SDSU CS 696 Intro to Big Data: Tools and Methods
Spring Semester, 2020
Lecture Notes
DCS
To Course Web Site
San Diego State University -- This page last updated 23-Apr-20

This page contains links to lecture notes for the CS 696 Intro to Big Data: Tools and Methods course. This page will be updated as more notes become available.

Lecture Notes By Topic
  1. Class Intro
  2. Big Data Intro
  3. Python
  4. SciPy
  5. Panda Data Structures (zipped notebook)
  6. DataFrame (zipped notebook)
  7. Statistics, Sampling, Bloom
  8. Ploting (zipped notebook)
  9. Data Manipulation (zipped notebook)
  10. Dask
  11. Regression
  12. Scikit Learn, Bayes
  13. Clustering
  14. Spark Intro
  15. Spark 2
  16. Spark Cluster, AWS
  17. Spark ML
  18. Spark Clustering
  19. Spark on AWS
  20. Kafka
  21. Kafka, Spark Streaming
  22. Cassandra
  23. Kafka Pipelines, Mircoservices
  24. Display, End Remarks

Lecture Video By Date
Tue Thur
Jan 21 Jan 23 Class Intro
Jan 28 Big Data Intro Jan 30 Panda Data Structures
Feb 4 Dataframe , Ploting Feb 6 Data Manipulaton
Feb 11 - No Class Feb 13 - No Class
Feb 18 Dask, Statisitics Feb 20 No Video, technical issue
Feb 25 Scikitlearn, Bayes Feb 27 Clustering
Mar 3 Assignment 1 Mar 5 Assignment 2, Spark
Mar 10 Spark, Spark Master/Slave Mar 12 Assignment 2 Questions, Spark ML
Mar 17 More Spark ML Mar 19 Spark ML, Spark Cluster
Mar 24 Spark Neural Networks, S3 Mar 26 Spark on AWS
Mar 31 Spring Break Apr 2 Spring Break
Apr 7 Kafka Apr 9 Kafka, Spark Streaming
Apr 14 Kafka, Spark Streaming Apr 16 No Video
Apr 21 Cassandra, Kafka Pipelines Apr 23 Display, End Remarks
Apr 28 Apr 30
May 5 May 7 Last Class
May 12 May 14 Projects Due