SDSU |
CS 649 Big Data: Tools and Methods
Spring Semester, 2022 Lecture Notes |
DCS |
---|---|---|
San Diego State University -- This page last updated 3-May-22 |
Tuesday | Thursday |
---|---|
Jan 18 | Jan 20 Course Intro |
Jan 25 Big Data Intro | Jan 27 Python, SciPy, Panda Series |
Feb 1 Dataframe, Data Manipulation | Feb 3 Data Manipulation |
Feb 8 Ploting | Feb 10 Dashboards, Spark Intro |
Feb 15 Spark-Panda API, PySpark 2 | Feb 17 PySpark 2, Statistics |
Feb 22 Statistics, Sampling | Feb 24 Sampling, Bloom, Panda Alternatives |
Mar 1 Panda Alternatives | Mar 3 Regression |
Mar 8 Assignment 1, Regression | Mar 10 Regression, Scikit Learn, Bayes |
Mar 15 Clustering | Mar 17 Spark ML |
Mar 22 Spark ML | Mar 24 Spark Clustering |
Mar 29 No Class Spring Break | Mar 31 No Class Spring Break |
Apr 5 | Apr 7 |
Apr 12 Running Spark | Apr 14 Running Spark, Partition |
Apr 19 No SQL, Cassandra | Apr 21 Cassandra |
Apr 26 Kafka | Apr 28 Kafka, Spark Streaming |
May 3 | May 5 |
May 10 | May 12 Project Due |