| SDSU |
CS 649 Big Data: Tools and Methods
Spring Semester, 2022 Lecture Notes |
DCS |
|---|---|---|
|
San Diego State University -- This page last updated 3-May-22 |
| Tuesday | Thursday |
|---|---|
| Jan 18 | Jan 20 Course Intro |
| Jan 25 Big Data Intro | Jan 27 Python, SciPy, Panda Series |
| Feb 1 Dataframe, Data Manipulation | Feb 3 Data Manipulation |
| Feb 8 Ploting | Feb 10 Dashboards, Spark Intro |
| Feb 15 Spark-Panda API, PySpark 2 | Feb 17 PySpark 2, Statistics |
| Feb 22 Statistics, Sampling | Feb 24 Sampling, Bloom, Panda Alternatives |
| Mar 1 Panda Alternatives | Mar 3 Regression |
| Mar 8 Assignment 1, Regression | Mar 10 Regression, Scikit Learn, Bayes |
| Mar 15 Clustering | Mar 17 Spark ML |
| Mar 22 Spark ML | Mar 24 Spark Clustering |
| Mar 29 No Class Spring Break | Mar 31 No Class Spring Break |
| Apr 5 | Apr 7 |
| Apr 12 Running Spark | Apr 14 Running Spark, Partition |
| Apr 19 No SQL, Cassandra | Apr 21 Cassandra |
| Apr 26 Kafka | Apr 28 Kafka, Spark Streaming |
| May 3 | May 5 |
| May 10 | May 12 Project Due |