|
|
The course schedule will be posted here.
Week | Date | Topic | Reference Material |
1 | 08-24 | Introduction and overview |
[1],
[2],
[3],
[4],
[5],
pptx,
pdf
|
| 08-26 |
Introduction to parallel computing with Spark |
pptx,
pdf
|
2 | 08-31 | Introduction to parallel computing with Spark (contd.) |
|
| 09-02 | BTrace tutorial |
|
3 | 09-07 | Deep dive into techniques for parallel execution |
pptx,
pdf
|
| 09-09 |
Deep dive into techniques for parallel execution (contd.)
|
pptx,
pdf
|
4 | 09-14 | Introduction to the MapReduce computation model |
pptx,
pdf,
algorithms
|
| 09-16 | How MapReduce works |
ppt,
pdf
|
5 | 09-21 | How MapReduce works (contd.) |
ppt,
pdf
|
| 09-23 | Data Partitioning and Assignment |
Chapter 2 from Foundations article
|
6 | 09-28 | Introduction to Amazon Web Services |
|
| 09-30 | Midterm 1 |
|
7 | 10-05 | SQL Query Processing |
ppt,
pdf
|
| 10-07 | SQL Query Processing (contd.) |
ppt,
pdf
|
8 | 10-12 | Fall Break |
|
| 10-14 | Pipelined Query Execution |
ppt,
pdf,
notes
|
9 | 10-19 | SQL Query Plan Selection |
ppt,
pdf
|
| 10-21 | SQL Query Plan Selection (contd.) |
ppt,
pdf
|
10 | 10-26 | Introduction to Data Stream Processing |
ppt,
pdf
|
| 10-28 | Role of Kafka in Modern Data Processing |
reading
|
11 | 11-02 | Distributed Data Stream Processing (Execution) |
reading
(along with the other three parts of this blog series),
reading
|
| 11-04 | Distributed Data Stream Processing (Fault Tolerance) |
reading
(along with the other three parts of this blog series),
reading,
reading
|
12 | 11-09 | Midterm 1 Review |
Some solutions were posted on Sakai
|
| 11-11 | Midterm 2 |
|
13 | 11-16 | No class |
|
| 11-18 | Data Stream Processing with Micro Batches |
reading,
reading
|
14 | 11-23 | Graph and Iterative Processing |
pptx,
pdf
|
| 11-25 | Thanksgiving break |
|
15 | 11-30 | NoSQL Systems |
reading,
pptx,
pdf
|
| 12-02 | NoSQL Systems |
pptx,
pdf
|
|