Date |
Topic |
Assignment |
|
Notes
- The exact topics of the lectures are subject to change.
- We do not anticipate changing any of the other dates (exams, assignments, etc.)
- To watch the lecture videos, sign in to YouTube using your andrew.cmu.edu account.
|
Aug |
29 |
Why parallelism?
(slides;
video
) |
|
|
Aug |
31 |
Modern multicore processors
(slides;
video) |
|
Sep |
2 |
Parallel programming models
(slides;
video
) |
Assignment 1 out |
|
|
Sep |
7 |
Graphic processing units and CUDA (slides ,
video (part 1),
video (part 2))
|
|
Sep |
9 |
Parallel programming basics (slides
, video ) |
|
|
|
Sep |
12 |
Work distribution and scheduling (slides
, video
) |
Assignment 1 due, assignment 2 out |
|
Sep |
14 |
Locality, communication, and contention
(slides
, video
) |
|
|
Sep |
16 |
Application case studies
(slides
, video
) |
|
|
|
Sep |
19 |
Workload-driven performance evaluation
(slides
,video (part 1), video (part 2), video (part 3)
) |
|
|
Sep |
21 |
Snooping-based cache coherence (slides, video (part 1),
video (part 2),
video (part 3)) |
|
|
Sep |
23 |
Directory-based cache coherence (slides, video (part 1),
video (part 2))
|
|
|
|
Sep |
26 |
Snooping implementation (slides,
video (part 1),
video (part 2)) |
Assignment 3 out |
|
Sep |
28 |
Performance Measurement and Tuning (slides. video (part 1),
video (part 2))
|
Assignment 2 due |
|
Sep |
30 |
Virtual Memory
(slides,
video)
|
|
|
|
Oct |
3 |
Interconnection networks (slides,
video (part 1),
video (part 2),
exam review video (NOTE: ignore the discussion of exam logistics - it is outdated)) |
)
|
|
Oct |
5 |
Exam 1 |
|
|
Oct |
7 |
Memory consistency
(slides
, video
) |
|
|
|
Oct |
10 |
Implementing synchronization (slides,
video (part 1),
video (part 2),
video (part 3)) |
|
|
Oct |
12 |
Fine-grained synchronization, lock-free programming
(slides,
video (part 1),
video (part 2),
video (part 3)) |
Assignment 3 due, Assignment 4 out |
|
Oct |
14 |
Tolerating latency through prefetching
(slides
,
video
) |
|
|
|
Oct |
24 |
Heterogenous parallelism (slides,
video (part 1),
video (part 2),
video (part 3)) |
|
|
Oct |
26 |
Parallel Deep Learning (Part 1)
(slides ,
video for data parallelism) |
|
|
|
|
Oct |
31 |
Parallel Deep Learning (Part 2)
(slides ,
video for model parallelism, video for pipeline parallelism) |
|
|
|
Nov |
2 |
Guest lecture (Randy Huang, Amazon AWS) |
Assignment 4 due |
|
Nov |
4 |
Domain-specific programming languages
(slides ,
video) |
|
|
|
Nov |
7 |
Project Idea Meetings |
|
|
Nov |
9 |
Domain-specific programming on graphs
(slides ,
video) |
Project Proposal Due |
|
Nov |
11 |
Under the Hood: Message Passing Implementation
(slides
, video
)
| |