Generative AI

10-423 + 10-623, Spring 2024
School of Computer Science
Carnegie Mellon University


Important Notes

This schedule is tentative and subject to change. Please check back often.

Tentative Schedule

Date Lecture Readings Announcements

Generative models of text

Wed, 17-Jan Lecture 1 : RNN LMs / Autodiff
[Slides] [Slides (Inked)]

HW0 out

Fri, 19-Jan Recitation: HW0
[Handout]

Mon, 22-Jan Lecture 2 : Transformer LMs
[Slides] [Slides (Inked)]

Wed, 24-Jan Lecture 3 : Learning LLMs / Decoding
[Slides] [Slides (Inked)]

HW0 due

Fri, 26-Jan Recitation: HW1
[Handout]

HW1 out (L1-L4)

Mon, 29-Jan Lecture 4 : Pre-training, fine-tuning / Modern Transformers / CNNs
[Slides] [Slides (Inked)]

Add/audit/tuition drop deadline

Generative models of images

Wed, 31-Jan Lecture 5 : Encoder-only Transformers / Vision Transformers
[Slides] [Slides (Inked)]

Quiz 1 (in-class, L1-L3)

Fri, 2-Feb (No Recitation)

Mon, 5-Feb Lecture 6 : Generative Adversarial Networks (GANs)
[Slides] [Slides (Inked)]

Wed, 7-Feb Lecture 7 : Diffusion models (Part I)
[Slides] [Slides (Inked)] [Whiteboard]

Fri, 9-Feb Lecture 8 : Diffusion models (Part II)
[Slides] [Slides (Inked)]

HW1 due

HW2 out (L4-L7)

Mon, 12-Feb Recitation: HW2
[Handout]

Wed, 14-Feb Lecture 9 : Variational Autoencoders (VAEs)
[Slides] [Slides (Inked)]

Quiz 2 (in-class, L4-L8)

Fri, 16-Feb (No Recitation)

Applying and adapting foundation models

Mon, 19-Feb Lecture 10 : In-context learning for text & for vision
[Slides] [Slides (Inked)]

Tue, 20-Feb

HW2 due

HW3 out (L8-L11)

Wed, 21-Feb Lecture 11 : Parameter-efficient fine tuning
[Slides] [Slides (Inked)]

Fri, 23-Feb Recitation: HW3
[Handout]

Mon, 26-Feb Lecture 12 : Reinforcement learning with human feedback (RLHF)
[Slides] [Slides (Inked)]

(Quiz 3 in-class, L9-L11)

Semester Course Drop Deadline

Multimodal foundation models

Wed, 28-Feb Lecture 13 : Text-to-image generation / Aligning multimodal representations (CLIP)
[Slides] [Slides (Inked)]

Send out course Survey

Thu, 29-Feb Lecture 13.5 : Prompt-to-Prompt
[Slides] [Slides (Inked)]

HW3 due

Fri, 1-Mar (No Recitation)

Mon, 4-Mar Spring break

Tue, 5-Mar

Wed, 6-Mar Spring break

Thu, 7-Mar

Fri, 8-Mar Spring break

Mon, 11-Mar Lecture 14 : Visual-language models
[Slides]

Midsemester Grades Due

Wed, 13-Mar Lecture 15 : Scaling Up (Part 1)
[Slides]

HW4 out (L12-L15)

Fri, 15-Mar Recitation: HW4
[Handout] [Supplemental Material]

Scaling up

Mon, 18-Mar Lecture 16 : Scaling Up Part II: Mixture of Experts
[Slides]

Wed, 20-Mar Lecture 17 : Scaling Up Part 3: Attention
[Slides]

(Quiz 4 in-class, L12-L15)

Fri, 22-Mar (No Recitation)

Sun, 24-Mar

HW4 due

Mon, 25-Mar Lecture 18 : Distributed training
[Slides]

Wed, 27-Mar (No lecture)

Project description out

Thu, 28-Mar

HW623 out

Fri, 29-Mar Lecture 19 : How to Train Large Language Models

Mon, 1-Apr In-Class Exam

Pass/no pass & withdrawal deadline

Tue, 2-Apr

Project team formation due by 2pm

Advanced Topics

Wed, 3-Apr Lecture 20 : Long Context in LLM

Fri, 5-Apr (No Recitation)

Project proposal due

Mon, 8-Apr Lecture 21 : Generative Models for Videos

Tue, 9-Apr

HW623 due

Wed, 10-Apr Lecture 22 : Legacy Lecture

Thu, 11-Apr Spring Carnival - No Class

Fri, 12-Apr Spring Carnival - No Class

Mon, 15-Apr Lecture 23 : Speech Recognition Models

(Quiz 5 in-class, L16-L20)

Registration week

Wed, 17-Apr Lecture 24 : Introduction to Cuda Programming

Thu, 18-Apr

Fri, 19-Apr (No Recitation)

Sat, 20-Apr

Project midway report due

Mon, 22-Apr Lecture 25 : State Space Model

Wed, 24-Apr Lecture 26 : Introduction to Infrastructures of Large-Scale Language Model Training (>10K GPU nodes)

Fri, 26-Apr (No Recitation)

Last day of classes

Wed, 1-May

Project poster due

Fri, 3-May Project Final Presentations

Project final report due