11-411/611: Natural Language Processing (Fall 2025)

Place and Time: HOA 160, TR 2:00-3:20P

Course Description: This course is designed to be accessible to Masters and advanced undergraduate students who seek the basic skills necessary to implement practical Natural Language Processing (NLP) applications using Language Models (LMs) in specific information domains. The syllabus includes learning materials on the core concepts of NLP and LMs, and how they are applied in closed commercial systems (e.g. ChatGPT) as well as open systems (e.g. Llama, T5). Students complete a set of hands-on exercises in Python that develop skills in applying NLP for various practical problems.

Textbook: Jurafsky and Martin, "Speech and Language Processing"

Prerequisite Knowledge: Strong programming skills (in Python); A course in data structures and algorithms (or equivalent experience); A basic knowledge of probability theory and linear algebra

Course Goals: Students acquire basic knowledge of NLP approaches, including language representations, probability theory and language modeling, logistic and softmax regression, word embeddings, neural networks and large language models; and NLP tasks, such as document classification, parsing, knowledge representation and reasoning, translation, and question answering.

Grading (S'25, subject to revision):

Midterm Exam = 25%
Final Exam = 25%
Homeworks (4) = 50%

Syllabus (S'25, subject to revision):

NLP Landscape and History, Course Objectives
Representation in NLP
Designing, Evaluating, and Incrementally Improving NLP Systems
Probability Theory and Language Modeling
Naive Bayes and Document Classification
Logistic Regression
Softmax Regression
Feed-Forward Neural Networks
Word Embeddings & Distributional Semantics
Modeling Sequences: RNNs and NER
Encoder-Decoder Models, Beam Search
Self-Attention and Transformers
LLMs I: Pretraining, Encoder-only (BERT), Finetuning
LLMs II: Encoder-Decoder (T5) and Decoder-Only (GPT), ICL
LLMs III: RLHF, DPO, Guardrails
Ethics and NLP
Syntax and Parsing
Semantics and Reasoning over Knowledge Representations
Natural Language Inference
Machine Translation
Multilingual NLP
Information Extraction & Coreference
Question Answering I (information retrieval, information extraction)
Question Answering II (LLMs, prompting, RAG)

Last Updated March 25, 2025