Calendar
Week 1
- Sep 4
-
- HolidayLabor Day
Week 2
- Sep 11
-
- LectureLLVM Introduction 1
- Saining Xie
LLVM Neural Network Architectures: Past, Present and Paths Forward
-
- LectureLLVM Introduction 2
- Saining Xie
LLVM Training objectives: Supervised learning, Self-Supervised Learning, Generative Models and Beyond
-
Reading materials
- The Illustrated Transformer
- Attention is All You Need (Optional, but recommended!)
- Sep 14
- Early Assignment DueDue at 5:00 PM ET
Week 3
- Sep 18
-
- LectureLLVM Introduction 3
- Saining Xie
LLVM Training objectives: Supervised learning, Self-Supervised Learning, Generative Models and Beyond
LLVM Data, Infrastructure, Model Training, Inference, Complexity, Metrics
-
- LecturePaper Reading and Reviewing
- Saining Xie
How to be a Good Citizen of the LLVM Community
-
Reading materials
Week 4
- Sep 25
- Paper ReadingPanel Discussion - Vision & Language
CLIP: Learning Transferable Visual Models From Natural Language Supervision, Alec et al.
- Paper ReadingPanel Discussion - Vision & Language
-
Reading materials
Week 5
- Oct 2
- Paper ReadingPanel Discussion - LLM & RLHF
-
Paper ReadingPanel Discussion - LLM & RLHF
WebGPT: Browser-assisted question-answering with human feedback, Nakano et al.
-
Reading materials
- Scaling Laws for Neural Language Models, Kaplan et al.
- OpenAI’s blog on ChatGPT
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis et al.
- Learning to summarize from human feedback, Stiennon et al.
- Training language models to follow instructions with human feedback, Ouyang et al.
-
John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
- LLaMA: Open and Efficient Foundation Language Models, Touvron et al.
- Alpaca: A Strong, Replicable Instruction-Following Model
- Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality
-
LlaMA 2: Open Foundation and Fine-Tuned Chat Models, Touvron et al.
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models, Wei et al.
- Oct 5
- Proposal DueDue at 4:00 PM ET
Week 6
- Oct 9
- Proposal Review Sessions With Instructor(Check Email for Scheduling)
-
Paper ReadingPanel Discussion - Multimodal LLM
Flamingo: a Visual Language Model for Few-Shot Learning, Alayrac et al.
-
Paper ReadingPanel Discussion - Multimodal LLM
-
Reading materials
Week 7
- Oct 16
- Paper ReadingPanel Discussion - GenAI 1: Image
Diffusion Models Beat GANs on Image Synthesis, Dhariwal et al.
-
Paper ReadingPanel Discussion - GenAI 1: Image
High-Resolution Image Synthesis with Latent Diffusion Models, Rombach et al.
-
Reading materials
Week 8
- Oct 23
- Paper ReadingPanel Discussion - GenAI 2: Video & 3D
Imagen Video: High Definition Video Generation with Diffusion Models, Ho et al.
-
Paper ReadingPanel Discussion - GenAI 2: Video & 3D
-
Reading materials
Week 9
- Oct 30
- Paper ReadingPanel Discussion - Open World Perception
-
Paper ReadingPanel Discussion - Open World Perception
DINOv2: Learning Robust Visual Features without Supervision, Oquab et al.
-
Reading materials
Week 10
- Nov 6
- Paper ReadingPanel Discussion - LLM Agent
Generative Agents: Interactive Simulacra of Human Behavior, Park et al.
-
Paper ReadingPanel Discussion - LLM Agent
PaLM-E: An Embodied Multimodal Language Model, Driess et al.
-
Reading materials
Week 11
- Nov 13
- Paper ReadingPanel Discussion - Data
Whisper: Robust Speech Recognition via Large-Scale Weak Supervision, Radford et al.
-
Paper ReadingPanel Discussion - Data
-
Reading materials
- Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling, Gandhi et al.
- WhisperX: Time-Accurate Speech Transcription of Long-Form Audio, Bain et al.
- DataComp: In search of the next generation of multimodal datasets, Gadre et al.
- LAION-AESTHETICS, Schuhmann et al.
- WebLI - PaLI Dataset, chen et al.
Week 12
- Nov 20
-
Paper ReadingPanel Discussion - Infra and Training
Flash Attention-2: Faster Attention with Better Parallelism and Work Partitioning, Dao et al.
-
Paper ReadingPanel Discussion - Infra and Training
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models, Rajbhandari et al.
-
Reading materials
Week 13
- Nov 27
- Paper ReadingPanel Discussion - AI Safety, Privacy, Law, Alignment
- Llama 2: Open Foundation and Fine-Tuned Chat Models
- (With a focus on the safety design and discussions.)
-
Paper ReadingPanel Discussion - AI Safety, Privacy, Law, Alignment
-
Reading materials
- Extracting Training Data from Large Language Models, Carlini et al.
- Towards Measuring the Representation of Subjective Global Opinions in Language Models, Durmus et al.
- Generative AI is a minefield for copyright law, Mahari et al.
- Frontier AI Regulation: Managing Emerging Risks to Public Safety, Anderljung et al.
- MART: Improving LLM Safety with Multi-round Automatic Red-Teaming, Ge et al.
- CSCI-GA 3850-007 PhD Seminar: AI: Challenges in Law, Policy, and Ethics
Week 14
- Dec 4
- Paper ReadingPanel Discussion - AI4Science
-
Paper ReadingPanel Discussion - AI4Science
-
Reading materials
Week 15
- Dec 11
- Final PresentationFinal Project Presentation