Lectures
You can download the lectures here. We will try to upload lectures prior to their corresponding classes.
-
Maximum Likelihood Estimation
tl;dr: Basics of Probability and Statistics, Information Theory, Maximum Likelihood Estimation
[slides] [recording]
Suggested Readings:
- Basics of Probability and Statistics
- Basics of Information Theory
- Discriminative vs. Generative Models
- Basics of Maximum Likelihood Estimation
- Basics of Optimization
-
Hidden Markov Models
tl;dr:
[slides] [recording 1] [recording 2]
Suggested Readings:
- Introduction to Hidden Markov Models.
- Inference, Deccoding, and Learning for Hidden Markov Models.
- The Viterbi Algorithm.
- The Baum-Welch Algorithm.
-
Linear Dynamical Systems
tl;dr:
[slides] [recording 1] [recording 2] [recording 3]
Suggested Readings:
- Introduction to Linear Dynamical Systems.
- Filtering and Smoothing for Linear Dynamical Systems.
- State Estimation for Linear Dynamical Systems.
- EM for Linear Dynamical Systems.
-
Recurrent Neural Networks
tl;dr:
[slides] [recording 1] [recording 2] [recording 3]
Suggested Readings:
- Introduction the RNN architecture
- Unfolding and Backpropagation through Time.
- Variants of RNNs: LSTMs, GRUs
-
BERT and Vision Transformers
tl;dr:
[slides] [recording 1] [recording 2]
Suggested Readings:
- BERT (https://arxiv.org/abs/1810.04805)
- ImageGPT (https://cdn.openai.com/papers/Generative_Pretraining_from_Pixels_V2.pdf)
- ViT (https://arxiv.org/abs/2010.11929)
-
Transformers for Vision and Language
tl;dr:
[slides] [recording 1] [recording 2]
Suggested Readings:
- ViLBERT (https://arxiv.org/abs/1908.02265)
- CLIP (https://arxiv.org/abs/2103.00020)
- DALL-E (https://arxiv.org/abs/2102.12092)
- LLaVA (https://arxiv.org/abs/2304.08485)
-
Denoising Diffusion Probabilistic Models
tl;dr:
[slides] [recording 1] [recording 2]
Suggested Readings:
- Introduction to Denoising Diffusion Probabilistic Models