Foundational Papers That Made AI What It Is Today
From early neural networks and optimization methods to transformers, diffusion models, and LLMs.
1. Foundations of Machine Learning
Early work on perceptrons, backpropagation, and neural network approximation laid the groundwork for deep learning.
2. Optimization & Training Deep Networks
Techniques that made training deep neural networks stable and efficient at scale.
3. Deep Learning for Computer Vision
Breakthrough convolutional network architectures that unlocked modern computer vision.
4. Representation Learning
Learning compact, useful representations of data—embeddings, language models, and more.
5. Transformers & Large Language Models
The architectures and models that defined the transformer era in NLP and beyond.
6. Reinforcement Learning Landmarks
From tabular Q-learning to deep RL and game-playing agents like AlphaGo and AlphaZero.
7. Generative Models
GANs, VAEs, and diffusion models that power modern image and media generation.
8. Scaling & Foundation Models
Work on scaling laws, open large models, and efficient fine-tuning that drives current foundation model development.