Jeff Liu Lab
Home
Projects
Workshop
AI Wiki
AI Lab
Shop
中
Sign In
All
Computing Science
Artificial Intelligence
Deep Learning
Reinforcement Learning
AI Agents
Embodied Intelligence
Robot Engineering
Human-Like Intelligence
AI Engineering
← Back to Wiki
Reinforcement Learning
RL Overview
RL Landscape
RL Milestones
Classical RL
Deep RL
Advanced Policy Gradient
Offline RL
Model-based RL
RL Engineering
LLM Post-Training
Multi-Agent RL
Exploration & Reward Engineering
Advanced RL
RL Applications
Comments (0)
Sign in to comment
Table of Contents
Overview
Timeline Overview
1. TD-Gammon (1992)
Achievement
Core Algorithm
Key Formula
Historical Significance
2. DQN: Deep Q-Network (2013/2015)
Achievement
Core Algorithm
Key Innovation
Historical Significance
3. AlphaGo (2016)
Achievement
Core Algorithm
System Architecture
Historical Significance
4. AlphaZero (2017)
Achievement
Core Improvements
Key Results
Historical Significance
5. OpenAI Five (2019)
Achievement
Core Algorithm
Technical Details
Historical Significance
6. AlphaStar (2019)
Achievement
Core Algorithm
League Training Architecture
Historical Significance
7. MuZero (2020)
Achievement
Core Algorithm
Comparison with AlphaZero
Historical Significance
8. RLHF and ChatGPT (2022)
Achievement
Core Algorithm
Key Papers
Historical Significance
9. RT-2: Robotic Transformer (2023)
Achievement
Core Algorithm
Key Innovation
Historical Significance
10. o1: Reasoning Enhancement (2024)
Achievement
Core Approach
Key Insight
Subsequent Developments
Historical Significance
Milestone Summary
Development Trends
References
Further Reading
Comments
Comments (0)
Sign in to comment