Jeff Liu Lab
HomeProjectsWorkshopAI WikiAI LabShop
Sign In
All
Computing Science
Artificial Intelligence
Deep Learning
Reinforcement Learning
AI Agents
Embodied Intelligence
Robot Engineering
Human-Like Intelligence
AI Engineering
← Back to Wiki
Reinforcement Learning
RL Landscape
RL Milestones

Comments (0)

Sign in to comment

Table of Contents
OverviewTimeline Overview1. TD-Gammon (1992)AchievementCore AlgorithmKey FormulaHistorical Significance2. DQN: Deep Q-Network (2013/2015)AchievementCore AlgorithmKey InnovationHistorical Significance3. AlphaGo (2016)AchievementCore AlgorithmSystem ArchitectureHistorical Significance4. AlphaZero (2017)AchievementCore ImprovementsKey ResultsHistorical Significance5. OpenAI Five (2019)AchievementCore AlgorithmTechnical DetailsHistorical Significance6. AlphaStar (2019)AchievementCore AlgorithmLeague Training ArchitectureHistorical Significance7. MuZero (2020)AchievementCore AlgorithmComparison with AlphaZeroHistorical Significance8. RLHF and ChatGPT (2022)AchievementCore AlgorithmKey PapersHistorical Significance9. RT-2: Robotic Transformer (2023)AchievementCore AlgorithmKey InnovationHistorical Significance10. o1: Reasoning Enhancement (2024)AchievementCore ApproachKey InsightSubsequent DevelopmentsHistorical SignificanceMilestone SummaryDevelopment TrendsReferencesFurther Reading

© 2026 Jeff Liu Lab. All rights reserved.

AboutPricingPrivacy & TermsContact