About
Publications
Blog
Blog
You Can't Learn From Zero Rewards
October 10, 2025
•
1 min read
Advice on advice
September 29, 2025
•
2 min read
AI-to-Human Knowledge Distillation
September 20, 2025
•
1 min read
Addicted to Cursor
September 16, 2025
•
3 min read
Startup as RL problem
September 14, 2025
•
4 min read
Bottom-up vs Top-down
September 14, 2025
•
3 min read
DeepSeek's open-source week
March 3, 2025
•
34 min read
Guide