↓
Skip to main content
davethehuman.com
About
Blog
Brain
Projects
About
Blog
Brain
Projects
Notes
2026
RL
13 May 2026
·
3 words
·
1 min
reinforcement learning
13 May 2026
·
116 words
·
1 min
reasoning
13 May 2026
·
127 words
·
1 min
inference-time compute scaling
13 May 2026
·
66 words
·
1 min
inference-compute scaling
13 May 2026
·
4 words
·
1 min
supervised fine-tuning
12 May 2026
·
23 words
·
1 min
preference tuning
12 May 2026
·
22 words
·
1 min
LLM training pipeline
12 May 2026
·
20 words
·
1 min
chain-of-thought
7 May 2026
·
135 words
·
1 min
pre-training
5 May 2026
·
59 words
·
1 min
←
1
2
3
→
↑