Acknowledgments
Preface
Part I: The Foundation
Chapter 1: Post-Training Essentials: What Is and Why It Matters
Chapter 2: Prerequisites for Success: Before You Fine-Tune
Part II: The Tools
Chapter 3: Supervised Fine-Tuning: The Foundation Technique
Chapter 4: Reinforcement Learning: Better Each Time
Chapter 5: Preference Optimization Modern Alternatives to PPO
Chapter 6: Evaluation Strategies: Measuring Model Quality
Part III: The Craft
Chapter 7: Efficiency Techniques: Quantization and Compression
Chapter 8: Domain Adaptation: Make It Yours
Chapter 9: Agentic Models: Deeds, Not Words
Chapter 10: Reasoning Capabilities: Training for Complex Thought
Part IV: The Frontier
Chapter 11: Synthetic Training: Self-Play and Generated Data
Chapter 12: Multimodal Systems: Post-Training Beyond Text
Chapter 13: Future Directions: What Comes Next
Bibliography
The chapters in red are included in this Early Access PDF.
