WANT SWEET DEALS? JOIN OUR MAILING LIST
The Craft of Post-Training

The Craft of Post-Training

A Practical Guide for AI Engineers and Developers
by Chris von Csefalvay
September 2026, 416 pp.
ISBN-13: 
9781718505209
Use coupon code PREORDER to get 25% off!

If you're a practitioner who has watched a promising AI demo fail to survive contact with production, where prompting hits its ceiling, retrieval isn't enough, and the model still can't be trusted with your domain, post-training is what you've been missing.

The Craft of Post-Training is a practical guide to turning foundation models into production-ready systems — reshaping behavior, aligning to your values, and deploying with confidence. Each technique is taught concept-first, then implementation-through-code, so you understand not just what to run, but what you're actually changing inside the model.

You'll leave with the skills to:

  • Fine-tune models on curated datasets using supervised fine-tuning, LoRA, and QLoRA without destroying the base model's general capabilities
  • Apply reinforcement learning from human feedback and modern preference optimization methods, including GRPO, ORPO, and beyond, to shape model behavior
  • Evaluate models rigorously: design benchmarks, detect regression, and measure quality claims that survive scrutiny
  • Adapt models to specialized domains, from clinical language to legal text, turning general capability into a defensible competitive advantage
  • Train agentic models that take sequences of actions reliably, not just models that talk about taking actions
  • Quantize and compress fine-tuned models for deployment without sacrificing the gains you trained for

Post-training is where models stop being impressive and start being useful. This book teaches you to do it right.

Author Bio 

Chris von Csefalvay is a Principal at HCLTech's AI Practice, where he leads post-training research and clinical intelligence. He has held senior data science leadership roles across major enterprises, published extensively on distributed computing for ML, and designed language models for applications ranging from pharmacovigilance to social dynamics. He holds degrees from the University of Oxford and Cardiff University and is a Fellow of the Royal Society for Public Health and Senior Member of IEEE.

Table of contents 

Acknowledgments
Preface

Part I: The Foundation
Chapter 1: Post-Training Essentials: What Is and Why It Matters
Chapter 2: Prerequisites for Success: Before You Fine-Tune

Part II: The Tools
Chapter 3: Supervised Fine-Tuning: The Foundation Technique
Chapter 4: Reinforcement Learning: Better Each Time
Chapter 5: Preference Optimization Modern Alternatives to PPO
Chapter 6: Evaluation Strategies: Measuring Model Quality

Part III: The Craft
Chapter 7: Efficiency Techniques: Quantization and Compression
Chapter 8: Domain Adaptation: Make It Yours
Chapter 9: Agentic Models: Deeds, Not Words
Chapter 10: Reasoning Capabilities: Training for Complex Thought

Part IV: The Frontier
Chapter 11: Synthetic Training: Self-Play and Generated Data
Chapter 12: Multimodal Systems: Post-Training Beyond Text
Chapter 13: Future Directions: What Comes Next
Bibliography

The chapters in red are included in this Early Access PDF.