It’s here: My new book on Language Models

The Hundred-Page Language Models Book by Andriy Burkov is arrived

Jan 16, 2025

After 9 months of hard work, I'm excited to announce that my book, The Hundred-Page Language Models Book is now available for order on Amazon and Leanpub.

If you've been following my project from the beginning and know what it's about, here's the quick link. If not, read on to learn everything about this release.

The Hundred-Page Language Models Book, a follow-up to my bestselling The Hundred-Page Machine Learning Book, takes readers on a hands-on journey from machine learning and language modeling fundamentals to the cutting edge of modern Large Language Models (LLMs). Following my signature hundred-page format, readers will master both theoretical concepts and practical implementations, making this an invaluable resource for technical leaders and data scientists.

The Hundred-Page Language Models Book enables you to:

Master the mathematical foundations of modern machine learning and neural networks needed for language model development
Build and train three different language model architectures using Python
Understand and implement a Transformer language model from scratch in PyTorch
Work effectively with LLMs through instruction finetuning and prompt engineering techniques

Written in a hands-on style with working Python code examples, the book progressively builds your understanding of language modeling. All code examples are available on Github and run in Google Colab, making it accessible to anyone with a modern laptop.

Get $150 in Free GPU Credits

Purchase the book and receive $150 in free GPU credits on Lambda. Simply email your proof of purchase to author@thelmbook.com to claim your credits.

Not every hands-on book comes with top cloud GPU access. This one does!

About the technology

Large language models have fundamentally transformed how machines process and generate information. They are reshaping white-collar jobs at a pace comparable only to the revolutionary impact of personal computers. Understanding the mathematical foundations and inner workings of language models has become crucial for maintaining relevance and competitiveness in an increasingly automated workforce.

About the book

The Hundred-Page Language Models Book takes a unique approach by introducing language modeling concepts gradually, starting with foundational methods before advancing to modern architectures. Each chapter builds upon the previous one, making complex concepts accessible through clear explanations, diagrams, and practical implementations.

What's inside

Essential machine learning and neural network fundamentals
Text representation techniques and basic language modeling
Implementation of RNNs and Transformer architectures with PyTorch
Practical guidance on finetuning language models and prompt engineering
Important considerations on hallucinations and ways to evaluate models
Additional resources for advanced topics through the book's wiki at thelmbook.com/wiki

About the reader

Readers should have programming experience in Python. While familiarity with PyTorch and tensors is helpful, it's not required. College-level math knowledge is beneficial, but the book presents mathematical concepts intuitively with clear examples and diagrams.

Endorsements

Vint Cerf, Internet pioneer and Turing Award recipient: "This book cleared up a lot of conceptual confusion for me about how Machine Learning actually works—it is a gem of clarity."

Tomáš Mikolov, the author of word2vec and FastText: "The book is a good start for anyone new to language modeling who aspires to improve on state of the art."

Bob van Luijt, CEO and Co-Founder of Weaviate: “Andriy's long-awaited sequel in his "The Hundred-Page" series of machine learning textbooks is a masterpiece of concision.”

Jorge Torres, CEO at MindsDB: “Andriy has this almost supernatural talent for shrinking epic AI concepts down to bite-sized, ‘Ah, now I get it!’ moments.”

Florian Douetteau, Co-founder and CEO at Dataiku: “Andriy paints for us, in 100 marvelous strokes, the journey from linear algebra basics to the implementation of transformers.”

Andre Zayarni, Co-founder and CEO at Qdrant: “Andriy's book is an incredibly concise, clear, and accessible introduction to machine learning.”

Jerry Liu, Co-founder and CEO at LlamaIndex: “This is one of the most comprehensive yet concise handbooks out there for truly understanding how LLMs work under the hood.”

Where to buy the book

The book is now available in both paperback and e-book formats on Amazon and Leanpub. You can find it on your local Amazon website by searching for "The Hundred-Page Language Models Book."Links to some stores:

And don't forget: as a bonus, you'll receive $150 in GPU cloud credits for your hands-on experiments. To claim them, simply email your proof of purchase to author@thelmbook.com.

I hope you enjoy reading this book as much as I enjoyed writing it!

Andriy

True Positive Weekly

Discussion about this post