Alice's adventures in a differentiable wonderland

https://www.sscardapane.it/alice-book

Neural networks surround us, in the form of large language models, speech transcription systems, molecular discovery algorithms, robotics, and much more. Stripped of anything else, neural networks are compositions of differentiable primitives, and studying them means learning how to program and how to interact with these models, a particular example of what is called differentiable programming.

This primer is an introduction to this fascinating field imagined for someone, like Alice, who has just ventured into this strange differentiable wonderland. I overview the basics of optimizing a function via automatic differentiation, and a selection of the most common designs for handling sequences, graphs, texts, and audios. The focus is on a intuitive, self-contained introduction to the most important design techniques, including convolutional, attentional, and recurrent blocks, hoping to bridge the gap between theory and code (PyTorch and JAX) and leaving the reader capable of understanding some of the most advanced models out there, such as large language models (LLMs) and multimodal architectures.

Download or buy the book

Buy the book on the Amazon stores (independently published):
United States | United Kingdom | Germany | France | Spain | Italy | Netherlands | Poland | Sweden | Canada | Australia
Downloaded the updated full draft (29/08/2025).
A static preprint is also available on arXiv (arXiv 2404.17625).
For the differences between the versions: errata list.

License

The book is released under CC BY-SA license. This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. If you remix, adapt, or build upon the material, you must license the modified material under identical terms.

Additional material

Guided lab sessions covering most of the topics in the book (PyTorch, JAX), updated continuously.
Unofficial Italian translation (incomplete) by Giovanni Borrelli.

Additional chapters

I will publish here additional chapters on advanced material that I could not fit into the first volume. Eventually, I hope these will be part of a second volume. More probably, they will languish here forever.

Advanced automatic differentiation [link], covers differentiation in generic vector spaces, computational graphs, implicit differentiation (briefly), and provides an overview of multilinear algebra (draft, may have many typos, updated 02/09/25).
Post-training algorithms [link], has an historical overview of post-training LLMs (from 2012 to the present) and a technical focus on reinforcement learning (draft, may have many typos, updated 10/12/25).

Chapter 1: Foreword and introduction
Chapter 2: Mathematical preliminaries
Chapter 3: Datasets and losses
Chapter 4: Linear models
Chapter 5: Fully-connected layers
Chapter 6: Automatic differentiation
Chapter 7: Convolutional layers
Chapter 8: Convolutions beyond images
Chapter 9: Scaling up the models
Chapter 10: Transformer models
Chapter 11: Transformers in practice
Chapter 12: Graph layers
Chapter 13: Recurrent layers
Appendix A: Probability theory
Appendix B: 1D universal approximation

{
  "by": "tosh",
  "descendants": 97,
  "id": 40213292,
  "kids": [
    40214349,
    40215053,
    40224216,
    40215080,
    40213790,
    40214712,
    40213461
  ],
  "score": 235,
  "time": 1714496616,
  "title": "Alice's adventures in a differentiable wonderland",
  "type": "story",
  "url": "https://www.sscardapane.it/alice-book"
}

{
  "author": "TL;DR 👇",
  "date": null,
  "description": "My personal website, where I collect slides, publications, and presentations.",
  "image": "https://www.sscardapane.it/assets/alice/Alice.png",
  "logo": null,
  "publisher": "Simone Scardapane",
  "title": "Book: Alice’s Adventures in a differentiable wonderland",
  "url": "https://sscardapane.it/alice-book/"
}

{
  "url": "https://sscardapane.it/alice-book/",
  "title": "Book: Alice’s Adventures in a differentiable wonderland",
  "description": "Neural networks surround us, in the form of large language models, speech transcription systems, molecular discovery algorithms, robotics, and much more. Stripped of anything else, neural networks are...",
  "links": [
    "https://sscardapane.it/alice-book/",
    "https://www.sscardapane.it/alice-book"
  ],
  "image": "",
  "content": "<section>\n<p><img src=\"https://sscardapane.it/assets/alice/Alice.png\" /></p>\n<p><strong>Neural networks</strong> surround us, in the form of large language models, speech transcription systems, molecular discovery algorithms, robotics, and much more. Stripped of anything else, neural networks are compositions of <strong>differentiable primitives</strong>, and studying them means learning how to program and how to interact with these models, a particular example of what is called <a target=\"_blank\" href=\"https://www.facebook.com/yann.lecun/posts/10155722686332143\">differentiable programming</a>.</p>\n<p>This primer is an introduction to this fascinating field imagined for someone, like Alice, who has just ventured into this strange <em>differentiable</em> wonderland. I overview the basics of optimizing a function via automatic differentiation, and a selection of the most common designs for handling sequences, graphs, texts, and audios. The focus is on a intuitive, <strong>self-contained introduction</strong> to the most important design techniques, including convolutional, attentional, and recurrent blocks, hoping to bridge the gap between theory and code (PyTorch and JAX) and leaving the reader capable of understanding some of the most advanced models out there, such as large language models (LLMs) and multimodal architectures.</p>\n<h3 id=\"download-or-buy-the-book\">Download or buy the book</h3>\n<ul>\n  <li>Buy the book on the Amazon stores (independently published): <br />  <a target=\"_blank\" href=\"https://www.amazon.com/dp/B0D9QHS5NG\">United States</a> | <a target=\"_blank\" href=\"https://www.amazon.co.uk/dp/B0D9QHS5NG\">United Kingdom</a> | <a target=\"_blank\" href=\"https://www.amazon.de/dp/B0D9QHS5NG\">Germany</a> | <a target=\"_blank\" href=\"https://www.amazon.fr/dp/B0D9QHS5NG\">France</a> | <a target=\"_blank\" href=\"https://www.amazon.es/dp/B0D9QHS5NG\">Spain</a> | <a target=\"_blank\" href=\"https://www.amazon.it/dp/B0D9QHS5NG\">Italy</a> | <a target=\"_blank\" href=\"https://www.amazon.nl/dp/B0D9QHS5NG\">Netherlands</a> | <a target=\"_blank\" href=\"https://www.amazon.pl/dp/B0D9QHS5NG\">Poland</a> | <a target=\"_blank\" href=\"https://www.amazon.se/dp/B0D9QHS5NG\">Sweden</a> | <a target=\"_blank\" href=\"https://www.amazon.ca/dp/B0D9QHS5NG\">Canada</a> | <a target=\"_blank\" href=\"https://www.amazon.com.au/dp/B0D9QHS5NG\">Australia</a></li>\n  <li>Downloaded the <a target=\"_blank\" href=\"https://sscardapane.it/assets/alice/Alice_book_volume_1.pdf\">updated full draft</a> (<strong>29/08/2025</strong>).</li>\n  <li>A static preprint is also available on <a target=\"_blank\" href=\"https://arxiv.org/abs/2404.17625\">arXiv</a> (arXiv 2404.17625).</li>\n  <li>For the differences between the versions: <a target=\"_blank\" href=\"https://sscardapane.it/assets/alice/errata_list.pdf\">errata list</a>.</li>\n</ul>\n<h3 id=\"license\">License</h3>\n<p><img src=\"https://sscardapane.it/assets/images/by-sa.png\" /></p>\n<p>The book is released under <a target=\"_blank\" href=\"https://creativecommons.org/licenses/by-sa/4.0/\">CC BY-SA license</a>. This license enables <em>reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. If you remix, adapt, or build upon the material, you must license the modified material under identical terms</em>.</p>\n<h3 id=\"additional-material\">Additional material</h3>\n<ul>\n  <li><a target=\"_blank\" href=\"https://www.notion.so/sscardapane/Guided-lab-sessions-18c25bd12a8c8068b972f7612fcde8d5\">Guided lab sessions</a> covering most of the topics in the book (PyTorch, JAX), updated continuously.</li>\n  <li>Unofficial <a target=\"_blank\" href=\"https://github.com/GiovanniBorrelli/Traduzione-di-Alice-s-Adventures-in-a-differentiable-wonderland/tree/main\">Italian translation</a> (incomplete) by Giovanni Borrelli.</li>\n</ul>\n<h3 id=\"additional-chapters\">Additional chapters</h3>\n<p>I will publish here additional chapters on advanced material that I could not fit into the first volume. Eventually, I hope these will be part of a second volume. More probably, they will languish here forever.</p>\n<ol>\n  <li><strong>Advanced automatic differentiation</strong> [<a target=\"_blank\" href=\"https://sscardapane.it/assets/alice/Alice_volume_2_autodiff.pdf\">link</a>], covers differentiation in generic vector spaces, computational graphs, implicit differentiation (briefly), and provides an overview of multilinear algebra (<em>draft</em>, may have many typos, updated 02/09/25).</li>\n  <li><strong>Post-training algorithms</strong> [<a target=\"_blank\" href=\"https://sscardapane.it/assets/alice/Alice_volume_2_rl.pdf\">link</a>], has an historical overview of post-training LLMs (from 2012 to the present) and a technical focus on reinforcement learning (<em>draft</em>, may have many typos, updated 10/12/25).</li>\n</ol>\n<h3 id=\"table-of-contents\">Table of contents</h3>\n<div>\n<ol>\n<li> Chapter 1: Foreword and introduction</li>\n<li> Chapter 2: Mathematical preliminaries</li>\n<li> Chapter 3: Datasets and losses</li>\n<li> Chapter 4: Linear models</li>\n<li> Chapter 5: Fully-connected layers</li> \n<li> Chapter 6: Automatic differentiation</li>\n<li> Chapter 7: Convolutional layers</li>\n<li> Chapter 8: Convolutions beyond images</li>\n<li> Chapter 9: Scaling up the models</li>\n<li> Chapter 10: Transformer models</li>\n<li> Chapter 11: Transformers in practice</li>\n<li> Chapter 12: Graph layers</li>\n<li> Chapter 13: Recurrent layers</li>\n<li> Appendix A: Probability theory</li>\n<li> Appendix B: 1D universal approximation</li>\n</ol>\n</div>\n    </section>",
  "author": "TL;DR 👇",
  "favicon": "",
  "source": "sscardapane.it",
  "published": "",
  "ttr": 95,
  "type": "website"
}