LLM Fundamentals
The Art of Sampling: Controlling Randomness in LLMs
A Mental Model for Temperature, Top-k, and Top-p
Machine Learning
Deep Dive into LLMs like ChatGPT by Andrej Karpathy - TL;DW version
I think everything that Andrej Karpathy shares on X or YouTube is a goldmine of information, and his latest video,
LLM Scaling
How to think about LLM Model Size
Breaking Down Parameters, Training Data, and Compute