5/5 Your First Model: Navigating a Hugging Face Repo
A model is a folder. config.json is the architecture. generation_config.json is the sampling defaults. vocab.json is the tokenizer. Here's how to read them — and where to change the knobs.
Browse subfolders and articles in Tokens In Logits Out
A model is a folder. config.json is the architecture. generation_config.json is the sampling defaults. vocab.json is the tokenizer. Here's how to read them — and where to change the knobs.
The model produces 50,257 scores. Sampling decides which one becomes the next token. Temperature, top-k, and top-p are your mixer sliders — here's what each one does, with numbers.
Attention is a weighted mix. Multi-head is a filter bank. The causal mask means no spoilers. Here's the transformer architecture without the math — then with just enough of it.
Tokenization is at the heart of every weird LLM behavior. Why they can't reverse strings, why Japanese costs more, why 'SolidGoldMagikarp' breaks them. Here's why — and what you can do about it.
You use ChatGPT, Gemini, Claude every day. But what's inside the box? Assistants are not models. Once you see the difference, you unlock controls most people don't know exist.