Tokens In Logits Out

Browse subfolders and articles in Tokens In Logits Out

Tags in this section

AI API BPE GPT-2 Groq Hugging Face LLM OpenAI SentencePiece attention sampling temperature tokenization top-k top-p transformer

Articles

August 30, 2025 · 5 min read

5/5 Your First Model: Navigating a Hugging Face Repo

A model is a folder. config.json is the architecture. generation_config.json is the sampling defaults. vocab.json is the tokenizer. Here's how to read them — and where to change the knobs.

#LLM #Hugging Face #GPT-2 #API #Groq #OpenAI #AI

August 27, 2025 · 6 min read

4/5 Be the Language DJ: Temperature, Top-k, and Top-p

The model produces 50,257 scores. Sampling decides which one becomes the next token. Temperature, top-k, and top-p are your mixer sliders — here's what each one does, with numbers.

#LLM #sampling #temperature #top-p #top-k #GPT-2 #AI

August 24, 2025 · 6 min read

3/5 The Transformer in 90 Seconds (Then the Other 900)

Attention is a weighted mix. Multi-head is a filter bank. The causal mask means no spoilers. Here's the transformer architecture without the math — then with just enough of it.

#LLM #transformer #attention #GPT-2 #AI

August 21, 2025 · 5 min read

2/5 Why Can't LLMs Spell 'Raspberry'? It's Tokenization.

Tokenization is at the heart of every weird LLM behavior. Why they can't reverse strings, why Japanese costs more, why 'SolidGoldMagikarp' breaks them. Here's why — and what you can do about it.

#LLM #tokenization #BPE #SentencePiece #GPT-2 #AI

August 18, 2025 · 4 min read

1/5 Tokens In, Logits Out: What's Actually Inside ChatGPT

You use ChatGPT, Gemini, Claude every day. But what's inside the box? Assistants are not models. Once you see the difference, you unlock controls most people don't know exist.

#LLM #GPT-2 #transformer #AI

Posts