technique

Fine-tuning

Fine-tuning updates a pretrained model's weights on task-specific data, baking the new behaviour into the model rather than relying on prompts.

Fine-tuning takes a foundation model and trains it further on domain-specific examples (often via LoRA or QLoRA adapters rather than full weight updates) so the new behaviour is reproducible without long prompts. It is the right call when prompting plateaus on a high-volume narrow task, when latency budget excludes few-shot examples, or when you need to ship a tone/style consistently. It is overkill when the task is rare, when data is small, or when system-prompt + few-shot already meets quality. In 2026 most teams fine-tune open-weight models (Llama 4, Mistral, Qwen) rather than closed APIs because the cost is lower and ownership matters.

When to use fine-tuning

Common mistakes

FAQ

What is fine-tuning?

Fine-tuning updates a pretrained model's weights on task-specific data, baking the new behaviour into the model rather than relying on prompts.

When should I use fine-tuning?

High-volume narrow task where prompting plateaus. Strict latency budget excluding long prompts. Custom domain language (legal, medical, code).

What are the most common mistakes with fine-tuning?

Fine-tuning before exhausting prompt + few-shot options. Tiny datasets (under ~500 examples) — usually noise.

Last updated: 2026-06-01. Raw markdown: https://promtable.com/glossary/fine-tuning.md.