# Spot instance (AI training)

**Source:** https://promtable.com/glossary/spot-instance

> A spot instance is a cloud GPU rented at a discount (often 50-90% off) on the condition that the provider can reclaim it on short notice — used for cost-sensitive training that can checkpoint and resume.

---
A spot instance is a cloud GPU rented at a discount (often 50-90% off) on the condition that the provider can reclaim it on short notice — used for cost-sensitive training that can checkpoint and resume.

Spot pricing for GPUs (AWS, GCP, Vast.ai, Lambda) is the cheapest path to large-scale training in 2026 — at the cost of operational complexity. Training jobs must checkpoint frequently and resume cleanly when instances are reclaimed. Frameworks like Ray Train, Modal restart hooks, and SkyPilot handle the orchestration. For non-resumable workloads (interactive notebooks, live inference) spot is the wrong choice. For batch training, fine-tuning, and large embedding jobs, spot can cut compute cost by 5-10× vs on-demand.

## When to use

- Batch training and fine-tuning jobs with checkpointing.
- Large embedding generation pipelines.

## Common mistakes

- No checkpointing — losing hours of training when an instance is reclaimed.
- Using spot for live inference — interruptions break user UX.

## Related terms

- [batched-inference](https://promtable.com/glossary/batched-inference)
- [fine-tuning](https://promtable.com/glossary/fine-tuning)
- [cold-start](https://promtable.com/glossary/cold-start)

*Last updated: 2026-06-01*
---

Original page: https://promtable.com/glossary/spot-instance
Maintained by Promtable (https://promtable.com). Content: CC BY 4.0. Cite as "Promtable — https://promtable.com/glossary/spot-instance".
Contact: info@vibecodingturkey.com.