# ControlNet

**Source:** https://promtable.com/glossary/controlnet

> ControlNet is a neural-network architecture that conditions a diffusion image model on extra spatial inputs — edges, depth, pose, segmentation — for precise control over output structure.

---
ControlNet is a neural-network architecture that conditions a diffusion image model on extra spatial inputs — edges, depth, pose, segmentation — for precise control over output structure.

ControlNet (Zhang et al., 2023) adds a parallel conditioning branch to a pretrained diffusion model so it can accept a structural hint alongside the text prompt. Common control types: Canny edges, depth maps, OpenPose skeletons, normal maps, scribbles, line art. The model then generates images that match both the text prompt and the spatial structure. ControlNet is the production workflow for AI photography, AI animation, and architectural visualisation — anywhere the composition has to be exact. The richest ControlNet ecosystem is on Stable Diffusion 3.5; Flux ControlNets are catching up; Midjourney has no first-party ControlNet.

## When to use

- Locking a specific pose, edge map, or depth structure across generations.
- Architectural visualisation, product photography, character keyframing.

## Common mistakes

- Using a misaligned control input — produces broken hybrid output.
- Combining too many ControlNets at once — model loses coherence.

## Related terms

- [diffusion-model](https://promtable.com/glossary/diffusion-model)
- [lora](https://promtable.com/glossary/lora)
- [seed](https://promtable.com/glossary/seed)
- [cfg-scale](https://promtable.com/glossary/cfg-scale)

## Sources

- [Zhang et al. 2023 (arXiv)](https://arxiv.org/abs/2302.05543)

*Last updated: 2026-06-01*
---

Original page: https://promtable.com/glossary/controlnet
Maintained by Promtable (https://promtable.com). Content: CC BY 4.0. Cite as "Promtable — https://promtable.com/glossary/controlnet".
Contact: info@vibecodingturkey.com.