Skip to content

Decoder-Only Model

A transformer architecture that generates text left-to-right, predicting one token at a time based on all previous tokens. GPT, Claude, and Llama are decoder-only models. This architecture dominates generative language tasks.

Related terms

TransformerCausal Language ModelEncoder-Decoder Model
← Back to glossary