Decoder-Only Model
A transformer architecture that generates text left-to-right, predicting one token at a time based on all previous tokens. GPT, Claude, and Llama are decoder-only models. This architecture dominates generative language tasks.
A transformer architecture that generates text left-to-right, predicting one token at a time based on all previous tokens. GPT, Claude, and Llama are decoder-only models. This architecture dominates generative language tasks.