Softmax
A mathematical function that converts a vector of raw scores (logits) into a probability distribution where all values sum to one. In language models, softmax transforms logit scores into token probabilities used for sampling.
A mathematical function that converts a vector of raw scores (logits) into a probability distribution where all values sum to one. In language models, softmax transforms logit scores into token probabilities used for sampling.