AI Glossary
A comprehensive encyclopedia of artificial intelligence and context management terminology — with definitions, in-depth articles, and authoritative sources.
Tokens
Also known as: Token, Subword Token, BPE Token
The basic units of text that language models process, typically representing words, subwords, or characters. Token counts determine context window usage and API costs.
Training Data
Also known as: Training Dataset, Training Corpus, Training Set
The curated dataset used to train machine learning models, whose quality, diversity, size, and representativeness directly determine the model's capabilities and limitations.
Transformer
Also known as: Transformer Architecture, Transformer Model
A neural network architecture based on self-attention mechanisms that processes input sequences in parallel, forming the foundation of virtually all modern large language models.
3 terms in "T"