Training Data
Also known as: Training Dataset, Training Corpus, Training Set
The curated dataset used to train machine learning models, whose quality, diversity, size, and representativeness directly determine the model's capabilities and limitations.
Sources & References
Related Terms
Bias in AI
Systematic errors in AI system outputs that create unfair outcomes for certain groups, typically arising from biased training data, flawed model design, or biased evaluation metrics.
Fine-Tuning
The process of further training a pre-trained AI model on a specialized dataset to adapt its behavior, knowledge, or output style for a specific domain or task.
Machine Learning
A subset of artificial intelligence that enables systems to learn and improve from experience without being explicitly programmed, using algorithms that identify patterns in data.
Supervised Learning
A machine learning paradigm where models are trained on labeled datasets containing input-output pairs, learning to map inputs to correct outputs for prediction and classification tasks.