Glossary AI Safety 1 min read

AI Alignment

Also known as: Value Alignment, AI Safety Alignment

The research field focused on ensuring that AI systems' goals, behaviors, and values are compatible with human intentions and societal well-being throughout their operation.

test annotation

Sources & References

Core Views on AI Safety: When, Why, What, and How

Anthropic

Our Approach to AI Safety

OpenAI

NIST AI Risk Management Framework

National Institute of Standards and Technology

Government

Related Terms

Artificial General Intelligence

A hypothetical form of AI that possesses the ability to understand, learn, and apply intelligence across any intellectual task that a human being can, exhibiting flexibility and adaptability across domains.

Hallucination

When an AI model generates information that sounds plausible but is factually incorrect, fabricated, or not supported by its training data or provided context.

Reinforcement Learning from Human Feedback

A training technique that uses human evaluations of AI outputs to train a reward model, which then guides the AI system to produce outputs more aligned with human preferences.

Responsible AI

The practice of designing, developing, deploying, and using AI systems in ways that are ethical, transparent, fair, accountable, and aligned with human rights and societal values.

Previous Adaptive Batch Sizing Controller

Next AI Governance

Back to Glossary

MCP Tutorials

RAG Cookbook

Library Integrations

Context Window Engineering

Embeddings & Retrieval

Tool Use & Function Calling