Glossary Architecture 1 min read

Attention Mechanism

Also known as: Self-Attention, Scaled Dot-Product Attention, Multi-Head Attention

A neural network component that allows models to selectively focus on the most relevant parts of their input, dynamically weighting the importance of different elements in a sequence.

Self-attention, multi-head attention, cross-attention, query-key-value, softmax, attention weights, transformer architecture, scaled dot-product attention, causal attention, bidirectional attention, attention score, context-aware processing

Sources & References

Attention Is All You Need

Google Research

Research

Neural Machine Translation by Jointly Learning to Align and Translate

Yoshua Bengio et al.

Research

The Illustrated Attention Mechanism

Jay Alammar

Related Terms

Context Window

The maximum amount of text (measured in tokens) that a large language model can process in a single interaction, encompassing both the input prompt and the generated output. Managing context windows effectively is critical for enterprise AI deployments where complex queries require extensive background information.

Next Attestation Service

Back to Glossary

MCP Tutorials

RAG Cookbook

Library Integrations

Context Window Engineering

Embeddings & Retrieval

Tool Use & Function Calling

Attention Mechanism

Sources & References

Related Terms

Context Window

Large Language Model

Neural Network

Transformer