Glossary Library

Technical GlossaryDeep Learning

Attention Mask

A control mechanism that determines which positions a model may or may not attend to during attention computation.

An attention mask makes context access in attention mechanisms rule-governed. It can be used to ignore padding tokens, hide future positions, or restrict focus to specific regions. Without it, the model may attend to irrelevant or prohibited information. In Transformer training, correct masking is therefore essential for the semantic correctness of the architecture.

You Might Also Like

Explore these concepts to continue your artificial intelligence journey.

Glossary Cover

Perceptron

The most basic artificial neuron model that learns a linear decision boundary through weighted inputs.

Glossary Cover

Multilayer Perceptron

A fully connected neural network structure containing multiple hidden layers.

Glossary Cover

Feedforward Neural Network

The classical family of neural networks in which information flows one-way from input to output.