Glossary Library

Technical GlossaryDeep Learning

Transformer Feed-Forward Network

A Transformer sub-block that operates independently on each token and strengthens representation transformation.

The feed-forward network inside a Transformer provides token-wise nonlinear transformation that attention alone does not supply. It typically consists of two linear layers and an activation function. Although it operates independently on each token, it contributes a major portion of the model’s overall capacity. In large language models, a substantial share of parameters resides in this substructure.

You Might Also Like

Explore these concepts to continue your artificial intelligence journey.

Glossary Cover

yapay-zeka-temelleri

Representation Learning

An approach in which informative, discriminative, and task-relevant internal representations are learned automatically from raw data.

Glossary Cover

makine-ogrenmesi

Non-negative Matrix Factorization

A dimensionality reduction technique that produces part-based and interpretable representations in non-negative data.

Glossary Cover

makine-ogrenmesi

Autoencoder-Based Dimensionality Reduction

An approach that learns lower-dimensional representations of data through neural-network-based compression.