Glossary Library

Technical GlossaryGenerative AI and LLM

Pretraining

The initial training stage in which a model learns broad patterns from large-scale general data.

Pretraining is the foundation of the foundation model paradigm because the model acquires broad knowledge of language, visual structure, or multimodal patterns during this stage. Much of the adaptability needed for downstream tasks is gained here. The diversity, volume, and quality of the data directly shape the model’s later capability.

You Might Also Like

Explore these concepts to continue your artificial intelligence journey.

Glossary Cover

yapay-zeka-temelleri

Self-Supervised Learning

An approach that enables strong representation learning by generating supervision signals from the internal structure of the data itself.

Glossary Cover

Masked Language Modeling

A pretraining objective based on masking some input tokens and predicting them from context.

Glossary Cover

dogal-dil-isleme

Pretraining Corpus

The large text data pool used by a language model to acquire general linguistic and world knowledge.