Skip to content
Category

Data Engineering and AI Infrastructure

109 terms in the Data Engineering and AI Infrastructure domain — each bilingual TR/EN with related-term graph.

ETL / ELTData PipelinesData LakeData WarehouseBatch ProcessingStream ProcessingMetadata ManagementFeature StoreVector DatabasesData Lineage

Most Read

All Terms (109)

D
15 terms
🕸️

DAG

A core orchestration structure that models data processing tasks as a directed acyclic dependency graph.

📊

Dashboard Lineage

A trace structure that shows which datasets, queries, and transformations feed the metrics and visuals inside a dashboard.

📜

Data Contract Enforcement

An approach in which schema, quality, and delivery expectations are not only defined, but actively enforced by the system.

🌊

Data Lake

A storage layer where structured and unstructured data is kept in raw or lightly processed form at scale.

🗂️

Data Lifecycle Tiering

An approach in which data is moved across storage tiers as its access frequency, age, and business value change.

🧭

Data Lineage

The traceable lifecycle journey of a data element from source through transformations to final usage.

🛤️

Data Pipeline

A processing chain that reliably moves data from a source, through transformations, into one or more target systems.

📦

Data Product Lineage

A trace structure showing which sources feed a data product, how it is produced, and which consumers it serves.

🪪

Data Provenance

A source-reliability perspective that describes the origin, creation conditions, and processing history of a data element.

🏢

Data Warehouse

A structured, integrated, query-optimized data storage environment built for reporting, analytics, and decision support.

🗺️

Dataset Dependency Map

A mapping structure that systematically shows dependency relationships among datasets.

🔗

Dependency Management

The process of managing the dependencies among tasks, datasets, and execution orders within a data workflow.

🔗

Dependency Resolution

The process of determining in what order and under what conditions tasks and data assets in a workflow should execute.

📐

Dimensional Modeling

A modeling approach that organizes analytical data structures around facts and dimensions.

⚠️

Downstream Breakage Risk

A risk measure describing the likelihood that a change in a data asset will cause breakage in connected reports, models, or services.

S
13 terms
📖

Schema-on-Read

A flexible data-processing approach in which schema is applied when data is read rather than when it is written.

🧠

Semantic Layer

A layer that abstracts business metrics, definitions, and query logic consistently above technical data structures.

🧠

Semantic Lineage

A lineage approach that shows how data assets are derived and connected not only technically, but also at the business-meaning level.

📐

Similarity Metric

The core retrieval criterion that defines how proximity between vectors is computed.

🕰️

Slowly Changing Dimension

A warehouse approach that defines how changing dimension attributes should be preserved historically over time.

🪞

Source System Replication

An approach in which data from a source system is replicated into another environment for analytical or operational use.

🪜

Staging Area

An intermediate preparation layer where source data is temporarily held before final transformation.

Star Schema

A classic analytical warehouse design with a central fact table surrounded by dimension tables.

🗄️

State Store

A persistent or semi-persistent data structure that stores historical context and intermediate computation state during stream processing.

🪢

Stream Join

The operation of joining multiple continuous data streams by key and time logic to create meaningful event context.

🐢

Stream Lag

A core stream-health metric that expresses the delay gap between produced events and consumed events.

Stream Processing

A processing approach based on handling continuously arriving data events with low latency.

🪟

Stream Windowing

An approach that groups continuous data streams into defined time or event intervals for computation.