Skip to content
Category

Mathematics, Statistics and Optimization

111 terms in the Mathematics, Statistics and Optimization domain — each bilingual TR/EN with related-term graph.

Linear AlgebraProbability TheoryStatistical ConceptsDistributionsHypothesis TestingOptimization MethodsDerivatives and GradientsLoss FunctionsInformation TheoryStatistical Model Comparison

Most Read

All Terms (111)

C
12 terms
🧪

Calibration

A property describing how well a model’s predicted probabilities align with actual observed frequencies.

⛓️

Chain Rule

A rule that computes the derivative of a composite function through the derivatives of its inner and outer functions.

📶

Channel Capacity

The theoretical maximum amount of information that a communication channel can transmit without error.

🧮

Chi-Square Distribution

A distribution derived from the sum of squared standard normal variables and widely used in statistical testing.

⚠️

Condition Number

An important linear algebra indicator that measures how numerically sensitive or unstable a matrix is.

🔗

Conditional Probability

A concept that measures how likely an event is given that another event has already occurred.

📈

Consistency

The property of an estimator converging to the true value as the sample size grows.

⛓️

Constrained Optimization

An optimization approach in which the solution must satisfy not only the objective function but also specified constraints.

🛤️

Convex Optimization

A class of optimization problems where the objective and constraints satisfy favorable geometric conditions that enable more reliable solutions.

🔗

Correlation

A standardized measure of the direction and strength of the linear relationship between two variables.

🔄

Covariance

A fundamental statistical measure that shows how two variables change together.

🎯

Cross-Entropy Loss

A core classification loss that measures the mismatch between the true distribution and the model’s predicted probability distribution.

M
14 terms
⚖️

Mann-Whitney U Test

A test used to compare two independent groups without relying on strong parametric assumptions.

🔄

Markov Property

A property stating that a system’s future depends only on its current state, not on the full past history.

🔲

Matrix

A structure of numbers arranged in rows and columns, central to data representation and transformations.

🧠

Maximum A Posteriori Estimation (MAP)

A Bayesian estimation approach that accounts for prior knowledge while selecting parameters that explain the data.

🔍

Maximum Likelihood Estimation (MLE)

A fundamental statistical estimation method based on selecting the parameters that make the observed data most likely.

🧪

McNemar Test

A test used to compare the error behavior of two classifiers on the same set of examples.

📏

Mean Absolute Error (MAE)

A regression loss function that averages the absolute differences between predictions and true values, offering greater robustness.

📉

Mean Squared Error (MSE)

A common regression loss function that averages the squared differences between predictions and true values.

📍

Mean, Median, and Mode

Fundamental statistical measures that summarize the central tendency of a dataset from different perspectives.

📦

Mini-Batch Gradient Descent

A widely used optimization approach that splits training data into small batches to balance efficiency and stability.

🗜️

Minimum Description Length (MDL)

An information-theoretic principle stating that a good model is one that describes the data in the shortest sufficient way.

🌪️

Momentum

A method that speeds up optimization by incorporating the direction of past gradient updates.

🧪

Multiple Comparison Correction

A correction approach used to control false positives when multiple hypotheses are tested.

🔁

Mutual Information

A concept that measures how much knowing one variable reduces uncertainty about another.