# Data Mixing Math: Sampling Temperature, DoReMi, Domain Reweighting

> Source: https://sukruyusufkaya.com/en/learn/fine-tuning-cookbook/ftc-data-mixing-math-doremi-temperature
> Updated: 2026-05-14T14:42:50.805Z
> Category: Fine-Tuning Cookbook (Model-by-Model)
> Module: Part II — Tokenizer & Data Engineering
**TLDR:** How to mix multiple datasets? Naïve concatenation = the large dataset dominates. Sampling temperature, proportional mixing, DoReMi (Xie et al. 2023) algorithm for dynamic reweighting. Turkish SFT mix example: 40% TR-Alpaca + 25% OASST + 20% ShareGPT-TR + 15% custom — why these percentages?

