# Continuous Batching

> Source: https://sukruyusufkaya.com/en/glossary/continuous-batching
> Updated: 2026-05-13T19:59:32.675Z
> Type: glossary
> Category: uretken-yapay-zeka-ve-llm
**TLDR:** A serving approach that increases throughput by dynamically merging requests arriving at different times into the same processing flow.

<p>Continuous batching has become a critical technique for improving resource utilization in LLM serving. Instead of using fixed batches, requests are grouped dynamically as they flow through the system. This can increase GPU utilization and improve overall serving efficiency. It plays an important role in balancing low latency with high throughput.</p>