Easiest way to keep cache hit high?

Cut Cost up to 90% with Prompt Caching

Cache stable system prompts, large few-shot blocks, and long documents to slash input cost.

Şükrü Yusuf KAYA

11 min read

6/26/2026

Intermediate

Cache Anatomisi#

Prompt'un hangi bölümünü cache'leyeceğini

cache_control

ile işaretlersin. İlk çağrıda 'cache write' biraz pahalı; sonraki çağrılar bu kısım için %90'a varan tasarruf.

python

resp = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    system=[
        {
            "type":"text",
            "text": LONG_STABLE_SYSTEM_PROMPT,
            "cache_control": {"type":"ephemeral"},
        },
    ],
    messages=[{"role":"user","content": user_text}],
)
# response.usage içinde cache_read_input_tokens / cache_creation_input_tokens alanları olur

System prompt cache — en yaygın kullanım.

python

# Cache hit oranı simülasyonu
total_in = 0
cached_in = 0
calls = [
    {"input": 1200, "cached": 0},     # ilk çağrı, cache miss
    {"input": 1200, "cached": 1100},  # cache hit
    {"input": 1200, "cached": 1100},
    {"input": 1200, "cached": 1100},
]
for c in calls:
    total_in += c["input"]
    cached_in += c["cached"]
 
print(f"Cache hit oranı: {cached_in/total_in*100:.1f}%")

Cache hit oranı — yüksek tut, maliyetin sahibi sen ol.

Boşluk doldur · text

Cache _____ flag'i ile sistem promptunun bölümleri cache'lenir. Tipik _____ TTL kısa, dakikalar mertebesindedir. Hit oranını izlemek için response _____ alanı kullanılır.

Frequently Asked Questions

Structurally separate variable bits from the system prompt. Move customer name and language outside the cache; keep role, rules, glossary inside.

Yorumlar & Soru-Cevap

(0)

Yorum yazmak için giriş yap.

Yorumlar yükleniyor...

Cut Cost up to 90% with Prompt Caching

Cache Anatomisi#

Frequently Asked Questions

Easiest way to keep cache hit high?

Yorumlar & Soru-Cevap

Related Content

Batch API: Bulk Async Workloads

Eval Sets and LLM-as-Judge

What is Claude? The New Generation of AI Assistants

Subscribe to Newsletter