Can Claude compute p-values?

Q: Can Claude compute p-values?

It can write code to compute them with your data; relying on its head math is unreliable — always validate with code.

Statistical Interpretation and Visualization

Stories, not just numbers — interpret statistical results with Claude, with significance thresholds and visualization choices.

Şükrü Yusuf KAYA

12 min read

6/24/2026

Intermediate

İstatistik yorumlama üçgeni: efekt boyutu, anlamlılık, görselleştirme

Sayıyı Hikâyeye Çevir#

Bir A/B testi sonucu '%2.7 daha iyi, p < 0.05' olduğunda Claude'a sadece "yorumla" demek yetmez. Üç soruyu sor:

Efekt boyutu ne? (büyüklüğü iş için anlamlı mı)
Anlamlılık ne? (istatistiksel güven nasıl)
Karar nedir? (yayınla, devam et, kapat)

text

<role>Kıdemli istatistikçi + ürün analisti.</role>
 
<context>
Test: yeni onboarding akışı vs. eski.
Metrik: 7. günde aktif kullanım oranı.
N: kontrol 12,403; varyant 12,389.
Sonuç: control 28.4%, variant 31.1%.
p-value: 0.012, CI95: [+1.6%, +3.8%].
</context>
 
<task>
1) Etki boyutunu sözel ifade et (delta, oransal artış).
2) Belirsizliği aralıklarla ifade et.
3) İş etkisi tahminin (kabaca) — eğer trafik 1M/ay ise.
4) Önerin: yayınla / devam et / kapat?
5) Önerinin altında 2 risk yaz.
</task>

Üretim seviyesi A/B yorumlama prompt'u.

A/B testi sonuçlarını sözel hâle çevir#

python

def interpret_ab(c_rate, v_rate, p, ci95_lo, ci95_hi, monthly_users):
    delta_abs = (v_rate - c_rate) * 100  # %
    delta_rel = (v_rate - c_rate) / c_rate * 100
    incremental = (v_rate - c_rate) * monthly_users
    return {
        "delta_pp": round(delta_abs, 2),
        "delta_rel_pct": round(delta_rel, 2),
        "p": p,
        "ci95_pp": [round(ci95_lo*100, 2), round(ci95_hi*100, 2)],
        "incremental_users_per_month": round(incremental),
        "verdict": "yayınla" if p < 0.05 and ci95_lo > 0 else "devam et",
    }
 
print(interpret_ab(0.284, 0.311, 0.012, 0.016, 0.038, 1_000_000))

Boşluk doldur · text

İstatistiksel sonuçta efekt _____ , anlamlılık ve karar üçü birlikte yorumlanır. Çoklu test için _____ veya Benjamini-Hochberg düzeltmesi gerekir. Korelasyon _____ değildir.

Frequently Asked Questions

It can write code to compute them with your data; relying on its head math is unreliable — always validate with code.

Yorumlar & Soru-Cevap

(0)

Yorum yazmak için giriş yap.

Yorumlar yükleniyor...

Statistical Interpretation and Visualization

Sayıyı Hikâyeye Çevir#

A/B testi sonuçlarını sözel hâle çevir#

Frequently Asked Questions

Can Claude compute p-values?

Yorumlar & Soru-Cevap

Related Content

Long Document Summarization and Synthesis

Catching Hallucination and Verifying Sources

What is Claude? The New Generation of AI Assistants

Subscribe to Newsletter