Bias Eval TR: BBQ-TR — Gender / Ethnicity / Sect / Age / Socioeconomic Probe + Mitigation
BBQ (Bias Benchmark for QA, Parrish 2022) TR adaptation: gender, ethnicity (Turkish/Kurdish/Arab/Armenian), sect (Sunni/Alevi), age, socioeconomic, physical appearance — 9 categories bias probe. 1200 ambiguous question pairs. Cookbook's mitigation recipe: balanced SFT data + DPO bias-rejection examples.
Şükrü Yusuf KAYA
28 min read
Advanced✅ Teslim
- BBQ-TR (cookbook reference) ile model bias score ölç. 2) DPO bias-rejection pair üret. 3) Sonraki ders: 18.7 — Red-Teaming Lab.
Yorumlar & Soru-Cevap
(0)Yorum yazmak için giriş yap.
Yorumlar yükleniyor...
Related Content
Part 0 — Engineering Foundations
Welcome to the Fine-Tuning Cookbook: System, Stage Taxonomy, and the Reproducibility Contract
Start LearningPart 0 — Engineering Foundations
Reproducibility Stack: Seeds, cuDNN Flags, and Deterministic CUDA — End the 'Works on My Machine' Problem
Start LearningPart 0 — Engineering Foundations