Skip to content

AI Interactive Tools

Turkish LLM Performance Comparator

16+ LLMs on Turkish benchmarks + use-case score + domain (bank/legal/health) + cost + region.

Definition
Turkish LLM Benchmark
Standard eval sets measuring LLM performance in Turkish: MMLU-TR, TruthfulQA-TR, Reasoning-TR, sectoral domain tests + token efficiency measurements.
Also known as: TR-MMLU, Turkish LLM eval, TR benchmark, Cosmos, Trendyol LLM

Selection

Models (4)

Results

Sign-up Required

Turkish LLM Performance Comparator results are members-only

You can adjust the form inputs freely; the result table, charts and PDF report require a free account. Your current inputs are preserved when you sign up.

  • Re-download your reports and PDFs from your dashboard
  • Stay updated on new tools and KVKK + EU AI Act changes
  • Full access to the Resource Centre, Forum and Learning Portal

KVKK/GDPR compliant — only name and email. We won't send ads; you can delete your account anytime.

Frequently Asked Questions

  • MMLU-TR, TruthfulQA-TR, Belebele, Artificial Analysis benchmark set + internal Q1 2026 calibration.

References

  1. , Hendrycks et al.
  2. , Lin et al.
  3. , Meta
  4. , Artificial Analysis