Skip to content
Şükrü Yusuf KAYA

Areas of Expertise

RAG SystemsLLMOpsAI GovernancePrompt EngineeringAgentic AIPrivate LLM DeploymentAI ArchitectureEnterprise AI StrategyEU AI Act ComplianceTürkçe Doğal Dil İşlemeTürkçe LLM EğitimiAI Ürün StratejisiSektör Bazlı AI ÇözümleriKurumsal Yapay Zeka EğitmenliğiKurumsal Yapay Zeka DanışmanlığıYapay Zeka EğitmeniYapay Zeka DanışmanıYapay Zeka UzmanıTürkiye Kurumsal AI EğitimiTürkçe RAG EğitimiTürkçe Agentic AI Eğitimi

Education & Certifications

  • Stanford University
  • Harvard University
  • Yıldız Teknik Üniversitesi
  • Beykent Üniversitesi
  • Anadolu Üniversitesi

Latest Articles

View All
6/27/2026

The 2026 Guide to Cutting LLM Costs: Prompt Caching, Model Routing, Quantization and Observability

I walk through how I cut a production LLM bill in half, sometimes to a fifth: prompt caching, model routing, self-hosted quantization and the observability that makes it all visible. With a Turkey and KVKK lens, concrete cost math and a tactics table.

6/27/2026

Choosing a Vector Database for Enterprise RAG in 2026: pgvector or a Dedicated Solution?

For enterprise RAG in 2026, pgvector or a dedicated solution like Pinecone, Qdrant, Weaviate, Milvus? A field-tested decision guide through the lens of scale, cost, hybrid search and data sovereignty.

6/27/2026

The Enterprise Agent Race of 2026: What Does Google's Gemini Enterprise Agent Platform Bring, and What Does It Mean for Turkish Organizations?

At Cloud Next '26, Google turned Vertex AI into the Gemini Enterprise Agent Platform and put a single unified platform on the table in the enterprise agent race. From a practitioner's chair: what does this move mean, where does it stand against OpenAI and Anthropic, and how should Turkish organizations decide while keeping KVKK and data sovereignty in mind?

6/27/2026

RAG and Compliance Assistants in Banking: A KVKK + BDDK-Compliant, Auditable AI Architecture (2026)

Why is RAG in banking different from a "chatbot"? Cited answers, audit trails, on-prem/sovereign deployment and BDDK/KVKK compliance. A field use-case inventory, architecture layers and an 8-week pilot recipe.

6/27/2026

Why Does Agentic AI Break in Production? 2026 Resilience Patterns (Error Handling, Oversight, Evaluation)

Why do agents that work flawlessly in a demo collapse in production? From the field, I walk through 2026's real failure patterns and resilience fixes: infinite loops, error cascades, cost explosions, oversight, and evaluation.

6/27/2026

The EU AI Act's August 2 Just Vanished: A Full Anatomy of the Digital Omnibus Deferral and a 16-Month Readiness Plan

The Digital Omnibus defers high-risk AI obligations from 2 August 2026 to 2 December 2027. Full timeline, what Annex III means, Article 99 penalties, the KVKK overlap, and a concrete 16-month readiness plan from the field.

Get in Touch

Reach out directly for projects, training or collaboration opportunities.

Send Message