
Author Profile
Şükrü Yusuf KAYA
AI Expert & Consultant
Areas of Expertise
Education & Certifications
- Stanford University
- Harvard University
- Yıldız Teknik Üniversitesi
- Beykent Üniversitesi
- Anadolu Üniversitesi
Latest Articles
View AllThe 2026 Guide to Cutting LLM Costs: Prompt Caching, Model Routing, Quantization and Observability
I walk through how I cut a production LLM bill in half, sometimes to a fifth: prompt caching, model routing, self-hosted quantization and the observability that makes it all visible. With a Turkey and KVKK lens, concrete cost math and a tactics table.
Choosing a Vector Database for Enterprise RAG in 2026: pgvector or a Dedicated Solution?
For enterprise RAG in 2026, pgvector or a dedicated solution like Pinecone, Qdrant, Weaviate, Milvus? A field-tested decision guide through the lens of scale, cost, hybrid search and data sovereignty.
The Enterprise Agent Race of 2026: What Does Google's Gemini Enterprise Agent Platform Bring, and What Does It Mean for Turkish Organizations?
At Cloud Next '26, Google turned Vertex AI into the Gemini Enterprise Agent Platform and put a single unified platform on the table in the enterprise agent race. From a practitioner's chair: what does this move mean, where does it stand against OpenAI and Anthropic, and how should Turkish organizations decide while keeping KVKK and data sovereignty in mind?
RAG and Compliance Assistants in Banking: A KVKK + BDDK-Compliant, Auditable AI Architecture (2026)
Why is RAG in banking different from a "chatbot"? Cited answers, audit trails, on-prem/sovereign deployment and BDDK/KVKK compliance. A field use-case inventory, architecture layers and an 8-week pilot recipe.
Why Does Agentic AI Break in Production? 2026 Resilience Patterns (Error Handling, Oversight, Evaluation)
Why do agents that work flawlessly in a demo collapse in production? From the field, I walk through 2026's real failure patterns and resilience fixes: infinite loops, error cascades, cost explosions, oversight, and evaluation.
The EU AI Act's August 2 Just Vanished: A Full Anatomy of the Digital Omnibus Deferral and a 16-Month Readiness Plan
The Digital Omnibus defers high-risk AI obligations from 2 August 2026 to 2 December 2027. Full timeline, what Annex III means, Article 99 penalties, the KVKK overlap, and a concrete 16-month readiness plan from the field.
Get in Touch
Reach out directly for projects, training or collaboration opportunities.
Send Message