About this training
A comprehensive 3-day advanced training for AI engineers who want to take DeepSeek V3 / R1, Qwen 3, Gemma 3, Llama 3.3, and Turkish-fine-tuned models (Trendyol LLM, Cosmos LLM) into production in a KVKK-compliant, self-hosted architecture. Ollama, vLLM, LoRA fine-tuning, Turkish RAG, and quantization.
This training is designed for: AI Engineers and ML Engineers who want to take open-source LLMs into production on Turkish tasks Technical teams of banking, healthcare, energy, and regulated sectors that must build a self-hosted, KVKK-compliant LLM infrastructure Data scientists and researchers who want to comparatively evaluate the DeepSeek, Qwen 3, Gemma 3, and Llama 3.3 families Platform Engineer and DevOps teams managing Turkish RAG, fine-tuning, and customized model projects Startup CTOs and technical founders who want to reduce API token costs and secure data privacy Organizations that want to build their own enterprise AI inference platform, internal AI gateway, or on-prem agent infrastructure
Why this course matters: Addresses the paradigm shift that DeepSeek V3 / R1 created in the open-source ecosystem with architectural depth. Positions the Turkish open-source LLM landscape (Qwen 3, Gemma 3, Llama 3.3, Trendyol LLM, Cosmos LLM, AYDA) comparatively. Clarifies the layer distinction between local execution (Ollama, LM Studio) and production-grade serving (vLLM, TGI, SGLang). Covers Turkish fine-tuning with LoRA and QLoRA hands-on end to end, from dataset preparation to adapter merging. Shows how to make self-hosted LLM decisions under regulations like KVKK, BDDK, EPDK, SGK, and the EU AI Act. Designed as Turkey's most comprehensive enterprise-focused reference training in an area where Turkish DeepSeek + open-source LLM content is virtually nonexistent.
Learning outcomes by the end of the programme: Make use-case-based correct selections among the DeepSeek model family and its distilled versions. Make comparative selections for Turkish tasks among Qwen 3, Gemma 3, Llama 3.3, and local models. Set up a local LLM environment on a developer machine with Ollama and LM Studio. Deploy production-grade inference serving with vLLM, TGI, and SGLang. Train your own custom model with Turkish instruction fine-tuning via LoRA and QLoRA. Build a RAG with hybrid search and rerankers using Turkish-optimized embedding models. Optimize hardware costs with GGUF, AWQ, GPTQ, and FP8 quantization strategies. Design a KVKK-compliant on-prem and air-gapped deployment topology. Produce an end-to-end Turkish LLM stack architecture tailored to your organization in the capstone.
Prerequisites and recommended background: Active Python experience (intermediate to advanced), and use of pip / uv / poetry Linux command-line and git experience Basic knowledge of REST APIs and JSON Schema Cloud, container, or GPU-server experience (Docker, Kubernetes preferred) Access to a GPU machine or cloud GPU during the training (Colab Pro / RunPod / Lambda is sufficient) A Hugging Face account (can be created with the instructor's help)
- Turkey's most comprehensive open-source LLM training, addressing DeepSeek V3, V3.1, R1, and Distill models together with Qwen 3, Gemma 3, Llama 3.3, and Turkish-fine-tuned local models (Trendyol LLM, Cosmos LLM)
- A structure that comparatively covers the Ollama, LM Studio, vLLM, TGI, and SGLang inference engines and teaches you to choose correctly between local prototypes and production-grade serving
- An end-to-end practical methodology covering Turkish instruction fine-tuning with LoRA and QLoRA, dataset preparation, TRL SFTTrainer usage, and adapter merging
- A production-grade Turkish RAG architecture with a comparison of multilingual-e5, jina-embeddings-v3, and bge-m3, plus hybrid search + cross-encoder reranker layers
- An approach that helps you mature the self-hosted vs API decision matrix through GGUF, AWQ, GPTQ, and FP8 quantization methods and cost-per-token analysis
- An enterprise compliance perspective covering KVKK 'cross-border transfer' rules, BDDK/EPDK/SGK regulations, air-gapped Kubernetes deployment, and governance documentation
Key Takeaways
- Make use-case-based correct selections among the DeepSeek model family and its distilled versions.
- Make comparative selections for Turkish tasks among Qwen 3, Gemma 3, Llama 3.3, and local models.
- Set up a local LLM environment on a developer machine with Ollama and LM Studio.
- Deploy production-grade inference serving with vLLM, TGI, and SGLang.
- Train your own custom model with Turkish instruction fine-tuning via LoRA and QLoRA.
- Build a RAG with hybrid search and rerankers using Turkish-optimized embedding models.
- Optimize hardware costs with GGUF, AWQ, GPTQ, and FP8 quantization strategies.
- Design a KVKK-compliant on-prem and air-gapped deployment topology.
- Produce an end-to-end Turkish LLM stack architecture tailored to your organization in the capstone.
DeepSeek and Turkish Open-Source LLM Usage Training
A comprehensive 3-day advanced training for AI engineers who want to take DeepSeek V3 / R1, Qwen 3, Gemma 3, Llama 3.3, and Turkish-fine-tuned models (Trendyol LLM, Cosmos LLM) into production in a KVKK-compliant, self-hosted architecture. Ollama, vLLM, LoRA fine-tuning, Turkish RAG, and quantization.
About This Course
This training is designed for AI engineers, ML engineers, data scientists, and platform engineers who want to run open-source large language models with high quality on Turkish tasks and bring them into a KVKK-compliant self-hosted infrastructure for regulated sectors that require data privacy. At the heart of the program is the following approach: productizing open-source LLMs is not simply downloading a model to a server and running it. Real enterprise value comes from selecting the right base model (DeepSeek V3 / R1, Qwen 3, Gemma 3, Llama 3.3), deciding on fine-tuning by Turkish task type, choosing correctly among Ollama / vLLM / TGI as the inference engine, building a Turkish RAG with an embedding and vector-DB stack, optimizing hardware cost via a quantization strategy, establishing on-prem deployment in line with KVKK 'cross-border transfer' rules, and binding all of this to an auditable governance layer.
DeepSeek fundamentally transformed the open-source LLM ecosystem during late 2024 and throughout 2025. The DeepSeek V3 (671B total parameters, 37B active parameters in a Mixture-of-Experts architecture), V3.1 (hybrid reasoning), and R1 (open-source reasoning model) series offered open-source alternatives to closed reasoning models like OpenAI o1 and Claude Opus 4.7 Deep Think. Distill models (from R1-Distill-Qwen-1.5B to 70B) enabled smaller companies and solo developers to access reasoning capabilities at low cost. This training covers the DeepSeek ecosystem with architectural depth: MoE mechanics, FP8 native quantization, V3.1 hybrid reasoning mode, the R1 chain-of-thought training paradigm, and the correct use-case scenarios for distill models are covered in detail.
The training's Turkish focus is another critical dimension. Participants gain the competence to survey, compare, and correctly choose open-source base models with strong Turkish performance. The Qwen 3 (Alibaba) family offers a broad range from 0.5B to 72B with strong Turkish coverage; the Gemma 3 (Google) open-weight family is strong with multimodal capabilities; Llama 3.3 (Meta) 70B is consistent in Turkish instruction following. Alongside these, Turkish-fine-tuned local models — Trendyol LLM (e-commerce-focused), KUIS Cosmos LLM (Koç University general-purpose), AYDA, the BERTurk family — hold significant positions in their niches. The training teaches systematically evaluating these models with MTEB Turkish, Belebele, MMLU-TR, TruthfulQA-TR, and organization-specific custom eval sets.
The training covers local execution and production-inference layers together. On a developer machine and a single-node server, how DeepSeek-R1-Distill, Qwen 3, and Turkish models can be run in seconds with Ollama and LM Studio; GGUF format management, quantization-level selection, and exposing OpenAI-compatible APIs are shown hands-on. On the production side, vLLM with PagedAttention and continuous batching, Text Generation Inference (TGI) with Hugging Face native deployment, and SGLang with structured generation and constrained decoding are covered comprehensively. Tensor parallelism and multi-GPU deployment address high-throughput needs.
Perhaps one of the strongest modules of the program is dedicated to Turkish fine-tuning with LoRA and QLoRA. Adapting open-source base models to organization-specific Turkish tasks saves up to 99% of hardware costs compared to full fine-tune via the PEFT (Parameter-Efficient Fine-Tuning) approach. The training covers end-to-end Turkish instruction-dataset preparation (Alpaca, ShareGPT, ChatML formats), using SFTTrainer with the Hugging Face TRL library, hyperparameter selection (learning rate, batch size, gradient accumulation), adapter merging, GGUF conversion, and final model deployment. As a result, participants reach a level where they can train a Turkish-optimized custom LLM for their own company.
Turkish RAG architecture is also one of the program's core modules. To strengthen open-source LLMs with Turkish document-based systems, the comparison of embedding models like multilingual-e5-large, jina-embeddings-v3, and bge-m3; Turkish-morphology-aware chunking strategies (recursive, semantic, sentence-window); self-hosted vector-DB deployment with Qdrant, Weaviate, and pgvector; BM25 + vector hybrid search architecture; and a cross-encoder reranker layer with bge-reranker-v2 are covered in detail. This provides an architectural foundation directly applicable to enterprise knowledge management, customer service, document summarization, and compliance products.
Quantization strategies are a critical component of the program. GGUF (Q2_K, Q4_K_M, Q5_K_M, Q6_K, Q8_0), AWQ, GPTQ, FP8 native, EXL2, and other modern quantization methods are addressed comparatively; measuring quantization-induced quality regression, selecting levels by hardware constraints, and the self-hosted cost model (GPU, electricity, ops) are covered in detail. Thus, participants reach a level where they can make the right quantization choice from an architectural perspective in the triangle of quality, speed, and cost.
A distinguishing point of the program is a module dedicated to KVKK-compliant on-prem and air-gapped deployment. In banking (BDDK), energy (EPDK), healthcare (SGK), and regulation-heavy sectors, self-hosted LLMs are not merely a technological preference but a mandatory architectural decision in terms of data privacy, regulation, and audit requirements. The training comprehensively covers model download and signature verification, the transfer process in restricted-network environments, on-prem Kubernetes clusters and the GPU operator, network segmentation and mTLS, PII masking and audit-log layers, governance documentation, and a compliance checklist.
In the capstone project, each participant designs an end-to-end Turkish LLM stack tailored to their own organization: use case → base model → fine-tune decision → inference engine → embedding and vector DB → KVKK-compliant deployment topology. Participants present this architecture together with a diagram, deployment plan, and eval report, receiving peer review and instructor feedback. By the end of the training, participants will have the technical and architectural competence to understand the DeepSeek and Turkish open-source LLM ecosystem at a strategic level, professionally establish local and production-inference layers, perform Turkish LoRA fine-tuning, design a Turkish RAG architecture, optimize their quantization strategy, and build a KVKK-compliant self-hosted AI infrastructure. The training consists of 3 days, 12 modules, and over 70 hands-on lessons.
Training Methodology
Turkey's most comprehensive open-source LLM training, addressing DeepSeek V3, V3.1, R1, and Distill models together with Qwen 3, Gemma 3, Llama 3.3, and Turkish-fine-tuned local models (Trendyol LLM, Cosmos LLM)
A structure that comparatively covers the Ollama, LM Studio, vLLM, TGI, and SGLang inference engines and teaches you to choose correctly between local prototypes and production-grade serving
An end-to-end practical methodology covering Turkish instruction fine-tuning with LoRA and QLoRA, dataset preparation, TRL SFTTrainer usage, and adapter merging
A production-grade Turkish RAG architecture with a comparison of multilingual-e5, jina-embeddings-v3, and bge-m3, plus hybrid search + cross-encoder reranker layers
An approach that helps you mature the self-hosted vs API decision matrix through GGUF, AWQ, GPTQ, and FP8 quantization methods and cost-per-token analysis
An enterprise compliance perspective covering KVKK 'cross-border transfer' rules, BDDK/EPDK/SGK regulations, air-gapped Kubernetes deployment, and governance documentation
Who Is This For?
Why This Course?
Addresses the paradigm shift that DeepSeek V3 / R1 created in the open-source ecosystem with architectural depth.
Positions the Turkish open-source LLM landscape (Qwen 3, Gemma 3, Llama 3.3, Trendyol LLM, Cosmos LLM, AYDA) comparatively.
Clarifies the layer distinction between local execution (Ollama, LM Studio) and production-grade serving (vLLM, TGI, SGLang).
Covers Turkish fine-tuning with LoRA and QLoRA hands-on end to end, from dataset preparation to adapter merging.
Shows how to make self-hosted LLM decisions under regulations like KVKK, BDDK, EPDK, SGK, and the EU AI Act.
Designed as Turkey's most comprehensive enterprise-focused reference training in an area where Turkish DeepSeek + open-source LLM content is virtually nonexistent.
Learning Outcomes
Requirements
Course Curriculum
92 LessonsInstructor

Şükrü Yusuf KAYA
AI Architect | Enterprise AI & LLM Training | Stanford University | Software & Technology Consultant
Şükrü Yusuf KAYA is an internationally experienced AI Consultant and Technology Strategist leading the integration of artificial intelligence technologies into the global business landscape. With operations spanning 6 different countries, he bridges the gap between the theoretical boundaries of technology and practical business needs, overseeing end-to-end AI projects in data-critical sectors such as banking, e-commerce, retail, and logistics. Deepening his technical expertise particularly in Generative AI and Large Language Models (LLMs), KAYA ensures that organizations build architectures that shape the future rather than relying on short-term solutions. His visionary approach to transforming complex algorithms and advanced systems into tangible business value aligned with corporate growth targets has positioned him as a sought-after solution partner in the industry. Distinguished by his role as an instructor alongside his consulting and project management career, Şükrü Yusuf KAYA is driven by the motto of "Making AI accessible and applicable for everyone." Through comprehensive training programs designed for a wide spectrum of professionals—from technical teams to C-level executives—he prioritizes increasing organizational AI literacy and establishing a sustainable culture of technological transformation.
Frequently Asked Questions
Apply for Training
Boutique training with limited seats.
Pre-register for Next Groups
Leave your info to be the first to know when the next batch opens.
1-on-1 Mentorship
Book a private session.
Related programs
Professional Software Development with Claude Code Training
A comprehensive, advanced 4-day training program for software professionals seeking enterprise-level mastery of Anthropic's agentic coding platform, Claude Code. Production-grade agent architecture with MCP integrations, Hooks, Sub-agents, Skills, and the Claude Agent SDK.
4 GünadvancedLLM Alignment Engineering with RLHF, DPO, and GRPO Training
A 3-day advanced Turkish LLM alignment training that covers the RLHF (PPO), DPO, KTO, IPO, SimPO, ORPO, and DeepSeek R1 GRPO algorithms at both math and code level; and teaches reward modeling, Constitutional AI, RLAIF, reasoning-model alignment, and the TRL/Axolotl/LLaMA-Factory/OpenRLHF/verl toolchain at production grade.
3 GünadvancedBuilding AI Agents with the Claude Agent SDK Training
A comprehensive, advanced 4-day program for software engineers who want to develop production-grade AI agents with Anthropic's Claude Agent SDK. Tool-use orchestration, MCP server development, multi-agent patterns, prompt caching, and evaluation engineering.
4 Günadvanced