# 8 Years After Transformer: From 'Attention Is All You Need' to GPT-5 Anatomy

> Source: https://sukruyusufkaya.com/en/learn/llm-muhendisligi/transformer-sonrasi-8-yil-tam-anatomi
> Updated: 2026-05-13T13:00:24.446Z
> Category: LLM Mühendisliği
> Module: Module 3: The Philosophical History of Deep Learning
**TLDR:** Detailed 8-year evolution map of transformer from Vaswani 2017 to 2026 GPT-5: BERT, GPT series, T5, BART, Llama, Claude, DeepSeek, Mistral, Qwen. Pre-training paradigm settlement, scaling laws, RLHF, multimodal capabilities, reasoning models.

