# What is an AI Agent? Autonomous AI Architectures in 2026 — A Comprehensive End-to-End Guide > Source: https://sukruyusufkaya.com/en/blog/ai-agent-otonom-yapay-zeka > Updated: 2026-05-13T19:58:04.858Z > Type: blog > Category: yapay-zeka **TLDR:** A comprehensive 2026 reference explaining how AI agents work, which architectures solve which problems, and what they mean for Turkish enterprises. Covers ReAct, multi-agent, MCP, tool use, computer use, browser agents, frameworks (LangGraph / AutoGen / CrewAI / Claude Code), production concerns, evaluation, security, KVKK compliance, and three anonymized Turkish case studies. ## 1. What is an AI Agent? — One-Sentence and Extended Definition The essential difference between an LLM and an AI Agent can be summed in one sentence: **LLMs produce responses; agents take actions.** While an LLM answers you in a ChatGPT window, an Agent — given the same query — researches, sends emails, edits files, opens CRM records, and does so not in a single shot but along a multi-step plan. This is **not science fiction**; it is a concrete paradigm shift observed in production through 2024-2026. Claude Code, GitHub Copilot Workspace, Cursor Agent, Replit Agent, Devin, OpenAI Operator, Anthropic Computer Use, Microsoft Copilot Studio — all are tangible products of this paradigm. ### Traditional LLM Call vs Agent Traditional use: "Summarize this PDF" → one prompt, one response. Agent use: "Analyze the customer's orders over the last 6 months; if the inventory of their most-bought category was low last month, create a purchase request" → the agent queries the database, analyzes tables, checks the inventory system, opens a purchase request, sends emails. A nuance LangChain's Harrison Chase often highlights: a **Workflow** is a predefined sequence of LLM calls (deterministic DAG); an **Agent** is a dynamic process where the LLM itself decides the next step. Workflows are more predictable and cheaper; agents are more flexible but more expensive and error-prone. Most production systems are **hybrid** — critical steps as workflows, flexible decision points as agents. ## 2. The Anatomy of an AI Agent: Four Core Components Four core components make up an AI Agent. You cannot build a durable agent without designing each separately. ### 2.1. LLM Brain The core reasoning and decision engine. As of 2026, flagship agent models: - **Claude Opus 4.7** — long context (1M), tool use, leads in agent use; Anthropic's agent-centric training focus - **GPT-5** — function calling, multi-step reasoning, OpenAI Operator integration - **Gemini 3 Pro** — multimodal agent tasks, Google Workspace integration - **Open alternatives** — Llama 4 70B, DeepSeek V3, Qwen 2.5 (with tool-use support) ### 2.2. Memory An agent's ability to "remember the past" works in two layers: - **Short-term memory:** Conversation history, intermediate outputs, and plan state held in the context window during the active task. - **Long-term memory:** Past interactions, user preferences, organizational knowledge stored in a vector DB. Usually integrated with a RAG architecture. #### Three Memory Types in Practice - **Episodic memory:** Time-bound events like "Last week we had this chat with customer X." Typical architecture: vector DB + timestamp metadata. - **Semantic memory:** Inferred, stable facts like "The customer's preferred channel is email." Usually stored in a structured DB (Postgres, MongoDB). - **Procedural memory:** Learned workflows like "Invoice-dispute replies in this sector follow these steps." Typically prompt templates + example-based few-shot references. #### Memory Frameworks - **Mem0** — open source, automatic fact extraction + retrieval - **Zep** — per-user long-term memory + temporal graph - **LangMem** — LangChain memory management (semantic + episodic blend) - **Letta (formerly MemGPT)** — virtual context (long-context simulation) Long-term customer relationships, assistants that learn user preferences, and internal team agents that learn across sessions benefit significantly from memory. For one-shot tasks (e.g., summarizing a single email), memory investment is unnecessary. ### 2.3. Planner The component that answers the agent's "what should I do next?" question. Three main strategies are used in practice: - **Chain-of-Thought (CoT):** "Think step by step" prompting; the model verbalizes its reasoning. - **ReAct (Reason + Act):** Thought → Action → Observation → Thought loop. The most common base pattern in modern agents. - **Tree-of-Thoughts (ToT):** Generate multiple plan branches and select the best. Improves quality on complex problems but costs 3-10x. - **Plan-and-Solve:** First produce the full plan, then execute step by step. Plan-execution separation eases evaluation and enables human approval for the plan. - **ReWOO (Reasoning WithOut Observation):** Builds a multi-step plan without waiting for tool output and then runs in parallel. Parallelizable steps **cut latency by 40-60%**. - **Self-Discover:** Lets the model **discover its own reasoning structure** for the given problem (Google DeepMind, 2024). Reports of +10-25% quality on complex problems. - **Reflexion:** Agents that **analyze their own mistakes and correct in the next attempt**. Single-iteration improvement can exceed 20% on test/code-writing tasks; a max-iter cap is mandatory to avoid loops. - **Graph-of-Thoughts (GoT):** A generalization of ToT — feedback links between ideas. In academic research; usually unnecessary in production. **ReAct** suffices for 70% of use cases. For complex multi-step tasks, move to **Plan-and-Solve** or **ReWOO**. For feedback-rich tasks like code and tests, add **Reflexion**. ToT and GoT should only be tried if your eval plateaus on existing strategies. ### 2.4. Tool / Executor The layer through which the agent affects the outside world. The tool catalog typically includes: - **API calls** — CRM, ERP, ticketing, compute services - **Database queries** — SQL, vector search - **File system operations** — read, write, transform - **Web** — browser, search APIs - **Code execution** — Python sandbox, JavaScript runtime - **Communication** — sending email, Slack messages, Teams notifications - **MCP servers** — standardized third-party tool integration ## 3. The Agent Decision Loop An agent completes its task in the following loop: One iteration of this loop is **not a single LLM call** — a typical agent task can involve 5-50 LLM calls. Cost and latency management is therefore critical. ## 4. Agent Architectural Patterns (5) There is no single right agent architecture; five main patterns are preferred by problem shape. ### 4.1. Single Agent The simplest form. One LLM, one tool catalog, a ReAct loop. Ideal for narrow tasks like customer service chatbots, internal productivity tools, and personal assistants. ### 4.2. Supervisor (Orchestration) A "manager" agent (supervisor) delegates sub-tasks to specialized sub-agents and synthesizes results. This is **LangGraph's flagship pattern** and the most common multi-agent layout in 2025-2026 production systems. **Typical structure:** - Supervisor: understands the goal and selects the right sub-agent - Researcher: gathers information from web/RAG - Analyzer: performs data analysis - Writer: produces the report/response - Critic: evaluates the output ### 4.3. Hierarchical A tree-shaped agent organization where supervisors have supervisors. Very complex projects (e.g., autonomous software development — Devin) use this layout. ### 4.4. Swarm Peer-level agents running in parallel and referencing each other's outputs. OpenAI's "Swarm" framework and CrewAI's "process" mode support this style. ### 4.5. Network (A2A — Agent-to-Agent) Agents communicate as independent services over the network. By late 2025 / early 2026, **A2A protocol** standardization efforts began (Google's A2A initiative). Still early but the next step. Practical rule: **always start with single-agent for MVPs**. Move to supervisor + 2-3 sub-agents once eval (faithfulness, success rate, latency) is solid and you actually need specialization. Hierarchical and swarm patterns are overkill until single-agent eval is solved at 85%+. ### 4.6. Agent vs Workflow vs RAG vs Fine-tuning — A Decision Matrix Not every problem needs an agent. The matrix below helps pick the right tool. **Hybrid Approach — Common Production Architecture:** Most mature production systems use **all four together**: - **Workflow** runs deterministic main flows (e.g., order processing steps) - **RAG** answers information questions (e.g., product catalog, regulations) - **Agent** handles points requiring dynamic decisions (e.g., customer-objection triage) - **Fine-tuning** locks brand tone and format templates ## 5. Core Capabilities: What Can an Agent Do? Modern agent capabilities fall into five main categories. ### 5.1. Tool Use / Function Calling Structured API calls produced by the agent. OpenAI Function Calling (Dec 2023), Anthropic Tool Use (Mar 2024), Gemini Function Calling — all serve the same purpose: LLMs producing parameterized function calls in JSON. ### 5.2. Code Execution Running Python (most common) in a secure sandbox. ChatGPT Code Interpreter / Advanced Data Analysis, Claude's "execute code" tool, Replit Agent — all leverage this. The main power source for data analysis, computation, and transformation tasks. ### 5.3. Web Browsing Using a real browser or search API to gather up-to-date information. OpenAI's "Browse" feature, Anthropic Claude's Web Search, Gemini Deep Research belong here. Solves the knowledge-cutoff problem. ### 5.4. Computer Use Agents controlling a computer's screen with mouse and keyboard actions by "seeing" the screen. **Anthropic Claude Computer Use (Oct 2024)** brought this mainstream; **OpenAI Operator (Jan 2025)** is the rival. The new generation of autonomous process automation. ### 5.5. Multi-Modal Perception Image, audio, and video understanding expand an agent's "senses." An agent can read an error message in a screenshot, transcribe a customer voice, or extract key moments from a video presentation. ## 6. Popular Agent Frameworks Which framework you choose depends on your agent's complexity, production goals, and team capabilities. ### Detailed Framework Selection Guide **LangGraph** — The 2026 reference for production multi-agent. Stateful graph architecture, supervisor pattern native, integrated observability (LangSmith). Most common framework choice in Turkish enterprises. **AutoGen** — Microsoft Research origin. Strong multi-agent "conversation" paradigm; native code execution. Natural choice for Microsoft / Azure ecosystem. **CrewAI** — Fast prototyping with role-based thinking (researcher / writer / critic). Ideal for MVPs and POCs; many teams migrate to LangGraph as they scale. **Anthropic Claude Code + MCP** — The new generation of agent development experience for 2025-2026. MCP standardizes the tool catalog; Claude's native agent capability reduces framework requirements. **Vercel AI SDK** — The TypeScript / Next.js world's choice. Streaming, tool use, agent loops are native. The practical choice for enterprise sites built on Next.js (like sukruyusufkaya.com). ## 7. Model Context Protocol (MCP) — The Most Important Standard of 2025 Every team building agents faced the same problem: each tool integration (Slack, Gmail, CRM, file system) required separate code. **Anthropic's MCP, introduced November 2024**, standardized this. ### MCP's Structure - **MCP Server:** Publishes a tool / data source (e.g., Slack MCP, Postgres MCP, Filesystem MCP) - **MCP Client:** The agent-running app (Claude Code, Claude Desktop, Cursor, etc.) - **Transport:** JSON-RPC over Stdio, HTTP-SSE, or WebSocket ### MCP Ecosystem as of 2026 - **150+ community MCP servers** — Slack, GitHub, Linear, Notion, Postgres, Google Drive, Jira, Salesforce - **Official adoption** — OpenAI (March 2025), Microsoft Copilot Studio, Google (Spring 2025) - **Local Turkish tools** — examples of KVKK-compliant MCP servers are starting to emerge MCP prevents the **agent ecosystem from fragmenting**. A tool author writes once and works simultaneously with all major model providers (Anthropic, OpenAI, Google). This makes third-party SaaS agent-compatibility cheap. Within two years, Turkish software companies may need to position their SaaS products as "MCP-compatible" as a baseline. ## 8. Production Concerns: Shipping an Agent Moving an agent from POC to production is much harder than classic LLM applications. Five critical concerns: ### 8.1. Cost (Token Explosion) A single-prompt LLM call may consume 2-5K tokens, while an agent task can consume 20-100K tokens. Multi-agent tasks reach 200-500K. Budget tracking is mandatory. #### Practical Cost Formula Estimated token cost of a single agent task: Cost = (Step count) × (avg input tokens × input price + avg output tokens × output price) + Tool-call costs **Example.** A 10-step agent task with average 4K input + 500 output tokens per step, Claude Opus 4.7 ($15 input / $75 output per 1M): - Per-step cost: (4000 × $15 + 500 × $75) / 1M = $0.0975 - Total task: 10 × $0.0975 = **$0.975** (~$1) - Same task on Claude Haiku 4.5 (~$1 input / $5 output): **~$0.065** A 10x cost gap = at 10K monthly tasks: **$9,000 vs $650**. Model routing (simple steps to Haiku, complex to Opus) typically yields 60-80% total savings. #### Cost Optimization Checklist - [ ] **Prompt caching** — 50-90% discount on repeated system prompts (Anthropic, OpenAI cached input pricing) - [ ] **Model routing** — dynamic LLM selection by step complexity - [ ] **Tool result caching** — cache hit when a tool is called with identical args - [ ] **Max-iter limit** — strict upper bound on the agent loop (e.g., max 20 steps) - [ ] **Streaming + early-stop** — stop early when the user is satisfied - [ ] **Batch API** — 50% discount for async workloads on OpenAI/Anthropic ### 8.2. Reliability Agents are probabilistic — the same input can produce different outputs. For production, a good pattern is to **keep deterministic parts in workflows and flexible parts in agents**. Lock critical paths with strict schemas (Pydantic, Zod). ### 8.3. Latency In multi-step tasks, total response time can stretch from 30 seconds to minutes. Solutions: - **Streaming** — surface progress to the user - **Parallel tool calls** — independent steps in parallel - **Model routing** — small models for simple steps, large for complex ### 8.4. Observability Tracing agent behavior is **much more complex than classic logging**. 2026 tools: - **LangSmith** — LangChain ecosystem - **Langfuse** — open-source alternative - **Helicone** — simple, fast setup - **Arize Phoenix** — advanced eval integration - **OpenLLMetry** — OpenTelemetry-based ### 8.5. Security and Guardrails Because an agent takes actions, **a safety layer is mandatory**: - **Tool permissions** — which agent can access which tool? - **Dry-run mode** — destructive actions (delete, payment) are simulated first - **Human-in-the-Loop (HITL)** — human approval for critical actions - **Prompt-injection defenses** — against user input manipulating system prompts - **Sandbox** — code execution must always be isolated ## 9. Agent Eval: Why It Differs from LLM Eval An LLM response is evaluated at a single point (faithfulness, relevance). An agent task involves **multiple steps, multiple tools, and multiple possible outputs**. Eval dimensions: Eval infrastructure: **LangSmith**, **Langfuse**, **Patronus**, **Braintrust**, **DeepEval Agent module**. A combination of manual test sets (50-200 tasks) + automated LLM-as-judge + human evaluation is the practical standard. ## 10. Agents Under KVKK + EU AI Act An autonomous decision-making AI system is **particularly sensitive** under regulatory frameworks. ### Under KVKK - **Personal data automation.** If an agent processes customer data across multiple systems, the KVKK privacy notice must cover this automation. - **Automated decision-making.** Fully automated decision agents (e.g., credit approval) fall under KVKK Article 11 — right to object to automated processing. - **Audit log requirement.** Every agent action must be auditably recorded. ### Under EU AI Act - **High-risk classification.** Running agents in HR selection, credit scoring, education assessment automatically qualifies as high-risk. - **Human oversight (Article 14).** Critical decisions by high-risk agents require human approval flows. - **Transparency.** Users must know they are interacting with an agent. When an agent takes action on your company's behalf, **the responsibility is yours**. An HR agent's wrong candidate evaluation, a customer-service agent's wrong discount offer, a trading agent's wrong transaction — all fall under your company's accountability. That is why HITL and audit logs are not optional. ## 11. Agent Use Cases for Turkish Enterprises ### 11.1. Customer Service Agent Not just chatting but opening tickets, querying order status, initiating returns, sending contracts. An active investment area for Turkish telco and e-commerce companies in 2025-2026. ### 11.2. Internal Operations Agent HR approval flows, finance reports, IT ticket triage, purchase request initiation. Typically Slack/Teams integrated, connecting to internal systems via MCP. ### 11.3. Sales / SDR Agent Lead research, personalized outreach, follow-up emails, CRM updates. The foundation of the AI Automation Agency (AAA) business model. ### 11.4. Research Agent Market research, competitor analysis, academic literature scans, investment due diligence. As a strategic decision-support tool, it saves executives significant time. ### 11.5. Code Agent (Developer Assistant) Cursor Agent, Claude Code, Devin, GitHub Copilot Workspace. Agents that open pull requests, write tests, refactor. **Reported to lift software-team productivity by 30-50%.** ### 11.6. Legal Assistant Agent Contract analysis, regulatory change tracking, case precedent scans. A RAG + agent hybrid for law firms. ### 11.7. Operational Monitoring Agent When the system alarms, an agent that triages autonomously, analyzes logs, and proposes (or automates) initial responses (rollback, restart). A DevOps/SRE agent. ## 12. Case Studies (Anonymized Turkish Enterprises) ### Case 1 — Turkish Bank: Internal Knowledge Agent **Problem.** Bank employees (especially call-center agents and branch staff) were constantly searching the internal knowledge base for product questions, regulatory changes, and operational procedures. They had RAG but each question required a manual query. **Solution.** LangGraph supervisor + 3 sub-agents (Product, Regulation, Operations). Native Slack/Teams integration. Via MCP, automatic information retrieval from internal wiki, product catalog, regulation repo. Employees ask in natural language "Is there a card commission change?" — the agent routes to the right sub-agent and returns the correct answer with citations. **Result.** Information-search time per employee dropped from 3.2 hours per week to 1.1 hours. Employee satisfaction +18 points. ROI: 4x payback in 9 months. ### Case 2 — Law Firm: Contract Analysis Agent **Problem.** Contract analysts manually read every document to extract risk clauses, missing terms, and case precedents. A standard contract analysis took 4-6 hours. **Solution.** CrewAI + 4 role-based agents: **Reader** (article-by-article structural chunking), **Risk Analyst** (risk scoring), **Regulator** (KVKK, TBK, TMK comparison via RAG), **Writer** (final summary). Claude Opus 4.7 (1M context — ideal for long contracts) base. **Result.** Contract analysis time dropped from 4-6 hours to 35 minutes. Lawyers received citation-grounded reports; the final decision still rests with the lawyer. Average case duration shortened by 22%; additional $480K annual revenue. ### Case 3 — E-Commerce Marketplace: Supplier Sales Agent **Problem.** Onboarding a new seller required a personalized offer package (market research, product fit analysis, pricing proposal, contract draft) — days of work per prospect. **Solution.** OpenAI Operator-based agent + computer-use capability. The agent scans the CRM, gathers company information from LinkedIn, reviews the product catalog, creates a personalized offer package, and submits to a sales rep for approval. **Result.** New-seller onboarding time dropped from 5 days to 1.5 days. Monthly new sellers onboarded: 2.4x. ROI: 7x in 6 months. ## 13. Agent Development Roadmap ## 14. Common Mistakes and Anti-Patterns Mistakes that repeatedly appear in production agent projects: ### 14.1. The "Single Mega-Agent" Trap One agent given 30+ tools and told to "do everything." Result: the planner overloads, wrong tool selections multiply, eval becomes impossible. **Fix:** Narrow the task scope or split into supervisor + specialist sub-agents. ### 14.2. Shipping Without Eval Skipping the eval harness with "we'll test in beta." The first real bug becomes a user-facing incident. **Fix:** A 50+ task eval set is mandatory before production; run in CI on every PR. ### 14.3. No HITL An agent that decides everything autonomously, skipping human approval on critical actions. KVKK + EU AI Act risk. **Fix:** HITL is mandatory for destructive, financial, or high-user-impact actions. ### 14.4. Infinite Loops In a reflection loop the agent keeps re-evaluating its own answer. Token bomb. **Fix:** Hard caps on max-iter (e.g., 20), max-cost ($0.50/task), and max-time (5 min). ### 14.5. Prompt-Injection-Open Tool Use User input manipulating system prompts; the agent calls unauthorized tools. **Fix:** Strict input validation, tool authorization, sandboxed code execution. ### 14.6. Shipping Without Observability Cannot answer "why did the agent do this?". **Fix:** Langfuse / LangSmith / Helicone from day 1; persist every tool call, planner decision, and eval score. ### 14.7. The "No Transparency" Pattern Users not knowing they are talking to an agent — an EU AI Act transparency violation. **Fix:** Clear AI disclosure, agent action summaries, user controls. ### 14.8. Cost Surprise Going to production without a token budget; end-of-month invoice 10x the expectation. **Fix:** Per-user, per-task, per-day budget caps + alert thresholds. ## 15. The 2026-2030 Future of Agents **1. MCP standard spreads.** All SaaS products needing to publish MCP servers becomes essentially mandatory by 2027; AI engines start disadvantaging non-MCP products. **2. Computer use goes mainstream.** With Anthropic Computer Use and OpenAI Operator maturing in 2026, the RPA market is fundamentally transformed. Legacy RPA players like UI-Path, Automation Anywhere face pressure from AI-native products. **3. Multi-agent A2A standardizes.** Google's A2A protocol and similar initiatives enable agents to communicate as independent network services. **4. Specialized vertical agents.** Domain-trained agent platforms emerge for law, health, finance, retail. The "one general agent" gives way to "one agent per sector." **5. Agent eval frameworks mature.** By end of 2026, "agent benchmarks" reach the maturity LLM benchmarks have today. **6. Self-improving agents (limited).** Agents that improve themselves via reflection + memory + fine-tuning loops are in research; production by 2027-2028. **7. Regulatory tightening.** EU AI Act implementation in 2026-2027 brings concrete obligations for autonomous decision-making agents; US states and Turkey debate similar laws. ## 16. Frequently Asked Questions A chatbot **produces a response**; an agent **takes action**. A chatbot answers an order-status question with text; an agent queries the order, contacts the courier, and proactively notifies the customer. Advanced versions of modern assistants (ChatGPT, Claude) can do both.

As of 2026: Claude Opus 4.7 (Anthropic's agent-use training focus), GPT-5 (function-calling maturity), and Gemini 3 Pro (for multimodal agent tasks) lead. Open alternatives: Llama 4 70B and DeepSeek V3 with tool-use support are sufficient.

Agent tasks consume 10-100x more tokens than single-prompt calls; plan, observation, reflection, and retry are each separate LLM calls. Multi-agent grows further. Do not ship without cost-aware architecture (model routing, caching, parallel calls).

Decision matrix: **MVP / fast prototype:** CrewAI; **production multi-agent:** LangGraph; **TypeScript / Next.js:** Vercel AI SDK; **Microsoft / .NET:** AutoGen or Semantic Kernel; **Anthropic-focused:** Claude Code + MCP. For single-agent, a minimal library / native API is enough.

Sector consensus: **HITL (Human-in-the-Loop) for critical decisions**, automation for routine ones. High-stake actions (payments, deletions, account changes) require human approval; low-stake tasks (information retrieval, draft creation, report writing) can be fully automated.

Yes — MCP is not mandatory but in 2026 **strategically the right choice**. Without MCP, your tool integrations are tied to one LLM provider; switching requires rewrites. MCP is the standard way to avoid vendor lock-in.

Anthropic Claude Computer Use currently recommends running in **a sandboxed VM**; to restrict access to systems the model is not entitled to. For production deployments, sandboxing is mandatory; giving direct access to the live OS is high-risk.

If an agent processes personal data: **privacy notice** (user information), **right to object to automated decisions** (Article 11), **audit log**, **data minimization**. For high-risk EU AI Act categories: human oversight, documentation, quality management. Detailed compliance guide is on this site.

Build a 50-200 representative task set (user query examples + expected results). For each task measure: task success (boolean), plan quality (LLM-as-judge), step count, tool accuracy, latency, cost. Build a dashboard with LangSmith or Langfuse. Do not ship a new model/prompt version **without passing eval**.

80% of cases are solved by single-agent. Multi-agent is needed when **specialization** is required (each sub-agent in a different domain), for **parallelization**, or **long-tail tasks**. Multi-agent eval and debug are 3-5x harder — start single-agent until operational maturity warrants the complication.

Partially. Devin, Replit Agent, Claude Code, Cursor Agent deliver impressive results on **specific tasks** (CRUD endpoints, bug fixes, adding tests). But major architectural decisions, complex refactoring, and domain business logic still require human developer oversight. As of 2026, "fully replacing a senior developer" is hype; "2-3x'ing a senior developer's productivity" is realistic.

All major frameworks (LangGraph, AutoGen, CrewAI, Vercel AI SDK) work seamlessly with Turkish input/output; you can provide Turkish natural-language tool descriptions and agent instructions. In terms of Turkish docs/community, **Vercel AI SDK** and the **LangChain Turkish community** are the most active resources. ## 17. Next Steps To define your agent strategy or move an existing agent application to production quality: 1. **Agent architecture workshop.** Use-case evaluation, single-vs-multi decision, framework selection, tool inventory, KVKK risk map — clarified in a 4-hour session. 2. **Agent eval harness setup.** A 50-200 task test set, observability stack, monitoring dashboard. Brings the existing agent up to a quality scale. 3. **Production audit.** If you have a live agent: 360° audit on cost, latency, errors, security, compliance with an improvement roadmap. Reach out via the contact form on the site. --- This is a living document; the AI Agent ecosystem (frameworks, MCP standards, computer-use capabilities) shifts every quarter, so it is **updated quarterly**.