<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Abhi</title>
    <description>The latest articles on Forem by Abhi (@abhigk).</description>
    <link>https://forem.com/abhigk</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F194331%2F6cc6422d-1e3a-45ab-8888-e4e269d8439c.jpeg</url>
      <title>Forem: Abhi</title>
      <link>https://forem.com/abhigk</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/abhigk"/>
    <language>en</language>
    <item>
      <title>Agents that ship: Breakdown of the 3-part architecture that survived real-world chaos</title>
      <dc:creator>Abhi</dc:creator>
      <pubDate>Mon, 24 Nov 2025 11:28:58 +0000</pubDate>
      <link>https://forem.com/abhigk/agents-that-ship-breakdown-of-the-3-part-architecture-that-survived-real-world-chaos-2679</link>
      <guid>https://forem.com/abhigk/agents-that-ship-breakdown-of-the-3-part-architecture-that-survived-real-world-chaos-2679</guid>
      <description>&lt;h2&gt;
  
  
  &lt;strong&gt;1. The Big Picture: From Passive AI to Autonomous Agents&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Historical Context&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Traditional AI was &lt;strong&gt;passive&lt;/strong&gt; — it responded to prompts, answered questions, or translated text.&lt;/li&gt;
&lt;li&gt;The new wave is about &lt;strong&gt;autonomous, goal-oriented AI agents&lt;/strong&gt; — systems that &lt;strong&gt;plan, act, and solve complex problems&lt;/strong&gt; over multiple steps without constant supervision.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;The Core Idea&lt;/strong&gt;
&lt;/h3&gt;

&lt;blockquote&gt;
&lt;p&gt;Agents don’t just talk — they act.&lt;br&gt;
They execute actions in the real (or digital) world to achieve defined goals.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fytv4zldjlmoy0zg7lqo9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fytv4zldjlmoy0zg7lqo9.png" alt="Agentic AI problem-solving process" width="800" height="491"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h6&gt;
  
  
  (Agentic AI problem-solving process from Whitepaper - Introduction to Agents and Agent architectures)
&lt;/h6&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;2. The Agent Anatomy: Three Core Parts&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;The white paper breaks down an agent into three key components:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;The Model (Brain)&lt;/strong&gt; – The reasoning and decision-making core.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The Tools (Hands)&lt;/strong&gt; – The interfaces to act on the world.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The Orchestration Layer (Conductor)&lt;/strong&gt; – The system that coordinates everything.&lt;/li&gt;
&lt;/ol&gt;




&lt;h3&gt;
  
  
  &lt;strong&gt;A. The Model – “The Brain”&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;The &lt;strong&gt;LLM (Language Model)&lt;/strong&gt; serves as the reasoning engine.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Its main function: &lt;strong&gt;managing the context window&lt;/strong&gt; — constantly deciding &lt;em&gt;what’s important right now&lt;/em&gt; from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The mission goal&lt;/li&gt;
&lt;li&gt;Memory&lt;/li&gt;
&lt;li&gt;Tool outputs&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;It determines what matters for the next reasoning step.&lt;/p&gt;




&lt;h3&gt;
  
  
  &lt;strong&gt;B. The Tools – “The Hands”&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Tools are how agents &lt;strong&gt;interact with the outside world&lt;/strong&gt; — APIs, functions, databases, vector stores, etc.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Examples:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Look up customer data&lt;/li&gt;
&lt;li&gt;Check inventory&lt;/li&gt;
&lt;li&gt;Query a vector database&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;The model decides &lt;strong&gt;which tool&lt;/strong&gt; to use, while the orchestration layer executes the call and feeds results back into the model.&lt;/p&gt;




&lt;h3&gt;
  
  
  &lt;strong&gt;C. The Orchestration Layer – “The Conductor”&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Governs the &lt;strong&gt;entire reasoning loop&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Planning&lt;/li&gt;
&lt;li&gt;Memory/state management&lt;/li&gt;
&lt;li&gt;Reasoning strategy (e.g., &lt;em&gt;Chain-of-Thought&lt;/em&gt;, &lt;em&gt;ReAct&lt;/em&gt;)&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;The ReAct Loop&lt;/strong&gt;
&lt;/h4&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Think:&lt;/strong&gt; Based on the goal, decide next step.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Act:&lt;/strong&gt; Use a tool.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Observe:&lt;/strong&gt; Take in the result.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Think again:&lt;/strong&gt; Iterate.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This &lt;strong&gt;think–act–observe loop&lt;/strong&gt; is what transforms an LLM into a true agent capable of executing complex, adaptive workflows.&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;3. Example: The Agentic Loop in Action&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Scenario:&lt;/strong&gt; Organizing a Team’s Travel
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Mission:&lt;/strong&gt; “Organize my team’s travel.”&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scan the Scene:&lt;/strong&gt; Identify tools — calendar, booking APIs, etc.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Plan:&lt;/strong&gt; “First, get the team roster.”&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Act:&lt;/strong&gt; Call &lt;code&gt;getTeamRoster()&lt;/code&gt; tool.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Observe &amp;amp; Iterate:&lt;/strong&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;ul&gt;
&lt;li&gt;Receive team list → update context.&lt;/li&gt;
&lt;li&gt;Next step: check availability, then book travel.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This cycle continues until the mission is completed.&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;4. Levels of Agent Capability (Taxonomy)&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxof3u2rvc1b1s38zhobn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxof3u2rvc1b1s38zhobn.png" alt="Agent Taxonomy Levels" width="800" height="382"&gt;&lt;/a&gt;&lt;br&gt;
(Agent Taxonomy Levels from Whitepaper - Introduction to Agents and Agent architectures)&lt;/p&gt;

&lt;p&gt;Designing an agent requires defining its &lt;strong&gt;capability level&lt;/strong&gt;:&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Level 0: Basic LLM&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Just the model.&lt;/li&gt;
&lt;li&gt;No tools or external access.&lt;/li&gt;
&lt;li&gt;Can explain concepts but &lt;strong&gt;cannot access real-time data&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Level 1: Connected Problem Solver&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Model + Tools.&lt;/li&gt;
&lt;li&gt;Gains real-world awareness.&lt;/li&gt;
&lt;li&gt;Example: Looks up current sports scores via a search API.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Level 2: Strategic Problem Solver&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Handles &lt;strong&gt;multi-step tasks&lt;/strong&gt; using &lt;strong&gt;context engineering&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Example: “Find a coffee shop halfway between two addresses.”&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Uses a map tool → gets midpoint coordinates.&lt;/li&gt;
&lt;li&gt;Then queries coffee shops near that point with ratings &amp;gt;4.0.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Level 3: Collaborative Multi-Agent System&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;A &lt;strong&gt;team of agents&lt;/strong&gt; working together.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Example:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Project Manager Agent&lt;/strong&gt; → delegates to&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Market Research Agent&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Data Analysis Agent&lt;/em&gt;&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;&lt;p&gt;Enables &lt;strong&gt;goal delegation&lt;/strong&gt; and &lt;strong&gt;independent sub-agent reasoning&lt;/strong&gt;.&lt;/p&gt;&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Level 4: Self-Evolving System&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Agents that &lt;strong&gt;identify and fill their own capability gaps&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Example:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Realizes it needs social media sentiment analysis.&lt;/li&gt;
&lt;li&gt;Creates a &lt;strong&gt;new agent&lt;/strong&gt; to perform that task.&lt;/li&gt;
&lt;li&gt;Configures access and integrates it automatically.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;5. Building Reliable Production-Grade Agents&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Model Selection&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Don’t just chase benchmarks.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Choose models that are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Strong in reasoning.&lt;/li&gt;
&lt;li&gt;Reliable with tool usage.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;Use &lt;strong&gt;Model Routing&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Heavy reasoning → Gemini 1.5 Pro.&lt;/li&gt;
&lt;li&gt;Simple tasks → Gemini 1.5 Flash.&lt;/li&gt;
&lt;li&gt;Balances &lt;strong&gt;cost and performance&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;




&lt;h3&gt;
  
  
  &lt;strong&gt;Tool Design&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;Two main categories:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Retrieval Tools (RAG, Vector DBs)&lt;/strong&gt; – Ground the agent in factual data.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Action Tools (APIs, Scripts)&lt;/strong&gt; – Allow real-world execution.&lt;/li&gt;
&lt;/ol&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Function Calling&lt;/strong&gt;
&lt;/h4&gt;

&lt;ul&gt;
&lt;li&gt;Tools must have &lt;strong&gt;clear specifications (e.g., OpenAPI format)&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;The model must know:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What the tool does.&lt;/li&gt;
&lt;li&gt;What parameters it requires.&lt;/li&gt;
&lt;li&gt;What output to expect.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;This ensures the &lt;strong&gt;loop stays stable and accurate&lt;/strong&gt;.&lt;/p&gt;




&lt;h3&gt;
  
  
  &lt;strong&gt;Memory Management&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Short-Term Memory:&lt;/strong&gt;
Current context and reasoning trace for the task.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Long-Term Memory:&lt;/strong&gt;
Persistent storage — preferences, user history, learned data.
Often implemented via &lt;strong&gt;vector databases&lt;/strong&gt; as RAG tools.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;6. Testing and Debugging (AgentOps)&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Evaluation&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Traditional testing doesn’t work — outputs vary.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Use &lt;strong&gt;AI-as-a-judge&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Another model grades outputs against a rubric.&lt;/li&gt;
&lt;li&gt;Checks factual grounding and adherence to constraints.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Observability&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;OpenTelemetry Traces&lt;/strong&gt; track every step:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompts, reasoning, tools used, parameters, outputs.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;&lt;p&gt;Acts as a &lt;strong&gt;flight recorder&lt;/strong&gt; for debugging.&lt;/p&gt;&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;User Feedback&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Every failure → a &lt;strong&gt;new test case&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Builds a “golden dataset” that prevents recurring issues.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;7. Security and Governance&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;The Trust Trade-Off&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;More capabilities = more risk.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Requires &lt;strong&gt;Defense-in-Depth&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Hard-coded guardrails (policy engines).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AI-based guard models&lt;/strong&gt; to detect risky behavior pre-execution.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Agent Identity&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Each agent needs a &lt;strong&gt;secure digital identity&lt;/strong&gt; (e.g., &lt;em&gt;SXBF standard&lt;/em&gt;).&lt;/li&gt;
&lt;li&gt;Enables &lt;strong&gt;least-privilege access control&lt;/strong&gt; — limit what each agent can do.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Agent Governance&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Prevent &lt;strong&gt;agent sprawl&lt;/strong&gt; with a &lt;strong&gt;central control plane&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Routes all traffic (user ↔ agent, agent ↔ tool).&lt;/li&gt;
&lt;li&gt;Enforces policies and authentication.&lt;/li&gt;
&lt;li&gt;Monitors logs and performance metrics.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;8. Continuous Learning and Adaptation&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Agents evolve through:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Runtime logs and traces&lt;/li&gt;
&lt;li&gt;User feedback&lt;/li&gt;
&lt;li&gt;Policy and data updates&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Simulation Environments (“Agent Gym”)&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Safe sandbox for testing complex multi-agent behaviors.&lt;/li&gt;
&lt;li&gt;Enables experimentation with synthetic data before deployment.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;9. Real-World Examples&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Google Co-Scientist&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;A &lt;strong&gt;Level 3–4 system&lt;/strong&gt; for scientific research.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Acts as a virtual collaborator:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Formulates hypotheses.&lt;/li&gt;
&lt;li&gt;Designs experiments.&lt;/li&gt;
&lt;li&gt;Analyzes data.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;&lt;p&gt;Uses multiple agents under a &lt;strong&gt;supervisor agent&lt;/strong&gt;.&lt;/p&gt;&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftf1qodezt7j3fbp5iwwj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftf1qodezt7j3fbp5iwwj.png" alt="The AI co-scientist design system" width="800" height="458"&gt;&lt;/a&gt;&lt;br&gt;
(The AI co-scientist design system from Whitepaper - Introduction to Agents and Agent architectures)&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh14m1azjdjkqkybaiao2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh14m1azjdjkqkybaiao2.png" alt="The AI co-scientist design system" width="800" height="402"&gt;&lt;/a&gt;&lt;br&gt;
(The AI co-scientist design system from Whitepaper - Introduction to Agents and Agent architectures)&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;AlphaVolve&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;A &lt;strong&gt;Level 4 AI system&lt;/strong&gt; focused on algorithm discovery.&lt;/li&gt;
&lt;li&gt;Generates, tests, and evolves algorithms autonomously.&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Has achieved improvements in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data center efficiency.&lt;/li&gt;
&lt;li&gt;Matrix multiplication algorithms.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;&lt;p&gt;Humans guide the process by defining &lt;strong&gt;evaluation metrics&lt;/strong&gt;.&lt;/p&gt;&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flvoon7nn89mc4gi80mte.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flvoon7nn89mc4gi80mte.png" alt="Alpha Evolve design system" width="800" height="321"&gt;&lt;/a&gt;&lt;br&gt;
(Alpha Evolve design system )&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;10. The Takeaway: Becoming an AI Architect&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Building successful agents isn’t about having the smartest model — it’s about &lt;strong&gt;engineering rigor&lt;/strong&gt;.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;The Core Components&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model&lt;/strong&gt; → Reasoning&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Tools&lt;/strong&gt; → Action&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Orchestration&lt;/strong&gt; → Management&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;What Matters Most&lt;/strong&gt;
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Architecture&lt;/li&gt;
&lt;li&gt;Governance&lt;/li&gt;
&lt;li&gt;Security&lt;/li&gt;
&lt;li&gt;Testing&lt;/li&gt;
&lt;li&gt;Observability&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Role as a developer is evolving:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;From coder to &lt;strong&gt;architect&lt;/strong&gt; — designing intelligent, autonomous systems that act as collaborative partners, not just tools.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Next I will share how to create an AI Agent from scratch&lt;br&gt;
Until then 👋&lt;/p&gt;

&lt;p&gt;[Reference: Whitepaper: Introduction to Agents. Authors: Alan Blount, Antonio Gulli, Shubham Saboo, Michael Zimmermann, and Vladimir Vuskovic]&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>webdev</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>An open letter to Microsoft</title>
      <dc:creator>Abhi</dc:creator>
      <pubDate>Fri, 20 Jan 2023 03:07:52 +0000</pubDate>
      <link>https://forem.com/abhigk/an-open-letter-to-microsoft-ei5</link>
      <guid>https://forem.com/abhigk/an-open-letter-to-microsoft-ei5</guid>
      <description>&lt;p&gt;Dear Microsoft,&lt;/p&gt;

&lt;p&gt;I am writing to express my deep concern about the decision to put accessible features, such as live captions, behind a paywall. This decision has a significant impact on individuals who are hard of hearing or deaf, as it makes it difficult for them to access important information and communicate effectively.&lt;/p&gt;

&lt;p&gt;Access to live captions is a basic need for many people with hearing impairments, and it is not something that should be restricted by cost. It is essential for them to be able to understand and participate in conversations and meetings, both in personal and professional settings. Without access to live captions, these individuals may miss important information and be excluded from full participation in their communities.&lt;/p&gt;

&lt;p&gt;In addition to being essential for communication, live captions are also a valuable tool for education and personal development. Many deaf and hard of hearing individuals struggle to access educational and professional opportunities due to barriers such as a lack of captioning. By making live captions available to all users, Microsoft can help to level the playing field and provide more opportunities for these individuals to reach their full potential.&lt;/p&gt;

&lt;p&gt;Furthermore, I would like to bring to your attention that, accessibility features such as live captions are not only beneficial for people with hearing impairments but also for people who are non-native speakers, people in noisy environments, or people with cognitive or learning disabilities. Therefore, it is not only important for people with hearing impairments but for everyone in general.&lt;/p&gt;

&lt;p&gt;I urge Microsoft to consider the needs of individuals with hearing impairments and to make live captions available to all users, regardless of their subscription status. This would make a significant difference in the lives of many people and would demonstrate the company's commitment to accessibility and inclusivity.&lt;/p&gt;

&lt;p&gt;I would like to thank you for your time and consideration, and I look forward to hearing about any steps that Microsoft is taking to address this issue.&lt;/p&gt;

&lt;p&gt;Sincerely,&lt;br&gt;
Abhishek Gupta &lt;/p&gt;

</description>
      <category>seo</category>
      <category>ai</category>
      <category>digital</category>
      <category>marketing</category>
    </item>
  </channel>
</rss>
