<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Ajaykumar Yavagal</title>
    <description>The latest articles on Forem by Ajaykumar Yavagal (@akyavagal).</description>
    <link>https://forem.com/akyavagal</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3920419%2F9fbaaace-6101-455e-b824-2f6c253cec70.png</url>
      <title>Forem: Ajaykumar Yavagal</title>
      <link>https://forem.com/akyavagal</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/akyavagal"/>
    <language>en</language>
    <item>
      <title>Hermes Agent — The System That Doesn’t Stop When the Task Ends</title>
      <dc:creator>Ajaykumar Yavagal</dc:creator>
      <pubDate>Fri, 15 May 2026 23:49:59 +0000</pubDate>
      <link>https://forem.com/akyavagal/hermes-agent-the-system-that-doesnt-stop-when-the-task-ends-hi6</link>
      <guid>https://forem.com/akyavagal/hermes-agent-the-system-that-doesnt-stop-when-the-task-ends-hi6</guid>
      <description>&lt;h1&gt;
  
  
  Hermes Isn't a Chatbot. It's an Agent Runtime.
&lt;/h1&gt;

&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/hermes-agent-2026-05-15"&gt;Hermes Agent Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;The first time you run Hermes, nothing about it feels unusual.&lt;/p&gt;

&lt;p&gt;A CLI.&lt;br&gt;
A loop.&lt;br&gt;
A few commands.&lt;/p&gt;

&lt;p&gt;Another agent.&lt;/p&gt;

&lt;p&gt;And that’s why most people will underestimate it.&lt;/p&gt;

&lt;p&gt;Because if you stop there, you miss what’s actually happening.&lt;/p&gt;

&lt;p&gt;Hermes is not optimizing responses.&lt;/p&gt;

&lt;p&gt;It is beginning to remember.&lt;/p&gt;


&lt;h2&gt;
  
  
  The Misunderstanding
&lt;/h2&gt;

&lt;p&gt;Most people encountering Hermes will interpret it as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;a coding assistant&lt;/li&gt;
&lt;li&gt;a tool wrapper&lt;/li&gt;
&lt;li&gt;a prompt loop with memory&lt;/li&gt;
&lt;li&gt;a nicer interface over LLMs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;All reasonable conclusions.&lt;/p&gt;

&lt;p&gt;All incomplete.&lt;/p&gt;

&lt;p&gt;Hermes is not fundamentally a chatbot.&lt;/p&gt;

&lt;p&gt;It is an &lt;strong&gt;agent runtime&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;And more importantly:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;It is structured like something that expects to stay alive.&lt;/p&gt;
&lt;/blockquote&gt;


&lt;h2&gt;
  
  
  The Shift: From Responses to Runtime
&lt;/h2&gt;

&lt;p&gt;Most AI systems today operate like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Input → Prompt → Model → Output → End
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Hermes does something fundamentally different:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;State → Context → Reason → Act → Store → Continue
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This is the shift from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;answering → operating&lt;/li&gt;
&lt;li&gt;stateless → persistent&lt;/li&gt;
&lt;li&gt;reactive → continuous&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What Hermes Actually Builds
&lt;/h2&gt;

&lt;p&gt;At the center of Hermes is not an interface.&lt;/p&gt;

&lt;p&gt;It is a loop.&lt;/p&gt;

&lt;p&gt;A managed, long-lived, stateful loop.&lt;/p&gt;

&lt;p&gt;Everything else orbits that loop:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;CLI&lt;/li&gt;
&lt;li&gt;messaging gateways&lt;/li&gt;
&lt;li&gt;schedules&lt;/li&gt;
&lt;li&gt;batch jobs&lt;/li&gt;
&lt;li&gt;protocol adapters&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is not how you design a chatbot.&lt;/p&gt;

&lt;p&gt;This is how you design a runtime.&lt;/p&gt;




&lt;h2&gt;
  
  
  Architecture That Reveals Intent
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;User / External Surface
→ Interfaces (CLI, Gateway, MCP, Scheduler)
→ Agent Runtime
→ Context Engine + Memory Manager
→ Tools + Integrations
→ Providers
→ Persistent State
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Every layer isolates responsibility.&lt;/p&gt;

&lt;p&gt;Every layer can evolve.&lt;/p&gt;

&lt;p&gt;Hermes is not an app.&lt;/p&gt;

&lt;p&gt;It is a system that can host intelligence.&lt;/p&gt;




&lt;h2&gt;
  
  
  Memory Is Not a Feature — It's a Foundation
&lt;/h2&gt;

&lt;p&gt;Hermes separates memory into distinct layers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;curated long-term memory&lt;/li&gt;
&lt;li&gt;searchable session history&lt;/li&gt;
&lt;li&gt;external memory providers&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It distinguishes between:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;what must persist&lt;/li&gt;
&lt;li&gt;what can be retrieved&lt;/li&gt;
&lt;li&gt;what should be summarized&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is not prompt engineering.&lt;/p&gt;

&lt;p&gt;That is information architecture.&lt;/p&gt;




&lt;h2&gt;
  
  
  Context Is Treated Like Lifecycle
&lt;/h2&gt;

&lt;p&gt;Hermes does not treat context overflow as failure.&lt;/p&gt;

&lt;p&gt;It treats it as evolution.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;compresses intelligently&lt;/li&gt;
&lt;li&gt;preserves critical context&lt;/li&gt;
&lt;li&gt;rotates sessions&lt;/li&gt;
&lt;li&gt;maintains lineage&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Context becomes a managed lifecycle rather than a limitation.&lt;/p&gt;




&lt;h2&gt;
  
  
  Tools Are Capabilities
&lt;/h2&gt;

&lt;p&gt;Hermes defines a structured tool system:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;tools register themselves&lt;/li&gt;
&lt;li&gt;define schemas&lt;/li&gt;
&lt;li&gt;execute safely&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The model does not just generate text.&lt;/p&gt;

&lt;p&gt;It selects actions within a system.&lt;/p&gt;




&lt;h2&gt;
  
  
  Delegation Changes Everything
&lt;/h2&gt;

&lt;p&gt;Hermes can spawn sub-agents.&lt;/p&gt;

&lt;p&gt;Those sub-agents:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;run in isolation&lt;/li&gt;
&lt;li&gt;have bounded context&lt;/li&gt;
&lt;li&gt;use restricted tools&lt;/li&gt;
&lt;li&gt;return results&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This shifts intelligence from linear to distributed.&lt;/p&gt;




&lt;h2&gt;
  
  
  Agents as Processes
&lt;/h2&gt;

&lt;p&gt;Hermes treats agents not as calls, but as processes.&lt;/p&gt;

&lt;p&gt;Not something invoked once.&lt;/p&gt;

&lt;p&gt;Something that runs.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="k"&gt;while&lt;/span&gt; &lt;span class="n"&gt;alive&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
    &lt;span class="nf"&gt;observe&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
    &lt;span class="nf"&gt;reason&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
    &lt;span class="nf"&gt;act&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
    &lt;span class="nf"&gt;update&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This loop is the system.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;AI is moving from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;response systems&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;runtime systems&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Where the system itself:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;holds memory&lt;/li&gt;
&lt;li&gt;coordinates actions&lt;/li&gt;
&lt;li&gt;persists over time&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  The Bigger Shift
&lt;/h2&gt;

&lt;p&gt;The useful unit of AI is no longer the prompt.&lt;/p&gt;

&lt;p&gt;It is the runtime.&lt;/p&gt;

&lt;p&gt;Not isolated responses.&lt;/p&gt;

&lt;h2&gt;
  
  
  Persistent systems.
&lt;/h2&gt;

&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;Hermes is not important because of what it does today.&lt;/p&gt;

&lt;p&gt;It is important because of what it implies.&lt;/p&gt;

&lt;p&gt;The moment AI systems stop resetting...&lt;/p&gt;

&lt;p&gt;They stop behaving like assistants.&lt;/p&gt;

&lt;p&gt;They begin to persist.&lt;/p&gt;




&lt;h2&gt;
  
  
  Tags
&lt;/h2&gt;

&lt;p&gt;&lt;code&gt;#ai&lt;/code&gt; &lt;code&gt;#agents&lt;/code&gt; &lt;code&gt;#systemsdesign&lt;/code&gt; &lt;code&gt;#opensource&lt;/code&gt;&lt;/p&gt;

</description>
      <category>hermesagentchallenge</category>
      <category>devchallenge</category>
      <category>agents</category>
      <category>systemdesign</category>
    </item>
    <item>
      <title>Gemma 4 and the End of API-Dependent AI</title>
      <dc:creator>Ajaykumar Yavagal</dc:creator>
      <pubDate>Fri, 15 May 2026 16:06:22 +0000</pubDate>
      <link>https://forem.com/akyavagal/gemma-4-and-the-end-of-api-dependent-ai-1bb8</link>
      <guid>https://forem.com/akyavagal/gemma-4-and-the-end-of-api-dependent-ai-1bb8</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/google-gemma-2026-05-06"&gt;Gemma 4 Challenge: Write About Gemma 4&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;For years, we built AI systems by renting intelligence — this explores what happens when we finally start owning it.&lt;/p&gt;

&lt;p&gt;We called APIs.&lt;br&gt;&lt;br&gt;
We paid per token.&lt;br&gt;&lt;br&gt;
We accepted latency, outages, pricing changes, and vendor lock-in as normal.&lt;/p&gt;

&lt;p&gt;And eventually, we stopped questioning it.&lt;/p&gt;

&lt;p&gt;If you wanted serious AI capability, you didn’t own it.&lt;/p&gt;

&lt;p&gt;You leased it.&lt;/p&gt;


&lt;h2&gt;
  
  
  The API Era Shaped How We Build
&lt;/h2&gt;

&lt;p&gt;Modern AI systems were designed around centralized intelligence.&lt;/p&gt;

&lt;p&gt;Your application didn’t contain intelligence.&lt;br&gt;&lt;br&gt;
It depended on it.&lt;/p&gt;

&lt;p&gt;That decision shaped everything:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Architecture
&lt;/li&gt;
&lt;li&gt;Cost structure
&lt;/li&gt;
&lt;li&gt;Performance
&lt;/li&gt;
&lt;li&gt;Privacy
&lt;/li&gt;
&lt;li&gt;Scalability
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A large number of “AI products” became thin layers over external models.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;User Input → Backend → API → Model → Response → Cost
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This created a strange reality:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Core product capabilities were external
&lt;/li&gt;
&lt;li&gt;Margins depended on someone else’s pricing
&lt;/li&gt;
&lt;li&gt;Reliability depended on another company
&lt;/li&gt;
&lt;li&gt;Scaling increased dependency instead of reducing it&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We accepted it because we had no meaningful alternative.&lt;/p&gt;




&lt;h2&gt;
  
  
  Gemma 4 Changes the Assumption
&lt;/h2&gt;

&lt;p&gt;Gemma 4 doesn’t matter because it wins every benchmark.&lt;/p&gt;

&lt;p&gt;It matters because it changes something deeper:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;You can now own capable AI instead of permanently renting it.&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
That single shift changes how modern software gets designed.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;For the first time, developers can realistically ask:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;How much of my system actually needs a remote model?&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That’s a very different question from:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Which model is best?&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Benchmarks Don’t Build Systems
&lt;/h2&gt;

&lt;p&gt;AI discussions are increasingly dominated by:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Benchmark scores
&lt;/li&gt;
&lt;li&gt;Reasoning rankings
&lt;/li&gt;
&lt;li&gt;Throughput metrics
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But products don’t ship benchmarks.&lt;/p&gt;

&lt;p&gt;They ship systems.&lt;/p&gt;

&lt;p&gt;And real systems care about:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Latency
&lt;/li&gt;
&lt;li&gt;Cost predictability
&lt;/li&gt;
&lt;li&gt;Deployment flexibility
&lt;/li&gt;
&lt;li&gt;Privacy
&lt;/li&gt;
&lt;li&gt;Control
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The question is no longer:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;“Is this the smartest model?”&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It’s:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;“Is this model sufficient to own the stack?”&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Capability vs Practicality
&lt;/h2&gt;

&lt;p&gt;Frontier models still lead in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Deep reasoning
&lt;/li&gt;
&lt;li&gt;Complex planning
&lt;/li&gt;
&lt;li&gt;Advanced synthesis
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But most real-world workloads don’t need maximum intelligence.&lt;/p&gt;

&lt;p&gt;They need:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Summarization
&lt;/li&gt;
&lt;li&gt;Transformation
&lt;/li&gt;
&lt;li&gt;Structured outputs
&lt;/li&gt;
&lt;li&gt;Log analysis
&lt;/li&gt;
&lt;li&gt;Lightweight reasoning
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In these scenarios, a model that is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Local
&lt;/li&gt;
&lt;li&gt;Fast enough
&lt;/li&gt;
&lt;li&gt;Private
&lt;/li&gt;
&lt;li&gt;Low-cost
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;can create a better overall system, even if it scores lower on benchmarks.&lt;/p&gt;




&lt;h2&gt;
  
  
  Local AI Changes the Development Experience
&lt;/h2&gt;

&lt;p&gt;This shift is not just technical — it’s experiential.&lt;/p&gt;

&lt;p&gt;With APIs, every interaction introduces friction:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Network calls
&lt;/li&gt;
&lt;li&gt;Latency gaps
&lt;/li&gt;
&lt;li&gt;Rate limits
&lt;/li&gt;
&lt;li&gt;Cost awareness
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;With local models:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Responses begin immediately
&lt;/li&gt;
&lt;li&gt;Iteration becomes effectively free
&lt;/li&gt;
&lt;li&gt;Experimentation accelerates
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Even if raw speed is slower, the system often feels faster.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;AI stops being a service and becomes part of the system.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Privacy Becomes Structural
&lt;/h2&gt;

&lt;p&gt;Privacy is often treated as a feature.&lt;/p&gt;

&lt;p&gt;Local AI makes it architectural.&lt;/p&gt;

&lt;p&gt;Entire categories of software become easier to build:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Internal tools
&lt;/li&gt;
&lt;li&gt;Proprietary code analysis
&lt;/li&gt;
&lt;li&gt;Security systems
&lt;/li&gt;
&lt;li&gt;Regulated environments
&lt;/li&gt;
&lt;li&gt;Offline applications
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You’re no longer asking:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Can we send this data out?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Because you don’t have to.&lt;/p&gt;




&lt;h2&gt;
  
  
  From Renting Intelligence to Owning It
&lt;/h2&gt;

&lt;p&gt;The biggest shift is economic.&lt;/p&gt;

&lt;h3&gt;
  
  
  API-first AI
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Pay per request
&lt;/li&gt;
&lt;li&gt;Costs scale with usage
&lt;/li&gt;
&lt;li&gt;Dependency increases
&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Local-first AI
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Costs stabilize
&lt;/li&gt;
&lt;li&gt;Control increases
&lt;/li&gt;
&lt;li&gt;Systems become customizable
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This moves AI from:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;a metered service → to infrastructure&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  A Small but Real Example
&lt;/h2&gt;

&lt;p&gt;While exploring this shift, I built a local-first system using Gemma 4&lt;br&gt;&lt;br&gt;
to process and interpret security events.&lt;/p&gt;

&lt;p&gt;Instead of sending logs to external services, the system:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Analyzes event patterns locally
&lt;/li&gt;
&lt;li&gt;Generates structured threat explanations
&lt;/li&gt;
&lt;li&gt;Provides actionable recommendations
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;What stood out was not just capability — but the workflow shift:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;No concern about API cost
&lt;/li&gt;
&lt;li&gt;Faster iteration loops
&lt;/li&gt;
&lt;li&gt;Full control over sensitive data
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It revealed something subtle but powerful:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Owning the intelligence changes how the system behaves.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Real-World Scenario
&lt;/h2&gt;

&lt;p&gt;Consider a small hospital or organization without a dedicated security team.&lt;/p&gt;

&lt;p&gt;A sequence of events occurs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Multiple failed login attempts
&lt;/li&gt;
&lt;li&gt;A successful login from an unusual source
&lt;/li&gt;
&lt;li&gt;Execution of a suspicious script
&lt;/li&gt;
&lt;li&gt;Persistence mechanisms being installed
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In most systems, these appear as isolated log entries.&lt;/p&gt;

&lt;p&gt;No clear narrative.&lt;br&gt;&lt;br&gt;
No immediate action.&lt;/p&gt;

&lt;p&gt;But when processed locally by a system like this:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The sequence is recognized as a coordinated attack
&lt;/li&gt;
&lt;li&gt;The risk is clearly explained
&lt;/li&gt;
&lt;li&gt;Immediate response actions are generated
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Instead of raw logs, the system produces:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A clear threat explanation
&lt;/li&gt;
&lt;li&gt;Context-aware insights
&lt;/li&gt;
&lt;li&gt;Actionable remediation steps
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is the difference between:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;detecting events&lt;/strong&gt; and &lt;strong&gt;understanding threats&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;And importantly, all of this happens locally — without sending sensitive system data outside the organization.&lt;/p&gt;




&lt;h2&gt;
  
  
  AI Watchdog in Action
&lt;/h2&gt;

&lt;p&gt;Below is a real example of AI Watchdog analyzing a multi-stage attack using Gemma 4 running locally:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdupm39p55bgshevewjow.PNG" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdupm39p55bgshevewjow.PNG" alt=" " width="800" height="465"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This example shows:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A sequence of suspicious events across a single host
&lt;/li&gt;
&lt;li&gt;Real-time threat classification (Critical, High, Medium)
&lt;/li&gt;
&lt;li&gt;Structured AI-generated insights
&lt;/li&gt;
&lt;li&gt;Actionable response recommendations
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The system transforms fragmented logs into a coherent attack narrative — locally, without external APIs.&lt;/p&gt;




&lt;h2&gt;
  
  
  This Shift Has Happened Before
&lt;/h2&gt;

&lt;p&gt;Computing has always moved in cycles:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Mainframes → centralized
&lt;/li&gt;
&lt;li&gt;PCs → decentralized
&lt;/li&gt;
&lt;li&gt;Cloud → centralized again
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI is beginning its own shift.&lt;/p&gt;

&lt;p&gt;For years, advanced intelligence lived in remote systems.&lt;br&gt;&lt;br&gt;
Now, it’s moving closer to developers again.&lt;/p&gt;




&lt;h2&gt;
  
  
  Frontier Models Still Matter
&lt;/h2&gt;

&lt;p&gt;This is not the end of APIs.&lt;/p&gt;

&lt;p&gt;Frontier models still lead in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Advanced reasoning
&lt;/li&gt;
&lt;li&gt;Complex problem-solving
&lt;/li&gt;
&lt;li&gt;Research-grade tasks
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But the gap between:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;“best possible”&lt;/strong&gt; and &lt;strong&gt;“good enough for real systems”&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;is shrinking quickly.&lt;/p&gt;

&lt;p&gt;And that’s where disruption happens.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Gemma 4 Represents
&lt;/h2&gt;

&lt;p&gt;Gemma 4 is not just another model release.&lt;/p&gt;

&lt;p&gt;It represents a change in assumption:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Powerful AI does not have to remain centralized.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;And once developers realize capable AI can increasingly run locally, the economics and architecture of software start changing with it.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;System design
&lt;/li&gt;
&lt;li&gt;Cost models
&lt;/li&gt;
&lt;li&gt;Developer workflows
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;For years, building with AI meant renting intelligence.&lt;/p&gt;

&lt;p&gt;Gemma 4 suggests a different future:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Local
&lt;/li&gt;
&lt;li&gt;Private
&lt;/li&gt;
&lt;li&gt;Controllable
&lt;/li&gt;
&lt;li&gt;Deployable anywhere
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Not perfect.&lt;/p&gt;

&lt;p&gt;But increasingly sufficient.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;In software, “sufficient and owned” often beats “perfect and rented.”&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The real question is no longer:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Which model is smartest?&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It’s:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Which model lets you build the best system?&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Maybe the future of AI is not about accessing the smartest model on Earth.&lt;/p&gt;

&lt;p&gt;Maybe it’s about owning intelligence that is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Good enough
&lt;/li&gt;
&lt;li&gt;Always available
&lt;/li&gt;
&lt;li&gt;Fully under your control
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And for the first time in a long while, that shift feels within reach.&lt;/p&gt;




&lt;h2&gt;
  
  
  Tags
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;#gemma
#gemmachallenge
#ai
#opensource
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



</description>
      <category>devchallenge</category>
      <category>gemmachallenge</category>
      <category>gemma</category>
      <category>privacy</category>
    </item>
    <item>
      <title>ProdSeer — AI-Powered Production Failure Prediction™</title>
      <dc:creator>Ajaykumar Yavagal</dc:creator>
      <pubDate>Fri, 08 May 2026 16:31:24 +0000</pubDate>
      <link>https://forem.com/akyavagal/prodseer-ai-powered-production-failure-prediction-3icc</link>
      <guid>https://forem.com/akyavagal/prodseer-ai-powered-production-failure-prediction-3icc</guid>
      <description>&lt;h1&gt;
  
  
  Building ProdSeer: AI-Powered Production Failure Prediction™
&lt;/h1&gt;

&lt;p&gt;Modern systems rarely fail because of one bug.&lt;/p&gt;

&lt;p&gt;They fail because of hidden operational complexity:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;cascading dependencies&lt;/li&gt;
&lt;li&gt;infrastructure bottlenecks&lt;/li&gt;
&lt;li&gt;observability blind spots&lt;/li&gt;
&lt;li&gt;external API failures&lt;/li&gt;
&lt;li&gt;scaling assumptions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So for the MeDo Hackathon, I built &lt;strong&gt;ProdSeer&lt;/strong&gt; — an AI-powered Production Failure Prediction™ platform.&lt;/p&gt;

&lt;p&gt;ProdSeer analyzes GitHub repositories, simulates cascading infrastructure failures, and forecasts production survivability before deployment using structured AI reasoning workflows.&lt;/p&gt;

&lt;h2&gt;
  
  
  What ProdSeer Does
&lt;/h2&gt;

&lt;p&gt;ProdSeer goes beyond static code analysis.&lt;/p&gt;

&lt;p&gt;It:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;analyzes repository architecture&lt;/li&gt;
&lt;li&gt;infers production topology&lt;/li&gt;
&lt;li&gt;identifies operational bottlenecks&lt;/li&gt;
&lt;li&gt;simulates cascading failure scenarios&lt;/li&gt;
&lt;li&gt;forecasts survivability under production pressure&lt;/li&gt;
&lt;li&gt;generates infrastructure redesign recommendations&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Some Features
&lt;/h2&gt;

&lt;p&gt;⚡ Repository intelligence&lt;br&gt;
🕸️ Infrastructure topology visualization&lt;br&gt;
🔥 Cascading failure simulation&lt;br&gt;
📉 Survival probability forecasting&lt;br&gt;
🛡️ Operational risk analysis&lt;br&gt;
💬 Conversational infrastructure reasoning&lt;br&gt;
📄 Executive production-readiness reports&lt;/p&gt;

&lt;h2&gt;
  
  
  The Most Interesting Part
&lt;/h2&gt;

&lt;p&gt;One of the wildest moments during development was watching AI reason about:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;AI-agent architectures&lt;/li&gt;
&lt;li&gt;degraded operational modes&lt;/li&gt;
&lt;li&gt;LLM dependency bottlenecks&lt;/li&gt;
&lt;li&gt;secure command execution&lt;/li&gt;
&lt;li&gt;infrastructure survivability&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The system began generating surprisingly believable operational redesigns involving:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Kubernetes&lt;/li&gt;
&lt;li&gt;Redis&lt;/li&gt;
&lt;li&gt;observability pipelines&lt;/li&gt;
&lt;li&gt;secure execution sandboxes&lt;/li&gt;
&lt;li&gt;API resilience layers&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Tech Stack
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;MeDo AI Framework&lt;/li&gt;
&lt;li&gt;Gemini 2.5 Flash&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;AI is making software development dramatically faster.&lt;/p&gt;

&lt;p&gt;But as AI-generated systems become more complex, operational uncertainty also increases.&lt;/p&gt;

&lt;p&gt;ProdSeer was an experiment in exploring what AI-native operational intelligence could look like.&lt;/p&gt;

&lt;p&gt;And honestly… watching AI simulate infrastructure collapse scenarios was mind-bending 😄&lt;/p&gt;

&lt;h2&gt;
  
  
  Links
&lt;/h2&gt;

&lt;p&gt;🚀 Live App: &lt;a href="https://app-bhgywf5bbuv5.appmedo.com/" rel="noopener noreferrer"&gt;https://app-bhgywf5bbuv5.appmedo.com/&lt;/a&gt;&lt;br&gt;&lt;br&gt;
🎥 Demo Video: &lt;a href="https://www.youtube.com/watch?v=y_sFMDpOYxs" rel="noopener noreferrer"&gt;https://www.youtube.com/watch?v=y_sFMDpOYxs&lt;/a&gt;  &lt;/p&gt;

&lt;h1&gt;
  
  
  BuiltWithMeDo
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>lowcode</category>
      <category>cybersecurity</category>
    </item>
  </channel>
</rss>
