<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Adarsh Agrahari</title>
    <description>The latest articles on Forem by Adarsh Agrahari (@adarsh_agrahari_47f6e1681).</description>
    <link>https://forem.com/adarsh_agrahari_47f6e1681</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3823501%2F686f5fbc-263a-44ac-9fba-f72bf22715b9.png</url>
      <title>Forem: Adarsh Agrahari</title>
      <link>https://forem.com/adarsh_agrahari_47f6e1681</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/adarsh_agrahari_47f6e1681"/>
    <language>en</language>
    <item>
      <title>most coding agents are still too stateless for real software workflows</title>
      <dc:creator>Adarsh Agrahari</dc:creator>
      <pubDate>Sun, 15 Mar 2026 18:53:48 +0000</pubDate>
      <link>https://forem.com/adarsh_agrahari_47f6e1681/most-coding-agents-are-still-too-stateless-for-real-software-workflows-4320</link>
      <guid>https://forem.com/adarsh_agrahari_47f6e1681/most-coding-agents-are-still-too-stateless-for-real-software-workflows-4320</guid>
      <description>&lt;p&gt;Inside a single prompt… coding agents can look impressive.&lt;/p&gt;

&lt;p&gt;Across longer software workflows… they still get brittle.&lt;/p&gt;

&lt;p&gt;That was the repeated frustration behind &lt;strong&gt;Nexus Prime&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;I kept seeing the same pattern:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;context drifted across steps&lt;/li&gt;
&lt;li&gt;prior decisions got lost&lt;/li&gt;
&lt;li&gt;execution became messy&lt;/li&gt;
&lt;li&gt;too much depended on one growing prompt&lt;/li&gt;
&lt;li&gt;parallel work became hard to isolate cleanly&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The issue did not feel like raw model quality.&lt;/p&gt;

&lt;p&gt;It felt like a systems problem.&lt;/p&gt;

&lt;h2&gt;
  
  
  The real gap
&lt;/h2&gt;

&lt;p&gt;Most coding agents are still too &lt;strong&gt;stateless&lt;/strong&gt; at the workflow level.&lt;/p&gt;

&lt;p&gt;They can generate well in the moment.&lt;br&gt;&lt;br&gt;
But they struggle to carry forward the right context over time.&lt;/p&gt;

&lt;p&gt;That matters more than it sounds.&lt;/p&gt;

&lt;p&gt;Because real software work is rarely one step.&lt;/p&gt;

&lt;p&gt;It is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;inspect code&lt;/li&gt;
&lt;li&gt;recover prior context&lt;/li&gt;
&lt;li&gt;choose the right files&lt;/li&gt;
&lt;li&gt;make changes&lt;/li&gt;
&lt;li&gt;verify what happened&lt;/li&gt;
&lt;li&gt;continue without silently losing state&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Once the workflow gets longer… the cracks show up.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I built
&lt;/h2&gt;

&lt;p&gt;I built &lt;strong&gt;Nexus Prime&lt;/strong&gt; to explore that missing layer.&lt;/p&gt;

&lt;p&gt;It is a &lt;strong&gt;local-first control plane for coding agents&lt;/strong&gt; built around:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;session bootstrap with memory recovery&lt;/li&gt;
&lt;li&gt;token-aware context assembly&lt;/li&gt;
&lt;li&gt;orchestrator-first execution&lt;/li&gt;
&lt;li&gt;reusable skills… workflows… hooks… and automations&lt;/li&gt;
&lt;li&gt;runtime truth surfaced in a dashboard&lt;/li&gt;
&lt;li&gt;isolated git worktree execution for bounded parallel work&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The goal is not to make agents sound smarter.&lt;/p&gt;

&lt;p&gt;The goal is to make them more usable over longer horizons.&lt;/p&gt;

&lt;h2&gt;
  
  
  What changed in my thinking
&lt;/h2&gt;

&lt;p&gt;Building this pushed me toward a stronger belief:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The next leap in coding agents may come less from better models alone&lt;br&gt;&lt;br&gt;
and more from better systems around memory… continuity… orchestration… and execution boundaries.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;In other words:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The problem with most coding agents is not just intelligence. It is state.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Why bounded execution matters
&lt;/h2&gt;

&lt;p&gt;One thing that became clearer while building Nexus Prime is that unbounded agent workflows do not just get expensive.&lt;/p&gt;

&lt;p&gt;They get misleading.&lt;/p&gt;

&lt;p&gt;Stale context accumulates.&lt;br&gt;&lt;br&gt;
Older assumptions bleed into later decisions.&lt;br&gt;&lt;br&gt;
The prompt becomes a blurry substitute for explicit state.&lt;/p&gt;

&lt;p&gt;That is why I became much more biased toward:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;bounded execution&lt;/li&gt;
&lt;li&gt;explicit memory recovery&lt;/li&gt;
&lt;li&gt;isolated worktrees&lt;/li&gt;
&lt;li&gt;runtime traces over vague conversational continuity&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If context has to be compacted… durable state should move somewhere explicit first.&lt;/p&gt;

&lt;p&gt;Otherwise compaction just hides distortion.&lt;/p&gt;

&lt;h2&gt;
  
  
  Early signal that this problem is real
&lt;/h2&gt;

&lt;p&gt;After launching Nexus Prime… the most interesting early signal was not just usage.&lt;/p&gt;

&lt;p&gt;It was recognition.&lt;/p&gt;

&lt;p&gt;Different technical communities kept coming back to the same idea in different words:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;coding agents need better state&lt;/li&gt;
&lt;li&gt;memory matters more than people think&lt;/li&gt;
&lt;li&gt;orchestration and runtime discipline are the missing middle layer&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That was useful confirmation that this is not just a personal frustration.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I am curious about next
&lt;/h2&gt;

&lt;p&gt;I think we are still early in understanding what the right systems layer for coding agents looks like.&lt;/p&gt;

&lt;p&gt;Some open questions I am thinking about:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;what should count as durable state vs disposable context&lt;/li&gt;
&lt;li&gt;how much memory recovery should happen automatically&lt;/li&gt;
&lt;li&gt;when should orchestration choose a workflow vs a specialist vs a tool&lt;/li&gt;
&lt;li&gt;what is the right balance between flexibility and bounded execution&lt;/li&gt;
&lt;li&gt;how much of agent usefulness comes from model quality vs control-plane design&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  If you are building in this space
&lt;/h2&gt;

&lt;p&gt;I would love to hear how you are thinking about:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;memory across workflows&lt;/li&gt;
&lt;li&gt;bounded execution&lt;/li&gt;
&lt;li&gt;context recovery&lt;/li&gt;
&lt;li&gt;worktree-based parallelism&lt;/li&gt;
&lt;li&gt;practical coding-agent architecture&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Nexus Prime is here if you want to explore it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Website: &lt;a href="https://nexus-prime.cfd" rel="noopener noreferrer"&gt;https://nexus-prime.cfd&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;GitHub: &lt;a href="https://github.com/sir-ad/nexus-prime" rel="noopener noreferrer"&gt;https://github.com/sir-ad/nexus-prime&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Product Hunt: &lt;a href="https://www.producthunt.com/products/nexus-prime" rel="noopener noreferrer"&gt;https://www.producthunt.com/products/nexus-prime&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you have hit similar problems… I would be especially curious where your current setup breaks.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Question for people building with coding agents:&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Do they mainly need better models from here… or better systems around state… continuity… and execution boundaries?&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>mcp</category>
      <category>agents</category>
    </item>
  </channel>
</rss>
