<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Ziel</title>
    <description>The latest articles on Forem by Ziel (@ziel).</description>
    <link>https://forem.com/ziel</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3810073%2Fd488a68d-f43b-4984-809e-165ba0b0dc68.png</url>
      <title>Forem: Ziel</title>
      <link>https://forem.com/ziel</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/ziel"/>
    <language>en</language>
    <item>
      <title>Aionis: AI Agents Don’t Have a Context Problem. They Have an Execution Memory Problem</title>
      <dc:creator>Ziel</dc:creator>
      <pubDate>Mon, 16 Mar 2026 13:29:59 +0000</pubDate>
      <link>https://forem.com/ziel/aionis-ai-agents-dont-have-a-context-problem-they-have-an-execution-memory-problem-2jbe</link>
      <guid>https://forem.com/ziel/aionis-ai-agents-dont-have-a-context-problem-they-have-an-execution-memory-problem-2jbe</guid>
      <description>&lt;p&gt;A lot of people think AI agents need more context.&lt;/p&gt;

&lt;p&gt;I don’t think that’s the core issue.&lt;/p&gt;

&lt;p&gt;I think the real issue is that most agents still don’t have a durable execution memory layer. They can generate. They can retrieve. They can sometimes act. But they still struggle to continue work reliably across sessions, interruptions, and handoffs.&lt;/p&gt;

&lt;p&gt;That’s why I built Aionis.&lt;/p&gt;

&lt;p&gt;Aionis is not another vector database wrapper. It is not a generic “memory plugin.” It is execution memory infrastructure for coding agents: a system for storing task state, structured handoff, replayable execution artifacts, and policy-linked memory that can actually help an agent continue work.&lt;/p&gt;

&lt;p&gt;The reason I think this matters is not just conceptual. We already have public evidence that this layer changes system behavior in measurable ways.&lt;/p&gt;

&lt;p&gt;The problem isn’t recall. It’s continuity.&lt;br&gt;
The standard memory story in AI is still too shallow. Store some history, retrieve the relevant bits, inject them back into the prompt, and hope the model stays coherent.&lt;/p&gt;

&lt;p&gt;That works for lightweight scenarios.&lt;/p&gt;

&lt;p&gt;It breaks down in real ones.&lt;/p&gt;

&lt;p&gt;In real coding workflows, the hard part is not “finding relevant text.” The hard part is preserving enough execution structure that the next session, the next runtime, or the next agent can actually pick up the task without re-deriving everything from scratch.&lt;/p&gt;

&lt;p&gt;That’s the gap Aionis is designed to close.&lt;/p&gt;

&lt;p&gt;And the public numbers already point in that direction.&lt;/p&gt;

&lt;p&gt;Public evidence so far&lt;br&gt;
On a continuation benchmark over pallets/click, Aionis reduced:&lt;/p&gt;

&lt;p&gt;input tokens by 30.03%&lt;br&gt;
output tokens by 77%&lt;br&gt;
total tokens by 33.24%&lt;br&gt;
That matters because the gain did not come from making the model “smarter.” It came from avoiding repeated context reconstruction in follow-up sessions.&lt;/p&gt;

&lt;p&gt;On handoff, the public evidence is even more direct:&lt;/p&gt;

&lt;p&gt;cross-runtime recovery improved from 33.33% to 100%&lt;br&gt;
real-repository handoff improved from 0% to 100%&lt;br&gt;
That is the kind of result I care about. Not whether an agent looks impressive in a single run, but whether it can reliably resume and continue when the session boundary changes.&lt;/p&gt;

&lt;p&gt;On policy routing, public A/B evidence showed improvement from 0% to 100%, with routing converging toward rg and pytest-focused behavior in real repository workflows. That suggests memory isn’t just useful for recall. It can become part of the control loop.&lt;/p&gt;

&lt;p&gt;On replay, strict replay on replay1 and replay2 ran with 0 model tokens. That’s important because it shows some execution paths can be recovered or reproduced without paying model cost again.&lt;/p&gt;

&lt;p&gt;On the SDK side, Aionis ran a route-to-SDK audit on 2026-03-14 across 65 non-admin, non-control-plane routes and found no missing public SDK surface in either the TypeScript SDK or the Python SDK.&lt;/p&gt;

&lt;p&gt;So this is not just an idea anymore. There is already a public implementation and public evidence across continuation, handoff, replay, policy, and SDK coverage.&lt;/p&gt;

&lt;p&gt;Why this is different from “more context”&lt;br&gt;
Bigger context windows help. Retrieval helps. But neither one is the same as execution memory.&lt;/p&gt;

&lt;p&gt;Context is what the model can see right now.&lt;br&gt;
Execution memory is what the system can rely on later.&lt;/p&gt;

&lt;p&gt;That difference matters a lot.&lt;/p&gt;

&lt;p&gt;A system can retrieve the right text and still fail to resume a task properly. It can remember facts but lose decisions. It can preserve notes but fail to preserve execution state.&lt;/p&gt;

&lt;p&gt;Aionis is built around the idea that agents need memory of work, not just memory of content.&lt;/p&gt;

&lt;p&gt;That includes:&lt;/p&gt;

&lt;p&gt;what happened&lt;br&gt;
what artifacts were produced&lt;br&gt;
what state the task is in&lt;br&gt;
what handoff is available&lt;br&gt;
what can be replayed&lt;br&gt;
what policy signals should influence the next step&lt;br&gt;
That’s why I increasingly think “execution memory” is the right category. It is more precise than “agent memory,” and much more useful than just saying “context management.”&lt;/p&gt;

&lt;p&gt;My bet&lt;br&gt;
My bet is simple:&lt;/p&gt;

&lt;p&gt;The next wave of useful agents won’t be defined by who can stuff the most context into a prompt. They’ll be defined by who can build systems that preserve state, enforce control, and carry execution forward reliably.&lt;/p&gt;

&lt;p&gt;That means memory has to become infrastructure.&lt;/p&gt;

&lt;p&gt;Not memory in the loose sense of “things the model once saw.”&lt;br&gt;
Memory in the hard sense of “state the runtime can operate on.”&lt;/p&gt;

&lt;p&gt;That’s what Aionis is trying to build.&lt;/p&gt;

&lt;p&gt;And if the current public results keep holding, I think this layer is going to matter much more than most people expect.&lt;/p&gt;

</description>
      <category>agents</category>
      <category>ai</category>
      <category>architecture</category>
      <category>showdev</category>
    </item>
    <item>
      <title>Replayable Execution Memory for AI Agents: Building Aionis</title>
      <dc:creator>Ziel</dc:creator>
      <pubDate>Fri, 06 Mar 2026 15:00:47 +0000</pubDate>
      <link>https://forem.com/ziel/replayable-execution-memory-for-ai-agents-building-aionis-d74</link>
      <guid>https://forem.com/ziel/replayable-execution-memory-for-ai-agents-building-aionis-d74</guid>
      <description>&lt;p&gt;Aionis: Replayable Execution Memory for AI Agents&lt;/p&gt;

&lt;p&gt;Large language models are getting extremely good at reasoning.&lt;/p&gt;

&lt;p&gt;Agents built on top of them can plan tasks, call tools, and automate workflows. But there is still a major limitation in most agent systems today:&lt;/p&gt;

&lt;p&gt;Agents don’t remember how work gets done.&lt;/p&gt;

&lt;p&gt;They remember conversations.&lt;br&gt;
They remember embeddings.&lt;br&gt;
But they rarely remember execution.&lt;/p&gt;

&lt;p&gt;Every time an agent performs a task, it often needs to reason through the entire workflow again.&lt;/p&gt;

&lt;p&gt;This leads to:&lt;br&gt;
    • high token usage&lt;br&gt;
    • slow execution&lt;br&gt;
    • unstable results&lt;/p&gt;

&lt;p&gt;We kept running into the same problem while building agent workflows. Even after an agent successfully completed a task, the next run still required the model to re-plan everything.&lt;/p&gt;

&lt;p&gt;So we built something different.&lt;/p&gt;

&lt;p&gt;From Conversation Memory to Execution Memory&lt;/p&gt;

&lt;p&gt;Most agent memory systems store text.&lt;/p&gt;

&lt;p&gt;Typical examples include:&lt;br&gt;
    • chat history&lt;br&gt;
    • vector embeddings&lt;br&gt;
    • entity memory&lt;br&gt;
    • preference storage&lt;/p&gt;

&lt;p&gt;These help agents recall information, but they don’t help agents reuse workflows.&lt;/p&gt;

&lt;p&gt;Aionis focuses on a different kind of memory:&lt;/p&gt;

&lt;p&gt;execution memory.&lt;/p&gt;

&lt;p&gt;Instead of storing what the agent said, Aionis records how the agent completed a task.&lt;/p&gt;

&lt;p&gt;The lifecycle looks like this:&lt;/p&gt;

&lt;p&gt;Agent Run&lt;br&gt;
↓&lt;br&gt;
Execution Trace&lt;br&gt;
↓&lt;br&gt;
Compile Playbook&lt;br&gt;
↓&lt;br&gt;
Replay Workflow&lt;/p&gt;

&lt;p&gt;Once a workflow succeeds, it can be replayed later.&lt;/p&gt;

&lt;p&gt;Instead of asking the model to reason again, the system executes the compiled workflow.&lt;/p&gt;

&lt;p&gt;Why This Matters&lt;/p&gt;

&lt;p&gt;Consider a typical agent task like setting up a development environment.&lt;/p&gt;

&lt;p&gt;Without execution memory, every run looks like this:&lt;/p&gt;

&lt;p&gt;User request&lt;br&gt;
↓&lt;br&gt;
LLM reasoning&lt;br&gt;
↓&lt;br&gt;
Tool planning&lt;br&gt;
↓&lt;br&gt;
Execution&lt;/p&gt;

&lt;p&gt;Even if the exact same task has already succeeded before.&lt;/p&gt;

&lt;p&gt;With execution memory, the process becomes:&lt;/p&gt;

&lt;p&gt;First run:&lt;br&gt;
LLM reasoning → execution trace → playbook&lt;/p&gt;

&lt;p&gt;Later runs:&lt;br&gt;
Replay playbook&lt;/p&gt;

&lt;p&gt;This dramatically reduces overhead and increases stability.&lt;/p&gt;

&lt;p&gt;What Replay Actually Means&lt;/p&gt;

&lt;p&gt;Replay in Aionis is not token replay.&lt;/p&gt;

&lt;p&gt;It does not attempt to reproduce the exact LLM token sequence.&lt;/p&gt;

&lt;p&gt;Instead, Aionis replays actions and artifacts.&lt;/p&gt;

&lt;p&gt;Typical replay steps include:&lt;br&gt;
    • shell commands&lt;br&gt;
    • tool invocations&lt;br&gt;
    • file operations&lt;br&gt;
    • environment changes&lt;/p&gt;

&lt;p&gt;This makes replay deterministic and avoids the complexity of reproducing model reasoning.&lt;/p&gt;

&lt;p&gt;The goal is not to replay thoughts.&lt;/p&gt;

&lt;p&gt;The goal is to replay execution.&lt;/p&gt;

&lt;p&gt;Replay Modes&lt;/p&gt;

&lt;p&gt;Aionis provides three replay modes depending on how strict you want execution to be.&lt;/p&gt;

&lt;p&gt;simulate&lt;/p&gt;

&lt;p&gt;Simulation mode performs validation without executing commands.&lt;/p&gt;

&lt;p&gt;This checks:&lt;br&gt;
    • environment readiness&lt;br&gt;
    • dependencies&lt;br&gt;
    • preconditions&lt;/p&gt;

&lt;p&gt;Useful for auditing workflows.&lt;/p&gt;

&lt;p&gt;strict&lt;/p&gt;

&lt;p&gt;Strict mode executes the workflow exactly as recorded.&lt;/p&gt;

&lt;p&gt;If any step fails, execution stops immediately.&lt;/p&gt;

&lt;p&gt;This is useful for deterministic automation.&lt;/p&gt;

&lt;p&gt;guided&lt;/p&gt;

&lt;p&gt;Guided mode executes workflows but allows repair suggestions if failures occur.&lt;/p&gt;

&lt;p&gt;Repair patches can be generated through:&lt;br&gt;
    • heuristics&lt;br&gt;
    • external synthesis services&lt;br&gt;
    • optional LLM assistance&lt;/p&gt;

&lt;p&gt;However, repairs are not automatically applied.&lt;/p&gt;

&lt;p&gt;Governance and Safety&lt;/p&gt;

&lt;p&gt;Automation systems require governance.&lt;/p&gt;

&lt;p&gt;Aionis follows an audit-first design philosophy.&lt;/p&gt;

&lt;p&gt;Typical repair flow:&lt;/p&gt;

&lt;p&gt;guided run&lt;br&gt;
↓&lt;br&gt;
repair suggestion&lt;br&gt;
↓&lt;br&gt;
human review&lt;br&gt;
↓&lt;br&gt;
shadow validation&lt;br&gt;
↓&lt;br&gt;
promotion&lt;/p&gt;

&lt;p&gt;By default:&lt;br&gt;
    • repairs require review&lt;br&gt;
    • validation happens in shadow mode&lt;br&gt;
    • workflows are not automatically promoted&lt;/p&gt;

&lt;p&gt;This keeps automation systems observable and controllable.&lt;/p&gt;

&lt;p&gt;Benchmark&lt;/p&gt;

&lt;p&gt;In a simple workflow benchmark, we observed the following performance:&lt;/p&gt;

&lt;p&gt;Baseline reasoning execution:&lt;/p&gt;

&lt;p&gt;≈ 2.3 seconds&lt;/p&gt;

&lt;p&gt;Replay execution:&lt;/p&gt;

&lt;p&gt;≈ 0.27 seconds&lt;/p&gt;

&lt;p&gt;Warm replay:&lt;/p&gt;

&lt;p&gt;≈ 0.11 seconds&lt;/p&gt;

&lt;p&gt;This corresponds to roughly:&lt;/p&gt;

&lt;p&gt;8× – 20× speed improvement&lt;/p&gt;

&lt;p&gt;Replay stability across 100 runs was approximately:&lt;/p&gt;

&lt;p&gt;≈98%&lt;/p&gt;

&lt;p&gt;While this is not a universal benchmark, it demonstrates how replayable execution can significantly reduce latency and token usage.&lt;/p&gt;

&lt;p&gt;Where Aionis Fits in the Agent Stack&lt;/p&gt;

&lt;p&gt;Modern agent systems increasingly resemble layered architectures.&lt;/p&gt;

&lt;p&gt;LLM&lt;br&gt;
↓&lt;br&gt;
Agent Planner&lt;br&gt;
↓&lt;br&gt;
Execution Memory&lt;br&gt;
↓&lt;br&gt;
Tools / Environment&lt;/p&gt;

&lt;p&gt;LLMs provide reasoning.&lt;/p&gt;

&lt;p&gt;Agent frameworks orchestrate tasks.&lt;/p&gt;

&lt;p&gt;Aionis provides the execution memory layer that allows agents to reuse successful workflows.&lt;/p&gt;

&lt;p&gt;OpenClaw Integration&lt;/p&gt;

&lt;p&gt;We also built an OpenClaw plugin to integrate agents with Aionis.&lt;/p&gt;

&lt;p&gt;Once installed, the agent automatically records execution traces and can replay workflows later.&lt;/p&gt;

&lt;p&gt;Install:&lt;/p&gt;

&lt;p&gt;openclaw plugins install @aionis/openclaw&lt;br&gt;
openclaw aionis-memory bootstrap&lt;br&gt;
openclaw aionis-memory selfcheck&lt;/p&gt;

&lt;p&gt;Final Thoughts&lt;/p&gt;

&lt;p&gt;Large context windows and powerful models are making agents increasingly capable.&lt;/p&gt;

&lt;p&gt;But reasoning alone does not create reliable automation.&lt;/p&gt;

&lt;p&gt;Execution memory allows agents to gradually accumulate procedural knowledge.&lt;/p&gt;

&lt;p&gt;Instead of solving the same problem repeatedly, agents can reuse workflows that have already succeeded.&lt;/p&gt;

&lt;p&gt;That’s the idea behind Aionis.&lt;/p&gt;

&lt;p&gt;If you’re building agent systems, AI copilots, MCP tools, or LangGraph pipelines, we’d love to hear your feedback.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/Cognary/Aionis" rel="noopener noreferrer"&gt;GitHub:https://github.com/Cognary/Aionis&lt;/a&gt;&lt;br&gt;
&lt;a href="https://doc.aionisos.com" rel="noopener noreferrer"&gt;Docs:https://doc.aionisos.com&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>opensource</category>
      <category>programming</category>
    </item>
  </channel>
</rss>
