<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Praveen Govindaraj</title>
    <description>The latest articles on Forem by Praveen Govindaraj (@praveen_govi_ai).</description>
    <link>https://forem.com/praveen_govi_ai</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3848446%2F83ea5a8b-0dee-4f86-bb8b-965a86031f2f.png</url>
      <title>Forem: Praveen Govindaraj</title>
      <link>https://forem.com/praveen_govi_ai</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/praveen_govi_ai"/>
    <language>en</language>
    <item>
      <title>Build a Multi-Agent Data Pipeline in 50 Lines of Neam</title>
      <dc:creator>Praveen Govindaraj</dc:creator>
      <pubDate>Wed, 01 Apr 2026 12:38:13 +0000</pubDate>
      <link>https://forem.com/praveen_govi_ai/build-a-multi-agent-data-pipeline-in-50-lines-of-neam-58g3</link>
      <guid>https://forem.com/praveen_govi_ai/build-a-multi-agent-data-pipeline-in-50-lines-of-neam-58g3</guid>
      <description>&lt;p&gt;In this tutorial, you'll build a working multi-agent data pipeline using Neam, an agentic AI programming language. By the end, you'll have a DIO orchestrating five agents through a churn prediction workflow.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step 1: Define Your Infrastructure Profile. This tells every agent where data lives and what compliance rules apply:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight hcl"&gt;&lt;code&gt;&lt;span class="nx"&gt;infrastructure_profile&lt;/span&gt; &lt;span class="nx"&gt;MyInfra&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="nx"&gt;data_warehouse&lt;/span&gt;&lt;span class="err"&gt;:&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
        &lt;span class="nx"&gt;platform&lt;/span&gt;&lt;span class="err"&gt;:&lt;/span&gt; &lt;span class="s2"&gt;"postgres"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="nx"&gt;connection&lt;/span&gt;&lt;span class="err"&gt;:&lt;/span&gt; &lt;span class="nx"&gt;env&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="s2"&gt;"DB_URL"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
    &lt;span class="p"&gt;},&lt;/span&gt;
    &lt;span class="nx"&gt;governance&lt;/span&gt;&lt;span class="err"&gt;:&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt; &lt;span class="nx"&gt;regulations&lt;/span&gt;&lt;span class="err"&gt;:&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"GDPR"&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="p"&gt;}&lt;/span&gt;
&lt;span class="p"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Step 2: Declare Your Agents. Each agent is a specialist. Note the budget constraints:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;budget B { cost: 50.00, tokens: 500000 }

databa agent MyBA { provider: "openai",
    model: "gpt-4o", budget: B }
datascientist agent MyDS { provider: "openai",
    model: "gpt-4o", budget: B }
datatest agent MyDT { provider: "openai",
    model: "gpt-4o", budget: B }
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Step 3: Wire Up the DIO. The orchestrator coordinates everything:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="nx"&gt;budget&lt;/span&gt; &lt;span class="nx"&gt;DioBudget&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt; &lt;span class="nl"&gt;cost&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="mf"&gt;500.00&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nx"&gt;tokens&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="mi"&gt;2000000&lt;/span&gt; &lt;span class="p"&gt;}&lt;/span&gt;

&lt;span class="nx"&gt;dio&lt;/span&gt; &lt;span class="nx"&gt;agent&lt;/span&gt; &lt;span class="nx"&gt;MyDIO&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="nl"&gt;mode&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;hybrid&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="nx"&gt;task&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;Predict customer churn, identify drivers&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="nx"&gt;infrastructure&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="nx"&gt;MyInfra&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="nx"&gt;agent_md&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;./my_domain.agent.md&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="nx"&gt;provider&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;openai&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nx"&gt;model&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;gpt-4o&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="nx"&gt;budget&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="nx"&gt;DioBudget&lt;/span&gt;
&lt;span class="p"&gt;}&lt;/span&gt;

&lt;span class="kd"&gt;let&lt;/span&gt; &lt;span class="nx"&gt;result&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nf"&gt;dio_solve&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;MyDIO&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nx"&gt;task&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;result&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Step 4: Create Your Agent.MD. This is the secret weapon — encode domain knowledge:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## @organization-context&lt;/span&gt;
Company: My E-Commerce Co
Scale: 500K customers, 5M orders

&lt;span class="gu"&gt;## @known-data-issues&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; signup_date timezone drift before 2024-03
&lt;span class="p"&gt;-&lt;/span&gt; Product ratings skew positive (self-reported)

&lt;span class="gu"&gt;## @agent-preferences&lt;/span&gt;
DataScientist: XGBoost for tabular, AUC-ROC metric
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Run it: neam-cli run my_pipeline.neam&lt;/p&gt;

&lt;p&gt;Welcome to Neam Ecosystem&lt;/p&gt;

&lt;p&gt;&lt;a href="https://neam.dev/" rel="noopener noreferrer"&gt;Neam Landing Page&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/neam-lang/Data-Sims" rel="noopener noreferrer"&gt;Neam DIO Data Sims&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/neam-lang/Neam" rel="noopener noreferrer"&gt;Neam Repo&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://neam-lang.github.io/The-Intelligent-Data-Organization-with-Neam/index.html" rel="noopener noreferrer"&gt;Neam DIO EBook&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://neam-lang.github.io/Neam-The-AI-Native-Programming-Language/index.html" rel="noopener noreferrer"&gt;Programming Neam&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>opensource</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Neam DIO: Orchestrate 14 AI Agents for Your Data Lifecycle</title>
      <dc:creator>Praveen Govindaraj</dc:creator>
      <pubDate>Tue, 31 Mar 2026 06:30:49 +0000</pubDate>
      <link>https://forem.com/praveen_govi_ai/neam-dio-orchestrate-14-ai-agents-for-your-data-lifecycle-42oc</link>
      <guid>https://forem.com/praveen_govi_ai/neam-dio-orchestrate-14-ai-agents-for-your-data-lifecycle-42oc</guid>
      <description>&lt;h2&gt;
  
  
  85% of ML Projects Fail.
&lt;/h2&gt;

&lt;p&gt;We Built 14 AI Agents to Fix That.&lt;/p&gt;

&lt;p&gt;How the Neam Data Intelligent Orchestrator manages the entire data lifecycle — from requirements to production — with spec-driven agent coordination.&lt;/p&gt;

&lt;p&gt;The Number That Should Trouble Every Data Leader&lt;br&gt;
Here is a statistic that should keep every VP of Data awake at night: 85% of machine learning projects never reach production. Not 85% that deliver poor results. Eighty-five percent that never ship at all.&lt;/p&gt;

&lt;p&gt;For every six ML initiatives your organization launches, five will consume budget, occupy engineers, generate excitement in steering committees — and then quietly die. The models sit in notebooks. The pipelines rot. The business case gets revisited “next quarter,” which is corporate shorthand for never.&lt;/p&gt;

&lt;p&gt;This is not a technology problem. The algorithms work. The cloud scales. The tooling has never been better. The problem is organizational. It lives in the gaps between the business analyst who writes the requirements and the data engineer who builds the pipeline. Between the data scientist who trains the model and the MLOps engineer who deploys it.&lt;/p&gt;

&lt;p&gt;These gaps have a name: handoff failures. And they are where data projects go to die.&lt;/p&gt;

&lt;p&gt;The Shrimp Tank Insight&lt;br&gt;
In mid-2019, a $70 shrimp tank in a Singapore shop made me rethink how systems should be designed. The shopkeeper explained: no water changes, no filter cleaning. The shrimp eat the vegetation, the vegetation grows back. A self-sustaining ecosystem. You buy it, you keep your hands clean. It just… lives.&lt;/p&gt;

&lt;p&gt;💡 Key Insight&lt;/p&gt;

&lt;p&gt;That question became the design philosophy behind the Neam DIO: 14 agents, each with a distinct role, each producing outputs that others consume, each making the system stronger simply by doing their job. A data ecosystem that, like that $70 shrimp tank, just… works.&lt;/p&gt;

&lt;p&gt;Introducing the Data Intelligent Orchestrator (DIO)&lt;br&gt;
The DIO is the central coordination layer of Neam’s Intelligent Data Organization. It is not a chatbot. It is not a prompt chain. It is a compiled, spec-driven orchestrator that coordinates 14 specialist AI agents across the complete data lifecycle.&lt;/p&gt;

&lt;p&gt;The Four-Layer Architecture&lt;br&gt;
LayerAgentsWhat They DoInfrastructureData Agent, ETL Agent, Migration AgentSource discovery, SQL-first warehousing, zero-downtime platform movesPlatformDataOps, Governance, Modeling, AnalystSRE for data, compliance enforcement, architecture intelligence, NL-to-SQLAnalyticalData-BA, DataScientist, Causal, DataTest, MLOpsRequirements, EDA-to-AutoML, causal reasoning, quality validation, production opsOrchestrationDIODynamic crew formation, RACI assignment, 8 auto-patterns, error recovery&lt;/p&gt;

&lt;p&gt;Each agent has a defined personality, authority boundary, and trait-based capabilities. The Data-BA Agent is “inquisitive and traceability-obsessed.” The DataTest Agent is “skeptical, adversarial, never rubber-stamps.” The Causal Agent is “correlation-is-not-causation embodied.”&lt;/p&gt;

&lt;p&gt;These are not marketing descriptions. They are system prompts compiled into bytecode.&lt;/p&gt;

&lt;p&gt;How the DIO Actually Works&lt;/p&gt;

&lt;p&gt;Step 1: Task Understanding&lt;br&gt;
When a task arrives — say, “Predict which customers will churn in 90 days and identify the causal drivers” — the DIO classifies the intent and matches it against 8 pre-defined auto-patterns.&lt;/p&gt;

&lt;p&gt;Step 2: Crew Formation&lt;/p&gt;

&lt;p&gt;Not every task needs all 14 agents. The DIO scores each agent on four dimensions:&lt;/p&gt;

&lt;p&gt;Capability match (40%) — Can this agent do the required work?&lt;br&gt;
Cost efficiency (20%) — How much budget does it consume?&lt;br&gt;
Infrastructure compatibility (20%) — Does it work with the declared platform?&lt;br&gt;
Historical performance (20%) — How well has it performed on similar tasks?&lt;br&gt;
For churn prediction, the DIO forms a crew of 7 agents and skips DataOps, Analyst, Modeling, and Migration entirely.&lt;/p&gt;

&lt;p&gt;Step 3: RACI Delegation&lt;/p&gt;

&lt;p&gt;Every sub-task gets a RACI assignment: who is Responsible (does the work), Accountable (owns the outcome — always the DIO), Consulted (provides input), and Informed (receives results).&lt;/p&gt;

&lt;p&gt;Step 4: Execute with Quality Gates&lt;/p&gt;

&lt;p&gt;The DataTest Agent — architecturally separated from all builder agents — must approve artifacts before they flow downstream. The agent that trains the model cannot be the agent that validates it. This is a trust boundary.&lt;/p&gt;

&lt;p&gt;Step 5: Error Recovery&lt;/p&gt;

&lt;p&gt;Retry → Fallback → Graceful Degradation → Human Escalation. Exhaust automated options before involving humans, but involve humans before producing incorrect results.&lt;/p&gt;

&lt;p&gt;The Trait System&lt;/p&gt;

&lt;p&gt;TraitWhat It MeansAgentsDataProducerCreates data artifactsData Agent, ETL, Migration, Data-BA, DataScientist, DeployDataConsumerReads artifacts from other agentsETL, Modeling, Analyst, DataScientist, Causal, MLOpsCausalReasonerPerforms causal inferenceCausal Agent (exclusively)QualityGatekeeperCan block downstream progressData Agent, DataOps, Governance, DataTest, MLOps&lt;/p&gt;

&lt;p&gt;The Causal Agent: The Missing Role&lt;/p&gt;

&lt;p&gt;SHAP values tell you which features were important to the model’s prediction. They do not tell you which features cause the outcome. The Causal Agent reveals that “support_ticket_resolution_time” is the actual driver — not “days_since_last_order.” One is chasing symptoms. The other identifies the lever you can actually pull.&lt;/p&gt;

&lt;p&gt;The Evidence: DataSims&lt;br&gt;
MetricTraditional TeamNeam Agent StackCost$548,000$34,700Phases completedVaries (often incomplete)7/7Model AUCVaries0.847Test coverageVaries94%ReproducibilityLow100% (50/50 runs)Cost reduction — 93.7%&lt;/p&gt;

&lt;p&gt;Welcome to Neam Ecosystem&lt;/p&gt;

&lt;p&gt;&lt;a href="https://neam.dev/" rel="noopener noreferrer"&gt;Neam Landing Page&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/neam-lang/Data-Sims" rel="noopener noreferrer"&gt;Neam DIO Data Sims&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/neam-lang/Neam" rel="noopener noreferrer"&gt;Neam Repo&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://neam-lang.github.io/The-Intelligent-Data-Organization-with-Neam/index.html" rel="noopener noreferrer"&gt;Neam DIO EBook&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://neam-lang.github.io/Neam-The-AI-Native-Programming-Language/index.html" rel="noopener noreferrer"&gt;Programming Neam&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://discord.com/invite/mjrmVqCh" rel="noopener noreferrer"&gt;Join to Discord&lt;/a&gt;&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
