<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Prateek YJ</title>
    <description>The latest articles on Forem by Prateek YJ (@prateek_yj_15603f79cafabc).</description>
    <link>https://forem.com/prateek_yj_15603f79cafabc</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3562520%2F6b0fdfdb-a0f8-4068-ae3a-fb1917bd1f5f.jpg</url>
      <title>Forem: Prateek YJ</title>
      <link>https://forem.com/prateek_yj_15603f79cafabc</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/prateek_yj_15603f79cafabc"/>
    <language>en</language>
    <item>
      <title>🚀 We Open-Sourced XAI's Macrohard: Meet Open Computer Use - Autonomous AI Agents That Actually Control Computers</title>
      <dc:creator>Prateek YJ</dc:creator>
      <pubDate>Wed, 15 Oct 2025 13:06:19 +0000</pubDate>
      <link>https://forem.com/prateek_yj_15603f79cafabc/we-open-sourced-xais-macrohard-meet-open-computer-use-autonomous-ai-agents-that-actually-ked</link>
      <guid>https://forem.com/prateek_yj_15603f79cafabc/we-open-sourced-xais-macrohard-meet-open-computer-use-autonomous-ai-agents-that-actually-ked</guid>
      <description>&lt;p&gt;Remember when XAI teased "Macrohard" - their vision of AI agents that could actually control computers? Well, the open-source community just said: &lt;strong&gt;"Hold our coffee."&lt;/strong&gt; ☕&lt;/p&gt;

&lt;h2&gt;
  
  
  Introducing Open Computer Use 🎯
&lt;/h2&gt;

&lt;p&gt;Today, we're thrilled to share &lt;strong&gt;&lt;a href="https://github.com/LLmHub-dev/open-computer-use" rel="noopener noreferrer"&gt;Open Computer Use&lt;/a&gt;&lt;/strong&gt; - a fully open-source platform that gives AI agents &lt;em&gt;real&lt;/em&gt; computer control. Not just chat. Not just suggestions. &lt;strong&gt;Actual automation.&lt;/strong&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  What Can It Do? 🚀
&lt;/h3&gt;

&lt;p&gt;✅ &lt;strong&gt;Browse the web&lt;/strong&gt; like a human - search, click, fill forms, extract data&lt;br&gt;
✅ &lt;strong&gt;Run terminal commands&lt;/strong&gt; - file management, script execution, package installation&lt;br&gt;&lt;br&gt;
✅ &lt;strong&gt;Control desktop apps&lt;/strong&gt; - full UI automation with computer vision&lt;br&gt;
✅ &lt;strong&gt;Multi-agent orchestration&lt;/strong&gt; - break down complex tasks across specialized agents&lt;br&gt;
✅ &lt;strong&gt;Real-time streaming&lt;/strong&gt; - watch your agents work with live feedback&lt;br&gt;
✅ &lt;strong&gt;100% open-source&lt;/strong&gt; - Apache 2.0 license, self-hostable, fully auditable&lt;/p&gt;

&lt;p&gt;Think Anthropic's Claude Computer Use capabilities, but &lt;strong&gt;completely open&lt;/strong&gt;, &lt;strong&gt;extensible&lt;/strong&gt;, and &lt;strong&gt;production-ready&lt;/strong&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Matters 💡
&lt;/h2&gt;

&lt;p&gt;For too long, "computer use" capabilities have been locked behind closed APIs and proprietary systems. Open Computer Use changes that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;🔓 &lt;strong&gt;Truly Open&lt;/strong&gt;: Apache 2.0 licensed - fork it, modify it, deploy it anywhere&lt;/li&gt;
&lt;li&gt;🔒 &lt;strong&gt;Safe by Design&lt;/strong&gt;: Isolated Docker VMs, sandboxed execution, no data persistence&lt;/li&gt;
&lt;li&gt;🎯 &lt;strong&gt;Production Ready&lt;/strong&gt;: Real-time streaming, multi-provider AI support (OpenAI, Anthropic, Google, xAI, Mistral, and more)&lt;/li&gt;
&lt;li&gt;🛠️ &lt;strong&gt;Developer First&lt;/strong&gt;: Built with Next.js 15, FastAPI, TypeScript - modern stack you already know&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  See It In Action 🎬
&lt;/h2&gt;

&lt;p&gt;Our agents can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Research and summarize&lt;/strong&gt; - "Find the latest AI research papers and create a summary report"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Automate workflows&lt;/strong&gt; - "Deploy this app to production and run the test suite"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data extraction&lt;/strong&gt; - "Scrape competitor pricing and build a comparison dashboard"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Complex tasks&lt;/strong&gt; - "Build a quantitative trading dashboard using QuantConnect data"&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;All executed autonomously. All streaming in real-time. All running in isolated, secure environments.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Tech Stack 🏗️
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Frontend&lt;/strong&gt;: Next.js 15 (App Router), React 19, TypeScript, Tailwind CSS 4&lt;br&gt;
&lt;strong&gt;Backend&lt;/strong&gt;: FastAPI, Python 3.10+, asyncio, websockets&lt;br&gt;
&lt;strong&gt;AI Providers&lt;/strong&gt;: OpenAI, Anthropic, Google, xAI, Mistral, Perplexity, OpenRouter&lt;br&gt;
&lt;strong&gt;Infrastructure&lt;/strong&gt;: Docker, Ubuntu 22.04 + XFCE, Selenium, Playwright&lt;br&gt;
&lt;strong&gt;Database&lt;/strong&gt;: Supabase (Auth + Postgres)&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture Highlights 🎯
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;User Request → AI Planner → Specialized Agents → Isolated VM
                    ↓
            [Browser | Terminal | Desktop]
                    ↓
          Real-time Streaming Feedback
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Each session runs in an &lt;strong&gt;isolated Docker container&lt;/strong&gt; with:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Sandboxed execution environment&lt;/li&gt;
&lt;li&gt;Ephemeral containers (no data persistence)&lt;/li&gt;
&lt;li&gt;Network isolation options
&lt;/li&gt;
&lt;li&gt;Resource limits and monitoring&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Quick Start 🏃‍♂️
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Clone the repo&lt;/span&gt;
git clone https://github.com/LLmHub-dev/open-computer-use.git
&lt;span class="nb"&gt;cd &lt;/span&gt;open-computer-use

&lt;span class="c"&gt;# Set up Supabase, add your API keys&lt;/span&gt;
&lt;span class="nb"&gt;cp&lt;/span&gt; .env.example .env
&lt;span class="c"&gt;# Edit .env with your configuration&lt;/span&gt;

&lt;span class="c"&gt;# Start with Docker&lt;/span&gt;
docker-compose up &lt;span class="nt"&gt;--build&lt;/span&gt;

&lt;span class="c"&gt;# Access at http://localhost:3000&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Bring your own API keys (BYOK) for any AI provider. All keys are encrypted and stored securely.&lt;/p&gt;

&lt;h2&gt;
  
  
  Use Cases 💼
&lt;/h2&gt;

&lt;p&gt;✨ &lt;strong&gt;Research &amp;amp; Data Gathering&lt;/strong&gt; - Web scraping, competitive analysis, market research&lt;br&gt;
✨ &lt;strong&gt;Testing &amp;amp; QA&lt;/strong&gt; - Automated UI testing, E2E test generation, regression testing&lt;br&gt;&lt;br&gt;
✨ &lt;strong&gt;DevOps &amp;amp; Automation&lt;/strong&gt; - Server configuration, deployment automation, log analysis&lt;br&gt;
✨ &lt;strong&gt;E-commerce Operations&lt;/strong&gt; - Price monitoring, product research, inventory tracking&lt;br&gt;
✨ &lt;strong&gt;Content Creation&lt;/strong&gt; - Screenshot documentation, tutorial generation, demo creation&lt;/p&gt;

&lt;h2&gt;
  
  
  Join the Movement 🌟
&lt;/h2&gt;

&lt;p&gt;This is just the beginning. We're building the future of autonomous computer agents - &lt;strong&gt;openly, safely, and together&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What we need from you:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;⭐ &lt;strong&gt;Star the repo&lt;/strong&gt; - &lt;a href="https://github.com/LLmHub-dev/open-computer-use" rel="noopener noreferrer"&gt;github.com/LLmHub-dev/open-computer-use&lt;/a&gt;&lt;br&gt;
💬 &lt;strong&gt;Join our Discord&lt;/strong&gt; - Share ideas, get help, contribute&lt;br&gt;
🔧 &lt;strong&gt;Contribute&lt;/strong&gt; - PRs welcome! Check out our Contributing Guide&lt;br&gt;
📢 &lt;strong&gt;Spread the word&lt;/strong&gt; - Help us show what open-source can do&lt;/p&gt;

&lt;h2&gt;
  
  
  Roadmap Ahead 🗺️
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Q1 2026:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Multi-VM orchestration (parallel agents)&lt;/li&gt;
&lt;li&gt;Visual workflow builder&lt;/li&gt;
&lt;li&gt;Windows and macOS VM support&lt;/li&gt;
&lt;li&gt;Mobile apps (iOS/Android)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Q2 2026:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Plugin system for custom tools&lt;/li&gt;
&lt;li&gt;Collaborative agent sessions&lt;/li&gt;
&lt;li&gt;Enterprise SSO support&lt;/li&gt;
&lt;li&gt;Advanced analytics dashboard&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  The Big Picture 🌍
&lt;/h2&gt;

&lt;p&gt;"Computer use" capabilities shouldn't be locked behind proprietary APIs. They should be:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;✅ &lt;strong&gt;Open&lt;/strong&gt; - Auditable, modifiable, yours to control&lt;/li&gt;
&lt;li&gt;✅ &lt;strong&gt;Safe&lt;/strong&gt; - Isolated, sandboxed, transparent&lt;/li&gt;
&lt;li&gt;✅ &lt;strong&gt;Accessible&lt;/strong&gt; - Self-hostable, BYOK, no vendor lock-in&lt;/li&gt;
&lt;li&gt;✅ &lt;strong&gt;Collaborative&lt;/strong&gt; - Built by the community, for the community&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We're not just building an alternative to closed-source solutions. &lt;strong&gt;We're building the foundation for a new era of autonomous agents that developers can trust, extend, and deploy anywhere.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Ready to Give Your AI Agents Real Power? 💪
&lt;/h2&gt;

&lt;p&gt;⭐ &lt;strong&gt;Star us on GitHub&lt;/strong&gt;: &lt;a href="https://github.com/LLmHub-dev/open-computer-use" rel="noopener noreferrer"&gt;LLmHub-dev/open-computer-use&lt;/a&gt;&lt;br&gt;
💬 &lt;strong&gt;Join Discord&lt;/strong&gt;: &lt;a href="https://discord.gg/llmhub" rel="noopener noreferrer"&gt;discord.gg/llmhub&lt;/a&gt;&lt;br&gt;&lt;br&gt;
🐦 &lt;strong&gt;Follow on X&lt;/strong&gt;: &lt;a href="https://twitter.com/llmhub_dev" rel="noopener noreferrer"&gt;@llmhub_dev&lt;/a&gt;&lt;br&gt;
📧 &lt;strong&gt;Contact&lt;/strong&gt;: &lt;a href="mailto:prateek@llmhub.dev"&gt;prateek@llmhub.dev&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Built with ❤️ by the Open Computer Use community. Apache 2.0 licensed. 97 stars (and counting!)&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Let's show the world what open-source can do. Star the repo, join the Discord, and help us shape the future of autonomous computer agents! 🚀&lt;/strong&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>python</category>
    </item>
    <item>
      <title>We just open-sourced XAI’s Macrohard, an autonomous computer-using agent</title>
      <dc:creator>Prateek YJ</dc:creator>
      <pubDate>Mon, 13 Oct 2025 13:39:08 +0000</pubDate>
      <link>https://forem.com/prateek_yj_15603f79cafabc/we-just-open-sourced-xais-macrohard-an-autonomous-computer-using-agent-56bj</link>
      <guid>https://forem.com/prateek_yj_15603f79cafabc/we-just-open-sourced-xais-macrohard-an-autonomous-computer-using-agent-56bj</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Ever imagined an AI that could &lt;em&gt;actually use&lt;/em&gt; your computer  open apps, type, click, deploy virtual machines, and run workflows safely and autonomously?&lt;/p&gt;

&lt;p&gt;We’re open-sourcing &lt;strong&gt;Open Computer Use&lt;/strong&gt;, a fully transparent, open-stack system for autonomous computer control.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h3&gt;
  
  
  🚀 What it does
&lt;/h3&gt;

&lt;p&gt;Open Computer Use lets AI agents go beyond APIs they can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Deploy and manage &lt;strong&gt;virtual machines&lt;/strong&gt; (Docker or full VMs)&lt;/li&gt;
&lt;li&gt;Execute &lt;strong&gt;CLI commands&lt;/strong&gt; or control &lt;strong&gt;desktops and browsers&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Automate software installs, builds, and tests&lt;/li&gt;
&lt;li&gt;Stream logs, screenshots, and progress in real time&lt;/li&gt;
&lt;li&gt;Run in sandboxed, permission-based environments for safety&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Everything  frontend, backend, orchestration, sandbox, and agents  is &lt;strong&gt;open source&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;🧩 &lt;strong&gt;Repo:&lt;/strong&gt; &lt;a href="https://github.com/LLmHub-dev/open-computer-use" rel="noopener noreferrer"&gt;github.com/LLmHub-dev/open-computer-use&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  💡 Why this matters
&lt;/h3&gt;

&lt;p&gt;Most “AI agents” today stop at the text layer  they talk about what they &lt;em&gt;would&lt;/em&gt; do.&lt;br&gt;
We wanted something that can &lt;em&gt;actually do it&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;Think of it like &lt;strong&gt;XAI’s Macrohard&lt;/strong&gt;, but:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;100% open-source&lt;/li&gt;
&lt;li&gt;Self-hostable and transparent&lt;/li&gt;
&lt;li&gt;Sandbox-safe&lt;/li&gt;
&lt;li&gt;Built with a modular architecture anyone can extend&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We’re releasing it so devs, researchers, and companies can &lt;strong&gt;run, study, and improve autonomous computer agents safely&lt;/strong&gt;  without depending on closed systems.&lt;/p&gt;




&lt;h3&gt;
  
  
  ⚙️ How to try it
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/LLmHub-dev/open-computer-use.git
&lt;span class="nb"&gt;cd &lt;/span&gt;open-computer-use
docker compose up
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Then launch the web interface → create an agent session → watch it deploy a VM, run commands, and stream live feedback.&lt;br&gt;
You can even write your own plugins to extend its capabilities.&lt;/p&gt;




&lt;h3&gt;
  
  
  🧠 Under the hood
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Frontend:&lt;/strong&gt; Next.js + Tailwind&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Backend:&lt;/strong&gt; FastAPI + Python&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Orchestration:&lt;/strong&gt; Docker + sandboxed VMs&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agent core:&lt;/strong&gt; modular planners + multi-process action engine&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Safety:&lt;/strong&gt; permission gating, audit logs, container isolation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It’s built for &lt;strong&gt;scalability&lt;/strong&gt;, so you can run many agents concurrently or integrate your own LLM router.&lt;/p&gt;




&lt;h3&gt;
  
  
  🔒 Safety first
&lt;/h3&gt;

&lt;p&gt;This kind of agent is powerful  we’ve made security a first-class feature:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Runs in sandboxed environments&lt;/li&gt;
&lt;li&gt;Requires explicit permission for file/system access&lt;/li&gt;
&lt;li&gt;Full audit trail of every action&lt;/li&gt;
&lt;li&gt;No network or credential sharing unless explicitly allowed&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Transparency builds trust  that’s why every component is open.&lt;/p&gt;




&lt;h3&gt;
  
  
  🌍 What’s next
&lt;/h3&gt;

&lt;p&gt;We’re working on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Multi-VM orchestration&lt;/li&gt;
&lt;li&gt;Windows/macOS support&lt;/li&gt;
&lt;li&gt;Plugin marketplace&lt;/li&gt;
&lt;li&gt;Custom LLM routing support via LLmHUB&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you want to build or contribute, &lt;strong&gt;we’d love your help&lt;/strong&gt; check out the repo and open a PR!&lt;/p&gt;




&lt;h3&gt;
  
  
  ❤️ Join us
&lt;/h3&gt;

&lt;p&gt;This is an open project built for the developer community.&lt;br&gt;
If you find it exciting star the repo, share feedback, or build your own extensions.&lt;/p&gt;

&lt;p&gt;👉 &lt;a href="https://github.com/LLmHub-dev/open-computer-use" rel="noopener noreferrer"&gt;https://github.com/LLmHub-dev/open-computer-use&lt;/a&gt;&lt;/p&gt;

</description>
      <category>automation</category>
      <category>ai</category>
      <category>machinelearning</category>
      <category>opensource</category>
    </item>
  </channel>
</rss>
