<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Johnson</title>
    <description>The latest articles on Forem by Johnson (@johnsonbuilds).</description>
    <link>https://forem.com/johnsonbuilds</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3939238%2F4f29d5ce-3d03-468e-959d-e0d7ed61ac47.jpg</url>
      <title>Forem: Johnson</title>
      <link>https://forem.com/johnsonbuilds</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/johnsonbuilds"/>
    <language>en</language>
    <item>
      <title>I Gave an AI Agent a Telegram Bot and It Started Editing Videos</title>
      <dc:creator>Johnson</dc:creator>
      <pubDate>Tue, 19 May 2026 03:45:33 +0000</pubDate>
      <link>https://forem.com/johnsonbuilds/i-gave-an-ai-agent-a-telegram-bot-and-it-started-editing-videos-55gm</link>
      <guid>https://forem.com/johnsonbuilds/i-gave-an-ai-agent-a-telegram-bot-and-it-started-editing-videos-55gm</guid>
      <description>&lt;p&gt;I wanted to test something simple:&lt;/p&gt;

&lt;p&gt;Could an autonomous AI agent receive a video from Telegram, process it automatically, write its own Python code, and send the result back to me?&lt;/p&gt;

&lt;p&gt;Turns out:&lt;/p&gt;

&lt;p&gt;Yes.&lt;/p&gt;

&lt;p&gt;And surprisingly, it worked better than I expected.&lt;/p&gt;




&lt;h1&gt;
  
  
  The Setup
&lt;/h1&gt;

&lt;p&gt;I deployed an OpenClaw agent on &lt;a href="https://getclawcloud.com/" rel="noopener noreferrer"&gt;GetClawCloud&lt;/a&gt; and connected it to a Telegram bot.&lt;/p&gt;

&lt;p&gt;The task sounded straightforward:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;I send a video to Telegram&lt;/li&gt;
&lt;li&gt;The AI agent receives the file&lt;/li&gt;
&lt;li&gt;It extracts the last frame from the video&lt;/li&gt;
&lt;li&gt;Sends the image back to me automatically&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;But what made this interesting was:&lt;/p&gt;

&lt;p&gt;I didn’t manually write the processing script.&lt;/p&gt;

&lt;p&gt;The agent generated it by itself.&lt;/p&gt;




&lt;h1&gt;
  
  
  What the Agent Actually Did
&lt;/h1&gt;

&lt;p&gt;After receiving the video, the agent:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;analyzed the task&lt;/li&gt;
&lt;li&gt;decided it needed Python video processing&lt;/li&gt;
&lt;li&gt;generated a script&lt;/li&gt;
&lt;li&gt;installed dependencies&lt;/li&gt;
&lt;li&gt;extracted the final frame&lt;/li&gt;
&lt;li&gt;saved the image&lt;/li&gt;
&lt;li&gt;sent the image back through Telegram&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The entire workflow was autonomous.&lt;/p&gt;

&lt;p&gt;No manual scripting.&lt;br&gt;
No SSH session.&lt;br&gt;
No intervention.&lt;/p&gt;

&lt;p&gt;Just a Telegram message triggering an AI workflow.&lt;/p&gt;




&lt;h1&gt;
  
  
  The Surprising Part
&lt;/h1&gt;

&lt;p&gt;The most interesting thing wasn’t the frame extraction itself.&lt;/p&gt;

&lt;p&gt;It was that the agent could reliably operate across multiple steps:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;receive external input&lt;/li&gt;
&lt;li&gt;reason about the task&lt;/li&gt;
&lt;li&gt;generate code&lt;/li&gt;
&lt;li&gt;execute code&lt;/li&gt;
&lt;li&gt;manage files&lt;/li&gt;
&lt;li&gt;return results&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is where autonomous AI agents start feeling less like chatbots and more like runtime workers.&lt;/p&gt;




&lt;h1&gt;
  
  
  Then I Tried Something More Advanced
&lt;/h1&gt;

&lt;p&gt;Next, I gave the agent a Wavespeed.ai API key and a simple instruction:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Generate a cinematic video of a spaceship landing in the desert.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The agent:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;searched the API documentation itself&lt;/li&gt;
&lt;li&gt;figured out the request format&lt;/li&gt;
&lt;li&gt;called the API&lt;/li&gt;
&lt;li&gt;waited for generation&lt;/li&gt;
&lt;li&gt;downloaded the final video&lt;/li&gt;
&lt;li&gt;sent the result back to Telegram&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That was the moment it started feeling genuinely autonomous.&lt;/p&gt;

&lt;p&gt;Not just “AI chat”.&lt;/p&gt;

&lt;p&gt;An actual AI worker.&lt;/p&gt;




&lt;h1&gt;
  
  
  Why Hosting Matters More Than People Think
&lt;/h1&gt;

&lt;p&gt;A lot of AI agent demos look impressive in short clips.&lt;/p&gt;

&lt;p&gt;But running agents continuously is a completely different problem.&lt;/p&gt;

&lt;p&gt;Long-running autonomous workflows require:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;persistent storage&lt;/li&gt;
&lt;li&gt;stable execution&lt;/li&gt;
&lt;li&gt;background processing&lt;/li&gt;
&lt;li&gt;reliable networking&lt;/li&gt;
&lt;li&gt;restart handling&lt;/li&gt;
&lt;li&gt;runtime isolation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That infrastructure layer is usually where things break.&lt;/p&gt;

&lt;p&gt;Especially when agents start:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;writing files&lt;/li&gt;
&lt;li&gt;generating code&lt;/li&gt;
&lt;li&gt;calling APIs&lt;/li&gt;
&lt;li&gt;handling async tasks&lt;/li&gt;
&lt;/ul&gt;




&lt;h1&gt;
  
  
  Why I Built GetClawCloud
&lt;/h1&gt;

&lt;p&gt;I mainly built &lt;a href="https://getclawcloud.com/" rel="noopener noreferrer"&gt;GetClawCloud&lt;/a&gt; because I wanted a simpler way to run OpenClaw agents reliably without constantly managing VPS infrastructure.&lt;/p&gt;

&lt;p&gt;For workflows like this, it handles:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;persistent runtime&lt;/li&gt;
&lt;li&gt;always-on execution&lt;/li&gt;
&lt;li&gt;file storage&lt;/li&gt;
&lt;li&gt;autonomous task execution&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;without me needing to manually babysit servers.&lt;/p&gt;

&lt;p&gt;I also started publishing reusable OpenClaw workflow ideas and prompt templates here:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://getclawcloud.com/blog/" rel="noopener noreferrer"&gt;https://getclawcloud.com/blog/&lt;/a&gt;&lt;/p&gt;




&lt;h1&gt;
  
  
  Final Thoughts
&lt;/h1&gt;

&lt;p&gt;The interesting part of AI agents is no longer conversation.&lt;/p&gt;

&lt;p&gt;It’s execution.&lt;/p&gt;

&lt;p&gt;Once agents can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;interact with APIs&lt;/li&gt;
&lt;li&gt;generate code&lt;/li&gt;
&lt;li&gt;process media&lt;/li&gt;
&lt;li&gt;manage files&lt;/li&gt;
&lt;li&gt;communicate externally&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;they start behaving more like autonomous software workers.&lt;/p&gt;

&lt;p&gt;This Telegram experiment was one of the first times that actually felt real to me.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>openclaw</category>
      <category>agents</category>
    </item>
  </channel>
</rss>
