<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: 大阪のUber配達員</title>
    <description>The latest articles on Forem by 大阪のUber配達員 (@uber_95d81a06792ac5).</description>
    <link>https://forem.com/uber_95d81a06792ac5</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3501162%2F6fd14760-f853-4e02-9ca2-29ae03861336.png</url>
      <title>Forem: 大阪のUber配達員</title>
      <link>https://forem.com/uber_95d81a06792ac5</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/uber_95d81a06792ac5"/>
    <language>en</language>
    <item>
      <title>Sora 2 Watermark Remover Free</title>
      <dc:creator>大阪のUber配達員</dc:creator>
      <pubDate>Fri, 03 Oct 2025 09:31:58 +0000</pubDate>
      <link>https://forem.com/uber_95d81a06792ac5/sora-2-watermark-remover-free-58gm</link>
      <guid>https://forem.com/uber_95d81a06792ac5/sora-2-watermark-remover-free-58gm</guid>
      <description>&lt;p&gt;Source code&lt;br&gt;
&lt;a href="https://github.com/GitHub30/Sora2WatermarkRemover" rel="noopener noreferrer"&gt;https://github.com/GitHub30/Sora2WatermarkRemover&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Let's try&lt;br&gt;
&lt;a href="https://colab.research.google.com/drive/1Iqu4RZ9WAhcbO1Jn0wCkMOsw2l1p6z62?usp=sharing" rel="noopener noreferrer"&gt;https://colab.research.google.com/drive/1Iqu4RZ9WAhcbO1Jn0wCkMOsw2l1p6z62?usp=sharing&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  what is Watermark
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdl372slv3qfshjugtco1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdl372slv3qfshjugtco1.png" alt=" " width="787" height="306"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Before you try any “free watermark remover,” read this. This post explains &lt;strong&gt;why visible watermarks exist on Sora 2 videos&lt;/strong&gt;, the &lt;strong&gt;legal and ethical risks of removing them&lt;/strong&gt;, and &lt;strong&gt;legit ways to get clean exports&lt;/strong&gt; without breaking rules.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  TL;DR
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Sora 2 adds both a visible watermark and C2PA Content Credentials&lt;/strong&gt; (provenance metadata). These are there to help viewers distinguish AI media. (&lt;a href="https://openai.com/index/launching-sora-responsibly/" rel="noopener noreferrer"&gt;OpenAI&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Removing or helping others remove watermarks/CMI can be illegal in many places&lt;/strong&gt; (e.g., US DMCA §1201 and rules against stripping copyright management information). None of this is legal advice—when in doubt, talk to a lawyer. (&lt;a href="https://www.law.cornell.edu/uscode/text/17/1201?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;法律情報研究所&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Legit path to clean exports:&lt;/strong&gt; OpenAI’s &lt;strong&gt;ChatGPT Pro&lt;/strong&gt; tier explicitly includes the ability to &lt;strong&gt;download videos without a watermark&lt;/strong&gt;. If you need unwatermarked Sora footage for permitted use, upgrade rather than hacking around it. (&lt;a href="https://help.openai.com/en/articles/10245774-sora-billing-credits-faq" rel="noopener noreferrer"&gt;OpenAI Help Center&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;You’ll see many “3-minute remover” posts and tools online; this article &lt;strong&gt;does not endorse them&lt;/strong&gt;. It explains risks and safer alternatives. (&lt;a href="https://qiita.com/7mpy/items/9bf1d9bf90e583f8611d?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Qiita&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What the watermark (and C2PA) are for
&lt;/h2&gt;

&lt;p&gt;Sora 2 ships with &lt;strong&gt;visible watermarks&lt;/strong&gt; and embeds &lt;strong&gt;C2PA “Content Credentials.”&lt;/strong&gt; C2PA is an industry standard for attaching a tamper-evident provenance “nutrition label” that can show where media came from and how it was edited. These signals help platforms, journalists, and everyday viewers evaluate AI-generated video. (&lt;a href="https://openai.com/index/launching-sora-responsibly/" rel="noopener noreferrer"&gt;OpenAI&lt;/a&gt;)&lt;/p&gt;

&lt;h2&gt;
  
  
  Is removing the watermark legal?
&lt;/h2&gt;

&lt;p&gt;Short answer: &lt;strong&gt;often no.&lt;/strong&gt;&lt;br&gt;
In the US, &lt;strong&gt;DMCA §1201&lt;/strong&gt; prohibits circumventing technological protection measures. Related rules also prohibit removing or altering &lt;strong&gt;copyright management information (CMI)&lt;/strong&gt;—which can include watermarks and provenance metadata. Other jurisdictions have similar restrictions. Again, this is &lt;strong&gt;general information, not legal advice.&lt;/strong&gt; (&lt;a href="https://www.law.cornell.edu/uscode/text/17/1201?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;法律情報研究所&lt;/a&gt;)&lt;/p&gt;

&lt;h2&gt;
  
  
  The responsible way to get clean Sora 2 video
&lt;/h2&gt;

&lt;p&gt;If your use case legitimately requires a clean export (client delivery, broadcast graphics, etc.), the &lt;strong&gt;official route is to use a plan that allows watermark-free downloads.&lt;/strong&gt; OpenAI’s &lt;strong&gt;Sora/ChatGPT Pro&lt;/strong&gt; includes &lt;strong&gt;“Download videos without a watermark.”&lt;/strong&gt; That’s the safe, policy-compliant solution. (&lt;a href="https://help.openai.com/en/articles/10245774-sora-billing-credits-faq" rel="noopener noreferrer"&gt;OpenAI Help Center&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;Also note: &lt;strong&gt;even when a visible watermark is off, C2PA credentials may remain&lt;/strong&gt; to help preserve provenance. Many platforms are beginning to surface these labels. Don’t use tools or transcoders that strip Content Credentials if you want to keep trust signals intact. (&lt;a href="https://openai.com/index/launching-sora-responsibly/" rel="noopener noreferrer"&gt;OpenAI&lt;/a&gt;)&lt;/p&gt;

&lt;h2&gt;
  
  
  “But there are free watermark removers…”
&lt;/h2&gt;

&lt;p&gt;Yes—blogs, videos, and posts claim they can erase Sora logos quickly. That doesn’t make it wise (or lawful). If you encounter those tutorials, understand you may be:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Violating terms of use and possibly the law&lt;/strong&gt; (anti-circumvention / CMI removal).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Misrepresenting provenance&lt;/strong&gt; by breaking C2PA chains.
An example of such “how-to” material exists online; this post &lt;strong&gt;won’t replicate or endorse it.&lt;/strong&gt; (&lt;a href="https://qiita.com/7mpy/items/9bf1d9bf90e583f8611d?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Qiita&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  If you’re the rights holder (or you have explicit permission)
&lt;/h2&gt;

&lt;p&gt;When you own the content or have documented permission to deliver clean footage:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Use Sora/ChatGPT Pro’s watermark-free download&lt;/strong&gt; option. (&lt;a href="https://help.openai.com/en/articles/10245774-sora-billing-credits-faq" rel="noopener noreferrer"&gt;OpenAI Help Center&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Keep Content Credentials intact&lt;/strong&gt; for trust and audit trails; confirm your editing pipeline preserves C2PA metadata. (&lt;a href="https://c2pa.org/specifications/specifications/2.2/explainer/Explainer.html?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;C2PA&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Document rights &amp;amp; disclosures&lt;/strong&gt; (e.g., “AI-generated” disclaimers) to meet platform or client requirements. Many platforms are moving toward clearer labeling. (&lt;a href="https://www.theverge.com/2024/8/21/24223932/c2pa-standard-verify-ai-generated-images-content-credentials?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;The Verge&lt;/a&gt;)&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Ethics in practice
&lt;/h2&gt;

&lt;p&gt;Watermarks and Content Credentials aren’t just roadblocks; they’re &lt;strong&gt;transparency tools&lt;/strong&gt;. If your creative or commercial goal conflicts with them, the fix isn’t evasion—it’s to &lt;strong&gt;use proper licensing, the correct product tier, and honest labeling&lt;/strong&gt; that respects audiences and platforms. (&lt;a href="https://openai.com/index/launching-sora-responsibly/" rel="noopener noreferrer"&gt;OpenAI&lt;/a&gt;)&lt;/p&gt;

&lt;h2&gt;
  
  
  FAQ
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Q: Is there any truly “free” way to get Sora output without a watermark?&lt;/strong&gt;&lt;br&gt;
A: If you’re seeing a watermark under your current access level, the correct path is &lt;strong&gt;upgrading to a plan that allows clean downloads&lt;/strong&gt;—not stripping it with third-party tools. (&lt;a href="https://help.openai.com/en/articles/10245774-sora-billing-credits-faq" rel="noopener noreferrer"&gt;OpenAI Help Center&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Q: Do all Sora outputs have provenance metadata?&lt;/strong&gt;&lt;br&gt;
A: OpenAI states that videos include &lt;strong&gt;visible and invisible provenance signals&lt;/strong&gt; and &lt;strong&gt;embed C2PA metadata&lt;/strong&gt;. Some tiers/features may affect visibility of the watermark, but provenance aims to persist. (&lt;a href="https://openai.com/index/launching-sora-responsibly/" rel="noopener noreferrer"&gt;OpenAI&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Q: What happens if I transcode or edit a Sora video?&lt;/strong&gt;&lt;br&gt;
A: Depending on your software, you might inadvertently &lt;strong&gt;strip Content Credentials&lt;/strong&gt;. Use tools that preserve C2PA if you need verifiable provenance downstream. (&lt;a href="https://c2pa.org/specifications/specifications/2.2/explainer/Explainer.html?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;C2PA&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  Resources &amp;amp; further reading
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;OpenAI – Launching Sora responsibly&lt;/strong&gt; (watermarks + C2PA). (&lt;a href="https://openai.com/index/launching-sora-responsibly/" rel="noopener noreferrer"&gt;OpenAI&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;OpenAI Help Center – Sora Billing FAQ&lt;/strong&gt; (Pro plan includes &lt;strong&gt;download without watermark&lt;/strong&gt;). (&lt;a href="https://help.openai.com/en/articles/10245774-sora-billing-credits-faq" rel="noopener noreferrer"&gt;OpenAI Help Center&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;C2PA Explainer – Content Credentials&lt;/strong&gt; (how provenance works). (&lt;a href="https://c2pa.org/specifications/specifications/2.2/explainer/Explainer.html?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;C2PA&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;US DMCA §1201&lt;/strong&gt; (anti-circumvention) + guidance on removing CMI. (&lt;a href="https://www.law.cornell.edu/uscode/text/17/1201?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;法律情報研究所&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Example of online “removal” tutorials (not endorsed). (&lt;a href="https://qiita.com/7mpy/items/9bf1d9bf90e583f8611d?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Qiita&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;




&lt;h3&gt;
  
  
  A practical checklist (for creators &amp;amp; teams)
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;□ Use the &lt;strong&gt;official watermark-free export&lt;/strong&gt; when permitted by your plan. (&lt;a href="https://help.openai.com/en/articles/10245774-sora-billing-credits-faq" rel="noopener noreferrer"&gt;OpenAI Help Center&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;□ &lt;strong&gt;Preserve C2PA&lt;/strong&gt; in your edit pipeline; avoid tools that strip Content Credentials. (&lt;a href="https://c2pa.org/specifications/specifications/2.2/explainer/Explainer.html?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;C2PA&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;□ Maintain &lt;strong&gt;disclosures&lt;/strong&gt; where required by clients/platforms. (&lt;a href="https://www.theverge.com/2024/8/21/24223932/c2pa-standard-verify-ai-generated-images-content-credentials?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;The Verge&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;□ Avoid tutorials that promote &lt;strong&gt;circumvention&lt;/strong&gt; or &lt;strong&gt;CMI removal&lt;/strong&gt;. (&lt;a href="https://www.law.cornell.edu/uscode/text/17/1201?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;法律情報研究所&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;em&gt;Author’s note:&lt;/em&gt; This article focuses on &lt;strong&gt;responsible, policy-compliant&lt;/strong&gt; workflows. It does &lt;strong&gt;not&lt;/strong&gt; provide instructions to remove watermarks or bypass provenance. If you need clean Sora 2 footage, &lt;strong&gt;use the official, supported route.&lt;/strong&gt; (&lt;a href="https://help.openai.com/en/articles/10245774-sora-billing-credits-faq" rel="noopener noreferrer"&gt;OpenAI Help Center&lt;/a&gt;)&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.reuters.com/business/media-telecom/openai-launches-new-ai-video-app-spun-copyrighted-content-2025-09-30/?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Reuters&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.theverge.com/ai-artificial-intelligence/789126/openai-made-a-tiktok-for-deepfakes-and-its-getting-hard-to-tell-whats-real?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;The Verge&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>sora</category>
      <category>openai</category>
      <category>chatgpt</category>
    </item>
    <item>
      <title>Meet “Awesome Gemini Robotics” — Copy-Paste Prompts &amp; Demos for Gemini Robotics-ER 1.5</title>
      <dc:creator>大阪のUber配達員</dc:creator>
      <pubDate>Fri, 26 Sep 2025 16:03:53 +0000</pubDate>
      <link>https://forem.com/uber_95d81a06792ac5/meet-awesome-gemini-robotics-copy-paste-prompts-demos-for-gemini-robotics-er-15-2bko</link>
      <guid>https://forem.com/uber_95d81a06792ac5/meet-awesome-gemini-robotics-copy-paste-prompts-demos-for-gemini-robotics-er-15-2bko</guid>
      <description>&lt;p&gt;&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;https://github.com/GitHub30/Awesome-Gemini-Robotics&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  TL;DR
&lt;/h2&gt;

&lt;p&gt;If you’re exploring &lt;strong&gt;Gemini Robotics-ER 1.5&lt;/strong&gt; (Google/DeepMind’s embodied-reasoning model for robots), this community repo collects &lt;strong&gt;ready-to-run prompts&lt;/strong&gt;, &lt;strong&gt;reference images&lt;/strong&gt; and &lt;strong&gt;usage patterns&lt;/strong&gt; you can drop straight into your own projects. It tracks what ER 1.5 is good at right now—&lt;strong&gt;pointing/boxes/trajectories, long-horizon planning, temporal reasoning on video, tool-use (e.g., Search)&lt;/strong&gt;—and shows practical prompt shapes for each. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  Why this repo matters (and why now)
&lt;/h2&gt;

&lt;p&gt;On &lt;strong&gt;Sept. 25, 2025&lt;/strong&gt;, Google announced that &lt;strong&gt;Gemini Robotics-ER 1.5&lt;/strong&gt; is available to developers via the Gemini API in Google AI Studio. ER 1.5 acts as a &lt;strong&gt;“high-level brain”&lt;/strong&gt; for robots: it plans multi-step tasks, performs spatial/temporal reasoning, and can &lt;strong&gt;natively call tools&lt;/strong&gt; (like Google Search) to ground its plans in the real world. ([Google Developers Blog][2])&lt;/p&gt;

&lt;p&gt;The official docs summarize what you can build today: unified object finding/labeling (points &amp;amp; boxes), trajectory planning, video understanding, and orchestration of long-horizon tasks—with the model currently in &lt;strong&gt;preview&lt;/strong&gt;. (&lt;a href="https://ai.google.dev/gemini-api/docs/robotics-overview" rel="noopener noreferrer"&gt;Google AI for Developers&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Awesome Gemini Robotics&lt;/strong&gt; rides that momentum by curating concrete, reproducible prompts and screenshots so you don’t have to start from a blank page. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  What’s inside
&lt;/h2&gt;

&lt;p&gt;The repo is a &lt;strong&gt;gallery of 10+ use cases&lt;/strong&gt; (and growing), each with an image and a copy-friendly prompt. Examples include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Pointing to undefined/defined objects&lt;/strong&gt; (single or multiple categories)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Bounding boxes&lt;/strong&gt; and &lt;strong&gt;part localization&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Counting by pointing&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Trajectory generation&lt;/strong&gt; (e.g., moving a pen to an organizer, brushing particles)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Obstacle-aware paths&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Video analysis&lt;/strong&gt; (step-by-step breakdowns and frame tracking)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Orchestrating tasks&lt;/strong&gt; (e.g., “make room for a laptop,” “pack a lunch”)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each card shows the expected output structure (normalized coordinates; JSON-friendly answers) to plug into your controller or VLA. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;It also includes &lt;strong&gt;tips &amp;amp; patterns&lt;/strong&gt;—like normalizing point/box coordinates to a 0–1000 range, tuning the &lt;strong&gt;thinking budget&lt;/strong&gt; for latency vs. accuracy, and interleaving text with spatial outputs to create “&lt;strong&gt;spatially grounded&lt;/strong&gt;” plans. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;Bonus: the README is available in &lt;strong&gt;multiple languages&lt;/strong&gt; (JA/KR/VI/ZH/EN), making it easier to share across global teams. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  How it aligns with the official guidance
&lt;/h2&gt;

&lt;p&gt;The &lt;strong&gt;AI for Developers&lt;/strong&gt; robotics overview shows the same pillars this repo foregrounds—&lt;strong&gt;points, boxes, trajectories, video, and orchestration&lt;/strong&gt;—and provides minimal client snippets for ER 1.5 (currently &lt;strong&gt;&lt;code&gt;gemini-robotics-er-1.5-preview&lt;/code&gt;&lt;/strong&gt;). If you’re new, start there to wire up your SDK, then use the repo’s prompts as drop-in tests. (&lt;a href="https://ai.google.dev/gemini-api/docs/robotics-overview" rel="noopener noreferrer"&gt;Google AI for Developers&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;The &lt;strong&gt;launch blog&lt;/strong&gt; reinforces the model’s role as an &lt;strong&gt;agentic planner&lt;/strong&gt; that can call tools (e.g., Search) and sequence your robot APIs—exactly the workflows many cards in the repo illustrate. ([Google Developers Blog][2])&lt;/p&gt;




&lt;h2&gt;
  
  
  Quick look: a reusable prompt pattern
&lt;/h2&gt;

&lt;p&gt;A recurring pattern in the repo (and docs) is to &lt;strong&gt;ask for structured spatial outputs&lt;/strong&gt; you can feed into your planner:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Point to no more than N items in the image.
Return [{"point":[y,x],"label":"&amp;lt;name&amp;gt;"}] with y/x normalized to 0–1000.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This shape keeps responses machine-friendly and model-agnostic, and it scales to parts, boxes, or trajectories with minimal edits. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  Who should use this
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Robotics researchers &amp;amp; builders&lt;/strong&gt; prototyping perception-to-action loops and benchmarking prompt formats&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Product teams&lt;/strong&gt; validating UX for natural-language robot tasks (clean-up, organizing, sorting, etc.)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Educators &amp;amp; hobbyists&lt;/strong&gt; looking for clear examples of &lt;strong&gt;agentic&lt;/strong&gt; multimodal reasoning grounded in spatial outputs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For deeper context on the two-model system (ER 1.5 as the planner + the action-oriented VLA model), see DeepMind’s overview of &lt;strong&gt;Gemini Robotics 1.5&lt;/strong&gt; and &lt;strong&gt;ER 1.5&lt;/strong&gt;. (&lt;a href="https://deepmind.google/discover/blog/gemini-robotics-15-brings-ai-agents-into-the-physical-world/?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Google DeepMind&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  Getting started
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Skim the official docs&lt;/strong&gt; to confirm the current model name, preview status, and input limits. (&lt;a href="https://ai.google.dev/gemini-api/docs/robotics-overview" rel="noopener noreferrer"&gt;Google AI for Developers&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Run a minimal image-prompt call&lt;/strong&gt; in your environment (Python or REST). (&lt;a href="https://ai.google.dev/gemini-api/docs/robotics-overview" rel="noopener noreferrer"&gt;Google AI for Developers&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Pick a use case&lt;/strong&gt; from the repo (e.g., “2D bounding boxes” or “Simple trajectory planning”) and paste the prompt as-is. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Wire responses&lt;/strong&gt; (points/boxes/paths) into your controller or VLA for execution; increase the &lt;strong&gt;thinking budget&lt;/strong&gt; for harder tasks like counting or affordance reasoning. (&lt;a href="https://ai.google.dev/gemini-api/docs/robotics-overview" rel="noopener noreferrer"&gt;Google AI for Developers&lt;/a&gt;)&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  Contributing
&lt;/h2&gt;

&lt;p&gt;The repo welcomes PRs. Add a new folder under &lt;code&gt;cases/&amp;lt;short-name&amp;gt;/&lt;/code&gt; with a short README, your prompt, and an image (or link). Keep prompts &lt;strong&gt;copy-runnable&lt;/strong&gt;, &lt;strong&gt;JSON-friendly&lt;/strong&gt;, and &lt;strong&gt;cite primary sources&lt;/strong&gt; (docs/blogs/videos). (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  License &amp;amp; image attribution
&lt;/h2&gt;

&lt;p&gt;Text is MIT-licensed. Demo images marked in the repo reference Google’s public docs/blog; check source licenses before redistributing and replace placeholders with your own captures when in doubt. (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;




&lt;h2&gt;
  
  
  Further reading
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Launch/How-to&lt;/strong&gt;: Building the Next Generation of Physical Agents with Gemini Robotics-ER 1.5 (availability, capabilities, thinking budget, safety). ([Google Developers Blog][2])&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Docs&lt;/strong&gt;: Gemini Robotics-ER 1.5 overview (object finding, trajectories, video, orchestration; example code). (&lt;a href="https://ai.google.dev/gemini-api/docs/robotics-overview" rel="noopener noreferrer"&gt;Google AI for Developers&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model pages&lt;/strong&gt;: DeepMind’s Gemini Robotics 1.5 and ER 1.5 (system roles, roadmaps, trusted-tester context). (&lt;a href="https://deepmind.google/models/gemini-robotics/gemini-robotics/?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Google DeepMind&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you build something cool with these prompts—&lt;strong&gt;open a PR&lt;/strong&gt; and share it with the community! (&lt;a href="https://github.com/GitHub30/Awesome-Gemini-Robotics" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;[2]: &lt;a href="https://developers.googleblog.com/en/building-the-next-generation-of-physical-agents-with-gemini-robotics-er-15/" rel="noopener noreferrer"&gt;https://developers.googleblog.com/en/building-the-next-generation-of-physical-agents-with-gemini-robotics-er-15/&lt;/a&gt; "&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;        Building the Next Generation of Physical Agents with Gemini Robotics-ER 1.5


        \- Google Developers Blog

    "
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

</description>
      <category>ai</category>
      <category>robotics</category>
      <category>gemini</category>
      <category>deepmind</category>
    </item>
    <item>
      <title>Handle WebDAV as JSON: A One-File Self-Hosted API WebDAVJSON (PHP/Node.js)</title>
      <dc:creator>大阪のUber配達員</dc:creator>
      <pubDate>Sun, 14 Sep 2025 10:02:22 +0000</pubDate>
      <link>https://forem.com/uber_95d81a06792ac5/handle-webdav-as-json-a-one-file-self-hosted-api-webdavjson-phpnodejs-19do</link>
      <guid>https://forem.com/uber_95d81a06792ac5/handle-webdav-as-json-a-one-file-self-hosted-api-webdavjson-phpnodejs-19do</guid>
      <description>&lt;p&gt;&lt;a href="https://github.com/GitHub30/WebDAVJSON" rel="noopener noreferrer"&gt;https://github.com/GitHub30/WebDAVJSON&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What is it?
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;WebDAVJSON&lt;/strong&gt; lets you drop a &lt;strong&gt;single PHP or Node.js file&lt;/strong&gt; on your server and immediately get a &lt;strong&gt;JSON API for file operations&lt;/strong&gt; (list, upload, download, delete). It supports &lt;strong&gt;CORS&lt;/strong&gt;, &lt;strong&gt;custom API key (Bearer) auth&lt;/strong&gt;, and an &lt;strong&gt;extension allow-list&lt;/strong&gt;, making it easy to call from front-ends or automation scripts.&lt;/p&gt;




&lt;h2&gt;
  
  
  Key features
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;CORS&lt;/strong&gt; support&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;API key (Bearer) authentication&lt;/strong&gt; (optional)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;File listing in JSON&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Upload (multipart/PUT), download, delete&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Extension allow-list&lt;/strong&gt; for basic safety&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Single-file&lt;/strong&gt; PHP/Node implementation&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Endpoints (HTTP methods)
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;GET /&lt;/code&gt; — List files (JSON)&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;GET /?filename=abc.txt&lt;/code&gt; — Download&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;POST/PUT /&lt;/code&gt; — Upload (multipart or PUT)&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;POST/PUT /?filename=abc.txt&lt;/code&gt; — Binary upload to a specific name&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;DELETE /?filename=abc.txt&lt;/code&gt; — Delete&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Quick setup
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Node.js (Windows example)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;winget &lt;span class="nb"&gt;install &lt;/span&gt;FiloSottile.mkcert Node.js &lt;span class="nt"&gt;--silent&lt;/span&gt;
mkcert &lt;span class="nt"&gt;-install&lt;/span&gt;
mkcert localhost
node index.mjs
&lt;span class="c"&gt;# Verify:&lt;/span&gt;
&lt;span class="c"&gt;# fetch('https://localhost:8443/').then(r=&amp;gt;r.json())&lt;/span&gt;
&lt;span class="c"&gt;# fetch('https://localhost:8443/?filename=foo.txt',{method:'PUT',body:'foobar'})&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  PHP (Linux example)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Get the single file&lt;/span&gt;
wget https://raw.githubusercontent.com/GitHub30/WebDAVJSON/refs/heads/main/index.php
&lt;span class="c"&gt;# Start with PHP’s built-in server (choose any port)&lt;/span&gt;
php &lt;span class="nt"&gt;-S&lt;/span&gt; 0.0.0.0:8000
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  Usage (cURL recipes)
&lt;/h2&gt;

&lt;h3&gt;
  
  
  List files (JSON)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl http://localhost:8000/
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Partial match search (&lt;code&gt;q&lt;/code&gt;)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="s2"&gt;"http://localhost:8000/?q=report"&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Download
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-O&lt;/span&gt; &lt;span class="s2"&gt;"http://localhost:8000/?filename=abc.txt"&lt;/span&gt;
&lt;span class="c"&gt;# Save as attachment with original name&lt;/span&gt;
curl &lt;span class="nt"&gt;-OJ&lt;/span&gt; &lt;span class="s2"&gt;"http://localhost:8000/?download&amp;amp;filename=abc.txt"&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Upload (multipart/form-data)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-F&lt;/span&gt; &lt;span class="s2"&gt;"file=@abc.txt"&lt;/span&gt; http://localhost:8000/
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Upload (PUT, binary)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-X&lt;/span&gt; PUT &lt;span class="nt"&gt;--data-binary&lt;/span&gt; @abc.txt &lt;span class="s2"&gt;"http://localhost:8000/?filename=abc.txt"&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Delete
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-X&lt;/span&gt; DELETE &lt;span class="s2"&gt;"http://localhost:8000/?filename=abc.txt"&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  Authentication (optional)
&lt;/h2&gt;

&lt;p&gt;If you set &lt;code&gt;$api_key&lt;/code&gt; (PHP) or the equivalent in the Node file, the API will require the header &lt;strong&gt;&lt;code&gt;Authorization: Bearer &amp;lt;API_KEY&amp;gt;&lt;/code&gt;&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-H&lt;/span&gt; &lt;span class="s2"&gt;"Authorization: Bearer your_api_key"&lt;/span&gt; http://localhost:8000/
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  Extension allow-list
&lt;/h2&gt;

&lt;p&gt;Example of commonly allowed extensions:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;txt, jpg, png, webp, heic, gif, pdf, docx, xlsx, zip, mp4, gz
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Tightening this list is recommended when exposing the API.&lt;/p&gt;




&lt;h2&gt;
  
  
  When it’s handy
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;You want a &lt;strong&gt;minimal self-hosted storage API&lt;/strong&gt; callable from a front-end&lt;/li&gt;
&lt;li&gt;You need a &lt;strong&gt;quick drop-zone for CI/CD artifacts&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;You prefer a &lt;strong&gt;no/low-code backend&lt;/strong&gt; you can spin up in seconds&lt;/li&gt;
&lt;li&gt;You’re standing up a &lt;strong&gt;temporary sharing server&lt;/strong&gt; with basic guards (CORS + key)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Notes &amp;amp; best practices
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Serve over &lt;strong&gt;HTTPS&lt;/strong&gt; and enable an &lt;strong&gt;API key&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Keep the &lt;strong&gt;exposure surface minimal&lt;/strong&gt; and restrict the &lt;strong&gt;allowed extensions&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Set correct permissions on the &lt;strong&gt;writable directory&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;If usage grows, consider &lt;strong&gt;rate limiting&lt;/strong&gt; and &lt;strong&gt;audit logging&lt;/strong&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  License
&lt;/h2&gt;

&lt;p&gt;MIT License.&lt;/p&gt;




&lt;h2&gt;
  
  
  Wrap-up
&lt;/h2&gt;

&lt;p&gt;WebDAVJSON is a &lt;strong&gt;“smallest-viable API”&lt;/strong&gt; you can deploy in minutes. Despite its tiny footprint, it covers practical needs like CORS, API keys, and extension filtering. When you need a lightweight, self-hosted file API, this is a great first choice.&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>php</category>
      <category>api</category>
      <category>javascript</category>
    </item>
  </channel>
</rss>
