<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: DataHive AI</title>
    <description>The latest articles on Forem by DataHive AI (@maksim_n_69ebb06c2a0af970).</description>
    <link>https://forem.com/maksim_n_69ebb06c2a0af970</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3867951%2F858d93e4-9679-4791-89c4-70d544cab0e4.png</url>
      <title>Forem: DataHive AI</title>
      <link>https://forem.com/maksim_n_69ebb06c2a0af970</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/maksim_n_69ebb06c2a0af970"/>
    <language>en</language>
    <item>
      <title>The New Era of Private Data</title>
      <dc:creator>DataHive AI</dc:creator>
      <pubDate>Fri, 10 Apr 2026 11:04:24 +0000</pubDate>
      <link>https://forem.com/maksim_n_69ebb06c2a0af970/the-new-era-of-private-data-1gdd</link>
      <guid>https://forem.com/maksim_n_69ebb06c2a0af970/the-new-era-of-private-data-1gdd</guid>
      <description>&lt;p&gt;DataHive AI released a new &lt;a href="https://clawhub.ai/datahiveai/datahive-ride-insights" rel="noopener noreferrer"&gt;OpenClaw skill&lt;/a&gt; called Ride Receipts (DataHive Ride Insights). The skill extracts structured ride data from Gmail receipts for Uber, Bolt, Yandex, Lyft, Free Now, Curb, Via and other providers. All processing occurs locally on the user’s machine. It stores the data in SQLite and produces an anonymized CSV for optional upload to DataHive missions.&lt;/p&gt;

&lt;p&gt;Technical Architecture and Data FlowThe skill uses this pipeline:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Email Ingestion 
The gog CLI fetches ride receipts from Gmail. Raw emails are saved &lt;code&gt;todata/ride-insights/emails.json&lt;/code&gt;.&lt;/li&gt;
&lt;li&gt;LLM Extraction
The &lt;code&gt;scriptextract_rides_gateway.py&lt;/code&gt; sends each email JSON (including HTML) to a local OpenClaw Gateway at &lt;code&gt;/v1/responses&lt;/code&gt;.
The LLM parses the receipt and outputs structured ride records to &lt;code&gt;data/ride-insights/rides.json&lt;/code&gt;.&lt;/li&gt;
&lt;li&gt;SQLite Storage
The &lt;code&gt;scriptinsert_rides_json_sqlite.py&lt;/code&gt; loads the extracted data into &lt;code&gt;data/ride-insights/rides.sqlite&lt;/code&gt;. The database follows the schema in &lt;code&gt;references/schema_rides.sql&lt;/code&gt;.&lt;/li&gt;
&lt;li&gt;Anonymized Export
The &lt;code&gt;scriptexport_anonymized_rides_csv.py&lt;/code&gt; generates &lt;code&gt;data/ride-insights/anonymized_rides.csv&lt;/code&gt; with only de-identified fields.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The skill requires a local OpenClaw Gateway (&lt;code&gt;http://127.0.0.1:port&lt;/code&gt; or ::1) . It refuses any non-localhost URL.&lt;/p&gt;

&lt;h2&gt;
  
  
  Privacy Implementation
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Raw emails and full receipt JSON stay on the device.&lt;/li&gt;
&lt;li&gt;The anonymized CSV contains only: provider, email_month (YYYY-MM), start_time_15m, end_time_15m, currency, amount, distance_km, duration_min, pickup_city, pickup_country, dropoff_city, dropoff_country.&lt;/li&gt;
&lt;li&gt;No addresses, payment details, driver names, message IDs or raw email content are exported.&lt;/li&gt;
&lt;li&gt;Users review the report before any upload.&lt;/li&gt;
&lt;li&gt;The gateway enforces localhost-only connections.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  OutputAfter installation the skill provides:
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Personal ride history in SQLite&lt;/li&gt;
&lt;li&gt;Total cost of rides in a single currency&lt;/li&gt;
&lt;li&gt;Summaries of spending, habits, repeated routes, anchor locations and time-of-day patterns&lt;/li&gt;
&lt;li&gt;Ready-to-upload anonymized CSV&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Installation
&lt;/h2&gt;

&lt;p&gt;Run the command: &lt;code&gt;openclaw skills install ride-insights&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Then start a new OpenClaw session and talk to the agent, or execute the bundled scripts directly:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;fetch_emails_json.py&lt;/li&gt;
&lt;li&gt;extract_rides_gateway.py&lt;/li&gt;
&lt;li&gt;insert_rides_json_sqlite.py&lt;/li&gt;
&lt;li&gt;export_anonymized_rides_csv.py&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The skill is available on ClawHub:&lt;br&gt;
Ride Receipts:&lt;a href="https://clawhub.ai/datahiveai/datahive-ride-insights" rel="noopener noreferrer"&gt;https://clawhub.ai/datahiveai/datahive-ride-insights&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;A &lt;a href="https://dashboard.datahive.ai/missions/96e5660d-26bc-4182-ae04-b22362ca1d37" rel="noopener noreferrer"&gt;mission in the DataHive AI&lt;/a&gt; dashboard accepts the anonymized CSV for users who want to contribute to the shared data pool. The skill demonstrates local agent processing for personal data extraction and optional anonymized contribution to decentralized AI training data.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>data</category>
      <category>openclaw</category>
      <category>web3</category>
    </item>
  </channel>
</rss>
