<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Lakshay Garg</title>
    <description>The latest articles on Forem by Lakshay Garg (@lagarg7).</description>
    <link>https://forem.com/lagarg7</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3828075%2F223e260d-db5a-4139-9861-08d36dd0bf24.png</url>
      <title>Forem: Lakshay Garg</title>
      <link>https://forem.com/lagarg7</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/lagarg7"/>
    <language>en</language>
    <item>
      <title>Building MediSense: A Real-Time Emergency Healthcare Co-Pilot with Gemini Live API on Google Cloud</title>
      <dc:creator>Lakshay Garg</dc:creator>
      <pubDate>Mon, 16 Mar 2026 20:37:35 +0000</pubDate>
      <link>https://forem.com/lagarg7/building-medisense-a-real-time-emergency-healthcare-co-pilot-with-gemini-live-api-on-google-cloud-5h87</link>
      <guid>https://forem.com/lagarg7/building-medisense-a-real-time-emergency-healthcare-co-pilot-with-gemini-live-api-on-google-cloud-5h87</guid>
      <description>&lt;p&gt;Healthcare workers in rural clinics and home-care settings often face critical situations without immediate access to specialist guidance. MediSense is an AI-powered remote emergency co-pilot that provides real-time voice, video, and screen analysis to support clinical decision-making — built entirely on Google AI and Google Cloud.&lt;/p&gt;

&lt;p&gt;Live Demo: &lt;a href="https://medisense-130810972151.us-central1.run.app" rel="noopener noreferrer"&gt;https://medisense-130810972151.us-central1.run.app&lt;/a&gt;&lt;br&gt;
GitHub: &lt;a href="https://github.com/lakshay0007/MediSense" rel="noopener noreferrer"&gt;https://github.com/lakshay0007/MediSense&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The Problem&lt;br&gt;
A junior nurse in a remote clinic encounters a wound she hasn't seen before. There's no senior doctor on-site. She needs guidance — right now — not in 30 minutes when someone calls back. MediSense bridges that gap with an AI co-pilot that can see what she sees and talk her through it in real time.&lt;/p&gt;

&lt;p&gt;Google AI Models Used&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Gemini Live 2.5 Flash (Native Audio) — Real-Time Multimodal Streaming
The core of MediSense is the Gemini Multimodal Live API (gemini-live-2.5-flash-native-audio). This enables:&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Live voice conversation — the nurse speaks naturally and gets spoken responses&lt;br&gt;
Real-time camera analysis — point a phone camera at a wound, equipment, or patient and Gemini analyzes the video feed live&lt;br&gt;
Screen sharing analysis — share an EHR screen or vital signs monitor for AI interpretation&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Gemini 2.0 Flash (Image Generation) — Visual Aid Generation
MediSense also uses gemini-2.0-flash-preview-image-generation to generate visual aids on demand — anatomical diagrams, procedure illustrations, or reference images that help guide clinical procedures.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Google Cloud Architecture&lt;br&gt;
Vertex AI&lt;br&gt;
All Gemini API calls go through Vertex AI, Google Cloud's managed ML platform:&lt;/p&gt;

&lt;p&gt;Using Vertex AI gives us enterprise-grade authentication (OAuth 2.0), regional endpoints, and production-ready reliability.&lt;/p&gt;

&lt;p&gt;Cloud Run&lt;br&gt;
The app is containerized with Docker and deployed to Google Cloud Run for serverless, auto-scaling hosting:&lt;/p&gt;

&lt;p&gt;Cloud Run handles scaling automatically — zero instances when idle, scaling up under load — perfect for a healthcare tool that needs to be always available but cost-efficient.&lt;/p&gt;

&lt;p&gt;Cloud Build&lt;br&gt;
CI/CD is handled by Google Cloud Build, which builds the Docker image directly from source:&lt;/p&gt;

&lt;p&gt;What I Learned&lt;br&gt;
The Multimodal Live API is a game-changer. Being able to stream video + audio bidirectionally opens up use cases that weren't possible with traditional request-response APIs. Healthcare is one of the most impactful.&lt;/p&gt;

&lt;p&gt;Vertex AI simplifies production deployment. OAuth-based auth, regional endpoints, and the google-genai SDK made it straightforward to go from prototype to production.&lt;/p&gt;

&lt;p&gt;Cloud Run + Cloud Build = fast iteration. Push code, build container, deploy — all in under 2 minutes. For a hackathon, this speed is essential.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>gemini</category>
      <category>googlecloud</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
