<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Sagar Joshi</title>
    <description>The latest articles on Forem by Sagar Joshi (@sagar_joshi_31ac4ef39367a).</description>
    <link>https://forem.com/sagar_joshi_31ac4ef39367a</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3626673%2Ff6ae7d4a-458c-4a17-bebd-3bd024379c9b.jpg</url>
      <title>Forem: Sagar Joshi</title>
      <link>https://forem.com/sagar_joshi_31ac4ef39367a</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/sagar_joshi_31ac4ef39367a"/>
    <language>en</language>
    <item>
      <title>We Solved the Recording Problem. The Playback Problem Is Still Broken.</title>
      <dc:creator>Sagar Joshi</dc:creator>
      <pubDate>Sun, 22 Mar 2026 20:54:00 +0000</pubDate>
      <link>https://forem.com/sagar_joshi_31ac4ef39367a/we-solved-the-recording-problem-the-playback-problem-is-still-broken-5ffj</link>
      <guid>https://forem.com/sagar_joshi_31ac4ef39367a/we-solved-the-recording-problem-the-playback-problem-is-still-broken-5ffj</guid>
      <description>&lt;p&gt;You joined a new company. Someone shared a link — a 2-hour product walkthrough, detailed, important. You opened it. You watched 20 minutes. You closed it and never went back.&lt;/p&gt;

&lt;p&gt;Nobody told you which 15 minutes actually mattered.&lt;/p&gt;

&lt;p&gt;We solved recording. We solved search. We solved summaries. But somewhere along the way, we optimized recordings for machines to understand — not for humans to consume.&lt;/p&gt;

&lt;p&gt;That is the playback problem. And it is not a new problem — it is an unsolved one.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Format Problem — Mid-2010s
&lt;/h2&gt;

&lt;p&gt;In the mid-2010s, enterprise platforms stored recordings in a fragmented landscape of proprietary formats. Adobe Connect used Flash-based FLV. WebEx had ARF. GoToMeeting used G2M. For platforms built on VNC-based screen sharing, the format was FBS — Free Buffer Stream. Not a mainstream format. A niche protocol dump with no standard player, no indexing, and tooling its own developers described as old and barely maintained.&lt;/p&gt;

&lt;p&gt;Fuze — a major enterprise UCaaS platform serving 400,000+ users — had accumulated thousands of recordings in this format without a viable conversion path.&lt;/p&gt;

&lt;p&gt;There was no off-the-shelf solution. We built one from scratch using open source tools — no commercial SDK, no vendor API.&lt;/p&gt;

&lt;p&gt;The approach: extend the RFB player code to dump a JPEG per frame, concatenate via ffmpeg at 30fps, solve sync drift between screen share and audio, handle resolution distortion using a Black Image Padding Technique.&lt;/p&gt;

&lt;p&gt;Result: hundreds of recordings rescued from obsolescence. A format problem solved.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Delivery Problem — 2017 to 2018
&lt;/h2&gt;

&lt;p&gt;Even converted recordings remained tethered to the company network. Petabytes of content — meetings, training, product walkthroughs — locked behind a network connection.&lt;/p&gt;

&lt;p&gt;Exporting the three components (audio, video, screen share) separately caused sync drift. The approach that worked: record the screen while the content played, using entirely open source tools.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Xvfb&lt;/strong&gt; — headless virtual framebuffer, ran the platform URL on Chrome&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;ScreenCastify&lt;/strong&gt; — captured Xvfb output&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;ffmpeg&lt;/strong&gt; — converted WebM to MP4&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;chrome.runtime APIs&lt;/strong&gt; — detected buffering, paused capture to prevent sync errors&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;RabbitMQ&lt;/strong&gt; — async task queue so users weren't blocked waiting for long exports&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Result: a download button that actually worked. The delivery problem solved.&lt;/p&gt;




&lt;h2&gt;
  
  
  What the Industry Actually Solved — And What It Didn't
&lt;/h2&gt;

&lt;p&gt;Tools like Otter.ai, Fireflies.ai, Read AI, and tl;dv transformed recordings into structured data. Teams Intelligent Recap added chapters. Panopto enabled dual-stream playback. Zoom added clips.&lt;/p&gt;

&lt;p&gt;This progress is real and useful.&lt;/p&gt;

&lt;p&gt;But it solved a different problem.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The industry optimized recordings for machines to understand — not for humans to consume.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you missed a meeting today, you have three options: read the summary, search the transcript, or watch the full recording. What you still cannot do is watch the right version of the recording.&lt;/p&gt;

&lt;p&gt;A 2-hour recording is still a 2-hour recording — just with better indexing.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Four Gaps
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. No viewer-controlled dynamic view switching&lt;/strong&gt;&lt;br&gt;
Zoom's multi-view recording requires host pre-configuration. Panopto's dual-stream requires editor intervention — not a viewer action. No platform offers audio-only mode or true viewer-controlled layout at replay time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. No downloadable intelligent highlight packages&lt;/strong&gt;&lt;br&gt;
Online highlight reels exist (Read AI, tl;dv) — but they don't travel offline. You can download the full 2-hour recording. You cannot download the intelligent 29-minute version.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Personalization exists at the summary layer — not the playback layer&lt;/strong&gt;&lt;br&gt;
Every tool sends the new joiner and the senior engineer the same MP4. Role-aware playback packaging does not exist.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. The root cause is invisible — recordings are permanently flattened at capture time&lt;/strong&gt;&lt;br&gt;
When a meeting ends, audio, video, and screen share are merged into a single MP4 and the streams are discarded. Automatically. Silently. Irreversibly. This single architectural decision forecloses every intelligent replay option downstream. Most users never know it happened.&lt;/p&gt;


&lt;h2&gt;
  
  
  A Missing Category: Intelligent Replay Systems
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Intelligent Replay is the ability to generate a personalized, context-aware version of a recording at playback time.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What makes this achievable now is multimodal AI. Models like Gemini and GPT-4o can watch a video stream and understand it visually — detecting when a presenter shifts to demonstrating, when a slide changes, when a live demo begins. They can decide, second by second, which stream carries the most relevant signal.&lt;/p&gt;

&lt;p&gt;The shift:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Recording → file → dataset → generated experience
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This is not an extension of existing tools — it is orthogonal to them.&lt;/p&gt;

&lt;p&gt;We would not accept a document that can only be read top to bottom with no ability to skip, restructure, or personalize. That is exactly how we still treat video.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why Nobody Has Built This Yet
&lt;/h2&gt;

&lt;p&gt;Three structural constraints:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Storage and processing cost&lt;/strong&gt; — Multi-stream recording means 2-3x storage. Per-user packaging multiplies compute.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Architectural inertia&lt;/strong&gt; — Most platforms flatten at capture time. Changing this requires product conviction most teams haven't developed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Implicit demand&lt;/strong&gt; — The signal was always there in unopened recording links, in "can you summarize this?" messages, in training videos unwatched for months. The industry prescribed a painkiller — summaries and transcripts. Nobody diagnosed the underlying condition: that playback itself was broken.&lt;/p&gt;

&lt;p&gt;But those constraints are weakening. Storage is cheaper. AI is significantly better. The cost of wasted time is becoming measurable.&lt;/p&gt;




&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;We solved recording. We solved summaries. But we never solved playback.&lt;/p&gt;

&lt;p&gt;The infrastructure exists. The AI exists. Multimodal models can already watch a video and decide which stream matters at every second. What is missing is the decision to treat playback as a product — not a file.&lt;/p&gt;

&lt;p&gt;To the startup ecosystem: the Big Three have optimized for the summary. The playback layer is uncontested.&lt;/p&gt;

&lt;p&gt;That category is Intelligent Replay. It is waiting to be claimed.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Full article with diagrams on Medium:&lt;/em&gt;&lt;br&gt;
&lt;a href="https://medium.com/@jo.sagar/we-solved-the-recording-problem-the-playback-problem-is-still-broken-1768038911b3" rel="noopener noreferrer"&gt;https://medium.com/@jo.sagar/we-solved-the-recording-problem-the-playback-problem-is-still-broken-1768038911b3&lt;/a&gt;&lt;/p&gt;

</description>
      <category>video</category>
      <category>ai</category>
      <category>productivity</category>
      <category>distributedsystems</category>
    </item>
    <item>
      <title>The Missing Protocol: How BFCP Unlocked Dual-Monitor Conferencing for Enterprise Room Systems</title>
      <dc:creator>Sagar Joshi</dc:creator>
      <pubDate>Sat, 14 Mar 2026 22:57:05 +0000</pubDate>
      <link>https://forem.com/sagar_joshi_31ac4ef39367a/the-missing-protocol-how-bfcp-unlocked-dual-monitor-conferencing-for-enterprise-room-systems-27af</link>
      <guid>https://forem.com/sagar_joshi_31ac4ef39367a/the-missing-protocol-how-bfcp-unlocked-dual-monitor-conferencing-for-enterprise-room-systems-27af</guid>
      <description>&lt;p&gt;Enterprise room systems like Polycom and Lifesize support dual monitors — one for video, one for screen share. But when they joined a web conference through a SIP gateway, both monitors showed the same combined feed.&lt;/p&gt;

&lt;p&gt;The hardware was capable. OpalVoIP had no BFCP support. So screen share got injected into the video feed.&lt;/p&gt;

&lt;p&gt;I added BFCP to OpalVoIP from scratch in 2018 — integrating libbfcp via a C++ wrapper, extending SDP offer/answer processing, and handling a dual-role design where the gateway acts as BFCP client on outbound calls and BFCP server on inbound calls.&lt;/p&gt;

&lt;p&gt;The full write-up covers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The SDP negotiation details (with before/after examples)&lt;/li&gt;
&lt;li&gt;The dual-role client/server architecture&lt;/li&gt;
&lt;li&gt;Why pre-negotiation beats mid-call renegotiation&lt;/li&gt;
&lt;li&gt;How WebRTC handles this differently today — and why SIP gateways still need BFCP&lt;/li&gt;
&lt;li&gt;Lessons learned from production deployment with real hardware endpoints 
&lt;a href="https://medium.com/@jo.sagar/the-missing-protocol-how-bfcp-unlocked-dual-monitor-conferencing-for-enterprise-room-systems-36e61b336fbf" rel="noopener noreferrer"&gt;Medium Article&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Open source contribution credited at open.gslab.com. The BFCP gap still exists in OpalVoIP's main codebase today — sharing in case it's useful to anyone hitting the same problem.&lt;/p&gt;

</description>
      <category>opensource</category>
      <category>voip</category>
      <category>distributedsystems</category>
      <category>softwaredevelopment</category>
    </item>
    <item>
      <title>The UX Problem Isn’t Speed — It’s Intent</title>
      <dc:creator>Sagar Joshi</dc:creator>
      <pubDate>Mon, 26 Jan 2026 16:24:13 +0000</pubDate>
      <link>https://forem.com/sagar_joshi_31ac4ef39367a/the-ux-problem-isnt-speed-its-intent-39ek</link>
      <guid>https://forem.com/sagar_joshi_31ac4ef39367a/the-ux-problem-isnt-speed-its-intent-39ek</guid>
      <description>&lt;p&gt;Modern software isn’t slow — it’s cognitively heavy.&lt;/p&gt;

&lt;p&gt;We’ve built fast systems, powerful features, and rich interfaces. Yet users still spend a surprising amount of effort translating what they want to do into clicks, menus, and workflows.&lt;/p&gt;

&lt;p&gt;The bottleneck isn’t performance.&lt;br&gt;
It’s that software lacks a shared, adaptive grammar for user intent.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Translation Gap: Intent vs Interface
&lt;/h2&gt;

&lt;p&gt;Users don’t think in UI elements.&lt;br&gt;
They think in goals.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“Reply quickly.”&lt;/li&gt;
&lt;li&gt;“Escalate this issue.”&lt;/li&gt;
&lt;li&gt;“Share this with the team.”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But software forces users to perform a translation step:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Find the right menu&lt;/li&gt;
&lt;li&gt;Recall where an action lives&lt;/li&gt;
&lt;li&gt;Learn different interaction patterns for every product&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This repeated translation creates cognitive overload — not because tools are complex, but because each tool speaks a different interaction language.&lt;/p&gt;

&lt;p&gt;Humans adapt to computers.&lt;br&gt;
Computers rarely adapt to humans.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Keyboard Shortcuts Don’t Fully Solve This
&lt;/h2&gt;

&lt;p&gt;Keyboard shortcuts were an early attempt to compress intent into speed. They help — but they have structural limits:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Invisible: Users can’t discover what they don’t know&lt;/li&gt;
&lt;li&gt;Inconsistent: Same shortcut, different behavior across apps&lt;/li&gt;
&lt;li&gt;High memory cost: Each tool demands a new mental vocabulary&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Most users don’t avoid shortcuts because they dislike efficiency — they avoid them because software rarely teaches shortcuts at the moment intent is expressed.&lt;/p&gt;

&lt;h2&gt;
  
  
  Adaptive UX: Learning at Runtime
&lt;/h2&gt;

&lt;p&gt;Static onboarding tours don’t work. Users skip them.&lt;/p&gt;

&lt;p&gt;What does work is just-in-time guidance — adapting while the user is already working.&lt;/p&gt;

&lt;p&gt;With modern telemetry and AI, this is finally practical.&lt;/p&gt;

&lt;p&gt;Examples:&lt;/p&gt;

&lt;h3&gt;
  
  
  Fade-in shortcuts
&lt;/h3&gt;

&lt;p&gt;If a user manually clicks “Export to PDF” multiple times, subtly surface the shortcut only when the pattern appears.&lt;/p&gt;

&lt;h3&gt;
  
  
  Behavioral macros
&lt;/h3&gt;

&lt;p&gt;If a user consistently mutes audio and disables video before a recurring meeting, the system can suggest automating it.&lt;/p&gt;

&lt;h3&gt;
  
  
  Gesture hints
&lt;/h3&gt;

&lt;p&gt;When repeated multi-step actions are detected, lightweight visual cues can introduce a gesture or touchpad action to replace them.&lt;/p&gt;

&lt;p&gt;This isn’t about adding features — it’s about teaching efficiency at the moment of intent.&lt;/p&gt;

&lt;h2&gt;
  
  
  Beyond the Keyboard: A Shared Interaction Vocabulary
&lt;/h2&gt;

&lt;p&gt;Intent doesn’t have to be typed.&lt;/p&gt;

&lt;p&gt;Modern devices already support:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;multi-finger gestures&lt;/li&gt;
&lt;li&gt;touchpad zones&lt;/li&gt;
&lt;li&gt;pressure or context-aware inputs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The real opportunity isn’t inventing new gestures — it’s making them consistent.&lt;/p&gt;

&lt;p&gt;If the same gesture meant “Send to team” across chat, email, and issue trackers, users would develop muscle memory for intent, not for menus.&lt;/p&gt;

&lt;p&gt;That’s what a shared interaction grammar enables.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Matters for Product Teams
&lt;/h2&gt;

&lt;p&gt;From a product and UX perspective, intent-driven interaction offers clear business value:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Faster onboarding: Users become effective through usage, not training&lt;/li&gt;
&lt;li&gt;Lower support load: Many “how do I” tickets are really “where is” problems&lt;/li&gt;
&lt;li&gt;Higher flow state: Reduced friction keeps users focused on outcomes&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;When software understands what the user is trying to do, it can minimize the effort required to do it.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Comes Next
&lt;/h2&gt;

&lt;p&gt;The industry doesn’t need more features.&lt;/p&gt;

&lt;p&gt;It needs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;shared patterns for expressing common user intents&lt;/li&gt;
&lt;li&gt;adaptive interfaces that learn from real behavior&lt;/li&gt;
&lt;li&gt;interaction models that prioritize goals over UI mechanics&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We’ve already personalized content.&lt;br&gt;
Now it’s time to personalize interaction.&lt;/p&gt;

&lt;h2&gt;
  
  
  Closing Thought
&lt;/h2&gt;

&lt;p&gt;Good software doesn’t ask users to think harder.&lt;/p&gt;

&lt;p&gt;It understands what they’re trying to achieve — and clears the shortest path to get there.&lt;/p&gt;

&lt;h2&gt;
  
  
  Reference
&lt;/h2&gt;

&lt;p&gt;For an earlier exploration of structured, intent-driven interaction patterns:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://priorart.ip.com/IPCOM/000175764" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>uxdesign</category>
      <category>softwaredesign</category>
      <category>productdesign</category>
      <category>humancomputerinteraction</category>
    </item>
    <item>
      <title>Why Collaboration Still Breaks Context — And What Comes Next</title>
      <dc:creator>Sagar Joshi</dc:creator>
      <pubDate>Mon, 05 Jan 2026 19:24:47 +0000</pubDate>
      <link>https://forem.com/sagar_joshi_31ac4ef39367a/why-collaboration-still-breaks-context-and-what-comes-next-3gbi</link>
      <guid>https://forem.com/sagar_joshi_31ac4ef39367a/why-collaboration-still-breaks-context-and-what-comes-next-3gbi</guid>
      <description>&lt;p&gt;Even in 2026, collaboration still feels oddly fragile.&lt;/p&gt;

&lt;p&gt;We jump between chat, calls, email, documents, and wikis all day — yet the context rarely follows. A discussion starts in chat, escalates to a call, spills into documents, and somehow ends up summarized (poorly) in email.&lt;/p&gt;

&lt;p&gt;The tools are powerful.&lt;br&gt;
The experience is fragmented.&lt;/p&gt;

&lt;p&gt;This isn’t a UX nit — it’s a systems gap.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Real Problem Isn’t Channels — It’s Context
&lt;/h2&gt;

&lt;p&gt;Most modern collaboration platforms are optimized around mediums, not conversations.&lt;/p&gt;

&lt;p&gt;What breaks during transitions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;participants change&lt;/li&gt;
&lt;li&gt;intent gets diluted&lt;/li&gt;
&lt;li&gt;decisions lose traceability&lt;/li&gt;
&lt;li&gt;historical “why” disappears&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Even with AI summaries, context is usually reconstructed after the fact, inside a single tool.&lt;/p&gt;

&lt;p&gt;What’s missing is a continuous conversation model that survives medium changes.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Context-Preserving Collaboration Should Feel Like
&lt;/h2&gt;

&lt;p&gt;Imagine moving fluidly across:&lt;/p&gt;

&lt;p&gt;Chat → Call → Doc → Email → Wiki → Chat&lt;/p&gt;

&lt;p&gt;…and carrying forward:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;participants&lt;/li&gt;
&lt;li&gt;shared artifacts&lt;/li&gt;
&lt;li&gt;conversation history&lt;/li&gt;
&lt;li&gt;decisions&lt;/li&gt;
&lt;li&gt;intent&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No re-explaining.&lt;br&gt;
No restarting.&lt;br&gt;
No “what did we decide again?”&lt;/p&gt;

&lt;p&gt;This is not a feature problem — it’s an architecture problem.&lt;/p&gt;

&lt;h2&gt;
  
  
  An Old Idea That’s Suddenly Relevant Again
&lt;/h2&gt;

&lt;p&gt;Years ago, an IBM defensive disclosure (IPCOM/000193228) explored a simple but powerful idea:&lt;/p&gt;

&lt;p&gt;Dynamically switch between collaboration mediums while preserving the full interaction context.&lt;/p&gt;

&lt;p&gt;The concept introduced a collaboration switch that:&lt;/p&gt;

&lt;p&gt;captured parameters from the current medium (participants, content, resources)&lt;/p&gt;

&lt;p&gt;mapped them into the next medium&lt;/p&gt;

&lt;p&gt;preserved continuity across transitions&lt;/p&gt;

&lt;p&gt;At the time, it felt ahead of the ecosystem.&lt;/p&gt;

&lt;p&gt;Today, it feels like the missing layer.&lt;/p&gt;

&lt;p&gt;Where Today’s Tools Still Fall Short&lt;/p&gt;

&lt;h3&gt;
  
  
  Microsoft Teams
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Strength: Strong in-app context&lt;/li&gt;
&lt;li&gt;What’s missing: Email ↔ Teams continuity still feels fragmented&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Slack
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Strength: Excellent chat threads&lt;/li&gt;
&lt;li&gt;What’s missing: Context often breaks when switching to meetings&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Zoom
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Strength: Great meeting capture&lt;/li&gt;
&lt;li&gt;What’s missing: Weak pre- and post-meeting continuity&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Google Workspace
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Strength: Smart Canvas shows promise&lt;/li&gt;
&lt;li&gt;What’s missing: No unified memory across chat, mail, docs, and meetings
📌 Despite different strengths, none provide a single conversation graph across mediums.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;There is still no unified conversation graph.&lt;/p&gt;

&lt;h2&gt;
  
  
  Three Gaps That Matter in 2026
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;Context Is Still Siloed&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;AI summaries live inside tools, not across them.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;No Intelligent Medium Switching&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Tools don’t suggest when to switch:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;heated debate → call&lt;/li&gt;
&lt;li&gt;long call → document&lt;/li&gt;
&lt;li&gt;stalled thread → async summary&lt;/li&gt;
&lt;/ul&gt;

&lt;ol&gt;
&lt;li&gt;No Unified Timeline&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;There’s no single narrative stitching together:&lt;br&gt;
chat + calls + docs + edits + decisions.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Contextual Search Changes Everything
&lt;/h2&gt;

&lt;p&gt;Once context is unified, search becomes transformational.&lt;/p&gt;

&lt;p&gt;Instead of keywords, you ask:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“What concerns came up before Feature X was approved?”&lt;/li&gt;
&lt;li&gt;“Show the full incident timeline across tools.”&lt;/li&gt;
&lt;li&gt;“Summarize discussions involving these people last month.”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Search becomes intent-aware, narrative-aware, and cross-channel.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Is Finally Possible
&lt;/h2&gt;

&lt;p&gt;The technology stack has caught up:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;LLMs for semantic mapping&lt;/li&gt;
&lt;li&gt;embeddings + graph databases for conversation modeling&lt;/li&gt;
&lt;li&gt;identity resolution across tools&lt;/li&gt;
&lt;li&gt;APIs that expose collaboration state&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The pieces exist.&lt;br&gt;
What’s missing is the unifying layer.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Comes Next
&lt;/h2&gt;

&lt;p&gt;The future likely includes:&lt;/p&gt;

&lt;p&gt;persistent context profiles that follow work, not tools&lt;/p&gt;

&lt;p&gt;AI-driven medium recommendations&lt;/p&gt;

&lt;p&gt;unified conversation timelines&lt;/p&gt;

&lt;p&gt;portable context across platforms&lt;/p&gt;

&lt;p&gt;Collaboration shouldn’t belong to an app.&lt;br&gt;
It should belong to the work itself.&lt;/p&gt;

&lt;h2&gt;
  
  
  Closing Thought
&lt;/h2&gt;

&lt;p&gt;We’ve built excellent channels.&lt;/p&gt;

&lt;p&gt;Now it’s time to build the intelligent connective tissue between them.&lt;/p&gt;

&lt;p&gt;If you work on collaboration platforms, distributed systems, or developer productivity:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Where does context break most for you?&lt;/li&gt;
&lt;li&gt;Would automatic medium switching help — or feel intrusive?&lt;/li&gt;
&lt;li&gt;What would a unified timeline change in your day?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Curious to hear how others think about this problem.&lt;/p&gt;

</description>
      <category>distributedsystems</category>
      <category>ai</category>
      <category>productivity</category>
      <category>softwareengineering</category>
    </item>
    <item>
      <title>Why Real-Time Communication Still Breaks — And What a 2014 Idea Got Right</title>
      <dc:creator>Sagar Joshi</dc:creator>
      <pubDate>Fri, 05 Dec 2025 17:32:34 +0000</pubDate>
      <link>https://forem.com/sagar_joshi_31ac4ef39367a/why-real-time-communication-still-breaks-and-what-a-2014-idea-got-right-1b40</link>
      <guid>https://forem.com/sagar_joshi_31ac4ef39367a/why-real-time-communication-still-breaks-and-what-a-2014-idea-got-right-1b40</guid>
      <description>&lt;p&gt;Every time a Teams, Zoom, or Webex call freezes during an important moment, we’re reminded of something uncomfortable:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Real-time communication is still too fragile.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Even in 2025, we continue to see:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Regional outages
&lt;/li&gt;
&lt;li&gt;Overloaded media servers
&lt;/li&gt;
&lt;li&gt;Capacity limits blocking new joiners
&lt;/li&gt;
&lt;li&gt;Users forced to rejoin calls
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Years ago at IBM, we explored a different path — one that didn’t depend on running idle backup servers.&lt;/p&gt;




&lt;h2&gt;
  
  
  A Simple but Powerful Idea (US Patent 10,051,235)
&lt;/h2&gt;

&lt;p&gt;Instead of keeping mirrored servers on standby, the system could:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Monitor performance using SDP thresholds
&lt;/li&gt;
&lt;li&gt;Dynamically elect an active participant as a temporary media relay
&lt;/li&gt;
&lt;li&gt;Switch over seamlessly when the primary server failed
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No reconnects.&lt;br&gt;&lt;br&gt;
No cold backups.&lt;br&gt;&lt;br&gt;
No duplicated cloud cost.&lt;/p&gt;




&lt;h2&gt;
  
  
  Traditional Redundancy vs. Our 2014 Approach
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Traditional Redundancy
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Requires mirrored idle servers
&lt;/li&gt;
&lt;li&gt;High cloud/infra cost
&lt;/li&gt;
&lt;li&gt;Failover takes seconds
&lt;/li&gt;
&lt;li&gt;Participants must reconnect
&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Our Approach
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Elect from active participants
&lt;/li&gt;
&lt;li&gt;Zero idle infrastructure
&lt;/li&gt;
&lt;li&gt;Instant failover
&lt;/li&gt;
&lt;li&gt;Session stays alive
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  A 2025 Application: Scaling When Servers Hit Capacity
&lt;/h2&gt;

&lt;p&gt;One of today’s biggest challenges:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Live events and webinars hitting concurrency limits.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;When a media server is full, instead of rejecting viewers:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Let an existing viewer temporarily proxy the stream.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This gives:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Graceful scaling during spikes
&lt;/li&gt;
&lt;li&gt;Fewer “room full” errors
&lt;/li&gt;
&lt;li&gt;Continuity during partial outages
&lt;/li&gt;
&lt;li&gt;Better reliability in poor networks
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  The Hard Part: Graceful Failback
&lt;/h2&gt;

&lt;p&gt;A modern design would need to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Sync media + session state
&lt;/li&gt;
&lt;li&gt;Avoid ping-pong failovers
&lt;/li&gt;
&lt;li&gt;Time the switch during quiet windows
&lt;/li&gt;
&lt;li&gt;Possibly split media vs signaling paths
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;With today’s telemetry (latency, jitter, CPU), this can even be predictive.&lt;/p&gt;




&lt;h2&gt;
  
  
  Quiet Validation Over the Years
&lt;/h2&gt;

&lt;p&gt;This idea was later granted as U.S. Patent &lt;strong&gt;10,051,235&lt;/strong&gt;, now cited by:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Unify
&lt;/li&gt;
&lt;li&gt;Vonage
&lt;/li&gt;
&lt;li&gt;VMware
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;(Full citation list available on Google Patents.)&lt;/p&gt;




&lt;h2&gt;
  
  
  Closing Thought
&lt;/h2&gt;

&lt;p&gt;Failure in real-time systems is inevitable.&lt;br&gt;&lt;br&gt;
But failure &lt;strong&gt;doesn’t need to reach the user&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;If you're working on real-time media, cloud systems, or distributed comms, I’d love to hear your perspective:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What does “never drop the call” look like in your world today?&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  References
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://www.linkedin.com/pulse/from-cloud-outages-built-in-continuity-rethinking-sagar-joshi-8xogc" rel="noopener noreferrer"&gt;https://www.linkedin.com/pulse/from-cloud-outages-built-in-continuity-rethinking-sagar-joshi-8xogc&lt;/a&gt;&lt;br&gt;
&lt;a href="https://patents.google.com/patent/US10051235B2" rel="noopener noreferrer"&gt;https://patents.google.com/patent/US10051235B2&lt;/a&gt;&lt;br&gt;
&lt;a href="https://medium.com/@jo.sagar/from-access-cards-to-ai-presence-the-evolution-of-intelligent-communication-routing-5ad39dea698e" rel="noopener noreferrer"&gt;https://medium.com/@jo.sagar/from-access-cards-to-ai-presence-the-evolution-of-intelligent-communication-routing-5ad39dea698e&lt;/a&gt;&lt;/p&gt;

</description>
      <category>webrtc</category>
      <category>distributedsystems</category>
      <category>architecture</category>
      <category>cloud</category>
    </item>
    <item>
      <title>A 2009 IBM Patent That Solved Indoor Location Without GPS — And Got Cited 64 Times by Cisco, Microsoft, Avaya…</title>
      <dc:creator>Sagar Joshi</dc:creator>
      <pubDate>Mon, 24 Nov 2025 07:00:06 +0000</pubDate>
      <link>https://forem.com/sagar_joshi_31ac4ef39367a/a-2009-ibm-patent-that-solved-indoor-location-without-gps-and-got-cited-64-times-by-cisco-3pnn</link>
      <guid>https://forem.com/sagar_joshi_31ac4ef39367a/a-2009-ibm-patent-that-solved-indoor-location-without-gps-and-got-cited-64-times-by-cisco-3pnn</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Futdpt1hq23zlfl3okc90.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Futdpt1hq23zlfl3okc90.png" alt="UX Diagram" width="800" height="777"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9ok3l2jizuu4zk673uvq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9ok3l2jizuu4zk673uvq.png" alt="Location Update Flow Chart" width="800" height="1143"&gt;&lt;/a&gt;&lt;br&gt;
In 2009 most location patents were about GPS.&lt;br&gt;
We took a different route: use the badge-swipe data enterprises already had, infer where someone probably is, and route calls accordingly — no tracking, no battery drain, no extra hardware.&lt;br&gt;
Patent US8635366B2 was born.&lt;br&gt;
Fifteen years and 64 forward citations later (including Cisco, Microsoft, Avaya, RingCentral, Mitel), it feels worth revisiting.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://patents.google.com/patent/US8635366" rel="noopener noreferrer"&gt;Click here to open patent link&lt;/a&gt; &lt;/p&gt;

&lt;p&gt;More than a decade ago, while leading a team at IBM, I co-authored and successfully prosecuted U.S. Patent 8,635,366 — “Communication Routing” — a foundational invention in context-aware communication systems.&lt;/p&gt;

&lt;p&gt;The goal was simple yet visionary: connect people through the right device at the right time — without GPS or invasive tracking.&lt;/p&gt;

&lt;p&gt;Back then, GPS was expensive, unreliable indoors, and privacy concerns were rising. Our patented method used existing enterprise access control data (badge swipes) to infer location and route calls — a low-cost, privacy-first alternative that predated modern AI presence systems by nearly a decade.&lt;/p&gt;

&lt;p&gt;Key Innovation (Claim 1): “Routing communication to an individual by identifying current location from access control information.” This was a new paradigm for location inference without geolocation hardware.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Why It Still Matters in 2025&lt;/strong&gt;&lt;br&gt;
Despite billions invested in collaboration tools, the core problem remains unsolved: systems still don’t truly know where you are or how best to reach you.&lt;/p&gt;

&lt;p&gt;Today, most platforms default to “ring everything” — creating noise, not clarity.&lt;/p&gt;

&lt;p&gt;But what if communication could be predictive, adaptive, and human-centered?&lt;/p&gt;

&lt;p&gt;With cloud, AI, and behavioral signals now mature, we can reimagine my original patent as an AI-driven presence engine — one that learns from patterns while preserving privacy.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Smarter Device Routing Through AI (Next-Gen Vision)&lt;/strong&gt;&lt;br&gt;
Context SignalIntelligent ActionActive laptop on corporate Wi-Fi→ Ring desktop app firstCommute pattern (mobile + Bluetooth)→ Route to phone, suppress desktopCalendar focus block→ Auto-deflect or notify laterBadge swipe at cafeteria→ Update IM status: “In cafeteria — back at 1 PM”&lt;/p&gt;

&lt;p&gt;Become a member&lt;br&gt;
This isn’t science fiction — it’s the natural evolution of my 2009 patent, now supercharged with AI.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Proven Impact: Cited by 64 Patents&lt;/strong&gt;&lt;br&gt;
My invention has been cited 64 times — including by Cisco, Microsoft, Avaya, RingCentral, and Mitel — in systems for:&lt;/p&gt;

&lt;p&gt;Location-based call routing (Cisco)&lt;br&gt;
Presence aggregation (Microsoft)&lt;br&gt;
Dynamic device selection (Avaya)&lt;br&gt;
Click here for full citation list&lt;/p&gt;

&lt;p&gt;This sustained influence proves the patent’s role as a foundational building block in enterprise unified communications.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Looking Ahead: From Patent to Platform&lt;/strong&gt;&lt;br&gt;
What began as a GPS alternative is now a blueprint for AI-native communication.&lt;/p&gt;

&lt;p&gt;Today, I’m exploring how generative AI + privacy-preserving inference can power the next leap:&lt;/p&gt;

&lt;p&gt;“Presence that predicts, protects, and personalizes — without ever tracking.”&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Let’s Build the Future of Communication&lt;/strong&gt;&lt;br&gt;
How would you design an intelligent routing system today?&lt;/p&gt;

&lt;p&gt;Should AI infer availability from calendar + badge + typing patterns?&lt;br&gt;
How do we balance prediction with privacy?&lt;br&gt;
Can we make “do not disturb” truly intelligent?&lt;br&gt;
Drop your thoughts below — I read every comment.&lt;/p&gt;

&lt;p&gt;Originally shared on LinkedIn and Medium.&lt;br&gt;&lt;br&gt;
Patent: &lt;a href="https://patents.google.com/patent/US8635366B2" rel="noopener noreferrer"&gt;https://patents.google.com/patent/US8635366B2&lt;/a&gt;&lt;br&gt;&lt;br&gt;
64-citation CSV: &lt;a href="https://patents.google.com/patent/US8635366B2#patentCitations" rel="noopener noreferrer"&gt;https://patents.google.com/patent/US8635366B2#patentCitations&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Curious how you’d extend this idea with today’s telemetry + ML? Let’s talk in the comments.&lt;/p&gt;

&lt;h1&gt;
  
  
  AI #CloudComputing #VoIP #Innovation #SoftwareArchitecture #PrivacyTech #Patents #EB1A #TechLeadership #Presence #LocationAwareness
&lt;/h1&gt;

</description>
      <category>discuss</category>
      <category>privacy</category>
      <category>ai</category>
      <category>iot</category>
    </item>
  </channel>
</rss>
