<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Naman Dhakad</title>
    <description>The latest articles on Forem by Naman Dhakad (@naman_dhakad).</description>
    <link>https://forem.com/naman_dhakad</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3803074%2Ffa660fda-157a-4983-bb0b-2c5140811f26.gif</url>
      <title>Forem: Naman Dhakad</title>
      <link>https://forem.com/naman_dhakad</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/naman_dhakad"/>
    <language>en</language>
    <item>
      <title>🚀 From Paper to Pixels: How I Built an AI-Powered Exam Platform with Google Gemini</title>
      <dc:creator>Naman Dhakad</dc:creator>
      <pubDate>Tue, 03 Mar 2026 05:53:05 +0000</pubDate>
      <link>https://forem.com/naman_dhakad/from-paper-to-pixels-how-i-built-an-ai-powered-exam-platform-with-google-gemini-46mh</link>
      <guid>https://forem.com/naman_dhakad/from-paper-to-pixels-how-i-built-an-ai-powered-exam-platform-with-google-gemini-46mh</guid>
      <description>&lt;p&gt;&lt;code&gt;Transforming PDF question papers into interactive CBT experiences - a journey through hackathons, AI, and building for millions of students&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  The Spark ✨
&lt;/h2&gt;

&lt;p&gt;It was 2 AM in the hostel room. My friend was frustrated—again. He'd solved a 180-question JEE mock test on paper, only to realize he'd made a calculation error on question #45 that threw off his entire ranking. No instant results. No analytics. No way to know his weak areas.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"If only there was a way to just... upload the PDF and take it like the real exam."&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That question changed everything.&lt;/p&gt;




&lt;h2&gt;
  
  
  🎯 What I Built: Rankify
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Rankify&lt;/strong&gt; is an AI-powered platform that transforms any PDF question paper into a fully interactive computer-based test (CBT). No more paper. No more manual checking. Just upload, practice, and analyze.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Magic Flow ✨
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;graph LR
    A[📄 PDF Upload] --&amp;gt; B[🤖 AI Extraction]
    B --&amp;gt; C[👁️ Review &amp;amp; Edit]
    C --&amp;gt; D[⚙️ Configure Test]
    D --&amp;gt; E[🎮 Take the Test]
    E --&amp;gt; F[📊 Analyze Results]
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Features That Actually Matter 🎮
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Feature&lt;/th&gt;
&lt;th&gt;What It Does&lt;/th&gt;
&lt;th&gt;Why It Matters&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;AI Extraction&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Gemini reads your PDF and extracts questions&lt;/td&gt;
&lt;td&gt;No manual typing&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Confidence Scores&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Each question gets a 1-5 confidence rating&lt;/td&gt;
&lt;td&gt;Know what to review&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Diagram Detection&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;AI finds and maps diagrams to questions&lt;/td&gt;
&lt;td&gt;Visual questions handled&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Multi-PDF Batch&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Process 10 PDFs at once&lt;/td&gt;
&lt;td&gt;Save time&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Offline Mode&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Works without internet&lt;/td&gt;
&lt;td&gt;Study anywhere&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  🧠 The Hackathon Journey
&lt;/h2&gt;

&lt;p&gt;This wasn't a weekend project. It started at an MLH hackathon where I had &lt;strong&gt;24 hours&lt;/strong&gt; to prove the concept worked.&lt;/p&gt;

&lt;h3&gt;
  
  
  Day 1: The Chaos 🌪️
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Me at 6 PM: "I'll use Gemini to extract questions from PDFs!"
Me at 8 PM: *Reading Gemini documentation*
Me at 10 PM: *Making API calls that fail*
Me at midnight: *First successful extraction! 🎉*
Me at 2 AM: *It works but the JSON is garbage*
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Day 2: The Pivot 🔄
&lt;/h3&gt;

&lt;p&gt;I realized the hard way: &lt;strong&gt;JSON output from LLMs is unpredictable&lt;/strong&gt;. So I built:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Zero-tolerance prompts&lt;/strong&gt; - Instructions so strict Gemini had no choice but to behave&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Fallback chains&lt;/strong&gt; - If one model fails, automatically try the next&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Confidence scoring&lt;/strong&gt; - A multi-factor algorithm to rate extraction quality
&lt;/li&gt;
&lt;/ol&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight typescript"&gt;&lt;code&gt;&lt;span class="c1"&gt;// The fallback chain that saved my hackathon&lt;/span&gt;
&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;modelFallback&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;gemini-2.5-flash&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;      &lt;span class="c1"&gt;// Best quality&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;gemini-2.5-flash-lite&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="c1"&gt;// Quality + speed  &lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;gemini-2.0-flash&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;      &lt;span class="c1"&gt;// Reliable backup&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;gemini-2.0-flash-lite&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;  &lt;span class="c1"&gt;// Last resort&lt;/span&gt;
&lt;span class="p"&gt;]&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  The Result? 🏆
&lt;/h3&gt;

&lt;p&gt;A working prototype that extracted 85 questions from a JEE sample paper in 3 minutes. Not perfect—but proof it was possible.&lt;/p&gt;




&lt;h2&gt;
  
  
  🔥 What I Learned
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Technical Wins 💻
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Gemini's Vision API is underutilized&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Most people use it for images. But PDFs? That's where the magic happens for educational tech.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Prompt engineering &amp;gt; Fine-tuning&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;I spent zero dollars on training. Just better prompts.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Client-side AI is viable&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Privacy-first architecture actually became a selling point.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  The Hard Lessons 😵
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Problem&lt;/th&gt;
&lt;th&gt;Solution&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Rate limits&lt;/td&gt;
&lt;td&gt;Implemented exponential backoff + model fallback&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;PDF parsing issues&lt;/td&gt;
&lt;td&gt;Added pdf.js preprocessing&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;JSON hallucinations&lt;/td&gt;
&lt;td&gt;Built a validation layer with confidence scores&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cost tracking&lt;/td&gt;
&lt;td&gt;Created real-time usage dashboard&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  Soft Skills That Stuck 🎯
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MVP-first thinking&lt;/strong&gt;: Ship fast, iterate faster&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;User empathy&lt;/strong&gt;: Students don't care about AI—they care about results&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Failure tolerance&lt;/strong&gt;: 70% of my experiments failed. That's okay.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  📝 My Google Gemini Feedback: The Good, The Bad, The Ugly
&lt;/h2&gt;

&lt;h3&gt;
  
  
  ✅ What Worked Brilliantly
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Multi-modal = Game changer&lt;/strong&gt; 🎨&lt;br&gt;
Processing text + images together meant diagrams weren't a separate problem.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Context caching&lt;/strong&gt; 💾&lt;br&gt;
Reduced costs by 60%. Essential for production apps.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Large context windows&lt;/strong&gt; 📏&lt;br&gt;
1M tokens = entire question papers in one go. Unreal.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;JSON mode&lt;/strong&gt; 📋&lt;br&gt;
Structured output without post-processing. So good.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  ⚠️ Where Google Could Improve
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;PDF parsing is inconsistent&lt;/strong&gt;&lt;br&gt;
Rotated pages, watermarks, and scanned docs still cause headaches.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Rate limits are tight&lt;/strong&gt;&lt;br&gt;
For a free tier, it's restrictive. I had to build a queuing system.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Error messages are cryptic&lt;/strong&gt;&lt;br&gt;
"Internal error" tells me nothing. Help us help you!&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Cost visibility&lt;/strong&gt;&lt;br&gt;
Real-time spend tracking would prevent surprises.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  😬 The Ugly Truth
&lt;/h3&gt;

&lt;p&gt;Sometimes Gemini &lt;strong&gt;hallucinates&lt;/strong&gt;. It sees a question it can't parse and just... makes up options. That's why confidence scoring exists now.&lt;/p&gt;




&lt;h2&gt;
  
  
  🚀 Where I'm Going Next
&lt;/h2&gt;

&lt;p&gt;The journey continues:&lt;/p&gt;

&lt;h3&gt;
  
  
  Phase 1: Cloud Run Deployment
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;gcloud run deploy rankify-ai \
  --source . \
  --region us-central1 \
  --no-allow-unauthenticated \
  --labels dev-tutorial=blog-devcommunity2026
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Phase 2: Agent Development Kit (ADK)
&lt;/h3&gt;

&lt;p&gt;Building specialized agents for different subjects:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Physics Agent 🧪&lt;/li&gt;
&lt;li&gt;Chemistry Agent ⚗️&lt;/li&gt;
&lt;li&gt;Math Agent 📐&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Phase 3: A2A Protocol
&lt;/h3&gt;

&lt;p&gt;Agent-to-agent communication for complex multi-step reasoning&lt;/p&gt;




&lt;h2&gt;
  
  
  💡 To Anyone Building with Gemini
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Start with Vision&lt;/strong&gt; - It's more mature than you think&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Build for failure&lt;/strong&gt; - Every API call can fail&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Hide the AI&lt;/strong&gt; - Users want results, not technology&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Trust but verify&lt;/strong&gt; - Confidence scores are your friend&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  🎤 Final Thoughts
&lt;/h2&gt;

&lt;p&gt;Building Rankify taught me that AI isn't about replacing humans—it's about amplifying. A what we can do student in a small town with a cracked phone can now practice with the same quality as someone with expensive coaching.&lt;/p&gt;

&lt;p&gt;That's what keeps me building.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Thanks for reading! If you want to try Rankify, check out the live demo. And if you're entering the Built with Google Gemini: Writing Challenge—good luck! 🎉&lt;/em&gt;&lt;/p&gt;




&lt;h1&gt;
  
  
  MLH #DEVChallenge #GoogleGemini #Vue #Nuxt #AI #WebDev
&lt;/h1&gt;

</description>
      <category>vue</category>
      <category>nuxt</category>
      <category>javascript</category>
      <category>gemini</category>
    </item>
  </channel>
</rss>
