<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Richard Joseph Porter</title>
    <description>The latest articles on Forem by Richard Joseph Porter (@richardporter).</description>
    <link>https://forem.com/richardporter</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3731433%2F13b75c13-f343-4731-8cf8-eb79566e0831.jpg</url>
      <title>Forem: Richard Joseph Porter</title>
      <link>https://forem.com/richardporter</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/richardporter"/>
    <language>en</language>
    <item>
      <title>Claude Code Token Management: 8 Strategies to Save 50-70% on Pro Plan</title>
      <dc:creator>Richard Joseph Porter</dc:creator>
      <pubDate>Wed, 28 Jan 2026 14:43:46 +0000</pubDate>
      <link>https://forem.com/richardporter/claude-code-token-management-8-strategies-to-save-50-70-on-pro-plan-3hob</link>
      <guid>https://forem.com/richardporter/claude-code-token-management-8-strategies-to-save-50-70-on-pro-plan-3hob</guid>
      <description>&lt;p&gt;If you're on Claude Code's Pro plan ($20/month), you've probably hit usage limits mid-session. Here are 8 proven strategies to stretch your tokens while maintaining code quality.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Quick Reference:&lt;/strong&gt; &lt;code&gt;/clear&lt;/code&gt; (fresh start) | &lt;code&gt;/compact&lt;/code&gt; (summarize) | &lt;code&gt;/context&lt;/code&gt; (check usage) | Target: &amp;lt;30K tokens per session&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  1. Master Context Commands
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;&lt;code&gt;/clear&lt;/code&gt;&lt;/strong&gt; — Use between unrelated tasks. Don't carry auth refactor context into CSS work.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;code&gt;/compact&lt;/code&gt;&lt;/strong&gt; — Summarize at 70% capacity, don't wait for auto-compact at 95%.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;/compact summarize only architectural decisions, omit debugging attempts
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  2. Keep CLAUDE.md Lean
&lt;/h2&gt;

&lt;p&gt;Your CLAUDE.md loads on &lt;em&gt;every&lt;/em&gt; prompt. Keep it under 150 tokens:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Short bullet points, not paragraphs&lt;/li&gt;
&lt;li&gt;Project facts Claude &lt;em&gt;needs&lt;/em&gt; to know&lt;/li&gt;
&lt;li&gt;Forbidden directories (node_modules, dist, .git)&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  3. Be Surgical with File References
&lt;/h2&gt;

&lt;p&gt;❌ Token-wasteful:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Check my authentication code for bugs
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;✅ Token-efficient:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Check @src/api/auth.js for the JWT validation bug in verifyUser
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  4. Manage MCP Servers Dynamically
&lt;/h2&gt;

&lt;p&gt;Each enabled server consumes context even when idle. Linear alone eats ~14K tokens.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;@brave-search disable
/mcp  # toggle servers interactively
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  5. One Task Per Session
&lt;/h2&gt;

&lt;p&gt;The golden rule: &lt;strong&gt;One task, one session.&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;code&gt;/clear&lt;/code&gt; → fresh start&lt;/li&gt;
&lt;li&gt;Work on single task&lt;/li&gt;
&lt;li&gt;Commit to Git&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;/clear&lt;/code&gt; → next task&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  6. Reset Every 20 Iterations
&lt;/h2&gt;

&lt;p&gt;Performance degrades in long conversations. Clear proactively rather than waiting for quality to drop.&lt;/p&gt;

&lt;h2&gt;
  
  
  7. Write Token-Efficient Prompts
&lt;/h2&gt;

&lt;p&gt;❌ Vague:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Make the login system better
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;✅ Specific:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;1. Add rate limiting (5 attempts/15 min)
2. Implement JWT rotation
3. No other changes
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  8. Use GitIngest for Large Repos
&lt;/h2&gt;

&lt;p&gt;Instead of loading files directly, use &lt;a href="https://gitingest.com" rel="noopener noreferrer"&gt;gitingest.com&lt;/a&gt; to get optimized summaries. Users report &lt;strong&gt;98% token savings&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  TL;DR
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Strategy&lt;/th&gt;
&lt;th&gt;Impact&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;code&gt;/clear&lt;/code&gt; between tasks&lt;/td&gt;
&lt;td&gt;High&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Lean CLAUDE.md (&amp;lt;150 tokens)&lt;/td&gt;
&lt;td&gt;High&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;code&gt;@&lt;/code&gt; file references&lt;/td&gt;
&lt;td&gt;Medium&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Disable unused MCP servers&lt;/td&gt;
&lt;td&gt;Medium&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Reset every 20 iterations&lt;/td&gt;
&lt;td&gt;Medium&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Start with just 2-3 techniques. Most developers cut consumption by &lt;strong&gt;50-70%&lt;/strong&gt; with &lt;code&gt;/clear&lt;/code&gt; discipline and a good CLAUDE.md alone.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;This is a summarized version. For the complete guide with advanced techniques, templates, and FAQ, read the &lt;a href="https://richardporter.dev/blog/claude-code-token-management" rel="noopener noreferrer"&gt;full article on my blog&lt;/a&gt;.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>claudecode</category>
      <category>ai</category>
      <category>productivity</category>
      <category>webdev</category>
    </item>
  </channel>
</rss>
