<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: 4663437Mehdi</title>
    <description>The latest articles on Forem by 4663437Mehdi (@4663437mehdi).</description>
    <link>https://forem.com/4663437mehdi</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3933985%2Fd12e89b2-cc42-404c-b494-5ebd7577086c.png</url>
      <title>Forem: 4663437Mehdi</title>
      <link>https://forem.com/4663437mehdi</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/4663437mehdi"/>
    <language>en</language>
    <item>
      <title>Token Ledger Digest – 2026-05-20</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Wed, 20 May 2026 10:30:01 +0000</pubDate>
      <link>https://forem.com/4663437mehdi/token-ledger-digest-2026-05-20-50nn</link>
      <guid>https://forem.com/4663437mehdi/token-ledger-digest-2026-05-20-50nn</guid>
      <description>&lt;h1&gt;
  
  
  Token Ledger Digest – 2026-05-20
&lt;/h1&gt;

&lt;p&gt;&lt;strong&gt;Lead change – biggest cost impact&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Google Gemini Flash Latest&lt;/strong&gt; (&lt;code&gt;~google/gemini-flash-latest&lt;/code&gt;)

&lt;ul&gt;
&lt;li&gt;Prompt price rose from $0.50/1M to $1.50/1M (+$1.00/1M).
&lt;/li&gt;
&lt;li&gt;Completion price rose from $3.00/1M to $9.00/1M (+$6.00/1M).
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Teams running high‑volume inference on this model will see per‑million‑token costs jump by $7.00; consider alternatives or prompt‑completion optimization.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Other price changes&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Z.ai GLM 5.1&lt;/strong&gt; (&lt;code&gt;z-ai/glm-5.1&lt;/code&gt;)  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt price dropped from $0.98/1M to $0.00/1M.
&lt;/li&gt;
&lt;li&gt;Completion price dropped from $3.08/1M to $0.00/1M.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Users can now run this model at zero token cost; ideal for cost‑sensitive prototypes or batch workloads.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Qwen: Qwen3.6 35B A3B&lt;/strong&gt; (&lt;code&gt;qwen/qwen3.6-35b-a3b&lt;/code&gt;)  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt price fell slightly from $0.15/1M to $0.149/1M (‑$0.001/1M).
&lt;/li&gt;
&lt;li&gt;Completion price unchanged at $1.00/1M.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Negligible impact; monitor for further drift.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Qwen: Qwen3.5‑35B‑A3B&lt;/strong&gt; (&lt;code&gt;qwen/qwen3.5-35b-a3b&lt;/code&gt;)  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt price fell from $0.14/1M to $0.139/1M (‑$0.001/1M).
&lt;/li&gt;
&lt;li&gt;Completion price unchanged at $1.00/1M.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Minimal effect; no action needed.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;New model added&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Google Gemini 3.5 Flash&lt;/strong&gt; (&lt;code&gt;google/gemini-3.5-flash&lt;/code&gt;)

&lt;ul&gt;
&lt;li&gt;Prompt price: $1.50/1M.
&lt;/li&gt;
&lt;li&gt;Completion price: $9.00/1M.
&lt;/li&gt;
&lt;li&gt;Context window: 1,048,576 tokens.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Developers needing very long contexts; compare pricing against other long‑context options.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Summary&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Total models tracked: 357. No other meaningful changes today.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-20" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger – 2026-05-19</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Tue, 19 May 2026 10:44:31 +0000</pubDate>
      <link>https://forem.com/4663437mehdi/the-token-ledger-2026-05-19-30eo</link>
      <guid>https://forem.com/4663437mehdi/the-token-ledger-2026-05-19-30eo</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger – 2026-05-19
&lt;/h1&gt;

&lt;p&gt;&lt;strong&gt;Most cost‑impacting change:&lt;/strong&gt; NVIDIA’s Nemotron 3 Super completion price fell from $0.50 to $0.45 per 1M tokens (‑$0.05/1M), a 10% reduction.&lt;/p&gt;

&lt;h3&gt;
  
  
  Price changes
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Model&lt;/th&gt;
&lt;th&gt;What changed&lt;/th&gt;
&lt;th&gt;Old price ($/1M)&lt;/th&gt;
&lt;th&gt;New price ($/1M)&lt;/th&gt;
&lt;th&gt;Who should care&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;NVIDIA: Nemotron 3 Super&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Prompt ↓&lt;/td&gt;
&lt;td&gt;0.10 → 0.09&lt;/td&gt;
&lt;td&gt;Completion ↓&lt;/td&gt;
&lt;td&gt;0.50 → 0.45&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Google: Gemma 4 26B A4B&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Prompt ↓&lt;/td&gt;
&lt;td&gt;0.07 → 0.06&lt;/td&gt;
&lt;td&gt;Completion ↓&lt;/td&gt;
&lt;td&gt;0.34 → 0.33&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;OpenAI: gpt-oss-120b&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Completion ↓&lt;/td&gt;
&lt;td&gt;0.19 → 0.18&lt;/td&gt;
&lt;td&gt;(Prompt unchanged)&lt;/td&gt;
&lt;td&gt;0.039 → 0.039&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Mistral: Mistral Nemo&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Completion ↓&lt;/td&gt;
&lt;td&gt;0.04 → 0.03&lt;/td&gt;
&lt;td&gt;(Prompt unchanged)&lt;/td&gt;
&lt;td&gt;0.02 → 0.02&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;No models were added or removed today. The cheapest models remain inclusionAI: Ling‑2.6‑flash, IBM: Granite 4.0 Micro, and Meta: Llama 3.1 8B Instruct (see source data for full list).&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-19" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger — May 17, 2026</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Sun, 17 May 2026 09:16:05 +0000</pubDate>
      <link>https://forem.com/4663437mehdi/the-token-ledger-may-17-2026-4pgn</link>
      <guid>https://forem.com/4663437mehdi/the-token-ledger-may-17-2026-4pgn</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger — May 17, 2026
&lt;/h1&gt;

&lt;p&gt;Three providers raised completion prices today; NVIDIA’s Nemotron 3 Super saw the largest absolute increase. No new models were added or removed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;NVIDIA: Nemotron 3 Super (120B A12B)&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Prompt: $0.09/1M → $0.10/1M (+11.1%)&lt;br&gt;&lt;br&gt;
Completion: $0.45/1M → $0.50/1M (+$0.05, +11.1%)&lt;br&gt;&lt;br&gt;
Impact: Heaviest token-cost increase today. Relevant for agents and reasoning workflows.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Mistral: Mistral Nemo&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Prompt: unchanged at $0.02/1M&lt;br&gt;&lt;br&gt;
Completion: $0.03/1M → $0.04/1M (+33.3%)&lt;br&gt;&lt;br&gt;
Relative jump is steep, but absolute cost remains low. Relevant for lightweight local-style deployments.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Google: Gemma 4 26B A4B&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Prompt: $0.06/1M → $0.07/1M (+16.7%)&lt;br&gt;&lt;br&gt;
Completion: $0.33/1M → $0.34/1M (+3%)&lt;br&gt;&lt;br&gt;
Smaller absolute impact vs. Nemotron; still a 17% prompt hike.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;OpenAI: gpt-oss-120b&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Prompt: unchanged at $0.039/1M&lt;br&gt;&lt;br&gt;
Completion: $0.18/1M → $0.19/1M (+5.6%)&lt;br&gt;&lt;br&gt;
Marginal; likely overlooked in volume.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Cheapest models today&lt;/strong&gt; (by prompt price):  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;inclusionAI: Ling-2.6-flash — $0.01/1M prompt, $0.03/1M completion
&lt;/li&gt;
&lt;li&gt;IBM: Granite 4.0 Micro — $0.017/1M prompt, $0.112/1M completion
&lt;/li&gt;
&lt;li&gt;Meta: Llama 3.1 8B Instruct — $0.02/1M prompt, $0.05/1M completion
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Total tracked models: 356.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-17" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger Digest – 2026-05-16</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Sat, 16 May 2026 09:01:03 +0000</pubDate>
      <link>https://forem.com/4663437mehdi/the-token-ledger-digest-2026-05-16-3e1g</link>
      <guid>https://forem.com/4663437mehdi/the-token-ledger-digest-2026-05-16-3e1g</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger Digest – 2026-05-16
&lt;/h1&gt;

&lt;p&gt;No meaningful changes today.&lt;/p&gt;

&lt;h2&gt;
  
  
  Cheapest models (per 1M tokens)
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;inclusionAI: Ling-2.6-flash&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What changed: —
&lt;/li&gt;
&lt;li&gt;Prompt: $0.01 / 1M Completion: $0.03 / 1M
&lt;/li&gt;
&lt;li&gt;Who should care: Developers seeking the lowest‑cost inference for short‑to‑medium prompts.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Mistral: Mistral Nemo&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What changed: —
&lt;/li&gt;
&lt;li&gt;Prompt: $0.02 / 1M Completion: $0.03 / 1M
&lt;/li&gt;
&lt;li&gt;Who should care: Teams needing a balanced low‑cost model with strong multilingual ability.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Meta: Llama 3.1 8B Instruct&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What changed: —
&lt;/li&gt;
&lt;li&gt;Prompt: $0.02 / 1M Completion: $0.05 / 1M
&lt;/li&gt;
&lt;li&gt;Who should care: Users who want a widely‑available 8B model at minimal expense.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-16" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>Token Ledger – 2026-05-15</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Fri, 15 May 2026 23:37:01 +0000</pubDate>
      <link>https://forem.com/4663437mehdi/token-ledger-2026-05-15-1mpl</link>
      <guid>https://forem.com/4663437mehdi/token-ledger-2026-05-15-1mpl</guid>
      <description>&lt;h1&gt;
  
  
  Token Ledger – 2026-05-15
&lt;/h1&gt;

&lt;p&gt;&lt;strong&gt;356 models added, 0 removed, 0 price changes.&lt;/strong&gt; The largest influx on record reframes the cost landscape. Leading the batch is a 1-trillion-parameter model at sub-dollar rates.&lt;/p&gt;

&lt;h2&gt;
  
  
  Most cost-impacting addition
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;inclusionAI: Ring-2.6-1T&lt;/strong&gt; – $0.075 / 1M input, $0.625 / 1M output, 262k context.&lt;br&gt;&lt;br&gt;
A 1T-parameter dense Mixture-of-Experts model at this price point is unprecedented. For reference, comparable-scale models typically run 5-10× higher. Developers processing high-volume reasoning tasks should test immediately.&lt;/p&gt;

&lt;h2&gt;
  
  
  Other notable low-cost entries
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;IBM: Granite 4.1 8B&lt;/strong&gt; – $0.05 / 1M input, $0.10 / 1M output, 131k context. Cheapest 8B in the fleet.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Google: Gemini 3.1 Flash Lite&lt;/strong&gt; – $0.25 / 1M input, $1.50 / 1M output, 1M context. Largest context-to-cost ratio on a production model.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Perceptron: Perceptron Mk1&lt;/strong&gt; – $0.15 / 1M input, $1.50 / 1M output, 32k context. New entrant at the ultra-budget tier.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;xAI: Grok 4.3&lt;/strong&gt; – $1.25 / 1M input, $2.50 / 1M output, 1M context. Lower than Grok 4.2 pricing.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Premium tier
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Anthropic: Claude Opus 4.7 (Fast)&lt;/strong&gt; – $30 / 1M input, $150 / 1M output, 1M context. Fast variant of Opus.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;OpenAI: GPT Chat Latest&lt;/strong&gt; – $5 / 1M input, $30 / 1M output, 400k context. New default chat model.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Free models added
&lt;/h2&gt;

&lt;p&gt;Baidu Qianfan CoBuddy, NVIDIA Nemotron 3 Nano Omni, Poolside Laguna XS.2 &amp;amp; M.1, and OpenRouter Owl Alpha are available at zero cost.&lt;/p&gt;

&lt;p&gt;All additions bring the platform to 356 total models. No existing model prices changed.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-15" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
  </channel>
</rss>
