DEV Community

Cover image for 2025s Best AI Coding Tools: Real Cost, Geeky Value & Honest Comparison
Steven Gonsalvez
Steven Gonsalvez

Posted on • Edited on

3

2025s Best AI Coding Tools: Real Cost, Geeky Value & Honest Comparison

Missed Part 1? This piece builds on Beyond the Hype: What Truly Makes an AI a Great Coding Partner.

Contents

Best AI Coding Tools in 2025: Cost vs Value Showdown {#best-ai-coding-tools-in-2025-cost-vs-value-showdown}

If our first article asked "What makes a great AI coding partner?" this follow‑up is more of "Cool, but how much will it cost me and is it worth it?"

Developers are living inside Ferris Bueller's Law of Software: "Code moves pretty fast. If you don't stop and priceshop once in a while, you could blow your entire budget."

In this guide we map the free to premium landscape of AI development tools, spotlight the quirks that make each product lovable (or rage‑quit inducing) and will try to wrap with a monster comparison table you can attempt to make sense of.


Free AI Coding Tools That Punch Above Their (Skint) Weight {#free-ai-coding-tools-that-punch-above-their-skint-weight}

1. Gemini AI Studio with Gemini 2.5 pro(1 M Token Context) {#1-gemini-ai-studio-with-gemini-2-5-pro-1-m-token-context}

Want to spin up things fast with a 1 million token context window (soon to be 2 million)? This is your jam. You can practically get small sized apps fully built in here ...completely free!!

📚 Geek Corner cheeky hack to build an MCP server
Recipe Card: Instant  build an MCP Server
1. gitingestTypeScript SDK
2. Paste the output into Gemini chat
3. Add prompt + rules + Open API spec of the tool
4. Hit ↵ and snag server.ts
5. Boom! Bob's your uncle
📚 Geek Corner
Big‑O Budgeting: A 1million context sounds infinite, yet it's still (O(n)) paste‑work. Remember: context ≠ competence.

Gemini AI Studio hands you a one‑million token context on the 2 .5 Pro model, genuinely generous!. The flip side is that it remains pure chat with no agentic capabilities. So obviously, you'll be doing a bit of copy-pasting and some back and forth. It supports a few integrations (YouTube, Drive - classic Google), but it's nowhere near truly "agentic." This is pure augmentation for engineers who know their onions. Non-engineers? They'd probably give up faster than a cat in a bathtub. It's not quite "vibe coding"; you need a plan.

1.1. Gemini APIs {#1-1-gemini-apis}

The free tier here is ridiculously generous. We're talking 10-30 requests per minute (depending on the model) for their SOTA models like Gemini 1.5 Pro (1 million context, soon 2 million!) and the newer Gemini 1.5 Flash Preview (1 million context), and maybe soon 2.5 Flash as well. This means a whole universe of AI apps and agentic coding clients (Cursor, Cline, Roocode, Aider, Goose, Trae, Windsurf ... you name it) can run on these APIs with BYO-AI. Sweet!

📚 Geek Corner
BYO-AI (Bring Your Own API key): Many AI tools offer a subscription but also let you plug in your own API key from a provider like Google or OpenAI. This can sometimes be more cost-effective or give you access to models the tool doesn't offer by default. It's like unlocking cheat codes in a game where everyone else is still reading the manual.

1.2. Other Google Goodies {#1-2-other-google-goodies}

  • Google Code Assist (Workspace Add-on/IDE extension): Free for a whopping 180,000 lines a day (or thereabouts). It's not the an agentic beast, but think of it as autocomplete on steroids: function completion, interactive chat with code files, the works. Useful for that everyday grind.
  • Google Code Assist GitHub Bot for PR Reviews: This thingie is smashing. Absolutely free PR reviews (summaries, inline suggestions and the whole lot), and I haven't hit any limits yet. Works for private and public repos. The reviews are solid, especially for JavaScript and Python. And get this – it even adds in some trivia in the reviews. I'm a sucker for that kind of fun. A definite recommend, especially when other PR review tools (Coderabbit, Bito,sourcery) cost an arm and a leg.

gemini code assist trivia

2. GitHub Copilot (Free Tier) {#2-github-copilot-free-tier}

(Now practically baked into VSCode). The free version of Copilot? Meh. It's good, but fairly limited. You get about 50 "agentic" requests per month. Really? Fifty? That's not "free," that's a "please buy me" sample. It's the AI equivalent of Costco giving you half a sausage on a toothpick and calling it lunch. It's barely enough to explore the tool, let alone get any serious work done. The code completion (the "autocomplete on steroids" part) is a bit more generous with around 2000 completions, which might last you a week if you're frugal.


Affordable AI Code Assistants-Where to Spend Your First $10? {#affordable-ai-code-assistants-where-to-spend-your-first-10}

1. GitHub Copilot Pro ~£8/mo {#1-github-copilot-pro-8-mo}

Continuing with GitHub, the Pro version is probably the cheapest of the dedicated coding assistants at $10/month for 300 "advanced" requests. From my usage, it's still a bit subpar to the likes of Cursor and Windsurf when it comes to precision, usage flow and "agentic" IDE capabilities, and definitely a notch below opensource plugins like Cline or Roocode.

However, two features are genuinely impressive and pretty unique right now:

  • Integrated Voice Plugin: You can literally talk to your code editor chat. The speech recognition, which uses a local model, is brilliant. Seriously, it's top-notch.
  • Accessibility - Text-to-Speech: VSCode can read out explanations from the chat (vscode://settings/accessibility.voice.autoSynthesize). This is fantastic if you don't want to squint at your screen or if you're multitasking. Bonus: you can set up a keyword kickoff like "Hey Code," and its pickup is often better than "OK Google" or "Hey Siri."

And a second brilliant thing: it automatically picks up my MCP server config that I've set up for Claude Desktop. Boom! That's some seriously improved developer experience (DevX) compared to fiddling with different MCP JSON configs for every darn tool. Before you know it, you've got MCP JSON sprawl – it's a real condition, look it up (okay, don't, I made it up, but it feels real).

📚 Geek Corner
MCP (Model Context Protocol): A standardized way for AI tools to discover and interact with capabilities offered by other tools or services. Think of it as a universal translator and remote control for AI agents, allowing them to use external "limbs" like web browsers, file systems, or even other AIs. It's key for building more complex agentic systems. Find out More on the MCP series

2. Cursor / Windsurf {#2-cursor-windsurf}

I'm lumping these two together because they occupy a similar space in my mental AI map.

  • Cursor: A bit superior in my tests and overall feel. It's slightly pricier at ~£15/month for 500 "premium" requests (GPT-4 , claude sonnet3.7 and similar level).
  • Windsurf: Costs ~£11/month for a similar 500 requests.

I felt that doing roughly the same agentic tasks (Windsurf calls this "cascade"), Windsurf burned through requests faster. It's a bit chattier and tends to over-explain basic stuff, which is sometimes welcome, but often you just want it to get on with it. So, the pricing might even out in the end.

Beyond the initial quota, usage is priced similarly. Cursor has per-call overages, which feels more transparent. Windsurf offers slabs like $10 for an extra 250 requests. And, of course, if you're hitting top-tier reasoning models like GPT-4.5 Turbo or O3, those chew through your request credits faster (like 2 or 3 "requests" per actual call).

A word of warning on Cursor's "slow requests": Once you burn through your 500 premium requests, you get relegated to "slow" mode. It's not too bad if you're a night owl, an early bird, or coding during a siesta. But during peak times? The throttling on slow requests is unbearable. There was this one time, in a fit of situational anxiety and slow-request-induced frustration, I accidentally snipped my headphone wires. Turns out, using wire cutters as a fidget toy is a terrible idea. Don't be like me.


The Chat Platform Contenders: More Than Just Talk? {#the-chat-platform-contenders-more-than-just-talk}

Let's look at the big chat platforms and how they fare as coding sidekicks.

Microsoft Copilot (the general one): {#microsoft-copilot-the-general-one}

Honestly, it's a bit of an afterthought and so janky for serious coding unless you're deep in the Microsoft 365 ecosystem with Copilot Studio, which grants agentic powers within the Office suite. For standalone coding? Nah, give it a miss. It's like bringing a spork to a sword fight.

💰 £20/mo for the full suite, but coding features are more of a bonus than a core offering.

Gemini Chat (Advanced/Pro versions): {#gemini-chat-advanced-pro-versions}

Pretty much mirrors what the AI Studio offers in terms of coding grunt. The "Canvas" feature is cool for iterative code writing, maybe a bit cleaner than ChatGPT or Claude for previewing. That 1 million token context is a beast for prototyping decently complex web apps. But, it can't really integrate outside its sandbox (no backend for frontend calls to live APIs, for example). Still, solid for prototyping.

It also has other brilliant features like DeepResearch (Google's research capabilities are hands-down the best) and audio overviews with NotebookLM. You can do some fascinating stuff like creating your own agentic podcast or audiobooking your notes. Not strictly coding, but cool nonetheless. For coding, Gemini 2.5 Pro's reasoning is arguably more advanced than Claude 3.7 Sonnet's (Anthropic's "thinking" model). That said, Open AI o3 (and maybe GPT-4.5) still pip Gemini 2.5 Pro in some hardcore coding benchmarks (SWE-Bench, HLEval) – but the price difference is also monstrous. And this is not mentioning the 1 TB of storage you get with the pro offering. The pro from google packs a punch

💰 £19/mo for Gemini Advanced, which includes the full suite of features including the 1TB storage, advanced reasoning capabilities, and all the research tools. The free tier is quite generous too, with access to Gemini 1.5 Pro , a decent amount of 2.5 pro and other features.

ChatGPT (Plus) {#chatgpt-plus}

The context window is smaller (around 128k for GPT-4 Turbo, though it feels like it performs best with less), but it compensates with a broader range of integrations. The biggest downer? Neither ChatGPT nor Gemini Chat (the web UIs) properly support MCP out of the box.

ChatGPT Pro lets you build Custom GPTs, which is a somewhat horrible, janky way to integrate with external tools via APIs. It kinda works like MCP if you set it up meticulously, but you can only integrate one main tool per Custom GPT. So, you'll spend an eternity setting them up, with no versioning or proper management. You'll age faster than milk in the sun.

The standout feature? Integration from ChatGPT into apps. Relevant to coding, it can connect to VSCode/Cursor/Windsurf, text editors, terminals/iTerm, Android Studio/Xcode. The gotcha with IDEs: it can only access one file at a time. Yes, you read that right. One. Single. File. So, its context is limited, but at least changes can be propagated back to the tool (e.g., VSCode). The iTerm integration is one-way: ChatGPT can read everything in iTerm but can't execute commands. That would have been smashing, but alas.

💰 £20/mo for ChatGPT Plus

⭐⭐⭐ Claude Desktop: The Reigning Champion (In My Book) {#claude-desktop-the-reigning-champion-in-my-book}

This is my top dog, my go-to. It holds its own even against purpose-built IDEs. From "vibe coding" completely new projects agentically to making surgical patches in relatively large codebases (think ~200K lines of code), Claude Desktop is where it's at. Most agentic tools turn stupid or non-agentic real fast with codebases over 500K lines, or you just get AI slop if you don't switch from vibing to using it as a glorified assistant.

📚 Geek Corner
Vibe Coding: Coding by feeling and letting AI guess along with you. It's fun - until it isn't, and the AI starts hallucinating features you never asked for, like a rogue interior decorator suddenly deciding your app needs more glitter.

It has "full" MCP support (I say "full," but as an MCP client, it still doesn't support crucial parts like sampling, discovery, or notifications, which is a bit sad considering Anthropic pioneered MCP). Yet, it's still the best implementation I've used. Bolster it with context, rules, state, task management, and efficient prompting, it's probably as good as, if not better than, any on this list for complex, iterative development.

💰 Pricing: The pricing model is straightforward - £18/month for unlimited access to Claude 3.7 Sonnet (and few other integrations and usability features)

Things that Stand Out (Claude Desktop):

For 18 quid a month get a ridiculous amount of usage compared to other tools. It's not measured in tokens directly, but in "messages" based on context window usage, resetting every 5 hours. If you're vibing hard in a single chat window, the entire history gets added to the 200K context window of Claude. So, you'll hit the message limit pretty quickly – maybe within 40 minutes of intense agentic use with Sonnet.

Pro-Tip for Optimizing Claude Usage: Keep conversations small. Start new chats frequently, letting it re-read project state if needed, rather than continuing one massive thread. You can stretch your usage to almost twice or thrice that window – so, 2-3 hours of solid work within a 5-hour block ain't bad. When throttled on Sonnet, you can still use Haiku. Haiku is surprisingly solid for churn tasks: fixing TypeScript errors, making GitHub Actions pipelines, extracting code into common functions. Just don't ask Haiku to make changes to code related to state management; it gets lost worse than Windows searching for printer drivers.

📚 Geek Corner
Claude vs Cursor: Token Math
To put this into perspective with Cursor: Cursor's $20 gets you 500 premium requests. If each request uses Claude's max token size (let's say for a big operation, though it's usually less, but for argument, assume 10K tokens for a Claude 3.7 Sonnet equivalent call via API that Cursor might make for complex tasks), that's 500 × 10,000 = 5 million tokens for the entire month.

With Claude Desktop, I'd estimate I'm getting something closer to 2–3 million tokens per day with smart usage. It's not a direct comparison, I know, but the cost-value proposition for producing working software is just miles ahead with Claude Desktop.
  • Moderation: The level of moderation in Claude Desktop is impressive. As I wil cover in the next part of the MCP series, MCP can be a massive attack vector. Pretty much every other tool that supports MCP falls on its face and is ridiculously easy to compromise. Claude Desktop holds its own; there's some solid moderation happening under the hood.
  • Claudesync: This is a super handy companion tool. It helps save a lot of time from MCP reading and making sense of your project by offering compression and other smarts.

The BYO-API Crew: Maximum Control, Maximum Tweaking {#the-byo-api-crew-maximum-control-maximum-tweaking}

These tools generally don't have their own models; you plug in your API key (OpenAI, Anthropic, Google, etc.), often via OpenRouter for more flexibility.

Continue.dev: {#continue-dev}

Haven't used this one recently enough to rank it on current efficiency or cost. But it was one of the first semi-agentic tools I had in my VSCode. It supported lots of integrations with function calling (like browser fetching, Jira integration – key SDLC stuff) even before MCP was mainstream. I'll probably give it another shot and update my thoughts.

Roocode & Cline {#roocode-cline}

Roocode was a fork of Cline, born out of community demand (the power of open source, eh? Like when a popular mod becomes its own game). The core mechanics are similar. Cline is rock-solid, but Roocode has enhanced system management and DevX.

  • Roocode's Boomerang Task Management: This is superb. Saves you from needing yet another tool like taskmaster or managing tasks externally for MCP. Roocode also makes tweaking system prompts dead easy, which is crucial because, let's be honest, many of these IDE tools are just fancy prompt engineering and context fetching wrapped around VSCode.
  • Cline: Being open source, it's also fairly easy to tinker with system prompts (something you can't easily do in Cursor/Windsurf).

Standouts for Cline & Roocode:

  • Pair Programming Feel: These tools, especially with their different interaction modes, genuinely feel more like pair programming. They ask intelligent questions back, making you feel they understand the code structure and composition. No sudden "let's go nuclear and rewrite everything!" suggestions that you sometimes get from Cursor or even Claude Desktop when they're having a moment.
  • Controllable Context: There's no hard context limitation, or rather, it's controllable. This applies to both reasoning and non-reasoning models. You can even mix and match models for different tasks (e.g., DeepSeek for reasoning/architecture and Claude Haiku for dev/debug modes) to slash costs. Because of better context handling, you often get fewer hallucinations and more precise suggestions.
  • Cline's MCP Marketplace: Cline has a slight edge with its MCP marketplace. That said, there are MCPs to install MCPs these days, so it's a bit like Inception.

💰 Pricing: Since you bring your own API key, the cost is entirely usage-based. Cline and Roocode are highly optimized for context stuffing and iterative problem-solving, but that means you can easily burn through tokens at a rapid pace. If you're not careful, you could be spending at the rate of £20 an hour (or more) during heavy, agentic sessions-especially with premium models like Claude or GPT-4. The upside: you have maximum control and flexibility. The downside: your bill can spike fast if you let the models chew through large contexts or run lots of multi-step tasks. For most devs, it's wise to keep an eye on your API dashboard and set usage caps if possible.

📚 Geek Corner
The Billion-Dollar Occam's Razor: Ever wonder what's really under the hood of your favorite "AI-powered" dev tool? Spoiler: it's mostly a glorified Mad Libs for nerds. Take a peek at this repo and you'll find that the secret sauce is just a pile of elaborate system prompts (with a bit of extra packaging and polish). Billions in valuation, and the magic is… really, really good prompt engineering. Somewhere, a prompt engineer is cackling while investors nod sagely at a 200-line YAML file that says "Act like a helpful coding assistant, but with more emojis."

Aider: The CLI Powerhouse {#aider-the-cli-powerhouse}

This is my close second favorite, maybe even my first on some days. Aider is not agentic. It's a pure, unadulterated CLI-based augmented AI coder. No bells, no whistles, just brilliant execution. And it's a CLI! Who doesn't love a CLI that actually works and makes you feel like a wizard?

📚 Geek Corner
Agentic vs. Augmented AI: Augmented AI helps you with specific tasks (e.g., "write this function," "find this bug"). Agentic AI can take broader goals ("refactor this module for performance," "build a user auth system"), break them down into steps, and execute them, often interacting with tools and your codebase more autonomously.

Things that Stand Out (Aider):

  • Context Fetching: Easily the best of the bunch. Cline and Roocode use similar methods treesitter and ripgrep, but Aider really gets it right. In comparison, Cursor, Windsurf, and other VS Code-based tools rely on VectorDBs and perform NN style searches on vectors. From experience, I can confidently say that treesitter combined with some form of fuzzy search consistently outperforms vector search approaches.
  • CLI FTW!: Being a CLI, you can wrap it, automate the automation, build bots – the sky's the limit. You could build your own auto-PR bot for small bug fixes. Imagine: Aider fixes a bug, then Google Code Assist Bot reviews it... evil laugh. (Here's one I whipped up earlier: https://github.com/stevengonsalvez/patchmycode - user to fill in)
  • Aider Benchmarks: The benchmarks Aider uses are a solid standard for ranking how well different models perform for coding tasks. From my experience using various models via OpenRouter with different tools, these benchmarks are scarily accurate to real-world experience.

💰 Pricing: Aider stands out as the best value among these tools. Because it isn't agentic and is highly optimized for context fetching, it uses far fewer tokens per session than agentic tools that chew through context windows and run multi-step tasks. With Aider, your costs are almost entirely determined by the efficiency of your prompts and the model you choose-no hidden overhead, no runaway token usage. In practice, this means you can get hours of productive coding for just a few pounds or dollars, especially if you use cost-effective models like deep-seek and/or qwen.


Other Players on the Field {#other-players-on-the-field}

A quick look at a few others making waves, ripples, or let's be honest, barely noticeable bubbles no one asked for.

Trae (from ByteDance) {#trae-from-bytedance}

Imagine if Cursor had a long-lost cousin who showed up uninvited to the family reunion, wearing knockoff shoes and bragging about their "innovative" ideas. That's Trae. It's like someone at ByteDance saw Cursor, squinted, and said, "Yeah, we can copy that badly and in a hurry!" But hey, at least Trae's got one thing going for it: you can use it all day long because apparently, ByteDance forgot to install a proper throttle. If you're tired of burning through your Cursor or Windsurf premium requests on actual work, just dump your busywork on Trae and let it fumble through. Is it as good as Cursor or Windsurf? Absolutely not. But it's free, and sometimes you just need a tool that's good enough to get the job done… or at least make you appreciate the tools you're actually paying for.

Amazon Q {#amazon-q}

Oh, Amazon Q. Until recently, it was a big, sad dud. It lagged so far behind in augmented coding that CodeWhisperer (its predecessor) barely managed decent autocomplete. Single file edits, generating docs, and chatting with your current file – that was pretty much its resume. Then, the name change to Q must have brought some good fortune (or a kick up the AWS backside).

The IDE plugin is still, frankly, rubbish. But the Q CLI (my love for CLIs is showing again!) is starting to get interesting. Amazon absorbed Fig a few years back (one of the best terminal helpers ever, written in Rust, super slick – RIP original Fig) and relaunched parts of it. The Q CLI was good for autocomplete, but that was it. Then, boom, out of nowhere, Q CLI became an AI CLI with agentic capabilities. The latest release even supports MCP! It feels surprisingly agentic when it gets going.

There's still a lot of room for improvement. Much of it is manual (setting context, rules, system prompts). It feels like a basic AI agent plugged into an LLM API with an MCP client, some pre-provisioned S3 location, and a BuilderID auth system. But it's really fast and crisp to work with. Plus, it's open source, so there's potential.

It still can't hold a candle to Aider for context handling or code fixing, but Aider isn't agentic and doesn't do MCP. So, Q CLI has its niche. There's a bit of a free tier (I think there are limits). A downer is that we don't know what models are powering it (for reasoning/planning vs. execution). All other products are fairly open about this. (Peeking at their code, Amazon uses codewhisperer and amazonqdeveloper backend APIs. codewhisperer seems to be for local cli use , qdeveloper for cloudshell). I haven't tested the difference extensively; it's likely more about context added in the backend than model differences. But I get a sneaking suspicion that some reasoning might be happening with a Claude model, and some dev tasks with a lower-capability model (Claude Haiku, maybe?). This is just observational, especially when using MCPs.

💸 Price: £19/month for Pro - which is bonkers considering it's pretty half-baked compared to the competition. The free tier is a paltry 50 chats per month or something equally stingy.

That said, Q CLI is still probably the best suggestive autocomplete for the terminal out there; and that part is free.

Claude Code and OpenAI Codex CLI Tools {#claude-code-and-openai-codex-cli-tools}

Both of these tools are highly experimental, coming from the two big players in the space: Anthropic (Claude Code) and OpenAI (Codex CLI). From my brief trials, both are in such an early beta phase that they frequently veer off in odd directions, hallucinate or end up with shlop. They lack even basic parity with tools like Aider. Codex doesn't even support tools (MCP) yet.

Both do use a version of tree-sitter and some extra mechanisms to fetch context when none is specified - and that is pretty good(credit where it's due), but the rest feels very janky. I moved away from both quickly.

Worse, they seem to be much more expensive than any of the above, just to reach similar outputs (by the time you get a usable result).

One credit to OpenAI: Codex is open source. Claude Code is not (booo!).

GitHub Copilot CLI {#github-copilot-cli}

Needs a soft mention here. For free, it's a lovely little helper utility in your terminal for complex bash completions and simple queries. Super handy.

> ?? fetch me all kubernetes pods in the namespace that is using memory more than 10GB

Welcome to GitHub Copilot in the CLI!
version 1.1.0 (2025-02-10)

I'm powered by AI, so surprises and mistakes are possible. Make sure to verify any generated code or suggestions, and share feedback so that we can learn and improve. For more information, see https://gh.io/gh-copilot-transparency

Suggestion:

  kubectl get pods --namespace=<namespace> --field-selector=status.phase=Running -o=jsonpath='{range .items[?(@.status.containerStatuses[0].usage.memory > "10Gi")]}{.metadata.name}{"\n"}{end}'

? Select an option  [Use arrows to move, type to filter]
> Copy command to clipboard
  Explain command
  Execute command
  Revise command
  Rate response
  Exit
Enter fullscreen mode Exit fullscreen mode

The "Pure Vibe" AI Tools: Shiny, Pricey, and Mostly for Non-Coders {#the-pure-vibe-ai-tools-shiny-pricey-and-mostly-for-non-coders}

Now, let's talk about the "pure vibe" tools-the ones for people who don't care about code, just want to see something shiny on the screen, and whose only requirement is that the UX makes them feel like a tech visionary (or at least, not bored).

There's a tidal wave of these things flooding the market. I swear, a new one is born every time someone runs sudo apt update. Why? Because it's ridiculously easy to spin up a web app with Supabase, Convex, NocoDB, Appwrite, or whatever the latest "backend in a box" is. The result? Apps so minimal and generic, you'll half-expect them to greet you with "It works!" like a fresh Apache install.

But wait, there's more! Every single one of these tools comes with a catch: either the code is locked away in a vault, or you're chained to their backend ecosystem, or plot twist, you never even see the code at all. If you're an engineer or coder, run. Seriously, you'll hit a wall of frustration so hard you'll wish you were stuck in a never-ending Zoom call about quarterly KPIs instead.

And the price? Oh, they're expensive for what they do. Here are some of the ones I've tried briefly : lovable.dev, bolt.new, codev, softgen, replit, manus and hope (which, fittingly, is something you'll not have left after using it).
I've actually spent the most time with Replit-yes, I paid for a whole month, five months ago. At least Replit is a real dev ecosystem (cloud IDE and all that jazz). But the "agentic" building experience? It moves at the speed of continental drift, is impossible to steer, and the hallucinations are so wild you'll start to think they're a feature, not a bug. You might set out to build a food delivery app and end up with a blog about why Coke zero is superior to Pepsi Max. (Spoiler: it's not.)

So, TL;DR: If you're a coder, avoid these like you'd avoid a production deploy on a Friday. But if you're a non-coder or just here for the vibes, this is your playground. Enjoy the chaos!

👀 Lookout: The Rise of True Background Agents
One tool I'm seriously hyped to try is Augment - it's building a cult following for a reason. Augment was the first to unleash "background agents": real, persistent AI helpers you can assign low-value, high-churn tasks (think: squashing TypeScript errors, refactoring repetitive code, or migrating logic to a common function) - all grinding away in the background while you keep coding, reviewing, or just vibing in your flow.

Cursor is rolling out a similar feature in preview (haven't tried it yet), and this is where the next generation of tools will truly break away from the pack. This isn't just "agentic" as a buzzword - it's the literal, living embodiment: AI that works with you, not just for you, and never interrupts your vibe.

Bottom line: The era of "set it and forget it" background AI agents is here, and it's about to get wildly productive (and fun). Strap in! 🚀

PS: There are tons of other AI thingies I use around my coding/building/dev workflow (Pieces.db is one – check it out, especially if you're an Obsidian user!), but I'll cover those in my upcoming productivity setup series.


My main setup around these tools. {#my-main-setup-around-these-tools}

📚 Geek Corner
The Secret Sauce: Context, Rules, Prompts, Tools, Task Management

If you want to get the real power out of any AI coding tool, it's not about the tool itself, it's about how you set up your context, define your rules, craft your prompts, pick your tools, and manage your tasks. These are the levers that make the difference between "just using AI" and absolutely crushing it with AI; by a huge margin.

In the next post Productivity Series: My AI Dev Stack, Unleashed I'll break down exactly how I set up and combine these elements (with real setup variations), and how to use them across different tools for maximum effect. Stay tuned!

Tool-wise, here's the snapshot:

  1. small single scripts, simple MCP servers, prototypes:** Gemini AI Studio. Quick, easy, free.

  2. Any project starting from scratch (new codebases): Claude Desktop. (Obviously with a prep and wrappers, which I'll detail in the next blog in this series).

  3. Medium-sized codebase work (enhancing, debugging, refactoring) : - Aider or Cursor. I love a good CLI, so Aider is often my pick. But for simple to medium complexity, Cursor's agentic capabilities and integrated MCP client can save time over Aider.

  4. When I'm truly stuck in a rut, or Claude (in Cursor or Desktop) has started to suggest "lets go nuclear" (i.e., suggest crazy rewrites), or the codebase is large (150K+ lines): * **Roocode with OpenRouter* (using a combination of deepseek and Claude 3.7 - recently Gemini 2.5 pro).
    Be prepared: value your time saved against the money you'll spend, as this combo can get expensive quickly. I've had a couple of days where you could burn over 25 quid an hour. But when you need that breakthrough, it's often worth it.

💥 Chaos Corner: The "Let's Go Nuclear" Award
Ever innocently ask Claude for a little help with your useState logic? Next thing you know KABOOM 💣💥 !! it's pitching a full React state overhaul, TanStack, Redux, and maybe a GraphQL layer for good measure. It's like ordering a splash of oat milk and getting a cow, a pasture, and a subscription to the farm management monthly.

letsgonuclear.jpeg

Its true, was not joking
|

XKCD #303: Compiling


The Big Comparison Table: AI Coding Tool Showdown {#the-big-comparison-table-ai-coding-tool-showdown}

Okay, Ill try my best to distill some of this into a handy table. Prices are approximate and can change. "Requests" are a fuzzy metric, so take with a grain of salt.

Warning: This table is so wide it needs its own zip code. Scroll horizontally, hydrate, and maybe stretch first.

Tool / Platform Price / Free Tier Best Use Case / User Vibe Context Window Agentic / Augmented MCP Support Model Access Geek Corner / Meme 📚 Notable Quirk / Funniest Bit 😅 Downsides / 🚧 Symbols
Gemini AI Studio Free 🥇💸 Prototyping, students, hobbyists 1M+ tokens (soon 2M) Augmented No Gemini 2.5 Pro/Flash O(n) paste-work "So fast, it finished your code before you even thought of it. Also, now your toaster is sentient." No agentic, manual, copy-paste 🥇💸⚡
Gemini APIs Generous Free Tier 🥇🗝️ Backend for agentic tools, APIs 1M+ tokens API (for tools) N/A Gemini SOTA models Cheat code unlocked 📚 "BYO-API, but if you forget your key, it just sits there judging you." Need to integrate API yourself 🗝️⚡
Google Code Assist Free, 180k lines/day 💸 Autocomplete, PR reviews N/A Single purposed agent No Gemini/Google models Trivia in PRs! "Adds more fun facts to your PRs than Wikipedia on a sugar rush." No customisation, github only 💸
Copilot (Free) Free, 2k completions, 50/mo Sampling Copilot Limited Augmented (low) No GPT on the free "Costco sausage sample" "Just enough to taste, not enough to code. Like a demo disc from 1998." Very limited, instant paywall 😅
Copilot Pro £8/mo (~$10), 300/mo 🥇 Voice coding, VSCode warriors Low (~8k tokens) Augmented/Agentic (med), multi-modal input Rudimentary Most SOTA models MCP "auto-detect" superpower "Voice chat so good, you'll start apologizing to your computer when you make a typo." Agentic use is just fair (becoming better) 🥇
Cursor £15/mo, 500/mo, slow after 🔥 IDE agentic power-users ~10k tokens (has a max mode), uses vectorDB Agentic (high) 🤖, multi-modal input Yes All SOTA models Constant fights to make it read its own rules "Changes files like a toddler with a box of crayons: unpredictable and everywhere." Slow/annoying after quota, pricey 🤖🔥
Windsurf £11/mo, 500/mo Cursor alternative, chatty users ~8k tokens, uses vectorDB Agentic (medium-high), multi-modal input Yes GPT-4, Claude, etc. Capricious - can get moody "Talks so much, you'll wish for a mute button. Eats quota like popcorn." Chattier, eats quota quickly 🤖
Claude Desktop £18/mo unlimited, resets 🔥🥇 Big codebases, "vibe coding" 200k tokens/chat, manual context Agentic (high) 🤖 - with MCP, multi-modal input Yes Claude Sonnet 3.7/Haiku Vibe coding Kaboom! "Let's Go Nuclear" award. "Ask for a bug fix, get a new programming language." Message limit fast, restart threads 🤖🔥🥇
Aider (BYO-API) API cost only 💸🗝️🔥🥇, opensource 👐 CLI power-users, cost-conscious Full context length of models, uses treesitter Augmented No Any via OpenRouter CLI King, automation "Hours of coding for a few quid. Also, it's the only tool that will never ask you to 'try the GUI'." Not agentic, CLI only 🗝️🛠️💸
Cline/Roocode (BYO) API cost only 🗝️, Opensource 👐 Pair programming, MCP tinkerers Flexible to models max, uses treesitter and ripgrep Agentic (high) 🤖 , multi-modal input Yes Any via OpenRouter Closest to a pair programmer "Marketplace for MCPs-so meta, you'll need a prompt to find your prompts." Can eat into LLM API usage so fast you'll swear your credits just vanished like socks in a dryer 🗝️🤖🛠️
Amazon Q CLI/Pro £19/mo Pro, free 50/mo 🚧 , only cli is opensource 👐 Terminal autocomplete, CLI fans Unsure 🤷‍♂️ Agentic (CLI, low) 🤖, Not multimodal input Yes (CLI) Unclear (Claude/CodeWhisperer?) Resurrects "Fig" "Best terminal autocomplete, but the IDE experience is like using Notepad on a potato." Pricey for what you get 🤖🛠️🚧🤷‍♂️
Continue.dev Free / BYO-API 🗝️❓ IDE agentic integrations Unsure 🤷‍♂️ Agentic (varies) 🤖 Any "Oldest with MCP-like powers. Still waiting for its midlife crisis." 🤖❓
Trae Free, unlimited (sorta) 💸🚧 Busywork dumping, Cursor clone Unsure 🤷‍♂️ Agentic (med), but awful No Unknown "Cursor knockoff" "You'll appreciate the other tools you pay for! Also, sometimes it just gives up and takes a nap." Mediocre results, lacks polish, awful lot of failures 💸🚧
Claude Code/Codex CLI Experimental, BYO-API, Opensource 🗝️❓👐 Early stages Unsure 🤷‍♂️ Agentic (experimental) Claude code - yes, Codex - No Claude or GPT respectively Claude Code is closed source ... Boo!! "Open source (Codex), but so very janky" Unstable, buggy, pricey ❓👐
Copilot CLI Free 💸🛠️ Terminal helpers, Bash lovers N/A Augmented (low) No GitHub "??" for magic Bash "Explains CLI commands so well, I would need to retire my rubber duck." Not agentic, limited 💸🛠️
Pure Vibe Builders Pricey, often £15–30/mo 🚧 Non-coders, "just for vibes" Varies (often low) Minimal/None No Varies "Is it worth the time?" "So generic, you'll get a certificate of participation and a sticker that says 'I used an app!'" Pricey, code locked, not for coders 🚧

Table Footnotes & Symbol Guide

BYO-API Symbol Explained:

🗝️ = Bring Your Own API Key : Tools that let you connect your own LLM API (Google, OpenAI, Anthropic, Openrouter etc.) for ultimate cost control and flexibility. Great for tinkerers, but watch your token bill!

Most of these tools also support running local models via Ollama, LM Studio, or similar solutions. However, unless you have extremely powerful hardware, the quantized versions of even the best open models (like DeepSeek, Qwen, Llama, or Mistral) tend to deliver underwhelming results-usually only suitable for basic tasks like documentation generation. Notably, among these, only Mistral currently supports function calling, which is essential for true agentic workflows; the rest are limited to simple chat or completion modes. Even with the full version of Mistral, the performance still falls short-it's not even close to the top SOTA models in terms of coding ability or reasoning. For most serious development or agentic use, cloud-based SOTA models remain far ahead.

Agentic vs Augmented:

• Agentic (🤖): Can autonomously plan, break down, and execute multi-step coding tasks, sometimes even integrating tools, plugins, or APIs.

• Augmented: Helps you with specific tasks or code completions, but you still drive the workflow.


🚀 Super Tip: Voice Dictation & Next-Level Pairing
Supercharge your coding with dictation. Try SuperWhisper or Wispr Flow - both are fully free for dictation, run locally, and work with nearly every coding tool in this article. Dictation lets you code almost hands-free and take notes quickly, giving you the same voice advantage as premium features in other tools, but without the cost.
For tool creators reading: Imagine a dedicated, always-on voice agent-separate from your main coding assistant-just for brainstorming, rubber ducking, or talking through ideas while your main agent is busy fixing code. This is not just multitasking; it is about having a true pairing experience, where you can keep the conversation flowing and context-rich even as your tools work in the background. Raising the bar for collaborative and creative coding.


TL;DR {#tldr}

Students and hobbyists should milk Gemini AI Studio alongside Google Code Assist; everyday professionals will cover ninety percent of their work with Cursor/Copilot/Windsurf; big‑code wranglers swear by Claude Desktop and/or Roocode, where the price of admission is smaller than the therapy bill; and CLI devotees will find bliss pairing Aider with Q.

Keep an eye out for the next article in this series! I'll be sharing a deep dive into my personal AI dev stack, including not just the tools I use, but also the context, rules, and workflows that make them effective. Expect practical setup tips, real-world examples, and the "why" behind my choices: how I set boundaries for AI, what rules I follow to keep code maintainable, and how I integrate these tools into my daily development flow.

Parkinson's Law says work expands to fill the time available. Trust me, LLM bills expand to fill the credit limit available. Choose wisely, code boldly, and keep a wire cutter far from your headphones. 🎧✂️

Top comments (0)

👋 Kindness is contagious

Explore this insightful write-up, celebrated by our thriving DEV Community. Developers everywhere are invited to contribute and elevate our shared expertise.

A simple "thank you" can brighten someone’s day—leave your appreciation in the comments!

On DEV, knowledge-sharing fuels our progress and strengthens our community ties. Found this useful? A quick thank you to the author makes all the difference.

Okay