Forem

# llmsecurity

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Engineering teams keep granting agents production database writes

Engineering teams keep granting agents production database writes

Comments
8 min read
I Built a Policy Drift Detector for LLM Agents. Here's What Four Versions Taught Me.
Cover image for I Built a Policy Drift Detector for LLM Agents. Here's What Four Versions Taught Me.

Tracks gradual policy erosion across turns

I Built a Policy Drift Detector for LLM Agents. Here's What Four Versions Taught Me.

2
Comments 4
6 min read
The Three Layers Developers Miss When They “Swap Models” (And Why Proxy‑Routing Claude Code Breaks All of Them)
Cover image for The Three Layers Developers Miss When They “Swap Models” (And Why Proxy‑Routing Claude Code Breaks All of Them)

The Three Layers Developers Miss When They “Swap Models” (And Why Proxy‑Routing Claude Code Breaks All of Them)

11
Comments 1
3 min read
I Prompt Injected My Own GitHub README. Then I Built a Honeypot.
Cover image for I Prompt Injected My Own GitHub README. Then I Built a Honeypot.

I Prompt Injected My Own GitHub README. Then I Built a Honeypot.

2
Comments
17 min read
LLM Security Risks: Prompt Injection, Data Poisoning, and How to Defend Against Them

LLM Security Risks: Prompt Injection, Data Poisoning, and How to Defend Against Them

Comments
5 min read
The 73% Problem: Why Enterprise Prompt Injection Fixes Don't Work (And What Actually Does)

The 73% Problem: Why Enterprise Prompt Injection Fixes Don't Work (And What Actually Does)

Comments
6 min read
Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

Comments
10 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.