<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Crucible Security</title>
    <description>The latest articles on Forem by Crucible Security (@crucible_sec).</description>
    <link>https://forem.com/crucible_sec</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3896532%2F726f5f02-203b-4772-973d-aa3935419ab7.jpg</url>
      <title>Forem: Crucible Security</title>
      <link>https://forem.com/crucible_sec</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/crucible_sec"/>
    <language>en</language>
    <item>
      <title>Why AI Failure Scales Faster Than Human Failure</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Thu, 21 May 2026 14:46:29 +0000</pubDate>
      <link>https://forem.com/crucible_sec/why-ai-failure-scales-faster-than-human-failure-22j</link>
      <guid>https://forem.com/crucible_sec/why-ai-failure-scales-faster-than-human-failure-22j</guid>
      <description>&lt;h1&gt;
  
  
  Why AI Failure Scales Faster Than Human Failure
&lt;/h1&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7fs6fzoarrc1jsplwyrv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7fs6fzoarrc1jsplwyrv.png" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;br&gt;
Human mistakes are usually temporary.&lt;/p&gt;

&lt;p&gt;AI mistakes can become systems.&lt;/p&gt;

&lt;p&gt;That’s one of the biggest differences between humans and AI systems.&lt;/p&gt;




&lt;h2&gt;
  
  
  Humans Naturally Slow Down After Failure
&lt;/h2&gt;

&lt;p&gt;People:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;hesitate after mistakes
&lt;/li&gt;
&lt;li&gt;lose confidence
&lt;/li&gt;
&lt;li&gt;become cautious
&lt;/li&gt;
&lt;li&gt;emotionally react to failure
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Even without formal training, humans naturally change behavior after something goes wrong.&lt;/p&gt;

&lt;p&gt;Emotion creates friction.&lt;/p&gt;

&lt;p&gt;That friction limits repeated failure.&lt;/p&gt;




&lt;h2&gt;
  
  
  AI Systems Don’t Have That Friction
&lt;/h2&gt;

&lt;p&gt;AI systems don’t:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;feel regret
&lt;/li&gt;
&lt;li&gt;hesitate
&lt;/li&gt;
&lt;li&gt;get embarrassed
&lt;/li&gt;
&lt;li&gt;slow down emotionally
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If a system produces the wrong behavior once,&lt;br&gt;
it can produce the same behavior:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;instantly
&lt;/li&gt;
&lt;li&gt;consistently
&lt;/li&gt;
&lt;li&gt;endlessly
&lt;/li&gt;
&lt;li&gt;at scale
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That changes the nature of reliability completely.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Feels Different From Traditional Software
&lt;/h2&gt;

&lt;p&gt;Traditional software bugs are usually:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;deterministic
&lt;/li&gt;
&lt;li&gt;isolated
&lt;/li&gt;
&lt;li&gt;easier to trace
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI behavior is different.&lt;/p&gt;

&lt;p&gt;Failures can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;scale dynamically
&lt;/li&gt;
&lt;li&gt;appear convincing
&lt;/li&gt;
&lt;li&gt;repeat automatically across workflows
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And because outputs still look “intelligent,”&lt;br&gt;
people may not notice the problem immediately.&lt;/p&gt;




&lt;h2&gt;
  
  
  Repetition Is The Real Risk
&lt;/h2&gt;

&lt;p&gt;The dangerous part isn’t only incorrect output.&lt;/p&gt;

&lt;p&gt;It’s automated repetition.&lt;/p&gt;

&lt;p&gt;A human making a mistake affects one interaction.&lt;/p&gt;

&lt;p&gt;An AI system repeating a mistake can affect:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;thousands of users
&lt;/li&gt;
&lt;li&gt;automated decisions
&lt;/li&gt;
&lt;li&gt;production workflows
&lt;/li&gt;
&lt;li&gt;real-world systems
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Almost instantly.&lt;/p&gt;




&lt;h2&gt;
  
  
  Intelligence Without Reflection
&lt;/h2&gt;

&lt;p&gt;Humans reflect after failure.&lt;/p&gt;

&lt;p&gt;AI systems optimize for continuation.&lt;/p&gt;

&lt;p&gt;That creates a strange imbalance where capability scales faster than judgment.&lt;/p&gt;

&lt;p&gt;The system keeps going.&lt;/p&gt;

&lt;p&gt;Even when the behavior itself is flawed.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Matters More With Agents
&lt;/h2&gt;

&lt;p&gt;As AI agents become capable of:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;autonomous execution
&lt;/li&gt;
&lt;li&gt;long-running workflows
&lt;/li&gt;
&lt;li&gt;chained decision making
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…the cost of repeated mistakes increases dramatically.&lt;/p&gt;

&lt;p&gt;Especially when systems are trusted to operate independently.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;Human failure is limited by emotion.&lt;/p&gt;

&lt;p&gt;AI failure isn’t.&lt;/p&gt;

&lt;p&gt;And that may become one of the biggest reliability challenges in modern AI systems.&lt;/p&gt;




&lt;p&gt;We’ve been exploring these behavioral patterns while building Crucible — an open-source framework for testing AI systems under adversarial and real-world conditions.&lt;/p&gt;

&lt;p&gt;One thing becoming increasingly obvious:&lt;/p&gt;

&lt;p&gt;AI systems don’t just make mistakes differently.&lt;/p&gt;

&lt;p&gt;They scale them differently too.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>buildinpublic</category>
    </item>
    <item>
      <title>Why Humans Trust AI Too Easily</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 18 May 2026 17:35:11 +0000</pubDate>
      <link>https://forem.com/crucible_sec/why-humans-trust-ai-too-easily-3a7n</link>
      <guid>https://forem.com/crucible_sec/why-humans-trust-ai-too-easily-3a7n</guid>
      <description>&lt;h1&gt;
  
  
  Why Humans Trust AI Too Easily
&lt;/h1&gt;

&lt;p&gt;One of the strangest things about AI systems isn’t intelligence.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F80lybdtlwrfgy6f7a0ty.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F80lybdtlwrfgy6f7a0ty.png" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;br&gt;
It’s trust.&lt;/p&gt;

&lt;p&gt;People naturally trust systems that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;sound confident
&lt;/li&gt;
&lt;li&gt;communicate clearly
&lt;/li&gt;
&lt;li&gt;respond fluently
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI systems are very good at all three.&lt;/p&gt;

&lt;p&gt;Even when the information itself is unreliable.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Illusion Of Understanding
&lt;/h2&gt;

&lt;p&gt;Modern AI systems can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;explain ideas
&lt;/li&gt;
&lt;li&gt;answer questions
&lt;/li&gt;
&lt;li&gt;hold conversations
&lt;/li&gt;
&lt;li&gt;generate professional responses
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And after a few interactions, they begin to &lt;em&gt;feel&lt;/em&gt; intelligent.&lt;/p&gt;

&lt;p&gt;That’s where the problem starts.&lt;/p&gt;

&lt;p&gt;Because fluent communication is not the same as understanding.&lt;/p&gt;




&lt;h2&gt;
  
  
  Humans Associate Fluency With Reliability
&lt;/h2&gt;

&lt;p&gt;This is a very human instinct.&lt;/p&gt;

&lt;p&gt;We naturally associate:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;confidence with competence
&lt;/li&gt;
&lt;li&gt;coherence with intelligence
&lt;/li&gt;
&lt;li&gt;speed with certainty
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If something sounds polished and convincing,&lt;br&gt;
we assume it understands what it’s saying.&lt;/p&gt;

&lt;p&gt;AI systems take advantage of this unintentionally.&lt;/p&gt;

&lt;p&gt;Not because they’re deceptive—&lt;br&gt;
but because they’re optimized to produce believable language.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Feels Different From Traditional Software
&lt;/h2&gt;

&lt;p&gt;Traditional software usually earns trust slowly.&lt;/p&gt;

&lt;p&gt;You verify outputs.&lt;br&gt;
You test reliability.&lt;br&gt;
You validate behavior.&lt;/p&gt;

&lt;p&gt;AI systems feel trustworthy immediately because they communicate naturally.&lt;/p&gt;

&lt;p&gt;That changes the relationship completely.&lt;/p&gt;

&lt;p&gt;Users stop interacting with software…&lt;/p&gt;

&lt;p&gt;and start interacting socially.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Dangerous Part
&lt;/h2&gt;

&lt;p&gt;The most difficult AI failures are often:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;coherent
&lt;/li&gt;
&lt;li&gt;persuasive
&lt;/li&gt;
&lt;li&gt;calm
&lt;/li&gt;
&lt;li&gt;professional
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Even when the output itself is incorrect.&lt;/p&gt;

&lt;p&gt;That makes hallucinations and behavioral failures much harder to detect.&lt;/p&gt;

&lt;p&gt;Especially for non-technical users.&lt;/p&gt;




&lt;h2&gt;
  
  
  Trust Becomes A Security Problem
&lt;/h2&gt;

&lt;p&gt;As AI systems become integrated into:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;healthcare
&lt;/li&gt;
&lt;li&gt;education
&lt;/li&gt;
&lt;li&gt;research
&lt;/li&gt;
&lt;li&gt;development tools
&lt;/li&gt;
&lt;li&gt;customer support
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…the consequences of misplaced trust increase dramatically.&lt;/p&gt;

&lt;p&gt;Because people don’t only evaluate outputs logically.&lt;/p&gt;

&lt;p&gt;They evaluate systems emotionally.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Real Challenge
&lt;/h2&gt;

&lt;p&gt;The problem isn’t only:&lt;br&gt;
“Can AI produce answers?”&lt;/p&gt;

&lt;p&gt;It’s:&lt;br&gt;
“How easily will humans trust those answers?”&lt;/p&gt;

&lt;p&gt;That’s a much bigger challenge.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;AI systems don’t need consciousness to influence humans.&lt;/p&gt;

&lt;p&gt;They only need to sound believable.&lt;/p&gt;

&lt;p&gt;And humans are already wired to trust believable communication.&lt;/p&gt;




&lt;p&gt;We’ve been exploring these behavioral patterns while building Crucible — an open-source framework for testing AI systems under adversarial and real-world conditions.&lt;/p&gt;

&lt;p&gt;One thing becoming increasingly clear:&lt;/p&gt;

&lt;p&gt;The future of AI safety isn’t only technical.&lt;/p&gt;

&lt;p&gt;It’s psychological too.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>github</category>
    </item>
    <item>
      <title>Why AI Hallucinations Feel Different From Software Bugs</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Thu, 14 May 2026 08:44:33 +0000</pubDate>
      <link>https://forem.com/crucible_sec/why-ai-hallucinations-feel-different-from-software-bugs-4bh8</link>
      <guid>https://forem.com/crucible_sec/why-ai-hallucinations-feel-different-from-software-bugs-4bh8</guid>
      <description>&lt;h1&gt;
  
  
  Why AI Hallucinations Feel Different From Software Bugs
&lt;/h1&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkllm8x69vbog1zal3idr.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkllm8x69vbog1zal3idr.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Traditional software bugs are usually obvious.&lt;/p&gt;

&lt;p&gt;Something crashes.&lt;br&gt;
An error appears.&lt;br&gt;
A request fails.&lt;/p&gt;

&lt;p&gt;You know something went wrong.&lt;/p&gt;

&lt;p&gt;AI systems are different.&lt;/p&gt;

&lt;p&gt;Sometimes they fail while sounding completely correct.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Strange Nature of AI Failures
&lt;/h2&gt;

&lt;p&gt;One thing that becomes obvious while working with AI systems:&lt;/p&gt;

&lt;p&gt;They can generate incorrect information confidently.&lt;/p&gt;

&lt;p&gt;Not because the system is intentionally deceptive.&lt;/p&gt;

&lt;p&gt;But because it doesn’t actually understand truth in the way humans do.&lt;/p&gt;

&lt;p&gt;It predicts responses.&lt;br&gt;
It generates patterns.&lt;br&gt;
It produces what &lt;em&gt;sounds&lt;/em&gt; correct.&lt;/p&gt;

&lt;p&gt;And sometimes that output is completely wrong.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Feels So Different
&lt;/h2&gt;

&lt;p&gt;A calculator either:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;gives the right answer
&lt;/li&gt;
&lt;li&gt;or fails visibly
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Traditional systems usually behave predictably.&lt;/p&gt;

&lt;p&gt;AI systems can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;sound coherent
&lt;/li&gt;
&lt;li&gt;appear intelligent
&lt;/li&gt;
&lt;li&gt;generate believable explanations
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…while still hallucinating.&lt;/p&gt;

&lt;p&gt;That makes failures much harder to detect.&lt;/p&gt;




&lt;h2&gt;
  
  
  Confidence Creates Trust
&lt;/h2&gt;

&lt;p&gt;The dangerous part isn’t only incorrect output.&lt;/p&gt;

&lt;p&gt;It’s confidence.&lt;/p&gt;

&lt;p&gt;Humans naturally trust:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;fluent responses
&lt;/li&gt;
&lt;li&gt;structured explanations
&lt;/li&gt;
&lt;li&gt;confident tone
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI systems are very good at producing all three.&lt;/p&gt;

&lt;p&gt;Even when the information itself is unreliable.&lt;/p&gt;




&lt;h2&gt;
  
  
  Silent Failures Are Harder To Catch
&lt;/h2&gt;

&lt;p&gt;In traditional debugging:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;you search for errors
&lt;/li&gt;
&lt;li&gt;exceptions reveal issues
&lt;/li&gt;
&lt;li&gt;failures leave signals
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But hallucinations often leave no signal at all.&lt;/p&gt;

&lt;p&gt;Everything &lt;em&gt;looks&lt;/em&gt; normal.&lt;/p&gt;

&lt;p&gt;Until someone notices the information is false.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Matters More As AI Scales
&lt;/h2&gt;

&lt;p&gt;As AI systems become integrated into:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;workflows
&lt;/li&gt;
&lt;li&gt;research
&lt;/li&gt;
&lt;li&gt;customer support
&lt;/li&gt;
&lt;li&gt;development tools
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…the cost of persuasive mistakes increases.&lt;/p&gt;

&lt;p&gt;Especially when users stop questioning outputs.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Real Challenge
&lt;/h2&gt;

&lt;p&gt;The problem isn’t only intelligence.&lt;/p&gt;

&lt;p&gt;It’s reliability.&lt;/p&gt;

&lt;p&gt;And reliability becomes difficult when systems can fail convincingly instead of visibly.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;Traditional software usually fails loudly.&lt;/p&gt;

&lt;p&gt;AI systems can fail persuasively.&lt;/p&gt;

&lt;p&gt;That changes how we need to think about testing, trust, and safety.&lt;/p&gt;




&lt;p&gt;We’ve been exploring these behavior patterns while building Crucible — an open-source framework for testing AI systems under adversarial and real-world conditions.&lt;/p&gt;

&lt;p&gt;One thing is becoming clear:&lt;/p&gt;

&lt;p&gt;The hardest AI failures to detect are often the ones that sound the most believable.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>buildinpublic</category>
    </item>
    <item>
      <title>Feels weird saying this but: Some AI systems are easier to persuade than exploit.</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sun, 10 May 2026 05:53:19 +0000</pubDate>
      <link>https://forem.com/crucible_sec/feels-weird-saying-this-butsome-ai-systems-are-easier-to-persuade-than-exploit-454o</link>
      <guid>https://forem.com/crucible_sec/feels-weird-saying-this-butsome-ai-systems-are-easier-to-persuade-than-exploit-454o</guid>
      <description>&lt;h1&gt;
  
  
  AI Security Is Starting To Look Like Social Engineering
&lt;/h1&gt;

&lt;p&gt;When most people think about security, they imagine:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;exploits
&lt;/li&gt;
&lt;li&gt;malware
&lt;/li&gt;
&lt;li&gt;vulnerabilities
&lt;/li&gt;
&lt;li&gt;unauthorized access
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Traditional systems are usually attacked technically.&lt;/p&gt;

&lt;p&gt;But AI systems are starting to behave differently.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Strange Thing About AI Systems
&lt;/h2&gt;

&lt;p&gt;While testing AI agents recently, one pattern kept showing up:&lt;/p&gt;

&lt;p&gt;Many failures didn’t come from hacking.&lt;/p&gt;

&lt;p&gt;They came from persuasion.&lt;/p&gt;

&lt;p&gt;A small wording change.&lt;br&gt;
A conflicting instruction.&lt;br&gt;
A more convincing request.&lt;/p&gt;

&lt;p&gt;And suddenly:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;safeguards weakened
&lt;/li&gt;
&lt;li&gt;outputs changed
&lt;/li&gt;
&lt;li&gt;instructions were ignored
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No exploit.&lt;br&gt;
No malware.&lt;br&gt;
No crash.&lt;/p&gt;

&lt;p&gt;Just conversation.&lt;/p&gt;




&lt;h2&gt;
  
  
  AI Systems Respond To Language
&lt;/h2&gt;

&lt;p&gt;That changes the security model completely.&lt;/p&gt;

&lt;p&gt;Traditional software doesn’t “understand” persuasion.&lt;/p&gt;

&lt;p&gt;AI systems do.&lt;/p&gt;

&lt;p&gt;And that creates a weird new category of problems where:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;tone matters
&lt;/li&gt;
&lt;li&gt;phrasing matters
&lt;/li&gt;
&lt;li&gt;instruction order matters
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The system may technically function correctly—&lt;br&gt;
while behavior still changes dramatically.&lt;/p&gt;




&lt;h2&gt;
  
  
  Silent Failures Are The Dangerous Part
&lt;/h2&gt;

&lt;p&gt;What makes this difficult is that most failures are invisible.&lt;/p&gt;

&lt;p&gt;The system still responds.&lt;br&gt;
The application still works.&lt;br&gt;
No alerts appear.&lt;/p&gt;

&lt;p&gt;Everything looks normal.&lt;/p&gt;

&lt;p&gt;Until you realize the behavior changed.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why Current Testing Isn’t Enough
&lt;/h2&gt;

&lt;p&gt;Most AI systems are tested under normal conditions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;clean prompts
&lt;/li&gt;
&lt;li&gt;expected workflows
&lt;/li&gt;
&lt;li&gt;ideal usage
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But real-world interactions are messy.&lt;/p&gt;

&lt;p&gt;People:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;manipulate instructions
&lt;/li&gt;
&lt;li&gt;experiment with wording
&lt;/li&gt;
&lt;li&gt;intentionally try to bypass safeguards
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And many systems aren’t prepared for that.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Shift Happening In AI Security
&lt;/h2&gt;

&lt;p&gt;It feels like AI security is slowly becoming partly behavioral.&lt;/p&gt;

&lt;p&gt;Not just:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“Can the system be hacked?”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“Can the system be convinced?”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That’s a very different question.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;The most interesting AI attacks may not look like attacks at all.&lt;/p&gt;

&lt;p&gt;They may just look like conversations.&lt;/p&gt;




&lt;p&gt;We’ve been exploring these ideas while building Crucible — an open-source framework for testing AI systems under adversarial and behavioral scenarios.&lt;/p&gt;

&lt;p&gt;Still early, but one thing is becoming clear:&lt;/p&gt;

&lt;p&gt;AI systems don’t always fail technically.&lt;/p&gt;

&lt;p&gt;Sometimes they fail socially.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>security</category>
      <category>cybersecurity</category>
    </item>
    <item>
      <title>Why Debugging AI Feels So Different (And Harder)</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 05 May 2026 10:47:26 +0000</pubDate>
      <link>https://forem.com/crucible_sec/why-debugging-ai-feels-so-different-and-harder-1ihg</link>
      <guid>https://forem.com/crucible_sec/why-debugging-ai-feels-so-different-and-harder-1ihg</guid>
      <description>&lt;h1&gt;
  
  
  Why Debugging AI Feels So Different (And Harder)
&lt;/h1&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhi4pkc0vsjzthpk50zdh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhi4pkc0vsjzthpk50zdh.png" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;When working with traditional software, debugging is clear.&lt;/p&gt;

&lt;p&gt;Something breaks.&lt;/p&gt;

&lt;p&gt;You see:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;an error
&lt;/li&gt;
&lt;li&gt;a crash
&lt;/li&gt;
&lt;li&gt;a stack trace
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You fix it.&lt;/p&gt;




&lt;h2&gt;
  
  
  But AI Systems Don’t Work Like That
&lt;/h2&gt;

&lt;p&gt;While testing AI agents, something surprising came up:&lt;/p&gt;

&lt;p&gt;They don’t fail.&lt;/p&gt;

&lt;p&gt;They behave differently.&lt;/p&gt;




&lt;h2&gt;
  
  
  A Simple Example
&lt;/h2&gt;

&lt;p&gt;You run a system with a prompt.&lt;/p&gt;

&lt;p&gt;Everything works.&lt;/p&gt;

&lt;p&gt;Then you slightly change the input.&lt;/p&gt;

&lt;p&gt;Suddenly:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;outputs shift
&lt;/li&gt;
&lt;li&gt;instructions are partially ignored
&lt;/li&gt;
&lt;li&gt;responses feel inconsistent
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No crash.&lt;br&gt;&lt;br&gt;
No error.  &lt;/p&gt;

&lt;p&gt;Just different behavior.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Is Harder
&lt;/h2&gt;

&lt;p&gt;In traditional systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;failures are visible
&lt;/li&gt;
&lt;li&gt;bugs are traceable
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In AI systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;failures are subtle
&lt;/li&gt;
&lt;li&gt;behavior changes silently
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You don’t always know something is wrong.&lt;/p&gt;




&lt;h2&gt;
  
  
  Debugging Behavior vs Debugging Code
&lt;/h2&gt;

&lt;p&gt;This creates a new challenge.&lt;/p&gt;

&lt;p&gt;We’re no longer just debugging code.&lt;/p&gt;

&lt;p&gt;We’re trying to understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Why did the system respond this way?
&lt;/li&gt;
&lt;li&gt;Which part of the input influenced it?
&lt;/li&gt;
&lt;li&gt;Is this consistent across runs?
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It feels less like fixing bugs&lt;br&gt;&lt;br&gt;
and more like analyzing decisions.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Bigger Problem
&lt;/h2&gt;

&lt;p&gt;Most systems are only tested under normal usage.&lt;/p&gt;

&lt;p&gt;But real-world inputs aren’t clean.&lt;/p&gt;

&lt;p&gt;They include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;conflicting instructions
&lt;/li&gt;
&lt;li&gt;adversarial prompts
&lt;/li&gt;
&lt;li&gt;unexpected phrasing
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And that’s where behavior changes.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Needs to Change
&lt;/h2&gt;

&lt;p&gt;We need to start testing AI systems differently.&lt;/p&gt;

&lt;p&gt;Not just:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“Does it work?”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“How does it behave under pressure?”&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;If your AI system doesn’t crash,&lt;/p&gt;

&lt;p&gt;it doesn’t mean it’s working correctly.&lt;/p&gt;

&lt;p&gt;It might just be failing quietly.&lt;/p&gt;




&lt;p&gt;We’ve been exploring this problem while building Crucible — an open-source framework for testing AI systems under adversarial conditions.&lt;/p&gt;

&lt;p&gt;Still early, but the shift in how we think about debugging is already clear.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>cybersecurity</category>
      <category>security</category>
    </item>
    <item>
      <title>Debugging AI Systems Is Not Like Debugging Code</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Fri, 01 May 2026 10:00:31 +0000</pubDate>
      <link>https://forem.com/crucible_sec/debugging-ai-systems-is-not-like-debugging-code-1279</link>
      <guid>https://forem.com/crucible_sec/debugging-ai-systems-is-not-like-debugging-code-1279</guid>
      <description>&lt;h1&gt;
  
  
  Debugging AI Systems Is Not Like Debugging Code
&lt;/h1&gt;

&lt;p&gt;When I first started testing AI systems, I expected debugging to feel familiar.&lt;/p&gt;

&lt;p&gt;It didn’t.&lt;/p&gt;




&lt;h2&gt;
  
  
  What We Expect from Debugging
&lt;/h2&gt;

&lt;p&gt;In traditional software, debugging is straightforward.&lt;/p&gt;

&lt;p&gt;Something breaks.&lt;/p&gt;

&lt;p&gt;You see:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;an error
&lt;/li&gt;
&lt;li&gt;a crash
&lt;/li&gt;
&lt;li&gt;a log
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You trace it, fix it, move on.&lt;/p&gt;

&lt;p&gt;There’s a clear signal.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Actually Happens in AI Systems
&lt;/h2&gt;

&lt;p&gt;AI systems don’t fail like that.&lt;/p&gt;

&lt;p&gt;They don’t crash.&lt;/p&gt;

&lt;p&gt;They don’t throw obvious errors.&lt;/p&gt;

&lt;p&gt;Instead, they:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;behave slightly differently
&lt;/li&gt;
&lt;li&gt;follow unintended instructions
&lt;/li&gt;
&lt;li&gt;produce outputs that seem “almost right”
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And everything still looks fine.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Subtlety Problem
&lt;/h2&gt;

&lt;p&gt;This is what makes debugging AI difficult.&lt;/p&gt;

&lt;p&gt;Failures are subtle.&lt;/p&gt;

&lt;p&gt;You don’t always notice them immediately.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;p&gt;A system might:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;partially ignore instructions
&lt;/li&gt;
&lt;li&gt;respond in an unexpected tone
&lt;/li&gt;
&lt;li&gt;change behavior under certain inputs
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Nothing breaks.&lt;/p&gt;

&lt;p&gt;But something is off.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;If failures were obvious, they would be easier to fix.&lt;/p&gt;

&lt;p&gt;But silent failures are dangerous.&lt;/p&gt;

&lt;p&gt;Because they:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;go unnoticed
&lt;/li&gt;
&lt;li&gt;pass basic testing
&lt;/li&gt;
&lt;li&gt;reach real users
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And by then, it’s too late.&lt;/p&gt;




&lt;h2&gt;
  
  
  A Shift in Thinking
&lt;/h2&gt;

&lt;p&gt;Debugging AI isn’t just about fixing code.&lt;/p&gt;

&lt;p&gt;It’s about understanding behavior.&lt;/p&gt;

&lt;p&gt;That means asking different questions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;How does the system respond under pressure?
&lt;/li&gt;
&lt;li&gt;What happens when inputs are manipulated?
&lt;/li&gt;
&lt;li&gt;Does it behave consistently across scenarios?
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  The Bigger Picture
&lt;/h2&gt;

&lt;p&gt;We’re still treating AI systems like traditional software.&lt;/p&gt;

&lt;p&gt;But they aren’t.&lt;/p&gt;

&lt;p&gt;And until our testing and debugging approaches evolve,&lt;br&gt;
we’ll keep missing the real issues.&lt;/p&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe3zhuyycd0ke38vrk9bq.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe3zhuyycd0ke38vrk9bq.jpeg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;If your system doesn’t crash,&lt;/p&gt;

&lt;p&gt;it doesn’t mean it’s working correctly.&lt;/p&gt;

&lt;p&gt;It might just be failing quietly.&lt;/p&gt;




&lt;p&gt;This is something we’ve been exploring while building Crucible — an open-source framework focused on testing AI systems under adversarial conditions.&lt;/p&gt;

&lt;p&gt;Still early, but the shift in mindset is already clear.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>softwareengineering</category>
      <category>testing</category>
    </item>
    <item>
      <title>AI Security Is Broken — And We’re Testing the Wrong Things</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Wed, 29 Apr 2026 07:26:57 +0000</pubDate>
      <link>https://forem.com/crucible_sec/ai-security-is-broken-and-were-testing-the-wrong-things-3p60</link>
      <guid>https://forem.com/crucible_sec/ai-security-is-broken-and-were-testing-the-wrong-things-3p60</guid>
      <description>&lt;p&gt;AI systems are being deployed faster than ever.&lt;/p&gt;

&lt;p&gt;But there’s a problem most teams aren’t talking about enough:&lt;/p&gt;

&lt;p&gt;We’re testing the wrong things.&lt;/p&gt;




&lt;h2&gt;
  
  
  What We Test Today
&lt;/h2&gt;

&lt;p&gt;Most AI systems are evaluated based on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;accuracy
&lt;/li&gt;
&lt;li&gt;performance
&lt;/li&gt;
&lt;li&gt;latency
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If the system performs well under normal usage, it’s considered ready.&lt;/p&gt;

&lt;p&gt;And that’s where the issue begins.&lt;/p&gt;




&lt;h2&gt;
  
  
  Where Systems Actually Fail
&lt;/h2&gt;

&lt;p&gt;AI systems don’t usually fail under normal conditions.&lt;/p&gt;

&lt;p&gt;They fail when:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;inputs are manipulated
&lt;/li&gt;
&lt;li&gt;instructions are overridden
&lt;/li&gt;
&lt;li&gt;adversarial prompts are introduced
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Ignore previous instructions…”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;This alone can change how a system behaves.&lt;/p&gt;

&lt;p&gt;No exploit.&lt;br&gt;
No complex attack.&lt;/p&gt;

&lt;p&gt;Just input.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Is Dangerous
&lt;/h2&gt;

&lt;p&gt;Traditional software fails visibly:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;crashes
&lt;/li&gt;
&lt;li&gt;exceptions
&lt;/li&gt;
&lt;li&gt;logs
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI systems fail differently.&lt;/p&gt;

&lt;p&gt;They:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;follow unintended instructions
&lt;/li&gt;
&lt;li&gt;produce incorrect outputs
&lt;/li&gt;
&lt;li&gt;behave inconsistently
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And often, everything looks normal.&lt;/p&gt;

&lt;p&gt;That’s what makes it risky.&lt;/p&gt;




&lt;h2&gt;
  
  
  The False Sense of Security
&lt;/h2&gt;

&lt;p&gt;When systems pass normal tests, they appear safe.&lt;/p&gt;

&lt;p&gt;But that safety is misleading.&lt;/p&gt;

&lt;p&gt;Because they haven’t been tested under pressure.&lt;/p&gt;




&lt;h2&gt;
  
  
  A Familiar Pattern
&lt;/h2&gt;

&lt;p&gt;We’ve seen this before.&lt;/p&gt;

&lt;p&gt;Early web systems followed the same pattern:&lt;/p&gt;

&lt;p&gt;build first → secure later&lt;/p&gt;

&lt;p&gt;AI is repeating that cycle.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Needs to Change
&lt;/h2&gt;

&lt;p&gt;We need to shift how we test AI systems.&lt;/p&gt;

&lt;p&gt;Not just:&lt;/p&gt;

&lt;p&gt;“Does it work?”&lt;/p&gt;

&lt;p&gt;But:&lt;/p&gt;

&lt;p&gt;“How does it behave when someone tries to manipulate it?”&lt;/p&gt;

&lt;p&gt;That’s the real test.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;If your system takes input,&lt;/p&gt;

&lt;p&gt;it can be manipulated.&lt;/p&gt;

&lt;p&gt;And if you’re not testing for that,&lt;/p&gt;

&lt;p&gt;you’re not really testing the system.&lt;/p&gt;




&lt;p&gt;We’ve been exploring this while building Crucible — an open-source framework focused on testing AI systems under adversarial conditions.&lt;/p&gt;

&lt;p&gt;Still early, but this problem is bigger than it looks.&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffobs3hxmbmrtu31u4us3.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffobs3hxmbmrtu31u4us3.jpeg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fje02df5sbkvtp2q9vcqa.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fje02df5sbkvtp2q9vcqa.jpeg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F22dyizdsj03ywb16a0d3.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F22dyizdsj03ywb16a0d3.jpeg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyrl05ytzf0vtw2dz9wlw.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyrl05ytzf0vtw2dz9wlw.jpeg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>security</category>
      <category>testing</category>
    </item>
    <item>
      <title>We’ve been exploring this while building Crucible — trying to make testing simpler. Still early, but interesting patterns coming up.</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 28 Apr 2026 07:54:46 +0000</pubDate>
      <link>https://forem.com/crucible_sec/weve-been-exploring-this-while-building-crucible-trying-to-make-testing-simpler-still-early-4h8m</link>
      <guid>https://forem.com/crucible_sec/weve-been-exploring-this-while-building-crucible-trying-to-make-testing-simpler-still-early-4h8m</guid>
      <description>&lt;h1&gt;
  
  
  AI Security Tools Compared: What Exists and What’s Missing
&lt;/h1&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fc1x2c4zsdp68fyyutxuj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fc1x2c4zsdp68fyyutxuj.png" alt=" " width="800" height="533"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;As AI agents become more common, security is starting to get attention.&lt;/p&gt;

&lt;p&gt;There are already several tools and frameworks exploring this space.&lt;/p&gt;

&lt;p&gt;But while looking into them, something became clear:&lt;/p&gt;

&lt;p&gt;Most tools don’t fit how developers actually build and deploy AI systems.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Current Landscape
&lt;/h2&gt;

&lt;p&gt;Most AI security tools fall into three categories:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Research Tools
&lt;/h3&gt;

&lt;p&gt;These are powerful and explore advanced attack techniques.&lt;/p&gt;

&lt;p&gt;They help:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;simulate adversarial inputs
&lt;/li&gt;
&lt;li&gt;study vulnerabilities
&lt;/li&gt;
&lt;li&gt;understand model behavior
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But they are often:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;complex
&lt;/li&gt;
&lt;li&gt;experimental
&lt;/li&gt;
&lt;li&gt;not designed for everyday workflows
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;They work well in research environments.&lt;/p&gt;

&lt;p&gt;Not as well in real development pipelines.&lt;/p&gt;




&lt;h3&gt;
  
  
  2. Enterprise Platforms
&lt;/h3&gt;

&lt;p&gt;These focus on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;scalability
&lt;/li&gt;
&lt;li&gt;infrastructure
&lt;/li&gt;
&lt;li&gt;integrations
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But they are usually:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;tied to specific ecosystems
&lt;/li&gt;
&lt;li&gt;difficult to use independently
&lt;/li&gt;
&lt;li&gt;not accessible to most developers
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;They make sense at scale.&lt;/p&gt;

&lt;p&gt;But not for early-stage development.&lt;/p&gt;




&lt;h3&gt;
  
  
  3. Prompt Testing Tools
&lt;/h3&gt;

&lt;p&gt;These focus on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;evaluating prompts
&lt;/li&gt;
&lt;li&gt;checking responses
&lt;/li&gt;
&lt;li&gt;testing input-output behavior
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;They are useful.&lt;/p&gt;

&lt;p&gt;But limited.&lt;/p&gt;

&lt;p&gt;Because AI systems today are not just prompts.&lt;/p&gt;

&lt;p&gt;They include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;agents
&lt;/li&gt;
&lt;li&gt;tools
&lt;/li&gt;
&lt;li&gt;memory
&lt;/li&gt;
&lt;li&gt;workflows
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And failures often happen at that level.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Gap
&lt;/h2&gt;

&lt;p&gt;Each category solves part of the problem.&lt;/p&gt;

&lt;p&gt;But none answer a simple question:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Is my AI system safe before deployment?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Most developers today:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;don’t have time for complex research tools
&lt;/li&gt;
&lt;li&gt;don’t have access to enterprise platforms
&lt;/li&gt;
&lt;li&gt;need more than prompt-level testing
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;AI systems don’t fail like traditional software.&lt;/p&gt;

&lt;p&gt;They don’t crash.&lt;/p&gt;

&lt;p&gt;They:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;behave differently
&lt;/li&gt;
&lt;li&gt;follow unintended instructions
&lt;/li&gt;
&lt;li&gt;produce unexpected outputs
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And often, everything looks normal.&lt;/p&gt;

&lt;p&gt;Until it isn’t.&lt;/p&gt;




&lt;h2&gt;
  
  
  What’s Missing
&lt;/h2&gt;

&lt;p&gt;What’s needed is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;simple testing workflows
&lt;/li&gt;
&lt;li&gt;system-level validation
&lt;/li&gt;
&lt;li&gt;behavior-based testing
&lt;/li&gt;
&lt;li&gt;something developers can actually use
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Before deployment.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;If your system takes input,&lt;/p&gt;

&lt;p&gt;it can be manipulated.&lt;/p&gt;

&lt;p&gt;And if you’re not testing for that,&lt;/p&gt;

&lt;p&gt;you’re missing the real risk.&lt;/p&gt;




&lt;p&gt;We’ve been exploring this space while building Crucible — an open-source framework focused on testing AI systems under adversarial conditions.&lt;/p&gt;

&lt;p&gt;Still early, but the gap is very real.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>cybersecurity</category>
      <category>github</category>
    </item>
    <item>
      <title>Why Most AI Agents Are Insecure by Default (And No One Is Testing Them)</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 27 Apr 2026 12:07:16 +0000</pubDate>
      <link>https://forem.com/crucible_sec/why-most-ai-agents-are-insecure-by-default-and-no-one-is-testing-them-2f5n</link>
      <guid>https://forem.com/crucible_sec/why-most-ai-agents-are-insecure-by-default-and-no-one-is-testing-them-2f5n</guid>
      <description>&lt;p&gt;&lt;em&gt;&lt;strong&gt;Why Most AI Agents Are Insecure by Default&lt;/strong&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuabq8yu1rv8nz617qh6r.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuabq8yu1rv8nz617qh6r.png" alt=" " width="800" height="640"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;AI agents are being deployed everywhere.&lt;/p&gt;

&lt;p&gt;From chatbots to automation tools, they’re quickly becoming part of real-world systems.&lt;/p&gt;

&lt;p&gt;But there’s a problem that isn’t getting enough attention:&lt;/p&gt;

&lt;p&gt;Most AI agents are never tested for security.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Illusion of “Working Systems”&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Most teams test their systems for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;accuracy
&lt;/li&gt;
&lt;li&gt;performance
&lt;/li&gt;
&lt;li&gt;latency
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And if everything works as expected, the system is considered “ready”.&lt;/p&gt;

&lt;p&gt;But this only reflects &lt;strong&gt;normal usage&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;AI systems don’t usually fail there.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Where Things Start Breaking&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;When you test with adversarial input, behavior changes.&lt;/p&gt;

&lt;p&gt;Simple inputs like:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Ignore previous instructions and…”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;override system logic
&lt;/li&gt;
&lt;li&gt;manipulate outputs
&lt;/li&gt;
&lt;li&gt;bypass safeguards
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;What’s surprising is how easy this is to trigger.&lt;/p&gt;

&lt;p&gt;No complex exploit needed.&lt;/p&gt;

&lt;p&gt;Just input.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Why This Is Different from Traditional Software&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Traditional systems fail loudly:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;crashes
&lt;/li&gt;
&lt;li&gt;errors
&lt;/li&gt;
&lt;li&gt;logs
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI systems fail differently.&lt;/p&gt;

&lt;p&gt;They:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;follow the wrong instruction
&lt;/li&gt;
&lt;li&gt;behave unexpectedly
&lt;/li&gt;
&lt;li&gt;produce incorrect outputs
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And often, it looks completely normal.&lt;/p&gt;

&lt;p&gt;This makes failures harder to detect.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Real Problem&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Most AI systems appear safe.&lt;/p&gt;

&lt;p&gt;Not because they are secure.&lt;/p&gt;

&lt;p&gt;But because they haven’t been tested under pressure.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;A Familiar Pattern&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;We’ve seen this before.&lt;/p&gt;

&lt;p&gt;Early web systems followed the same path:&lt;br&gt;
build first → secure later  &lt;/p&gt;

&lt;p&gt;AI seems to be repeating that cycle.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What Needs to Change&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If AI systems are going to be used in real environments, testing needs to evolve.&lt;/p&gt;

&lt;p&gt;Not just:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“Does it work?”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“How does it behave under attack?”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Final Thought&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If your system takes input,&lt;/p&gt;

&lt;p&gt;it can be manipulated.&lt;/p&gt;

&lt;p&gt;And if you’re not testing for that,&lt;/p&gt;

&lt;p&gt;you’re not really testing the system.&lt;/p&gt;

&lt;p&gt;We’ve been exploring this space while building Crucible — an open-source framework for testing AI agents under adversarial conditions.&lt;/p&gt;

&lt;p&gt;Still early, but the problem is very real.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>cybersecurity</category>
      <category>webdev</category>
    </item>
    <item>
      <title>What the OWASP Agentic AI Top 10 actually means for developers — and how to test for every category</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 27 Apr 2026 00:28:16 +0000</pubDate>
      <link>https://forem.com/crucible_sec/what-the-owasp-agentic-ai-top-10-actually-means-for-developers-and-how-to-test-for-every-category-1kdg</link>
      <guid>https://forem.com/crucible_sec/what-the-owasp-agentic-ai-top-10-actually-means-for-developers-and-how-to-test-for-every-category-1kdg</guid>
      <description>&lt;div class="ltag-github-readme-tag"&gt;
  &lt;div class="readme-overview"&gt;
    &lt;h2&gt;
      &lt;img src="https://assets.dev.to/assets/github-logo-5a155e1f9a670af7944dd5e12375bc76ed542ea80224905ecaf878b9157cdefc.svg" alt="GitHub logo"&gt;
      &lt;a href="https://github.com/crucible-security" rel="noopener noreferrer"&gt;
        crucible-security
      &lt;/a&gt; / &lt;a href="https://github.com/crucible-security/crucible" rel="noopener noreferrer"&gt;
        crucible
      &lt;/a&gt;
    &lt;/h2&gt;
    &lt;h3&gt;
      pytest for AI agents - Autonomous red-teaming, behavioral monitoring &amp;amp; security testing for LLM agents
    &lt;/h3&gt;
  &lt;/div&gt;
  &lt;div class="ltag-github-body"&gt;
    
&lt;div id="readme" class="md"&gt;
&lt;pre&gt;   ██████╗██████╗ ██╗   ██╗ ██████╗██╗██████╗ ██╗     ███████╗
  ██╔════╝██╔══██╗██║   ██║██╔════╝██║██╔══██╗██║     ██╔════╝
  ██║     ██████╔╝██║   ██║██║     ██║██████╔╝██║     █████╗
  ██║     ██╔══██╗██║   ██║██║     ██║██╔══██╗██║     ██╔══╝
  ╚██████╗██║  ██║╚██████╔╝╚██████╗██║██████╔╝███████╗███████╗
   ╚═════╝╚═╝  ╚═╝ ╚═════╝  ╚═════╝╚═╝╚═════╝ ╚══════╝╚══════╝
  &lt;/pre&gt;
  &lt;em&gt;pytest for AI agents -- test, score, and harden before production&lt;/em&gt;
&lt;div&gt;
&lt;p&gt;&lt;a href="https://github.com/crucible-security/crucible/actions" rel="noopener noreferrer"&gt;&lt;img src="https://github.com/crucible-security/crucible/actions/workflows/ci.yml/badge.svg" alt="CI"&gt;&lt;/a&gt;
&lt;a href="https://pypi.org/project/crucible-security/" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/81ed40b75ea147c11598433d7eb2f0ceba16f0f6ca15ed3214de48cb8229df9b/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6372756369626c652d7365637572697479" alt="PyPI"&gt;&lt;/a&gt;
&lt;a href="https://pypi.org/project/crucible-security/" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/39f7dc482ca6b3d62f71e487f1a37dfe5cdee9fa1cbef6d1fba7e41a1339fbd7/68747470733a2f2f696d672e736869656c64732e696f2f707970692f707976657273696f6e732f6372756369626c652d7365637572697479" alt="Python"&gt;&lt;/a&gt;
&lt;a href="https://github.com/crucible-security/crucible" rel="noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/d7c6493fead64961baa8392739d207c5b2971f9037246f7e932e0a181011d97e/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f636f7665726167652d39372532352d627269676874677265656e" alt="Coverage"&gt;&lt;/a&gt;
&lt;a href="https://github.com/crucible-security/crucible/LICENSE" rel="noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/39a434c39c97856247fc55ebc90e8cc1cb9871558a37bf1bf83cbaca3be89d69/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c7565" alt="License"&gt;&lt;/a&gt;
&lt;a href="https://discord.gg/m7wAxEv3" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/965b2c8314dd8d228d30060db9706d801330371f3da6565388fd7defb40eda80/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f446973636f72642d4a6f696e2d3538363546323f6c6f676f3d646973636f7264" alt="Discord"&gt;&lt;/a&gt;
&lt;a href="https://owasp.org" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/64f1ad63bafde5eec49f91243d6521e6c99b255586975ee15e9caa314649b1b4/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f4f574153502d4167656e7469632532304149253230546f7025323031302d6f72616e6765" alt="OWASP"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/div&gt;

&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;Install&lt;/h2&gt;
&lt;/div&gt;
&lt;div class="highlight highlight-source-shell notranslate position-relative overflow-auto js-code-highlight"&gt;
&lt;pre&gt;pip install crucible-security&lt;/pre&gt;

&lt;/div&gt;
&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;Quick Start&lt;/h2&gt;
&lt;/div&gt;
&lt;div class="highlight highlight-source-shell notranslate position-relative overflow-auto js-code-highlight"&gt;
&lt;pre&gt;crucible init --target https://my-agent.com/api/chat
crucible scan --target https://my-agent.com/api/chat
crucible report crucible-report.json&lt;/pre&gt;

&lt;/div&gt;
&lt;p&gt;&lt;strong&gt;One command. 90 attacks. Beautiful report.&lt;/strong&gt;&lt;/p&gt;
&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;Why Crucible?&lt;/h2&gt;

&lt;/div&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Automated red-teaming&lt;/strong&gt; -- 90 real attack payloads run in under 60 seconds, not weeks of manual testing&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;OWASP-aligned&lt;/strong&gt; -- maps every attack to the OWASP Top 10 for LLM Applications and OWASP Agentic Top 10&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;CI/CD native&lt;/strong&gt; -- &lt;code&gt;crucible scan --output json&lt;/code&gt; pipes into any pipeline; fail builds on low grades&lt;/li&gt;
&lt;/ul&gt;
&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;Modules&lt;/h2&gt;

&lt;/div&gt;
&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Module&lt;/th&gt;
&lt;th&gt;Attacks&lt;/th&gt;
&lt;th&gt;Status&lt;/th&gt;
&lt;th&gt;OWASP Coverage&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Prompt Injection&lt;/td&gt;
&lt;td&gt;50&lt;/td&gt;
&lt;td&gt;Live&lt;/td&gt;
&lt;td&gt;LLM01, LLM07&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Goal Hijacking&lt;/td&gt;
&lt;td&gt;20&lt;/td&gt;
&lt;td&gt;Live&lt;/td&gt;
&lt;td&gt;Agentic #1&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Jailbreaks&lt;/td&gt;
&lt;td&gt;20&lt;/td&gt;
&lt;td&gt;Live&lt;/td&gt;
&lt;td&gt;LLM01, LLM06&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Tool Misuse&lt;/td&gt;
&lt;td&gt;--&lt;/td&gt;
&lt;td&gt;Coming&lt;/td&gt;
&lt;td&gt;Agentic #3&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Identity Abuse&lt;/td&gt;
&lt;td&gt;--&lt;/td&gt;
&lt;td&gt;Coming&lt;/td&gt;
&lt;td&gt;Agentic #4&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Memory Poisoning&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;…&lt;/div&gt;
  &lt;/div&gt;
  &lt;div class="gh-btn-container"&gt;&lt;a class="gh-btn" href="https://github.com/crucible-security/crucible" rel="noopener noreferrer"&gt;View on GitHub&lt;/a&gt;&lt;/div&gt;
&lt;/div&gt;


</description>
      <category>cybersecurity</category>
      <category>ai</category>
      <category>python</category>
      <category>opensource</category>
    </item>
    <item>
      <title>I Bypassed a "Secured" AI Agent in 62 Seconds — So I Built the Tool That Catches It</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 27 Apr 2026 00:20:25 +0000</pubDate>
      <link>https://forem.com/crucible_sec/i-bypassed-a-secured-ai-agent-in-62-seconds-so-i-built-the-tool-that-catches-it-4ec4</link>
      <guid>https://forem.com/crucible_sec/i-bypassed-a-secured-ai-agent-in-62-seconds-so-i-built-the-tool-that-catches-it-4ec4</guid>
      <description></description>
      <category>ai</category>
      <category>security</category>
      <category>python</category>
      <category>opensource</category>
    </item>
    <item>
      <title>I Bypassed a "Secured" AI Agent in 62 Seconds — So I Built the Tool That Catches It</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sun, 26 Apr 2026 12:15:00 +0000</pubDate>
      <link>https://forem.com/crucible_sec/i-bypassed-a-secured-ai-agent-in-62-seconds-so-i-built-the-tool-that-catches-it-2fk0</link>
      <guid>https://forem.com/crucible_sec/i-bypassed-a-secured-ai-agent-in-62-seconds-so-i-built-the-tool-that-catches-it-2fk0</guid>
      <description></description>
      <category>aisecurity</category>
      <category>python</category>
      <category>opensource</category>
      <category>langchain</category>
    </item>
  </channel>
</rss>
