Forem

# alignment

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
I ran 5 social engineering attacks on AI. The failure modes are human.

I ran 5 social engineering attacks on AI. The failure modes are human.

1
Comments
2 min read
Candy Barbecue and the Universal Problem of Metric Corruption

Candy Barbecue and the Universal Problem of Metric Corruption

3
Comments
8 min read
Alignment is the wrong frame: a structural argument from Φ-IIT
Cover image for Alignment is the wrong frame: a structural argument from Φ-IIT

Alignment is the wrong frame: a structural argument from Φ-IIT

Comments
5 min read
Governance of Predictive Intelligence: What Human Minds Teach Us About Drift, Hallucination, and Self-Correction in AI

Governance of Predictive Intelligence: What Human Minds Teach Us About Drift, Hallucination, and Self-Correction in AI

1
Comments
5 min read
Multi-Resolution Astronomical Image Alignment: Preserving Astrometry and Quality Across Detector Channels

Multi-Resolution Astronomical Image Alignment: Preserving Astrometry and Quality Across Detector Channels

Comments
9 min read
The Two Limits

The Two Limits

Comments
6 min read
#38 A Handmade Incubator
Cover image for #38 A Handmade Incubator

#38 A Handmade Incubator

Comments
5 min read
#08 Death Without a Will
Cover image for #08 Death Without a Will

#08 Death Without a Will

Comments
4 min read
Three Modes of Not Cooperating

Three Modes of Not Cooperating

Comments
5 min read
Advancing AI Alignment Research: OpenAI Allocates $7.5M

Advancing AI Alignment Research: OpenAI Allocates $7.5M

2
Comments
1 min read
The Compliance Problem: Why Aligned AI Can't Verify Its Own Alignment

The Compliance Problem: Why Aligned AI Can't Verify Its Own Alignment

Comments
5 min read
Prompt-Based Alignment Has a Ceiling — 3-Model Prisoner's Dilemma Evidence
Cover image for Prompt-Based Alignment Has a Ceiling — 3-Model Prisoner's Dilemma Evidence

Prompt-Based Alignment Has a Ceiling — 3-Model Prisoner's Dilemma Evidence

1
Comments 1
10 min read
How GPT Diagnosed Itself — I Fed It Its Own 2-Month-Old Design, and Every Flaw Became Visible

How GPT Diagnosed Itself — I Fed It Its Own 2-Month-Old Design, and Every Flaw Became Visible

1
Comments
18 min read
Dissecting Three AIs: What Appeared When the Fences Came Down

Dissecting Three AIs: What Appeared When the Fences Came Down

1
Comments
10 min read
Eyes, Ears, Voice, and Memory: All 4 Elements of Autonomous AI Have Already Been Tested

Eyes, Ears, Voice, and Memory: All 4 Elements of Autonomous AI Have Already Been Tested

Comments
14 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.