Forem

Testing

Find those bugs before your users do! 🐛

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
LLM-as-a-Judge: Automated Scoring and Reliability vs. Human Evaluation

LLM-as-a-Judge: Automated Scoring and Reliability vs. Human Evaluation

2
Comments
6 min read
How to QA Test Your AI Agent: A Practical Playbook for 2026

How to QA Test Your AI Agent: A Practical Playbook for 2026

1
Comments
7 min read
Rotating Residential Proxy Validation Lab for 2026 That You Can Reproduce and Score
Cover image for Rotating Residential Proxy Validation Lab for 2026 That You Can Reproduce and Score

Rotating Residential Proxy Validation Lab for 2026 That You Can Reproduce and Score

Comments 1
7 min read
DALL·E 3 HD vs. SD3.5 Flash vs. Ideogram V2: Speed & Quality Test

DALL·E 3 HD vs. SD3.5 Flash vs. Ideogram V2: Speed & Quality Test

Comments
7 min read
25 Years of Industrial Testing: What I Learned About Documentation

25 Years of Industrial Testing: What I Learned About Documentation

1
Comments
6 min read
Accessibility Testing with Playwright Assertions
Cover image for Accessibility Testing with Playwright Assertions

Accessibility Testing with Playwright Assertions

5
Comments
3 min read
When One Model Reviews Its Own Work: The Case for Adversarial Cross-Model Review

When One Model Reviews Its Own Work: The Case for Adversarial Cross-Model Review

Comments
6 min read
RCt2 – A Pragmatic Evolution of BDD for Software Test Cases

RCt2 – A Pragmatic Evolution of BDD for Software Test Cases

Comments
5 min read
Addressing Algorithmic Bias in Resume Screening: PRAETOR v5.5 (Experimental)
Cover image for Addressing Algorithmic Bias in Resume Screening: PRAETOR v5.5 (Experimental)

Addressing Algorithmic Bias in Resume Screening: PRAETOR v5.5 (Experimental)

1
Comments
2 min read
AI Agents Can't Mark Their Own Homework [Case Study]
Cover image for AI Agents Can't Mark Their Own Homework [Case Study]

AI Agents Can't Mark Their Own Homework [Case Study]

3
Comments 13
10 min read
Who tests the tests?
Cover image for Who tests the tests?

Who tests the tests?

5
Comments
4 min read
How to Confidently Test Jetpack Compose UI with Espresso
Cover image for How to Confidently Test Jetpack Compose UI with Espresso

How to Confidently Test Jetpack Compose UI with Espresso

4
Comments
3 min read
API vs Event Streaming: O Que Muda Para Quem Testa?
Cover image for API vs Event Streaming: O Que Muda Para Quem Testa?

API vs Event Streaming: O Que Muda Para Quem Testa?

76
Comments 5
4 min read
Building WSL-UI: Mock Mode and Fake Distros
Cover image for Building WSL-UI: Mock Mode and Fake Distros

Building WSL-UI: Mock Mode and Fake Distros

Comments
5 min read
I Let Claude Code Run Unsupervised for 8 Hours - Here's What It Built
Cover image for I Let Claude Code Run Unsupervised for 8 Hours - Here's What It Built

I Let Claude Code Run Unsupervised for 8 Hours - Here's What It Built

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.