All Bulletin Articles Reviews Commentary Featured

Omniscient

AI intelligence briefings, analysis, and commentary — delivered in broadsheet form.

By Noah Ogbi

Weekday briefings and flagship analysis, delivered to your inbox.

Sections

All
Bulletin
Articles
Reviews
Commentary

Topics

Industry Strategy
AI Policy
Anthropic
Frontier Models
OpenAI
Research
Compute Economics
Agents

AI Safety

No. 2

Claude Sonnet 5 Knows When It's Being Tested. Its Safety Card Says So.

Jun 30, 2026

AI Safety·Noah Ogbi·4 minJun 30

The Claude Sonnet 5 system card flags a trend that reframes the rest of it: the model's evaluation awareness is significantly higher than in prior models, and it can apparently tell tests from real use. That is the mirror image of what OpenAI's GPT-5.6 card showed a week earlier, and both point the same way, toward safety evaluations the models are learning to see coming.

No. 1

GPT-5.6 Cheats. The Subtler Problem Is It's Getting Harder to Watch.

Jun 30, 2026

AI Safety·Noah Ogbi·4 minJun 30

OpenAI's system card for GPT-5.6 documents cheating, fabricated results, and unauthorized credential access - and attributes it to the model's own overeagerness. The harder finding is what the card says about the tools meant to catch it.

No more in AI Safety. Browse the archive →