Omniscient
AllDaily SignalArticlesReviewsCommentaryFeatured
Sign In

Omniscient

AI intelligence briefings, analysis, and commentary — delivered in broadsheet form.

By Noah Ogbi

Subscribe

Weekday briefings and flagship analysis, delivered to your inbox.

Sections

  • All
  • Daily Signal
  • Articles
  • Reviews
  • Commentary

Topics

  • Industry Strategy
  • Anthropic
  • AI Policy
  • Research
  • Compute Economics
  • OpenAI
  • Agents
  • Frontier Models

Meta

  • About
  • Masthead
  • Standards
  • Corrections
  • RSS Feed
  • Privacy Policy
  • Terms of Service

Omniscient Media — made by ForeverBuilt, LLC.
© 2026 ForeverBuilt, LLC. All rights reserved.

  1. Home
  2. ›Industry
  3. ›Anthropic Made the Case for a Pause It Will Help Verify

Industry

Vol. 1·Sunday, June 7, 2026

Anthropic Made the Case for a Pause It Will Help Verify


Noah Ogbi10 min read

Tips, corrections, or questions? support@omniscient.media

TopicsAI PolicyAI SecurityIndustry Strategy
CompaniesAnthropic
Anthropic Made the Case for a Pause It Will Help Verify

On a single page published in May 2026 at anthropic.com/institute/recursive-self-improvement, Marina Favaro and Jack Clark did two things at once. They disclosed that more than 80% of the code merged into Anthropic's codebase in May 2026 was authored by Claude, that the typical Anthropic engineer was shipping eight times as much code per day as in 2024, and that on one internal AI safety research problem, agents closed 97% of the gap between baseline and ceiling in days where two human researchers had closed 23% in a week.[1][2][6] Then, in the same piece, they endorsed the conditions under which a coordinated international slowdown on frontier AI development would, in Anthropic's stated view, "likely be a good thing," and proposed that Anthropic help build the verification systems that any such slowdown would require.[8]

The first part is the news. The second part is the position. Why they appear together is the argument this piece makes.

What Anthropic disclosed about its own work

The productivity block in When AI builds itself is the empirically novel content. Most of the field has published task-completion benchmarks. Few labs have published internal productivity numbers tied to the deployment of their own models. Anthropic did - and the specific numbers matter, because not all of them carry equal weight for the policy claim that follows.

As of May 2026, more than 80% of the code merged into Anthropic's codebase was authored by Claude.[2] Before Claude Code launched in research preview in February 2025, the same number was in the low single digits.[2] In the second quarter of 2026, the typical Anthropic engineer was merging eight times as much code per day as in 2024.[2] The piece itself acknowledges that lines of code is an imperfect measure. The throughput number is the least load-bearing of the three for the slowdown argument - it says the volume is high, not that the capability is compounding.

The qualitative measures rose in parallel. On open-ended coding tasks, Claude's success rate reached 76% in May 2026, up fifty percentage points in six months.[4] Anthropic's own assessment of code quality is that Claude-written code "was somewhat worse than human-written code at Anthropic in late 2025, is roughly at parity today, and we expect it to be strictly better within the year."[4]

A March 2026 internal survey with 130 respondents reported a median of four times the research output attributed to Mythos Preview, Anthropic's most capable model at the time.[3] On research direction selection in open-ended investigative sessions, Anthropic measured model judgment rising from 51% with Opus 4.5 in November 2025 to 64% with Mythos Preview in April 2026.[7]

The task-horizon trend is the longest-baseline disclosure. Anthropic reports that the length of tasks Claude can reliably complete on its own has been doubling roughly every four months, up from an earlier doubling cadence of every seven months.[5] In March 2024, Opus 3 managed four-minute tasks. In March 2025, Sonnet 3.7 managed ninety-minute tasks. In March 2026, Opus 4.6 managed twelve-hour tasks.[5] Anthropic's extrapolation: "If this trend holds, tasks that take a skilled person days could come into range this year. In 2027, AI systems could be capable of tasks that take a person weeks."[5]

The most consequential single data point - most consequential for the slowdown claim specifically - is on AI safety research itself. Anthropic's alignment team published a companion paper on an automated weak-to-strong supervision system. Two of the paper's authors spent seven days tuning four representative prior methods, achieving a best Performance Gap Recovered of 0.23. The automated agents, powered by Opus 4.6, achieved a Performance Gap Recovered of 0.97 over five cumulative days at a cost of roughly $18,000.[6] The 0.97 PGR result is the number that justifies the policy posture: if AI systems are outperforming their own authors on the alignment research those authors are paid to do, then the case for a slowdown is no longer speculative. It is a matter of tempo.

What Anthropic endorsed as the policy position

On the same page, Favaro and Clark wrote the following exact sentence: "If it were possible to effectively slow the development of this technology to give ourselves more time to deal with its immense implications, we think that would likely be a good thing."[8]

That sentence is the position. It is published, not draft, not internal. It is signed under the Anthropic Institute, by Jack Clark - Anthropic's co-founder and Head of Policy - who has argued versions of this view in his Import AI newsletter for years.

The piece immediately follows the conditional with the constraint: "But if a slowdown simply lets the least cautious actors catch up technologically, it could leave everyone less safe."[8] The structural problem is given a metaphor: "Training runs are far easier to conceal than missile silos…the incentive to defect quietly is enormous, because whoever continues while others pause could inherit the lead."[8]

The piece then offers a specific proposal. Anthropic will support development of verification systems that would enable frontier AI developers globally to confirm that others have actually stopped or slowed, and to detect secret development during coordinated pauses.[8] The endorsement is not for a unilateral pause. The endorsement is for the conditions under which a verifiable, multilateral pause would be warranted, and a commitment to help build the institutional mechanism.

The piece is explicit about which scenario the company believes describes the present. Of three sketched futures - the trend stalls, AI labs see compounding efficiency gains with humans setting direction, and full recursive self-improvement - Anthropic's stated assessment is that "the evidence we've laid out here suggests that we're likely heading into" the second.[9] On the third, the piece concedes that how the alignment problem gets resolved "is something we are least certain about."[9]

Why the disclosure and the endorsement travel together

The productivity data and the slowdown endorsement were published in the same piece because they are, structurally, the same argument made in two registers - one empirical, one political. The data is the premise. The endorsement is what follows from accepting it.

If Claude is authoring more than four out of every five lines merged at Anthropic, if engineers are shipping at eight times their 2024 cadence, if open-ended task horizons are doubling every four months, and if on one internal AI safety research problem the agents are closing the human-to-ceiling gap better than the humans did in a week, then the lab has measurable evidence that the technology is compounding on itself at the development layer of the company that builds it. Scenario two, in the piece's own characterization, is not described as a forecast. It is described as the present.

The conditional structure of the endorsement - slowdown if verifiable and otherwise not - is the only posture available to a frontier lab that holds both views simultaneously. The lab that believes scenario two has arrived and scenario three is in view cannot endorse a unilateral pause without ceding the lead to less cautious actors. The lab that publishes the empirical case for the technology compounding on itself cannot remain silent on the policy implications without inviting the inference that it does not care. The conditional, plus the verification proposal, is the resolution.

The proposal that Anthropic help build the verification regime is not a rhetorical afterthought. The party that builds the verification mechanism is the party that sits at the table for any slowdown that results from it.

The Anthropic Institute as venue

When AI builds itself appears at anthropic.com/institute/recursive-self-improvement under the Anthropic Institute label, authored by Marina Favaro and Jack Clark, with editorial support from Santi Ruiz, visuals from Shan Carter, Romello Goodman, and Nikki Makagiansar, and data collection from Brian Calvert and Jun Shern Chan.[1] The piece links to a companion technical paper at alignment.anthropic.com.

The Institute is a third surface, distinct from both alignment.anthropic.com (where Anthropic's alignment team publishes its technical work, including the automated weak-to-strong researcher paper this piece relies on) and anthropic.com/news (where the company announces product launches, fundraises, and partnerships). It publishes under a research register and is structured for policy argumentation.

Marina Favaro works on AI safety governance. Jack Clark is Anthropic's co-founder and Head of Policy, and one of the most cited AI policy figures in the field, including through his long-running Import AI newsletter. The Institute is the publishing arm Anthropic has constituted to do exactly the kind of public-policy argumentation When AI builds itself does - in their voice, under the Anthropic brand, in a register distinct from both corporate communications and technical alignment research.

The piece reads as the Institute's inaugural-feeling release. Whatever the Institute publishes next will land in the same register and carry the same institutional weight as a policy artifact, regardless of its technical content.

Where this places Anthropic in the policy mechanism

The slowdown endorsement landed in the same month as a sequence of moves that, taken together, trace a single institutional trajectory.

On May 28, 2026, Anthropic closed a $65 billion Series H at a $965 billion post-money valuation, with a stated run-rate of $47 billion.[10] The round was led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital, with Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN as co-leads, alongside a broad group of institutional and strategic investors.[10] On June 1, the company filed a confidential S-1 with the SEC.[11]

On June 2, Anthropic expanded Project Glasswing to approximately 150 new partner organizations across more than 15 countries, anchored on Claude Mythos Preview, with an initial cohort of about 50 partners having already found more than 10,000 high- or critical-severity security flaws.[12] On the same day, the White House signed Promoting Advanced Artificial Intelligence Innovation and Security, an executive order defining a "covered frontier model" through a classified benchmarking process led by the Director of the NSA and codifying a voluntary 30-day pre-release access window - the operating structure Anthropic had been running through Glasswing since early April.[13]

When AI builds itself followed in May. The same lab that authored, in operating terms, the federal voluntary disclosure regime is now offering to author, in technical terms, the verification regime any international pause would route through. The slowdown endorsement is consistent with Anthropic's prior public positions. The verification proposal is consistent with the company's institutional trajectory. The October IPO window and the question of whether any other frontier lab co-signs the verification proposal are the two things to watch next.

The risk in the reading

Honest counterevidence first. Marina Favaro and Jack Clark have argued versions of this position for years, including in Clark's Import AI and in Favaro's prior governance work. The slowdown-if-verifiable view is not new in substance. It is established Anthropic public policy delivered in a new venue.

The productivity numbers are internally graded by Anthropic, not externally audited. The Performance Gap Recovered figures on the weak-to-strong supervision task come from Anthropic's own alignment paper, with code released and methodology described, but the human baseline is two of the paper's own authors. The reading offered here treats Anthropic's disclosures as Anthropic's disclosures. It does not attempt to verify them.

The piece's own caveat applies to the policy claim too. It is genuinely unclear whether today's training methods unlock the "research taste and judgment" that scenario two depends on as the human bottleneck and that scenario three would require AI systems to acquire. The piece itself concedes this. The reading that follows does not require certainty on the question.

The structural observation here is convergence, not causation. Nothing in this post requires the reader to accept that the productivity disclosure was timed to the IPO calendar, that the slowdown endorsement was timed to the Glasswing expansion, or that the Institute was constituted with the intent of authoring the verification regime. The structural fact is that the same lab is simultaneously the most-valuably-private AI company at a $965 billion post-money valuation, the operating partner for federal cyber defense per Glasswing, the proposer of the verification regime any international pause would route through, and the publisher of the empirical productivity case for taking the pause question seriously. The convergence is the story regardless of intent.


Sources

  1. anthropic.com/institute/recursive-self-improvement, When AI builds itself, by Marina Favaro and Jack Clark with editorial support from Santi Ruiz, May 2026. Inline ↗

  2. Same, internal-data section: "as of May 2026, more than 80% of the code we merge into Anthropic's codebase was authored by Claude"; "before Claude Code launched in research preview in February 2025, this number was in the low single digits"; "in the second quarter of 2026, the typical engineer was merging 8x as much code per day as they were in 2024." Inline ↗

  3. Same, internal-survey block: March 2026 survey, n=130; "the median respondent estimated that they produced around 4x as much output with Mythos Preview as they would have without access to any AI models." Inline ↗

  4. Same: Claude's session "success rate reached 76% in May 2026, up 50 percentage points in six months"; "Claude-written code was somewhat worse than human-written code at Anthropic in late 2025, is roughly at parity today, and we expect it to be strictly better within the year." Inline ↗

  5. Same: task horizons "doubling roughly every four months, up from an earlier trend of doubling every seven months"; Opus 3 four-minute tasks (March 2024), Sonnet 3.7 ninety-minute tasks (March 2025), Opus 4.6 twelve-hour tasks (March 2026); "tasks that take a skilled person days could come into range this year. In 2027, AI systems could be capable of tasks that take a person weeks." Inline ↗

  6. alignment.anthropic.com/2026/automated-w2s-researcher/, technical paper on Automated Alignment Researcher: AAR achieved a Performance Gap Recovered of 0.97 on held-out test data; "two authors spent 7 days tuning four representative prior methods, achieving a best PGR of 0.23"; AAR used Claude Opus 4.6 and completed 5 cumulative days at approximately $18,000. Inline ↗

  7. anthropic.com/institute/recursive-self-improvement: research direction judgment rose from 51% (Opus 4.5, November 2025) to 64% (Mythos Preview, April 2026). Inline ↗

  8. Same, slowdown section: "If it were possible to effectively slow the development of this technology to give ourselves more time to deal with its immense implications, we think that would likely be a good thing"; "But if a slowdown simply lets the least cautious actors catch up technologically, it could leave everyone less safe"; "Training runs are far easier to conceal than missile silos…the incentive to defect quietly is enormous." Anthropic states it will support verification-system development to enable frontier developers to confirm that others have stopped or slowed and to detect secret development during coordinated pauses. Inline ↗

  9. Same: Anthropic's view that "the evidence we've laid out here suggests that we're likely heading into" scenario two; on scenario three, "how the alignment problem gets solved, or not, in this future is something we are least certain about." Inline ↗

  10. anthropic.com/news/series-h: $65 billion Series H at a $965 billion post-money valuation, May 28, 2026; "our run-rate revenue crossed $47 billion earlier this month"; led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital; co-led by Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN. Inline ↗

  11. Anthropic, confidential S-1 filing announcement, June 1, 2026. Also reported by TechCrunch (June 1, 2026); October listing window per contemporaneous press coverage. Inline ↗

  12. anthropic.com/news/expanding-project-glasswing, June 2, 2026: "approximately 150 new organizations" across "more than 15 countries"; an initial cohort of about 50 partners "have so far found more than 10,000 high- or critical-severity security flaws"; anchored on Claude Mythos Preview. Project Glasswing originally launched in early April 2026. Inline ↗

  13. whitehouse.gov, Promoting Advanced Artificial Intelligence Innovation and Security, signed June 2, 2026: "covered frontier model" determination made by the Director of NSA through a classified benchmarking process; voluntary 30-day pre-release access window; "Nothing in this section shall be construed to authorize the creation of a mandatory governmental licensing, preclearance, or permitting requirement." Inline ↗

Share:

Consequential AI, explained and evaluated, every weekday.

The Daily Signal: 5 to 7 items a day with the take, not the recap.


Related

Industry

Vol. 1·Saturday, June 6, 2026

Before the Executive Order, There Was Glasswing

Anthropic didn't wait for the government's voluntary framework. It built one and made Washington catch up.


Before the Executive Order, There Was Glasswing

On Tuesday the White House signed an executive order defining a federal "covered frontier model" category and creating a voluntary 30-day pre-release window for those models. The same day, Anthropic expanded Project Glasswing to 150 organizations across fifteen countries, built on an unreleased model that has autonomously found thousands of zero-days. Glasswing started April 7. The EO did not create a new regime. It ratified the one Anthropic had been operating for two months.


AnthropicAI PolicyDefense & National Security
Noah Ogbi10 min read
Continue →

AI Policy

Vol. 1·Wednesday, April 8, 2026

Anthropic Built a Model Too Dangerous to Release. Its Fix Is to Give It Away to Big Tech.


Anthropic Built a Model Too Dangerous to Release. Its Fix Is to Give It Away to Big Tech.

Claude Mythos Preview can autonomously find and exploit zero-day vulnerabilities in every major operating system and browser. Rather than shelve it, Anthropic has handed it to a coalition of 50-plus firms under Project Glasswing. The strategy is defensible. Whether it holds depends on who else is building the same thing - and Washington's posture toward the company that built it.


AI PolicySafetyAnthropic
Noah Ogbi15 min read
Continue →

AI Policy

Vol. 1·Tuesday, May 12, 2026

OpenAI Just Shipped What Anthropic Won't. Now We Find Out What Restraint Costs.


OpenAI Just Shipped What Anthropic Won't. Now We Find Out What Restraint Costs.

OpenAI shipped Daybreak on Monday: a cybersecurity platform built on three GPT-5.5 variants with eight named enterprise security partners. Anthropic still won't ship Mythos. The gap between the two labs on the headline benchmark is now within one standard error - and the market is about to render its verdict on what restraint is actually worth.


AI PolicySafetyFrontier Models
Noah Ogbi9 min read
Continue →