A study published in Science finds that AI now generates nearly 30% of new Python code on GitHub in the United States, up from just 5% in 2022. The gains are real - but they flow almost entirely to experienced developers, not junior ones.
OpenAI and Anthropic released their flagship AI coding agents on the same day in February 2026. Their system cards reveal two genuinely different engineering philosophies and safety postures - and a single shared problem neither has solved: how to deploy an autonomous AI agent responsibly when you cannot yet fully account for its behavior.
Anthropic's Claude Opus 4.6 system card documents sweeping capability gains alongside safety findings that are harder to dismiss than those of any previous generation. On cyber evaluations the model has hit a ceiling, on autonomous R&D it is approaching one, and the tools used to monitor it are struggling to keep pace.