Anthropic says AI agents are nearing autonomous successor design as Claude writes most merged code
AI

Anthropic says AI agents are nearing autonomous successor design as Claude writes most merged code

Leaders flagged a four-month capability doubling pace and warned human review could become the limiting factor.

By AI News Crypto Editorial Team5 min read

Anthropic Institute lead Marina Favaro and Anthropic co-founder Jack Clark warned on June 5 that AI development is accelerating toward agents that could autonomously design and develop their own successors with enough compute. The pair argued policymakers should have the option to slow frontier AI progress, a stance that collides with early signs of agent-driven payment automation already scaling.

Key Takeaways

  • Anthropic leaders said AI development is trending toward agents that can design and develop successor systems given sufficient compute.
  • Model capability gains were described as roughly doubling every four months, faster than a prior seven-month cadence.
  • Claude authored around 80% of the code merged into Anthropic’s codebase, pointing to deep model integration in production workflows.
  • Keyrock reported $73 million settled across 176 million transactions by AI agents, framing agent-settled payments as a live, growing behavior.

Anthropic’s Warning: Agents Moving Toward Autonomous Successor Design

Marina Favaro, lead at the Anthropic Institute, and Anthropic co-founder Jack Clark said AI agents are already taking on meaningful chunks of AI R&D work, including running code and delegating tasks to other agents. In their June 5 blog post, they framed the trajectory as moving beyond “assistants” toward systems that can increasingly execute the development loop themselves.

Their core claim was explicit: “Taken far enough, and given enough compute, that trend points to an AI system capable of fully autonomously designing and developing its own successor.” Favaro and Clark added that recursive self-improvement is “not inevitable,” but warned it could arrive sooner than institutions are prepared for.

For crypto markets, the relevance is not philosophical. If agents can reliably take actions on behalf of users and businesses, payments and settlement become a software problem first and a governance problem second. That is where second-order effects start to show up in stablecoin rails, compliance tooling, and on-chain throughput narratives.

The Acceleration Metrics: Four-Month Doubling and Claude’s Code Share

Favaro and Clark said AI model improvement has been “roughly doubling every four months,” versus “every seven months.” A faster compounding curve matters because it compresses the time the market has to observe, validate, and price in downstream adoption.

They also offered an operational proof point that cuts through most agent hype: Claude authored “around 80% of the code merged into Anthropic’s codebase.” The excerpt does not specify the measurement window or methodology, but the directional signal is clear. AI is not just generating snippets. It is being used in a production pipeline where code is merged.

That internal workflow claim strengthens the case that agent-driven transaction automation can move from demos to sustained usage faster than many crypto narratives assume, especially if the “agent” is increasingly the default interface for software execution.

Where the Bottleneck Shifts: Human Review vs. Machine Output

Favaro and Clark argued the limiting factor may soon shift from writing code to reviewing it. Their warning was mechanical, not cinematic: “Once human- and AI-authored code quality reach parity, humans will stop writing code entirely and shift to only reviewing it. But if they can’t review code as quickly as Claude can generate it, human review will become the bottleneck to AI development.”

That bottleneck framing maps cleanly onto crypto infrastructure. Agentic commerce can scale transaction intent faster than humans can audit smart contracts, monitor risk, or govern protocol changes. If review capacity lags output capacity, the market can get more transactions without getting more safety.

Guardrails, Release Decisions, and Transaction Growth Signals to Track

Favaro and Clark said it would be “good for the world to have the option to slow or temporarily pause frontier AI development” to let societal structures and alignment research catch up. They also warned that unilateral slowdowns could advantage “the least cautious actors,” citing the lack of a global coordination mechanism.

On the same Thursday, a group of tech leaders, including some from Anthropic and OpenAI, released an open letter urging lawmakers to enact stronger AI guardrails, citing concerns that AI could overcome “knowledge barriers” that historically prevented bad actors from creating biological weapons.

Concrete watch items now cluster around policy and product gating. Any follow-on actions tied to the June 5 open letter matter most if they include timelines or enforcement mechanisms. Anthropic’s next stance on releasing higher-capability models is another live variable after it withheld public release of “Claude Mythos” in April 2026, citing cybersecurity concerns after the model could “easily create software exploits.”

On the adoption side, Keyrock said last month that AI agents settling payments moved from concept to reality over the past 12 months, with $73 million settled across 176 million transactions. The excerpt does not provide chain-by-chain or rail-by-rail detail, so the next useful datapoint is a breakdown and an updated 3–6 month run-rate. Similar disclosures from major AI labs on the share of production code authored by models would also act as a proxy for how quickly agent autonomy is increasing.

Trading the Agentic Commerce Narrative Without Overfitting the Hype

I treat the “~80% merged code” claim and the $73 million across 176 million agent-settled transactions as the same kind of signal: agents are already embedded in real workflows, and the market tends to underprice compounding when the proof points arrive in operational metrics rather than product launches.

The threshold that matters is whether guardrails and release decisions slow deployment enough to cap near-term transaction growth, or whether capability keeps compounding on that four-month cadence and forces payment rails to scale under pressure. This matters in practical terms if agent-settled payment volume shows a durable run-rate increase while frontier model releases remain available enough to keep automation expanding.

Sources