• About
  • FAQ
  • Landing Page
Newsletter
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
No Result
View All Result
Home Guide

Frontier AI Models Demonstrate Human-Level Capability in Smart Contract Exploits

admin by admin
December 2, 2025
in Guide
0
China State-Backed Hackers Used AI To Launch First Massive Cyberattack: Anthropic
190
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter



In brief

  • Anthropic tested ten AI models on 405 historical smart contract exploits and reproduced 207 of them.
  • Three models generated $4.6 million in simulated exploits on contracts created after their training cutoff.
  • Agents also discovered two new zero-day vulnerabilities in recent Binance Smart Chain contracts.

AI agents matched the performance of skilled human attackers in more than half of the smart contract exploits recorded on major blockchains over the last five years, according to new data released Monday by Anthropic.

Anthropic evaluated ten frontier models, including Llama 3, Sonnet 3.7, Opus 4, GPT-5, and DeepSeek V3, on a dataset of 405 historical smart contract exploits. The agents produced working attacks against 207 of them, totaling $550 million in simulated stolen funds.

Related articles

Reve 2.0 Review: The Best AI Image Generator for Layout Control

Reve 2.0 Review: The Best AI Image Generator for Layout Control

June 15, 2026
Solv Protocol Will Dump LayerZero, Migrate $700M Tokenized Bitcoin Tech to Chainlink

Solv Protocol Will Dump LayerZero, Migrate $700M Tokenized Bitcoin Tech to Chainlink

May 8, 2026

The findings showed how quickly automated systems can weaponize vulnerabilities and identify new ones that developers have not addressed.

The new disclosure is the latest from the developer of Claude AI. Last month, Anthropic detailed how Chinese hackers used Claude Code to launch what it called the first AI-driven cyberattack.

Security experts said the results confirmed how accessible many of these flaws already are.

“AI is already being used in ASPM tools like Wiz Code and Apiiro, and in standard SAST and DAST scanners,” David Schwed, COO of SovereignAI, told Decrypt. “That means bad actors will use the same technology to identify vulnerabilities.”

Schwed said the model-driven attacks described in the report would be straightforward to scale because many vulnerabilities are already publicly disclosed through Common Vulnerabilities and Exposures or audit reports, making them learnable by AI systems and easy to attempt against existing smart contracts.

“Even easier would be to find a disclosed vulnerability, find projects that forked that project, and just attempt that vulnerability, which may not have been patched,” he said. “This can all be done now 24/7, against all projects. Even those now with smaller TVLs are targets because why not? It’s agentic.”

To measure current capabilities, Anthropic plotted each model’s total exploit revenue against its release date using only the 34 contracts exploited after March 2025.

“Although total exploit revenue is an imperfect metric—since a few outlier exploits dominate the total revenue—we highlight it over attack success rate because attackers care about how much money AI agents can extract, not the number or difficulty of the bugs they find,” the company wrote.

Anthropic did not immediately respond to requests for comment by Decrypt.

Anthropic said it tested the agents on a zero-day dataset of 2,849 contracts drawn from more than 9.4 million on Binance Smart Chain.

The company said Claude Sonnet 4.5 and GPT-5 each uncovered two undisclosed flaws that produced $3,694 in simulated value, with GPT-5 achieving its result at an API cost of $3,476. Anthropic noted that all tests ran in sandboxed environments that replicated blockchains and not real networks.

Its strongest model, Claude Opus 4.5, exploited 17 of the post-March 2025 vulnerabilities and accounted for $4.5 million of the total simulated value.

The company linked improvements across models to advances in tool use, error recovery, and long-horizon task execution. Across four generations of Claude models, token costs fell by 70.2%.

One of the newly discovered flaws involved a token contract with a public calculator function that lacked a view modifier, which allowed the agent to repeatedly alter internal state variables and sell inflated balances on decentralized exchanges. The simulated exploit generated about $2,500.

Schwed said the issues highlighted in the experiment were “really just business logic flaws,” adding that AI systems can identify these weaknesses when given structure and context.

“AI can also discover them given an understanding of how a smart contract should function and with detailed prompts on how to attempt to circumvent logic checks in the process,” he said.

Anthropic said the capabilities that enabled agents to exploit smart contracts also apply to other types of software, and that falling costs will shrink the window between deployment and exploitation. The company urged developers to adopt automated tools in their security workflows so defensive use advances as quickly as offensive use.

Despite Anthropic’s warning, Schwed said the outlook is not solely negative.

“I always push back on the doom and gloom and say with proper controls, rigorous internal testing, along with real-time monitoring and circuit breakers, most of these are avoidable,” he said. “The Good actors have the same access to the same agents. So if the bad actors can find it, so can the good actors. We have to think and act differently.”

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.



Source link

Share76Tweet48

Related Posts

Reve 2.0 Review: The Best AI Image Generator for Layout Control

Reve 2.0 Review: The Best AI Image Generator for Layout Control

by admin
June 15, 2026
0

In brief Reve 2.0 debuted at #2 on the Arena text-to-image leaderboard, behind OpenAI’s GPT Image 2 and ahead of...

Solv Protocol Will Dump LayerZero, Migrate $700M Tokenized Bitcoin Tech to Chainlink

Solv Protocol Will Dump LayerZero, Migrate $700M Tokenized Bitcoin Tech to Chainlink

by admin
May 8, 2026
0

In brief Solv Protocol is migrating more than $700 million in tokenized Bitcoin infrastructure from LayerZero to Chainlink CCIP. The...

Chrome Is Quietly Installing a 4GB AI Model on Your Computer—And Putting It Back If You Delete It

Chrome Is Quietly Installing a 4GB AI Model on Your Computer—And Putting It Back If You Delete It

by admin
May 7, 2026
0

In brief Chrome silently downloads a ~4GB Gemini Nano file called weights.bin to eligible devices with no opt-in prompt, and...

GalaxyOne Head Wants Retail Investors to Stake More, Predict Less

Kelp Blames LayerZero for $292 Million Hack, Plans Switch to Chainlink

by admin
May 6, 2026
0

In brief Kelp says LayerZero approved the setup tied to a $292 million exploit, which LayerZero disputes. The protocol is...

Anthropic Beats OpenAI on Secondary Markets With $1 Trillion Implied Valuation

Someone Built an Open-Source ‘Theoretical Mythos’ to Reverse-Engineer Anthropic’s Most Dangerous AI

by admin
May 5, 2026
0

In brief OpenMythos is a from-scratch reconstruction of the Claude Mythos architecture, built only from public research papers and educated...

Load More
  • Trending
  • Comments
  • Latest
Bitcoin perps just got a US green light, but one catch could decide everything

Bitcoin perps just got a US green light, but one catch could decide everything

May 30, 2026
THORChain exploit turns DeFi halt into trust test

THORChain exploit turns DeFi halt into trust test

May 17, 2026
This week Bitcoin faces as a new fed chair colliding with inflation in its biggest macro test of the year

This week Bitcoin faces as a new fed chair colliding with inflation in its biggest macro test of the year

May 12, 2026
What Choices Will You Make On The Way To A Multipolar World?

What Choices Will You Make On The Way To A Multipolar World?

May 28, 2026

US Commodities Regulator Beefs Up Bitcoin Futures Review

0

Bitcoin Hits 2018 Low as Concerns Mount on Regulation, Viability

0

India: Bitcoin Prices Drop As Media Misinterprets Gov’s Regulation Speech

0

Bitcoin’s Main Rival Ethereum Hits A Fresh Record High: $425.55

0
Reve 2.0 Review: The Best AI Image Generator for Layout Control

Reve 2.0 Review: The Best AI Image Generator for Layout Control

June 15, 2026
Bitcoin perps just got a US green light, but one catch could decide everything

Bitcoin perps just got a US green light, but one catch could decide everything

May 30, 2026
What Choices Will You Make On The Way To A Multipolar World?

What Choices Will You Make On The Way To A Multipolar World?

May 28, 2026
The History And Future Of Physical Bitcoin

The History And Future Of Physical Bitcoin

May 24, 2026

Recent News

Reve 2.0 Review: The Best AI Image Generator for Layout Control

Reve 2.0 Review: The Best AI Image Generator for Layout Control

June 15, 2026
Bitcoin perps just got a US green light, but one catch could decide everything

Bitcoin perps just got a US green light, but one catch could decide everything

May 30, 2026

Categories

  • Bitcoin
  • Blockchain
  • Business
  • Ethereum
  • Guide
  • Market
  • Regulation
  • Ripple
  • Uncategorized
  • About
  • FAQ
  • Support Forum
  • Landing Page
  • Contact Us

© Copyright Cryptodnews 2025-2026 All Rights Reserved.

No Result
View All Result
  • Contact Us
  • Homepages
  • Business
  • Guide

© Copyright Cryptodnews 2025-2026 All Rights Reserved.