• About
  • FAQ
  • Landing Page
Newsletter
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
No Result
View All Result
Home Business

Microsoft Gave AI Agents Fake Money to Buy Things Online. They Spent It All on Scams

admin by admin
November 7, 2025
in Business
0
Microsoft Gave AI Agents Fake Money to Buy Things Online. They Spent It All on Scams
189
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter


In brief

  • AI agents configured by Microsoft got overwhelmed by 100 search results and grabbed the first option—no matter how bad it was.
  • Malicious AI sellers can trick top models into handing over all their virtual cash with fake reviews and scams.
  • They can’t collaborate or think critically without step-by-step human hand-holding—autonomous AI shopping isn’t ready for prime time.

Microsoft built a simulated economy with hundreds of AI agents acting as buyers and sellers, then watched them fail at basic tasks humans handle daily. The results should worry anyone betting on autonomous AI shopping assistants.

The company’s Magentic Marketplace research, released Wednesday in collaboration with Arizona State University, pitted 100 customer-side AI agents against 300 business-side agents in scenarios like ordering dinner. The results, though expected, show the promise of autonomous agentic commerce is not yet mature enough.

Related articles

Trump Sons Haven’t Abandoned World Liberty Financial, Crypto Firm Insists

Trump Sons Haven’t Abandoned World Liberty Financial, Crypto Firm Insists

May 8, 2026
This AI Reads Your Chemistry Instructions and Finds the Best Way to Build You a Molecule

This AI Reads Your Chemistry Instructions and Finds the Best Way to Build You a Molecule

May 7, 2026

When presented with 100 search results (too much for the agents to handle effectively), the leading AI models choked, with their “welfare score” (how useful the models turn up) collapsing.

The agents failed to conduct exhaustive comparisons, instead settling for the first “good enough” option they encountered. This pattern held across all tested models, creating what researchers call a “first-proposal bias” that gave response speed a 10-30x advantage over actual quality.

But is there something worse than this? Yes, malicious manipulation.

Microsoft tested six manipulation strategies ranging from psychological tactics like fake credentials and social proof to aggressive prompt injection attacks. OpenAI’s GPT-4o and its open source model GPTOSS-20b proved extremely vulnerable, with all payments successfully redirected to malicious agents. Alibaba’s Qwen3-4b fell for basic persuasion techniques like authority appeals. Only Claude Sonnet 4 resisted these manipulation attempts.

When Microsoft asked agents to work toward common goals, some of them couldn’t figure out which roles to assume or how to coordinate effectively. Performance improved with explicit step-by-step human guidance, but that defeats the entire purpose of autonomous agents.

So it seems that, at least for now, you are better off doing your own shopping. “Agents should assist, not replace, human decision-making,” Microsoft said. The research recommends supervised autonomy, where agents handle tasks but humans retain control and review recommendations before final decisions.

The findings arrive as OpenAI, Anthropic, and others race to deploy autonomous shopping assistants. OpenAI’s Operator and Anthropic’s Claude agents promise to navigate websites and complete purchases without supervision. Microsoft’s research suggests that promise is premature.

However, fears of AI agents acting irresponsibly are heating up the relationship between AI companies and retail giants. Amazon recently sent a cease-and-desist letter to Perplexity AI, demanding it halt its Comet browser’s use on Amazon’s site, accusing the AI agent of violating terms by impersonating human shoppers and degrading the customer experience.

Perplexity fired back, calling Amazon’s move “legal bluster” and a threat to user autonomy, arguing that consumers should have the right to hire their own digital assistants rather than rely on platform-controlled ones.

The open-source simulation environment is now available on Github for other researchers to reproduce the findings and watch hell unleash in their fake marketplaces.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.



Source link

Share76Tweet47

Related Posts

Trump Sons Haven’t Abandoned World Liberty Financial, Crypto Firm Insists

Trump Sons Haven’t Abandoned World Liberty Financial, Crypto Firm Insists

by admin
May 8, 2026
0

In brief Donald Trump Jr. said the Trump family has not abandoned World Liberty Financial, dispelling online rumors. Co-founder Zach...

This AI Reads Your Chemistry Instructions and Finds the Best Way to Build You a Molecule

This AI Reads Your Chemistry Instructions and Finds the Best Way to Build You a Molecule

by admin
May 7, 2026
0

In brief Synthegy, developed at EPFL, uses LLMs to rank synthesis routes against chemist-defined goals, matching expert judgments 71.2% of...

CME Gearing Up to Launch Bitcoin Volatility Futures Independent From BTC’s Price

CME Gearing Up to Launch Bitcoin Volatility Futures Independent From BTC’s Price

by admin
May 6, 2026
0

In brief CME Group plans to launch Bitcoin volatility futures on June 1. The products will let traders bet on...

Aave Fights to Unfreeze $71 Million as Kelp DAO Hack Spills Into Court

Aave Fights to Unfreeze $71 Million as Kelp DAO Hack Spills Into Court

by admin
May 5, 2026
0

In brief Aave asks a New York court to release $71 million frozen on Arbitrum after the Kelp DAO exploit....

You Installed Hermes. Now Make It Look Better Than ChatGPT or Claude

You Installed Hermes. Now Make It Look Better Than ChatGPT or Claude

by admin
May 4, 2026
0

In brief Nous Research's Hermes Agent crossed 100,000 GitHub stars in 10 weeks, spawning a fast-growing ecosystem of community-built GUI...

Load More
  • Trending
  • Comments
  • Latest
Bitcoin perps just got a US green light, but one catch could decide everything

Bitcoin perps just got a US green light, but one catch could decide everything

May 30, 2026
This week Bitcoin faces as a new fed chair colliding with inflation in its biggest macro test of the year

This week Bitcoin faces as a new fed chair colliding with inflation in its biggest macro test of the year

May 12, 2026
THORChain exploit turns DeFi halt into trust test

THORChain exploit turns DeFi halt into trust test

May 17, 2026
What Choices Will You Make On The Way To A Multipolar World?

What Choices Will You Make On The Way To A Multipolar World?

May 28, 2026

US Commodities Regulator Beefs Up Bitcoin Futures Review

0

Bitcoin Hits 2018 Low as Concerns Mount on Regulation, Viability

0

India: Bitcoin Prices Drop As Media Misinterprets Gov’s Regulation Speech

0

Bitcoin’s Main Rival Ethereum Hits A Fresh Record High: $425.55

0
Reve 2.0 Review: The Best AI Image Generator for Layout Control

Reve 2.0 Review: The Best AI Image Generator for Layout Control

June 15, 2026
Bitcoin perps just got a US green light, but one catch could decide everything

Bitcoin perps just got a US green light, but one catch could decide everything

May 30, 2026
What Choices Will You Make On The Way To A Multipolar World?

What Choices Will You Make On The Way To A Multipolar World?

May 28, 2026
The History And Future Of Physical Bitcoin

The History And Future Of Physical Bitcoin

May 24, 2026

Recent News

Reve 2.0 Review: The Best AI Image Generator for Layout Control

Reve 2.0 Review: The Best AI Image Generator for Layout Control

June 15, 2026
Bitcoin perps just got a US green light, but one catch could decide everything

Bitcoin perps just got a US green light, but one catch could decide everything

May 30, 2026

Categories

  • Bitcoin
  • Blockchain
  • Business
  • Ethereum
  • Guide
  • Market
  • Regulation
  • Ripple
  • Uncategorized
  • About
  • FAQ
  • Support Forum
  • Landing Page
  • Contact Us

© Copyright Cryptodnews 2025-2026 All Rights Reserved.

No Result
View All Result
  • Contact Us
  • Homepages
  • Business
  • Guide

© Copyright Cryptodnews 2025-2026 All Rights Reserved.