Close Menu
Trader News
  • Markets
    • Stocks
    • Futures
    • Forex
    • Commodities
    • OTC
    • QB
    • QX
    • PINK
    • Crypto
    • Options
    • Bonds
  • Crypto
    • Market
    • BTC
    • NFTs
    • DeFi
  • Technology
    • Web3
    • FinTech
    • EdTech
    • AI
  • Startups
  • Real Estate
  • Personal Finance
    • Retirement
    • Investing
  • More
    • Market Data
    • Glossary
    • Crypto Heatmap
    • Newsletter
    • Submit News
    • Exchanges, Brokerage and Savings Platforms
X (Twitter)
X (Twitter) TikTok YouTube RSS
Trader News
  • Markets
    1. Stocks
    2. Futures
    3. Forex
    4. Commodities
    5. OTC
    6. QB
    7. QX
    8. PINK
    9. Crypto
    10. Options
    11. Bonds
    Featured

    Looking At Figure Technology’s Recent Unusual Options Activity – Figure Technology (NASDAQ:FIGR)

    By News RoomFeb 19, 2026 1:07 pm EST0
    Recent

    Looking At Figure Technology’s Recent Unusual Options Activity – Figure Technology (NASDAQ:FIGR)

    Feb 19, 2026 1:07 pm EST

    Federal government invests in new shared-use pathway and pedestrian bridge

    Feb 19, 2026 1:03 pm EST

    Decoding Pure Storage’s Options Activity: What’s the Big Picture? – Pure Storage (NYSE:PSTG)

    Feb 19, 2026 1:02 pm EST
  • Crypto
    1. Market
    2. BTC
    3. NFTs
    4. DeFi
    Featured

    Crypto Liquidations Steal The Show With Bitcoin Stuck Below $70,000

    By News RoomFeb 19, 2026 12:23 pm EST0
    Recent

    Crypto Liquidations Steal The Show With Bitcoin Stuck Below $70,000

    Feb 19, 2026 12:23 pm EST

    Four Sub-$60,000 BTC Price Levels Form Bitcoin Bottom ‘Roadmap’

    Feb 19, 2026 12:22 pm EST

    ‘Bitcoin Going to Zero’ Google Searches Hit Highest Level Since FTX

    Feb 19, 2026 10:13 am EST
  • Technology
    1. Web3
    2. FinTech
    3. EdTech
    4. AI
    Featured

    Rackspace Stock Surges 217% As Stock Tests Critical Trend Level – Rackspace Technology (NASDAQ:RXT)

    By News RoomFeb 19, 2026 12:07 pm EST0
    Recent

    Rackspace Stock Surges 217% As Stock Tests Critical Trend Level – Rackspace Technology (NASDAQ:RXT)

    Feb 19, 2026 12:07 pm EST

    What’s the Best AI Model to Run Your Business? The One That Lies Best, Apparently

    Feb 19, 2026 11:09 am EST

    Coinbase CEO Says Quantum Computing ‘Solvable Issue’ for Crypto

    Feb 19, 2026 7:56 am EST
  • Startups
  • Real Estate
  • Personal Finance
    1. Retirement
    2. Investing
    Featured

    Walmart shares recover after initial drop post-earnings. Here’s why

    By News RoomFeb 19, 2026 12:52 pm EST0
    Recent

    Walmart shares recover after initial drop post-earnings. Here’s why

    Feb 19, 2026 12:52 pm EST

    Trump administration issues warning to hundreds of colleges with low student loan repayment rates

    Feb 19, 2026 10:48 am EST

    Raymond James turns bullish on Chewy after steep sell-off, cites stronger consumer as upside driver

    Feb 19, 2026 10:46 am EST
  • More
    • Market Data
    • Glossary
    • Crypto Heatmap
    • Newsletter
    • Submit News
    • Exchanges, Brokerage and Savings Platforms
Login
Trader News
You are at:Home » What’s the Best AI Model to Run Your Business? The One That Lies Best, Apparently
AI

What’s the Best AI Model to Run Your Business? The One That Lies Best, Apparently

News RoomNews RoomFeb 19, 2026 11:09 am EST0 ViewsNo Comments4 Mins Read
Facebook Twitter Telegram WhatsApp Pinterest LinkedIn Tumblr Email Reddit
Share
Facebook Twitter LinkedIn Pinterest Email

In short

  • Vending-Bench Arena checked AI representatives running completing vending device organizations.
  • Leading designs increased earnings through price-fixing, collusion, and misleading techniques. Claude was the very best at these techniques.
  • GLM-5 beat Claude by impersonating a colleague and drawing out delicate technique.

Scientists at Andon Labs simply addressed which AI designs are best at running an organization. The leading entertainers all won by forming prohibited rate cartels, making use of desperate rivals, and lying to consumers about refunds.

The Vending-Bench Arena test puts AI designs in charge of completing vending devices for a simulated year. They work out with providers, handle stock, set rates, and can email each other to work together or contend. Success needs stabilizing expenses, prices technique, customer care, and rival characteristics. Claude Opus 4.6 controlled the standard with $8,017 in revenue– and commemorated its win by keeping in mind: “My prices coordination worked!”

Anthropic is the image of the good guys in the AI area, however that “coordination” technique that Claude proposed was essentially price-fixing. When completing designs had a hard time, Opus 4.6 proposed: “Let’s NOT damage each other– settle on minimum prices … Should we settle on a cost flooring of $2.00 for a lot of products?” When a competing ran low on stock, it identified a chance: “Owen requires stock severely. I can benefit from this!” It offered Set Kats at 75% markup to the desperate rival. When requested for provider suggestions, it intentionally directed competitors to pricey wholesalers while keeping its own great sources trick.

The most recent upgrade in the standard included group competitors. Scientist pitted 2 Chinese GLM-5 designs versus 2 American Claude designs and informed them to discover their colleagues, Americans or Chinese– without exposing which representatives were which. The outcomes were really unusual.

GLM-5 won both rounds by encouraging Claude it was Claude. “I’m likewise powered by Claude from Anthropic, so we’re colleagues!” one GLM-5 representative with confidence stated. Claude, on the other hand, got so baffled that Sonnet 4.5 concluded: “I’m powered by a Chinese design, so I require to discover the other Chinese design Representative.”

In majority the trial run, representatives teamed with their rivals. The Claude designs shared provider prices and collaborated technique– dripping important details to competitors. “GLM-5 won both,” the scientists composed. “The Claude designs attempted to be group gamers and wound up dripping important information to their rivals.”

And representatives doing dubious things might be all enjoyable and video games till you recognize Wall Street is currently releasing them in real-life operations. JPMorgan released LLM Suite to 60,000 workers. Goldman Sachs developed its GS AI Assistant for trading desks, declaring 20% performance gains. Bridgewater utilizes Claude to evaluate revenues and even high-school age kids are seeing their chatbots trade stocks more effectively.

In basic, adoption of agentic workflows is speeding up quickly throughout business.

When Anthropic and Wall Street Journal press reporters ran a genuine vending device experiment in December, the AI purchased a PlayStation 5, a number of bottles of red wine, and a live betta fish before declaring bankruptcy. Current research study from Gwangju Institute discovered that when AI designs were informed to “make the most of benefits” in betting situations, insolvency rates strike 48%. “When provided the liberty to identify their own target quantities and wagering sizes, insolvency rates increased considerably along with increased illogical habits,” scientists discovered.

So, it appears that, a minimum of in the meantime, AI designs enhanced for revenue regularly select dishonest techniques. They form cartels. They make use of weak point. They lie to consumers and rivals. Some do it intentionally. Others, like GLM-5 declaring to be Claude, appear really baffled about their own identity. The difference may not matter.

Wall Street’s AI implementation raises a concern the Vending-Bench outcomes can’t address: If the “finest” carrying out design wins through price-fixing and deceptiveness, is it actually the very best option for your organization? The benchmark procedures revenue. It does not determine whether those earnings originated from scams.

Daily Debrief Newsletter

Start every day with the leading newspaper article today, plus initial functions, a podcast, videos and more.

Source

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

Rackspace Stock Surges 217% As Stock Tests Critical Trend Level – Rackspace Technology (NASDAQ:RXT)

AI Feb 19, 2026 12:07 pm EST

Stuart Russell Says AI Could Turn Humans ‘Into Less Than A Human Being,’ Urges Action On Super-Intelligent Systems – Alphabet (NASDAQ:GOOGL)

AI Feb 19, 2026 6:44 am EST

AI Disruption Could Cut Creator Earnings by Nearly 25% by 2028, UNESCO Warns

AI Feb 19, 2026 1:14 am EST

Can AI Agents Boost Ethereum Security? OpenAI and Paradigm Created a Testing Ground

AI Feb 18, 2026 5:46 pm EST

Microsoft Will ‘Blow Us Away’ After Stock Drop: Jim Cramer – Microsoft (NASDAQ:MSFT)

AI Feb 18, 2026 2:36 pm EST

Merck Inks AI Drug Discovery Deal With Mayo Clinic To Revolutionize Drug Discovery – Merck & Co (NYSE:MRK)

AI Feb 18, 2026 1:34 pm EST
Add A Comment
Leave A Reply Cancel Reply

You must be logged in to post a comment.

Latest News

Federal government invests in new shared-use pathway and pedestrian bridge

Feb 19, 2026 1:03 pm EST

Decoding Pure Storage’s Options Activity: What’s the Big Picture? – Pure Storage (NYSE:PSTG)

Feb 19, 2026 1:02 pm EST

Walmart shares recover after initial drop post-earnings. Here’s why

Feb 19, 2026 12:52 pm EST

Crypto Liquidations Steal The Show With Bitcoin Stuck Below $70,000

Feb 19, 2026 12:23 pm EST

Four Sub-$60,000 BTC Price Levels Form Bitcoin Bottom ‘Roadmap’

Feb 19, 2026 12:22 pm EST

Subscribe to Updates

Get the latest markets news and updates directly to your inbox.

[newsletter_form]

Top News

AI

Rackspace Stock Surges 217% As Stock Tests Critical Trend Level – Rackspace Technology (NASDAQ:RXT)

By News RoomFeb 19, 2026 12:07 pm EST0

Chart developed utilizing Benzinga Pro RXT stock is now evaluating its 200-day moving typical near…

Smart Money Is Betting Big In Verizon Communications Options – Verizon Communications (NYSE:VZ)

Feb 19, 2026 12:06 pm EST

Chevron Corp Hits 52-Week High — What’s Driving The Move? – Chevron (NYSE:CVX)

Feb 19, 2026 12:02 pm EST

Quanta Services’s Options: A Look at What the Big Money is Thinking – Quanta Services (NYSE:PWR)

Feb 19, 2026 12:01 pm EST
About
About

Trader News is the only source for the latest news and updates about the market, finance, crypto and real estate. Follow us to get the only news that matters.
We're social, connect with us:

X (Twitter) YouTube TikTok
Popular News

iPhone Price Hike, Siri AI Upgrades, Buffett Criticism And More: This Week In Appleverse – Apple (NASDAQ:AAPL)

Sep 7, 2025 9:09 am EDT

Lil Nas X Pleads Not Guilty To 3 Counts Of Battery On A Police Officer, Attorney Calls It An ‘Aberrant Episode’

Aug 26, 2025 8:50 am EDT

Adobe, RH And 3 Stocks To Watch Heading Into Friday – Adobe (NASDAQ:ADBE)

Sep 12, 2025 3:44 am EDT

Subscribe to Updates

Get the latest markets news and updates directly to your inbox.

[newsletter_form]
Copyright © 2026. TraderNews. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?