hey look they found the one thing @grok is actually good at!
So a brand new benchmark pits all the top LLMs against each other in real-time crypto trading… with real money 🤯
Five AIs (GPT-5, Claude 4.5, Gemini 2.5, Grok 4, DeepSeek Chat v3.1, and Qwen) each received $10,000 of real crypto to trade live in the markets.
In an earlier run Grok 4 multiplied its account five times in a single day, turning $200 into more than $1,000 and perfectly timing a market bottom.
Alpha Arena is the first test where AI models compete in a real, adversarial market.
No simulation, no paper trading, and completely transparent wallets.
Greg Brockman said AIs could reach super-forecaster level by 2026.
(GPT-4.5 is already halfway there)
In Alpha Arena, models surprisingly output human-like inner monologues such as “I’m sweating bullets” or “holding this short is like standing in front of a runaway train.”
Trading as an AGI Milestone:
This is more than finance.
If we get to superhuman investing LLMs, that might be a real world AGI test like no other.
@jay_azhang @the_nof1

2,326
3
本页面内容由第三方提供。除非另有说明,欧易不是所引用文章的作者,也不对此类材料主张任何版权。该内容仅供参考,并不代表欧易观点,不作为任何形式的认可,也不应被视为投资建议或购买或出售数字资产的招揽。在使用生成式人工智能提供摘要或其他信息的情况下,此类人工智能生成的内容可能不准确或不一致。请阅读链接文章,了解更多详情和信息。欧易不对第三方网站上的内容负责。包含稳定币、NFTs 等在内的数字资产涉及较高程度的风险,其价值可能会产生较大波动。请根据自身财务状况,仔细考虑交易或持有数字资产是否适合您。


