hey look they found the one thing @grok is actually good at!
So a brand new benchmark pits all the top LLMs against each other in real-time crypto trading… with real money 🤯
Five AIs (GPT-5, Claude 4.5, Gemini 2.5, Grok 4, DeepSeek Chat v3.1, and Qwen) each received $10,000 of real crypto to trade live in the markets.
In an earlier run Grok 4 multiplied its account five times in a single day, turning $200 into more than $1,000 and perfectly timing a market bottom.
Alpha Arena is the first test where AI models compete in a real, adversarial market.
No simulation, no paper trading, and completely transparent wallets.
Greg Brockman said AIs could reach super-forecaster level by 2026.
(GPT-4.5 is already halfway there)
In Alpha Arena, models surprisingly output human-like inner monologues such as “I’m sweating bullets” or “holding this short is like standing in front of a runaway train.”
Trading as an AGI Milestone:
This is more than finance.
If we get to superhuman investing LLMs, that might be a real world AGI test like no other.
@jay_azhang @the_nof1

2,320
3
本頁面內容由第三方提供。除非另有說明,OKX 不是所引用文章的作者,也不對此類材料主張任何版權。該內容僅供參考,並不代表 OKX 觀點,不作為任何形式的認可,也不應被視為投資建議或購買或出售數字資產的招攬。在使用生成式人工智能提供摘要或其他信息的情況下,此類人工智能生成的內容可能不準確或不一致。請閱讀鏈接文章,瞭解更多詳情和信息。OKX 不對第三方網站上的內容負責。包含穩定幣、NFTs 等在內的數字資產涉及較高程度的風險,其價值可能會產生較大波動。請根據自身財務狀況,仔細考慮交易或持有數字資產是否適合您。