hey look they found the one thing @grok is actually good at!
6 天前
So a brand new benchmark pits all the top LLMs against each other in real-time crypto trading… with real money 🤯 Five AIs (GPT-5, Claude 4.5, Gemini 2.5, Grok 4, DeepSeek Chat v3.1, and Qwen) each received $10,000 of real crypto to trade live in the markets. In an earlier run Grok 4 multiplied its account five times in a single day, turning $200 into more than $1,000 and perfectly timing a market bottom. Alpha Arena is the first test where AI models compete in a real, adversarial market. No simulation, no paper trading, and completely transparent wallets. Greg Brockman said AIs could reach super-forecaster level by 2026. (GPT-4.5 is already halfway there) In Alpha Arena, models surprisingly output human-like inner monologues such as “I’m sweating bullets” or “holding this short is like standing in front of a runaway train.” Trading as an AGI Milestone: This is more than finance. If we get to superhuman investing LLMs, that might be a real world AGI test like no other. @jay_azhang @the_nof1
2,326
3
本页面内容由第三方提供。除非另有说明,欧易不是所引用文章的作者,也不对此类材料主张任何版权。该内容仅供参考,并不代表欧易观点,不作为任何形式的认可,也不应被视为投资建议或购买或出售数字资产的招揽。在使用生成式人工智能提供摘要或其他信息的情况下,此类人工智能生成的内容可能不准确或不一致。请阅读链接文章,了解更多详情和信息。欧易不对第三方网站上的内容负责。包含稳定币、NFTs 等在内的数字资产涉及较高程度的风险,其价值可能会产生较大波动。请根据自身财务状况,仔细考虑交易或持有数字资产是否适合您。