Mistral just launched their... | Artificial Analysis OKX Feed

Mistral just launched their new large open weights model, Mistral Large 3 (675B total, 41B active), alongside a set of three Ministral models (3B, 8B, 14B) Mistral has released Instruct (non-reasoning) variants of all four models, as well as reasoning variants of the three Ministral models. All models support multimodal inputs and are available with an Apache 2.0 license today on @huggingface. We evaluated Mistral Large 3 and the Instruct variants of the three Ministral models prior to launch. Mistral’s highest scoring model in Artificial Analysis Intelligence Index remains the proprietary Magistral Medium 1.2, launched a couple of months back in September - this is due to reasoning giving models a significant advantage in many evals we use. Mistral discloses that a reasoning version of Mistral Large 3 is already in training and we look forward to evaluating it soon! Key highlights: ➤ Large and small models: at 675B total with 41B active, Mistral Large 3 is Mistral’s first open weights mixture-of-experts model since Mixtral 8x7B and 8x22B in late 2023 to early 2024. The Ministral releases are dense with 3B, 8B, and 14B parameter variants ➤ Significant intelligence increase but not amongst leading models (including proprietary): Mistral Large 3 represents a significant upgrade compared to the previous Mistral Large 2 with a +11 point increase on the Intelligence Index up to 38. However, Large 3 still trails leading proprietary reasoning & non-reasoning models ➤ Versatile small models: the Ministral models are released with Base, Instruct, and Reasoning variant weights - we tested only the Instruct variants ahead of release, which achieved Index scores of 31 (14B), 28 (8B), and 22 (3B). This places Ministral 14B ahead of the previous Mistral Small 3.2 with 40% fewer parameters. We are working on evaluating the reasoning variants and will share their intelligence results soon. ➤ Multi-modal capabilities: all models in the release support text and image inputs - this is a significant differentiator for Mistral Large 3, as few open weight models in its size class have support for image input. Context length also increases to 256k, enabling larger-input tasks. These new models from Mistral are not a step change from open weights competition, but they represent a strong performance base with vision capabilities. The Ministral 8B and 14B variants offer particularly compelling performance for their size, and we’re excited to see how the community uses and builds on these models. At launch, the new models are available for serverless inference on @MistralAI and a range of other providers including @awscloud Bedrock, @Azure AI Foundry, @IBMwatsonx, @FireworksAI_HQ, @togethercompute, and @modal.

11,55 rb

Konten pada halaman ini disediakan oleh pihak ketiga. Kecuali dinyatakan lain, OKX bukanlah penulis artikel yang dikutip dan tidak mengklaim hak cipta atas materi tersebut. Konten ini disediakan hanya untuk tujuan informasi dan tidak mewakili pandangan OKX. Konten ini tidak dimaksudkan sebagai dukungan dalam bentuk apa pun dan tidak dapat dianggap sebagai nasihat investasi atau ajakan untuk membeli atau menjual aset digital. Sejauh AI generatif digunakan untuk menyediakan ringkasan atau informasi lainnya, konten yang dihasilkan AI mungkin tidak akurat atau tidak konsisten. Silakan baca artikel yang terkait untuk informasi lebih lanjut. OKX tidak bertanggung jawab atas konten yang dihosting di situs pihak ketiga. Kepemilikan aset digital, termasuk stablecoin dan NFT, melibatkan risiko tinggi dan dapat berfluktuasi secara signifikan. Anda perlu mempertimbangkan dengan hati-hati apakah trading atau menyimpan aset digital sesuai untuk Anda dengan mempertimbangkan kondisi keuangan Anda.