Mistral Magistral Medium 1.1: ИИ для сложных рассуждений

Q: Кто разработал Magistral Medium 1.1?

Модель Magistral Medium 1.1 разработана компанией Mistral AI (France).

Q: Какие задачи решает Magistral Medium 1.1?

Генерация текста, Ответы на вопросы, Количественные рассуждения, Генерация кода, Машинный перевод

// задачи

Генерация текстаОтветы на вопросыКоличественные рассужденияГенерация кодаМашинный перевод

// описание

Magistral Medium 1.1 — это первая специализированная модель-рассуждалка от Mistral AI, созданная на базе собственного пайплайна обучения с подкреплением (RL). Она демонстрирует впечатляющие способности в написании кода, математических вычислениях и сложных логических выводах, раздвигая границы возможностей LLM.

// abstract

We introduce Magistral, Mistral’s first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a simple method to force the reasoning language of the model, and show that RL on text data alone maintains most of the initial checkpoint’s capabilities. We find that RL on text maintains or improves multimodal understanding, instruction following and function calling. We present Magistral Medium, trained for reasoning on top of Mistral Medium 3 with RL alone, and we open-source Magistral Small (Apache 2.0) which further includes cold-start data from Magistral Medium.

// faq

Что такое Magistral Medium 1.1?+

Кто разработал Magistral Medium 1.1?+

Какие задачи решает Magistral Medium 1.1?+

// похожие модели