DeepSeek-V3.1: гибридная ИИ-модель с режимом мышления

Q: Кто разработал DeepSeek-V3.1?

Модель DeepSeek-V3.1 разработана компанией DeepSeek (China).

Q: Какие задачи решает DeepSeek-V3.1?

Генерация текста, Генерация кода, Количественные рассуждения, Ответы на вопросы, Search, Управление системами, Следование инструкциям

// задачи

Генерация текстаГенерация кодаКоличественные рассужденияОтветы на вопросыSearchУправление системамиСледование инструкциям

// описание

Гибридная нейросеть, которая позволяет переключаться между быстрым ответом и режимом глубоких размышлений. DeepSeek-V3.1 значительно прибавила в качестве работы с инструментами и ИИ-агентами, показывая отличные результаты в математике и программировании.

// abstract

DeepSeek-V3.1 is a hybrid model that supports both thinking mode and non-thinking mode. Compared to the previous version, this upgrade brings improvements in multiple aspects: Hybrid thinking mode: One model supports both thinking mode and non-thinking mode by changing the chat template. Smarter tool calling: Through post-training optimization, the model's performance in tool usage and agent tasks has significantly improved. Higher thinking efficiency: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly. DeepSeek-V3.1 is post-trained on the top of DeepSeek-V3.1-Base, which is built upon the original V3 base checkpoint through a two-phase long context extension approach, following the methodology outlined in the original DeepSeek-V3 report. We have expanded our dataset by collecting additional long documents and substantially extending both training phases. The 32K extension phase has been increased 10-fold to 630B tokens, while the 128K extension phase has been extended by 3.3x to 209B tokens.

// faq

Что такое DeepSeek-V3.1?+

Кто разработал DeepSeek-V3.1?+

Какие задачи решает DeepSeek-V3.1?+

// похожие модели