Яндекс Метрика
Мультимодальная модель, Языковая модель, Видео, Компьютерное зрение

SenseNova V6

SenseTime
Генерация текстаКоличественные рассужденияОтветы на вопросыВизуальные ответы на вопросыОписание видеоCharacter recognition (OCR)Генерация кодаЧат-бот

SenseNova V6 от SenseTime устанавливает новые стандарты в мультимодальных вычислениях, используя продвинутые цепочки рассуждений (CoT) и глобальную память. Эта AI-система мастерски справляется с анализом видео, сложным OCR и количественными задачами, обеспечивая человекоподобный уровень взаимодействия.

HONG KONG, April 12, 2025 /PRNewswire/ -- SenseTime launched its newly upgraded large model series, SenseNova V6, at its Tech Day event held in several locations, including Shanghai and Shenzhen. Leveraging advances in the training of multimodal long chain-of-thought (CoT), global memory, and reinforcement learning, the model delivers industry-leading multimodal reasoning capabilities while setting a new benchmark for cost efficiency. The capabilities of the SenseNova V6 model have been greatly enhanced, with strong advantages in long CoT, reasoning, mathematical capabilities, and global memory. Its multimodal reasoning capabilities ranked first in China when benchmarked against GPT-o1, while its data analysis performance outpaced GPT-4o. It also combines high performance with cost efficiency. Its multimodal training efficiency is aligned with that of language models, providing the lowest training costs in the industry. Its reasoning costs are also the lowest in the industry. The new lightweight full-modal interactive model, SenseNova V6 Omni, delivers the most advanced multimodal interactive capabilities in China. It is China's first large model that supports in-depth analysis of 10-minute mid-to-long form videos, benchmarked against Gemini 2.5 Turbo to be among the strongest in its class

Что такое SenseNova V6?+
Кто разработал SenseNova V6?+
Какие задачи решает SenseNova V6?+