Яндекс Метрика
Мультимодальная модель, Языковая модель, Компьютерное зрение

o4-mini

OpenAI
Генерация текстаSearchОтветы на вопросыКоличественные рассужденияЧат-ботМашинный переводГенерация кодаВизуальные ответы на вопросыСледование инструкциямVisual puzzles

o4-mini — новейшая мультимодальная модель от OpenAI, способная к глубоким рассуждениям перед ответом. Этот мощный ИИ-агент мастерски использует инструменты ChatGPT, справляясь с программированием, математикой и сложными визуальными головоломками.

Today, we’re releasing OpenAI o3 and o4-mini, the latest in our o-series of models trained to think for longer before responding. These are the smartest models we’ve released to date, representing a step change in ChatGPT's capabilities for everyone from curious users to advanced researchers. For the first time, our reasoning models can agentically use and combine every tool within ChatGPT—this includes searching the web, analyzing uploaded files and other data with Python, reasoning deeply about visual inputs, and even generating images. Critically, these models are trained to reason about when and how to use tools to produce detailed and thoughtful answers in the right output formats, typically in under a minute, to solve more complex problems. This allows them to tackle multi-faceted questions more effectively, a step toward a more agentic ChatGPT that can independently execute tasks on your behalf. The combined power of state-of-the-art reasoning with full tool access translates into significantly stronger performance across academic benchmarks and real-world tasks, setting a new standard in both intelligence and usefulness. <..> OpenAI o4-mini is a smaller model optimized for fast, cost-efficient reasoning—it achieves remarkable performance for its size and cost, particularly in math, coding, and visual tasks. It is the best-performing benchmarked model on AIME 2024 and 2025. Although access to a computer meaningfully reduces the difficulty of the AIME exam, we also found it notable that o4-mini achieves 99.5% pass@1 (100% consensus@8) on AIME 2025 when given access to a Python interpreter. While these results should not be compared to the performance of models without tool access, they are one example of how effectively o4-mini leverages available tools; o3 shows similar improvements on AIME 2025 from tool use (98.4% pass@1, 100% consensus@8).

Что такое o4-mini?+
Кто разработал o4-mini?+
Какие задачи решает o4-mini?+