OCTAVE 8B от Hume — это продвинутый движок, который превращает текст в живую речь с уникальным характером. Всего по 5-секундному образцу или текстовому описанию ИИ генерирует не просто голос, а полноценную цифровую личность с выразительными эмоциями и интонациями.
We’re introducing OCTAVE (Omni-Capable Text and Voice Engine), a next-generation speech-language model that combines the capabilities of our EVI 2 speech-language model with those of systems like OpenAI’s Voice Engine, Elevenlab’s TTS Voice Design, and Google Deepmind’s NotebookLM. From descriptive prompts or recordings as brief as 5s, OCTAVE generates not just voices, but personalities (language, accent, expressions, underlying disposition, etc.) that can talk to you. And it can generate multiple, interacting AI personalities and voices within a real-time response. Maintaining the capabilities of a similar-sized frontier LLM, OCTAVE is well-suited to power AI systems that communicate richly with humans while following detailed instructions, using tools, or controlling an interface.