Chameleon-34B: Мультимодальная нейросеть от Meta (FAIR)

Q: Кто разработал Chameleon-34B?

Модель Chameleon-34B разработана компанией Facebook AI Research (United States of America,France).

Q: Какие задачи решает Chameleon-34B?

Генерация текста, Vision-language generation, Визуальные ответы на вопросы, Text-to-image

// задачи

Генерация текстаVision-language generationВизуальные ответы на вопросыText-to-image

// описание

Chameleon-34B от Meta — это мультимодальная ИИ-система, способная одновременно обрабатывать и генерировать текст и изображения в любой последовательности. Модель использует технологию «раннего слияния» токенов, что делает её одной из самых гибких нейросетей для визуальных ответов на вопросы и креативного контента.

// abstract

We present Chameleon, a family of early-fusion token-based mixed-modal models capable of understanding and generating images and text in any arbitrary sequence. We outline a stable training approach from inception, an alignment recipe, and an architectural parameterization tailored for the early-fusion, token-based, mixed-modal setting. The models are evaluated on a comprehensive range of tasks, including visual question answering, image captioning, text generation, image generation, and long-form mixed modal generation. Chameleon demonstrates broad and general capabilities, including state-of-the-art performance in image captioning tasks, outperforms Llama-2 in text-only tasks while being competitive with models such as Mixtral 8x7B and Gemini-Pro, and performs non-trivial image generation, all in a single model. It also matches or exceeds the performance of much larger models, including Gemini Pro and GPT-4V, according to human judgments on a new long-form mixed-modal generation evaluation, where either the prompt or outputs contain mixed sequences of both images and text. Chameleon marks a significant step forward in a unified modeling of full multimodal documents.

// faq

Что такое Chameleon-34B?+

Кто разработал Chameleon-34B?+

Какие задачи решает Chameleon-34B?+

// похожие модели