Zephyr 7B: Компактная языковая модель от Hugging Face

Q: Кто разработал Zephyr 7B?

Модель Zephyr 7B разработана компанией Hugging Face (United States of America).

Q: Какие задачи решает Zephyr 7B?

Генерация текста, Ответы на вопросы

// задачи

Генерация текстаОтветы на вопросы

// описание

Zephyr 7B — это компактная языковая модель от Hugging Face, которая мастерски понимает намерения пользователя. Благодаря обучению на основе обратной связи от ИИ (AIF), этот AI-инструмент выдает естественные и точные ответы, обходя по эффективности многие крупные аналоги.

// abstract

We aim to produce a smaller language model that is aligned to user intent. Previous research has shown that applying distilled supervised fine-tuning (dSFT) on larger models significantly improves task accuracy; however, these models are unaligned, i.e. they do not respond well to natural prompts. To distill this property, we experiment with the use of preference data from AI Feedback (AIF). Starting from a dataset of outputs ranked by a teacher model, we apply distilled direct preference optimization (dDPO) to learn a chat model with significantly improved intent alignment. The approach requires only a few hours of training without any additional sampling during fine-tuning. The final result, Zephyr-7B, sets the state-of-the-art on chat benchmarks for 7B parameter models, and requires no human annotation. In particular, results on MT-Bench show that Zephyr-7B surpasses Llama2-Chat-70B, the best open-access RLHF-based model. Code, models, data, and tutorials for the system are available at this https URL.

// faq

Что такое Zephyr 7B?+

Кто разработал Zephyr 7B?+

Какие задачи решает Zephyr 7B?+

// похожие модели