Llama 3.1-8B: быстрая и эффективная ИИ-модель от Meta

Q: Кто разработал Llama 3.1-8B?

Модель Llama 3.1-8B разработана компанией Meta AI (United States of America).

Q: Какие задачи решает Llama 3.1-8B?

Генерация текста

// задачи

Генерация текста

// описание

Llama 3.1-8B — самая компактная и быстрая нейросеть в обновленной линейке Meta, оптимизированная для работы на локальных устройствах. Несмотря на малый размер, этот AI отлично справляется с рассуждениями и использованием внешних инструментов, обеспечивая высокую производительность.

// abstract

Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.

// faq

Что такое Llama 3.1-8B?+

Кто разработал Llama 3.1-8B?+

Какие задачи решает Llama 3.1-8B?+

// похожие модели