T0-XXL: Языковая модель ИИ для Zero-Shot задач

Q: Кто разработал T0-XXL?

Модель T0-XXL разработана компанией Hugging Face,Brown University (United States of America,United States of America).

// задачи

Языковое моделирование

// описание

T0-XXL — это мощная языковая модель от Hugging Face, которая выводит zero-shot обучение на новый уровень. Благодаря обучению на огромном наборе разнообразных задач, этот ИИ способен мгновенно адаптироваться к новым вызовам без дополнительной настройки.

// abstract

Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale, we develop a system for easily mapping any natural language tasks into a human-readable prompted form. We convert a large set of supervised datasets, each with multiple prompts with diverse wording. These prompted datasets allow for benchmarking the ability of a model to perform completely held-out tasks. We fine-tune a pretrained encoder-decoder model (Raffel et al., 2020; Lester et al., 2021) on this multitask mixture covering a wide variety of tasks. The model attains strong zero-shot performance on several standard datasets, often outperforming models up to 16x its size. Further, our approach attains strong performance on a subset of tasks from the BIG-bench benchmark, outperforming models up to 6x its size. All trained models are available at this https URL and all prompts are available at this https URL.

// faq

Что такое T0-XXL?+

Кто разработал T0-XXL?+

Какие задачи решает T0-XXL?+

// похожие модели