Jurassic-1-Jumbo — это колоссальная языковая модель на 178 миллиардов параметров, созданная как прямой конкурент GPT-3. Этот ИИ превосходит многие аналоги в задачах zero-shot обучения, обеспечивая невероятно естественную генерацию текста и работу чат-ботов.
Jurassic-1 is a pair of auto-regressive language models recently released by AI21 Labs, consisting of J1-Jumbo, a 178B-parameter model, and J1-Large, a 7B-parameter model. We describe their architecture and training, and evaluate their performance relative to GPT-3. The evaluation is in terms of perplexity, as well as zero-shot and few-shot learning. To that end, we developed a zeroshot and few-shot test suite, which we made publicly available (https://github.com/ai21labs/ lm-evaluation) as a shared resource for the evaluation of mega language models.