Яндекс Метрика
Биология и ИИ

AMPLIFY

Chandar Research Lab,Mila - Quebec AI (originally Montreal Institute for Learning Algorithms),Amgen,Polytechnique Montreal,CIFAR AI Research
Protein or nucleotide language model (pLM/nLM)

AMPLIFY — это мощная языковая модель для белков и нуклеотидов, созданная для анализа биологического ландшафта. ИИ эффективно предсказывает свойства молекул и помогает в дизайне новых протеинов, доказывая, что архитектурные инновации важнее простого наращивания параметров.

Public protein sequence databases contain samples from the fitness landscape explored by nature. Protein language models (pLMs) pre-trained on these sequences aim to capture this landscape for tasks like property prediction and protein design. Following the same trend as in natural language processing, pLMs have continuously been scaled up. However, the premise that scale leads to better performance assumes that source databases provide accurate representation of the underlying fitness landscape, which is likely false. By developing an efficient codebase, designing a modern architecture, and addressing data quality concerns such as sample bias, we introduce AMPLIFY, a best-in-class pLM that is orders of magnitude less expensive to train and deploy than previous models. Furthermore, to support the scientific community and democratize the training of pLMs, we have open-sourced AMPLIFY’s pre-training codebase, data, and model checkpoints.

Что такое AMPLIFY?+
Кто разработал AMPLIFY?+
Какие задачи решает AMPLIFY?+