Яндекс Метрика
Биология и ИИ

ESM2-3B

Meta AI,New York University (NYU),Stanford University,Massachusetts Institute of Technology (MIT)
ProteinsProtein or nucleotide language model (pLM/nLM)Protein folding prediction

ESM2-3B представляет собой сбалансированную версию нейросети от Meta AI, предназначенную для глубокого анализа биологических последовательностей. Модель использует 3 миллиарда параметров, чтобы эффективно декодировать «язык» белков и предсказывать их трехмерную форму.

"Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins."

Что такое ESM2-3B?+
Кто разработал ESM2-3B?+
Какие задачи решает ESM2-3B?+