Яндекс Метрика
Биология и ИИ

ProtENN2

European Bioinformatics Institute,University of Cambridge,Google Research
Protein classification

ProtENN2 — это продвинутая нейросеть для глубокого анализа белков, разработанная Google Research и Кембриджем. Модель использует векторные представления (embeddings) для поиска скрытых сходств между семействами белков, открывая новые горизонты в биоинформатике и структурной биологии.

Motivation In this article, we propose a method for finding similarities between Pfam families based on the pre-trained neural network ProtENN2. We use the model ProtENN2 per-residue embeddings to produce new high-dimensional per-family embeddings and develop an approach for calculating inter-family similarity scores based on these embeddings, and evaluate its predictions using structure comparison. Results We apply our method to Pfam annotation by refining clan membership for Pfam families, suggesting both new members of existing clans and potential new clans for future Pfam releases. We investigate some of the failure modes of our approach, which suggests directions for future improvements. Our method is relatively simple with few parameters and could be applied to other protein family classification models. Overall, our work suggests potential benefits of employing deep learning for improving our understanding of protein family relationships and functions of previously uncharacterized families. Availability and implementation github.com/iponamareva/ProtCNNSim, 10.5281/zenodo.10091909.

Что такое ProtENN2?+
Кто разработал ProtENN2?+
Какие задачи решает ProtENN2?+