LBSTER: Эффективная языковая модель белков (pLM)

Q: Кто разработал LBSTER?

Модель LBSTER разработана компанией Prescient Design,Genentech (United States of America,United States of America).

// задачи

Protein or nucleotide language model (pLM/nLM)

// описание

LBSTER — это оптимизированная языковая модель белков (pLM), созданная для эффективного обучения в условиях ограниченных вычислительных мощностей. Разработчики применили метод «трамбовки» (cramming), чтобы сделать передовые ИИ-технологии в биологии доступными без огромных затрат на GPU-часы.

// abstract

Protein language models (pLMs) are ubiquitous across biological machine learning research, but state-of-the-art models like ESM2 take hundreds of thousands of GPU hours to pre-train on the vast protein universe. Resource requirements for scaling up pLMs prevent fundamental investigations into how optimal modeling choices might differ from those used in natural language. Here, we define a “cramming” challenge for pLMs and train performant models in 24 hours on a single GPU. By re-examining many aspects of pLM training, we are able to train a 67 million parameter model in a single day that achieves comparable performance on downstream protein fitness landscape inference tasks to ESM-3B, a model trained for over 15, 000× more GPU hours than ours. We open source our library1 for training and inference, LBSTER: Language models for Biological Sequence Transformation and Evolutionary Representation.

// faq

Что такое LBSTER?+

Кто разработал LBSTER?+

Какие задачи решает LBSTER?+

// похожие модели