DataRater 1B: умная фильтрация данных от Google DeepMind

Q: Кто разработал DataRater test model (1B)?

Модель DataRater test model (1B) разработана компанией Google DeepMind (United States of America).

// задачи

Языковое моделирование

// описание

DataRater от Google DeepMind — это компактная модель на 1 млрд параметров, созданная для автоматической оценки качества обучающих данных. Вместо ручной фильтрации этот ИИ самостоятельно определяет, какой контент принесет максимум пользы при обучении будущих нейросетей.

// abstract

The quality of foundation models depends heavily on their training data. Consequently, great efforts have been put into dataset curation. Yet most approaches rely on manual tuning of coarse-grained mixtures of large buckets of data, or filtering by hand-crafted heuristics. An approach that is ultimately more scalable (let alone more satisfying) is to learn which data is actually valuable for training. This type of meta-learning could allow more sophisticated, fine-grained, and effective curation. Our proposed DataRater is an instance of this idea. It estimates the value of training on any particular data point. This is done by meta-learning using ‘meta-gradients’, with the objective of improving training efficiency on held out data. In extensive experiments across a range of model scales and datasets, we find that using our DataRater to filter data is highly effective, resulting in significantly improved compute efficiency

// faq

Что такое DataRater test model (1B)?+

Кто разработал DataRater test model (1B)?+

Какие задачи решает DataRater test model (1B)?+

// похожие модели