π0.6 (pi-0.6): Революция в управлении роботами через ИИ

Q: Кто разработал π0.6 (pi-0.6)?

Модель π0.6 (pi-0.6) разработана компанией Physical Intelligence (United States of America).

// задачи

Robotic manipulation

// описание

Робототехническая модель π0.6 (pi-0.6) использует инновационный метод RECAP для обучения роботов через взаимодействие с реальным миром. Этот ИИ класса VLA (Vision-Language-Action) умеет самосовершенствоваться, объединяя визуальные данные и текстовые команды для выполнения сложнейших манипуляций.

// abstract

We study how vision-language-action (VLA) models can improve through real-world deployments via reinforcement learning (RL). We present a general-purpose method, RL with Experience and Corrections via Advantage-conditioned Policies (RECAP), that provides for RL training of VLAs via advantage conditioning. Our method incorporates heterogeneous data into the self-improvement process, including demonstrations, data from on-policy collection, and expert teleoperated interventions provided during autonomous execution. RECAP starts by pretraining a generalist VLA with offline RL, which we call π∗0.6, that can then be specialized to attain high performance on downstream tasks through on-robot data collection. We show that the π∗0.6 model trained with the full RECAP method can fold laundry in real homes, reliably assemble boxes, and make espresso drinks using a professional espresso machine. On some of the hardest tasks, RECAP more than doubles task throughput and roughly halves the task failure rate.

// faq

Что такое π0.6 (pi-0.6)?+

Кто разработал π0.6 (pi-0.6)?+

Какие задачи решает π0.6 (pi-0.6)?+

// похожие модели

π0.7 (pi-0.7)

Physical Intelligence