Robot Learning from Any Images

Published in CoRL 2025, 2025

RoLA is a framework that transforms any in-the-wild image into an interactive, physics-enabled robotic environment. It operates directly on a single image without requiring additional hardware or digital assets. RoLA democratizes robotic data generation by producing massive visuomotor robotic demonstrations within minutes from a wide range of image sources.

Recommended citation: Zhao, S., Mao, J., Chow, W., Shangguan, Z., Shi, T., Xue, R., Zheng, Y., Weng, Y., You, Y., Seita, D., Guibas, L., Zakharov, S., Guizilini, V., & Wang, Y. (2025). Robot Learning from Any Images. CoRL 2025.