Open X-Embodiment: Robotic Learning Datasets and RT-X Models
AI-generated Key Points
- Shift towards using large, high-capacity models trained on diverse datasets in robotics
- Development of pretrained models serving as general backbones for different robotic tasks
- Growing interest in developing generalist X-robot policies adaptable to new robots, tasks, and environments
- Recent study assembling a dataset from 22 different robots demonstrating 527 skills across 160266 tasks
- Training of high-capacity model RT-X on this data showing positive transferability among multiple robots
- Consideration of two model architectures in experiments: RT-1 (Transformer-based) and RT-2 (vision-language model)
- Both models take visual input and natural language instructions to output tokenized actions
- Exploration of how X-embodiment training can enhance the performance of learned policies on individual robots
- Research aims to explore benefits of consolidating pretrained models in robotics for efficient adaptation across various platforms
- Experimental results showcasing effective X-robot policies with positive transferability and improved performance across multiple robotic platforms
Authors: Dorsa Sadigh, Danny Driess, Fei Xia, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Yevgen Chebotar, Pierre Sermanet, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Igor Mordatch, Dinesh Jayaraman, Wenxuan Zhou, Nicolas Heess, Tianli Ding, Trevor Darrell, Chelsea Finn, Yifan Zhou, Deepak Pathak, Bernhard Schölkopf, Tony Z. Zhao, Vikash Kumar, Wolfram Burgard, Xinyang Geng, Yunfan Jiang, Pieter Abbeel, Guanzhi Wang, Ajay Mandlekar, Yuke Zhu, Jitendra Malik, Chen Wang, Jiajun Wu, Jeffrey Wu, Hao Su, Rafael Rafailov, Archit Sharma, Ken Goldberg, Jaehyung Kim, Ning Liu, Kevin Lin, Xiaolong Wang, Xi Chen, Zhuo Xu, Peng Xu, Michael C. Yip, Mateo Guaman Castro, Danfei Xu, Soroush Nasiriany, Li Fei-Fei, Yutaka Matsuo, Yusuke Iwasawa, Abdul Rehman, Xuanlin Li, Animesh Garg, Yilin Wu, Abhishek Gupta, Yecheng Jason Ma, Osbert Bastani, Ted Xiao, Jacky Liang, Felipe Vieira Frujeri, Dhruv Shah, Todor Davchev, Stefan Schaal, Subramanian Ramamoorthy, Agrim Gupta, Mingyu Ding, Jiankai Sun, Chenfeng Xu, Masayoshi Tomizuka, Wei Zhan, Zipeng Fu, Glen Berseth, Jeannette Bohg, Lisa Lee, Nur Muhammad Mahi Shafiullah, Anant Rai, Lerrel Pinto, Shikhar Bahl, Russell Mendonca, Yonatan Bisk, Yujin Tang, Kyle Hsu, Siddharth Karamcheti, Suvir Mirchandani, Suraj Nair, Krishnan Srinivasan, Annie Xie, Sean Kirmani, Jimmy Wu, Shuran Song, Cheng Chi, Anthony Brohan, Chuyuan Fu, Keerthana Gopalakrishnan, Jasmine Hsu, Alex Irpan, Ryan Julian, Dmitry Kalashnikov, Isabel Leal, Yao Lu, Karl Pertsch, Kanishka Rao, Anikait Singh, Stefan Welker, Paul Wohlhart, Jialin Wu, Sichun Xu, Ilija Radosavovic, Yue Cao, Abhinav Gupta, Yixuan Wang, Cewu Lu, Hiroki Furuta, Open X-Embodiment Collaboration, Abby O'Neill, Abhiram Maddukuri, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alexander Khazatsky, Anchit Gupta, Andrew Wang, Andrey Kolobov, Aniruddha Kembhavi, Antonin Raffin, Arefeh Yavary, Arhan Jain, Ashwin Balakrishna, Ben Burgess-Limerick, Beomjoon Kim, Blake Wulfe, Charles Xu, Charlotte Le, Chenguang Huang, Christine Chan, Christopher Agia, Chuer Pan, Coline Devin, Daniel Morton, Daphne Chen, Dieter Büchler, Edward Johns, Ethan Foster, Fangchen Liu, Federico Ceola, Feiyu Zhao, Freek Stulp, Gaoyue Zhou, Gaurav S. Sukhatme, Gautam Salhotra, Ge Yan, Gilbert Feng, Giulio Schiavi, Gregory Kahn, Guangwen Yang, Hao-Shu Fang, Haochen Shi, Henghui Bao, Heni Ben Amor, Henrik I Christensen, Homanga Bharadhwaj, Homer Walke, Hongjie Fang, Huy Ha, Jad Abou-Chakra, Jaimyn Drake, Jan Peters, Jan Schneider, Jay Vakil, Jeffrey Bingham, Jensen Gao, Jiaheng Hu, Jianlan Luo, Jiayuan Gu, Jie Tan, Jihoon Oh, Jingpei Lu, Jingyun Yang, João Silvério, Joey Hejna, Jonathan Booher, Jonathan Yang, Jordi Salvador, Joseph J. Lim, Junhyek Han, Kaiyuan Wang, Keegan Go, Kendra Byrne, Kenneth Oslund, Kento Kawaharazuka, Kevin Black, Kevin Zhang, Kiana Ehsani, Kiran Lekkala, Kirsty Ellis, Krishan Rana, Kuan Fang, Kunal Pratap Singh, Kuo-Hao Zeng, Kyle Hatch, Laurent Itti, Lawrence Yunliang Chen, Liam Tan, Linxi "Jim" Fan, Lionel Ott, Luca Weihs, Magnum Chen, Marion Lepert, Marius Memmel, Masha Itkina, Max Spero, Maximilian Du, Michael Ahn, Mingtong Zhang, Minho Heo, Mohan Kumar Srirama, Mohit Sharma, Moo Jin Kim, Naoaki Kanazawa, Nicklas Hansen, Nikhil J Joshi, Niko Suenderhauf, Norman Di Palo, Oier Mees, Oliver Kroemer, Pannag R Sanketi, Patrick "Tree" Miller, Patrick Yin, Peter David Fagan, Peter Mitrano, Priya Sundaresan, Qiuyu Chen, Ran Tian, Ria Doshi, Roberto Mart'in-Mart'in, Rohan Baijal, Rosario Scalise, Rose Hendrix, Roy Lin, Runjia Qian, Ruohan Zhang, Rutav Shah, Ryan Hoque, Samuel Bustamante, Shan Lin, Sherry Moore, Shivin Dass, Shubham Sonawani, Shubham Tulsiani, Siddhant Haldar, Simeon Adebola, Simon Guist, Stephen Tian, Sudeep Dasari, Suneel Belkhale, Sungjae Park, Takayuki Osa, Tanmay Gupta, Tatsuya Harada, Tatsuya Matsushima, Thomas Kollar, Travis Armstrong, Trinity Chung, Vidhi Jain, Xiangyu Chen, Xinghao Zhu, Xiyuan Liu, Xu Liangwei, Yansong Pang, Yejin Kim, Yifeng Zhu, Ying Xu, Yongqiang Dou, Yoonyoung Cho, Youngwoon Lee, Yuchen Cui, Yueh-Hua Wu, Yunchu Zhang, Yunshuang Li, Yunzhu Li, Zehan Ma, Zichen Jeff Cui, Zichen Zhang, Zipeng Lin
Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train generalist X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. More details can be found on the project website https://robotics-transformer-x.github.io.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.