Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

AI-generated keywords: Robotics

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Imitation learning from human demonstrations has shown remarkable performance in table-top manipulation tasks
  • Results often lack the mobility and dexterity required for practical and versatile applications
  • Mobile ALOHA is a system developed by Zipeng Fu, Tony Z. Zhao, and Chelsea Finn to enable imitation of bimanual mobile manipulation tasks with whole-body control
  • The system incorporates a low-cost whole-body teleoperation interface and adds a mobile base to enhance capabilities
  • Co-training with existing static ALOHA datasets significantly improves performance on mobile manipulation tasks
  • Up to 90% success rates achieved with 50 demonstrations for each task
  • Mobile ALOHA can autonomously complete complex mobile manipulation tasks such as sautéing, serving, opening cabinets, calling an elevator, and rinsing pans using a kitchen faucet
  • Presents an innovative solution for imitating bimanual mobile manipulation tasks that require whole-body control
  • Opens up possibilities for more advanced robotic applications in various domains
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zipeng Fu, Tony Z. Zhao, Chelsea Finn

Project website: https://mobile-aloha.github.io (Zipeng Fu and Tony Z. Zhao are project co-leads, Chelsea Finn is the advisor)

Abstract: Imitation learning from human demonstrations has shown impressive performance in robotics. However, most results focus on table-top manipulation, lacking the mobility and dexterity necessary for generally useful tasks. In this work, we develop a system for imitating mobile manipulation tasks that are bimanual and require whole-body control. We first present Mobile ALOHA, a low-cost and whole-body teleoperation system for data collection. It augments the ALOHA system with a mobile base, and a whole-body teleoperation interface. Using data collected with Mobile ALOHA, we then perform supervised behavior cloning and find that co-training with existing static ALOHA datasets boosts performance on mobile manipulation tasks. With 50 demonstrations for each task, co-training can increase success rates by up to 90%, allowing Mobile ALOHA to autonomously complete complex mobile manipulation tasks such as sauteing and serving a piece of shrimp, opening a two-door wall cabinet to store heavy cooking pots, calling and entering an elevator, and lightly rinsing a used pan using a kitchen faucet. Project website: https://mobile-aloha.github.io

Submitted to arXiv on 04 Jan. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2401.02117v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the field of robotics, imitation learning from human demonstrations has shown remarkable performance in table-top manipulation tasks. However, these results often lack the mobility and dexterity required for practical and versatile applications. To address this limitation, a team of researchers consisting of Zipeng Fu, Tony Z. Zhao, and Chelsea Finn have developed Mobile ALOHA - a system that enables the imitation of bimanual mobile manipulation tasks with whole-body control. The system incorporates a low-cost whole-body teleoperation interface and adds a mobile base to enhance the existing ALOHA system's capabilities. By utilizing this system to collect data, the researchers then perform supervised behavior cloning. Their findings indicate that co-training with existing static ALOHA datasets significantly improves performance on mobile manipulation tasks, with up to 90% success rates achieved with 50 demonstrations for each task. This enhancement allows Mobile ALOHA to autonomously complete complex mobile manipulation tasks such as sautéing and serving a piece of shrimp, opening a two-door wall cabinet to store heavy cooking pots, calling and entering an elevator, and lightly rinsing a used pan using a kitchen faucet. Overall, the development of Mobile ALOHA presents an innovative solution for imitating bimanual mobile manipulation tasks that require whole-body control and opens up possibilities for more advanced robotic applications in various domains.
Created on 08 Jan. 2024

Assess the quality of the AI-generated content by voting

Score: 1

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.