COCOI: Contact-aware Online Context Inference for Generalizable Non-planar Pushing

Abstract

General contact-rich manipulation problems are long-standing challenges in robotics due to the difficulty of understanding complicated contact physics. Deep reinforcement learning (RL) has shown great potential in solving robot manipulation tasks. However, existing RL policies have limited adaptability to environments with diverse dynamics properties, which is pivotal in solving many contact-rich manipulation tasks. In this work, we propose Contact-aware Online COntext Inference (COCOI), a deep RL method that encodes a context embedding of dynamics properties online using contact-rich interactions. We sample sensor data using a novel contact-aware strategy and formulate an interpretable dynamics transition module. We study this method based on a novel and challenging non-planar pushing task, where the robot uses a monocular camera image and wrist force torque sensor reading to push an object to a goal location while keeping it upright. We run extensive experiments to demonstrate the capability of COCOI in a wide range of settings and dynamics properties in simulation, and also in a sim-to-real transfer scenario on a real robot


Video


Citation

For attribution in academic contexts, please cite this work as

Xu et al., "COCOI: Contact-aware Online Context Inference for Generalizable Non-planar Pushing", 2021.

BibTeX citation

@misc{xu2020cocoi,
      title={COCOI: Contact-aware Online Context Inference for Generalizable Non-planar Pushing}, 
      author={Zhuo Xu and Wenhao Yu and Alexander Herzog and Wenlong Lu and Chuyuan Fu and Masayoshi Tomizuka and Yunfei Bai and C. Karen Liu and Daniel Ho},
      year={2020},
      eprint={2011.11270},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}