看不见的对象6D姿势估计：基准和基线

论文标题

看不见的对象6D姿势估计：基准和基线

Unseen Object 6D Pose Estimation: A Benchmark and Baselines

论文作者

Gou, Minghao, Pan, Haolin, Fang, Hao-Shu, Liu, Ziyuan, Lu, Cewu, Tan, Ping

论文摘要

估计看不见的对象的6D姿势对许多现实世界的应用非常有需求。但是，当前的最新姿势估计方法只能处理以前训练的对象。在本文中，我们提出了一项新任务，以使算法能够估计测试过程中新颖对象的6D姿势估计。我们收集一个具有真实图像和合成图像的数据集，并且在测试集中最多可见48个看不见的对象。同时，我们提出了一个名为invimum add（IADD）的新公制，这是对具有不同类型姿势歧义的对象的不变测量。还提供了针对此任务的两个阶段基线解决方案。通过训练端到端的3D对应网络，我们的方法可以准确有效地找到看不见的对象和部分视图RGBD图像之间的相应点。然后，它使用算法鲁棒从对应关系中计算出6D姿势。广泛的实验表明，我们的方法的表现优于几个直观基线，从而验证其有效性。所有数据，代码和模型都将公开可用。项目页面：www.graspnet.net/unseen6d

Estimating the 6D pose for unseen objects is in great demand for many real-world applications. However, current state-of-the-art pose estimation methods can only handle objects that are previously trained. In this paper, we propose a new task that enables and facilitates algorithms to estimate the 6D pose estimation of novel objects during testing. We collect a dataset with both real and synthetic images and up to 48 unseen objects in the test set. In the mean while, we propose a new metric named Infimum ADD (IADD) which is an invariant measurement for objects with different types of pose ambiguity. A two-stage baseline solution for this task is also provided. By training an end-to-end 3D correspondences network, our method finds corresponding points between an unseen object and a partial view RGBD image accurately and efficiently. It then calculates the 6D pose from the correspondences using an algorithm robust to object symmetry. Extensive experiments show that our method outperforms several intuitive baselines and thus verify its effectiveness. All the data, code and models will be made publicly available. Project page: www.graspnet.net/unseen6d

下载PDF全文

下载文献需遵守相关版权规定

论文标题