论文标题
用LiDAR线索无监督的对象检测
Unsupervised Object Detection with LiDAR Clues
论文作者
论文摘要
尽管据我们所知,但无监督的对象检测很重要,但以前没有解决此问题的工作。社区广为人知的一个主要问题是,仅从2D图像外观中得出的对象边界是模棱两可和不可靠的。为了解决这个问题,我们利用激光雷达线索来帮助无监督的对象检测。通过利用3D场景结构,可以大大减轻本地化问题。我们进一步确定了社区很少注意到的另一个主要问题,应容纳长尾和开放式(子)类别的分布。在本文中,我们提出了第一种实用方法,可借助LiDAR线索进行无监督的对象检测。在我们的方法中,首先生成基于3D点云的候选对象段。然后,进行了一个迭代段标记过程,以分配段标签并训练段标记网络,该网络基于2D图像和3D点云的功能。标签过程经过精心设计,以减轻长尾和开放式分布的问题。最终部分标签设置为用于对象检测网络训练的伪注释。大规模Waymo开放数据集的广泛实验表明,与LIDAR可见范围内的强制监督相比,派生的无监督对象检测方法具有合理的精度。代码应发布。
Despite the importance of unsupervised object detection, to the best of our knowledge, there is no previous work addressing this problem. One main issue, widely known to the community, is that object boundaries derived only from 2D image appearance are ambiguous and unreliable. To address this, we exploit LiDAR clues to aid unsupervised object detection. By exploiting the 3D scene structure, the issue of localization can be considerably mitigated. We further identify another major issue, seldom noticed by the community, that the long-tailed and open-ended (sub-)category distribution should be accommodated. In this paper, we present the first practical method for unsupervised object detection with the aid of LiDAR clues. In our approach, candidate object segments based on 3D point clouds are firstly generated. Then, an iterative segment labeling process is conducted to assign segment labels and to train a segment labeling network, which is based on features from both 2D images and 3D point clouds. The labeling process is carefully designed so as to mitigate the issue of long-tailed and open-ended distribution. The final segment labels are set as pseudo annotations for object detection network training. Extensive experiments on the large-scale Waymo Open dataset suggest that the derived unsupervised object detection method achieves reasonable accuracy compared with that of strong supervision within the LiDAR visible range. Code shall be released.