用于解析桥梁元素和分割桥梁检查图像中的多任务深度学习模型

论文标题

用于解析桥梁元素和分割桥梁检查图像中的多任务深度学习模型

A Multitask Deep Learning Model for Parsing Bridge Elements and Segmenting Defect in Bridge Inspection Images

论文作者

Zhang, Chenyu, Karim, Muhammad Monjurul, Qin, Ruwen

论文摘要

美国庞大的桥梁网络提出了对维护和康复的高度要求。在某种程度上，手动视觉检查评估桥梁条件的大量费用是一个负担。先进的机器人已被利用以自动化检查数据收集。自动化大量检查图像数据中多类元素和表面缺陷的分割将有助于对桥梁条件进行有效评估。培训单独的单任务网络以进行元素解析（即多类元素的语义分割）和缺陷分段无法整合这两个任务之间的密切连接。检查图像中都存在可识别的结构元素和明显的表面缺陷。本文的动机是开发多任务深度学习模型，该模型充分利用了桥梁元素和缺陷之间的这种相互依赖性，以提高模型的任务性能和概括。此外，该研究还研究了提出的模型设计在改善任务性能的有效性，包括特征分解，串扰共享和多目标损失函数。开发了具有桥梁元素和腐蚀的像素级标签的数据集用于模型训练和测试。评估开发的多任务深模型的定量和定性结果不仅在基于单任务的模型（在桥梁解析上高2.59％，在腐蚀细分方面提高1.65％）的优势，而且在计算时间和实现能力中也是如此。

The vast network of bridges in the United States raises a high requirement for maintenance and rehabilitation. The massive cost of manual visual inspection to assess bridge conditions is a burden to some extent. Advanced robots have been leveraged to automate inspection data collection. Automating the segmentations of multiclass elements and surface defects on the elements in the large volume of inspection image data would facilitate an efficient and effective assessment of the bridge condition. Training separate single-task networks for element parsing (i.e., semantic segmentation of multiclass elements) and defect segmentation fails to incorporate the close connection between these two tasks. Both recognizable structural elements and apparent surface defects are present in the inspection images. This paper is motivated to develop a multitask deep learning model that fully utilizes such interdependence between bridge elements and defects to boost the model's task performance and generalization. Furthermore, the study investigated the effectiveness of the proposed model designs for improving task performance, including feature decomposition, cross-talk sharing, and multi-objective loss function. A dataset with pixel-level labels of bridge elements and corrosion was developed for model training and testing. Quantitative and qualitative results from evaluating the developed multitask deep model demonstrate its advantages over the single-task-based model not only in performance (2.59% higher mIoU on bridge parsing and 1.65% on corrosion segmentation) but also in computational time and implementation capability.

下载PDF全文

下载文献需遵守相关版权规定

论文标题