• CSCD核心中文核心科技核心
  • RCCSE(A+)公路运输高质量期刊T1
  • Ei CompendexScopusWJCI
  • EBSCOPж(AJ)JST
二维码

隧道建设(中英文) ›› 2026, Vol. 46 ›› Issue (3): 478-490.DOI: 10.3973/j.issn.2096-4498.2026.03.003

• 研究与探索 • 上一篇    下一篇

基于实例分割的盾构渣石几何特征识别

刘泓志1, 2, 3, 郝淑宁1, 2, 郭易东1, 2, 李兴高1, 2, *   

  1. (1. 北京交通大学 城市地下工程教育部重点实验室, 北京 100044; 2. 北京交通大学土木建筑工程学院, 北京 100044; 3. 中交隧道工程局有限公司, 北京 100088)
  • 出版日期:2026-03-20 发布日期:2026-03-20
  • 作者简介:刘泓志(1989—),男,辽宁庄河人,2016年毕业于吉林大学,水利工程专业,硕士,工程师,现从事盾构施工技术管理与研究工作。E-mail:liuhz1316@qq.com。*通信作者:李兴高,E-mail: lxg_njtu@163.com。

Rock Chip Geometric Feature Recognition in Shield Tunneling Based on Instance Segmentation

LIU Hongzhi1, 2, 3, HAO Shuning1, 2, GUO Yidong1, 2, LI Xinggao1, 2, *   

  1. (1. Key Laboratory of Urban Underground Engineering, the Ministry of Education, Beijing Jiaotong University, Beijing 100044, China; 2. School of Civil Engineering, Beijing Jiaotong University, Beijing 100044, China; 3. CCCC Tunnel Engineering Bureau Co., Ltd., Beijing 100088, China)
  • Online:2026-03-20 Published:2026-03-20

摘要: 针对盾构施工中渣石图像存在颗粒密集堆叠、尺寸差异大、边缘遮挡严重,导致语义分割方法在个体分割精度上受限的问题,提出一种基于改进Mask R-CNN的渣石几何特征识别模型。首先,构建包含实验室分散/堆叠场景与北京市南水北调配套工程团城湖至第九水厂(二期)工程不同地层场景的渣石图像数据集,采用数据增强扩充样本; 然后,通过对比试验确定以Mask R-CNN为基准模型,并引入高效多尺度注意力机制(EMA)增强骨干网络对多尺度特征的提取能力,同时采用Confluence-NMS算法替代传统非极大值抑制,降低密集场景下的误检与漏检; 最后,利用工程数据集进行消融试验。结果表明: 1)引入EMA与Confluence-NMS后,模型AP0.50、AP0.75、AP0.50∶0.95分别达到82.70%、73.69%、59.13%,较原始模型分别提升4.93%、3.19%和2.60%,F1-score提升3.63%,显著性检验证实改进效果显著; 2)基于分割掩膜,结合旋转卡壳算法与参照物换算,自动提取渣石的长短轴、面积、周长等几何特征; 3)在工程测试集上,模型预测的粒径分布与人工标注结果高度一致,长轴尺寸集中分布在25~75 mm,短轴尺寸集中分布在15~45 mm,各区间占比最大误差小于7.53%,仅在小粒径区间(<25 mm)因密集遮挡导致漏检而使误差略高; 4)泛化性能验证中,模型对青岛某隧道渣石图像的AP0.50∶0.95达57.62%,表明其具备跨工程适用性。

关键词: 盾构隧道, 渣石形态分布, 深度学习, 实例分割, Mask R-CNN

Abstract: Semantic segmentation methods exhibit limited accuracy in the individual segmentation of rock chips due to dense stacking, large size variations, and severe edge occlusion in rock chip images collected during shield tunneling. To address these challenges, a rock chip geometric feature recognition model based on an improved mask region-based convolutional neural network (Mask R-CNN) is proposed. First, a rock chip image dataset is constructed, including laboratory (dispersed and stacked) and field scenarios from different strata of a tunneling project in Beijing, and data augmentation is applied to expand the sample size. Through comparative experiments, Mask R-CNN is selected as the baseline model. An efficient multiscale attention (EMA) mechanism is introduced to enhance the backbone network′s ability to extract multiscale features. Additionally, the Confluence-non-maximum suppression (Confluence-NMS) algorithm replaces the traditional non-maximum suppression method to reduce false detections and missed detections in dense scenarios. Ablation experiments conducted on the engineering dataset show that (1) after introducing EMA and Confluence-NMS, the model achieves AP0.50, AP0.75, and AP0.50∶0.95 values of 82.70%, 73.69%, and 59.13%, respectively, representing improvements of 4.93%, 3.19%, and 2.60% over the baseline model, while the F1-score increases by 3.63%; (2) significance tests confirm the statistical effectiveness of the proposed improvements. Based on the segmentation masks, the rotating calipers algorithm, combined with a reference-object conversion method, is used to automatically extract geometric features of rock chips, including the major and minor axis lengths, area, and perimeter. On the engineering test set, the particle size distribution predicted by the model is highly consistent with that obtained from manual annotations: the major axis lengths are primarily concentrated in the 25-75 mm range, while the minor axis lengths are mainly distributed within the 15-45 mm range. The maximum error in the proportion of each interval is less than 7.53%, with slightly higher errors occurring only in the small-size interval (<25 mm) due to missed detections caused by dense occlusion. In the evaluation of generalization performance, the model achieves an AP0.50∶0.95 of 57.62% on rock chip images from a tunnel project in Qingdao, demonstrating its cross-project applicability.

Key words: shield tunnel, rock chip morphology distribution, deep learning, instance segmentation, mask region-based convolutional neural network