ISSN 2096-4498

   CN 44-1745/U

二维码

Tunnel Construction ›› 2024, Vol. 44 ›› Issue (5): 1056-1067.DOI: 10.3973/j.issn.2096-4498.2024.05.013

Previous Articles     Next Articles

Ensemble Convolutional Neural Networks and Visual Transformers: Research on Tunnel Face Rock Identification

XIANG Lulu1, 2, TONG Jianjun1, 2, *, WANG Mingnian1, 2, MIAO Xingwang1, 2, YE Pei1, 2   

  1. (1. Key Laboratory of Transportation Tunnel Engineering, the Ministry of Education, Southwest  Jiaotong University, Chengdu 610031, Sichuan, China; 2. School of Civil Engineering, Southwest Jiaotong University, Chengdu 610031, Sichuan, China)
  • Online:2024-05-20 Published:2024-06-22

Abstract: The application of an increasing number of new technologies in tunnel construction is a consequence of the rapid development of intelligent tunnel construction. Among these, the use of machine vision instead of human eyes for geological information recognition in tunnels has become increasingly widespread. The authors build on this trend by using machine vision to investigate intelligent classification methods for the lithology of the tunnel face. First, four types of lithology (limestone, mudstone, sandstone, and basalt) are screened and counted by collecting highdefinition tunnel face photos, geological sketches, and engineering geological descriptions along the highspeed railway. On this basis, image enhancement techniques are used to expand the number of samples and construct a lithology sample set. Then, based on the above sample sets, the ResNet50V2 lithology classification transfer learning model and the VIT lithology classification model are constructed. The lithology classification effects of the two models are compared, and the Stacking method is used to integrate the classification characteristics of the two models. The authors select the most suitable metalearner by comparing the integrated fusion effects of three metalearners, logistic regression(LR), support vector machine, and decision tree, on the two models. The results reveal that the Stacking model constructed by LR integrated with ResNet50V2 and VIT has the best classification effect on lithology. Apart from having an accuracy rate of 93.8%, it can also fully integrate the overall and local features of the lithology of the tunnel face for better classification.

Key words:

tunnel, lithology of tunnel face, convolutional neural network, visual Transformer, ensemble learning, Stacking method