基于改进U−Net网络的路面裂缝检测方法

项先隆; 刘磊

doi:10.12415/j.issn.1671-7872.25069

基于改进U−Net网络的路面裂缝检测方法

项先隆,
刘磊

Pavement Crack Detection Method Based on Improved U−Net Network

摘要

摘要: 针对路面裂缝检测任务中精度与计算效率难以兼顾的的问题，提出一种基于改进U−Net的轻量化网络MSAL−Unet。首先构建轻量化编码器并集成卷积块注意力模块(CBAM)，以动态增强裂缝关键区域的特征响应，提升模型对低对比度目标的感知能力；其次设计多尺度空洞卷积模块(MSDC)，在不降低空间分辨率的前提下扩展感受野，有效捕获从细小裂缝到宽域上下文的多尺度信息，显著抑制误检现象；进一步引入特征金字塔网络(FPN)，实现跨层级的语义对齐与多尺度特征融合，提升裂缝定位的连贯性与边界平滑性。在公开数据集CRACK500上开展对比与消融实验，验证本文所提方法的性能。结果表明：本文方法在保持34.45 frames/s实时推理速度的同时，平均交并比(mIoU)达0.832 0，精确率提升至0.918 0(为所有对比模型最高)，F1−score达0.906 3，综合性能显著优于BiSeNet、Fast−SCNN、SegFormer和TransUNet等主流轻量化网络；尤其在浅裂缝与复杂裂缝场景下，MSAL−Unet的分割结果在完整性和边界准确性方面均表现最优。本研究通过协同优化多尺度特征融合、注意力机制与轻量化结构设计，有效解决了路面裂缝检测中精度与效率的平衡难题，为实际道路智能巡检系统提供了高可靠性与低计算开销的技术支撑。

Abstract: A lightweight network named MSAL−Unet, based on an improved U−Net, was proposed to address the challenge of balancing accuracy and computational efficiency in pavement crack detection. First, a lightweight encoder was constructed and integrated with the convolutional block attention module (CBAM) to dynamically enhance feature responses in key crack regions, thereby improving the model’s perception capability for low-contrast targets. Second, a Multi-Scale Dilated Convolution (MSDC) module was designed to expand the receptive field without reducing spatial resolution, effectively capturing multi-scale information from fine cracks to broad contextual features and significantly suppressing false positives. Furthermore, a feature pyramid network (FPN) was introduced to achieve cross-level semantic alignment and multi-scale feature fusion, enhancing the coherence of crack localization and boundary smoothness. Comparative and ablation experiments were conducted on the public dataset CRACK500 to validate the performance of the proposed method. The results demonstrate that the proposed method achieves a mean intersection over union (mIoU) of 0.8320 and a precision of 0.9180 (the highest among all compared models), with an F1−score of 0.9063, while maintaining a real-time inference speed of 34.45 frames per second. The comprehensive performance is significantly superior to mainstream lightweight networks such as BiSeNet, Fast−SCNN, SegFormer, and TransUNet. Particularly in scenarios involving faint and complex cracks, the segmentation results of MSAL−Unet exhibit optimal performance in both completeness and boundary accuracy. By synergistically optimizing multi-scale feature fusion, attention mechanisms, and lightweight structural design, this study effectively addresses the challenging balance between accuracy and efficiency in pavement crack detection, thereby providing highly reliable technical support with low computational overhead for practical road intelligent inspection systems.

HTML全文

参考文献(27)

施引文献

资源附件(0)