基于多尺度特征感知与模糊边界建模的息肉分割网络

王孙宾; 石刘伟; 黄俊

doi:10.12415/j.issn.1671-7872.25009

基于多尺度特征感知与模糊边界建模的息肉分割网络

Polyp Segmentation Network Based on Multi-Scale Feature Perception and Fuzzy Boundary Modeling

摘要

摘要: 内窥镜图像分割技术作为常规临床诊断手段，其分割精度直接影响医生对病变区域的诊断和治疗决策。针对现有方法在图像质量不佳、病变区域边界模糊等复杂场景下的局限性，提出一种融合多尺度特征感知与模糊边界建模的息肉分割网络。首先，通过离散小波变换将图像分解为不同尺度和频率的子带，分别提取图像全局结构和局部细节特征，并结合自适应注意力机制动态调整各子带特征权重，实现多尺度特征感知；其次采用变分多采样模块将特征映射至潜在空间进行概率分布建模，通过多次重参数化采样生成多样化潜在空间表示，有效平滑模糊区域并提高边界分割准确性。在CVC−300，CVC−ClinicDB，Kvasir−SEG，CVC−ColonDB，ETIS−LaribPolyDB等5个公开数据集和USTCAI非公开数据集上进行实验，验证本文方法的性能。结果表明：本文方法在Dice系数和mIoU指标上均优于现有方法，特别是在ETIS−LaribPolyDB数据集上以57.54%的Dice系数超越现有最优方法7.1%，在CVC−ClinicDB数据集上更是达到91.88%的Dice系数，展现出优异的复杂场景分割性能和泛化能力。本文方法通过结合多尺度特征感知与模糊边界建模技术，有效解决了内窥镜图像分割中的关键难题，为临床诊断提供了更精准可靠的技术支持。

Abstract: Endoscopic image segmentation technology as a routine clinical diagnostic method, whose segmentation accuracy directly affects physicians’ diagnosis and treatment decisions of lesion areas. In view of the limitations of existing methods in challenging scenarios such as poor image quality and blurred lesion area boundaries, a polyp segmentation network integrating multi-scale feature perception and fuzzy boundary modeling was proposed. Firstly, the image was decomposed into sub-bands of different scales and frequencies through discrete wavelet transform to extract global structural and local detailed features, while an adaptive attention mechanism was employed to dynamically adjust the weights of each sub-band feature, achieving multi-scale feature perception. Secondly, a variational multi-sampling module was utilized to map features into latent space for probability distribution modeling, where diversified latent space representations were generated through multiple reparameterized samplings, effectively smoothing blurred regions and improving boundary segmentation accuracy. Experiments were conducted on five public datasets (CVC−300, CVC−ClinicDB, Kvasir−SEG, CVC−ColonDB, ETIS−LaribPolyDB) and the non-public USTCAI dataset to validate the performance of the proposed method. The results demonstrate that the proposed method outperforms existing methods in both Dice coefficient and mIoU metrics. Particularly on the ETIS−LaribPolyDB dataset, a Dice coefficient of 57.54% is achieved, surpassing the state-of-the-art method by 7.1%, while on the CVC−ClinicDB dataset, an outstanding Dice coefficient of 91.88% is attained, exhibiting excellent segmentation performance and generalization capability in complex scenarios.By combining multi-scale feature perception with fuzzy boundary modeling techniques, the proposed method effectively addresses key challenges in endoscopic image segmentation, providing more accurate and reliable technical support for clinical diagnosis.

HTML全文

参考文献(25)

施引文献

资源附件(0)