融合几何注意力与边缘卷积的零件点云分割与高精度定位

牟星达; 张龙; 陈子豪; 缪剑晖; 洪军

doi:10.12415/j.issn.1671-7872.25126

融合几何注意力与边缘卷积的零件点云分割与高精度定位

Part Point Cloud Segmentation and High-Precision Localization via Geometric Attention and Edge Convolution

摘要

摘要: 针对工业场景下零件点云存在的噪声干扰、遮挡缺失以及前景与背景类别极度不平衡等问题，这些因素常导致语义分割边界模糊、小目标漏检及三维定位精度下降。为此，本文提出一种融合几何注意力与边缘卷积的增强型PointNet++网络。首先，在编码阶段引入局部几何精炼模块EdgeConv，显式建模邻域拓扑关系，以增强边缘细节的表征能力；同时，设计一维多尺度高效注意力模块EMA_1D，通过动态特征加权抑制背景噪声响应，突出关键结构特征；此外，在优化目标方面，采用Lovász-Softmax与负对数似然联合的混合损失函数，直接优化交并比指标，有效缓解类别不平衡带来的模型偏差。实验结果表明，所提方法在工业零件分割任务上达到93.13%的平均交并比(mIoU)，同时保持了较高的整体准确率与类别准确率。进一步，在端到端定位流程中，结合分割结果、配准与姿态规范化策略，实现了毫米级三维定位精度，平均定位误差为1.24 mm，验证了该方法在复杂工业点云条件下进行高精度三维定位的有效性与实用潜力。本研究为解决强干扰、弱特征场景下的三维感知难题提供了有效技术支撑，对提升工业机器人自主作业能力具有重要意义。

Abstract: To address issues such as noise interference, occlusion-induced incompleteness, and extreme foreground-background class imbalance in industrial part point clouds—factors that often lead to blurred semantic segmentation boundaries, missed detection of small objects, and reduced 3D localization accuracy—an enhanced PointNet++ network integrating geometric attention and edge convolution was proposed. First, a local geometric refinement module, EdgeConv, was introduced in the encoding stage to explicitly model neighborhood topological relationships and enhance the representation of edge details. Meanwhile, a one-dimensional multi-scale efficient attention module, EMA_1D, was designed to dynamically weight features, suppressing background noise responses and highlighting key structural characteristics. Furthermore, regarding the optimization objective, a hybrid loss function combining Lovász-Softmax and negative log-likelihood was adopted to directly optimize the intersection over union metric, effectively alleviating model bias caused by class imbalance. Experimental results show that a mean intersection over union (mIoU) of 93.13% is achieved by the proposed method on the industrial part segmentation task, while high overall accuracy and per-class accuracy are maintained. Furthermore, in the end-to-end localization pipeline, millimeter-level 3D localization accuracy is achieved by integrating segmentation results, registration, and pose normalization strategies, with an average localization error of 1.24 mm, and the effectiveness and practical potential of the proposed method for high-precision 3D localization under complex industrial point cloud conditions are validated. In this study, effective technical support for solving the 3D perception challenges in scenarios with strong interference and weak features is provided, which is of great significance for enhancing the autonomous operation capability of industrial robots.

HTML全文

参考文献(27)

施引文献

资源附件(0)