CN118898718B

Movatterモバイル変換

Info

Publication number: CN118898718B
Application number: CN202411004607.7A
Authority: CN
Inventors: 焦文华; 田玉宇; 周旭; 蔡晓异
Original assignee: China University of Mining and Technology Beijing CUMTB
Current assignee: China University of Mining and Technology Beijing CUMTB
Priority date: 2024-07-25
Filing date: 2024-07-25
Publication date: 2025-04-18
Anticipated expiration: 2044-07-25
Also published as: CN118898718A

Abstract

Translated fromChinese

本发明公开了一种增强边界感知的语义分割方法，属于语义分割技术领域，主要包括编码路径和解码路径，编码路径由5个编码模块组成，每个编码器对整个图像中目标区域的多层次语义信息进行编码，编码模块中不同尺度的卷积运算得到目标区域的多尺度信息；它利用池化操作有效地聚合了上下文语义；解码路径主要由四个模块组成，每个解码模块在注意力嵌入模块AEM的引导下，对不同分支的信息流进行聚合和细化，图卷积模块捕获大规模不规则区域的特征信息，注意嵌入模块生成互补的空间细节，更好地对编码特征进行建模。本发明采用上述的一种增强边界感知的语义分割方法，具备更精准的分割效果。

The present invention discloses a semantic segmentation method for enhancing boundary perception, which belongs to the field of semantic segmentation technology, and mainly includes an encoding path and a decoding path. The encoding path is composed of 5 encoding modules, each encoder encodes the multi-level semantic information of the target area in the whole image, and the convolution operations of different scales in the encoding module obtain the multi-scale information of the target area; it uses pooling operation to effectively aggregate contextual semantics; the decoding path is mainly composed of four modules, each decoding module, under the guidance of the attention embedding module AEM, aggregates and refines the information flow of different branches, the graph convolution module captures the feature information of large-scale irregular areas, and the attention embedding module generates complementary spatial details to better model the encoding features. The present invention adopts the above-mentioned semantic segmentation method for enhancing boundary perception, and has a more accurate segmentation effect.