CN119380021A

Movatterモバイル変換

Info

Publication number: CN119380021A
Application number: CN202411461106.1A
Authority: CN
Inventors: 谢荣辉; 史冉
Original assignee: Nanjing University of Science and Technology
Current assignee: Nanjing University of Science and Technology
Priority date: 2024-10-18
Filing date: 2024-10-18
Publication date: 2025-01-28

Abstract

Translated fromChinese

本发明提出了一种基于边界框输入与注视点辅助的交互式对象分割方法，获取图像I中分割对象的边界框，并转化为二值的边界框图B，同时，获取目标图像的注视点FM，并抹去输入框外的注视点信息，得到处理后的注视点图将图像I和边界框图B输入初始分割网络Coarse U‑Net，生成粗分割结果M_C与基于框的多尺度特征计算初始分割结果M_C与处理后的注视点图的相似度，并以此调整注视点图，得到调整后的注视点图FM'；将图像I、调整后的注视点图FM'和粗分割结果M_C在通道维度上连接，输入到细化分割网络Refinement U‑Net，提取细化特征并在此过程中逐层融合Coarse U‑Net提取的基于框的特征将细化特征输入细化网络Refinement U‑Net的解码器进行解码，得到最终的分割结果M。本发明提高了分割质量。

The present invention proposes an interactive object segmentation method based on bounding box input and gaze point assistance, which obtains the bounding box of the segmented object in the image I and converts it into a binary bounding box map B. At the same time, the gaze point FM of the target image is obtained, and the gaze point information outside the input box is erased to obtain a processed gaze point map The image I and the bounding box map B are input into the initial segmentation network Coarse U-Net to generate a coarse segmentation result M_C and a box-based multi-scale feature Calculate the initial segmentation result M_C and the processed gaze point map The similarity of the image I is calculated and the fixation map is adjusted to obtain the adjusted fixation map FM'. The image I, the adjusted fixation map FM' and the rough segmentation result M_C are connected in the channel dimension and input into the refinement segmentation network Refinement U-Net to extract the refined features. In this process, the box-based features extracted by Coarse U-Net are fused layer by layer. Refine features The image is input into the decoder of the refinement network U-Net for decoding to obtain the final segmentation result M. The present invention improves the segmentation quality.