Authors:Zhengyang Lyu;Pierre Beauseroy andAlexandre Baussard
Affiliation:Université de Technologie de Troyes, LIST3N, 12 Rue Marie Curie, 10300 Troyes, France
Keyword(s):Semantic Segmentation, Weak Supervision, Convolutional Neural Network, Support Vector Machine.
Abstract:In this paper, we propose a study of an expansion method based on an image-specific classifier and multi-features for Weakly Supervised Semantic Segmentation (WSSS) with only image-level labels. Recent WSSS methods focus mainly on enhancing the pseudo masks to improve the segmentation performance by obtaining improved Class Activation Maps (CAM) or by applying post-process methods that combine expansion and refinement. Most of these methods either lack of consideration for the balance between resolution and semantics in the used features, or are carried out globally for the whole data set, without taking into account potential additional improvements based on the specific content of the image. Previously, we proposed an image-specific expansion method using multi-features to alleviate these limitations. This new study aims firstly at determining the upper performance limit of the proposed method using the ground truth masks, and secondly at analysing this performance limit in relation with the features chosen. Experiments show that our expansion method can achieve promising results, when used with the ground truth (upper performance) and the features that strike a balance between semantics and resolution.(More)
In this paper, we propose a study of an expansion method based on an image-specific classifier and multi-features for Weakly Supervised Semantic Segmentation (WSSS) with only image-level labels. Recent WSSS methods focus mainly on enhancing the pseudo masks to improve the segmentation performance by obtaining improved Class Activation Maps (CAM) or by applying post-process methods that combine expansion and refinement. Most of these methods either lack of consideration for the balance between resolution and semantics in the used features, or are carried out globally for the whole data set, without taking into account potential additional improvements based on the specific content of the image. Previously, we proposed an image-specific expansion method using multi-features to alleviate these limitations. This new study aims firstly at determining the upper performance limit of the proposed method using the ground truth masks, and secondly at analysing this performance limit in relation with the features chosen. Experiments show that our expansion method can achieve promising results, when used with the ground truth (upper performance) and the features that strike a balance between semantics and resolution.