Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration

Authors

  • Hyeonseok KimSeoul National University of Science and Technology
  • Byeongkeun KangSeoul National University of Science and Technology
  • Yeejin LeeSeoul National University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v39i4.32446

Abstract

Generalized zero-shot semantic segmentation of 3D point clouds aims to classify each point into both seen and unseen classes. A significant challenge with these models is their tendency to make biased predictions, often favoring the classes encountered during training. This problem is more pronounced in 3D applications, where the scale of the training data is typically smaller than in image-based tasks. To address this problem, we propose a novel method called E3DPC-GZSL, which reduces overconfident predictions towards seen classes without relying on separate classifiers for seen and unseen data. E3DPC-GZSL tackles the overconfidence problem by integrating an evidence-based uncertainty estimator into a classifier. This estimator is then used to adjust prediction probabilities using a dynamic calibrated stacking factor that accounts for pointwise prediction uncertainty. In addition, E3DPC-GZSL introduces a novel training strategy that improves uncertainty estimation by refining the semantic space. This is achieved by merging learnable parameters with text-derived features, thereby improving model optimization for unseen data. Extensive experiments demonstrate that the proposed approach achieves state-of-the-art performance on generalized zero-shot semantic segmentation datasets, including ScanNet v2 and S3DIS.
AAAI-25 / IAAI-25 / EAAI-25 Proceedings Cover

Downloads

Published

2025-04-11

How to Cite

Kim, H., Kang, B., & Lee, Y. (2025). Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration.Proceedings of the AAAI Conference on Artificial Intelligence,39(4), 4248-4256. https://doi.org/10.1609/aaai.v39i4.32446

Issue

Section

AAAI Technical Track on Computer Vision III