Consistent-Separable Feature Representation for Semantic Segmentation

Authors

Xingjian He

School of Artificial Intelligence, University of Chinese Academy of Sciences National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Jing Liu

School of Artificial Intelligence, University of Chinese Academy of Sciences National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Jun Fu

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Xinxin Zhu

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Jinqiao Wang

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Hanqing Lu

School of Artificial Intelligence, University of Chinese Academy of Sciences National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Proceedings:

No. 2: AAAI-21 Technical Tracks 2

Volume

Issue:

Proceedings of the AAAI Conference on Artificial Intelligence, 35

Track:

AAAI Technical Track on Computer Vision I

Downloads:

Download PDF

Abstract:

Cross-entropy loss combined with softmax is one of the most commonly used supervision components in most existing segmentation methods. The softmax loss is typically good at optimizing the inter-class difference, but not good at reducing the intra-class variation, which can be suboptimal for semantic segmentation task. In this paper, we propose a Consistent-Separable Feature Representation Network to model the Consistent-Separable (C-S) features, which are intra-class consistent and inter-class separable, improving the discriminative power of the deep features. Specifically, we develop a Consistent-Separable Feature Learning Module to obtain C-S features through a new loss, called Class-Aware Consistency loss. This loss function is proposed to force the deep features to be consistent among the same class and apart between different classes. Moreover, we design an Adaptive feature Aggregation Module to fuse the C-S features and original features from backbone for the better semantic prediction. We show that compared with various baselines, the proposed method brings consistent performance improvement. Our proposed approach achieves state-of-the-art performance on Cityscapes (82.6% mIoU in test set), ADE20K (46.65% mIoU in validation set), COCO Stuff (41.3% mIoU in validation set) and PASCAL Context (55.9% mIoU in test set).

DOI:

10.1609/aaai.v35i2.16244

AAAI

Proceedings of the AAAI Conference on Artificial Intelligence, 35

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.