DF2Net: Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification

Authors

Yabei Li

Institute of Automation, Chinese Academy of Sciences (CASIA)

Junge Zhang

Institute of Automation, Chinese Academy of Sciences (CASIA)

Yanhua Cheng

Tencent

Kaiqi Huang

Institute of Automation, Chinese Academy of Sciences (CASIA)

Tieniu Tan

Institute of Automation, Chinese Academy of Sciences (CASIA)

Published:

2018-02-08

Proceedings:

Proceedings of the AAAI Conference on Artificial Intelligence, 32

Volume

Issue:

Thirty-Second AAAI Conference on Artificial Intelligence 2018

Track:

AAAI Technical Track: Vision

Downloads:

Download PDF

Abstract:

This paper focuses on the task of RGB-D indoor scene classification. It is a very challenging task due to two folds. 1) Learning robust representation for indoor scene is difficult because of various objects and layouts. 2) Fusing the complementary cues in RGB and Depth is nontrivial since there are large semantic gaps between the two modalities. Most existing works learn representation for classification by training a deep network with softmax loss and fuse the two modalities by simply concatenating the features of them. However, these pipelines do not explicitly consider intra-class and inter-class similarity as well as inter-modal intrinsic relationships. To address these problems, this paper proposes a Discriminative Feature Learning and Fusion Network (DF2Net) with two-stage training. In the first stage, to better represent scene in each modality, a deep multi-task network is constructed to simultaneously minimize the structured loss and the softmax loss. In the second stage, we design a novel discriminative fusion network which is able to learn correlative features of multiple modalities and distinctive features of each modality. Extensive analysis and experiments on SUN RGB-D Dataset and NYU Depth Dataset V2 show the superiority of DF2Net over other state-of-the-art methods in RGB-D indoor scene classification task.

DOI:

10.1609/aaai.v32i1.12292

AAAI

Thirty-Second AAAI Conference on Artificial Intelligence 2018

ISSN 2374-3468 (Online) ISSN 2159-5399 (Print)

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.