Probabilistic Non-Negative Matrix Factorization and Its Robust Extensions for Topic Modeling

Authors

Minnan Luo

Xi'an Jiaotong University

Feiping Nie

Northwestern Polytechnical University

Xiaojun Chang

University of Technology Sydney

Yi Yang

University of Technology Sydney

Alexander Hauptmann

Carnegie Mellon University

Qinghua Zheng

Xi'an Jiaotong University

Proceedings:

No. 1: Thirty-First AAAI Conference On Artificial Intelligence

Volume

Issue:

Proceedings of the AAAI Conference on Artificial Intelligence, 31

Track:

Machine Learning Methods

Downloads:

Download PDF

Abstract:

Traditional topic model with maximum likelihood estimate inevitably suffers from the conditional independence of words given the documentÕs topic distribution. In this paper, we follow the generative procedure of topic model and learn the topic-word distribution and topics distribution via directly approximating the word-document co-occurrence matrix with matrix decomposition technique. These methods include: (1) Approximating the normalized document-word conditional distribution with the documents probability matrix and words probability matrix based on probabilistic non-negative matrix factorization (NMF); (2) Since the standard NMF is well known to be non-robust to noises and outliers, we extended the probabilistic NMF of the topic model to its robust versions using l21-norm and capped l21-norm based loss functions, respectively. The proposed framework inherits the explicit probabilistic meaning of factors in topic models and simultaneously makes the conditional independence assumption on words unnecessary. Straightforward and efficient algorithms are exploited to solve the corresponding non-smooth and non-convex problems. Experimental results over several benchmark datasets illustrate the effectiveness and superiority of the proposed methods.

DOI:

10.1609/aaai.v31i1.10832

AAAI

Proceedings of the AAAI Conference on Artificial Intelligence, 31

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.