Part of the book series:Lecture Notes in Computer Science ((LNAI,volume 10235))
Included in the following conference series:
3262Accesses
Abstract
Many web items contain different types of information resources such as user profile, comments, users preference and so on. All these aspects can be seen as different views of real-world datasets and often admit same underlying clustering of the data. However, each view of dataset forming a huge sparse matrix results in the non-robust characteristic during matrix decomposition process, and further influences the accuracy of clustering results. In this paper, we attempt to use rating value given by the users as latent semantic information to handle those features that are unobserved in each data point so as to resolve the sparseness problem in all views matrices. To combine multiple views in our constructed corpusDoucom, we present WScoNMF (Weighted similarity co-regularized Non-negative Matrix Factorization), which provides an efficient weighted matrix factorization framework to further explore the sparseness problem in semantic space of data. The overall objective function is to minimize the loss function of weighted NMF under the\(l _{2,1}\)-norm and the co-regularized constraint under theF-norm. Experimental results on all datasets demonstrate the effectiveness of the proposed method.
This is a preview of subscription content,log in via an institution to check access.
Access this chapter
Subscribe and save
- Get 10 units per month
- Download Article/Chapter or eBook
- 1 Unit = 1 Article or 1 Chapter
- Cancel anytime
Buy Now
- Chapter
- JPY 3498
- Price includes VAT (Japan)
- eBook
- JPY 11439
- Price includes VAT (Japan)
- Softcover Book
- JPY 14299
- Price includes VAT (Japan)
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
In this paper, we use ‘data point’ and ‘item’ exchangeable.
- 2.
\(\odot \) in matrix denote element-wise multiplication.
- 3.
- 4.
- 5.
- 6.
References
Akata, Z., Thurau, C., Bauckhage, C.: Nonnegative matrix factorization in multimodality data for segmentation and label prediction. In: Computer Vision Winter Workshop, pp. 1–8 (2011)
Cai, X., Nie, F., Huang, H.: Multi-view K-means clustering on big data. In: International Joint Conference on Artificial Intelligence, pp. 2598–2604 (2013)
Cheng, W., Zhang, X., Guo, Z., Wu, Y.: Flexible and robust co-regularized multi-domain graph clustering. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2013, vol. 1, pp. 320–328 (2013)
Ding, C., Li, T., Jordan, M.I.: Convex and semi-nonnegative matrix factorizations. IEEE Trans. Pattern Anal. Mach. Intell.32(1), 45–55 (2010)
Ding, C., Li, T., Peng, W., Park, H.: Orthogonal nonnegative matrix t-factorization for clustering. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 126–135 (2006)
Ding, C.H., He, X., Simon, H.D.: On the equivalence of nonnegative matrix factorization and spectral clustering. In: SDM, vol. 5, pp. 606–610 (2005)
Du, L., Li, X., Shen, Y.D.: Robust nonnegative matrix factorization via half-quadratic minimization. In: Proceedings of the 12th International Conference on Data Mining, pp. 201–210 (2012)
Eaton, E., desJardins, M., Jacob, S.: Multi-view constrained clustering with an incomplete mapping between views. Knowl. Inf. Syst.38(1), 231–257 (2012)
He, X., Kan, M.Y., Xie, P., Chen, X.: Comment-based multi-view clustering of web 2.0 items. In: International Conference on World Wide Web, pp. 771–782 (2014)
Huang, J., Nie, F., Huang, H., Ding, C.: Robust manifold non-negative matrix factorization. ACM Trans. Knowl. Discov. Data (TKDD)8(3), 11:1–11:21 (2013)
Kumar, A., Iii, H.D.: A co-training approach for multi-view spectral clustering. In: ICML 2011, pp. 393–400 (2011)
Kumar, A., Rai, P., Iii, H.D.: Co-regularized multi-view spectral clustering. In: Proceedings of NIPS 2011, pp. 1413–1421 (2011)
Lee, L., Seung, D.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, vol. 13, pp. 556–562 (2001)
Liu, J., Wang, C., Gao, J., Han, J.: Multi-view clustering via joint nonnegative matrix factorization. In: Proceedings of SDM 2013, pp. 252–260 (2013)
Nie, F.P., Huang, H., Cai, X., Ding, C.: Efficient and robust feature selection via joint\(l_{2,1}\)-norms minimization. In: NIPS 2010 (2010)
Sun, J.W., Lu, J., Xu, T.Y., Bi, J.B.: Multi-view sparse co-clustering via proximal alternating. In: Proceedings of the 32th International Conference on Machine Learning, Lille, France, vol. 37 (2015)
Wang, H., Huang, H., Ding, C.: Cross-language web page classification via joint nonnegative matrix tri-factorization based dyadic knowledge transfer. In: Annual ACM SIGIR Conference, pp. 933–942 (2011)
Wang, H., Huang, H., Ding, C.: Simultaneous clustering of multi-type relational data via symmetric nonnegative matrix tri-factorization. In: CIKM 2011, pp. 279–284 (2011)
Wang, H., Nie, F.P., Huang, H.: Multi-view clustering and feature learning via structured sparsity. In: Proceedings of the 30th International Conference on Machine Learning, Atlanta, Georgia, USA, vol. 28 (2013)
Xu, W., Liu, X., Gong, Y.: Document clustering based on non-negative matrix factorization. In: SIGIR, pp. 267–273 (2003)
Zhang, X.C., Zong, L.L., Liu, X.Y., Yu, H.: Constrained nmf-based multi-view clustering on unmapped data. In: AAAI 2015, pp. 3174–3180 (2015)
Acknowledgements
This work is supported by the National Natural Science Foundation of China (No. 61472241) and the National High Technology Research and Development Program of China (No. 2015AA015303).
Author information
Authors and Affiliations
Department of Computer Science and Technology, Shanghai Jiao Tong University, Shanghai, China
Xiaolong Gong, Fuwei Wang & Linpeng Huang
- Xiaolong Gong
You can also search for this author inPubMed Google Scholar
- Fuwei Wang
You can also search for this author inPubMed Google Scholar
- Linpeng Huang
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toLinpeng Huang.
Editor information
Editors and Affiliations
Kangwon National University, Chuncheon, Korea (Republic of)
Jinho Kim
Seoul National University, Seoul, Korea (Republic of)
Kyuseok Shim
University of Technology Sydney, Sydney, New South Wales, Australia
Longbing Cao
KAIST, Daejeon, Korea (Republic of)
Jae-Gil Lee
University of New South Wales, Sydney, New South Wales, Australia
Xuemin Lin
Kangwon National University, Chuncheon, Korea (Republic of)
Yang-Sae Moon
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Gong, X., Wang, F., Huang, L. (2017). Weighted NMF-Based Multiple Sparse Views Clustering for Web Items. In: Kim, J., Shim, K., Cao, L., Lee, JG., Lin, X., Moon, YS. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10235. Springer, Cham. https://doi.org/10.1007/978-3-319-57529-2_33
Download citation
Published:
Publisher Name:Springer, Cham
Print ISBN:978-3-319-57528-5
Online ISBN:978-3-319-57529-2
eBook Packages:Computer ScienceComputer Science (R0)
Share this paper
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative