Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Advertisement

Springer Nature Link
Log in

Data Reduction for Noisy Data Classification Using Semi-supervised Manifold-Preserving Graph Reduction

  • Conference paper
  • First Online:

Abstract

This paper investigates the issue of data reduction for noisy data classification in semi-supervised learning. A novel semi-supervised manifold-preserving graph reduction (Semi-MPGR) is proposed for data reduction in the framework of semi-supervised learning. In Semi-MPGR, the adjacent graph consists of three sub-graphs that are constructed by labeled samples, unlabeled ones, and both. In doing so, the role of label information is strengthened. On the basis of the defined graph, Semi-MPGR selects data points according to their connection strength. The retained data could maintain the manifold structure of data and be efficiently handled by semi-supervised classifiers. Experimental results on several real-world data sets indicate the feasibility and validity of Semi-MPGR.

Supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China under Grant No. 19KJA550002, the Six Talent Peak Project of Jiangsu Province of China under Grant No. XYDXX-054, and the Priority Academic Program Development of Jiangsu Higher Education Institutions.

This is a preview of subscription content,log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 11439
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 14299
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide -see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Similar content being viewed by others

References

  1. Zhang, S., Zhang, C., Yang, Q.: Data preparation for data mining. Appl. Artif. Intell.17(5–6), 375–381 (1999)

    Google Scholar 

  2. Sun, S., Hussain, Z., ShaweTaylor, J.: Manifold-preserving graph reduction for sparse semi-supervised learning. Neurocomput.124(2), 13–21 (2014)

    Article  Google Scholar 

  3. Madigan, D., Nason. M.: Data reduction: sampling. In: Handbook of Data Mining and Knowledge Discovery, pp. 205–208 (2002)

    Google Scholar 

  4. Barca, J.C., Rumantir, G.: A modified k-means algorithm for noise reduction in optical motion capture data. In: 6th IEEE/ACIS International Conference on Computer and Information Science in Conjunction with 1st IEEE/ACIS International Workshop on e-Activity, pp. 118–122 (2007)

    Google Scholar 

  5. Xu, Z., Zhang, L.: Supervised manifold-preserving graph reduction for noisy data classification. In: 11th International Conference on Knowledge Science, Engineering and Management. pp. 226–237. Changchun, China, August 17–19 (2018)

    Google Scholar 

  6. Nie, F., Zhu, W., Li, X.: Unsupervised large graph embedding. In: Proceedings of 31st AAAI Conference on Artificial Intelligence(AAAI), San Francisco, USA (2017)

    Google Scholar 

  7. Ou, Y.Y., Chen, C.Y., Hwang, S.C., Oyang, Y.J.: Expediting model selection for support vector machines based on data reduction. IEEE Int. Conf. Syst.1, 786–791 (2003)

    Google Scholar 

  8. Panda, N., Chang, E.Y., Wu, G.: Concept boundary detection for speeding up SVMs. In: 23rd International Conference on Machine Learning, pp. 681–688 (2006)

    Google Scholar 

  9. Zhang, L., Zhou, W., Chen, G., Zhou, H., Ye, N., Jiao, L.: Pre-extracting boundary vectors for support vector machine using pseudo-density estimation method. In: International Symposium on Multispectral Image Processing and Pattern Recognition, vol. 7496, pp. 74960J–74960J-7 (2009)

    Google Scholar 

  10. Kubat, M., Matwin, S.: Addressing the course of imbalanced training sets: one-sided selection. In: Proceedings of International Conference on Machine Learning, pp. 179–186 (1997)

    Google Scholar 

  11. Zhang, J., Mani, I.: KNN approach to unbalanced data distributions: A case study involing information extraction. In: Proceedings of Workshop on Learning from Imbalanced Datasets (2003)

    Google Scholar 

  12. Dheeru, D., Karra Taniskidou, E.: UCI machine learning repository (2018).https://archive.ics.uci.edu/ml

  13. Yang, L., et al.: Kernel sparse representation-based classifier. IEEE Trans. Signal Process.60(4), 1684–1695 (2012)

    Article MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

  1. School of Computer Science and Technology & Joint International Research Laboratory of Machine Learning and Neuromorphic Computing, Soochow University, Suzhou 215006, Jiangsu, China

    Li Zhang, Qingqing Pang, Zhiqiang Xu & Xiaohan Zheng

  2. Provincial Key Laboratory for Computer Information Processing Technology, Soochow University, Suzhou 215006, Jiangsu, China

    Li Zhang

Authors
  1. Li Zhang

    You can also search for this author inPubMed Google Scholar

  2. Qingqing Pang

    You can also search for this author inPubMed Google Scholar

  3. Zhiqiang Xu

    You can also search for this author inPubMed Google Scholar

  4. Xiaohan Zheng

    You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toLi Zhang.

Editor information

Editors and Affiliations

  1. Department of AI, Ping An Life, Shenzhen, China

    Haiqin Yang

  2. Faculty of Information Technology, King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand

    Kitsuchart Pasupa

  3. City University of Hong Kong, Kowloon, Hong Kong

    Andrew Chi-Sing Leung

  4. Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong

    James T. Kwok

  5. School of Information Technology, King Mongkut’s University of Technology Thonburi, Bangkok, Thailand

    Jonathan H. Chan

  6. The Chinese University of Hong Kong, New Territories, Hong Kong

    Irwin King

Rights and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, L., Pang, Q., Xu, Z., Zheng, X. (2020). Data Reduction for Noisy Data Classification Using Semi-supervised Manifold-Preserving Graph Reduction. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Communications in Computer and Information Science, vol 1333. Springer, Cham. https://doi.org/10.1007/978-3-030-63823-8_34

Download citation

Publish with us

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 11439
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 14299
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide -see info

Tax calculation will be finalised at checkout

Purchases are for personal use only


[8]ページ先頭

©2009-2025 Movatter.jp