Part of the book series:Communications in Computer and Information Science ((CCIS,volume 310))
Included in the following conference series:
1527Accesses
6Citations
Abstract
Traditional machine learning has been largely concerned with developing techniques for small or modestly sized datasets. These techniques fail to scale up well for large data, a situation becoming increasingly common in today’s world. Furthermore most of the machine learning classifiers are trained in a batch way. Under this model, all training data is given a priori and training is performed in one batch. If more training data is later obtained the classifier must be re-trained from scratch. Re-solving the problem from scratch seems computationally wasteful. In this research we will focus on developing classifier for big data sets and incremental way of learning for dealing with real world problem. In this paper we propose a feature extraction and classification algorithm for big data which is based on incremental kernel PCA and conjugate gradient based LS-SVM. Through experimental results on big data in UCI machine learning repository we can show that proposed classification algorithm enables solving large scale classification problems.
This study was supported by a grant of Youngsan University in 2012.
This is a preview of subscription content,log in via an institution to check access.
Access this chapter
Subscribe and save
- Starting from 10 chapters or articles per month
- Access and download chapters and articles from more than 300k books and 2,500 journals
- Cancel anytime
Buy Now
- Chapter
- JPY 3498
- Price includes VAT (Japan)
- eBook
- JPY 11439
- Price includes VAT (Japan)
- Softcover Book
- JPY 14299
- Price includes VAT (Japan)
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, books and news in related subjects, suggested using machine learning.References
Vapnik, V.N.: Statistical learning theory. John Wiley & Sons, New York (1998)
Gupta, H., Agrawal, A.K., Pruthi, T., Shekhar, C., Chellappa, R.: An Experimental Evaluation of Linear and Kernel-Based Methods for Face Recognition,http://citeseer.nj.nec.com
Hall, P., Marshall, D., Martin, R.: Incremental eigenalysis for classification. In: British Machine Vision Conference, vol. 1, pp. 286–295 (September 1998)
Winkeler, J., Manjunath, B.S., Chandrasekaran, S.: Subset selection for active object recognition. In: CVPR, vol. 2, pp. 511–516. IEEE Computer Society Press (June 1999)
Murakami, H., Kumar, B.V.K.V.: Efficient calculation of primary images from a set of images. IEEE PAMI 4(5), 511–515 (1982)
Kim, B.J., Shim, J.Y., Hwang, C.H., Kim, I.K., Song, J.H.: Incremental Feature Extraction Based on Empirical Kernel Map. In: Zhong, N., Raś, Z.W., Tsumoto, S., Suzuki, E. (eds.) ISMIS 2003. LNCS (LNAI), vol. 2871, pp. 440–444. Springer, Heidelberg (2003)
Suykens, J.A.K., Vandewalle, J.: Least squares support vector machine classifiers. Neural Processing Letters 9, 293–300 (1999)
Golub, G.H., Van Loan, C.F.: Large scale LS_SVM Matrix Computations. Johns Hopkins University, Baltimore
Van Gestel, T., Suykens, J.A.K., Lanckriet, G., Lambrechts, A., De Moor, B., Vandewalle, J.: A Bayesian Framework for Least Squares Support Vector Machine Classifiers. Internal Report 00-65, ESAT-SISTA, K.U. Leuven
Accessible athttp://archive.ics.uci.edu/ml/
Author information
Authors and Affiliations
Department of Computer Engineering, Youngsan University, Korea
Byung Joo Kim
- Byung Joo Kim
Search author on:PubMed Google Scholar
Editor information
Editors and Affiliations
Department of Computer Engineering, Hannam University, 70 Hannamro, Daedeuk-gu, Daejeon, Korea
Geuk Lee
Computer Science and Information Systems, University of Limerick, Limerick, Ireland
Daniel Howard
University of Warsaw and Infobright Inc., Poland
Dominik Ślęzak
School of Information and Communication Engineering, Sang JI University, Wonju, Kangwon, Korea
You Sik Hong
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, B.J. (2012). A Classifier for Big Data. In: Lee, G., Howard, D., Ślęzak, D., Hong, Y.S. (eds) Convergence and Hybrid Information Technology. ICHIT 2012. Communications in Computer and Information Science, vol 310. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32692-9_63
Download citation
Publisher Name:Springer, Berlin, Heidelberg
Print ISBN:978-3-642-32691-2
Online ISBN:978-3-642-32692-9
eBook Packages:Computer ScienceComputer Science (R0)
Share this paper
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative