Part of the book series:Lecture Notes in Computer Science ((LNAI,volume 9799))
Included in the following conference series:
3124Accesses
Abstract
In this paper, with regards to aspect-based sentiment classification accuracy problem, we propose a Principal Component Analysis (PCA) feature selection method that can determine the most relevant set of features for aspect-based sentiment classification. Feature selection helps to reduce redundant features and remove irrelevant features which affect classifier accuracy. In this paper we present a method for feature selection for twitter aspect-based sentiment classification based on Principal Component Analysis (PCA). PCA is combined with Sentiwordnet lexicon-based method which is incorporated with Support Vector Machine (SVM) learning framework to perform the classification. Experiments on our own Hate Crime Twitter Sentiment (HCTS) and benchmark Stanford Twitter Sentiment (STS) datasets yields accuracies of 94.53 % and 97.93 % respectively. The comparisons with other statistical feature selection methods shows that our proposed approach shows promising results in improving aspect-based sentiment classification performance.
This is a preview of subscription content,log in via an institution to check access.
Access this chapter
Subscribe and save
- Get 10 units per month
- Download Article/Chapter or eBook
- 1 Unit = 1 Article or 1 Chapter
- Cancel anytime
Buy Now
- Chapter
- JPY 3498
- Price includes VAT (Japan)
- eBook
- JPY 11439
- Price includes VAT (Japan)
- Softcover Book
- JPY 14299
- Price includes VAT (Japan)
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC 2010), (Valletta, Malta), European Language Resources Association (ELRA), May 2010
Go, A., Bhayani, R.: Twitter sentiment classificationusing distant supervision. CS224N Project Rep. Stanford1, 1–12 (2009)
Niu, Z., Yin, Z., Kong, X.: Sentiment classification formicroblog by machine learning. In: 2012 Fourth International Conference on Computational and Information Sciences (ICCIS), pp. 286–289, August 2012
Lek, H.H., Poo, D.: Aspect-based twitter sentimentclassification. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 366–373, November 2013
Ghiassi, M., Skinner, J., Zimbra, D.: Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst. Appl.40(16), 6266–6282 (2013)
Feldman, R.: Techniques and applications for sentiment analysis. Commun. ACM56, 82–89 (2013)
Zhang, Y., Dang, Y., Chen, H.: Research note: examining gender emotional differences in web forum communication. Decis. Support Syst.55(3), 851–860 (2013)
Bhuta, S., Doshi, A., Doshi, U., Narvekar, M.: A review oftechniques for sentiment analysis of twitter data. In: 2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT), pp. 583–591, February 2014
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 79–86, Association for Computational Linguistics, Stroudsburg, PA, USA (2002)
Brychcin, T., Konkol, M., Steinberger, J.: UWB: machine learning approach to aspect-based sentiment analysis, SemEval 2014, p. 817 (2014)
Liu, K.L., Li, W.J., Guo, M.: Emoticon smoothed language modelsfor twitter sentiment analysis, vol. 2, pp. 1678–1684, 2012. cited By (since 1996)
Kansal, H., Toshniwal, D.: Aspect based summarization of context dependent opinion words. Procedia Comput. Sci.35, 166–175 (2014). 2014 Proceedings of 18th Annual Conference on Knowledge-Based and Intelligent Information and amp; Engineering Systems, KES-2014 Gdynia, Poland, September
Zhang, W., Xu, H., Wan, W.: Weakness finder: find product weakness from chinese reviews by using aspects based sentiment analysis. Expert Syst. Appl.39(11), 10283–10291 (2012)
Jmal, J., Faiz, R.: Customer review summarization approach using twitter and sentiwordnet. In: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, WIMS 2013, pp. 33:1–33:8, New York, NY, USA. ACM (2013)
Marrese-Taylor, E., Velsquez, J.D., Bravo-Marquez, F.: A novel deterministic approach for aspect-based opinion mining in tourism products reviews. Expert Syst. Appl.41(17), 7764–7775 (2014)
Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) LREC, European Language Resources Association (2010)
Zainuddin, N., Selamat, A., Ibrahim, R.: Improving twitter aspect-based sentiment analysis using hybrid approach. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.P. (eds.) ACIIDS 2016, Part I. LNCS, vol. 9621, pp. 151–160. Springer, Heidelberg (2016)
Liu, B.: Sentiment Analysis and Subjectivity: Handbook of Natural Language Processing, 2nd edn. CRC Press, Boca Raton (2010)
Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol. 6, pp. 417–422 (2006)
Selamat, A., Omatu, S.: Web page feature selection and classification using neural networks. Inf. Sci.158, 69–88 (2004)
Vinodhini, G., Chandrasekaran, M.R.: Opinion mining using principal component analysis based ensemble model for e-commerce application. CSI Trans. ICT2(3), 169–179 (2014)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn.20(3), 273–297 (1995)
Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)
Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Languages in Social Media, LSM 2011, pp. 30–38, Association for Computational Linguistics, Stroudsburg, PA, USA (2011)
Acknowledgments
The authors wish to thank Universiti Teknologi Malaysia (UTM) under Research University Grant Vot- 02G31 and Ministry of Higher Education Malaysia (MOHE) under the Fundamental Research Grant Scheme (FRGS Vot-4F551) for the completion of the research.
Author information
Authors and Affiliations
Faculty of Computing, Universiti Teknologi Malaysia, 81310, Johor Bahru, Johor, Malaysia
Nurulhuda Zainuddin, Ali Selamat & Roliana Ibrahim
- Nurulhuda Zainuddin
You can also search for this author inPubMed Google Scholar
- Ali Selamat
You can also search for this author inPubMed Google Scholar
- Roliana Ibrahim
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toAli Selamat.
Editor information
Editors and Affiliations
Iwate Prefectural University , Iwate, Japan
Hamido Fujita
Department Computer Science, Texas State University, San Marcos, Texas, USA
Moonis Ali
Universiti Teknologi Malaysis (UTM), Bahru, Malaysia
Ali Selamat
Iwate Prefectural University , Iwate, Japan
Jun Sasaki
Iwate Prefectural University , Iwate, Japan
Masaki Kurematsu
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Zainuddin, N., Selamat, A., Ibrahim, R. (2016). Twitter Feature Selection and Classification Using Support Vector Machine for Aspect-Based Sentiment Analysis. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_23
Download citation
Published:
Publisher Name:Springer, Cham
Print ISBN:978-3-319-42006-6
Online ISBN:978-3-319-42007-3
eBook Packages:Computer ScienceComputer Science (R0)
Share this paper
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative