Movatterモバイル変換

Part of the book series:Lecture Notes in Computer Science ((LNISA,volume 8797))

Included in the following conference series:

International Semantic Web Conference

2042Accesses
1Citations
2Altmetric

Abstract

Social media has become an effective channel for communicating both trends and public opinion on current events. However the automatic topic classification of social media content pose various challenges. Topic classification is a common technique used for automatically capturing themes that emerge from social media streams. However, such techniques are sensitive to the evolution of topics when new event-dependent vocabularies start to emerge (e.g., Crimea becoming relevant to War_Conflict during the Ukraine crisis in 2014). Therefore, traditional supervised classification methods which rely on labelled data could rapidly become outdated. In this paper we propose a novel transfer learning approach to address the classification task of new data when the only available labelled data belong to a previous epoch. This approach relies on the incorporation of knowledge from DBpedia graphs. Our findings show promising results in understanding how features age, and how semantic features can support the evolution of topic classifiers.

Download to read the full chapter text

Chapter PDF

Classification, Identification, and Analysis of Events on Twitter Through Data Mining

Tracking Dengue Epidemics Using Twitter Content Classification and Topic Modelling

On Dynamic Topic Models for Mining Social Media

Keywords

References

Blitzer, J., McDonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proc. Conf. on EMNLP (2006)
Google Scholar
Cano, E., He, Y., Liu, K., Zhao, J.: A weakly supervised bayesian model for violence detection in social media. In: Proc. 6th IJCNLP 2013 (2013)
Google Scholar
Cano, A.E., Varga, A., Rowe, M., Ciravegna, F., He, Y.: Harnessing linked knowledge source for topic classification in social media. In: Proc. 24th ACM Conf. on Hypertext and Social Media, Paris, France (2013)
Google Scholar
Caruana, R.: Multitask learning. 28(1), 41–75 (1997)
Google Scholar
Chen, G.H., Nikolov, S., Shah, D.: A latent source model for nonparametric time series classification. In: Advances in Neural Information Processing Systems (2013)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines: And Other Kernel-Based Learning Methods. Cambridge University Press (2000)
Google Scholar
Daumé, I.: Frustratingly easy domain adaptation. In: Proceedings of the 2007 ACL (2007)
Google Scholar
Diao, Q., Jiang, J., Zhu, F., Lim, E.-P.: Finding bursty topics from microblogs. In: Proc. 50th Annual Meeting of the ACL, Jeju Island, Korea (2012)
Google Scholar
Dries, A., Rückert, U.: Adaptive concept drift detection. Stat. Anal. Data Min. 2(56) (2009)
Google Scholar
Genc, Y., Sakamoto, Y., Nickerson, J.V.: Discovering context: Classifying tweets through a semantic transform based on wikipedia. In: Schmorrow, D.D., Fidopiastis, C.M. (eds.) FAC 2011. LNCS, vol. 6780, pp. 484–492. Springer, Heidelberg (2011)
Chapter Google Scholar
He, Y.: Incorporating sentiment prior knowledge for weakly supervised sentiment analysis. ACM Transactions on Asian Language Information Processing 11(2), 4:1–4:19 (2012)
Google Scholar
He, Y., Lin, C., Gao, W., Wong, K.-F.: Tracking sentiment and topic dynamics from social media. In: Proc. of the Sixth Int. Conf. on Weblogs and Social Media, Dublin, Ireland (2012)
Google Scholar
Jones, K.S., Walker, S., Robertson, S.E.: A probabilistic model of information retrieval: Development and comparative experiments. Inf. Process. Manage. 36(6), 779–808 (2000)
Article Google Scholar
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proc. 14th IJCAI, vol. 2 (1995)
Google Scholar
Lv, W., Xu, W., Guo, J.: Transfer learning in classification based on semantic analysis. In: 2nd Int. Conf. on ICCSNT (2012)
Google Scholar
Milikic, N., Jovanovic, J., Stankovic, M.: Discovering the dynamics of terms semantic relatedness through twitter. In: Proceedings, 1st Workshop on #MSM 2011 (2011)
Google Scholar
Muñoz García, O., García-Silva, A., Corcho, O., de la Higuera Hernández, M., Navarro, C.: Identifying Topics in Social Media Posts using DBpedia. In: Proc. of the NEM Summit (2011)
Google Scholar
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. on Knowl. and Data Eng. 22(10), 1345–1359 (2010)
Article Google Scholar
Phan, X.-H., Nguyen, L.-M., Horiguchi, S.: Learning to classify short and sparse text & web with hidden topics from large-scale data collections. In: Proc. 17th Int. Conf. on World Wide Web, WWW 2008, Beijing, China (2008)
Google Scholar
Porter, M.: An algorithm for suffix stripping. Program 14(3) (1980)
Google Scholar
Saif, H., Fernandez, M., He, Y., Alani, H.: Senticircles for contextual and conceptual semantic sentiment analysis of twitter. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8465, pp. 83–98. Springer, Heidelberg (2014)
Google Scholar
Salzberg, S.L., Fayyad, U.: On comparing classifiers: Pitfalls to avoid and a recommended approach. In: Data Mining and Knowledge Discovery, pp. 317–328 (1997)
Google Scholar
Song, Y., Wang, H., Wang, Z., Li, H., Chen, W.: Short text conceptualization using a probabilistic knowledgebase. In: Int. Joint. Conf. of AI (IJCAI). IJCAI/AAAI (2011)
Google Scholar
Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering. In: Proc. of the Int. ACM SIGIR (2010)
Google Scholar
Thrun, S.: Is learning the n-th thing any easier than learning the first? In: Advances in Neural Information Processing Systems (1996)
Google Scholar
Varga, A., Cano, A., Rowe, M., Ciravegna, F., He, Y.: Linked knowledge sources for topic classification of microposts: A semantic graph-based approach. In: JWS: Science, Services and Agents on the WWW (2014)
Google Scholar
Zhao, X., Shu, B., Jiang, J., Song, Y., Yan, H., Li, X.: Identifying event-related bursts via social media activities. In: Proc. of the Joint Conference on EMNLP, Jeju Island, Korea (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Knowledge Media Institute, Open University, UK
Amparo Elizabeth Cano & Harith Alani
School of Engineering and Applied Science, Aston University, UK
Yulan He

Authors

Amparo Elizabeth Cano
View author publications
You can also search for this author inPubMed Google Scholar
Yulan He
View author publications
You can also search for this author inPubMed Google Scholar
Harith Alani
View author publications
You can also search for this author inPubMed Google Scholar

Editor information

Editors and Affiliations

Yahoo Labs, Diagonal 177, 08018, Barcelona, Spain
Peter Mika
Stanford University, 1265 Welch Road, 94305, Stanford, CA, USA
Tania Tudorache
DDIS, University of Zurich, Zurich, Switzerland
Abraham Bernstein
IBM Research, Yorktown Heights, NY, USA
Chris Welty
Information Sciences Institute and Department of Computer Science, University of Southern California, Los Angeles, CA, USA
Craig Knoblock
Google, USA
Denny Vrandečić & Natasha Noy &
VU University Amsterdam, The Netherlands
Paul Groth
Department of Geography, University of California, Santa Barbara, CA, USA
Krzysztof Janowicz
School of Computer Science, The University of Manchester, Manchester, UK
Carole Goble

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cano, A.E., He, Y., Alani, H. (2014). Stretching the Life of Twitter Classifiers with Time-Stamped Semantic Graphs. In: Mika, P.,et al. The Semantic Web – ISWC 2014. ISWC 2014. Lecture Notes in Computer Science, vol 8797. Springer, Cham. https://doi.org/10.1007/978-3-319-11915-1_22

Download citation

DOI:https://doi.org/10.1007/978-3-319-11915-1_22
Publisher Name:Springer, Cham
Print ISBN:978-3-319-11914-4
Online ISBN:978-3-319-11915-1
eBook Packages:Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics