Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Advertisement

Springer Nature Link
Log in

Gender opposition recognition method fusing emojis and multi-features in Chinese speech

  • Neural Networks and Intelligent Systems
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Speech with gender opposition on the internet have been causing antagonism, gamophobia, and pregnancy phobia among young groups. Recognizing gender opposition speech contributes to maintaining a healthy online environment and security in cyberspace. Traditional recognition model ignores the Chinese-owned features and emojis, which inevitably affects the recognition accuracy of gender opposition. To tackle this issue, a gender opposition recognition method fusing emojis and multi-features in Chinese speech(GOR-CS) is proposed. Firstly, the exBERT method is employed to expand the encoding of emojis into the BERT vocabulary, which can ensure BERT to extract the basis vectors containing characters and emojis information. Then, the feature vectors containing Wubi, Zhengma, and Pinyin information are extracted by Word2Vec to obtain the Chinese-owned features of gender opposition text. Further, the proposed basis vector and feature vectors are fused and then fed into the Bi-GRU network to extract deeper semantics from input sentences. Finally, to determine whether the speech are related to gender opposition, the sentiment polarities are calculated with the fully connected layer and SoftMax function. Experimental results show that the proposed method can effectively improve the accuracy of gender opposition recognition.

This is a preview of subscription content,log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Algorithm 1
Fig. 7
Fig. 8

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The datasets generated during and/or analysed during the current study are not publicly available due to Protect data security but are available from the corresponding author on reasonable request.

References

  • Al-Garadi MA, Kim S, Guo Y et al (2022) Natural language model for automatic identification of intimate partner violence reports from twitter. Array 15:100217

    Article  Google Scholar 

  • Attili VSP, Mathew SK, Sugumaran V (2022) Information privacy assimilation in IT organizations. Inf Syst Front 24(5):1497–1513

    Article MATH  Google Scholar 

  • Ayo FE, Folorunso O, Ibharalu FT et al (2020) Machine learning techniques for hate speech classification of twitter data: state-of-the-art, future challenges and research directions. Comput Sci Rev 38:100311

    Article  Google Scholar 

  • Balakrishnan V, Khan S, Arabnia HR (2020) Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Comput Secur 90:101710

    Article MATH  Google Scholar 

  • Burnap P, Williams ML (2016) Us and them: identifying cyber hate on Twitter across multiple protected characteristics. EPJ Data Sci 5:1–15

    Article MATH  Google Scholar 

  • Cho K, Van M B, Bahdanau D et al (2014) On the properties of neural machine translation: encoder–decoder approaches. In: 8th workshop on syntax, semantics and structure in statistical translation, SSST 2014. Association for Computational Linguistics (ACL), pp 103–111

  • Devlin J, Chang M, Lee K et al (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT, pp 4171–4186

  • Diesenreiter C, Krauss O, Sandler S et al (2022) ProperBERT-proactive recognition of offensive phrasing for effective regulation. In: 2022 international conference on electrical, computer, communications and mechatronics engineering (ICECCME). IEEE, pp 1–6

  • Frenda S, Ghanem B, Montes-y-Gómez M et al (2019) Online hate speech against women: automatic identification of misogyny and sexism on twitter. J Intell Fuzzy Syst 36(5):4743–4752

    Article MATH  Google Scholar 

  • Garcia-diaz JA, Canovas-Garcia M, Colomo-Palacios R et al (2021) Detecting misogyny in Spanish tweets. An approach based on linguistics features and word embeddings. Future Gener Comput Syst 114:506–518

    Article MATH  Google Scholar 

  • Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article MATH  Google Scholar 

  • Jha A, Mamidi R (2017) When does a compliment become sexist? analysis and classification of ambivalent sexism using twitter data. In: Proceedings of the second workshop on NLP and computational social science, pp 7–16

  • Jiang A, Yang X, Liu Y et al (2022) SWSR: a Chinese dataset and lexicon for online sexism detection. Online Soc Netw Media 27:100182

    Article  Google Scholar 

  • Karlekar S, Bansal M (2018) Safecity: understanding diverse forms of sexual harassment personal stories. arXiv preprinthttp://arxiv.org/abs/1809.04739

  • Khanday AMUD, Rabani ST, Khan QR et al (2022) Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques. Int J Inf Manag Data Insights 2(2):100120

    Google Scholar 

  • Li L, Wang XT (2023) Nonverbal communication with emojis in social media: dissociating hedonic intensity from frequency. Lang Resour Eval 57(1):323–342

    Article MATH  Google Scholar 

  • Mikolov T, Chen K, Corrado G et al (2013) Efficient estimation of word representations in vector space. arXiv preprinthttp://arxiv.org/abs/1301.3781

  • Mozafari M, Farahbakhsh R, Crespi N (2020) Hate speech detection and racial bias mitigation in social media based on BERT model. PLoS ONE 15(8):e0237861

    Article MATH  Google Scholar 

  • Pamungkas EW, Basile V, Patti V (2020) Misogyny detection in twitter: a multilingual and cross-domain study. Inf Process Manag 57(6):102360

    Article  Google Scholar 

  • Parikh P, Abburi H, Chhaya N et al (2021) Categorizing sexism and misogyny through neural approaches. ACM Trans Web (TWEB) 15(4):1–31

    Article MATH  Google Scholar 

  • Pitsilis GK, Ramampiaro H, Langseth H (2018) Effective hate-speech detection in Twitter data using recurrent neural networks. Appl Intell 48(12):4730–4742

    Article MATH  Google Scholar 

  • Plaza L, Carrillo-de-Albornoz J, Morante R et al (2023) Overview of EXIST 2023: sEXism Identification in Social NeTworks. In: Advances in information retrieval: 45th European conference on information retrieval, ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part III. Springer Nature Switzerland, Cham, pp 593–599

  • Rodriguez-Sanchez F, Carrillo-De-Albornoz J, Plaza L (2020) Automatic classification of sexism in social networks: an empirical study on twitter data. IEEE Access 8:219563–219576

    Article  Google Scholar 

  • Sharif O, Hoque MM (2022) Tackling cyber-aggression: identification and fine-grained categorization of aggressive texts on social media using weighted ensemble of transformers. Neurocomputing 490:462–481

    Article MATH  Google Scholar 

  • Sharifirad S, Matwin S (2019) When a tweet is actually sexist. A more comprehensive classification of different online harassment categories and the challenges in NLP. arXiv preprinthttp://arxiv.org/abs/1902.10584

  • Sreedevi AG, Harshitha TN, Sugumaran V et al (2022) Application of cognitive computing in healthcare, cybersecurity, big data and IoT: a literature review. Inf Process Manag 59(2):102888

    Article MATH  Google Scholar 

  • Sugumaran V, Ibrahim SJA (2022) Rough set based on least dissimilarity normalized index for handling uncertainty during E-learners learning pattern recognition. Int J Intell Netw 3:133–137

    MATH  Google Scholar 

  • Sundaramurthy S, Sugumaran V, Thangavelu A et al (2023) Predicting rheumatoid arthritis from the biomarkers of clinical trials using improved harmony search optimization with adaptive neuro-fuzzy inference system. J Intell Fuzzy Syst 44(1):125–137

    Article  Google Scholar 

  • Tai W, Kung HT, Dong XL et al (2020) exBERT: extending pre-trained models with domain-specific vocabulary under constrained training resources. In: Findings of the association for computational linguistics: EMNLP 2020, pp 1433–1439

  • Wang YT (2021) Analysis on marriage practice of the new generation youth and its influencing factors. China Youth Study 12:15

    MATH  Google Scholar 

  • Wang X, Kou L, Sugumaran V et al (2020) Emotion correlation mining through deep learning models on natural language text. IEEE Trans Cybern 51(9):4400–4413

    Article  Google Scholar 

  • Xie S, Pan Q, Wang X et al (2024) Combining prompt learning with contextual semantics for inductive relation prediction. Expert Syst Appl 238:121669

    Article  Google Scholar 

  • Yan P, Li L, Chen W et al (2019) Quantum-inspired density matrix encoder for sexual harassment personal stories classification. In: 2019 IEEE international conference on intelligence and security informatics (ISI). IEEE, pp 218–220

  • Zheng XX, Liu L, Hu D et al (2018) Influence of micro-blog’s cyberbullying on mental health of college students in Hefei City. Med Soc 31(09):63–65+84

    MATH  Google Scholar 

  • Zhu Z, Ke Z, Cui J et al (2018) The construction of Chinese microblog gender-specific thesauruses and user gender classification. Appl Netw Sci 3(1):1–17

    Article MATH  Google Scholar 

Download references

Funding

This work was supported by the National Natural Science Foundation of China (Grant no. 62076006), the Opening Foundation of the State Key Laboratory of Cognitive Intelligence, iFLYTEK (Grant no. COGOS-2023HE02), and the University Synergy Innovation Program of Anhui Province (Grant no GXXT-2021-008).

Author information

Authors and Affiliations

  1. School of Computer Science and Engineering, Anhui University of Science and Technology, Huainan, Anhui, China

    Shunxiang Zhang, Zichen Ma, Hanchen Li & Yunduo Liu

  2. School of Computer, Huainan Normal University, Huainan, Anhui, China

    Shunxiang Zhang & Lei Chen

  3. Department of Computer Science and Information Engineering (CSIE), Providence University, Taichung, Taiwan, Republic of China

    Kuan-Ching Li

Authors
  1. Shunxiang Zhang

    You can also search for this author inPubMed Google Scholar

  2. Zichen Ma

    You can also search for this author inPubMed Google Scholar

  3. Hanchen Li

    You can also search for this author inPubMed Google Scholar

  4. Yunduo Liu

    You can also search for this author inPubMed Google Scholar

  5. Lei Chen

    You can also search for this author inPubMed Google Scholar

  6. Kuan-Ching Li

    You can also search for this author inPubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Shunxiang Zhang, Zichen Ma and Hanchen Li. The first draft of the manuscript was written by Shunxiang Zhang and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence toShunxiang Zhang.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, S., Ma, Z., Li, H.et al. Gender opposition recognition method fusing emojis and multi-features in Chinese speech.Soft Comput29, 2379–2390 (2025). https://doi.org/10.1007/s00500-025-10492-4

Download citation

Keywords

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Advertisement


[8]ページ先頭

©2009-2025 Movatter.jp