Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..

NotificationsYou must be signed in to change notification settings

mohabmes/Arabycia

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 

Repository files navigation

Arabic NLP tool Built using NLTK, Pyaramorph, and Sinai-corpus to perform:

  • Tokenization
  • Lemmatization
  • Segmentation
  • Transliteration
  • Reverse Transliteration
  • Sentence diacritization
  • Text Search
  • POS tagging
  • Translation
  • Find ambiguity

Usage

Input
text = 'يستعيد الكاتب في هذه الرواية كيف تحولت من مدينة للانوار الي مدينة للاشباح'arabycia = Arabycia()arabycia.set_raw_text(text)arabycia.analyze()
Output
Sentence :يستعيد الكاتب في هذه الرواية كيف تحولت من مدينة للانوار الي مدينة للاشباحWith Diacritics :يَسْتَعِيد الكاتِب فِي هٰذِهِ الرِوايَة كَيْفَ تَحَوَّلْتُ مِن مَدِينَة لِلأَنْوار إِلَى مَدِينَة لِلأَشْباحPOS :sotaEiyd/VERB_IMPERFECT kAtib/NOUN fiy/PREP h`*ihi/DEM_PRON_F riwAy/NOUN kayofa/REL_PRON taHaw~al/VERB_PERFECT min/PREP madiyn/NOUN >anowAr/NOUN <ilaY/PREP madiyn/NOUN >a$obAH/NOUNWord  : يَسْتَعِيدyasotaEiyd{isotaEAd_1 POS   : ya/IV3MS+sotaEiyd/VERB_IMPERFECT Gloss : recover;regain;reclaimWord  : هٰذِهِh`*ihih`*A_1 POS   : h`*ihi/DEM_PRON_F Gloss : this/theseWord  : لِلأَنْوارlil>anowArnuwr_2 POS   : li/PREP+Al/DET+>anowAr/NOUN Gloss : lightsWord  : لِلأَشْباحlil>a$obAH$abaH_1 POS   : li/PREP+Al/DET+>a$obAH/NOUN Gloss : specters;shapesWord  : الكاتِبAlkAtibkAtib_1 POS   : Al/DET+kAtib/NOUN Gloss : writer;authorWord  : فِيfiyfiy_1 POS   : fiy/PREP Gloss : inWord  : الرِوايَةAlriwAyapriwAyap_1 POS   : Al/DET+riwAy/NOUN+ap/NSUFF_FEM_SG Gloss : story;novelWord  : كَيْفَkayofakayofa_1 POS   : kayofa/REL_PRON Gloss : howWord  : تَحَوَّلْتُtaHaw~alotutaHaw~al_1 POS   : taHaw~al/VERB_PERFECT+tu/PVSUFF_SUBJ:1S Gloss : be changed;be transformedWord  : مِنminmin_1 POS   : min/PREP Gloss : fromWord  : مَدِينَةmadiynapmadiynap_1 POS   : madiyn/NOUN+ap/NSUFF_FEM_SG Gloss : cityWord  : إِلَى<ilaY<ilaY_1 POS   : <ilaY/PREP Gloss : to;towardsWord  : مَدِينَةmadiynapmadiynap_1  POS   : madiyn/NOUN+ap/NSUFF_FEM_SG Gloss : city
Input
text = 'يستجمع المؤرخ أفكاره'arabycia = Arabycia()arabycia.set_raw_text(text)search_result = arabycia.text_search("جمع")print(search_result)
Output
['يستجمع']

Notes

Requirement

  • NLTK
  • Sinai-corpus

About

Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..

Topics

Resources

Stars

Watchers

Forks

Languages


[8]ページ先頭

©2009-2025 Movatter.jp