- Notifications
You must be signed in to change notification settings - Fork6
ejtaal/jsastem
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This little project aims to create a simple arabic stemmerimplemented in JavaScript.
Good question. Because I need one for my other project,Mabhathu Tulab (A student's place of research) whichis an Arabic - Arabic dictionary, similar tohttp://baheth.infobut better of course ;) Its main feature is that theuser may click on any word of a dictionary entry that may thenbe further explained by an overlay.
It seems no-one has implemented one in JavaScript yet, forgood reason I imagine. The ISRI stemmer as implemented by theNLTK project seems like a very straight forward stemmer, nothingtoo complicated and time consuming for my needs. This projecttherefore aims to port the python script into javascript.
Hopefully I can find some time to increase accuracy. Some wordscan be derived from multiple possible theoretical roots so thesecould be enumerated in the return value. Prior to returning themthey could be compared against some known lists of existing rootsto filter out unknown roots.
License: GPLCopyright: Erik Taalejtaal@gmail.com ((http://ejtaal.net)