- Notifications
You must be signed in to change notification settings - Fork160
🪼 a python library for doing approximate and phonetic matching of strings.
License
NotificationsYou must be signed in to change notification settings
jamesturk/jellyfish
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
jellyfish is a library for approximate & phonetic matching of strings.
Source:https://github.com/jamesturk/jellyfish
Documentation:https://jamesturk.github.io/jellyfish/
Issues:https://github.com/jamesturk/jellyfish/issues
String comparison:
- Levenshtein Distance
- Damerau-Levenshtein Distance
- Jaccard Index
- Jaro Distance
- Jaro-Winkler Distance
- Match Rating Approach Comparison
- Hamming Distance
Phonetic encoding:
- American Soundex
- Metaphone
- NYSIIS (New York State Identification and Intelligence System)
- Match Rating Codex
>>>importjellyfish>>>jellyfish.levenshtein_distance('jellyfish','smellyfish')2>>>jellyfish.jaro_similarity('jellyfish','smellyfish')0.89629629629629637>>>jellyfish.damerau_levenshtein_distance('jellyfish','jellyfihs')1>>>jellyfish.metaphone('Jellyfish')'JLFX'>>>jellyfish.soundex('Jellyfish')'J412'>>>jellyfish.nysiis('Jellyfish')'JALYF'>>>jellyfish.match_rating_codex('Jellyfish')'JLLFSH'
About
🪼 a python library for doing approximate and phonetic matching of strings.