Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Russian/English/Estonian/Finnish/Swedish phonetic algorithm based on Soundex and Metaphone

License

NotificationsYou must be signed in to change notification settings

roddar92/russian_soundex

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Russian, English, Sweden, Estonian and Finnish Phonetic algorithm based on Soundex/Metaphone.

Package has both implemented phoneme transformation into letter-number sequence and distance engine for comparison of phonetic sequences (based on Levenstein and Hamming distances).

Furthermore, both Russian phonetic algorithms supports preprocessing for specific phoneme cases.

Quick start

  1. Install this package viapip
pipinstallfonetika
  1. Import Soundex algorithm.

Package supports a lot of opportunities, it's possible to cut a result sequence (like in the original Soundex version) or also code vowels.

fromfonetika.soundeximportRussianSoundexsoundex=RussianSoundex(delete_first_letter=True)soundex.transform('ёлочка')...J070530soundex=RussianSoundex(delete_first_letter=True,code_vowels=True)soundex.transform('ёлочка')...JA7A53A

A structure of the library is scalable,RussianSoundex class inherits basic classSoundex (original for English language). In order to extend our algorithm, you need just inherit own class fromSoundex and override methods.

  1. Import Soundex distance for usage of string comparision
fromfonetika.distanceimportPhoneticsInnerLanguageDistancesoundex=RussianSoundex(delete_first_letter=True)phon_distance=PhoneticsInnerLanguageDistance(soundex)phon_distance.distance('ёлочка','йолочка')...0
  1. You can also calculate distance between words of two languages. It would be useful for working with one language family group.
fromfonetika.distanceimportPhoneticsBetweenLanguagesDistancem1=FinnishMetaphone(reduce_word=False)m2=EstonianMetaphone(reduce_word=False)phon_distance=PhoneticsBetweenLanguagesDistance(m1,m2)phon_distance.distance('yö','öö')...1

About

Russian/English/Estonian/Finnish/Swedish phonetic algorithm based on Soundex and Metaphone

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp