Skip to content

Natural language alphabet detection library

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT
Notifications You must be signed in to change notification settings

RoDmitry/alphabet_detector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Alphabet Detector

Crate API

Detects 388 alphabets in 170 scripts

one spoken language can be written in multiple scripts, so it will be detected as a different alphabet/language

look at the alphabet.rs to understand what languages have already defined alphabets. Some of them need validation

Separates words in text (from iterator CharIndices), and detects language of words by used alphabets (chars).

Warning: can return words with chars from the Unicode private area (for Yoruba or Nuer language)

Alpha version (Work In Progress)

About

Natural language alphabet detection library

Resources

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT

Stars

Watchers

Forks

Languages