Skip to content

This project is to provide spell check help from Urdu to Hindi transliteration.The spelling errors in our case mostly comprises of errors in matras.

Notifications You must be signed in to change notification settings

rohithkodali/Hindi-Spell-Check-Using-Language-Modelling

 
 

Repository files navigation

1)Tokenise Hindi corpus.
python indic_tokenizer.py --i  hindi_corpus_5lac.txt --o hindi_5lac_tokenised.txt --l hin
2)Run the spell checker (Give an input sentence, with the index of the word for which spell cheking has to be done) 
python BigramModelSpellCheck.py

The code returns a ranked list of word suggestions (with most probable word on the top)

About

This project is to provide spell check help from Urdu to Hindi transliteration.The spelling errors in our case mostly comprises of errors in matras.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%