Achilles

Achilles is the name of the project using different n-grams to guess the language of the input givent by the user. It uses bigrams or quatergrams most of the time to build a probability distribution model. The model presents the probability of getting certain combinations of characters depending on the language. The dataset of books is freely provided by https://www.gutenberg.org/. The learning algorithm generates first the probability of getting an n-gram depending on the language and store it in a file. When asked about the language of a text, the algorithm will compare the input with the set of n-grams in the file and choose the language with the highest probability. The more dataset you give to the algorithm the better it will get. With 20 books per language we can achieve up to 94% with quatergrams.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
EnglishBooks		EnglishBooks
FrenchBooks		FrenchBooks
GermanBooks		GermanBooks
SpanishBooks		SpanishBooks
Achilles.py		Achilles.py
ChainGenerator.py		ChainGenerator.py
Englishblabla_quagrams.txt		Englishblabla_quagrams.txt
Frankeshtein.txt		Frankeshtein.txt
Frenchblabla_quagrams.txt		Frenchblabla_quagrams.txt
Germanblabla_quagrams.txt		Germanblabla_quagrams.txt
HW4.pdf		HW4.pdf
HW4.txt		HW4.txt
README.md		README.md
Spanishblabla_quagrams.txt		Spanishblabla_quagrams.txt
english_bigrams.txt		english_bigrams.txt
eshu.py		eshu.py
ham.txt.txt		ham.txt.txt
hw4.py		hw4.py
hw4completed.py		hw4completed.py
liste_english.txt		liste_english.txt
saver.py		saver.py
sentences.txt		sentences.txt
spanishblabla_bigrams.txt		spanishblabla_bigrams.txt
test.txt		test.txt
test1.txt		test1.txt
test3.txt		test3.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Achilles

About

Releases

Packages

Languages

allarassemjonathan/Achilles

Folders and files

Latest commit

History

Repository files navigation

Achilles

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages