GitHub

TextClassificationWeka

This project is composed of two parts:

The first part is: the light stemming In this part, we should mention in the file "param.txt" the file to stem and the stemmed file. To stem a file, we should execute the class main in the package stemmer.Analysis.
The second part: Text classification using machine learning algorithms from Weka ToolKit. We should mention the Arff file to classify and the classifier, either SMO or NaiveBayes, in the file "param.txt".

We used the project "https://github.com/motazsaad/arabic-light-stemmer" as a basis for the stemming part and we modified it according to our need.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
README.md		README.md
in.txt		in.txt
manifest.mf		manifest.mf
out.txt		out.txt
param.txt		param.txt
stopwords.txt		stopwords.txt
test.arff		test.arff
weka.jar		weka.jar