Skip to content

umberto-sonnino/GalicianNLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GalicianNLP

The repository for the Galician part of the NLP First Homework

The overall objective of this homework is to create a multilingual morphological parser based on Wiktionary. Each team (one or two students) will work on the assigned language(s) . The team’s objective is to deliver Java code to extract a full morphological table for the assigned language(s) together with the table itself . The information has to be extracted from the Wiktionary dump of the given language and/or the English Wiktionary dump which contains information for all languages. What has been implemented The minimum classes to be implemented are: – An extension of our MorphoEntryIterator called MorphoEntryIterator which iterates over the entries in the language-specific raw dump and/or the English (multilingual) dump, where is your language – A class for each morphological rule, as implementation of MorphoRule, for the given language. Naming convention: MorphoRule

Optionally, you can create MorphoForm, your extension of MorphoForm should you need to save more information . For instance, you will deliver: – it/uniroma1/lcl/wimmp/FrenchMorphoEntryIterator.java – it/uniroma1/lcl/wimmp/FrenchMorphoRules.java – it/uniroma1/lcl/wimmp/FrenchMorphoForm.java – it/uniroma1/lcl/wimmp/FrenchMorphoEntryIterator.class – it/uniroma1/lcl/wimmp/FrenchMorphoRules.class – it/uniroma1/lcl/wimmp/FrenchMorphoForm.class

About

The repository for the Galician part of the NLP First Homework

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages