Skip to content

a repository for quechua to spanish scripts and corpora

Notifications You must be signed in to change notification settings

johneortega/mt_quechua_spanish

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

This is project for the neural machine translation system from Quechua to Spanish. The code and corpora is part of a paper authored by John E. Ortega (NYU), Richard Castro Mamani (Cuzco) , and Kyunghun Cho (NYU)

The annotations folder are the human subject evaluations and the corpora are the new corpora commissioned by us @NYU, specifically the "magazine" folder under corpora has 100 translations done by a professional Quechua->Spanish translator.

The Opus folder contains files gotten using jw300 on http://opus.nlpl.eu/

There is a train,validate,test split for training OpenNmt (or any MT) models...

About

a repository for quechua to spanish scripts and corpora

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published