Skip to content

a repository for quechua to spanish scripts and corpora

Notifications You must be signed in to change notification settings

johneortega/mt_quechua_spanish

Folders and files

NameName
Last commit message
Last commit date

Latest commit

254cea0 · Feb 18, 2020

History

4 Commits
Feb 18, 2020
Feb 18, 2020
Feb 18, 2020
Feb 18, 2020

Repository files navigation

This is project for the neural machine translation system from Quechua to Spanish. The code and corpora is part of a paper authored by John E. Ortega (NYU), Richard Castro Mamani (Cuzco) , and Kyunghun Cho (NYU)

The annotations folder are the human subject evaluations and the corpora are the new corpora commissioned by us @NYU, specifically the "magazine" folder under corpora has 100 translations done by a professional Quechua->Spanish translator.

The Opus folder contains files gotten using jw300 on http://opus.nlpl.eu/

There is a train,validate,test split for training OpenNmt (or any MT) models...

About

a repository for quechua to spanish scripts and corpora

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published