Skip to content

antonisa/griko-italian-parallel-corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Griko Italian Parallel Corpus

This repository contain a (very) small parallel speech corpus between the endangered language Griko and Italian. It is made of 330 sentences, with the following information levels: speech, machine extracted pseudo-phones, transcriptions, translations and sentence alignment. A reference for evaluation following Track 2 of the Zero Resource Challenge 2017 is available here in two formats, with and without silence marks information.

The dataset is made available to the community for reproducible computational language documentation experiments and their evaluation.

  • Reference: "A small Griko-Italian speech translation corpus", Marcely ZANON BOITO, Antonios ANASTASOPOULOS, Marika LEKAKOU, Aline VILLAVICENCIO, Laurent BESACIER, SLTU 2018, Gurgaon, India.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published