Skip to content

sinaahmadi/KurdishMT

Repository files navigation

Kurdish Machine Translation

This repository contains the data and the models described in the paper "Towards Machine Translation for the Kurdish Language". Please note that ku and en refer to Sorani Kurdish (=ckb in ISO 639-3) and English, respectively.


We share technical details of our models here so that future systems can be compared with the current project as follows:

See also

Shortly after this project, a set of parallel corpora containing Sorani-Kurmanji, Sorani-English and Kurmanji-English sentences was published. Check it out at https://github.com/KurdishBLARK/InterdialectCorpus.

Cite this paper

If you use any part of the data, please consider citing this paper as follows:

@inproceedings{ahmadi-masoud-2020-towards,
    title = "Towards Machine Translation for the {K}urdish Language",
    author = "Ahmadi, Sina  and
      Masoud, Maraim",
    booktitle = "Proceedings of the 3rd Workshop on Technologies for MT of Low Resource Languages",
    month = dec,
    year = "2020",
    address = "Suzhou, China",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.loresmt-1.12",
    pages = "87--98"
}

License

Apache License