Skip to content

Latest commit

 

History

History
56 lines (41 loc) · 1.74 KB

README.md

File metadata and controls

56 lines (41 loc) · 1.74 KB

izindaba zesiZulu ngezigaba zakhona

Categorised isiZulu News. Source data is the isiZulu news from the SABC social media posts.

Give Feedback 📑: DSFSI Resource Feedback Form

Dataset Information

Columns

  • source
  • utime
  • text
  • category

example below

image

Online Repository link

The dataset has also been added to the South African News Data repository.

Authors

See also the list of contributors who participated in this project.

Citing the dataset

@inproceedings{ngomane-etal-2023-unsupervised,
    title = "Unsupervised Cross-lingual Word Embedding Representation for {E}nglish-isi{Z}ulu",
    author = "Ngomane, Derwin  and
      Mabuya, Rooweither  and
      Abbott, Jade  and
      Marivate, Vukosi",
    booktitle = "Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023)",
    month = may,
    year = "2023",
    address = "Dubrovnik, Croatia",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.rail-1.2",
    doi = "10.18653/v1/2023.rail-1.2",
    pages = "11--17",
}

License

Dataset license is CC BY SA