Skip to content
Change the repository type filter

All

    Repositories list

    • lacuna

      Public
      This repository is dedicated for the data engineering / data pipelines of the Lacuna project which aims at building a "Household Electricity Consumption Dataset" conducted in Sri Lanka. Resources
      Python
      0000Updated Oct 21, 2024Oct 21, 2024
    • HTML
      0000Updated Aug 18, 2023Aug 18, 2023
    • A dataset consisting of 3576 documents in Sinhala, drawn from Sri Lankan news websites and factchecking operations, annotated as CREDIBLE, FALSE, PARTIAL or UN- CERTAIN. The dataset has markers for the content of the document, the classification, the web domain from which each document was retrieved, and the date on which the document was publis…
      2400Updated Nov 23, 2022Nov 23, 2022
    • Scala
      1000Updated Jun 8, 2022Jun 8, 2022
    • HTML
      0100Updated May 3, 2022May 3, 2022
    • Backend Server for media monitoring automation (discordance)
      TypeScript
      MIT License
      1000Updated Nov 5, 2021Nov 5, 2021
    • Two large language corpora extracted from Facebook, focused primarily on Sinhala text. Timestamped statuses with origin markers. Rudimentary stopwords list included.
      5900Updated Jan 16, 2021Jan 16, 2021
    • R
      1000Updated Apr 28, 2020Apr 28, 2020
    • A dataset of millions of news articles scraped from a curated list of data sources.
      Apache License 2.0
      97100Updated Jan 25, 2020Jan 25, 2020
    • thirdeye

      Public
      Deepfake detection
      Python
      1000Updated Sep 5, 2019Sep 5, 2019
    • Jupyter Notebook
      Other
      3000Updated Jan 17, 2017Jan 17, 2017