DataExtractor A tool for extracting datasets from popular datasources and uploading them into a cloud-hosted file storage leveraging Apache Parquet as file format