Skip to content

Latest commit

 

History

History
16 lines (12 loc) · 1.02 KB

Datasets.md

File metadata and controls

16 lines (12 loc) · 1.02 KB

Datasets

This page lists a few large log datasets you can use to try out CLP and evaluate its compression ratio against other tools. Each dataset is gzipped for more efficient downloads. We will be uploading more datasets over time.

For evaluation results comparing CLP and other tools, see our paper.

Dataset Uncompressed size Download size
hadoop-14TB-part1 428.94 GB 20.33 GB
openstack-24hr 33.00 GB 2.06 GB
hive-24hr 2.07 GB 122.54 MB

We will upload the other parts soon.