diff --git a/README.md b/README.md index 0852d23..37e55e4 100644 --- a/README.md +++ b/README.md @@ -60,6 +60,8 @@ Big Data Stream processing engines, exemplified by tools like Apache Flink, empl #### Datasets * All datasets used are in this [link](https://drive.google.com/drive/folders/1F3ageBfsfOXqHKrk0H0ItqkJ4WJr_lQd?usp=sharing). +* The main dataset used contains around 6M records. +* The secondary dataset used contains around 49M records. #### Out Of Order Data Generator * Code could be found [here](https://drive.google.com/drive/folders/1Hkza13L3HfT8U7eVvLBOLnXrxN8r6Zhr?usp=sharing).