Releases: yutkin/Lenta.Ru-News-Dataset
Releases · yutkin/Lenta.Ru-News-Dataset
Dataset updated
Errors fixed. Dataset expanded.
Relase notes
- Shuffle of URLs in a batch has been fixed
- Overall speed up due to using of
ProcessPoolExecutor
for parsing - Small fixes and improvements in a code base
- Added new articles until 15 December 2018
N.B.
Python 3.7 required!
Fix errors when URLs in one batch have been shuffled
0.2 update README