Download any one of the text dataset mentioned in the previous lecture.
- Compute tokens, types, and TTR.
- Plot Zipf’s law and check if Zipf’s law holds true for meanings and lengths. When and when not?
- Plot Heaps’ law. Fit a curve and report the estimated K and β values.