On this page, I describe some parsing errors that I came across while working with Gentzkow et al.'s dataset of parsed speeches and phrase counts from the United States Congressional Record [1].
Parsing errors are remarkably rare overall in the files I have worked with (Daily Editions of the 110th through 114th Congress).
[1] Gentzkow, Matthew, Jesse M. Shapiro, and Matt Taddy. Congressional Record for the 43rd-114th Congresses: Parsed Speeches and Phrase Counts. Palo Alto, CA: Stanford Libraries [distributor], 2018-01-16. https://data.stanford.edu/congress_text