Skip to content

OpenDemocracyManitoba/Winnipeg-Hansard-Parser

Repository files navigation

Winnipeg Council Hansard Parser

This tool is a collection of Ruby and Javascript programs that convert (PDF to JSON to HTML) the council meeting hansards posted on the City of Winnipeg website.

The Process Described

  1. Download a hansard PDF from the City of Winnipeg website.
  2. Convert the PDF to an HTML file using HTML Publish*.
  3. Run the parse_handsard.rb with the name of a converted HTML file from step 2 as the only argument.
  4. Load the generated HTML file in a web-browser and fix any parsing errors.
  5. Grab the resulting JSON.
  6. Run the visualization script to convert the JSONified hansard into HTML meeting minutes.

* If this service ever goes down it should be possible to use BCL easyConverter SDK to perform this conversion.

License

This is free and unencumbered software released into the public domain. See UNLICENSE for details.

About

Extracting Data from City of Winnipeg Council Meeting Docs.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages