Skip to content

The scraper/parser that produces data for TheyWorkForYou, PublicWhip, etc

License

Notifications You must be signed in to change notification settings

datarelish/parlparse

 
 

Repository files navigation

parlparse

Prerequisites

You should probably install the following tools before attempting anything in this README, otherwise things may not work as expected (or at all):

Scrape data from data.parliament.uk

The source of data for the 2010 election onwards is data.parliament.uk. This data is scraped and stored locally for parsing, but needs updating manually using the following process.

  1. Change to the scripts/datadotparl directory.
  2. Run ./crawl-members

Update from data.parliament.uk

The data.parliament data is used to generate a list of members positions (in ministers-2010.json) which includes government posts, opposition posts, parliamentary posts and committee memberships from the 2010 general election onwards.

Before updating, you should run the data.parliament.uk scraper.

  1. Change to the scripts/datadotparl directory.
  2. Run ./update-members to parse the XML and update things.

About

The scraper/parser that produces data for TheyWorkForYou, PublicWhip, etc

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 54.5%
  • HTML 42.6%
  • Perl 1.3%
  • CSS 0.6%
  • PHP 0.4%
  • Shell 0.4%
  • XSLT 0.2%