Replies: 3 comments
-
I don't think anything like this exists right now -- there's sort of two different use cases:
|
Beta Was this translation helpful? Give feedback.
-
@showerst Thanks. I've done similar manual things, but of course I'm looking to scale up. I was thinking of two parsing passes. Using the maps from the first pass, a second pass would diff each pair of pseudo-id-matching files and report differences, ignoring timestamps. Diffing could be done by something like deepdiff. Does that make sense? |
Beta Was this translation helpful? Give feedback.
-
hoping to integrate this as part of openstates/issues#85 |
Beta Was this translation helpful? Give feedback.
-
In working on scrapers I often find that they could use some serious refactoring, but I'm too scared of breaking something to take it on. It seems like some kind of regression testing framework might help.
I'm thinking of a framework that would run a current subscraper, then run the new subscraper, and compare for unexpected differences. By "subscraper" I mean e.g. a bill scraper, vote scraper, people scraper, or whatever.
This would obviously be for manual use in the development process, not e.g. from Travis CI testing.
Does such a thing exist already somewhere?
This may belong better in pupa; I think it makes sense to rough it out here, first, though.
Beta Was this translation helpful? Give feedback.
All reactions