Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Identify PrimeGov subdomains in AHP Parser #8

Open
krammy19 opened this issue Feb 24, 2021 · 2 comments
Open

Identify PrimeGov subdomains in AHP Parser #8

krammy19 opened this issue Feb 24, 2021 · 2 comments
Assignees
Labels

Comments

@krammy19
Copy link
Collaborator

krammy19 commented Feb 24, 2021

This is borrowed from biglocalnews/civic-scraper#54

A number of local governments in the Bay Area and in other parts of the country post their meeting minutes, agendas, etc. on websites on the *primegov.com subdomain. These websites typically look something like this and follow the web address convention PLACE.primegov.com/public/portal, where PLACE is a custom field.

Your task is to add a primegov function to the html-request scraper2 so that it also grabs *primegov.com subdomains as possible. This will allow us to evaluate how many government agencies are using this website format, which, in turn, will help us to decide which scrapers to build next.

@xconnieex xconnieex added the good first issue Good for newcomers label Feb 24, 2021
@dineshkumar-23
Copy link

Hello,
Is it fine if we had all the places in a text file and checked the response for each? Like substituting the subdomain in the URL with a place in the text file.

@krammy19
Copy link
Collaborator Author

It seems like what you're describing is more appropriate for this issue: #11

@krammy19 krammy19 self-assigned this Feb 25, 2021
@krammy19 krammy19 changed the title Identify PrimeGov subdomains in html-request scraper2 Identify PrimeGov subdomains in AHP Parser Mar 12, 2021
@krammy19 krammy19 added On Hold and removed good first issue Good for newcomers labels Sep 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants