Skip to content
This repository has been archived by the owner on Feb 20, 2023. It is now read-only.

Improve data extraction #3

Open
annajung opened this issue Dec 2, 2021 · 0 comments
Open

Improve data extraction #3

annajung opened this issue Dec 2, 2021 · 0 comments
Assignees

Comments

@annajung
Copy link
Contributor

annajung commented Dec 2, 2021

Description

There is a limitation with GitHub API rate limit that makes data extraction hard for medium to large size projects.

The code currently handles API rate limits by waiting an hour (time required to reset) which does not scale.

Improve current implementation of data extraction to work around the API limit

Possible Solution

  • Leverage GraphQL API to lower API calls

Steps to Reproduce

To reproduce the error, run the following with a medium to large size repo

python runDataExtraction.py <organization> <repo>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant