This repository is dedicated to the collection and analysis of public information from Terraform GitHub issues, comments and events. The data is collected from GitHub APIs and stored in a MongoDB database for further analysis.
You can collect the data in the following ways:
Run mongorestore
to load data from dump files in the repository.
- Set the
GITHUB_TOKEN
environment. Unauth API calls can easily reach the GitHub API limit. - Run a mongodb locally, e.g.,
docker run -v mongodbdata:/data/db -p 27017:27017 -d mongo
.
Run the collector.py
to collect data from GitHub APIs.
Once the data set is ready, you can run each script to do analysis. The script file names should be descriptive.