Skip to content

Software-Engineering-Group-UP/potsdam-research-repos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

Introduction

This repository contains the data and findings for paper -

Contents

  1. data/: This directory contains the raw data collected from the research repositories. The all_research_repos.csv file specifically contains data used in this analysis, which is limited to research repositories. (Note: The files all_org_repos.csv and all_user_repos.csv include repositories labeled as research/non_research)
  2. analysis/: This folder contains plot_analysis.ipynb file with graphs/plots used in paper. The analysis and result graphs can be found in the plot_analysis.ipynb Jupyter notebook.
  3. scripts/: This directory contains the code code used for plotting graphs in jupyternotebook.

Methodology

SWORDS-template-UP v1.0.0 have been used for gathering GitHub profiles, repositories and additional software developement variables. This is an exteded version of SWORDS-template

Data Collection

To collect the necessary data for our analysis, follow these steps:

  1. Collect GitHub profiles of users and organizations by using the SWORDS-template-UP collect_users script.
  2. Collect repositories of GitHub profiles using the SWORDS-template-UP collect_repositories script.
  3. Collect additional variables by running specific scripts, such as:

Usage

To reproduce our analysis, follow these steps:

  1. Clone this repository.
  2. Navigate to the analysis/ directory.
  3. Run the analysis script.

License

The data is licensed under Creative Commons Attribution 4.0 International License Creative Commons License

Citation

Please cite it as described in the CITATION.cff file.