The influence of active learning model and prior knowledge choice on how long it takes to find hard-to-find relevant papers: Examining the variability of the time to discovery and the stability of its rank-orders

Description

The purpose of this project was to explore how the choice of active learning (AL) model and prior knowledge influences the time to discovery (TD) of the hard-to-find relevant papers in a dataset while using an AL-aided screening tool in the context of systematic reviewing (e.g., ASReview)

This repository contains the files to reproduce the simulation study and the subsequent analysis of the data for this project (which was conducted for a Master's thesis in Applied Data Science from Utrecht University).

How to reproduce the project

1. Access and preprocess the data

Please refer to the readme from the data folder

2. Install ASReview (and Makita)

Please refer to the readme in the scripts folder.

3. Run the jobs.bat files in the simulation folders

Run the jobs.bat files from the arfi_simulation and the multiple_models simulation folders (when the data has been accessed and prepocessed).

OR

3. Download the modified version of Makita and then run the simulations

PLease refer to the readme in the scripts folder.

4. Run the analysis notebook to generate the results

Open analysis_notebook and run the scripts (making sure to change the directory to where you have the hard_to_find_papers_project repo stored on your local computer).

License

This project is published under the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
arfi_simulation		arfi_simulation
data		data
modified_scripts		modified_scripts
multiple_models_simulation		multiple_models_simulation
.gitignore		.gitignore
DOIs, Titles and Abstracts of Hard-To-Find Papers.txt		DOIs, Titles and Abstracts of Hard-To-Find Papers.txt
LICENSE		LICENSE
README.md		README.md
analysis_notebook.ipynb		analysis_notebook.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The influence of active learning model and prior knowledge choice on how long it takes to find hard-to-find relevant papers: Examining the variability of the time to discovery and the stability of its rank-orders

Description

Table of Contents

`data`

`modified_scripts`

`multiple_models_simulation`

`arfi_simulation`

`analysis_notebook`

How to reproduce the project

1. Access and preprocess the data

2. Install ASReview (and Makita)

3. Run the jobs.bat files in the simulation folders

3. Download the modified version of Makita and then run the simulations

4. Run the analysis notebook to generate the results

License

About

Releases

Packages

Languages

License

FioByr/hard_to_find_papers_project

Folders and files

Latest commit

History

Repository files navigation

The influence of active learning model and prior knowledge choice on how long it takes to find hard-to-find relevant papers: Examining the variability of the time to discovery and the stability of its rank-orders

Description

Table of Contents

data

modified_scripts

multiple_models_simulation

arfi_simulation

analysis_notebook

How to reproduce the project

1. Access and preprocess the data

2. Install ASReview (and Makita)

3. Run the jobs.bat files in the simulation folders

3. Download the modified version of Makita and then run the simulations

4. Run the analysis notebook to generate the results

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`data`

`modified_scripts`

`multiple_models_simulation`

`arfi_simulation`

`analysis_notebook`

Packages