Provides access to given target and disease data via FTP within Dask ensuring performace through efficient parallelization. Data is quickly integrated, processed, and analyzed.
Install the Python dependencies using:
pip install -r requirements.txt
To access all given data and parse it, please run:
python target-disease.py
Two outputs will be generated in JSON format:
This is a pseudocode which still need adaptation to ensure efficient performance and reproducibility.