dcoxph

About

This repository hosts the R implementation of the distributed Cox Proportional Hazards algorithm as described by Lu et al. that can be used with the Personal Health Train distributed learning infrastructure.

The repository has been forked from the master branch. This repository has been tested with the new ppDLI infrastructure.

How to use this algorithm?

1. Getting the code

First, clone the repository and enter the directory:

git clone https://github.com/IKNL/dcoxph.git
cd dcoxph

2. Run the code in R-Studio

In order to run the algorithm, the following two approaches can be taken The following steps assume you have R (and RStudio) and git installed. If you run into trouble, please create an issue in the tracker.

2.1 Installing dependencies

Next, install the required packages in R. Either run the following in bash:

RScript install_packages.R

or run the following in R:

packages <- c(
  "abind",
  "dplyr",
  "httr",
  "rjson"
)

install.packages(packages)

2.2 Running the algorithm

This step assumes you have access to a central server and know your username, password and collaboration id.

A researcher then runs the analysis by:
i. Creating a client that communicates with the distributed learning infrastructure
ii.Calling the method dcoxph with the appropriate parameters

This is illustrated by the following R code: ```R source("Client.R") source("dl_coxph.R")

      # Create a client object to communicate with the server.
      #Example :  client <- Client("http://137.117.138.98:5000/api", "myname", "my_password", 2)

      client <- Client(host, username, password, collaboration_id)   
      client$authenticate()

      # Parameters used to interpret the hub's datastore
      expl_vars <- c("Age","Race2","Race3","Mar2","Mar3","Mar4","Mar5","Mar9",
                   "Hist8520","hist8522","hist8480","hist8501","hist8201",
                   "hist8211","grade","ts","nne","npn","er2","er4")
      time_col <- "Time"
      censor_col <- "Censor"

      results <- dcoxph(client, expl_vars, time_col, censor_col)
      ```

or, you can run the test_maastro.R script with correct username, password and collaboration id.

3. Run the code in Jupyter Notebook

2.1 Installing dependencies

Install [anaconda distribution] (https://www.anaconda.com/distribution/)
Install r package in anaconda run jupyter notebook in anaconda

4. Optional: encapsulating the distributed code in a Docker image

The code is split into a local and a distributed part. Both parts are implemented in the same R script dl_coxph.R. The Docker registry at https://docker-registry.distributedlearning.ai already hosts an image with the distributed code. If you are using your own installation of the infrastructure, you/the researcher should create a Docker image that holds the distributed code (see also build_docker.sh) and push the image to a (private) Docker registry. This requires Docker to be installed on the machine.

If you do not have access rights to https://docker-registry.distributedlearning.ai, a similar image can be extracted from here

For an overview of the working of the algorithm, see the figure below:

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.ipynb_checkpoints		.ipynb_checkpoints
img		img
.gitignore		.gitignore
Client.R		Client.R
Cox - Combined_Radiomics.txt		Cox - Combined_Radiomics.txt
Dockerfile		Dockerfile
Dockerfile.custom-r-base		Dockerfile.custom-r-base
LICENSE		LICENSE
README.md		README.md
SeerMetHeader.csv		SeerMetHeader.csv
SurvCurv.R		SurvCurv.R
UMASS-p.csv		UMASS-p.csv
UMASS-uni.csv		UMASS-uni.csv
Untitled.ipynb		Untitled.ipynb
build_docker.sh		build_docker.sh
d_coxph.Rproj		d_coxph.Rproj
dl_coxph.R		dl_coxph.R
forestplot_clin.R		forestplot_clin.R
forestplot_lp.R		forestplot_lp.R
forestplot_rad.R		forestplot_rad.R
forestplot_radReduced.R		forestplot_radReduced.R
hi.txt		hi.txt
install_packages.R		install_packages.R
requirements.txt		requirements.txt
run_coxph.R		run_coxph.R
run_sparql_for_cox.py		run_sparql_for_cox.py
run_tests.R		run_tests.R
split_SEER_dataset.R		split_SEER_dataset.R
test.R		test.R
test_coxph.R		test_coxph.R
test_maastro.R		test_maastro.R
validate_task.R		validate_task.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dcoxph

About

How to use this algorithm?

1. Getting the code

2. Run the code in R-Studio

2.1 Installing dependencies

2.2 Running the algorithm

3. Run the code in Jupyter Notebook

2.1 Installing dependencies

4. Optional: encapsulating the distributed code in a Docker image

About

Releases

Packages

Languages

License

VarshaGouthamchand/d_coxph_SPARQL_combined

Folders and files

Latest commit

History

Repository files navigation

dcoxph

About

How to use this algorithm?

1. Getting the code

2. Run the code in R-Studio

2.1 Installing dependencies

2.2 Running the algorithm

3. Run the code in Jupyter Notebook

2.1 Installing dependencies

4. Optional: encapsulating the distributed code in a Docker image

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages