SBCDataAnalysis

1. Abstract

Artificial intelligence (AI) is currently revolutionizing countless domains in our daily lives. In many domains like medicine data required for building AI is interconnected (e.g., sequential measurements). However, current AI algorithms cannot utilize connections between data which limits their learning capabilities. A promising technology for exploiting those connections is Graph Neural Networks. In this study, we evaluated when Graph Neural Networks represent a valuable alternative to current AI algorithms and what limitations this new technology has exemplified on the classification of blood measurements as septic or not. Finally, we reveal the underlying mechanisms of Graph Neural Networks and current AI approaches for the prediction.

2. Publication

The link to the publication will be added as soon as the manuscript is accepted.

3. Installation

Unzip the CSV in extdata

Install packages using conda:

conda create -n myenv --file package-list.txt

Some packages were easier to install using pip (e.g., sklearn), so why those are included in the requirements.txt. Install them using
```
pip install -r requirements.txt
```

Note: We have used Conda version 11.7 with the following hardware setup: • Mainboard Supermicro X12SPA-TF • CPU: Intel® Xeon® Scalable Processor “Ice Lake” Gold 6338, 2.0 GHz, 32- Core • GPU: NVIDIA® RTX A6000 (48 GB GDDR6) • RAM: 8x32 GB DDR4-3200 • ROM: 2TB Samsung SSD 980 PRO, M.2

If you have any issues or problems in reproducing, do not hesitate to create an Issue or directly contact me using the following e-mail: [email protected]

4. Description

I have created multiple directories each containing different parts relevant for the SBC analysis

extdata - containing the original dataset from Steinbach et al. (https://github.com/ampel-leipzig/sbcdata/tree/main)
dataAnalysis - Contains all python scripts for pre-processing the data (reading, filtering and transforming the data) (DataAnalysis.py) and some scripts for metric evaluations (Metrics.py) and constructing feature importance and slope (FeatureImportance.py)
noise - contains scripts for writing noisy features to the original dataset
machine_learning - contains all jupyter notebooks for analyzing the dataset using machine learning algorithms (logistic regression, decision tree, random forest, XGBoost, RUSBoost) and creates/writes the feature variation graphs
neural_network - contains the jupyter notebook for analyzing the dataset using the proposed neural network
graph_learning - contains all jupyter notebooks for analyzing the dataset as similarity graphs (heterogeneous & homogeneous) and as patient-centric graphs (patient_centric) using graph learning and evaluations for the resulting attention scores
feature_variation - contains jupyter notebooks for writing feature variation graphs for the graph learning algorithms and the neural network and the jupyter notebook for visualizing all graphs for each algorithm in feature variation graphs
cuda_test - checks the cuda version and availability
keep_ssh - to ensure that the ssh tunnel does not break for a defined inactive period

If you have any questions regarding the structure or specific implementation details, do not hesitate to contact me using the following e-mail: [email protected]

5. Fundings

This research was funded by the German Research Foundation (DFG) under the project ‘Optimizing graph databases focusing on data processing and integration of machine learning for large clinical and biological datasets’ [grant numbers HE 8077/2-1, SA 465/53-1]).

6. Competing interests

The authors declare that they have no competing interests.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
cuda_test		cuda_test
dataAnalysis		dataAnalysis
extdata		extdata
feature_variation		feature_variation
graph_learning		graph_learning
keep_ssh		keep_ssh
machine_learning		machine_learning
models		models
neural_network		neural_network
noise		noise
old_experiments		old_experiments
partial_dependence		partial_dependence
src		src
statistics		statistics
time_series		time_series
.gitignore		.gitignore
Cluster.ipynb		Cluster.ipynb
LICENSE		LICENSE
README.md		README.md
package-list.txt		package-list.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SBCDataAnalysis

Table of Contents

1. Abstract

2. Publication

3. Installation

4. Description

5. Fundings

6. Competing interests

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

danielwalke/SBCDataAnalysis

Folders and files

Latest commit

History

Repository files navigation

SBCDataAnalysis

Table of Contents

1. Abstract

2. Publication

3. Installation

4. Description

5. Fundings

6. Competing interests

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages