Benchmarking multi-omics integration approaches for the study of cancer

Introduction

This repository represents a substantial extension and refinement of the benchmarking framework originally established by Cantini et al.. Our repository builds upon their foundational work in benchmarking multi-omics integration methods, introducing the Group Aggregation via UMAP Data Integration (GAUDI) approach.

In our study, we have meticulously selected methods for benchmarking based on their distinguished performance as identified in the research by Cantini et al.. This selection enabled a robust and meaningful comparison within the realm of state-of-the-art multi-omics integration techniques. Our approach, GAUDI, was rigorously tested against these well-established methods in a series of comprehensive benchmarks. By doing so, we aimed to not only validate GAUDI's efficacy but also to provide an in-depth, comparative perspective that could potentially open new avenues in multi-omics research.

Benchmark Scenarios

Comparison on artificial datasets: 01_comparisons_on_simulated_data.qmd
Comparison on TCGA cancer datasets:
- Survival, biological, and clinical annotations: 02_comparison_on_cancer_data.qmd
- Clusters: 03_comparison_on_cancer_data_clusters.qmd
Comparison on single-cell datasets: 04_comparison_on_single_cell_data.qmd
Comparison on DepMap datasets: 05_comparison_on_DepMap_data.qmd
Benchmark of different UMAP integration methods (not included in the paper): umap_integration_methods.py

Benchmark Methods

`gaudi` R Package

We have developed a comprehensive R package for the GAUDI method, making it accessible and user-friendly for the research community. This package encapsulates all the functionalities of GAUDI, providing an efficient and streamlined way to apply our method to various multi-omics datasets.

Key Features

Easy integration of diverse omic data types
Built-in functions for clustering and feature extraction
Built-in functions for visualization of latent factors and metagenes

Citation

@article {Castellano-Escuder2024.10.07.617035,
	author = {Castellano-Escuder, Pol and Zachman, Derek K and Han, Kevin and Hirschey, Matthew D},
	title = {Interpretable multi-omics integration with UMAP embeddings and density-based clustering},
	elocation-id = {2024.10.07.617035},
	year = {2024},
	doi = {10.1101/2024.10.07.617035},
	publisher = {Cold Spring Harbor Laboratory},
	URL = {https://www.biorxiv.org/content/early/2024/10/11/2024.10.07.617035},
	eprint = {https://www.biorxiv.org/content/early/2024/10/11/2024.10.07.617035.full.pdf},
	journal = {bioRxiv}
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
data		data
results20240108123556		results20240108123556
results20240109194923		results20240109194923
results_DepMap		results_DepMap
results_single_cell		results_single_cell
results_umap_integration_methods		results_umap_integration_methods
scripts		scripts
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
readme_example.png		readme_example.png
umap_multiomics_integration.Rproj		umap_multiomics_integration.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchmarking multi-omics integration approaches for the study of cancer

Introduction

Benchmark Scenarios

Benchmark Methods

`gaudi` R Package

Key Features

Citation

About

Releases

Packages

Languages

License

hirscheylab/umap_multiomics_integration

Folders and files

Latest commit

History

Repository files navigation

Benchmarking multi-omics integration approaches for the study of cancer

Introduction

Benchmark Scenarios

Benchmark Methods

gaudi R Package

Key Features

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`gaudi` R Package

Packages