Skip to content

Commit

Permalink
update results
Browse files Browse the repository at this point in the history
rcannood committed Jan 22, 2025

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
1 parent 43e2ea2 commit a78daa2
Showing 7 changed files with 3,014 additions and 2,336 deletions.
54 changes: 27 additions & 27 deletions results/label_projection/data/dataset_info.json
Original file line number Diff line number Diff line change
@@ -6,28 +6,8 @@
"dataset_description": "Despite their crucial role in health and disease, our knowledge of immune cells within human tissues remains limited. We surveyed the immune compartment of 16 tissues from 12 adult donors by single-cell RNA sequencing and VDJ sequencing generating a dataset of ~360,000 cells. To systematically resolve immune cell heterogeneity across tissues, we developed CellTypist, a machine learning tool for rapid and precise cell type annotation. Using this approach, combined with detailed curation, we determined the tissue distribution of finely phenotyped immune cell types, revealing hitherto unappreciated tissue-specific features and clonal architecture of T and B cells. Our multitissue approach lays the foundation for identifying highly resolved immune cell types by leveraging a common reference dataset, tissue-integrated expression analysis, and antigen receptor sequencing.",
"data_reference": "dominguez2022crosstissue",
"data_url": "https://cellxgene.cziscience.com/collections/62ef75e4-cbea-454e-a0ce-998ec40223d3",
"date_created": "13-01-2025",
"file_size": 341174505
},
{
"dataset_id": "cellxgene_census/dkd",
"dataset_name": "Diabetic Kidney Disease",
"dataset_summary": "Multimodal single cell sequencing implicates chromatin accessibility and genetic background in diabetic kidney disease progression",
"dataset_description": "Multimodal single cell sequencing is a powerful tool for interrogating cell-specific changes in transcription and chromatin accessibility. We performed single nucleus RNA (snRNA-seq) and assay for transposase accessible chromatin sequencing (snATAC-seq) on human kidney cortex from donors with and without diabetic kidney disease (DKD) to identify altered signaling pathways and transcription factors associated with DKD. Both snRNA-seq and snATAC-seq had an increased proportion of VCAM1+ injured proximal tubule cells (PT_VCAM1) in DKD samples. PT_VCAM1 has a pro-inflammatory expression signature and transcription factor motif enrichment implicated NFkB signaling. We used stratified linkage disequilibrium score regression to partition heritability of kidney-function-related traits using publicly-available GWAS summary statistics. Cell-specific PT_VCAM1 peaks were enriched for heritability of chronic kidney disease (CKD), suggesting that genetic background may regulate chromatin accessibility and DKD progression. snATAC-seq found cell-specific differentially accessible regions (DAR) throughout the nephron that change accessibility in DKD and these regions were enriched for glucocorticoid receptor (GR) motifs. Changes in chromatin accessibility were associated with decreased expression of insulin receptor, increased gluconeogenesis, and decreased expression of the GR cytosolic chaperone, FKBP5, in the diabetic proximal tubule. Cleavage under targets and release using nuclease (CUT&RUN) profiling of GR binding in bulk kidney cortex and an in vitro model of the proximal tubule (RPTEC) showed that DAR co-localize with GR binding sites. CRISPRi silencing of GR response elements (GRE) in the FKBP5 gene body reduced FKBP5 expression in RPTEC, suggesting that reduced FKBP5 chromatin accessibility in DKD may alter cellular response to GR. We developed an open-source tool for single cell allele specific analysis (SALSA) to model the effect of genetic background on gene expression. Heterozygous germline single nucleotide variants (SNV) in proximal tubule ATAC peaks were associated with allele-specific chromatin accessibility and differential expression of target genes within cis-coaccessibility networks. Partitioned heritability of proximal tubule ATAC peaks with a predicted allele-specific effect was enriched for eGFR, suggesting that genetic background may modify DKD progression in a cell-specific manner.",
"data_reference": "wilson2022multimodal",
"data_url": "https://cellxgene.cziscience.com/collections/b3e2c6e3-9b05-4da9-8f42-da38a664b45b",
"date_created": "13-01-2025",
"file_size": 86763866
},
{
"dataset_id": "cellxgene_census/mouse_pancreas_atlas",
"dataset_name": "Mouse Pancreatic Islet Atlas",
"dataset_summary": "Mouse pancreatic islet scRNA-seq atlas across sexes, ages, and stress conditions including diabetes",
"dataset_description": "To better understand pancreatic β-cell heterogeneity we generated a mouse pancreatic islet atlas capturing a wide range of biological conditions. The atlas contains scRNA-seq datasets of over 300,000 mouse pancreatic islet cells, of which more than 100,000 are β-cells, from nine datasets with 56 samples, including two previously unpublished datasets. The samples vary in sex, age (ranging from embryonic to aged), chemical stress, and disease status (including T1D NOD model development and two T2D models, mSTZ and db/db) together with different diabetes treatments. Additional information about data fields is available in anndata uns field 'field_descriptions' and on https://github.com/theislab/mm_pancreas_atlas_rep/blob/main/resources/cellxgene.md.",
"data_reference": "hrovatin2023delineating",
"data_url": "https://cellxgene.cziscience.com/collections/296237e2-393d-4e31-b590-b03f74ac5070",
"date_created": "13-01-2025",
"file_size": 133936661
"date_created": "21-01-2025",
"file_size": 123436095
},
{
"dataset_id": "cellxgene_census/gtex_v9",
@@ -36,18 +16,28 @@
"dataset_description": "Understanding the function of genes and their regulation in tissue homeostasis and disease requires knowing the cellular context in which genes are expressed in tissues across the body. Single cell genomics allows the generation of detailed cellular atlases in human tissues, but most efforts are focused on single tissue types. Here, we establish a framework for profiling multiple tissues across the human body at single-cell resolution using single nucleus RNA-Seq (snRNA-seq), and apply it to 8 diverse, archived, frozen tissue types (three donors per tissue). We apply four snRNA-seq methods to each of 25 samples from 16 donors, generating a cross-tissue atlas of 209,126 nuclei profiles, and benchmark them vs. scRNA-seq of comparable fresh tissues. We use a conditional variational autoencoder (cVAE) to integrate an atlas across tissues, donors, and laboratory methods. We highlight shared and tissue-specific features of tissue-resident immune cells, identifying tissue-restricted and non-restricted resident myeloid populations. These include a cross-tissue conserved dichotomy between LYVE1- and HLA class II-expressing macrophages, and the broad presence of LAM-like macrophages across healthy tissues that is also observed in disease. For rare, monogenic muscle diseases, we identify cell types that likely underlie the neuromuscular, metabolic, and immune components of these diseases, and biological processes involved in their pathology. For common complex diseases and traits analyzed by GWAS, we identify the cell types and gene modules that potentially underlie disease mechanisms. The experimental and analytical frameworks we describe will enable the generation of large-scale studies of how cellular and molecular processes vary across individuals and populations.",
"data_reference": "eraslan2022singlenucleus",
"data_url": "https://cellxgene.cziscience.com/collections/a3ffde6c-7ad2-498a-903c-d58e732f7470",
"date_created": "13-01-2025",
"date_created": "21-01-2025",
"file_size": 206108150
},
{
"dataset_id": "cellxgene_census/dkd",
"dataset_name": "Diabetic Kidney Disease",
"dataset_summary": "Multimodal single cell sequencing implicates chromatin accessibility and genetic background in diabetic kidney disease progression",
"dataset_description": "Multimodal single cell sequencing is a powerful tool for interrogating cell-specific changes in transcription and chromatin accessibility. We performed single nucleus RNA (snRNA-seq) and assay for transposase accessible chromatin sequencing (snATAC-seq) on human kidney cortex from donors with and without diabetic kidney disease (DKD) to identify altered signaling pathways and transcription factors associated with DKD. Both snRNA-seq and snATAC-seq had an increased proportion of VCAM1+ injured proximal tubule cells (PT_VCAM1) in DKD samples. PT_VCAM1 has a pro-inflammatory expression signature and transcription factor motif enrichment implicated NFkB signaling. We used stratified linkage disequilibrium score regression to partition heritability of kidney-function-related traits using publicly-available GWAS summary statistics. Cell-specific PT_VCAM1 peaks were enriched for heritability of chronic kidney disease (CKD), suggesting that genetic background may regulate chromatin accessibility and DKD progression. snATAC-seq found cell-specific differentially accessible regions (DAR) throughout the nephron that change accessibility in DKD and these regions were enriched for glucocorticoid receptor (GR) motifs. Changes in chromatin accessibility were associated with decreased expression of insulin receptor, increased gluconeogenesis, and decreased expression of the GR cytosolic chaperone, FKBP5, in the diabetic proximal tubule. Cleavage under targets and release using nuclease (CUT&RUN) profiling of GR binding in bulk kidney cortex and an in vitro model of the proximal tubule (RPTEC) showed that DAR co-localize with GR binding sites. CRISPRi silencing of GR response elements (GRE) in the FKBP5 gene body reduced FKBP5 expression in RPTEC, suggesting that reduced FKBP5 chromatin accessibility in DKD may alter cellular response to GR. We developed an open-source tool for single cell allele specific analysis (SALSA) to model the effect of genetic background on gene expression. Heterozygous germline single nucleotide variants (SNV) in proximal tubule ATAC peaks were associated with allele-specific chromatin accessibility and differential expression of target genes within cis-coaccessibility networks. Partitioned heritability of proximal tubule ATAC peaks with a predicted allele-specific effect was enriched for eGFR, suggesting that genetic background may modify DKD progression in a cell-specific manner.",
"data_reference": "wilson2022multimodal",
"data_url": "https://cellxgene.cziscience.com/collections/b3e2c6e3-9b05-4da9-8f42-da38a664b45b",
"date_created": "21-01-2025",
"file_size": 159201224
},
{
"dataset_id": "cellxgene_census/tabula_sapiens",
"dataset_name": "Tabula Sapiens",
"dataset_summary": "A multiple-organ, single-cell transcriptomic atlas of humans",
"dataset_description": "Tabula Sapiens is a benchmark, first-draft human cell atlas of nearly 500,000 cells from 24 organs of 15 normal human subjects. This work is the product of the Tabula Sapiens Consortium. Taking the organs from the same individual controls for genetic background, age, environment, and epigenetic effects and allows detailed analysis and comparison of cell types that are shared between tissues. Our work creates a detailed portrait of cell types as well as their distribution and variation in gene expression across tissues and within the endothelial, epithelial, stromal and immune compartments.",
"data_reference": "consortium2022tabula",
"data_url": "https://cellxgene.cziscience.com/collections/e5f58829-1a66-40b5-a624-9046778e74f5",
"date_created": "13-01-2025",
"file_size": 1727821930
"date_created": "21-01-2025",
"file_size": 24760512
},
{
"dataset_id": "cellxgene_census/hypomap",
@@ -56,7 +46,17 @@
"dataset_description": "The hypothalamus plays a key role in coordinating fundamental body functions. Despite recent progress in single-cell technologies, a unified catalogue and molecular characterization of the heterogeneous cell types and, specifically, neuronal subtypes in this brain region are still lacking. Here we present an integrated reference atlas “HypoMap” of the murine hypothalamus consisting of 384,925 cells, with the ability to incorporate new additional experiments. We validate HypoMap by comparing data collected from SmartSeq2 and bulk RNA sequencing of selected neuronal cell types with different degrees of cellular heterogeneity.",
"data_reference": "steuernagel2022hypomap",
"data_url": "https://cellxgene.cziscience.com/collections/d86517f0-fa7e-4266-b82e-a521350d6d36",
"date_created": "13-01-2025",
"file_size": 23568346
"date_created": "21-01-2025",
"file_size": 35356034
},
{
"dataset_id": "cellxgene_census/mouse_pancreas_atlas",
"dataset_name": "Mouse Pancreatic Islet Atlas",
"dataset_summary": "Mouse pancreatic islet scRNA-seq atlas across sexes, ages, and stress conditions including diabetes",
"dataset_description": "To better understand pancreatic β-cell heterogeneity we generated a mouse pancreatic islet atlas capturing a wide range of biological conditions. The atlas contains scRNA-seq datasets of over 300,000 mouse pancreatic islet cells, of which more than 100,000 are β-cells, from nine datasets with 56 samples, including two previously unpublished datasets. The samples vary in sex, age (ranging from embryonic to aged), chemical stress, and disease status (including T1D NOD model development and two T2D models, mSTZ and db/db) together with different diabetes treatments. Additional information about data fields is available in anndata uns field 'field_descriptions' and on https://github.com/theislab/mm_pancreas_atlas_rep/blob/main/resources/cellxgene.md.",
"data_reference": "hrovatin2023delineating",
"data_url": "https://cellxgene.cziscience.com/collections/296237e2-393d-4e31-b590-b03f74ac5070",
"date_created": "21-01-2025",
"file_size": 453031863
}
]
164 changes: 90 additions & 74 deletions results/label_projection/data/method_info.json

Large diffs are not rendered by default.

1,576 changes: 830 additions & 746 deletions results/label_projection/data/metric_execution_info.json

Large diffs are not rendered by default.

32 changes: 16 additions & 16 deletions results/label_projection/data/metric_info.json
Original file line number Diff line number Diff line change
@@ -8,10 +8,10 @@
"metric_description": "The percentage of correctly predicted labels.",
"references_doi": "10.48550/arxiv.2008.05756",
"references_bibtex": null,
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/7dc145b8f5d3a63fa4c7502f017733e28a0616c2/src/metrics/accuracy",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/accuracy:build_multiple_updates",
"code_version": "build_multiple_updates",
"commit_sha": "7dc145b8f5d3a63fa4c7502f017733e28a0616c2",
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/b922d8db85e11cc822442fb7f028e2ead1b52060/src/metrics/accuracy",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/accuracy:build_main",
"code_version": "build_main",
"commit_sha": "b922d8db85e11cc822442fb7f028e2ead1b52060",
"maximize": true
},
{
@@ -23,10 +23,10 @@
"metric_description": "Calculates the F1 score for each label, and find their average weighted by support (the number of true instances for each label). This alters 'macro' to account for label imbalance; it can result in an F-score that is not between precision and recall.",
"references_doi": "10.48550/arxiv.2008.05756",
"references_bibtex": null,
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/7dc145b8f5d3a63fa4c7502f017733e28a0616c2/src/metrics/f1",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/f1:build_multiple_updates",
"code_version": "build_multiple_updates",
"commit_sha": "7dc145b8f5d3a63fa4c7502f017733e28a0616c2",
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/b922d8db85e11cc822442fb7f028e2ead1b52060/src/metrics/f1",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/f1:build_main",
"code_version": "build_main",
"commit_sha": "b922d8db85e11cc822442fb7f028e2ead1b52060",
"maximize": true
},
{
@@ -38,10 +38,10 @@
"metric_description": "Calculates the F1 score for each label, and find their unweighted mean. This does not take label imbalance into account.",
"references_doi": "10.48550/arxiv.2008.05756",
"references_bibtex": null,
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/7dc145b8f5d3a63fa4c7502f017733e28a0616c2/src/metrics/f1",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/f1:build_multiple_updates",
"code_version": "build_multiple_updates",
"commit_sha": "7dc145b8f5d3a63fa4c7502f017733e28a0616c2",
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/b922d8db85e11cc822442fb7f028e2ead1b52060/src/metrics/f1",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/f1:build_main",
"code_version": "build_main",
"commit_sha": "b922d8db85e11cc822442fb7f028e2ead1b52060",
"maximize": true
},
{
@@ -53,10 +53,10 @@
"metric_description": "Calculates the F1 score globally by counting the total true positives, false negatives and false positives.",
"references_doi": "10.48550/arxiv.2008.05756",
"references_bibtex": null,
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/7dc145b8f5d3a63fa4c7502f017733e28a0616c2/src/metrics/f1",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/f1:build_multiple_updates",
"code_version": "build_multiple_updates",
"commit_sha": "7dc145b8f5d3a63fa4c7502f017733e28a0616c2",
"implementation_url": "https://github.com/openproblems-bio/task_label_projection/blob/b922d8db85e11cc822442fb7f028e2ead1b52060/src/metrics/f1",
"image": "https://ghcr.io/openproblems-bio/task_label_projection/metrics/f1:build_main",
"code_version": "build_main",
"commit_sha": "b922d8db85e11cc822442fb7f028e2ead1b52060",
"maximize": true
}
]
924 changes: 547 additions & 377 deletions results/label_projection/data/quality_control.json

Large diffs are not rendered by default.

2,598 changes: 1,503 additions & 1,095 deletions results/label_projection/data/results.json

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion results/label_projection/data/task_info.json
Original file line number Diff line number Diff line change
@@ -31,6 +31,6 @@
}
}
],
"version": "build_multiple_updates",
"version": "build_main",
"license": "MIT"
}

0 comments on commit a78daa2

Please sign in to comment.