Link to Cell Reports Medicine article
This Jupyter Notebook performs the initial steps in the analysis of aberrant bowel movement frequencies (BMF) and their correlation with gut microbiome and organ function. It includes the following tasks:
Imports raw data from Arivale snapshots and performs necessary cleaning, filtering, and transformation steps.
Defines criteria for BMF and identifies cohorts of individuals based on their bowel movement patterns.
Saves BMF cohort data into separate CSV files (e.g., asvs.csv
) for subsequent analyses.
Calculates and reports basic descriptive statistics for each BMF cohort, such as mean age, gender distribution, and other relevant metrics.
Raw data from Arivale snapshots, containing relevant health and lifestyle information for each individual.
CSV files containing data for each identified BMF cohort.
- gender,age,BMI_CALC,vendor_dashboard,eGFR,CRP,A1C,LDL,PC[1-3],taxa_[taxonomic_classification],[metabolite_IDs],[clinical_chemistries],etc.
:
List of parameters spanning the multi-omic data analyses needs and their BMF subcohorts (e.g., metabolomics).
- Python
- pandas
- numpy
- seaborn
Ensure that the required Python packages (pandas, numpy, etc.) and dependencies are installed and loaded from the first few cells.
Load the arivale_snapshot
path within the notebook.
Execute the notebook cells sequentially or in desired chunks to complete the data import, preprocessing, cohort identification, and descriptive statistics calculation.
Examine the generated CSV files and descriptive statistics summary.
This Jupyter Notebook investigates the relationship between BMF-associated metabolites and kidney function (estimated glomerular filtration rate - eGFR). It utilizes data from Arivale snapshots and the results of LIMMA regressions to perform an OLS regression analysis, outputting statistical summaries and plots.**
- See base_final.ipynb for similar instructions and dependencies. Run the notebook to execute the analysis and generate the output files.
This collection of R scripts performs statistical analyses on the preprocessed data generated by the Jupyter Notebooks. They leverage various R packages (e.g., bioconductor, tidyverse) to conduct regressions, including CORNCOB, LIMMA, and POLR. The scripts output graphical visualizations and summary statistics to aid in interpretation of the results.
- CSV files generated by the Jupyter Notebooks (e.g., BMF cohort data, metabolomics data, eGFR data).
- R
- bioconductor
- tidyverse
- CORNCOB
- LIMMA
- polr
Ensure that the required R packages are installed and loaded.
Adjust file paths in the scripts to match your directory structure.
Execute the scripts in R or RStudio to perform the analyses and generate the outputs.