Code for major analyses performed in population genomic analyses and genome-environment association (GEA) analyses on Drosophila suzukii.
Related publication:
Genomic Diversity Illuminates the Environmental Adaptation of Drosophila suzukii
Siyuan Feng, Samuel P. DeGrey, Christelle Guédot, Sean D. Schoville, John E. Pool
bioRxiv 2023.07.03.547576; doi: https://doi.org/10.1101/2023.07.03.547576
- Identify autosomal and X-linked contigs
- Annotate genomic features
- Calculate sequence divergence between two species
- Estimate genome-wide and synonymous nucleotide diversity, and illustrate window diversity across chromosome arms
- Estimate genome-wide and synonymous FST
- Estimate genome-wide and synonymous DXY
- Perform PCA on allele frequencies
- Build population tree and infer admixture from allele frequency
- Perform genome-environment association (GEA) analysis
- Identify environment-associated genes
- Perform GO enrichment of the top environment-associated genes
To set up the environment for above analyses, you could use conda:
conda env create -n environemnt_name --file environemnt_name.yml
Environment needed for each analysis is indicated in documentation of each analysis. Environmental package lists (environemnt_name.yml
) for cross-platform install could be found under this directory.
If you have not installed conda, run the following command:
# download miniconda
curl -sL \
"https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh" > \
"Miniconda3.sh"
# install miniconda
bash Miniconda3.sh