Modules to perform genetic testing for human genomes or SNPs data.
Ancestry test is done based on 2 approaches: admix and AncestryMapper.
admix calculates the ancestry composition
(e.g. 48.08% North European, 17.77% Mediterranean, 10.21% East African, etc.) for input genome based on different public admixture models. The algorithm is based on the maximum likelihood estimation.
AncestryMapper estimates the similarity of the input genome to several reference ethnic groups by computing the genetic distances
based on the Principal component analysis (PCA) procedure. This apporach will not give the ancestry composition as the output like admix, but a sorted list from the most similar ethnic group to the most distance one together with the calculated genetic distances.
(Definition from wiki: Imputation in genetics refers to the statistic reference of unobserved genotypes. It is achieved by using known haplotypes in a population, for instance from the HapMap or the 1000 Genomes Project in humans, thereby allowing to test for association between a trait of interest (e.g. a disease) and experimentally untyped genetic variants, but whose genotypes have been statistically inferred ("imputed"). Genotype imputation is usually performed on SNP, the most common kind of genetic variation.)
This imputation module is adapted and modified from the source code of www.impute.me. The imputation is carried out using the tool impute2. After that, genetic tests are performed by calculating the genetic risk scores for the individual with different traits including several complex diseases, autoimmune diseases, breast cancer, drug response, precision medicine (personal relevant diseases), rare diseases, leukemia, public traits from the UK biobank, as well as the ethnicity, genetic height and hair color, traits related to intelligence, athletics, microbiome and gene mutations.