Identifying regional relatedness of African-Americans and European-Americans across the United States
MATHEMATICA code to
- read, efficiently store, analyze, and visualize genomic relatedness of African-Americans and European-Americans across the United States
- read, store, model, and visualize relatedness of individuals across the United States based on the data from US Census Bureau (data avilable from IPUMS)
- assess the correlation between genomic data and census data
Results are available in a preprint article.
- I have removed the (hard-coded) paths to the input data files (which are hosted on our clusters) from this project for security reasons. So, this code, in its current state on GitHub, will not work! However, it is shared publicly to show the steps I took to solve the specific problems discussed in our publication.
- The code is not the most elegantly written code, and there are some parts that intentionally have not been refactored to keep different parts of the project physically separate from each other.
- I will add my other scripts (corresponding to other parts of the project) in the near future.