Skip to content

bethan-mallabar-rimmer/epicv2refine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

epicv2refine

This function deals with replicate/duplicate probes on the EPICv2 array in two ways. For each group of replicates:

  1. Probes mapped to "chromosome 0" in the Illumina manifest, and probes labelled as inferior (due to reduced sensitivity and precision) by Peters et al., BMC Genomics 2024, are removed. Often this leaves only one 'superior' replicate probe in the group.
  2. If there is more than one probe left in the group after removing inferior probes*, mean and variance are calculated across the remaining probes. For each sample, beta values of remaining replicate probes are replaced by mean of beta values. E.g.

    Before:

Probe Name Sample 1 Sample 2
cg00002033_TC11 0.6533183 0.4650523
cg00002033_TC12 0.6440894 0.4638761

After:
Probe Name Sample 1 Sample 2
cg00002033_TC11 0.6487038 0.4644642

High-variance probes
By default, if probes in the replicate group have unusually high variance in beta values compared to other replicate groups (outliers defined as variance > Q3 + 1.5*IQR, same as in a box plot) then all probes in the replicate group will be removed. Alternatively, user can set remove_high_var = FALSE, to keep and average these high-variance replicates as shown above.

*Reasons for this include that one probe may have the best sensitivity while another has the best precision, group mean may be superior to any one probe, or probes may have no inferior/superior label due to insufficient evidence.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages