Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update configure maf docstring #578

Merged
merged 2 commits into from
Oct 2, 2024
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 9 additions & 0 deletions genie/database_to_staging.py
Original file line number Diff line number Diff line change
Expand Up @@ -272,6 +272,15 @@ def get_whitelist_variants_idx(mafdf):
# TODO: Add to transform.py
def configure_maf(mafdf, remove_variants, flagged_variants):
"""Configures each maf dataframe, does germline filtering

Germline filtering for MAF files uses the gnomAD columns that refer to the
allele frequencies (AF) of variants in different population groups
from the gnomAD (Genome Aggregation Database). This filter will filter out
variants with a maximum AF > 0.05% across all populations which are typically
common germline variants.

Germline filtering for MAF files occurs during release instead of during processing
because the MAF file gets re-annotated during processing via genome nexus annotation.

Args:
mafdf: Maf dataframe
Expand Down
Loading