All piplines will require a GWAS object or list of objects, they have the following properties
file
: Full path to file. Mandatoryancestry
: one ofAFR, AMR, EAS, EUR, SAS
. Mandatory for some pipelinesbuild
: one ofGRCh36, GRCh37, GRCh38
. Default:GRCh37
columns
: object of column name maps (or string of predefined map). Explained belowpopulate_rsid
: boolean value. Populates RSID column if it doesn't exist. Defaultfalse
GWAS Columns:
With each GWAS file, you can specify column names ex. {"P":"pval", ...}
, if you do not specify header names it will assume your GWAS has default names
Default names : SNP, CHR, BP, EA, OA, EAF, BETA, SE, OR, P, LOG_P, Z, OR_LB, OR_UB, RSID, N, N_CASES, ENSEMBL_ID, GENE_NAME
- Mandatory Columns:
CHR, BP, EA, OA
Effect Column Options. One of these sets are mandatory:BETA, SE
OR, OR_LB, OR_UB
Z
- P-value Column:
P or LOG_P
Alternatively, columns
accepts a string of some of the more common output formats (ex: metal
, gwama
). There are also a list of predefined common column maps here.
All pipelines will standardise each GWAS before running the subsequent steps. The SNP
field will be recalculated as CHR:POS_EA_OA
, where EA and OA are ordered alphabetically, and the subsequent BETA and EAF will be adjusted accordingly
n GWAS objects
: See above for GWAS Object explanationoutput
:build
: one of the supported reference builds. Default:GRCh37
columns
: same format as input columns. Either a object {} or predefined map string
2 GWAS objects
: Incident and Subsequent GWAS. See above for GWAS Object explanationplink_clump_arguments
: arguments that are fed into theplink --clump
call. Options hereoutput
: 2 fields inadjusted_gwas
, wheretype
can beslopehunter
,cwls
, ormr_ivw
, andp_val
can be any of0.1, 0.01, 0.001, 1e-5
n GWAS objects
: See above for GWAS Object explanation- Please also explicitly include
N
in the GWAS object, for use in the LDSC tool
- Please also explicitly include
plink_clump_arguments
: arguments that are fed into theplink --clump
call. Options here
1 GWAS object
: See above for GWAS Object explanationqtl
:dataset
: see below for optionssubcategories
: list of subcategories to run, see below for options- If left empty, all subcategories will be run
exposures
: list of specific exposures to be run. If left empty all exposures will be run in the dataset- example:
["IL6", "CCL2"]
- example:
Available Datasets and Subcategories:
- dataset:
metabrain
- subcategories:
basalganglia, cerebellum, cortex, hippocampus, spinalcord
- subcategories:
- dataset:
eqtlgen
- subcategories:
cis
(future:trans
)
- subcategories: