R/imputationPipeline.R
postImputeFilter.Rd
Remove markers from the imputed VCF based on adjustable thresholds. For VCFs imputed by Beagle5
postImputeFilter(
inPath = NULL,
inName,
outPath = NULL,
outName,
DR2thresh = 0.75,
HWEthresh = 1e-20,
MAFthresh = 0.005
)
path to input VCFs
imputed input VCF name
path for output to be written to
name desired, don't add file suffix, will be *.vcf.gz
remove if imputation quality score DR2
supplied in the VCF INFO field by Beagle is less than or equal to the threshold
remove if p-value HWE from vcftools --hardy
less than threshold, smaller p-value means more departure from HWE
remove if minor allele frequency less than threshold
Uses vcftools and R.
of minor allee frequency MAFthresh
, p-value measuring likelihood of divergence from hardy-weinberg equilibrium ()
NOTICE: This function is part of a family of functions ("imputation_functions"
) developed as part of the NextGen Cassava Breeding Project genomic selection pipeline.
For some examples of their useage:
Other imputation_functions:
convertDart2vcf()
,
convertVCFtoDosage()
,
createGenomewideDosage()
,
filter_positions()
,
mergeVCFs()
,
postImputeFilterBeagle4pt1()
,
runBeagle4pt1GL()
,
runBeagle5()
,
splitVCFbyChr()