R/imputationPipeline.R
postImputeFilter.RdRemove markers from the imputed VCF based on adjustable thresholds. For VCFs imputed by Beagle5
postImputeFilter(
inPath = NULL,
inName,
outPath = NULL,
outName,
DR2thresh = 0.75,
HWEthresh = 1e-20,
MAFthresh = 0.005
)path to input VCFs
imputed input VCF name
path for output to be written to
name desired, don't add file suffix, will be *.vcf.gz
remove if imputation quality score DR2 supplied in the VCF INFO field by Beagle is less than or equal to the threshold
remove if p-value HWE from vcftools --hardy less than threshold, smaller p-value means more departure from HWE
remove if minor allele frequency less than threshold
Uses vcftools and R.
of minor allee frequency MAFthresh, p-value measuring likelihood of divergence from hardy-weinberg equilibrium ()
NOTICE: This function is part of a family of functions ("imputation_functions") developed as part of the NextGen Cassava Breeding Project genomic selection pipeline.
For some examples of their useage:
Other imputation_functions:
convertDart2vcf(),
convertVCFtoDosage(),
createGenomewideDosage(),
filter_positions(),
mergeVCFs(),
postImputeFilterBeagle4pt1(),
runBeagle4pt1GL(),
runBeagle5(),
splitVCFbyChr()