R/imputationPipeline.R
runBeagle4pt1GL.Rd
Beagle4.1 is slower than Beagle5 by far. However, it can use genotype-likelihoods (the GL VCF field) for its algorithm, which is potentially more accurate. In cases like imputing GBS data, or if you have time to wait around, this should be better than Beagle5.0 for a first pass imputation of observed sites. Impute with Beagle V5.0. Use an "imputation reference panel". Refer to Beagle documentation for meaning of arguments passed.
runBeagle4pt1GL(
targetVCF,
refVCF,
mapFile,
outName,
nthreads,
maxmem = "500g",
impute = TRUE,
ne = 1e+05,
samplesToExclude = NULL,
niter = 10
)
passes to Beagle `gl=targetVCF
`
passes to Beagle `ref=targetVCF
`
passes to Beagle `map=mapFile
`
passes to Beagle `out=outName
(don't put file
suffix, Beagle adds *.vcf.gz
).
passes to Beagle `nthreads=nthreads
`
passes to java `-Xmx<maxmem>
`
passes to Beagle `impute=TRUE
`
passes to Beagle `ne=ne
`
passes to Beagle `niterations=niter
`
NOTICE: This function is part of a family of functions ("imputation_functions"
) developed as part of the NextGen Cassava Breeding Project genomic selection pipeline.
For some examples of their useage:
Other imputation_functions:
convertDart2vcf()
,
convertVCFtoDosage()
,
createGenomewideDosage()
,
filter_positions()
,
mergeVCFs()
,
postImputeFilterBeagle4pt1()
,
postImputeFilter()
,
runBeagle5()
,
splitVCFbyChr()