Long-running jobs are vulnerable to early termination from
maintanance or power outages. We recommend chopping your analyses
into smaller chunks. This also offers the advantage of running jobs
in parallel. This function builds a plan that roughly splits the
whole analysis into equal amounts of work.
Usage
buildAnalysesPlan(snpData, sliceSize)
Arguments
snpData
a pathway to a file containing GWAS data. The data can be
in a variety of forms, such as standard PLINK format (bed/bim/fam),
PLINK2 format (pgen/pvar/psam), Oxford format (bgen/sample), or CSV
format (csv format in much slower due to the lack of compression
for non-binary files).
sliceSize
number of SNPs to analyze per job
Value
Returns a data.frame with one job specification per row with the following columns: