github.com/matthewse19/cSplotch-workflow/runcSplotch

This workflow runs the cSplotch model independently for each gene. The model is parallelized by running multiple VMs independetly, set by the max_concurrent_vms input parameter. It is recommendend to run a few genes first, (i.e. set splotch_gene_idxs to [1, 2, 3, 4, 5]) to estimate the minimum amount of memory and disk size required, and the amount of time each gene will take. The amount of time for each gene can be reduced be lowering the number of samples (175 is the lowest one should go)


This is a companion discussion topic for the original entry at github.com/matthewse19/cSplotch-workflow/runcSplotch