github.com/DataBiosphere/topmed-workflows/CCDG_aligner_functional_equivalent_cwl

system · October 3, 2019, 4:03pm

This workflow processes high-throughput sequencing data for downstream processing

Requirements/expectations :

Human whole-genome pair-end sequencing data in unmapped BAM (uBAM) format
One or more read groups, one per uBAM file, all belonging to a single sample (SM)
Input uBAM files must additionally comply with the following requirements:
- filenames all have the same suffix (we use “.unmapped.bam”)
- files must pass validation by ValidateSamFile
- reads are provided in query-sorted