This workflow processes high-throughput sequencing data for downstream processing
Requirements/expectations :
- Human whole-genome pair-end sequencing data in unmapped BAM (uBAM) format
- One or more read groups, one per uBAM file, all belonging to a single sample (SM)
- Input uBAM files must additionally comply with the following requirements:
- filenames all have the same suffix (we use “.unmapped.bam”)
- files must pass validation by ValidateSamFile
- reads are provided in query-sorted
This is a companion discussion topic for the original entry at github.com/DataBiosphere/topmed-workflows/CCDG_aligner_functional_equivalent_cwl