This workflow processes high-throughput sequencing data for downstream processing

Requirements/expectations :

  • Human whole-genome pair-end sequencing data in unmapped BAM (uBAM) format
  • One or more read groups, one per uBAM file, all belonging to a single sample (SM)
  • Input uBAM files must additionally comply with the following requirements:
    • filenames all have the same suffix (we use “.unmapped.bam”)
    • files must pass validation by ValidateSamFile
    • reads are provided in query-sorted

This is a companion discussion topic for the original entry at