Description

Locate and tag duplicate reads in a BAM file

Input

name:type
description
pattern

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

reads

:file

Sequence reads file, can be SAM/BAM/CRAM format

*.{bam,cram,sam}

meta2

:map

Groovy Map containing reference information e.g. [ id:‘genome’ ]

fasta

:file

Reference genome fasta file, required for CRAM input

*.{fasta,fa}

fai

:file

Reference genome fasta index

*.{fai}

Output

name:type
description
pattern

bam

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bam

:file

BAM file with duplicate reads marked/removed

*.{bam}

bai

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bai

:file

An optional BAM index file. If desired, —CREATE_INDEX must be passed as a flag

*.{bai}

cram

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.cram

:file

Output CRAM file

*.{cram}

metrics

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.metrics.txt

:file

Duplicate metrics file generated by picard

*.{metrics.txt}

versions_picard

${task.process}

:string

The process the versions were collected from

picard

:string

The tool name

picard MarkDuplicates --version 2>&1 | sed -n 's/.*Version://p'

:string

The command used to generate the version of the tool

Topics

name:type
description
pattern

versions

${task.process}

:string

The process the versions were collected from

picard

:string

The tool name

picard MarkDuplicates --version 2>&1 | sed -n 's/.*Version://p'

:string

The command used to generate the version of the tool

Tools

picard
MIT

A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.