| This script removes duplicate paired-end reads from a pair of input fastq files and prints out unique reads to a pair of output fastq files. Read-pairs must have the exact same sequence to be called duplicates, quality scores are ignored. The top N (default 20) most duplicated sequences are printed out in fasta format, making it convenient for using BLAST to identify them. |
hg clone https://toolshed.g2.bx.psu.edu/repos/jgarbe/redup
| Name | Description | Version | Minimum Galaxy Version |
|---|---|---|---|
| Remove exact duplicate reads from paired-end fastq files | 1.0 | any | |