Mercurial > repos > peterjc > fastq_filter_by_id
comparison tools/fastq/fastq_filter_by_id.xml @ 1:b79caa511ba2
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
author | peterjc |
---|---|
date | Tue, 07 Jun 2011 17:23:49 -0400 |
parents | 10e963c79a45 |
children | d570cc324779 |
comparison
equal
deleted
inserted
replaced
0:10e963c79a45 | 1:b79caa511ba2 |
---|---|
1 <tool id="fastq_filter_by_id" name="Filter FASTQ by ID" version="0.0.2"> | 1 <tool id="fastq_filter_by_id" name="Filter FASTQ by ID" version="0.0.3"> |
2 <description>from a tabular file</description> | 2 <description>from a tabular file</description> |
3 <command interpreter="python"> | 3 <command interpreter="python"> |
4 fastq_filter_by_id.py $input_tabular $columns $input_fastq | 4 fastq_filter_by_id.py $input_tabular $columns $input_fastq |
5 #if $output_choice_cond.output_choice=="both" | 5 #if $output_choice_cond.output_choice=="both" |
6 $output_pos $output_neg | 6 $output_pos $output_neg |
58 | 58 |
59 By default it divides a FASTQ file in two, those sequences with or without an | 59 By default it divides a FASTQ file in two, those sequences with or without an |
60 ID present in the tabular file column(s) specified. You can opt to have a | 60 ID present in the tabular file column(s) specified. You can opt to have a |
61 single output file of just the matching records, or just the non-matching ones. | 61 single output file of just the matching records, or just the non-matching ones. |
62 | 62 |
63 Note that the order of sequences in the original FASTA file is preserved. | 63 Note that the order of sequences in the original FASTQ file is preserved. |
64 Also, if any sequences share an identifier, duplicates are not removed. | 64 Also, if any sequences share an identifier, duplicates are not removed. |
65 | 65 |
66 **Example Usage** | 66 **Example Usage** |
67 | 67 |
68 You may have performed some kind of contamination search, for example running | 68 You may have mapped your reads against a reference genome, and thus generated |
69 BLASTN against a database of cloning vectors or bacteria, giving you a tabular | 69 a tabular file of the mapped reads. You could use this tool to divide the reads |
70 file containing read identifiers. You could use this tool to extract only the | 70 into those which map onto the genome, and those which don't. |
71 reads without BLAST matches (i.e. those which do not match your contaminant | |
72 database). | |
73 | 71 |
74 </help> | 72 </help> |
75 </tool> | 73 </tool> |