annotate squirrel-qc.xml @ 3:d8d5380422c1 draft default tip

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/main/tools/squirrel commit 68ea06281ba78b828b8d1947745161697dc0fbc5
author iuc
date Tue, 10 Jun 2025 13:32:36 +0000
parents cd668d67431e
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
1 <tool id="squirrel_qc" name="Squirrel QC" version="@TOOL_VERSION@+galaxy@VERSION_SUFFIX@" profile="21.05">
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
2 <description>QC of MPXV (Mpox virus) sequences</description>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
3 <macros>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
4 <import>macros.xml</import>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
5 </macros>
1
cd668d67431e planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/main/tools/squirrel commit d684b71bf5129645fe8eb349a56fcb29c321a7ab
iuc
parents: 0
diff changeset
6 <expand macro="xrefs"/>
0
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
7 <expand macro="requirements"/>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
8 <expand macro="version_command"/>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
9
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
10 <command detect_errors="exit_code"><![CDATA[
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
11 #set $mask_output = 'input.suggested_mask.csv'
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
12 #set $exclude_output = 'suggested_to_exclude.csv'
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
13
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
14 ln -s '${sequences}' input.fasta &&
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
15
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
16 squirrel
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
17 --seq-qc
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
18 --clade $clade
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
19
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
20 --threads \${GALAXY_SLOTS:-1}
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
21
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
22 input.fasta &&
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
23
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
24 mv '${mask_output}' '$mask' &&
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
25 mv '${exclude_output}' '$exclude'
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
26 ]]></command>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
27
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
28 <inputs>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
29 <param name="sequences"
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
30 type="data"
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
31 format="fasta"
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
32 label="Sequences in FASTA format" />
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
33 <param name="clade"
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
34 type="select"
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
35 label="Select MPXV Clade">
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
36 <option value="cladei">Clade I</option>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
37 <option value="cladeia">Clade Ia</option>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
38 <option value="cladeib">Clade Ib</option>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
39 <option value="cladeii">Clade II</option>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
40 <option value="cladeiia">Clade IIa</option>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
41 <option value="cladeiib">Clade IIb</option>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
42 </param>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
43
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
44 </inputs>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
45
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
46 <outputs>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
47 <!-- standard outputs-->
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
48 <data name="mask" format="csv" label="${tool.name} - flagged mutations to mask" />
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
49 <data name="exclude" format="csv" label="${tool.name} - flagged sequences to exclude" />
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
50 </outputs>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
51
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
52 <tests>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
53 <test expect_num_outputs="2">
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
54 <param name="sequences" value="test-sequences.fasta" />
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
55 <param name="clade" value="cladeii" />
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
56 <output name="mask" file="sequences.suggested_mask.csv" />
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
57 <output name="exclude" file="suggested_to_exclude.csv" />
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
58 </test>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
59 </tests>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
60 <help><![CDATA[
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
61 Squirrel in QC mode can run quality control (QC) on the alignment and flag certain sites to the user that may need to be masked. Squirrel can flag potential issues in the MPXV sequences that have been provided for alignment (e.g. SNPS near tracts of N, clusters of unique SNPs, reversions to reference alleles and convergent mutations) and outputs these in a mask file for investigation.
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
62
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
63 It is recommended that the user looks at these sites in an alignment viewer to judge whether the sites should be masked or not.
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
64
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
65 Squirrel with check within the alignment for:
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
66
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
67 - Mutations that are adjacent to N bases
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
68
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
69 The rationale for this is that N sites are usually a product of low coverage regions. Mutations that occur directly adjacent to low coverage regions may be a result of mis-alignment prior to the low coverage masking and may not be real SNPs. In squirrel, non-majority alleles that are present next to an N are flagged as potential sites for masking
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
70
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
71 - Unique mutations that clump together
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
72
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
73 If mutations are observed in only a single sequence in the genome, they are classed as unique mutations. Usually mutations do not clump closely together and may suggest an alignment or assembly issue. If these mutations are not shared with any other sequences, they are flagged for masking.
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
74
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
75 - Sequences with a high N content
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
76
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
77 Sequences that have many ambiguous bases in them are flagged that they may want to be excluded in further analysis. This may not always be appropriate, often genomes that have a lot of ambiguity can still be informative, however if there is something unusual about a sequence, having lots of ambiguities can be a flag for wider problems (like low read count during assembly).
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
78 ]]></help>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
79 <expand macro="citations"/>
f63d24309f49 planemo upload for repository https://github.com/aineniamh/squirrel commit ed19e40212d1e6651efb3a032d1170f4fd03b989
iuc
parents:
diff changeset
80 </tool>