Repository 'flye'
hg clone https://toolshed.g2.bx.psu.edu/repos/bgruening/flye

Changeset 11:291923e6f276 (2024-03-18)
Previous changeset 10:cb8dfd28c16f (2022-10-26) Next changeset 12:3e4f8642c77e (2024-08-26)
Commit message:
planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/flye commit acf41fab409bef4882d5d12cbf991452b408076e
modified:
flye.xml
macros.xml
b
diff -r cb8dfd28c16f -r 291923e6f276 flye.xml
--- a/flye.xml Wed Oct 26 13:37:35 2022 +0000
+++ b/flye.xml Mon Mar 18 12:44:09 2024 +0000
[
b'@@ -3,8 +3,9 @@\n     <macros>\n         <import>macros.xml</import>\n     </macros>\n+    <expand macro="edam_ontology"/>\n+    <expand macro="xrefs"/>\n     <expand macro="requirements" />\n-    <expand macro="edam_ontology"/>\n     <version_command>flye --version</version_command>\n     <command detect_errors="exit_code"><![CDATA[\n         #for $counter, $input in enumerate($inputs):\n@@ -17,7 +18,7 @@\n             #elif $input.is_of_type(\'fasta\'):\n                 #set $ext = \'fasta\'\n             #end if\n-            ln -s \'$input\' ./input_${counter}.${ext} &&\n+            ln -sf \'$input\' ./input_${counter}.${ext} &&\n         #end for\n         flye\n         $mode_conditional.mode\n@@ -228,12 +229,12 @@\n             </output>\n             <output name="assembly_gfa" ftype="txt">\n                 <assert_contents>\n-                    <has_size value="420252" delta="100"/>\n+                    <has_size value="419414" delta="100"/>\n                </assert_contents>\n             </output>\n             <output name="consensus" ftype="fasta">\n                 <assert_contents>\n-                    <has_size value="427129" delta="100"/>\n+                    <has_size value="426277" delta="100"/>\n                </assert_contents>\n             </output>\n         </test>\n@@ -252,17 +253,17 @@\n             </output>\n             <output name="assembly_graph" ftype="graph_dot">\n                 <assert_contents>\n-                    <has_size value="1273" delta="100"/>\n+                    <has_size value="1500" delta="100"/>\n                </assert_contents>\n             </output>\n             <output name="assembly_gfa" ftype="txt">\n                 <assert_contents>\n-                    <has_size value="420252" delta="100"/>\n+                    <has_size value="418422" delta="100"/>\n                </assert_contents>\n             </output>\n             <output name="consensus" ftype="fasta">\n                 <assert_contents>\n-                    <has_size value="427129" delta="100"/>\n+                    <has_size value="425147" delta="200"/>\n                </assert_contents>\n             </output>\n         </test>\n@@ -287,12 +288,12 @@\n             </output>\n             <output name="assembly_gfa" ftype="txt">\n                 <assert_contents>\n-                    <has_size value="420252" delta="100"/>\n+                    <has_size value="418511" delta="100"/>\n                </assert_contents>\n             </output>\n             <output name="consensus" ftype="fasta">\n                 <assert_contents>\n-                    <has_size value="427129" delta="100"/>\n+                    <has_size value="425267" delta="100"/>\n                </assert_contents>\n             </output>\n         </test>\n@@ -301,7 +302,7 @@\n             <param name="inputs" ftype="fastq.gz" value="ecoli_hifi_01.fastq.gz,ecoli_hifi_02.fastq.gz,ecoli_hifi_03.fastq.gz,ecoli_hifi_04.fastq.gz,ecoli_hifi_05.fastq.gz,ecoli_hifi_06.fastq.gz,ecoli_hifi_07.fastq.gz,ecoli_hifi_08.fastq.gz,ecoli_hifi_09.fastq.gz"/>\n             <param name="mode" value="--nano-hq"/>\n             <param name="min_overlap" value="1000"/>\n-            <param name="scaffolding" value="true"/>\n+            <param name="scaffold" value="true"/>\n             <output name="assembly_info" ftype="tabular">\n                 <assert_contents>\n                     <has_size value="286" delta="100"/>\n@@ -314,12 +315,12 @@\n             </output>\n             <output name="assembly_gfa" ftype="txt">\n                 <assert_contents>\n-                    <has_size value="420252" delta="100"/>\n+                    <has_size value="419414" delta="1000"/>\n                </assert_contents>\n             </output>\n             <output name="consensus" ftype="fasta">\n                 <assert_contents>\n-                    <has_size value="427129" delta="100"/>\n+                    <has_size value="426277" delta="1000"/>\n                </assert_contents>\n             </output>\n         </test>\n@@ -353,8 +35'..b'ue contigs are extended with the sequence from flanking unresolved repeats on the graph. Thus, \n a contig fully contains the corresponding graph edge (with the same id), but might be longer then this edge. This is somewhat similar to unitig-contig relation in \n@@ -402,53 +395,42 @@\n \n Extra information about contigs/scaffolds is output into the assembly_info.txt file. It is a tab-delimited table with the columns as follows:\n \n-    ::\n+*  Contig/scaffold id\n+*  Length\n+*  Coverage\n+*  Is circular, (Y)es or (N)o\n+*  Is repetitive, (Y)es or (N)o\n+*  Multiplicity (based on coverage)\n+*  Alternative group\n+*  Graph path (graph path corresponding to this contig/scaffold).\n \n-        - Contig/scaffold id\n-        - Length\n-        - Coverage\n-        - Is circular, (Y)es or (N)o\n-        - Is repetitive, (Y)es or (N)o\n-        - Multiplicity (based on coverage)\n-        - Alternative group\n-        - Graph path (graph path corresponding to this contig/scaffold).\n-\n-Scaffold gaps are marked with ?? symbols, and * symbol denotes a terminal graph node. Alternative contigs (representing alternative haplotypes) will have the same alt. \n-group ID. Primary contigs are marked by *.\n+Scaffold gaps are marked with `??` symbols, and `*` symbol denotes a terminal graph node. Alternative contigs (representing alternative haplotypes) will have the same alt. \n+group ID. Primary contigs are marked by `*`.\n \n ----\n \n-.. class:: infomark\n-\n **Algorithm Description**\n \n This is a brief description of the Flye algorithm. Please refer to the manuscript for more detailed information. The draft contig extension is organized as follows:\n \n-    ::\n-\n-        - K-mer counting / erroneous k-mer pre-filtering\n-        - Solid k-mer selection (k-mers with sufficient frequency, which are unlikely to be erroneous)\n-        - Contig extension. The algorithm starts from a single read and extends it with a next overlapping read (overlaps are dynamically detected using the selected solid k-mers).\n+*  K-mer counting / erroneous k-mer pre-filtering\n+*  Solid k-mer selection (k-mers with sufficient frequency, which are unlikely to be erroneous)\n+*  Contig extension. The algorithm starts from a single read and extends it with a next overlapping read (overlaps are dynamically detected using the selected solid k-mers).\n \n Note that we do not attempt to resolve repeats at this stage, thus the reconstructed contigs might contain misassemblies. Flye then aligns the reads on these draft \n contigs using minimap2 and calls a consensus. Afterwards, Flye performs repeat analysis as follows:\n \n-    ::\n-\n-        - Repeat graph is constructed from the (possibly misassembled) contigs\n-        - In this graph all repeats longer than minimum overlap are collapsed\n-        - The algorithm resolves repeats using the read information and graph structure\n-        - The unbranching paths in the graph are output as contigs\n+*  Repeat graph is constructed from the (possibly misassembled) contigs\n+*  In this graph all repeats longer than minimum overlap are collapsed\n+*  The algorithm resolves repeats using the read information and graph structure\n+*  The unbranching paths in the graph are output as contigs\n \n If enabled, after resolving bridged repeats, Trestle module attempts to resolve simple unbridged repeats (of multiplicity 2) using the heterogeneities between repeat copies. \n Finally, Flye performs polishing of the resulting assembly to correct the remaining errors:\n \n-    ::\n-\n-        - Alignment of all reads to the current assembly using minimap2\n-        - Partition the alignment into mini-alignments (bubbles)\n-        - Error correction of each bubble using a maximum likelihood approach\n-\n+*  Alignment of all reads to the current assembly using minimap2\n+*  Partition the alignment into mini-alignments (bubbles)\n+*  Error correction of each bubble using a maximum likelihood approach\n \n The polishing steps could be repeated, which might slightly increase quality for some datasets.\n \n'
b
diff -r cb8dfd28c16f -r 291923e6f276 macros.xml
--- a/macros.xml Wed Oct 26 13:37:35 2022 +0000
+++ b/macros.xml Mon Mar 18 12:44:09 2024 +0000
b
@@ -1,5 +1,5 @@
 <macros>
-    <token name="@TOOL_VERSION@">2.9.1</token>
+    <token name="@TOOL_VERSION@">2.9.3</token>
     <token name="@SUFFIX_VERSION@">0</token>
     <xml name="requirements">
         <requirements>
@@ -14,6 +14,11 @@
             <edam_operation>operation_0525</edam_operation>
         </edam_operations>
     </xml>
+    <xml name="xrefs">
+        <xrefs>
+            <xref type="bio.tools">Flye</xref>
+        </xrefs>
+    </xml>
     <xml name="citations">
         <citations>
             <citation type="doi">10.1073/pnas.1604560113</citation>