0
|
1 <tool id="metaphlan" name="MetaPhlAn" version="1.0">
|
|
2 <requirements>
|
|
3 <requirement type="package">metaphlan</requirement>
|
|
4 <requirement type="package" version="2.2.25+">blast</requirement>
|
|
5 </requirements>
|
|
6 <description>Metagenomic Phylogenetic Analysis</description>
|
|
7 <command>
|
|
8 metaphlan.py
|
|
9 #if str($source.type) == "fasta":
|
|
10 ${source.fasta_input}
|
|
11 #else:
|
|
12 ${source.blast_input}
|
|
13 #end if
|
|
14 ${metaphlan_out}
|
|
15 --nproc 4
|
|
16 #if str($source.type) == "fasta":
|
|
17 --blastout metagenome.outfmt6.txt
|
|
18 --evalue ${source.evalue}
|
|
19 #end if
|
|
20 --lib_dir ${GALAXY_DATA_INDEX_DIR}/shared/metaphlan
|
|
21 --min_cu_len ${min_cu_len}
|
|
22 --min_nreads ${min_nreads}
|
|
23 </command>
|
|
24 <inputs>
|
|
25
|
|
26 <conditional name="source">
|
|
27 <param name="type" type="select" label="Input Type">
|
|
28 <option value="fasta">multi-fasta file containing metagenomic reads</option>
|
|
29 <option value="blast">NCBI BLAST output file</option>
|
|
30 </param>
|
|
31 <when value="fasta">
|
|
32 <param format="fasta" name="fasta_input" type="data" label="from"/>
|
|
33 <param name="evalue" type="float" size="15" value="0.00001" label="evalue threshold for the blasting" />
|
|
34 </when>
|
|
35 <when value="blast">
|
|
36 <param format="tabular" name="blast_input" type="data" label="from"/>
|
|
37 </when>
|
|
38 </conditional>
|
|
39
|
|
40 <param name="tax_lev" type="select" label="Taxonomic Level" help="The taxonomic level for the relative abundance output">
|
|
41 <option value="a">All taxonomic levels</option>
|
|
42 <option value="k">Kingdoms (Bacteria and Archaea) only</option>
|
|
43 <option value="p">Phyla only</option>
|
|
44 <option value="c">Classes only</option>
|
|
45 <option value="o">Orders only</option>
|
|
46 <option value="f">Families only</option>
|
|
47 <option value="g">Genera only</option>
|
|
48 <option value="s">Species only</option>
|
|
49 </param>
|
|
50 <param name="min_cu_len" type="integer" value="10000" help="min_cu_len" label="Minimum total nucleotide length for the unique markers for estimating the abundance without considering children clade abundances" />
|
|
51 <param name="min_nreads" type="integer" value="5" help="min_nreads" label="minimum total reads assigned to a clade for estimating the abundance without considering children clade abundances" />
|
|
52 </inputs>
|
|
53 <outputs>
|
|
54 <data format="tabular" name="metaphlan_out" label="MetaPhlAn on ${on_string}" />
|
|
55 <data format="tabular" name="blast_out" from_work_dir="metagenome.outfmt6.txt" label="MetaPhlAn BLAST on ${on_string}">
|
|
56 <filter>source['type'] == "fasta"</filter>
|
|
57 </data>
|
|
58 </outputs>
|
|
59 <tests>
|
|
60 </tests>
|
|
61 <help>
|
|
62
|
|
63 **What it does**
|
|
64 MetaPhlAn is a computational tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data. MetaPhlAn relies on unique clade-specific marker genes identified from reference genomes, allowing orders of magnitude speedups and unambiguous taxonomic assignments.
|
|
65
|
|
66 MetaPhlAn main features are:
|
|
67
|
|
68 More than 100x computational speedup compared to Blast-based approaches or other available methods with species level resolution
|
|
69 Higher accuracy in estimating the true composition of microbial communities in terms of organismal relative abundance
|
|
70 Unambiguous read-to-taxa assignments as conserved inter-clade sequences are removed from the reference sequence data
|
|
71
|
|
72 ---------
|
|
73
|
|
74 **Inputs**
|
|
75
|
|
76 The input file can be a multi-fasta file containing metagenomic reads OR a NCBI BLAST output file (-outfmt 6 format) of the metagenomic read fasta file against the metaflan database.
|
|
77
|
|
78 **outputs**
|
|
79
|
|
80 The output is a tab-separated output file of the predicted taxon relative abundances.
|
|
81 If the input is a multi-fasta file then the output from the BLAST operation is also provided as an additional output.
|
|
82
|
|
83 ---------
|
|
84
|
|
85 **Settings**::
|
|
86
|
|
87 --tax_lev TAXONOMIC_LEVEL
|
|
88 The taxonomic level for the relative abundance
|
|
89 output:
|
|
90 'a' : all taxonomic levels
|
|
91 'k' : kingdoms (Bacteria and Archaea) only
|
|
92 'p' : phyla only
|
|
93 'c' : classes only
|
|
94 'o' : orders only
|
|
95 'f' : families only
|
|
96 'g' : genera only
|
|
97 's' : species only
|
|
98 [default 'a']
|
|
99 --evalue evalue threshold for the blasting
|
|
100 [default 1e-6]
|
|
101 --min_cu_len minimum total nucleotide lenght for the unique
|
|
102 markers for estimating the abundance without
|
|
103 considering children clade abundances
|
|
104 [default 10000]
|
|
105 --min_nreads minimum total reads assigned to a clade for
|
|
106 estimating the abundance without considering
|
|
107 children clade abundances
|
|
108 [default 5]
|
|
109
|
|
110 -----
|
|
111
|
|
112 **Citation**
|
|
113
|
|
114 If you find MetaPhlAn useful in your research, please cite our paper:
|
|
115 Nicola Segata, Levi Waldron, Annalisa Ballarini, Vagheesh Narasimhan, Olivier Jousson, Curtis Huttenhower.
|
|
116 "Fast and accurate metagenomic profiling of microbial community composition using unique clade-specific marker genes"
|
|
117 ***in review***
|
|
118
|
|
119 </help>
|
|
120 </tool>
|
|
121
|