+--readlength INT | Read length for the circularity prediction (default: 101 bp) |
+--windowsize INT | Window length used to determine the origin of replication in circular contigs according to the cumulative GC skew(default: 100 bp) |
+--slidingsize INT | Sliding window length used to determine the origin of replication in circular contigs according to the cumulative GC skew(default: 10 bp) |
+--out OUTPUTNAME | Name of the outputs files without extensions, as the program will add them automatically. By default, the program will use the input name as the output. |
+--locus STRING | Name of the contigs. If the input is a multiFASTA file, please put a general name as the program will add the number of the contig at the end of the name. By default, the name of the contigs will be "LOC". |
+--threads INT | Number of threads/CPUs. By default, the program will use 1 CPU. |
+--gff | Printing the output as a General Feature Format (GFF) version 3. It is a flat table file with contains 9 columns of data (see http://www.ensembl.org/info/website/upload/gff3.html for more information). By default, the program will not print the GFF3 file (--gff False). |
+--blastdb BLASTDB | BLAST database that will be used for the protein function prediction. The database MUST be for amino acids. This is only mandatory if the "ultrafast" mode is not active |
+--diamonddb DIAMONDDB | DIAMOND database that will be used for the protein function prediction. The database MUST be for amino acids. This is only mandatory when "ultrafast" mode is active |
+--blastevalue FLOAT | BLAST e-value threshold. By default, the threshold will be 1e-05. |
+
+--nohmmer | Running the program without using PHMMER to predict protein function. In this case, the program will be as fast as Prokka (Seemann 2014) but the annotations will not be accurate. By default, this program had this flag disabled. |
+--noblast | Running the program replacing BLAST by DIAMOND. In this case, the program will be fast but the annotations will not be accurate. By default, this program had this flag disabled. |
+--hmmdb HMMDB | PHMMER Database that will be used for the protein function prediction according to Hidden Markov Models. In this case, HMMDB must be in FASTA format (e.g. UniProt). This parameter is mandatory if the "--fast" option is disabled. " |
+--hmmerevalue FLOAT | PHMMER e-value threshold. By default, the threshold is 1e-03. |
+--typedata BCT|CON|VRL|PHG | GenBank Division: One of the following codes:
+
+BCT | Prokaryotic chromosome |
+VRL | Eukaryotic/Archaea virus |
+PHG | Phages |
+CON | Contig |
+
+By default, the program will consider every sequence as a contig (CON) |
+--gcode NUMBER | Number of GenBank translation table. At this moment, the available options are:
+
+1 | Standard genetic code [Eukaryotic] |
+2 | Vertebrate mitochondrial code |
+3 | Yeast mitochondrial code |
+4 | Mycoplasma/Spiroplasma and Protozoan/mold/coelenterate mitochondrial code |
+5 | Invertebrate mitochondrial code |
+6 | Ciliate/dasycladacean/Hexamita nuclear code |
+9 | Echinoderm/flatworm mitochondrial code |
+10 | Euplotid nuclear code |
+11 | Bacteria/Archaea/Phages/Plant plastid |
+12 | Alternative yeast nuclear code |
+13 | Ascidian mitochondrial code |
+14 | Alternative flatworm mitochondrial code |
+16 | Chlorophycean mitochondrial code |
+21 | Trematode mitochondrial code |
+22 | Scedenesmus obliquus mitochondrial code |
+23 | Thraustochytrium mitochondrial code |
+24 | Pterobranquia mitochondrial code |
+25 | Gracilibacteria and Candidate division SR1 |
+26 | Pachysolen tannophilus nuclear code |
+27 | Karyorelict nuclear code |
+28 | Condylostoma nuclear code |
+29 | Mesodinium nuclear code |
+30 | Peritrich nuclear code |
+31 | Blastocrithidia nuclear code |
+
+By default, the program will use the translation table no. 11 |
+--mincontigsize INT | Minimum contig length to be considered in the final files. By default, the program only consider from 200 bp. |
+--idthr FLOAT | Identity threshold to consider that a protein belong to a specific hit. By default, the threshold is 50.0 % |
+--coverthr FLOAT | Coverage threshold to consider that a protein belong to a specific hit. By default, the threshold is 50.0 % |
+--diffid FLOAT (>0.01) | Max allowed difference between the ID percentages of BLAST and HMMER. By default, the allowed difference is 5.00 % and we do not recommended to change such value. |
+--minrepeat INT | Minimum repeat length for CRISPR detection (Default: 16) |
+--maxrepeat INT | Maximum repeat length for CRISPR detection (Default: 64) |
+--minspacer INT | Minimum spacer length for CRISPR detection (Default: 8) |
+--maxspacer INT | Maximum spacer length for CRISPR detection (Default: 64) |
+--blastexh | Use of exhaustive BLAST to predict the proteins by homology according to Fozo et al. (2010). In this case, the search will be done using a word size of 2, a gap open penalty of 8, a gap extension penalty of 2, the PAM70 matrix instead of the BLOSUM62 and no compositional based statistics. This method is more accurate to predict the functions of the proteins but it is slower than BLAST default parameters. By default, exhaustive BLAST is disabled. |
+
+
+## Examples
+An example of execution (using BLAST and HMMER) is:
+
+ python VIGA.py --input eukarya.fasta --blastdb databases/blast/nr/nr --hmmdb databases/UniProt/uniprot_trembl.fasta --rfamdb databases/rfam/Rfam.cm --gcode 1 --out eukarya_BENCHMARK --modifiers ../modifiers.txt --threads 10
+
+Another example (but this time using BLAST but not HMMER - "fast mode") is:
+
+ python VIGA.py --input bacteria.fasta --blastdb databases/blast/nr/nr --nohmmer --rfamdb databases/rfam/Rfam.cm --out bacteria_BENCHMARK --modifiers ../modifiers.txt --threads 10
+
+Finally, an example using DIAMOND and not HMMER ("ultrafast mode") is:
+
+ python VIGA.py --input archaea.fasta --noblast --diamonddb databases/diamond/nr --nohmmer --rfamdb databases/rfam/Rfam.cm --out archaea_BENCHMARK --modifiers ../modifiers.txt --threads 10
+
+## Galaxy wrapper
+VIGA can be integrated into [Galaxy](https://galaxyproject.org) using the wrapper included in this repository.
+
+### Requirements
+
+[Docker](https://www.docker.com) should first be installed and working on the
+server where this Galaxy instance is setup. The user running Galaxy should be
+part of the **docker** user group.
+
+#### Manual installation of the wrapper from Github
+
+1. Download or clone this repository (as a submodule) in the **tools**
+ directory of the Galaxy installation.
+
+2. Update **config/tool_conf.xml** to add the VIGA wrapper in a relevant section of the tool panel. For example, "Annotation".
+
+