Galaxy | Tool Preview

Orthogroups_Tool (version 1.0.2)
The output from OrthoFinder, 'Orthogroups.txt'.
Contains the nucleic sequences to retrieve.
Drop orthogroups with less than x sequences :
Turn on this paramater to display an additional summary table (computed before the pralogous filtering)
Turn on this parameter to obtain an additional output (orthogroups files before paralogous filtering)

Authors Eric Fontanillas created the version 1 of this pipeline. Victor Mataigne developped version 2.

Galaxy integration Julie Baffard and ABiMS TEAM, Roscoff Marine Station

Contact support.abims@sb-roscoff.fr for any questions or concerns about the Galaxy implementation of this tool.
Credits : Gildas le Corguillé, Misharl Monsoor

Description

This tool takes Orthogroups found by OrthoFinder and proceeds to retrieve nucleic sequences back, then write each orthogroups in its own fasta file.


Step 1 : re-writing headers

This tool is configured to work within the AdaptSearch toolsuite, which implies a specific format of headers. Indeed, due to the use of external tools (like TransDecoder), the genes IDs might have been modified. A regular expression rewrites all the genes IDs in order to have, for instance, this format : >Pf1004_1/1_1.000_369


Step 2 : reading an re-writing orthoGroups from OrthoFinder

OrthoFinder (Emms, D.M. and Kelly, S., 2015) is a fast, accurate and comprehensive analysis tool for comparative genomics. It finds orthologues and orthogroups infers gene trees for all orthogroups and infers a rooted species tree for the species being analysed. OrthoFinder also provides comprehensive statistics for comparative genomic analyses.

Our tool focus on the orthogroups.txt file created before gene trees.The script proceeds to split each orthogroup in its own fasta file and, with the use of the output of Filter_Assemblies, to re-associate each ID with its sequence.


Inputs and parameters


Outputs


The AdaptSearch Pipeline

/repository/static/images/c563a21643779552/adaptsearch_picture_helps.png

Changelog

Version 1.0 - 11/01/2018