primo_multiomics: results2o.xml annotate

annotate results2o.xml @ 6:8fa07f40d2eb

added more parameters/options and made Ontology observations file optional;

author	pieter.lukasse@wur.nl
date	Fri, 01 Aug 2014 17:21:30 +0200
parents	eb0b7889dd08
children

rev	line source
0 5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	1 <tool name="Results2O" id="results2o1" version="0.0.1">
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	2 <description>use ontology mapping to annotate results (e.g. annotate protein identifications with Gene Ontology[GO] terms)</description>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	3 <!--
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	4 For remote debugging start you listener on port 8000 and use the following as command interpreter:
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	5 java -jar -Xdebug -Xrunjdwp:transport=dt_socket,address=D0100564.wurnet.nl:8000
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	6 -->
5 eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	7 <!-- similar to "join two datasets" tool http://galaxy.wur.nl/galaxy_production/root?tool_id=join1
eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	8 but this one is probably having more powerful features like supporting multiple ';' codes in key fields
eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	9 and the feature in ontologyTermColName(s) supporting direct hierarchy like annotation -->
0 5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	10 <command interpreter="java -jar ">
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	11 Results2O.jar
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	12 -inputFileName $inputFileName
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	13 -inputIdColumnName "$inputIdColumnName"
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	14 -inputIdPrefix "$inputIdPrefix"
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	15 -quantifColumn "$quantifColumn"
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	16
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	17 -ontologyMappingFileName $ontologyMappingFileName
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	18 -mappingFileIdColName "$mappingFileIdColName"
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	19 -mappingIdPrefix "$mappingIdPrefix"
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	20 -mappingFileOntologyTermColName "$mappingFileOntologyTermColName"
5 eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	21 -removeWhiteSpacesFromOterms $removeWhiteSpacesFromOterms
0 5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	22
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	23 -outputFileName $outputFileName
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	24 -outputObservationsFileName $outputObservationsFileName
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	25
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	26 </command>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	27
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	28 <inputs>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	29
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	30 <param name="inputFileName" type="data" format="tabular,csv" label="Input file (TSV/CSV)" />
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	31 <param name="inputIdColumnName" type="text" size="50" value="" label="ID column name" help="Name of the column containing the identification codes (in the given input file)"/>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	32 <param name="inputIdPrefix" type="text" size="50" value="" label="(Optional) Prefix in ID column"
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	33 help="Fill in if any prefix is found in the ID column values (e.g. in some
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	34 files the value is preceded by a fixed value like for example 'lipidmaps:LMFA00000007' instead of just 'LMFA00000007' - in this
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	35 example one would fill in 'lipidmaps:' as prefix)"/>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	36 <param name="quantifColumn" type="text" size="50" value="" label="(Optional) Values column name" help="Name of the column containing the quantification values (in the given input file)"/>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	37
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	38 <!-- =================== ONTOLOGY part ============== -->
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	39 <param name="ontologyMappingFileName" type="data" format="obo" label="ID to Ontology mapping file (TSV/CSV)" help="Simple file linking the coding scheme used for the identifications in the given input file to one or more ontology terms."/>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	40 <param name="mappingFileIdColName" type="text" size="50" value="" label="ID column name (in ontology mapping file)" help="Name of the column containing the identification codes (which will in fact link the input file records to the ontology records)"/>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	41 <param name="mappingIdPrefix" type="text" size="50" value="" label="(Optional) Prefix in mapping ID column"
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	42 help="Fill in if any prefix is found in the ID column values (e.g. in some
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	43 files the value is preceded by a fixed value like for example 'lipidmaps:LMFA00000007' instead of just 'LMFA00000007' - in this
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	44 example one would fill in 'lipidmaps:' as prefix)"/>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	45
5 eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	46 <param name="mappingFileOntologyTermColName" type="text" size="50" value="" label="Ontology term column name(s)"
eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	47 help="Name(s) of the column(s) containing the ontology terms in the ontology mapping file (and which will be transfered to the input file).
eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	48 For using multiple columns, set the names separated by comma (,). If multiple columns are specified, the algorithm will look for an annotation in the first one, if none
eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	49 found it will try the second one, and so forth. "/>
0 5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	50
5 eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	51 <param name="removeWhiteSpacesFromOterms" type="boolean" checked="false"
eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	52 label="Remove white spaces from ontology terms"
eb0b7889dd08 added more parameters/options and made Ontology observations file optional; pieter.lukasse@wur.nl parents: 0 diff changeset	53 help="This could be needed for some ontologies, like the current custom one for Lipidmaps."/>
0 5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	54
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	55 </inputs>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	56 <outputs>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	57 #if isinstance( $inputFileName.datatype, $__app__.datatypes_registry.get_datatype_by_extension('tabular').__class__):
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	58 <data name="outputFileName" format="tabular" label="${tool.name} on ${on_string}: annotated file " ></data>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	59 #else:
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	60 <data name="outputFileName" format="csv" label="${tool.name} on ${on_string}: annotated file " ></data>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	61 #end if
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	62
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	63 <data name="outputObservationsFileName" format="tabular" label="${tool.name} on ${on_string}: ontology observations file (TSV)"></data>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	64 </outputs>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	65 <tests>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	66 <!-- find out how to use -->
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	67 <test>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	68 </test>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	69 </tests>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	70 <help>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	71
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	72 .. class:: infomark
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	73
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	74 This tool is responsible for annotating quantifications result file
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	75 with the ontology terms given in a mapping file. This mapping file links the items found in the result file
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	76 (e.g. protein identifications coded in common protein coding formats such as UniProt )
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	77 to their respective ontology terms (e.g. GO terms). It enables users to use the cross-reference
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	78 information now available in different repositories (like uniprot and KEGG - see for example
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	79 http://www.uniprot.org/taxonomy/ or http://www.genome.jp/linkdb/ )
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	80 to map their results to other useful coding schemes such as ontologies for functional annotations.
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	81
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	82 As an example for transcripts and proteins, users can check http://www.uniprot.org/taxonomy/ to
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	83 see if their organism has been mapped to GO terms by Uniprot. For example the link
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	84 http://www.uniprot.org/uniprot/?query=taxonomy:2850 will show the Uniprot repository and cross-references
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	85 for the taxonomy 2850.
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	86 When the organism being studied is not available, then other strategies
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	87 could be tried (like Blast2GO for example).
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	88
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	89
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	90 Despite the specific examples above, this class is generic and can be used to map any
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	91 results file to an Ontology according to a given mapping file. One example would be mapping metabolomics
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	92 identifications to the CheBI ontology.
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	93
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	94
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	95 -----
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	96
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	97 Output
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	98
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	99 This method will read in the given input file and for each line it will add a new column
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	100 containing the Ontology terms found for the ID in that line. So the output file is the same as the
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	101 input file + extra Ontology terms column (separated by ; ).
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	102
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	103 A second summarized "ontology observations" file is also generated which can be used for visualizing the results
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	104 in an ontology viewer (e.g. see OntologyAndObservationsViewer).
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	105
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	106 </help>
5bc82cf5439b Initial commit pieter.lukasse@wur.nl parents: diff changeset	107 </tool>

Mercurial > repos > pieterlukasse > primo_multiomics

annotate results2o.xml @ 6:8fa07f40d2eb