annotate results2o.xml @ 0:5bc82cf5439b

Initial commit
author pieter.lukasse@wur.nl
date Fri, 02 May 2014 15:36:43 +0200
parents
children eb0b7889dd08
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
1 <tool name="Results2O" id="results2o1" version="0.0.1">
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
2 <description>use ontology mapping to annotate results (e.g. annotate protein identifications with Gene Ontology[GO] terms)</description>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
3 <!--
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
4 For remote debugging start you listener on port 8000 and use the following as command interpreter:
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
5 java -jar -Xdebug -Xrunjdwp:transport=dt_socket,address=D0100564.wurnet.nl:8000
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
6 -->
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
7 <command interpreter="java -jar ">
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
8 Results2O.jar
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
9 -inputFileName $inputFileName
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
10 -inputIdColumnName "$inputIdColumnName"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
11 -inputIdPrefix "$inputIdPrefix"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
12 -quantifColumn "$quantifColumn"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
13
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
14 -ontologyMappingFileName $ontologyMappingFileName
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
15 -mappingFileIdColName "$mappingFileIdColName"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
16 -mappingIdPrefix "$mappingIdPrefix"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
17 -mappingFileOntologyTermColName "$mappingFileOntologyTermColName"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
18
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
19 -outputFileName $outputFileName
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
20 -outputObservationsFileName $outputObservationsFileName
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
21
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
22 </command>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
23
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
24 <inputs>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
25
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
26 <param name="inputFileName" type="data" format="tabular,csv" label="Input file (TSV/CSV)" />
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
27 <param name="inputIdColumnName" type="text" size="50" value="" label="ID column name" help="Name of the column containing the identification codes (in the given input file)"/>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
28 <param name="inputIdPrefix" type="text" size="50" value="" label="(Optional) Prefix in ID column"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
29 help="Fill in if any prefix is found in the ID column values (e.g. in some
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
30 files the value is preceded by a fixed value like for example 'lipidmaps:LMFA00000007' instead of just 'LMFA00000007' - in this
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
31 example one would fill in 'lipidmaps:' as prefix)"/>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
32 <param name="quantifColumn" type="text" size="50" value="" label="(Optional) Values column name" help="Name of the column containing the quantification values (in the given input file)"/>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
33
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
34 <!-- =================== ONTOLOGY part ============== -->
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
35 <param name="ontologyMappingFileName" type="data" format="obo" label="ID to Ontology mapping file (TSV/CSV)" help="Simple file linking the coding scheme used for the identifications in the given input file to one or more ontology terms."/>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
36 <param name="mappingFileIdColName" type="text" size="50" value="" label="ID column name (in ontology mapping file)" help="Name of the column containing the identification codes (which will in fact link the input file records to the ontology records)"/>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
37 <param name="mappingIdPrefix" type="text" size="50" value="" label="(Optional) Prefix in mapping ID column"
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
38 help="Fill in if any prefix is found in the ID column values (e.g. in some
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
39 files the value is preceded by a fixed value like for example 'lipidmaps:LMFA00000007' instead of just 'LMFA00000007' - in this
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
40 example one would fill in 'lipidmaps:' as prefix)"/>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
41
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
42 <param name="mappingFileOntologyTermColName" type="text" size="50" value="" label="Ontology term column name" help="Name of the column containing the ontology terms in the ontology mapping file (and which will be transfered to the input file)"/>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
43
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
44
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
45 </inputs>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
46 <outputs>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
47 #if isinstance( $inputFileName.datatype, $__app__.datatypes_registry.get_datatype_by_extension('tabular').__class__):
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
48 <data name="outputFileName" format="tabular" label="${tool.name} on ${on_string}: annotated file " ></data>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
49 #else:
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
50 <data name="outputFileName" format="csv" label="${tool.name} on ${on_string}: annotated file " ></data>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
51 #end if
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
52
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
53 <data name="outputObservationsFileName" format="tabular" label="${tool.name} on ${on_string}: ontology observations file (TSV)"></data>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
54 </outputs>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
55 <tests>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
56 <!-- find out how to use -->
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
57 <test>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
58 </test>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
59 </tests>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
60 <help>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
61
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
62 .. class:: infomark
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
63
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
64 This tool is responsible for annotating quantifications result file
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
65 with the ontology terms given in a mapping file. This mapping file links the items found in the result file
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
66 (e.g. protein identifications coded in common protein coding formats such as UniProt )
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
67 to their respective ontology terms (e.g. GO terms). It enables users to use the cross-reference
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
68 information now available in different repositories (like uniprot and KEGG - see for example
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
69 http://www.uniprot.org/taxonomy/ or http://www.genome.jp/linkdb/ )
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
70 to map their results to other useful coding schemes such as ontologies for functional annotations.
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
71
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
72 As an example for transcripts and proteins, users can check http://www.uniprot.org/taxonomy/ to
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
73 see if their organism has been mapped to GO terms by Uniprot. For example the link
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
74 http://www.uniprot.org/uniprot/?query=taxonomy:2850 will show the Uniprot repository and cross-references
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
75 for the taxonomy 2850.
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
76 When the organism being studied is not available, then other strategies
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
77 could be tried (like Blast2GO for example).
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
78
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
79
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
80 Despite the specific examples above, this class is generic and can be used to map any
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
81 results file to an Ontology according to a given mapping file. One example would be mapping metabolomics
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
82 identifications to the CheBI ontology.
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
83
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
84
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
85 -----
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
86
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
87 **Output**
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
88
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
89 This method will read in the given input file and for each line it will add a new column
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
90 containing the Ontology terms found for the ID in that line. So the output file is the same as the
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
91 input file + extra Ontology terms column (separated by ; ).
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
92
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
93 A second summarized "ontology observations" file is also generated which can be used for visualizing the results
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
94 in an ontology viewer (e.g. see OntologyAndObservationsViewer).
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
95
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
96 </help>
5bc82cf5439b Initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
97 </tool>