Mercurial > repos > pieterlukasse > primo_multiomics
annotate results2o.xml @ 6:8fa07f40d2eb
added more parameters/options and made Ontology observations file optional;
author | pieter.lukasse@wur.nl |
---|---|
date | Fri, 01 Aug 2014 17:21:30 +0200 |
parents | eb0b7889dd08 |
children |
rev | line source |
---|---|
0 | 1 <tool name="Results2O" id="results2o1" version="0.0.1"> |
2 <description>use ontology mapping to annotate results (e.g. annotate protein identifications with Gene Ontology[GO] terms)</description> | |
3 <!-- | |
4 For remote debugging start you listener on port 8000 and use the following as command interpreter: | |
5 java -jar -Xdebug -Xrunjdwp:transport=dt_socket,address=D0100564.wurnet.nl:8000 | |
6 --> | |
5
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
7 <!-- similar to "join two datasets" tool http://galaxy.wur.nl/galaxy_production/root?tool_id=join1 |
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
8 but this one is probably having more powerful features like supporting multiple ';' codes in key fields |
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
9 and the feature in ontologyTermColName(s) supporting direct hierarchy like annotation --> |
0 | 10 <command interpreter="java -jar "> |
11 Results2O.jar | |
12 -inputFileName $inputFileName | |
13 -inputIdColumnName "$inputIdColumnName" | |
14 -inputIdPrefix "$inputIdPrefix" | |
15 -quantifColumn "$quantifColumn" | |
16 | |
17 -ontologyMappingFileName $ontologyMappingFileName | |
18 -mappingFileIdColName "$mappingFileIdColName" | |
19 -mappingIdPrefix "$mappingIdPrefix" | |
20 -mappingFileOntologyTermColName "$mappingFileOntologyTermColName" | |
5
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
21 -removeWhiteSpacesFromOterms $removeWhiteSpacesFromOterms |
0 | 22 |
23 -outputFileName $outputFileName | |
24 -outputObservationsFileName $outputObservationsFileName | |
25 | |
26 </command> | |
27 | |
28 <inputs> | |
29 | |
30 <param name="inputFileName" type="data" format="tabular,csv" label="Input file (TSV/CSV)" /> | |
31 <param name="inputIdColumnName" type="text" size="50" value="" label="ID column name" help="Name of the column containing the identification codes (in the given input file)"/> | |
32 <param name="inputIdPrefix" type="text" size="50" value="" label="(Optional) Prefix in ID column" | |
33 help="Fill in if any prefix is found in the ID column values (e.g. in some | |
34 files the value is preceded by a fixed value like for example 'lipidmaps:LMFA00000007' instead of just 'LMFA00000007' - in this | |
35 example one would fill in 'lipidmaps:' as prefix)"/> | |
36 <param name="quantifColumn" type="text" size="50" value="" label="(Optional) Values column name" help="Name of the column containing the quantification values (in the given input file)"/> | |
37 | |
38 <!-- =================== ONTOLOGY part ============== --> | |
39 <param name="ontologyMappingFileName" type="data" format="obo" label="ID to Ontology mapping file (TSV/CSV)" help="Simple file linking the coding scheme used for the identifications in the given input file to one or more ontology terms."/> | |
40 <param name="mappingFileIdColName" type="text" size="50" value="" label="ID column name (in ontology mapping file)" help="Name of the column containing the identification codes (which will in fact link the input file records to the ontology records)"/> | |
41 <param name="mappingIdPrefix" type="text" size="50" value="" label="(Optional) Prefix in mapping ID column" | |
42 help="Fill in if any prefix is found in the ID column values (e.g. in some | |
43 files the value is preceded by a fixed value like for example 'lipidmaps:LMFA00000007' instead of just 'LMFA00000007' - in this | |
44 example one would fill in 'lipidmaps:' as prefix)"/> | |
45 | |
5
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
46 <param name="mappingFileOntologyTermColName" type="text" size="50" value="" label="Ontology term column name(s)" |
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
47 help="Name(s) of the column(s) containing the ontology terms in the ontology mapping file (and which will be transfered to the input file). |
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
48 For using multiple columns, set the names separated by comma (,). If multiple columns are specified, the algorithm will look for an annotation in the first one, if none |
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
49 found it will try the second one, and so forth. "/> |
0 | 50 |
5
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
51 <param name="removeWhiteSpacesFromOterms" type="boolean" checked="false" |
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
52 label="Remove white spaces from ontology terms" |
eb0b7889dd08
added more parameters/options and made Ontology observations file optional;
pieter.lukasse@wur.nl
parents:
0
diff
changeset
|
53 help="This could be needed for some ontologies, like the current custom one for Lipidmaps."/> |
0 | 54 |
55 </inputs> | |
56 <outputs> | |
57 #if isinstance( $inputFileName.datatype, $__app__.datatypes_registry.get_datatype_by_extension('tabular').__class__): | |
58 <data name="outputFileName" format="tabular" label="${tool.name} on ${on_string}: annotated file " ></data> | |
59 #else: | |
60 <data name="outputFileName" format="csv" label="${tool.name} on ${on_string}: annotated file " ></data> | |
61 #end if | |
62 | |
63 <data name="outputObservationsFileName" format="tabular" label="${tool.name} on ${on_string}: ontology observations file (TSV)"></data> | |
64 </outputs> | |
65 <tests> | |
66 <!-- find out how to use --> | |
67 <test> | |
68 </test> | |
69 </tests> | |
70 <help> | |
71 | |
72 .. class:: infomark | |
73 | |
74 This tool is responsible for annotating quantifications result file | |
75 with the ontology terms given in a mapping file. This mapping file links the items found in the result file | |
76 (e.g. protein identifications coded in common protein coding formats such as UniProt ) | |
77 to their respective ontology terms (e.g. GO terms). It enables users to use the cross-reference | |
78 information now available in different repositories (like uniprot and KEGG - see for example | |
79 http://www.uniprot.org/taxonomy/ or http://www.genome.jp/linkdb/ ) | |
80 to map their results to other useful coding schemes such as ontologies for functional annotations. | |
81 | |
82 As an example for transcripts and proteins, users can check http://www.uniprot.org/taxonomy/ to | |
83 see if their organism has been mapped to GO terms by Uniprot. For example the link | |
84 http://www.uniprot.org/uniprot/?query=taxonomy:2850 will show the Uniprot repository and cross-references | |
85 for the taxonomy 2850. | |
86 When the organism being studied is not available, then other strategies | |
87 could be tried (like Blast2GO for example). | |
88 | |
89 | |
90 Despite the specific examples above, this class is generic and can be used to map any | |
91 results file to an Ontology according to a given mapping file. One example would be mapping metabolomics | |
92 identifications to the CheBI ontology. | |
93 | |
94 | |
95 ----- | |
96 | |
97 **Output** | |
98 | |
99 This method will read in the given input file and for each line it will add a new column | |
100 containing the Ontology terms found for the ID in that line. So the output file is the same as the | |
101 input file + extra Ontology terms column (separated by ; ). | |
102 | |
103 A second summarized "ontology observations" file is also generated which can be used for visualizing the results | |
104 in an ontology viewer (e.g. see OntologyAndObservationsViewer). | |
105 | |
106 </help> | |
107 </tool> |