annotate READme.md @ 3:08dda0f86758 draft

"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
author galaxyp
date Wed, 21 Oct 2020 16:22:53 +0000
parents
children b0de3731dca5
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
1 MT2MQ
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
2 ==========================================
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
3
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
4 Description
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
5 -----------
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
6
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
7 For multi-omics data analysis of microbiome data, the Galaxy-P team has developed a tool – MT2MQ – which takes in metatranscriptomics gene families
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
8 output from ASaiM workflow and converts it to GO/EC terms. This tool helps transform the metatranscriptomics output which can be then used as an input for
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
9 comparative statistical analysis via metaQuantome.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
10
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
11 Authors
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
12 -------
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
13
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
14 Authors and contributors:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
15
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
16 * Marie Crane
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
17 * Praveen Kumar
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
18 * Subina Mehta
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
19 * Dihn Duy An Nguyen
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
20 * Pratik Jagtap
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
21
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
22
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
23 # Instructions to run MT2MQ:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
24 --------------------------
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
25
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
26 The ASAIM workflow can be run following the training module on the [GTN](https://training.galaxyproject.org/training-material/topics/metagenomics/tutorials/metatranscriptomics/tutorial.html).
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
27 However, for training purposes we have provided inputs in the [test data](https://github.com/galaxyproteomics/tools-galaxyp/tree/master/tools/mt2mq/test-data).
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
28
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
29 ## Data upload
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
30
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
31 - Upload the files mentioned below to the Galaxy Europe instance.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
32 ```
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
33 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4A.tsv
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
34 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4B.tsv
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
35 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4C.tsv
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
36 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7A.tsv
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
37 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7B.tsv
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
38 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7C.tsv
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
39 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4T7_func.tsv
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
40
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
41 ```
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
42
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
43 ## Functional mode:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
44
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
45 1. Build a **Dataset list** for the six .tsv files( `T4A`,`T4B`,`T4C`,`T7A`,`T7B`,`T7C`).
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
46 - Click the **Operations on multiple datasets** check box at the top of the history panel.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
47 - Select the files mentioned above.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
48 - Click on ** For all selected** drop down menu and select **Build Dataset list**.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
49 - Once the collection is created, rename the dataset collection as `Input collection`.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
50
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
51 2. Download the map_go_uniref50.txt file from zenodo.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
52
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
53 3. Run the **Regroup a HUMAnN2 generated table by features**(Galaxy Version 0.11.1.0) tool is regrouping table features (abundances or coverage) given a table of feature values and a mapping of groups to component features. It produces a new table with group values in place of feature values.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
54 - [**Regroup a HUMAnN2 generated table by features**](https://toolshed.g2.bx.psu.edu/repository?repository_id=85391b8d5d7ad39d) with the following parameters:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
55
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
56 - *"Gene/pathway table"*: `Input collection`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
57 - *"How to combine grouped features?"*: `Sum`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
58 - In *"Use built-in grouping options?"*: `No`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
59 - *"Custom groups file"*: `map_go_uniref50.txt`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
60 - *"Is the groups file reversed?"*: `No`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
61 - *"Decimal places to round to after applying function"*: `3`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
62 - *"Include an 'UNGROUPED' group to capture features that did not belong to other groups?"*: `Yes`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
63 - *"Carry through protected features, such as 'UNMAPPED'?"*: `Yes`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
64
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
65 Once this tool is run, rename the dataset collection as `Regrouped collection` .
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
66
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
67 4. Run the **Rename features of a HUMAnN2 generated table** (Galaxy Version 0.11.1.0)tool to change the Uniref-50 values to GO term .
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
68 - [**Rename features of a HUMAnN2 generated table**](https://toolshed.g2.bx.psu.edu/repository?repository_id=c68108109505c2f5) with the following parameters:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
69
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
70 - *"Gene/pathway table"*: `Regrouped collection`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
71 - *"Type of renaming"*: `Standard renaming`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
72 - *"Table features that can be renamed?"*: `Gene Ontology (GO)`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
73 - *"Remove non-alphanumeric characters from names?"*: `No`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
74
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
75 Once this tool is run, rename the dataset collection as `Renamed collection`.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
76
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
77
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
78 5. Run the **Join HUMAnN2 generated tables** (Galaxy Version 0.11.1.1) tool to merge all the files into one.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
79 - [**Join HUMAnN2 generated tables**](https://toolshed.g2.bx.psu.edu/repository?repository_id=9b27f096128b26ff) with the following parameters:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
80
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
81 - *"Gene/pathway table"*: `Renamed collection`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
82
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
83 Once this tool is run, rename the dataset collection as `Joined Data`.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
84
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
85 6. Run the **Renormalize a HUMAnN2 generated table** (Galaxy Version 0.11.1.0) tool to normalize the data.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
86 - [**Renormalize a HUMAnN2 generated table**](https://toolshed.g2.bx.psu.edu/repository?repository_id=05a56fcdeac2a25c) with the following parameters:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
87
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
88 - *"Gene/pathway table"*: `Joined Data`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
89 - *"Normalization scheme"*: `Copies per million`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
90 - *"Normalization level"*: `Normalization of all levels by community total`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
91 - *"Include the special features UNMAPPED, UNINTEGRATED, and UNGROUPED?"*: `Yes`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
92 - *"Update '-RPK' in sample names to appropriate suffix?"*: `No`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
93
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
94 Once this tool is run, rename the dataset collection as `Renormalized data`.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
95
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
96
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
97 7. Now that the data is ready, we can run **MT2MQ Tool to prepare metatranscriptomic outputs from ASaiM for Metaquantome** (Galaxy Version 1.1.0)on this data.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
98 - [**MT2MQ Tool to prepare metatranscriptomic outputs from ASaiM for Metaquantome**](https://toolshed.g2.bx.psu.edu/repository?repository_id=cab5d81c5f0a2f94) with the
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
99 following parameters:
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
100 - *"Mode"*: `Function`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
101 - *"GO namespace"*: `Molecular Function` or `Biological Process` or ` Cellular Component`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
102 - *"File from HUMAnN2 after regrouping, renaming, joining, and renormalizing"*: `Renormalized data`
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
103
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
104 **Note** : The MT2MQ tools can be run will all three GO name space.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
105
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
106 There are two tabular outputs from this tool.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
107
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
108 - A f_int.tabular output which mimics the Intensity input file for metaQuantome.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
109 - A func.tabular output which mimics the Functional input file for metaQuantome.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
110
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
111 The resulting output files can be used as input for metaQuatome's functional mode.
08dda0f86758 "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
112 To run metaQuantome Function mode. Follow the [GTN](https://github.com/subinamehta/training-material/tree/metaquantome-2-3/topics/proteomics/tutorials/metaquantome-function).