Mercurial > repos > galaxyp > mt2mq
annotate READme.md @ 4:b0de3731dca5 draft
"planemo upload commit 987f052820b4b382b1ba65f32893fc0b6c8bccf9"
author | galaxyp |
---|---|
date | Wed, 21 Oct 2020 16:26:01 +0000 |
parents | 08dda0f86758 |
children | c3a418f7cf7f |
rev | line source |
---|---|
3
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
1 MT2MQ |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
2 ========================================== |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
3 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
4 Description |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
5 ----------- |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
6 |
4
b0de3731dca5
"planemo upload commit 987f052820b4b382b1ba65f32893fc0b6c8bccf9"
galaxyp
parents:
3
diff
changeset
|
7 In order to enable multi-omics data analysis of microbiome data, the Galaxy-P team has developed a tool – MT2MQ – which processes metatranscriptomics gene families output from ASaiM workflow and converts it to Gene Ontology (GO) or EC terms. The processed metatranscriptomics output can be subsequently used as an input for comparative statistical analysis via [metaQuantome](https://www.mcponline.org/content/18/8_suppl_1/S82) software suite. |
3
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
8 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
9 Authors |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
10 ------- |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
11 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
12 Authors and contributors: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
13 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
14 * Marie Crane |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
15 * Praveen Kumar |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
16 * Subina Mehta |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
17 * Dihn Duy An Nguyen |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
18 * Pratik Jagtap |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
19 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
20 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
21 # Instructions to run MT2MQ: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
22 -------------------------- |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
23 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
24 The ASAIM workflow can be run following the training module on the [GTN](https://training.galaxyproject.org/training-material/topics/metagenomics/tutorials/metatranscriptomics/tutorial.html). |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
25 However, for training purposes we have provided inputs in the [test data](https://github.com/galaxyproteomics/tools-galaxyp/tree/master/tools/mt2mq/test-data). |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
26 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
27 ## Data upload |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
28 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
29 - Upload the files mentioned below to the Galaxy Europe instance. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
30 ``` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
31 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4A.tsv |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
32 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4B.tsv |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
33 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4C.tsv |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
34 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7A.tsv |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
35 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7B.tsv |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
36 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7C.tsv |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
37 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4T7_func.tsv |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
38 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
39 ``` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
40 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
41 ## Functional mode: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
42 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
43 1. Build a **Dataset list** for the six .tsv files( `T4A`,`T4B`,`T4C`,`T7A`,`T7B`,`T7C`). |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
44 - Click the **Operations on multiple datasets** check box at the top of the history panel. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
45 - Select the files mentioned above. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
46 - Click on ** For all selected** drop down menu and select **Build Dataset list**. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
47 - Once the collection is created, rename the dataset collection as `Input collection`. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
48 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
49 2. Download the map_go_uniref50.txt file from zenodo. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
50 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
51 3. Run the **Regroup a HUMAnN2 generated table by features**(Galaxy Version 0.11.1.0) tool is regrouping table features (abundances or coverage) given a table of feature values and a mapping of groups to component features. It produces a new table with group values in place of feature values. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
52 - [**Regroup a HUMAnN2 generated table by features**](https://toolshed.g2.bx.psu.edu/repository?repository_id=85391b8d5d7ad39d) with the following parameters: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
53 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
54 - *"Gene/pathway table"*: `Input collection` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
55 - *"How to combine grouped features?"*: `Sum` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
56 - In *"Use built-in grouping options?"*: `No` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
57 - *"Custom groups file"*: `map_go_uniref50.txt` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
58 - *"Is the groups file reversed?"*: `No` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
59 - *"Decimal places to round to after applying function"*: `3` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
60 - *"Include an 'UNGROUPED' group to capture features that did not belong to other groups?"*: `Yes` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
61 - *"Carry through protected features, such as 'UNMAPPED'?"*: `Yes` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
62 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
63 Once this tool is run, rename the dataset collection as `Regrouped collection` . |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
64 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
65 4. Run the **Rename features of a HUMAnN2 generated table** (Galaxy Version 0.11.1.0)tool to change the Uniref-50 values to GO term . |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
66 - [**Rename features of a HUMAnN2 generated table**](https://toolshed.g2.bx.psu.edu/repository?repository_id=c68108109505c2f5) with the following parameters: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
67 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
68 - *"Gene/pathway table"*: `Regrouped collection` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
69 - *"Type of renaming"*: `Standard renaming` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
70 - *"Table features that can be renamed?"*: `Gene Ontology (GO)` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
71 - *"Remove non-alphanumeric characters from names?"*: `No` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
72 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
73 Once this tool is run, rename the dataset collection as `Renamed collection`. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
74 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
75 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
76 5. Run the **Join HUMAnN2 generated tables** (Galaxy Version 0.11.1.1) tool to merge all the files into one. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
77 - [**Join HUMAnN2 generated tables**](https://toolshed.g2.bx.psu.edu/repository?repository_id=9b27f096128b26ff) with the following parameters: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
78 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
79 - *"Gene/pathway table"*: `Renamed collection` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
80 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
81 Once this tool is run, rename the dataset collection as `Joined Data`. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
82 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
83 6. Run the **Renormalize a HUMAnN2 generated table** (Galaxy Version 0.11.1.0) tool to normalize the data. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
84 - [**Renormalize a HUMAnN2 generated table**](https://toolshed.g2.bx.psu.edu/repository?repository_id=05a56fcdeac2a25c) with the following parameters: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
85 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
86 - *"Gene/pathway table"*: `Joined Data` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
87 - *"Normalization scheme"*: `Copies per million` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
88 - *"Normalization level"*: `Normalization of all levels by community total` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
89 - *"Include the special features UNMAPPED, UNINTEGRATED, and UNGROUPED?"*: `Yes` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
90 - *"Update '-RPK' in sample names to appropriate suffix?"*: `No` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
91 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
92 Once this tool is run, rename the dataset collection as `Renormalized data`. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
93 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
94 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
95 7. Now that the data is ready, we can run **MT2MQ Tool to prepare metatranscriptomic outputs from ASaiM for Metaquantome** (Galaxy Version 1.1.0)on this data. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
96 - [**MT2MQ Tool to prepare metatranscriptomic outputs from ASaiM for Metaquantome**](https://toolshed.g2.bx.psu.edu/repository?repository_id=cab5d81c5f0a2f94) with the |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
97 following parameters: |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
98 - *"Mode"*: `Function` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
99 - *"GO namespace"*: `Molecular Function` or `Biological Process` or ` Cellular Component` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
100 - *"File from HUMAnN2 after regrouping, renaming, joining, and renormalizing"*: `Renormalized data` |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
101 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
102 **Note** : The MT2MQ tools can be run will all three GO name space. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
103 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
104 There are two tabular outputs from this tool. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
105 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
106 - A f_int.tabular output which mimics the Intensity input file for metaQuantome. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
107 - A func.tabular output which mimics the Functional input file for metaQuantome. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
108 |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
109 The resulting output files can be used as input for metaQuatome's functional mode. |
08dda0f86758
"planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff
changeset
|
110 To run metaQuantome Function mode. Follow the [GTN](https://github.com/subinamehta/training-material/tree/metaquantome-2-3/topics/proteomics/tutorials/metaquantome-function). |