comparison limma_voom.xml @ 6:39fa12a6d885 draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/limma_voom commit 60cae222b10f43f830936c19298bd723ac47e43d
author iuc
date Tue, 08 May 2018 18:12:40 -0400
parents d8a55b5f0de0
children e6a4ff41af6b
comparison
equal deleted inserted replaced
5:d8a55b5f0de0 6:39fa12a6d885
1 <tool id="limma_voom" name="limma" version="3.34.9.1"> 1 <tool id="limma_voom" name="limma" version="3.34.9.2">
2 <description> 2 <description>
3 Perform differential expression with limma-voom or limma-trend 3 Perform differential expression with limma-voom or limma-trend
4 </description> 4 </description>
5 5
6 <requirements> 6 <requirements>
80 #end if 80 #end if
81 81
82 -l '$adv.lfc' 82 -l '$adv.lfc'
83 -p '$adv.pVal' 83 -p '$adv.pVal'
84 -d '$adv.pAdjust' 84 -d '$adv.pAdjust'
85 -v '$adv.volgenes'
86 #if $adv.treat:
87 -T
88 #end if
85 89
86 #if $deMethod.de_select == 'voom': 90 #if $deMethod.de_select == 'voom':
87 #if $deMethod.weightOption: 91 #if $deMethod.weightOption:
88 -w 92 -w
89 #end if 93 #end if
183 </conditional> 187 </conditional>
184 188
185 <!-- Gene Annotations --> 189 <!-- Gene Annotations -->
186 <conditional name="anno"> 190 <conditional name="anno">
187 <param name="annoOpt" type="select" label="Use Gene Annotations?" 191 <param name="annoOpt" type="select" label="Use Gene Annotations?"
188 help="If you provide an annotation file, annotations will be added to the table(s) of differential expression results to provide descriptions for each gene. See Help section below."> 192 help="If you provide an annotation file, annotations will be added to the table(s) of differential expression results to provide descriptions for each gene, and used to label the top genes in the Volcano plot. See Help section below.">
189 <option value="no">No</option> 193 <option value="no">No</option>
190 <option value="yes">Yes</option> 194 <option value="yes">Yes</option>
191 </param> 195 </param>
192 <when value="yes"> 196 <when value="yes">
193 <param name="geneanno" type="data" format="tabular" label="Gene Annotations"/> 197 <param name="geneanno" type="data" format="tabular" label="Gene Annotations"/>
268 <option value="BH" selected="true">Benjamini and Hochberg (1995)</option> 272 <option value="BH" selected="true">Benjamini and Hochberg (1995)</option>
269 <option value="BY">Benjamini and Yekutieli (2001)</option> 273 <option value="BY">Benjamini and Yekutieli (2001)</option>
270 <option value="holm">Holm (1979)</option> 274 <option value="holm">Holm (1979)</option>
271 <option value="none">None</option> 275 <option value="none">None</option>
272 </param> 276 </param>
277 <param name="treat" type="boolean" truevalue="1" falsevalue="0" checked="False"
278 label="Test significance relative to a fold-change threshold (TREAT)"
279 help="If you want to apply a cut-off on a fold change the TREAT function can be used, see Help section below. Default: No"/>
280 <param name="volgenes" type="integer" value="10" min="0"
281 label="Number of genes to highlight in Volcano plot"
282 help="The top DE genes will be highlighted in the Volcano plot for each contrast. Default: 10."/>
273 <param name="normalisationOption" type="select" label="Normalisation Method" help="Default: TMM"> 283 <param name="normalisationOption" type="select" label="Normalisation Method" help="Default: TMM">
274 <option value="TMM" selected="true">TMM</option> 284 <option value="TMM" selected="true">TMM</option>
275 <option value="RLE">RLE</option> 285 <option value="RLE">RLE</option>
276 <option value="upperquartile">Upperquartile</option> 286 <option value="upperquartile">Upperquartile</option>
277 <option value="none">None (Don't normalise)</option> 287 <option value="none">None (Don't normalise)</option>
612 ========== ======= ======= ======= ======== ======== ======== 622 ========== ======= ======= ======= ======== ======== ========
613 623
614 **Gene Annotations:** 624 **Gene Annotations:**
615 Optional input for gene annotations, this can contain more 625 Optional input for gene annotations, this can contain more
616 information about the genes than just an ID number. The annotations will 626 information about the genes than just an ID number. The annotations will
617 be available in the differential expression results table and the optional normalised counts table. The file must contain a header row and have the gene IDs in the first column. The number of rows should match that of the counts files, add NA for any gene IDs with no annotation. The Galaxy tool **annotateMyIDs** can be used to obtain annotations for human, mouse, fly and zebrafish. 627 be available in the differential expression results table and the optional normalised counts table. The file must contain a header row and have the gene IDs in the first column. The second column will be used to label the genes in the Volcano plot instead of the default Gene IDs. The number of rows should match that of the counts files, add NA for any gene IDs with no annotation. The Galaxy tool **annotateMyIDs** can be used to obtain annotations for human, mouse, fly and zebrafish.
618 628
619 Example: 629 Example:
620 630
621 ========== ========== =================================================== 631 ========== ========== ===================================================
622 **GeneID** **Symbol** **GeneName** 632 **GeneID** **Symbol** **GeneName**
710 720
711 * **P-Value Adjustment Method:** 721 * **P-Value Adjustment Method:**
712 Change the multiple testing control method, the options are BH(1995) and 722 Change the multiple testing control method, the options are BH(1995) and
713 BY(2001) which are both false discovery rate controls. There is also 723 BY(2001) which are both false discovery rate controls. There is also
714 Holm(1979) which is a method for family-wise error rate control. 724 Holm(1979) which is a method for family-wise error rate control.
725
726 **Testing relative to a threshold (TREAT):**
727 If there are a lot of differentially expressed genes, a fold change threshold can be applied in addition to the P-value threshold to select genes that are more likely to be biologically significant. However, ranking by P-value and discarding genes with small logFCs can increase the false discovery rate. Using the limma TREAT function performs this analysis correctly (`McCarthy and Smyth, 2009`_).
715 728
716 **Normalisation Method:** 729 **Normalisation Method:**
717 The most obvious technical factor that affects the read counts, other than gene expression 730 The most obvious technical factor that affects the read counts, other than gene expression
718 levels, is the sequencing depth of each RNA sample. edgeR adjusts any differential expression 731 levels, is the sequencing depth of each RNA sample. edgeR adjusts any differential expression
719 analysis for varying sequencing depths as represented by differing library sizes. This is 732 analysis for varying sequencing depths as represented by differing library sizes. This is
727 proportion of the total library size, causing the remaining genes to be under-sampled in that 740 proportion of the total library size, causing the remaining genes to be under-sampled in that
728 sample. Unless this RNA composition effect is adjusted for, the remaining genes may falsely 741 sample. Unless this RNA composition effect is adjusted for, the remaining genes may falsely
729 appear to be down-regulated in that sample . The edgeR `calcNormFactors` function normalizes for RNA composition by finding a set of scaling factors for the library sizes that minimize the log-fold changes between the samples for most genes. The default method for computing these scale factors uses a trimmed mean of M values (TMM) between each pair of samples. We call the product of the original library size and the scaling factor the *effective library size*. The effective library size replaces the original library size in all downsteam analyses. TMM is the recommended method for most RNA-Seq data where the majority (more than half) of the genes are believed not differentially expressed between any pair of the samples. You can change the normalisation method under **Advanced Options** above. For more information, see the `calcNormFactors` section in the `edgeR User's Guide`_. 742 appear to be down-regulated in that sample . The edgeR `calcNormFactors` function normalizes for RNA composition by finding a set of scaling factors for the library sizes that minimize the log-fold changes between the samples for most genes. The default method for computing these scale factors uses a trimmed mean of M values (TMM) between each pair of samples. We call the product of the original library size and the scaling factor the *effective library size*. The effective library size replaces the original library size in all downsteam analyses. TMM is the recommended method for most RNA-Seq data where the majority (more than half) of the genes are believed not differentially expressed between any pair of the samples. You can change the normalisation method under **Advanced Options** above. For more information, see the `calcNormFactors` section in the `edgeR User's Guide`_.
730 743
731 **Robust Settings** 744 **Robust Settings**
732 Option to use robust settings with eBayes, used by both liamm-voom and limma-trend. Using robust settings is usually recommended to protect against outlier genes, for more information see the `limma User's Guide`_. This is turned on by default. 745 Option to use robust settings with eBayes or TREAT, used by both limma-voom and limma-trend. Using robust settings is usually recommended to protect against outlier genes, for more information see the `limma User's Guide`_ and `Phipson et al. 2016`_. This is turned on by default.
733 746
734 **Prior Count:** 747 **Prior Count:**
735 If the limma-trend method is used, a count (`prior.count`) is added to all counts to avoid taking a log of zero, and damp down the variances of logarithms of low counts. A default of 3 is used, as recommended in the `limma User's Guide`_. 748 If the limma-trend method is used, a count (`prior.count`) is added to all counts to avoid taking a log of zero, and damp down the variances of logarithms of low counts. A default of 3 is used, as recommended in the `limma User's Guide`_.
736 749
737 **Apply Sample Weights:** 750 **Apply Sample Weights:**
810 .. _limma: http://www.bioconductor.org/packages/release/bioc/html/limma.html 823 .. _limma: http://www.bioconductor.org/packages/release/bioc/html/limma.html
811 .. _limma approach: https://www.ncbi.nlm.nih.gov/pubmed/25605792 824 .. _limma approach: https://www.ncbi.nlm.nih.gov/pubmed/25605792
812 .. _limma User's Guide: http://bioconductor.org/packages/release/bioc/vignettes/limma/inst/doc/usersguide.pdf 825 .. _limma User's Guide: http://bioconductor.org/packages/release/bioc/vignettes/limma/inst/doc/usersguide.pdf
813 .. _edgeR: http://www.bioconductor.org/packages/release/bioc/html/edgeR.html 826 .. _edgeR: http://www.bioconductor.org/packages/release/bioc/html/edgeR.html
814 .. _edgeR User's Guide: https://bioconductor.org/packages/release/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf 827 .. _edgeR User's Guide: https://bioconductor.org/packages/release/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf
828 .. _McCarthy and Smyth, 2009: https://www.ncbi.nlm.nih.gov/pubmed/19176553
829 .. _Phipson et al. 2016: https://www.ncbi.nlm.nih.gov/pubmed/28367255
815 ]]></help> 830 ]]></help>
816 <citations> 831 <citations>
817 <citation type="doi">10.1186/gb-2014-15-2-r29</citation> 832 <citation type="doi">10.1186/gb-2014-15-2-r29</citation>
818 <citation type="doi">10.1093/nar/gkv412</citation> 833 <citation type="doi">10.1093/nar/gkv412</citation>
819 </citations> 834 </citations>