comparison w4mcorcov.xml @ 4:8bba31f628da draft

planemo upload for repository https://github.com/HegemanLab/w4mcorcov_galaxy_wrapper/tree/master commit 8f2dc8b66666340275cd8967e09c504720528462
author eschen42
date Sun, 04 Mar 2018 14:51:42 -0500
parents 5aaab36bc523
children 50f60f94c034
comparison
equal deleted inserted replaced
3:5aaab36bc523 4:8bba31f628da
1 <tool id="w4mcorcov" name="OPLS-DA_Contrasts" version="0.98.6"> 1 <tool id="w4mcorcov" name="OPLS-DA_Contrasts" version="0.98.7">
2 2
3 <description>OPLS-DA Contrasts of Univariate Results</description> 3 <description>OPLS-DA Contrasts of Univariate Results</description>
4 4
5 <requirements> 5 <requirements>
6 <requirement type="package">r-batch</requirement> 6 <requirement type="package">r-batch</requirement>
324 324
325 325
326 Motivation 326 Motivation
327 ---------- 327 ----------
328 328
329 OPLS-DA\ :superscript:`®` and the SIMCA\ :superscript:`®` S-PLOT\ :superscript:`®` (Wiklund *et al.*, 2008) may be employed to draw attention to metabolomic features that are potential biomarkers, i.e. features that are potentially useful to discriminate to which class a sample should be assigned (e.g. Sun *et al.*, 2016). Workflow4Metabolomics (W4M, Giacomoni *et al.*, 2014, Guitton *et al.*, 2017) provides a suite of tools for preprocessing and statistical analysis of LC-MS, GC-MS, and NMR metabolomics data; however, it does not (as of release 3.0) include a tool for making the equivalent of an S-PLOT. 329 OPLS-DA and the SIMCA S-PLOT (Wiklund *et al.*, 2008) may be employed to draw attention to metabolomic features that are potential biomarkers, i.e. features that are potentially useful to discriminate to which class a sample should be assigned (e.g. Sun *et al.*, 2016). Workflow4Metabolomics (W4M, Giacomoni *et al.*, 2014, Guitton *et al.*, 2017) provides a suite of tools for preprocessing and statistical analysis of LC-MS, GC-MS, and NMR metabolomics data; however, it does not (as of release 3.0) include a tool for making the equivalent of an S-PLOT.
330 330
331 The S-PLOT is computed from mean-centered, pareto-scaled data. This plot presents the correlation of the first score vector from an OPLS-DA model with the sample-variables used to produce that model versus the covariance of the scores with the sample-variables. For OPLS-DA, the first score vector represents the variation among the sample-variables that is related to the predictor (i.e., the contrasting factor). 331 The S-PLOT is computed from mean-centered, pareto-scaled data. This plot presents the correlation of the first score vector from an OPLS-DA model with the sample-variables used to produce that model versus the covariance of the scores with the sample-variables. For OPLS-DA, the first score vector represents the variation among the sample-variables that is related to the predictor (i.e., the contrasting factor).
332 332
333 The primary aims of this tool are: 333 The primary aims of this tool are:
334 334
340 Description 340 Description
341 ----------- 341 -----------
342 342
343 The purpose of the 'PLS-DA Contrasts' tool is to visualize GC-MS or LC-MS features that are possible biomarkers. 343 The purpose of the 'PLS-DA Contrasts' tool is to visualize GC-MS or LC-MS features that are possible biomarkers.
344 344
345 The W4M 'Univariate' tool (Thévenot *et al.*, 2015) adds the results of family-wise corrected pairwise significance-tests as columns of the **variableMetadata** dataset. 345 The W4M 'Univariate' tool (Th]]>&#233;<![CDATA[venot *et al.*, 2015) adds the results of family-wise corrected pairwise significance-tests as columns of the **variableMetadata** dataset.
346 For instance, suppose that you ran Kruskal-Wallis testing for a column named 'cluster' in sampleMetadata that has values 'k1' and 'k2' and at least one other value. 346 For instance, suppose that you ran Kruskal-Wallis testing for a column named 'cluster' in sampleMetadata that has values 'k1' and 'k2' and at least one other value.
347 347
348 - A column of variableMetadata would be labelled 'cluster_kruskal_sig' and would have values '1' and '0'; when the samples are grouped by 'cluster', '1' means that there is strong evidence against the hypothesis that there is no difference among the intensities for the feature across all sample-groups. 348 - A column of variableMetadata would be labelled 'cluster_kruskal_sig' and would have values '1' and '0'; when the samples are grouped by 'cluster', '1' means that there is strong evidence against the hypothesis that there is no difference among the intensities for the feature across all sample-groups.
349 - A column of variableMetadata would be labelled 'cluster_kruskal_k1.k2_sig' and would have values '1' and '0', where '1' means that there is significant evidence against the hypothesis that samples from sampleMetadata whose 'cluster' column contains 'k1' or 'k2' have the same intensity for that feature. 349 - A column of variableMetadata would be labelled 'cluster_kruskal_k1.k2_sig' and would have values '1' and '0', where '1' means that there is significant evidence against the hypothesis that samples from sampleMetadata whose 'cluster' column contains 'k1' or 'k2' have the same intensity for that feature.
350 350
351 The 'PLS-DA Contrasts' tool produces graphics and data for OPLS-DA contrasts of feature-intensities between significantly different pairs of factor-levels. For each factor-level, the tool performs a contrast with all other factor-levels combined and then separately with each other factor-level. 351 The 'PLS-DA Contrasts' tool produces graphics and data for OPLS-DA contrasts of feature-intensities between significantly different pairs of factor-levels. For each factor-level, the tool performs a contrast with all other factor-levels combined and then separately with each other factor-level.
352 352
353 **Along the left-to-right axis, the plots show the supervised projection of the variation explained by the predictor** (i.e., the factor specified when invoking the tool); **the top-to-bottom axis displays the variation that is orthogonal to the predictor level** (i.e., independent of it). 353 **Along the left-to-right axis, the plots show the supervised projection of the variation explained by the predictor** (i.e., the factor specified when invoking the tool); **the top-to-bottom axis displays the variation that is orthogonal to the predictor level** (i.e., independent of it).
354 354
355 Although this tool can be used in a purely exploratory manner by supplying the variableMetadata file without the columns added by the W4M 'Univariate' tool, **the preferred workflow is to use univariate testing to exclude features that are not significantly different and use OPLS-DA to visualize the differences identified in univariate testing** (Thévenot *et al.*, 2015); an appropriate exception would be to visualize contrasts of a specific list of metabolites. 355 Although this tool can be used in a purely exploratory manner by supplying the variableMetadata file without the columns added by the W4M 'Univariate' tool, **the preferred workflow is to use univariate testing to exclude features that are not significantly different and use OPLS-DA to visualize the differences identified in univariate testing** (Th]]>&#233;<![CDATA[venot *et al.*, 2015); an appropriate exception would be to visualize contrasts of a specific list of metabolites.
356 356
357 It must be stressed that there may be no *single* definitive computational approach to select features that are reliable biomarkers, especially from a small number of samples or experiments. A few possible choices are: 357 It must be stressed that there may be no *single* definitive computational approach to select features that are reliable biomarkers, especially from a small number of samples or experiments. A few possible choices are:
358 358
359 - picking features with maximum loadings along the projection parallel to the predictor (loadp), 359 - picking features with maximum loadings along the projection parallel to the predictor (loadp),
360 - examining extreme values on S-PLOTs (for which covariance is linearly related to loadp), 360 - examining extreme values on S-PLOTs (for which covariance is linearly related to loadp),
448 448
449 [OUT] Contrast-detail output PDF 449 [OUT] Contrast-detail output PDF
450 | Several plots for each two-projection OPLS-DA analysis: 450 | Several plots for each two-projection OPLS-DA analysis:
451 451
452 - (top-left) **correlation-versus-covariance plot** of OPLS-DA results (a work-alike for the S-PLOT, computed using formula in Supplement to Wiklund, *op. cit.*); point-color becomes saturated as the "variable importance in projection to the predictive components" (VIP\ :subscript:`4,p` from Galindo-Prieto *et al.* 2014) ranges from 0.83 and 1.21 (Mehmood *et al.* 2012) 452 - (top-left) **correlation-versus-covariance plot** of OPLS-DA results (a work-alike for the S-PLOT, computed using formula in Supplement to Wiklund, *op. cit.*); point-color becomes saturated as the "variable importance in projection to the predictive components" (VIP\ :subscript:`4,p` from Galindo-Prieto *et al.* 2014) ranges from 0.83 and 1.21 (Mehmood *et al.* 2012)
453 - (bottom-left) **model-overview plot** for the two projections; grey bars are the correlation coefficient for the fitted data; black bars indicate performance in cross-validation tests (Thévenot, 2017) 453 - (bottom-left) **model-overview plot** for the two projections; grey bars are the correlation coefficient for the fitted data; black bars indicate performance in cross-validation tests (Th]]>&#233;<![CDATA[venot, 2017)
454 - (top-right) OPLS-DA **scores-plot** for the two projections (Thévenot *et al.*, 2015) 454 - (top-right) OPLS-DA **scores-plot** for the two projections (Th]]>&#233;<![CDATA[venot *et al.*, 2015)
455 - (bottom-right) OPLS-DA **loadings-plot** for the two projections (*ibid.*) 455 - (bottom-right) OPLS-DA **loadings-plot** for the two projections (*ibid.*)
456 456
457 [OUT] Contrast Correlation-Covarinace data TABULAR 457 [OUT] Contrast Correlation-Covarinace data TABULAR
458 | A tab-separated values file of metadata for each feature for each contrast in which it was included. 458 | A tab-separated values file of metadata for each feature for each contrast in which it was included.
459 | Thus, a given feature may appear many times, but *the combination of featureID, factorLevel1, and factorLevel2 will be unique.* 459 | Thus, a given feature may appear many times, but *the combination of featureID, factorLevel1, and factorLevel2 will be unique.*
533 Working examples 533 Working examples
534 ---------------- 534 ----------------
535 535
536 **Input files** 536 **Input files**
537 537
538 +-------------------+-------------------------------------------------------------------------------------------------------------------+ 538 +-------------------------------------------------------------------------------------------------------------------+
539 | Input File | Download from URL | 539 | Download from URL |
540 +===================+===================================================================================================================+ 540 +===================================================================================================================+
541 | Data matrix | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/input_dataMatrix.tsv | 541 | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/input_dataMatrix.tsv |
542 +-------------------+-------------------------------------------------------------------------------------------------------------------+ 542 +-------------------------------------------------------------------------------------------------------------------+
543 | Sample metadata | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/input_sampleMetadata.tsv | 543 | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/input_sampleMetadata.tsv |
544 +-------------------+-------------------------------------------------------------------------------------------------------------------+ 544 +-------------------------------------------------------------------------------------------------------------------+
545 | Variable metadata | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/input_variableMetadata.tsv | 545 | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/input_variableMetadata.tsv |
546 +-------------------+-------------------------------------------------------------------------------------------------------------------+ 546 +-------------------------------------------------------------------------------------------------------------------+
547 547
548 **Example 1:** Include in the analysis only features identified as pair-wise significant in the Univariate test. 548 **Example 1:** Include in the analysis only features identified as pair-wise significant in the Univariate test.
549 549
550 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 550 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
551 | Input Parameter or Result | Value | 551 | Input Parameter or Result | Value |
560 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 560 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
561 | Level-name matching | use regular expressions for matching level-names | 561 | Level-name matching | use regular expressions for matching level-names |
562 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 562 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
563 | Number of features having extreme loadings | ALL | 563 | Number of features having extreme loadings | ALL |
564 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 564 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
565 | Label feat. having extreme orth. loadings | Yes |
566 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
565 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov.tsv | 567 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov.tsv |
566 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 568 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
567 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience.tsv | 569 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience.tsv |
568 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 570 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
569 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail.pdf | 571 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail.pdf |
584 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 586 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
585 | Level-name matching | use wild cards for matching level-names | 587 | Level-name matching | use wild cards for matching level-names |
586 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 588 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
587 | Number of features having extreme loadings | 5 | 589 | Number of features having extreme loadings | 5 |
588 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 590 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
591 | Label feat. having extreme orth. loadings | No |
592 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
589 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov_all.tsv | 593 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov_all.tsv |
590 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 594 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
591 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience_all.tsv | 595 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience_all.tsv |
592 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 596 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
593 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail_all.pdf | 597 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail_all.pdf |
608 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 612 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
609 | Level-name matching | use regular expressions for matching level-names | 613 | Level-name matching | use regular expressions for matching level-names |
610 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 614 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
611 | Number of features having extreme loadings | 0 | 615 | Number of features having extreme loadings | 0 |
612 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 616 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
617 | Label feat. having extreme orth. loadings | Yes |
618 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
613 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov_global.tsv | 619 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov_global.tsv |
614 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 620 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
615 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience_global.tsv | 621 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience_global.tsv |
616 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 622 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
617 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail_global.pdf | 623 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail_global.pdf |
632 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 638 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
633 | Level-name matching | use regular expressions for matching level-names | 639 | Level-name matching | use regular expressions for matching level-names |
634 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 640 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
635 | Number of features having extreme loadings | 3 | 641 | Number of features having extreme loadings | 3 |
636 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 642 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
643 | Label feat. having extreme orth. loadings | Yes |
644 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
637 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov_lohi.tsv | 645 | Output primary table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_corcov_lohi.tsv |
638 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 646 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
639 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience_lohi.tsv | 647 | Output salience table | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_salience_lohi.tsv |
640 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+ 648 +--------------------------------------------+----------------------------------------------------------------------------------------------------------------------------------------+
641 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail_lohi.pdf | 649 | Output figures PDF | https://raw.githubusercontent.com/HegemanLab/w4mcorcov_galaxy_wrapper/master/test-data/expected_contrast_detail_lohi.pdf |
643 651
644 652
645 Trademarks 653 Trademarks
646 ---------- 654 ----------
647 655
648 OPLS-DA\ :superscript:`®`, SIMCA\ :superscript:`®`, and S-PLOT\ :superscript:`®` are registered trademarks of the Umetrics company. http://umetrics.com/about-us/trademarks 656 OPLS-DA, SIMCA, and S-PLOT are registered trademarks of the Umetrics company. http://umetrics.com/about-us/trademarks
649 657
650 658
651 Release notes 659 Release notes
652 ------------- 660 -------------
661
662 0.98.7
663
664 - bug fix: handle case of a treatment level with only one sample.
653 665
654 0.98.6 666 0.98.6
655 667
656 - bug fix: set 'crossvalI' param (of R function 'ropls::opls') to the number of samples when the there are fewer than seven samples. 668 - bug fix: set 'crossvalI' param (of R function 'ropls::opls') to the number of samples when the there are fewer than seven samples.
657 669
687 <citation type="doi">10.1016/j.chemolab.2008.08.004</citation> 699 <citation type="doi">10.1016/j.chemolab.2008.08.004</citation>
688 <!-- Rinuardo 2016 --> 700 <!-- Rinuardo 2016 -->
689 <citation type="doi">10.3389/fmolb.2016.00026</citation> 701 <citation type="doi">10.3389/fmolb.2016.00026</citation>
690 <!-- Sun_2016 Urinary Biomarkers for adolescent idiopathic scoliosis --> 702 <!-- Sun_2016 Urinary Biomarkers for adolescent idiopathic scoliosis -->
691 <citation type="doi">10.1038/srep22274</citation> 703 <citation type="doi">10.1038/srep22274</citation>
692 <!-- Th_venot_2015 Urinary metabolome statistics --> 704 <!-- Thevenot_2015 Urinary metabolome statistics -->
693 <citation type="doi">10.1021/acs.jproteome.5b00354</citation> 705 <citation type="doi">10.1021/acs.jproteome.5b00354</citation>
694 <!-- ropls package --> 706 <!-- ropls package -->
695 <citation type="bibtex"><![CDATA[ 707 <citation type="bibtex"><![CDATA[
696 @incollection{Thevenot_ropls_2017, 708 @incollection{Thevenot_ropls_2017,
697 author = {Th{\'{e}}venot, Etienne A.}, 709 author = {Th{\'{e}}venot, Etienne A.},