Authors Colin A. Smith csmith@scripps.edu, Ralf Tautenhahn rtautenh@gmail.com, Steffen Neumann sneumann@ipb-halle.de, Paul Benton hpaul.benton08@imperial.ac.uk and Christopher Conley cjconley@ucdavis.edu
Galaxy integration ABiMS TEAM - SU/CNRS - Station biologique de Roscoff and Yann Guitton - LABERCA Part of Workflow4Metabolomics.org [W4M]
Contact support@workflow4metabolomics.org for any questions or concerns about the Galaxy implementation of this tool.
This tool is used for preprocessing data from multiple LC/MS files (NetCDF, mzXML and mzData formats) using the xcms R package. It extracts ions from each sample independently, and using a statistical model, peaks are filtered and integrated. A tutorial on how to perform xcms preprocessing is available as GTN (Galaxy Training Network).
Upstream tools
Name | Output file | Format |
---|---|---|
MSnbase.readMSData | *.raw.RData | rdata.msnbase.raw |
Downstream tools
Name | Output file | Format |
---|---|---|
xcms.findChromPeaks Merger (single) | *.raw.xset.RData | rdata.xcms.findchrompeaks |
xcms.groupChromPeaks (zip) | *.raw.xset.RData | rdata.xcms.findchrompeaks |
Example of a metabolomic workflow
MatchedFilter
The matchedFilter algorithm identifies peaks in the chromatographic time domain as described in [Smith 2006]. The intensity values are binned by cutting The LC/MS data into slices (bins) of a mass unit (‘binSize’ m/z) wide. Within each bin the maximal intensity is selected. The chromatographic peak detection is then performed in each bin by extending it based on the ‘steps’ parameter to generate slices comprising bins ‘current_bin - steps +1’ to ‘current_bin + steps - 1’. Each of these slices is then filtered with matched filtration using a second-derative Gaussian as the model peak shape. After filtration peaks are detected using a signal-to-ratio cut-off. For more details and illustrations see [Smith 2006].See the MatchedFilter_manual
CentWave
The centWave algorithm perform peak density and wavelet based chromatographic peak detection for high resolution LC/MS data in centroid mode [Tautenhahn 2008].Due to the fact that peak centroids are used, a binning step is not necessary.The method is capable of detecting close-by-peaks and also overlapping peaks. Some efforts are made to detect the exact peak boundaries to get precise peak integrals.See the CentWave_manual
CentWaveWithPredIsoROIs
This method performs a two-step centWave-based chromatographic peak detection: in a first centWave run peaks are identified for which then the location of their potential isotopes in the mz-retention time is predicted. A second centWave run is then performed on these regions of interest (ROIs). The final list of chromatographic peaks comprises all non-overlapping peaks from both centWave runs.See the CentWaveWithPredIsoROIs_manual
MSW
Wavelet based, used for direct infusion data. Continuous wavelet transform (CWT) can be used to locate chromatographic peaks on different scales.See the MSW_manual
For details and explanations concerning all the parameters and workflow of xcms package, see its manual and this example
xset.RData: rdata.xcms.findchrompeaks format
(single) RData files that are necessary in the second step of the workflow "xcms.groupChromPeaks" - must be merged first using "xcms.findChromPeaks Merger"(zip) RData file that is necessary in the second step of the workflow "xcms.groupChromPeaks".
Version 3.12.0+galaxy* - 03/03/2020
Version 3.6.1+galaxy1 - 22/04/2020
Version 3.6.1+galaxy* - 03/09/2019
Version 3.4.4.1 - 30/04/2019
Version 3.4.4.0 - 08/02/2019
Version 3.0.0.0 - 08/03/2018
Version 2.1.1 - 29/11/2017
Version 2.1.0 - 22/02/2017
Version 2.0.11 - 22/12/2016
Version 2.0.10 - 22/12/2016
Version 2.0.9 - 06/07/2016
Version 2.0.8 - 06/04/2016
Version 2.0.7 - 10/02/2016
Version 2.0.2 - 18/01/2016
Version 2.0.2 - 09/10/2015
Version 2.0.2 - 02/06/2015