Perform statistical analysis of preprocessed MaxQuant output data collected as described in [Cheng, 2018].
- Extracts sample-group IDs from sample names.
- Imputes missing values.
- Performs ANOVA analysis for each phosphopeptide.
- Performs Kinase-Substrate Enrichment Analysis (KSEA) using the method described by Casado et al. (2013); see "Algorithms" section below.
Phosphopeptides annotated with SwissProt and phosphosite metadata (in tabular format). This is the output from the "MaxQuant Phopsphopeptide Preprocessing" (mqppep_preproc) tool.
- First column label 'Phosphopeptide'.
- Sample-intensities must begin in first column matching 'Intensity-column pattern' and must have column labels to match argument 'Sample-name extraction pattern'.
0.2 0.1 0.05
First column of Filtered phosphopeptide intensities having intensity values (integer or PERL-compatible regular expression matching column label). Default:
^Intensity[^_]
Impute missing values by:
- group-median - use median for each sample-group;
- mean - use mean across all samples; or
- median - use median across all samples;
- random - use randomly generated values where:
- Mean percentile for random values specifies the percentile among non-missing values to be used as mean of random values, and
- Percentile SD for random values specifies the factor to be multiplied by the standard deviation among the non-missing values (across all samples) to determine the standard deviation of random values.
PERL-compatible regular expression extracting the sample-name from the the name of a column of intensities (from Filtered phosphopeptide intensities) for one sample.
- For example, "\.\d+[A-Z]$" applied to "Intensity.splunge.10A" would produce ".10A".
- Note that this is case sensitive by default.
PERL-compatible regular expression extracting the sample-grouping from the sample-name (that was in turn extracted with Sample-name extraction pattern from a column of intensites from Filtered phosphopeptide intensities).
- For example, "\d+$" applied to ".10A" would produce "10".
- Note that this is case sensitive by default.
[input file].[imputation method]-imputed_report
Summary report for normalization, imputation, and ANOVA, in PDF format.
[input file].[imputation method]-imputed_intensities
Phosphopeptide MS intensities where missing values have been imputed by the chosen method, in tabular format.
[input file].[imputation method]-imputed_QN_LT_intensities
Phosphopeptide MS intensities where missing values have been imputed by the chosen method, quantile-normalized (QN), and log10-transformed (LT), in tabular format.
The KSEA algorithm used here is as in the KSEAapp package as reported in [Wiredja 2017]. The code is adapted from "Danica D. Wiredja (2017). KSEAapp: Kinase-Substrate Enrichment Analysis. R package version 0.99.0." to work with output from the "MaxQuant Phosphopeptide Preprocessing" Galaxy tool and the multiple kinase-substrate databases that the latter tool searches.
Note that the PERL-compatible regular expressions accepted by this tool are documented at http://rdrr.io/r/base/regex.html