Mercurial > repos > recetox > mfassignr_isofiltr
comparison help.xml @ 3:e7ef7718505d draft
planemo upload for repository https://github.com/RECETOX/galaxytools/tree/master/tools/mfassignr commit c6e502d8af84750003e4ba001c61817acedd1896
author | recetox |
---|---|
date | Fri, 13 Sep 2024 10:08:55 +0000 |
parents | 079ae9aa6ed5 |
children |
comparison
equal
deleted
inserted
replaced
2:e4ddc3bc7e15 | 3:e7ef7718505d |
---|---|
30 (1) Run KMDNoise() to determine the noise level for the data. | 30 (1) Run KMDNoise() to determine the noise level for the data. |
31 (2) Check effectiveness of S/N threshold using SNplot(). | 31 (2) Check effectiveness of S/N threshold using SNplot(). |
32 (3) Use IsoFiltR() to identify potential 13C and 34S isotope masses. | 32 (3) Use IsoFiltR() to identify potential 13C and 34S isotope masses. |
33 (4) Using the S/N threshold, and the two data frames output from IsoFiltR(), run MFAssignCHO() to assign MF with C, H, and O to assess the mass accuracy. | 33 (4) Using the S/N threshold, and the two data frames output from IsoFiltR(), run MFAssignCHO() to assign MF with C, H, and O to assess the mass accuracy. |
34 (5) Use RecalList() to generate a list of the potential recalibrant series. | 34 (5) Use RecalList() to generate a list of the potential recalibrant series. |
35 (6) After choosing recalibrant series, use Recal() to recalibrate the mass lists. | 35 (6) Choose the most suitable recalibrant series using FindRecalSeries(). |
36 (7) Assign MF to the recalibrated mass list using MFAssign(). | 36 (7) After choosing recalibrant series, use Recal() to recalibrate the mass lists. |
37 (8) Check the output plots from MFAssign() to evaluate the quality of the assignments. | 37 (8) Assign MF to the recalibrated mass list using MFAssign(). |
38 (9) Check the output plots from MFAssign() to evaluate the quality of the assignments. | |
38 | 39 |
39 For detailed documentation on the individual steps please see the individual tool wrappers. | 40 For detailed documentation on the individual steps please see the individual tool wrappers. |
40 </token> | 41 </token> |
41 | 42 |
42 <token name="@KMDNOISE_HELP@"> | 43 <token name="@KMDNOISE_HELP@"> |
47 | 48 |
48 KMDnoise is a Kendrick Mass Defect (KMD) approach for the noise estimation. It selects a subset of the data using the linear equation y=0.1132x + b, where y stands for the KMD value, x for the measured ion mass and b is the y-intercept. The default y-intercepts of 0.05 and 0.2 in KMDNoise are used to isolate the largest analyte free region of noise in most mass spectra. The intensity of the peaks within this “slice” are then averaged and that value is defined as the noise level for the mass spectrum. This value is then multiplied with a user-defined signal-to-noise ratio (typically 3-10) to remove low intensity m/z values. | 49 KMDnoise is a Kendrick Mass Defect (KMD) approach for the noise estimation. It selects a subset of the data using the linear equation y=0.1132x + b, where y stands for the KMD value, x for the measured ion mass and b is the y-intercept. The default y-intercepts of 0.05 and 0.2 in KMDNoise are used to isolate the largest analyte free region of noise in most mass spectra. The intensity of the peaks within this “slice” are then averaged and that value is defined as the noise level for the mass spectrum. This value is then multiplied with a user-defined signal-to-noise ratio (typically 3-10) to remove low intensity m/z values. |
49 | 50 |
50 Output: | 51 Output: |
51 | 52 |
52 - noise estimate - (this noise level can then be multiplied by the user chosen value (3, 6, 10) in order to set the signal to noise cut for formula assignment.) | 53 - noise estimate - this noise level can then be multiplied by the user chosen value (3, 6, 10) in order to set the signal to noise cut for formula assignment. |
53 - KMD plot - bounds of the noise estimation area are highlighted in red | 54 - KMD plot - bounds of the noise estimation area are highlighted in red. |
54 </token> | 55 </token> |
55 | 56 |
56 <token name="@HISTNOISE_HELP@"> | 57 <token name="@HISTNOISE_HELP@"> |
57 MFAssignR - HistNoise | 58 MFAssignR - HistNoise |
58 ============================= | 59 ============================= |
62 HistNoise function creates a histogram using natural log of the intensity, which can be then used to determine the noise level for the data analyze, and also the estimated noise level. The noise level can be then multiplied by whatever value in order to reach the value to be used to cut the data. | 63 HistNoise function creates a histogram using natural log of the intensity, which can be then used to determine the noise level for the data analyze, and also the estimated noise level. The noise level can be then multiplied by whatever value in order to reach the value to be used to cut the data. |
63 | 64 |
64 Output: | 65 Output: |
65 | 66 |
66 - noise estimate - this noise level can then be multiplied by the user chosen value in order to set the signal to noise cut for formula assignment | 67 - noise estimate - this noise level can then be multiplied by the user chosen value in order to set the signal to noise cut for formula assignment |
67 - Histogram - shows where the cut is being applied123 | 68 - Histogram - shows where the cut is being applied |
68 | 69 |
69 </token> | 70 </token> |
70 | 71 |
71 <token name="@SNPLOT_HELP@"> | 72 <token name="@SNPLOT_HELP@"> |
72 MFAssignR - SNplot | 73 MFAssignR - SNplot |
116 | 117 |
117 <token name="@RECALLIST_HELP@"> | 118 <token name="@RECALLIST_HELP@"> |
118 MFAssignR - RecalList | 119 MFAssignR - RecalList |
119 ============================= | 120 ============================= |
120 | 121 |
121 This tool is the fifth step of the MFAssignR workflow (MFAssignCHO -> RecalList -> Recal) | 122 This tool is the fifth step of the MFAssignR workflow (MFAssignCHO -> RecalList -> FindRecalSeries) |
122 | 123 |
123 RecalList() function identifies the homologous series that could be used for recalibration. On the input, there is the output from MFAssign() or MFAssignCHO() functions. It returns a dataframe that contains the CH2 homologous series that contain more than 3 members. | 124 RecalList() function identifies the homologous series that could be used for recalibration. On the input, there is the output from MFAssign() or MFAssignCHO() functions. It returns a dataframe that contains the CH2 homologous series that contain more than 3 members. |
124 | 125 |
125 Output: | 126 Output: |
126 | 127 |
127 - Dataframe that contains the CH2 homologous series that contain more than 3 members. | 128 - Dataframe that contains the CH2 homologous series that contain more than 3 members. |
128 </token> | 129 </token> |
129 | 130 |
131 <token name="@FINDRECALSERIES_HELP@"> | |
132 MFAssignR - FindRecalSeries | |
133 ============================= | |
134 | |
135 This tool is the sixth step of the MFAssignR workflow (RecalList -> FindRecalSeries -> Recal) | |
136 | |
137 This function takes on input the CH2 homologous recalibration series, which are provided by the RecalList function and tries to find the most suitable series combination for recalibration based on the following criteria: | |
138 | |
139 (1) Series should cover the full mass spectral range, | |
140 (2) Series should be optimally long and combined have a “Tall Peak” at least every 100 m/z, | |
141 (3) Abundance score: the higher, the better, | |
142 (4) Peak score: the closer to 0, the better, | |
143 (5) Peak Distance: the closer to 1, the better, | |
144 (6) Series Score: the closer to this value, the better. | |
145 | |
146 Combinations of 5 series are assembled, scores are computed for other metrics (in case of Peak proximity and Peak | |
147 distance, an inverted score is computed) and these are summed. Finally, either a series of the size of combination or top 10 unique series having the highest score are outputted. | |
148 | |
149 Output: | |
150 | |
151 - Dataframe of n or 10 most suitable recalibrant series. | |
152 </token> | |
153 | |
130 <token name="@RECAL_HELP@"> | 154 <token name="@RECAL_HELP@"> |
131 MFAssignR - Recal | 155 MFAssignR - Recal |
132 ============================= | 156 ============================= |
133 | 157 |
134 This tool is the sixth step of the MFAssignR workflow (RecalList -> Recal -> MFAssign) | 158 This tool is the seventh step of the MFAssignR workflow (FindRecalSeries -> Recal -> MFAssign) |
135 | 159 |
136 Recal() function recalibrates the 'Mono' and 'Iso' outputs from the IsoFiltR() function and prepares a dataframe containing chose recalibrants. Also it outputs a plot for the qualitative assessment of recalibrants. The input to the function is output from MFAssign() or MFAssignCHO(). | 160 Recal() function recalibrates the 'Mono' and 'Iso' outputs from the IsoFiltR() function and prepares a dataframe containing chose recalibrants. Also it outputs a plot for the qualitative assessment of recalibrants. The input to the function is output from MFAssign() or MFAssignCHO(). |
137 | 161 |
138 It is important for recalibrant masses to cover the entire mass range of interest, and they should be among the most abundant peaks in their region of the spectrum - by default we take first 10 recalibrant series. We recommend to sort the Recalibration Series table based on the Series Score (largest to smallest). In case that error "Gap in recalibrant coverage, try adding more recalibrant series" would occur, we recommend to provide more diverse series. | 162 It is important for recalibrant masses to cover the entire mass range of interest, and they should be among the most abundant peaks in their region of the spectrum - by default we take first 10 recalibrant series. We recommend to sort the Recalibration Series table based on the Series Score (largest to smallest). In case that error "Gap in recalibrant coverage, try adding more recalibrant series" would occur, we recommend to provide more diverse series. |
139 | 163 |