Mercurial > repos > recetox > rmassbank
diff macros.xml @ 0:0b28816c1c2c draft
planemo upload for repository https://github.com/RECETOX/galaxytools/tree/master/tools/rmassbank commit 02414aa4c20f249c2069e5e3d587e3a8cda923a8
author | recetox |
---|---|
date | Thu, 18 May 2023 13:01:04 +0000 |
parents | |
children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/macros.xml Thu May 18 13:01:04 2023 +0000 @@ -0,0 +1,83 @@ +<macros> + <token name="@TOOL_VERSION@">3.0.0</token> + + <xml name="creator"> + <creator> + <person + givenName="KarolĂna" + familyName="Trachtová" + url="https://github.com/trachtok" /> + <organization + url="https://www.recetox.muni.cz/" + email="GalaxyToolsDevelopmentandDeployment@space.muni.cz" + name="RECETOX MUNI" /> + </creator> + </xml> + + <token name="@HELP@"> + <![CDATA[ + RMassBank workflow - part I and II + This is implementation of RMassBank workflow (Bioconductor R package) based on publication by Stravs et al. 2013. + + Original RMassBank package is used to process tandem MS files and build MassBank records. Functions include + automated extraction of tandem MS spectra, formula assignment to tandem MS fragments, recalibration of tandem + MS spectra with assigned fragments, spectrum cleanup, automated retrieval of compound information from Internet + databases, and export to MassBank records. + + This script is a full pipeline of RmassBank that loads settings file, compounds list, infolist and input .mzML + files (one or multiple) and goes through all 8 steps of data-preprocessing referenced in RMassBank's + Bioconductor manual (Mass spectrometry workflow, function msmsWorkflow()) and all steps of MassBank record + generation (MassBank record workflow, function mbWorkflow()). To make it easier for user, based on template + of settings required by RMassBank package the corresponding settings file is created as well + (see section Settings file). + + User's inputs: + - *compounds list* **(must have suffix .csv)** - separator must be ',', required columns: ID, Name_deu, Name_Eng, Name, SMILES, RT, CAS + - *mode* - either **pH** (positive H) specifies [M+H]+ OR **pNa** specifies [M+Na]+ OR **pM** specifies [M]+ OR **mFA** specify [M-H]- and [M+FA]- + - *input mzML files* - files for RMassBank package must have specific naming, each filename should be in format: xxxxxxxx_1234_xxx.mzXML where the xxx denotes anything and 1234 is compound ID (must be stated in ID column of compound csv file). + - *infolist* **(must have suffix .csv)** - separator must be ',', columns (required columns are **bold**): '' (column name empty, values are numbers from 1 to n=number of chemicals), OK, id, dbcas, dbname_d, dbname_e, dataused, COMMENT.CONFIDENCE, COMMENT.EAWAG_UCHEM_ID, **CH$NAME1**, CH$NAME2, CH$NAME3, CH$COMPOUND_CLASS, **CH$FORMULA**, **CH$EXACT_MASS**, **CH$SMILES**, **CH$IUPAC**, CH$LINK.CAS, CH$LINK.CHEBI, CH$LINK.HMDB, CH$LINK.KEGG, CH$LINK.LIPIDMAPS, CH$LINK.PUBCHEM, **CH$LINK.INCHIKEY**, CH$LINK.CHEMSPIDER + - *settings parameters* - either filled in Galaxy or the whole settings file can be uploaded and used to run the RMassBank workflow + + Galaxy outputs: + - CSV file with failed peaks (empty if there were no failed peaks) + - PDF plot showing performance of the recalibration + - Full R environment (.RData) that should be used as an input for RMassBank workflow - part II + - R environment with re-analyzed spectra (_RA.RData, see RMassBank manual for more information) + - R environment with refiltered spectra (_RF.RData, see RMassBank manual for more information) + - *zipped folder* with records ready to be uploaded to MassBank database + - *settings file* used to run the RMassBank + + For more information about the data-preprocessing and record generation of the RMassBank pipeline, please refer + to `Bioconductor manual <https://www.bioconductor.org/packages/release/bioc/html/RMassBank.html>`_ for RMassBank. + + Settings file + In the original RMassBank R package, user has to specify multiple parameters via the settings file. To make + life easier for users of this tool, most of the parameters can be specified in the Galaxy intrface for this + tool and are automatically filled in the settings template that can is then used for RMassBank workflow. + If the user wants to supply his/her own settings files, that is also possible. Path to OpenBabel will be + automatically filled no matter what is the original specification in the supplied settings file. + + For all parameters, please refer to the manual for RMassBank at `Bioconductor website + <https://www.bioconductor.org/packages/release/bioc/html/RMassBank.html>`_ . + + Parameters that can only be changed manually are: + - *deprofile:* leave empty if input data are already in centroid mode (usuall case), otherwise use one of the values from deprofile.spline, deprofile.fwhm or deprofile.localMax to convert the input $ + - *babeldir:* do not change this if the settings file is to be further used in Galaxy + - *annotations:* several fields containg information about author, used instrument etc. + - *spectraList:* list of data-dependent scans in their order (relative to the parent scan), for annotation of the MassBank records + - *electronicNoise:* list of known electronic noise peaks + - *recalibrator:* custom recalibration function + - *titleFormat:* defines the title format + + **All parameters are more thoroughly described in the created settings file, which can be edited in any simple + text editor.** + + **NOTE:** In case of manual editing of the settings file, do not indent with TAB characters! Use only spaces. + + Authors of RMassBank: Michael Stravs, Emma Schymanski, Steffen Neumann, Erik Mueller, with contributions + from Tobias Schulze. + + RMassBank maintainer: at Eawag (massbank@eawag.ch) + ]]> + </token> +</macros>