Mercurial > repos > metexplore > met4j_namematcher
annotate NameMatcher.xml @ 2:eecf4a934d32 draft default tip
planemo upload for repository https://forge.inrae.fr/metexplore/met4j-galaxy commit 71335d4a636672d264cc4f443c07856907571928
| author | metexplore |
|---|---|
| date | Fri, 21 Nov 2025 10:21:41 +0000 |
| parents | 54a9e177e17c |
| children |
| rev | line source |
|---|---|
|
0
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
1 <?xml version="1.0" encoding="UTF-8" standalone="no"?> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
2 <tool id="met4j_NameMatcher" name="NameMatcher" version="@TOOL_VERSION@"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
3 <description>This tool runs edit-distance based fuzzy matching to perform near-similar name matching between a metabolic model and a list of chemical names in a dataset. A harmonization processing is performed on chemical names with substitutions of common patterns among synonyms, in order to create aliases on which classical fuzzy matching can be run efficiently.</description> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
4 <macros> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
5 <import>macros.xml</import> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
6 </macros> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
7 <expand macro="bio_tools"/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
8 <expand macro="requirements"/> |
|
1
54a9e177e17c
planemo upload for repository https://forge.inrae.fr/metexplore/met4j-galaxy commit 6bb33ff0ff022029543e5e5ace7d16418715aa1c
metexplore
parents:
0
diff
changeset
|
9 <command detect_errors="exit_code"><![CDATA[met4j mapping.NameMatcher#if str($n): |
|
0
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
10 -nMatch "$n" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
11 #end if |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
12 #if str($nSkip): |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
13 -skip "$nSkip" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
14 #end if |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
15 #if str($col): |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
16 -col "$col" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
17 #end if |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
18 -i "$sbml" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
19 -compound "$input" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
20 #if str($comment): |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
21 -c "$comment" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
22 #end if |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
23 #if str($sep): |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
24 -sep "$sep" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
25 #end if |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
26 -o "$outputFile" |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
27 ]]></command> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
28 <inputs> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
29 <param argument="-nMatch" label="[1] Number of matchs to return per name" name="n" optional="true" type="text" value="1"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
30 <sanitizer invalid_char="_"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
31 <valid initial="string.printable"/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
32 </sanitizer> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
33 </param> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
34 <param argument="-skip" label="[0] Number of lines to skip at the beginning of the compound file" name="nSkip" optional="true" type="text" value="0"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
35 <sanitizer invalid_char="_"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
36 <valid initial="string.printable"/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
37 </sanitizer> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
38 </param> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
39 <param argument="-col" label="[1] column containing compounds' names" name="col" optional="true" type="text" value="1"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
40 <sanitizer invalid_char="_"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
41 <valid initial="string.printable"/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
42 </sanitizer> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
43 </param> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
44 <param argument="-i" format="sbml" label="Original sbml file" name="sbml" optional="false" type="data" value=""/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
45 <param argument="-compound" format="tsv" label="Compound file containing one column with compound names to search among the SBML entries" name="input" optional="false" type="data" value=""/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
46 <param argument="-c" label="[#] Comment String in the compound file. The lines beginning by this string won't be read" name="comment" optional="true" type="text" value="#"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
47 <sanitizer invalid_char="_"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
48 <valid initial="string.printable"/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
49 </sanitizer> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
50 </param> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
51 <param argument="-sep" label="[\t] separator in the compound file to split the colmumns." name="sep" optional="true" type="text" value="	"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
52 <sanitizer invalid_char="_"> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
53 <valid initial="string.printable"/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
54 </sanitizer> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
55 </param> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
56 </inputs> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
57 <outputs> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
58 <data format="tsv" name="outputFile"/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
59 </outputs> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
60 <tests/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
61 <help><![CDATA[Metabolic models and Metabolomics Data often refer compounds only by using their common names, which vary greatly according to the source, thus impeding interoperability between models, databases and experimental data. This requires a tedious step of manual mapping. Fuzzy matching is a range of methods which can potentially helps fasten this process, by allowing the search for near-similar names. Fuzzy matching is primarily designed for common language search engines and is frequently based on edit distance, i.e. the number of edits to transform a character string into another, effectively managing typo, case and special character variations, and allowing auto-completion. However, edit-distance based search fall short when mapping chemical names: As an example, alpha-D-Glucose et Glucose would require more edits than between Fructose and Glucose. |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
62 This tool runs edit-distance based fuzzy matching to perform near-similar name matching between a metabolic model and a list of chemical names in a dataset. A harmonization processing is performed on chemical names with substitutions of common patterns among synonyms, in order to create aliases on which classical fuzzy matching can be run efficiently. |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
63 @ATTRIBUTION@]]></help> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
64 <citations/> |
|
41bfd3a32e12
planemo upload for repository https://forgemia.inra.fr/metexplore/met4j-galaxy commit e28ca123295d50b85ba872e5a4720fd72697ecc3
metexplore
parents:
diff
changeset
|
65 </tool> |
