0
|
1 <tool id="ctb_remDuplicates" name="Remove duplicated molecules" version="1.0">
|
|
2 <description></description>
|
|
3 <parallelism method="multi" split_inputs="infile" split_mode="to_size" split_size="10000" shared_inputs="" merge_outputs="outfile"></parallelism>
|
|
4 <requirements>
|
|
5 <requirement type="package" version="2.3.2">openbabel</requirement>
|
|
6 </requirements>
|
|
7 <command>
|
2
|
8 <![CDATA[
|
0
|
9 obabel
|
|
10 -i"${infile.ext}"
|
|
11 "${infile}"
|
|
12 -ocopy
|
|
13 --unique "${descriptor}"
|
|
14 -O "${outfile}"
|
|
15 -e
|
2
|
16 2>&1
|
|
17 ]]>
|
0
|
18 </command>
|
|
19 <inputs>
|
|
20 <param name="infile" type="data" format="sdf,mol,mol2,cml,inchi,smi" label="Molecular input file"/>
|
|
21 <param name="descriptor" type="select" label="Select descriptor for molecule comparison">
|
|
22 <option value="inchi" checked="True">InChI (default)</option>
|
|
23 <option value="cansmi">Canonical SMILES</option>
|
|
24 <option value="cansmiNS">Canonical SMILES (without stereo)</option>
|
|
25 </param>
|
|
26 </inputs>
|
|
27 <outputs>
|
|
28 <data format_source="infile" name="outfile"/>
|
|
29 </outputs>
|
|
30 <tests>
|
|
31 <test>
|
|
32 <param name="infile" ftype="smi" value="2_mol.smi"/>
|
|
33 <param name="descriptor" value="inchi" />
|
|
34 <output name="outfile" ftype="smi" file="ob_remDuplicates_on_2_mol.smi" />
|
|
35 </test>
|
|
36 </tests>
|
|
37 <help>
|
2
|
38 <![CDATA[
|
0
|
39
|
|
40 .. class:: infomark
|
|
41
|
|
42 **What this tool does**
|
|
43
|
|
44 Filters a library of compounds and removes duplicated molecules.
|
|
45
|
|
46 -----
|
|
47
|
|
48 .. class:: warningmark
|
|
49
|
|
50 **Hint**
|
|
51
|
|
52 Comparison based on Canonical SMILES without stereochemistry may be useful in cases where this information is not crucial for library preparation. Several VS tools will automatically generate stereoisomeric forms.
|
|
53
|
|
54 -----
|
|
55
|
|
56 .. class:: infomark
|
|
57
|
|
58 **Input**
|
|
59
|
|
60 | - `InChI`_
|
|
61 | - `SMILES Format`_
|
|
62
|
|
63 .. _InChI: http://www.iupac.org/home/publications/e-resources/inchi.html
|
|
64 .. _SMILES Format: http://en.wikipedia.org/wiki/Simplified_molecular_input_line_entry_specification
|
|
65
|
|
66 -----
|
|
67
|
|
68 .. class:: infomark
|
|
69
|
2
|
70 **Output**
|
0
|
71
|
|
72 Same as input format.
|
|
73
|
|
74 -----
|
|
75
|
|
76 .. class:: infomark
|
|
77
|
|
78 **Cite**
|
|
79
|
|
80 N M O'Boyle, M Banck, C A James, C Morley, T Vandermeersch, and G R Hutchison - `Open Babel: An open chemical toolbox.`_
|
|
81
|
|
82 .. _`Open Babel: An open chemical toolbox.`: http://www.jcheminf.com/content/3/1/33
|
|
83
|
|
84 `Open Babel`_
|
|
85
|
|
86 .. _`Open Babel`: http://openbabel.org/wiki/Main_Page
|
|
87
|
2
|
88 ]]>
|
0
|
89 </help>
|
|
90 </tool>
|