annotate README.md @ 3:1de657a3198d draft default tip

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit b55cfca7f53d2eaa3f4c7c4ce09436371b088000
author iuc
date Sat, 05 Oct 2024 16:31:40 +0000
parents 5efbd15675ca
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
1 # IMPORTANT INFORMATION FOR SYSTEM ADMINISTRATORS
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
2
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
3 The tool GECKO is disk-intensive. This means that the algorithm will write a lot to the hard disk drive, which, even though buffered, can affect the performance of other processes that are sharing the same filesystem by starving their access to the disk.
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
4
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
5 ## Recommended use in concurrent systems
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
6
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
7 If GECKO is being used concurrently, it is recommended that the input file size is limited in order to avoid overall deterioration of the performance of the system. See details below for an estimate.
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
8
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
9 ## Details
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
10
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
11 The most disk-consuming step is the writing of "seeds" or "hits". These depend on the length and the similarity of the input sequences.
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
12
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
13 - A regular bacterial comparison (1 to 10 MB input sequences) can write around 1 to 5 GB.
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
14 - A medium-sized comparison (around 20 or 30 MB input sequences) can write around 10 GB.
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
15 - A chromosome comparison (50 to 200 MB input sequences) between human and gorilla can write around 600 GB. Other "less" similar comparison can write around 100 GB or less.
09459f6ffe08 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit f0c70444d9781900f0af1638792818543c65acfc"
iuc
parents:
diff changeset
16
2
5efbd15675ca "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/gecko commit 52b2b430b2d0ac983f426da298d966074a24f5cd"
iuc
parents: 0
diff changeset
17