annotate split.py @ 3:e0734e88a104 draft default tip

"planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
author iuc
date Thu, 27 Jan 2022 15:19:23 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
1 import sys
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
2
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
3
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
4 fai = sys.argv[1]
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
5 chunk_size = int(sys.argv[2])
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
6 overlap = int(sys.argv[3]) # Base pairs
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
7 with open(fai, 'r') as infile:
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
8 for line in infile:
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
9 name = line.split('\t')[0]
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
10 stop = int(line.split('\t')[1])
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
11 start = 1
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
12 while start < stop:
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
13 start = max(1, start - overlap)
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
14 print('\t'.join([name, str(start),
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
15 str(min(start + chunk_size, stop))]))
e0734e88a104 "planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/vardict commit 80e3742f499e8efabad1fe7627201466d0bdd190"
iuc
parents:
diff changeset
16 start += chunk_size