Mercurial > repos > bgruening > jbrowse2
changeset 0:53c2be00bb6f draft default tip
planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/jbrowse2 commit 0a86c88a95b0d1cc49d84544136de6556b95320f
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/all_fasta.loc.sample Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,18 @@ +#This file lists the locations and dbkeys of all the fasta files +#under the "genome" directory (a directory that contains a directory +#for each build). The script extract_fasta.py will generate the file +#all_fasta.loc. This file has the format (white space characters are +#TAB characters): +# +#<unique_build_id> <dbkey> <display_name> <file_path> +# +#So, all_fasta.loc could look something like this: +# +#apiMel3 apiMel3 Honeybee (Apis mellifera): apiMel3 /path/to/genome/apiMel3/apiMel3.fa +#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /path/to/genome/hg19/hg19canon.fa +#hg19full hg19 Human (Homo sapiens): hg19 Full /path/to/genome/hg19/hg19full.fa +# +#Your all_fasta.loc file should contain an entry for each individual +#fasta file. So there will be multiple fasta files for each build, +#such as with hg19 above. +#
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/apache2_license.txt Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,178 @@ + + Apache License + Version 2.0, January 2004 + http://www.apache.org/licenses/ + + TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION + + 1. Definitions. + + "License" shall mean the terms and conditions for use, reproduction, + and distribution as defined by Sections 1 through 9 of this document. + + "Licensor" shall mean the copyright owner or entity authorized by + the copyright owner that is granting the License. + + "Legal Entity" shall mean the union of the acting entity and all + other entities that control, are controlled by, or are under common + control with that entity. For the purposes of this definition, + "control" means (i) the power, direct or indirect, to cause the + direction or management of such entity, whether by contract or + otherwise, or (ii) ownership of fifty percent (50%) or more of the + outstanding shares, or (iii) beneficial ownership of such entity. + + "You" (or "Your") shall mean an individual or Legal Entity + exercising permissions granted by this License. + + "Source" form shall mean the preferred form for making modifications, + including but not limited to software source code, documentation + source, and configuration files. + + "Object" form shall mean any form resulting from mechanical + transformation or translation of a Source form, including but + not limited to compiled object code, generated documentation, + and conversions to other media types. + + "Work" shall mean the work of authorship, whether in Source or + Object form, made available under the License, as indicated by a + copyright notice that is included in or attached to the work + (an example is provided in the Appendix below). + + "Derivative Works" shall mean any work, whether in Source or Object + form, that is based on (or derived from) the Work and for which the + editorial revisions, annotations, elaborations, or other modifications + represent, as a whole, an original work of authorship. For the purposes + of this License, Derivative Works shall not include works that remain + separable from, or merely link (or bind by name) to the interfaces of, + the Work and Derivative Works thereof. + + "Contribution" shall mean any work of authorship, including + the original version of the Work and any modifications or additions + to that Work or Derivative Works thereof, that is intentionally + submitted to Licensor for inclusion in the Work by the copyright owner + or by an individual or Legal Entity authorized to submit on behalf of + the copyright owner. For the purposes of this definition, "submitted" + means any form of electronic, verbal, or written communication sent + to the Licensor or its representatives, including but not limited to + communication on electronic mailing lists, source code control systems, + and issue tracking systems that are managed by, or on behalf of, the + Licensor for the purpose of discussing and improving the Work, but + excluding communication that is conspicuously marked or otherwise + designated in writing by the copyright owner as "Not a Contribution." + + "Contributor" shall mean Licensor and any individual or Legal Entity + on behalf of whom a Contribution has been received by Licensor and + subsequently incorporated within the Work. + + 2. Grant of Copyright License. Subject to the terms and conditions of + this License, each Contributor hereby grants to You a perpetual, + worldwide, non-exclusive, no-charge, royalty-free, irrevocable + copyright license to reproduce, prepare Derivative Works of, + publicly display, publicly perform, sublicense, and distribute the + Work and such Derivative Works in Source or Object form. + + 3. Grant of Patent License. Subject to the terms and conditions of + this License, each Contributor hereby grants to You a perpetual, + worldwide, non-exclusive, no-charge, royalty-free, irrevocable + (except as stated in this section) patent license to make, have made, + use, offer to sell, sell, import, and otherwise transfer the Work, + where such license applies only to those patent claims licensable + by such Contributor that are necessarily infringed by their + Contribution(s) alone or by combination of their Contribution(s) + with the Work to which such Contribution(s) was submitted. If You + institute patent litigation against any entity (including a + cross-claim or counterclaim in a lawsuit) alleging that the Work + or a Contribution incorporated within the Work constitutes direct + or contributory patent infringement, then any patent licenses + granted to You under this License for that Work shall terminate + as of the date such litigation is filed. + + 4. Redistribution. You may reproduce and distribute copies of the + Work or Derivative Works thereof in any medium, with or without + modifications, and in Source or Object form, provided that You + meet the following conditions: + + (a) You must give any other recipients of the Work or + Derivative Works a copy of this License; and + + (b) You must cause any modified files to carry prominent notices + stating that You changed the files; and + + (c) You must retain, in the Source form of any Derivative Works + that You distribute, all copyright, patent, trademark, and + attribution notices from the Source form of the Work, + excluding those notices that do not pertain to any part of + the Derivative Works; and + + (d) If the Work includes a "NOTICE" text file as part of its + distribution, then any Derivative Works that You distribute must + include a readable copy of the attribution notices contained + within such NOTICE file, excluding those notices that do not + pertain to any part of the Derivative Works, in at least one + of the following places: within a NOTICE text file distributed + as part of the Derivative Works; within the Source form or + documentation, if provided along with the Derivative Works; or, + within a display generated by the Derivative Works, if and + wherever such third-party notices normally appear. The contents + of the NOTICE file are for informational purposes only and + do not modify the License. You may add Your own attribution + notices within Derivative Works that You distribute, alongside + or as an addendum to the NOTICE text from the Work, provided + that such additional attribution notices cannot be construed + as modifying the License. + + You may add Your own copyright statement to Your modifications and + may provide additional or different license terms and conditions + for use, reproduction, or distribution of Your modifications, or + for any such Derivative Works as a whole, provided Your use, + reproduction, and distribution of the Work otherwise complies with + the conditions stated in this License. + + 5. Submission of Contributions. Unless You explicitly state otherwise, + any Contribution intentionally submitted for inclusion in the Work + by You to the Licensor shall be under the terms and conditions of + this License, without any additional terms or conditions. + Notwithstanding the above, nothing herein shall supersede or modify + the terms of any separate license agreement you may have executed + with Licensor regarding such Contributions. + + 6. Trademarks. This License does not grant permission to use the trade + names, trademarks, service marks, or product names of the Licensor, + except as required for reasonable and customary use in describing the + origin of the Work and reproducing the content of the NOTICE file. + + 7. Disclaimer of Warranty. Unless required by applicable law or + agreed to in writing, Licensor provides the Work (and each + Contributor provides its Contributions) on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or + implied, including, without limitation, any warranties or conditions + of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A + PARTICULAR PURPOSE. You are solely responsible for determining the + appropriateness of using or redistributing the Work and assume any + risks associated with Your exercise of permissions under this License. + + 8. Limitation of Liability. In no event and under no legal theory, + whether in tort (including negligence), contract, or otherwise, + unless required by applicable law (such as deliberate and grossly + negligent acts) or agreed to in writing, shall any Contributor be + liable to You for damages, including any direct, indirect, special, + incidental, or consequential damages of any character arising as a + result of this License or out of the use or inability to use the + Work (including but not limited to damages for loss of goodwill, + work stoppage, computer failure or malfunction, or any and all + other commercial damages or losses), even if such Contributor + has been advised of the possibility of such damages. + + 9. Accepting Warranty or Additional Liability. While redistributing + the Work or Derivative Works thereof, You may choose to offer, + and charge a fee for, acceptance of support, warranty, indemnity, + or other liability obligations and/or rights consistent with this + License. However, in accepting such obligations, You may act only + on Your own behalf and on Your sole responsibility, not on behalf + of any other Contributor, and only if You agree to indemnify, + defend, and hold each Contributor harmless for any liability + incurred by, or claims asserted against, such Contributor by reason + of your accepting any such warranty or additional liability. + + END OF TERMS AND CONDITIONS +
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/autogenJB2.py Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,228 @@ +import argparse +import logging +import os +import sys + +from jbrowse2 import JbrowseConnector as jbC + + +logging.basicConfig(level=logging.DEBUG) +log = logging.getLogger("jbrowse") + +if __name__ == "__main__": + parser = argparse.ArgumentParser(description="", epilog="") + parser.add_argument("--sessName", help="Session name", default="AutoJBrowse") + parser.add_argument( + "--trackmeta", + help="Repeatable 'filename,filext,filepath,[bai/crai path for filesystem bam/cram]' for JBrowse2 tracks", + default=[], + action="append", + ) + parser.add_argument( + "--referencemeta", + help="Repeatable 'filename, filext, filepath, ... ,' for JBrowse2 reference tracks - usually only one needed", + default=[], + action="append", + ) + parser.add_argument( + "--pafmeta", + help="Repeatable. Each is a 'pafname, filext, filepath, ... ,' for a JBrowse2 paf track", + default=[], + action="append", + ) + parser.add_argument( + "--pafreferencemeta", + help="Repeatable. Each is a 'pafname,refpath,refname' Every pafname must have one or more", + default=[], + action="append", + ) + parser.add_argument( + "--jbrowse2path", help="Path to JBrowse2 directory in biocontainer or Conda" + ) + parser.add_argument("--outdir", help="Output directory", required=True) + parser.add_argument("--version", "-V", action="version", version="%(prog)s 2.10.2") + args = parser.parse_args() + sessName = args.sessName + default_session_data = {} + # --trackmeta $jbrowseme[$key],$jbrowseme[$key].ext,'$key' + trackList = [x.strip().split(",") for x in args.trackmeta if x > ""] + refList = [x.strip().split(",") for x in args.referencemeta if x > ""] + if len(refList) > 0: + listgenomes = [f for f in refList if f[1] in ["fasta", "fasta.gz"]] + # assume no pafs here + if len(listgenomes) > 0: + genome_paths = [x[0] for x in listgenomes] + genome_names = [x[2] for x in listgenomes] + guseuri = [] + for x in genome_paths: + if x.startswith("http://") or x.startswith("https://"): + guseuri.append("yes") + else: + guseuri.append("no") + jc = jbC( + outdir=args.outdir, + jbrowse2path=args.jbrowse2path, + ) + genomes = ( + [ + { + "path": x, + "label": genome_names[i], + "useuri": guseuri[i], + "meta": { + "name": genome_names[i], + "dataset_dname": genome_names[i], + }, + } + for i, x in enumerate(genome_paths) + ], + ) + assref_name = jc.process_genomes(genomes[0]) + if not default_session_data.get(assref_name, None): + default_session_data[assref_name] = { + "tracks": [], + "style": {}, + "style_labels": {}, + "visibility": { + "default_on": [], + "default_off": [], + }, + } + listtracks = trackList + # foo.paf must have a foo_paf.fasta or fasta.gz to match + tnames = [x[2] for x in listtracks] + texts = [x[1] for x in listtracks] + for i, track in enumerate(listtracks): + track_conf = { + "trackfiles": [], + "category": "autogenerated", + "assemblyNames": assref_name, + } + tpath, trext, trackname = track[:3] + track_conf["dataset_id"] = trackname + useuri = "no" + if tpath.startswith("http://") or tpath.startswith("https://"): + useuri = "yes" + if trext == "paf": + refname = trackname + "_paf.fasta" + refdat = [x[2] for x in listtracks if x[2] == refname] + if not refdat: + jc.logging.warn( + "No reference file %s corresponding to paf file %s found. Not building - there must be a corresponding fasta for each paf" + % (refname, trackname) + ) + sys.exit(3) + else: + track_conf.update( + { + "conf": { + "options": { + "paf": { + "genome": refdat, + "genome_label": trackname, + } + } + } + } + ) + elif trext == "bam": + ipath = track[3] + if not os.path.exists(ipath): + ipath = os.path.realpath( + os.path.join(jc.outdir, trackname + ".bai") + ) + cmd = [ + "samtools", + "index", + "-b", + "-o", + ipath, + os.path.realpath(track[0]), + ] + sys.stdout.write("#### calling %s" % " ".join(cmd)) + jc.subprocess_check_call(cmd) + track_conf.update( + {"conf": {"options": {"bam": {"bam_index": " %s ~ %s," % (tpath, ipath)}}}} + ) + elif trext == "cram": + ipath = track[3] + if not os.path.exists(ipath): + ipath = os.path.realpath( + os.path.join("./", trackname + ".crai") + ) + cmd = [ + "samtools", + "index", + "-c", + "-o", + ipath, + os.path.realpath(track[0]), + ] + jc.subprocess_check_call(cmd) + track_conf.update( + {"conf": {"options": {"cram": {"cram_index": "%s ~ %s," % (tpath, ipath)}}}} + ) + track_conf["path"] = tpath + track_conf["format"] = trext + track_conf["name"] = trackname + track_conf["label"] = trackname + track_conf["trackfiles"].append((tpath, trext, useuri, trackname, {})) + keys = jc.process_annotations(track_conf) + + if keys: + for key in keys: + if trext in [ + "bigwig", + "gff", + "gff3", + "vcf", + "maf", + "bed", + "hic" + ]: + default_session_data[assref_name]["visibility"]["default_on"].append(key) + else: + default_session_data[assref_name]["visibility"]["default_off"].append( + key + ) + if trext in ["gff", "gff3", "bed", "vcf", "maf", "blastxml"]: + ttype = "LinearBasicDisplay" + if trext == "vcf": + ttype = "LinearVariantDisplay" + style_json = { + "type": ttype, + "trackShowLabels": False, + "trackShowDescriptions": False, + } + default_session_data[assref_name]["style"][key] = style_json + default_session_data[assref_name]["tracks"].append(key) + # general_data = { + # "analytics": root.find("metadata/general/analytics").text, + # "primary_color": root.find("metadata/general/primary_color").text, + # "secondary_color": root.find("metadata/general/secondary_color").text, + # "tertiary_color": root.find("metadata/general/tertiary_color").text, + # "quaternary_color": root.find("metadata/general/quaternary_color").text, + # "font_size": root.find("metadata/general/font_size").text, + # } + jc.add_general_configuration({}) + trackconf = jc.config_json.get("tracks", []) + for gnome in jc.genome_names: + trackconf += jc.tracksToAdd[gnome] + logging.debug( + "++++ adding trackconf=%s for gnome %s" % (trackconf, gnome) + ) + jc.config_json["tracks"] = trackconf + assconf = jc.config_json.get("assemblies", []) + assconf += jc.assemblies + jc.config_json["assemblies"] = assconf + logging.debug("+++assemblies=%s, gnames=%s" % (assconf, jc.genome_names)) + jc.write_config() + default_session_data.update({"session_name": sessName}) + track_conf.update(default_session_data) + jc.add_default_session(default_session_data) + # jc.add_defsess_to_index(default_session_data) + # jc.text_index() not sure what broke here. + else: + sys.stderr.write( + "Collection has no suitable trackfiles for autogenJB2 - nothing to process" + )
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/blastxml_to_gapped_gff3.py Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,302 @@ +#!/usr/bin/env python +import argparse +import copy +import logging +import re +import sys + +from BCBio import GFF + +logging.basicConfig(level=logging.INFO) +log = logging.getLogger(name="blastxml2gff3") + +__doc__ = """ +BlastXML files, when transformed to GFF3, do not normally show gaps in the +blast hits. This tool aims to fill that "gap". +""" + + +def blastxml2gff3(blastxml, min_gap=3, trim=False, trim_end=False, include_seq=False): + from Bio.Blast import NCBIXML + from Bio.Seq import Seq + from Bio.SeqRecord import SeqRecord + from Bio.SeqFeature import SeqFeature, SimpleLocation + + blast_records = NCBIXML.parse(blastxml) + for idx_record, record in enumerate(blast_records): + # http://www.sequenceontology.org/browser/release_2.4/term/SO:0000343 + match_type = { # Currently we can only handle BLASTN, BLASTP + "BLASTN": "nucleotide_match", + "BLASTP": "protein_match", + }.get(record.application, "match") + + recid = record.query + if " " in recid: + recid = recid[0: recid.index(" ")] + + rec = SeqRecord(Seq("ACTG"), id=recid) + for idx_hit, hit in enumerate(record.alignments): + for idx_hsp, hsp in enumerate(hit.hsps): + qualifiers = { + "ID": "b2g.%s.%s.%s" % (idx_record, idx_hit, idx_hsp), + "source": "blast", + "score": hsp.expect, + "accession": hit.accession, + "hit_id": hit.hit_id, + "length": hit.length, + "hit_titles": hit.title.split(" >"), + } + if include_seq: + qualifiers.update( + { + "blast_qseq": hsp.query, + "blast_sseq": hsp.sbjct, + "blast_mseq": hsp.match, + } + ) + + for prop in ( + "score", + "bits", + "identities", + "positives", + "gaps", + "align_length", + "strand", + "frame", + "query_start", + "query_end", + "sbjct_start", + "sbjct_end", + ): + qualifiers["blast_" + prop] = getattr(hsp, prop, None) + + desc = hit.title.split(" >")[0] + qualifiers["description"] = desc[desc.index(" "):] + + # This required a fair bit of sketching out/match to figure out + # the first time. + # + # the match_start location must account for queries and + # subjecst that start at locations other than 1 + parent_match_start = hsp.query_start - hsp.sbjct_start + # The end is the start + hit.length because the match itself + # may be longer than the parent feature, so we use the supplied + # subject/hit length to calculate the real ending of the target + # protein. + parent_match_end = hsp.query_start + hit.length + hsp.query.count("-") + + # If we trim the left end, we need to trim without losing information. + used_parent_match_start = parent_match_start + if trim: + if parent_match_start < 1: + used_parent_match_start = 0 + + if trim or trim_end: + if parent_match_end > hsp.query_end: + parent_match_end = hsp.query_end + 1 + + # The ``match`` feature will hold one or more ``match_part``s + top_feature = SeqFeature( + SimpleLocation(used_parent_match_start, parent_match_end, strand=0), + type=match_type, + qualifiers=qualifiers, + ) + + # Unlike the parent feature, ``match_part``s have sources. + part_qualifiers = { + "source": "blast", + } + top_feature.sub_features = [] + for idx_part, (start, end, cigar) in enumerate( + generate_parts( + hsp.query, hsp.match, hsp.sbjct, ignore_under=min_gap + ) + ): + part_qualifiers["Gap"] = cigar + part_qualifiers["ID"] = qualifiers["ID"] + (".%s" % idx_part) + + # Otherwise, we have to account for the subject start's location + match_part_start = parent_match_start + hsp.sbjct_start + start - 1 + + # We used to use hsp.align_length here, but that includes + # gaps in the parent sequence + # + # Furthermore align_length will give calculation errors in weird places + # So we just use (end-start) for simplicity + match_part_end = match_part_start + (end - start) + + top_feature.sub_features.append( + SeqFeature( + SimpleLocation(match_part_start, match_part_end, strand=1), + type="match_part", + qualifiers=copy.deepcopy(part_qualifiers), + ) + ) + + rec.features.append(top_feature) + rec.annotations = {} + yield rec + + +def __remove_query_gaps(query, match, subject): + """remove positions in all three based on gaps in query + + In order to simplify math and calculations...we remove all of the gaps + based on gap locations in the query sequence:: + + Q:ACTG-ACTGACTG + S:ACTGAAC---CTG + + will become:: + + Q:ACTGACTGACTG + S:ACTGAC---CTG + + which greatly simplifies the process of identifying the correct location + for a match_part + """ + prev = 0 + fq = "" + fm = "" + fs = "" + for position in re.finditer("-", query): + fq += query[prev: position.start()] + fm += match[prev: position.start()] + fs += subject[prev: position.start()] + prev = position.start() + 1 + fq += query[prev:] + fm += match[prev:] + fs += subject[prev:] + + return (fq, fm, fs) + + +def generate_parts(query, match, subject, ignore_under=3): + region_q = [] + region_m = [] + region_s = [] + + (query, match, subject) = __remove_query_gaps(query, match, subject) + + region_start = -1 + region_end = -1 + mismatch_count = 0 + for i, (q, m, s) in enumerate(zip(query, match, subject)): + + # If we have a match + if m != " " or m == "+": + if region_start == -1: + region_start = i + # It's a new region, we need to reset or it's pre-seeded with + # spaces + region_q = [] + region_m = [] + region_s = [] + region_end = i + mismatch_count = 0 + else: + mismatch_count += 1 + + region_q.append(q) + region_m.append(m) + region_s.append(s) + + if mismatch_count >= ignore_under and region_start != -1 and region_end != -1: + region_q = region_q[0:-ignore_under] + region_m = region_m[0:-ignore_under] + region_s = region_s[0:-ignore_under] + yield region_start, region_end + 1, cigar_from_string( + region_q, region_m, region_s, strict_m=True + ) + region_q = [] + region_m = [] + region_s = [] + + region_start = -1 + region_end = -1 + mismatch_count = 0 + + yield region_start, region_end + 1, cigar_from_string( + region_q, region_m, region_s, strict_m=True + ) + + +def _qms_to_matches(query, match, subject, strict_m=True): + matchline = [] + + for (q, m, s) in zip(query, match, subject): + ret = "" + + if m != " " or m == "+": + ret = "=" + elif m == " ": + if q == "-": + ret = "D" + elif s == "-": + ret = "I" + else: + ret = "X" + else: + log.warn("Bad data: \n\t%s\n\t%s\n\t%s\n" % (query, match, subject)) + + if strict_m: + if ret == "=" or ret == "X": + ret = "M" + + matchline.append(ret) + return matchline + + +def _matchline_to_cigar(matchline): + cigar_line = [] + last_char = matchline[0] + count = 0 + for char in matchline: + if char == last_char: + count += 1 + else: + cigar_line.append("%s%s" % (last_char, count)) + count = 1 + last_char = char + cigar_line.append("%s%s" % (last_char, count)) + return " ".join(cigar_line) + + +def cigar_from_string(query, match, subject, strict_m=True): + matchline = _qms_to_matches(query, match, subject, strict_m=strict_m) + if len(matchline) > 0: + return _matchline_to_cigar(matchline) + else: + return "" + + +if __name__ == "__main__": + parser = argparse.ArgumentParser( + description="Convert Blast XML to gapped GFF3", epilog="" + ) + parser.add_argument( + "blastxml", type=argparse.FileType("r"), help="Blast XML Output" + ) + parser.add_argument( + "--min_gap", + type=int, + help="Maximum gap size before generating a new match_part", + default=3, + ) + parser.add_argument( + "--trim", + action="store_true", + help="Trim blast hits to be only as long as the parent feature", + ) + parser.add_argument( + "--trim_end", + action="store_true", + help="Cut blast results off at end of gene", + ) + parser.add_argument("--include_seq", action="store_true", help="Include sequence") + args = parser.parse_args() + + for rec in blastxml2gff3(**vars(args)): + if len(rec.features): + GFF.write([rec], sys.stdout)
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/convertMAF.sh Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,8 @@ +#!/usr/bin/env bash +# https://github.com/cmdcolin/jbrowse-plugin-mafviewer/blob/master/bin/convert.sh +# maf2bed modified to work right as a python script by ross lazarus in desperation +# MAF file must contain the species name and chromosome name +# e.g. hg38.chr1 in the sequence identifiers with hg38 passed in as $2 +python $3/maf2bed.py $2 < $1 | sort -k1,1 -k2,2n > $4.sorted.bed +bgzip -c $4.sorted.bed > $4.sorted.bed.gz +tabix -p bed $4.sorted.bed.gz
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/gff3_rebase.py Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,194 @@ +#!/usr/bin/env python +import argparse +import copy +import logging +import sys + +from BCBio import GFF +from Bio.SeqFeature import FeatureLocation + +logging.basicConfig(level=logging.INFO) +log = logging.getLogger(__name__) + +__author__ = "Eric Rasche" +__version__ = "0.4.0" +__maintainer__ = "Eric Rasche" +__email__ = "esr@tamu.edu" + + +def feature_lambda(feature_list, test, test_kwargs, subfeatures=True): + """Recursively search through features, testing each with a test function, yielding matches. + + GFF3 is a hierachical data structure, so we need to be able to recursively + search through features. E.g. if you're looking for a feature with + ID='bob.42', you can't just do a simple list comprehension with a test + case. You don't know how deeply burried bob.42 will be in the feature tree. This is where feature_lambda steps in. + + :type feature_list: list + :param feature_list: an iterable of features + + :type test: function reference + :param test: a closure with the method signature (feature, **kwargs) where + the kwargs are those passed in the next argument. This + function should return True or False, True if the feature is + to be yielded as part of the main feature_lambda function, or + False if it is to be ignored. This function CAN mutate the + features passed to it (think "apply"). + + :type test_kwargs: dictionary + :param test_kwargs: kwargs to pass to your closure when it is called. + + :type subfeatures: boolean + :param subfeatures: when a feature is matched, should just that feature be + yielded to the caller, or should the entire sub_feature + tree for that feature be included? subfeatures=True is + useful in cases such as searching for a gene feature, + and wanting to know what RBS/Shine_Dalgarno_sequences + are in the sub_feature tree (which can be accomplished + with two feature_lambda calls). subfeatures=False is + useful in cases when you want to process (and possibly + return) the entire feature tree, such as applying a + qualifier to every single feature. + + :rtype: yielded list + :return: Yields a list of matching features. + """ + # Either the top level set of [features] or the subfeature attribute + for feature in feature_list: + if test(feature, **test_kwargs): + if not subfeatures: + feature_copy = copy.deepcopy(feature) + feature_copy.sub_features = [] + yield feature_copy + else: + yield feature + + if hasattr(feature, 'sub_features'): + for x in feature_lambda(feature.sub_features, test, test_kwargs, subfeatures=subfeatures): + yield x + + +def feature_test_qual_value(feature, **kwargs): + """Test qualifier values. + + For every feature, check that at least one value in + feature.quailfiers(kwargs['qualifier']) is in kwargs['attribute_list'] + """ + for attribute_value in feature.qualifiers.get(kwargs['qualifier'], []): + if attribute_value in kwargs['attribute_list']: + return True + return False + + +def __get_features(child, interpro=False): + child_features = {} + for rec in GFF.parse(child): + # Only top level + for feature in rec.features: + # Get the record id as parent_feature_id (since this is how it will be during remapping) + parent_feature_id = rec.id + # If it's an interpro specific gff3 file + if interpro: + # Then we ignore polypeptide features as they're useless + if feature.type == 'polypeptide': + continue + # If there's an underscore, we strip up to that underscore? + # I do not know the rationale for this, removing. + # if '_' in parent_feature_id: + # parent_feature_id = parent_feature_id[parent_feature_id.index('_') + 1:] + + try: + child_features[parent_feature_id].append(feature) + except KeyError: + child_features[parent_feature_id] = [feature] + # Keep a list of feature objects keyed by parent record id + return child_features + + +def __update_feature_location(feature, parent, protein2dna): + start = feature.location.start + end = feature.location.end + if protein2dna: + start *= 3 + end *= 3 + + if parent.location.strand >= 0: + ns = parent.location.start + start + ne = parent.location.start + end + st = +1 + else: + ns = parent.location.end - end + ne = parent.location.end - start + st = -1 + + # Don't let start/stops be less than zero. It's technically valid for them + # to be (at least in the model I'm working with) but it causes numerous + # issues. + # + # Instead, we'll replace with %3 to try and keep it in the same reading + # frame that it should be in. + if ns < 0: + ns %= 3 + if ne < 0: + ne %= 3 + + feature.location = FeatureLocation(ns, ne, strand=st) + + if hasattr(feature, 'sub_features'): + for subfeature in feature.sub_features: + __update_feature_location(subfeature, parent, protein2dna) + + +def rebase(parent, child, interpro=False, protein2dna=False, map_by='ID'): + # get all of the features we will be re-mapping in a dictionary, keyed by parent feature ID + child_features = __get_features(child, interpro=interpro) + + for rec in GFF.parse(parent): + replacement_features = [] + for feature in feature_lambda( + rec.features, + # Filter features in the parent genome by those that are + # "interesting", i.e. have results in child_features array. + # Probably an unnecessary optimisation. + feature_test_qual_value, + { + 'qualifier': map_by, + 'attribute_list': child_features.keys(), + }, + subfeatures=False): + + # Features which will be re-mapped + to_remap = child_features[feature.id] + # TODO: update starts + fixed_features = [] + for x in to_remap: + # Then update the location of the actual feature + __update_feature_location(x, feature, protein2dna) + + if interpro: + for y in ('status', 'Target'): + try: + del x.qualifiers[y] + except Exception: + pass + + fixed_features.append(x) + replacement_features.extend(fixed_features) + # We do this so we don't include the original set of features that we + # were rebasing against in our result. + rec.features = replacement_features + rec.annotations = {} + GFF.write([rec], sys.stdout) + + +if __name__ == '__main__': + parser = argparse.ArgumentParser(description='rebase gff3 features against parent locations', epilog="") + parser.add_argument('parent', type=argparse.FileType('r'), help='Parent GFF3 annotations') + parser.add_argument('child', type=argparse.FileType('r'), help='Child GFF3 annotations to rebase against parent') + parser.add_argument('--interpro', action='store_true', + help='Interpro specific modifications') + parser.add_argument('--protein2dna', action='store_true', + help='Map protein translated results to original DNA data') + parser.add_argument('--map_by', help='Map by key', default='ID') + args = parser.parse_args() + rebase(**vars(args))
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/jb2_webserver.py Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,187 @@ +#!/usr/bin/env python3 +# spec: simplest python web server with range support and multithreading that takes root path, +# port and bind address as command line arguments; by default uses the current dir as webroot, +# port 8000 and bind address of 0.0.0.0 +# borrowed from https://github.com/danvk/RangeHTTPServer +# and reborrowed from https://gist.github.com/glowinthedark/b99900abe935e4ab4857314d647a9068 +# +# The Apache 2.0 license copy in this repository is distributed with this code in accordance with that licence. +# https://www.apache.org/licenses/LICENSE-2.0.txt +# This part is not MIT licenced like the other components. + +# APPENDIX: How to apply the Apache License to your work. + +# To apply the Apache License to your work, attach the following +# boilerplate notice, with the fields enclosed by brackets "[]" +# replaced with your own identifying information. (Don't include +# the brackets!) The text should be enclosed in the appropriate +# comment syntax for the file format. We also recommend that a +# file or class name and description of purpose be included on the +# same "printed page" as the copyright notice for easier +# identification within third-party archives. + +# Licensed under the Apache License, Version 2.0 (the "License"); +# you may not use this file except in compliance with the License. +# You may obtain a copy of the License at + +# http://www.apache.org/licenses/LICENSE-2.0 + +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. + + +import argparse +import functools +import os +import re +import socketserver +import webbrowser +from http.server import SimpleHTTPRequestHandler + + +DEFAULT_PORT = 8081 + + +def copy_byte_range(infile, outfile, start=None, stop=None, bufsize=16 * 1024): + """Like shutil.copyfileobj, but only copy a range of the streams. + + Both start and stop are inclusive. + """ + if start is not None: + infile.seek(start) + while 1: + to_read = min(bufsize, stop + 1 - infile.tell() if stop else bufsize) + buf = infile.read(to_read) + if not buf: + break + outfile.write(buf) + + +BYTE_RANGE_RE = re.compile(r"bytes=(\d+)-(\d+)?$") + + +def parse_byte_range(byte_range): + """Returns the two numbers in 'bytes=123-456' or throws ValueError. + + The last number or both numbers may be None. + """ + if byte_range.strip() == "": + return None, None + + m = BYTE_RANGE_RE.match(byte_range) + if not m: + raise ValueError("Invalid byte range %s" % byte_range) + + first, last = [x and int(x) for x in m.groups()] + if last and last < first: + raise ValueError("Invalid byte range %s" % byte_range) + return first, last + + +class RangeRequestHandler(SimpleHTTPRequestHandler): + """Adds support for HTTP 'Range' requests to SimpleHTTPRequestHandler + + The approach is to: + - Override send_head to look for 'Range' and respond appropriately. + - Override copyfile to only transmit a range when requested. + """ + + def handle(self): + try: + SimpleHTTPRequestHandler.handle(self) + except Exception: + # ignored, thrown whenever the client aborts streaming (broken pipe) + pass + + def send_head(self): + if "Range" not in self.headers: + self.range = None + return SimpleHTTPRequestHandler.send_head(self) + try: + self.range = parse_byte_range(self.headers["Range"]) + except ValueError: + self.send_error(400, "Invalid byte range") + return None + first, last = self.range + + # Mirroring SimpleHTTPServer.py here + path = self.translate_path(self.path) + f = None + ctype = self.guess_type(path) + try: + f = open(path, "rb") + except IOError: + self.send_error(404, "File not found") + return None + + fs = os.fstat(f.fileno()) + file_len = fs[6] + if first >= file_len: + self.send_error(416, "Requested Range Not Satisfiable") + return None + + self.send_response(206) + self.send_header("Content-type", ctype) + + if last is None or last >= file_len: + last = file_len - 1 + response_length = last - first + 1 + + self.send_header("Content-Range", "bytes %s-%s/%s" % (first, last, file_len)) + self.send_header("Content-Length", str(response_length)) + self.send_header("Last-Modified", self.date_time_string(fs.st_mtime)) + self.end_headers() + return f + + def end_headers(self): + self.send_header("Accept-Ranges", "bytes") + return SimpleHTTPRequestHandler.end_headers(self) + + def copyfile(self, source, outputfile): + if not self.range: + return SimpleHTTPRequestHandler.copyfile(self, source, outputfile) + + # SimpleHTTPRequestHandler uses shutil.copyfileobj, which doesn't let + # you stop the copying before the end of the file. + start, stop = self.range # set in send_head() + copy_byte_range(source, outputfile, start, stop) + + +class ThreadedTCPServer(socketserver.ThreadingMixIn, socketserver.TCPServer): + allow_reuse_address = True + + +if __name__ == "__main__": + parser = argparse.ArgumentParser( + description="Tiny Python Web Server supporting range requests, for local viewing of unzipped Galaxy JBrowse2 configurations" + ) + parser.add_argument( + "--root", + default=os.getcwd(), + help="Root path to serve files from (default: current working directory)", + ) + parser.add_argument( + "--port", + type=int, + default=DEFAULT_PORT, + help=f"Port to listen on (default: {DEFAULT_PORT})", + ) + parser.add_argument( + "--bind", + default="127.0.0.1", + help="IP address to bind to (default: 127.0.0.1 - use 0.0.0.0 to allow access on your network)", + ) + args = parser.parse_args() + + handler = functools.partial(RangeRequestHandler, directory=args.root) + + webbrowser.open(f"http://{args.bind}:{args.port}") + + with ThreadedTCPServer((args.bind, args.port), handler) as httpd: + print( + f"Serving HTTP on {args.bind} port {args.port} (http://{args.bind}:{args.port}/)" + ) + httpd.serve_forever()
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/jbrowse2.py Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,1753 @@ +#!/usr/bin/env python + +import argparse +import binascii +import datetime +# import hashlib +import json +import logging +import os +import re +import shutil +import ssl +import struct +import subprocess +import tempfile +import urllib.request +import xml.etree.ElementTree as ET +from collections import defaultdict + +logging.basicConfig(level=logging.DEBUG) +log = logging.getLogger("jbrowse") + +JB2VER = "v2.11.0" +# version pinned if cloning - but not used until now +logCommands = True +# useful for seeing what's being written but not for production setups +TODAY = datetime.datetime.now().strftime("%Y-%m-%d") +SELF_LOCATION = os.path.dirname(os.path.realpath(__file__)) +GALAXY_INFRASTRUCTURE_URL = None +mapped_chars = { + ">": "__gt__", + "<": "__lt__", + "'": "__sq__", + '"': "__dq__", + "[": "__ob__", + "]": "__cb__", + "{": "__oc__", + "}": "__cc__", + "@": "__at__", + "#": "__pd__", + "": "__cn__", +} + + +INDEX_TEMPLATE = """<!doctype html> +<html lang="en" style="height:100%"> +<head> +<meta charset="utf-8"/> +<link rel="shortcut icon" href="./favicon.ico"/> +<meta name="viewport" content="width=device-width,initial-scale=1"/> +<meta name="theme-color" content="#000000"/> +<meta name="description" content="A fast and flexible genome browser"/> +<link rel="manifest" href="./manifest.json"/> +<title>JBrowse</title> +</script> +</head> +<body style="overscroll-behavior:none; height:100%; margin: 0;"> +<iframe + id="jbframe" + title="JBrowse2" + frameborder="0" + width="100%" + height="100%" + src='index_noview.html?config=config.json__SESSION_SPEC__'> +</iframe> +</body> +</html> +""" + + +class ColorScaling(object): + + COLOR_FUNCTION_TEMPLATE = """ + function(feature, variableName, glyphObject, track) {{ + var score = {score}; + {opacity} + return 'rgba({red}, {green}, {blue}, ' + opacity + ')'; + }} + """ + + COLOR_FUNCTION_TEMPLATE_QUAL = r""" + function(feature, variableName, glyphObject, track) {{ + var search_up = function self(sf, attr){{ + if(sf.get(attr) !== undefined){{ + return sf.get(attr); + }} + if(sf.parent() === undefined) {{ + return; + }}else{{ + return self(sf.parent(), attr); + }} + }}; + + var search_down = function self(sf, attr){{ + if(sf.get(attr) !== undefined){{ + return sf.get(attr); + }} + if(sf.children() === undefined) {{ + return; + }}else{{ + var kids = sf.children(); + for(var child_idx in kids){{ + var x = self(kids[child_idx], attr); + if(x !== undefined){{ + return x; + }} + }} + return; + }} + }}; + + var color = ({user_spec_color} || search_up(feature, 'color') || search_down(feature, 'color') || {auto_gen_color}); + var score = (search_up(feature, 'score') || search_down(feature, 'score')); + {opacity} + if(score === undefined){{ opacity = 1; }} + var result = /^#?([a-f\d]{{2}})([a-f\d]{{2}})([a-f\d]{{2}})$/i.exec(color); + var red = parseInt(result[1], 16); + var green = parseInt(result[2], 16); + var blue = parseInt(result[3], 16); + if(isNaN(opacity) || opacity < 0){{ opacity = 0; }} + return 'rgba(' + red + ',' + green + ',' + blue + ',' + opacity + ')'; + }} + """ + + OPACITY_MATH = { + "linear": """ + var opacity = (score - ({min})) / (({max}) - ({min})); + """, + "logarithmic": """ + var opacity = Math.log10(score - ({min})) / Math.log10(({max}) - ({min})); + """, + "blast": """ + var opacity = 0; + if(score == 0.0) {{ + opacity = 1; + }} else {{ + opacity = (20 - Math.log10(score)) / 180; + }} + """, + } + + BREWER_COLOUR_IDX = 0 + BREWER_COLOUR_SCHEMES = [ + (166, 206, 227), + (31, 120, 180), + (178, 223, 138), + (51, 160, 44), + (251, 154, 153), + (227, 26, 28), + (253, 191, 111), + (255, 127, 0), + (202, 178, 214), + (106, 61, 154), + (255, 255, 153), + (177, 89, 40), + (228, 26, 28), + (55, 126, 184), + (77, 175, 74), + (152, 78, 163), + (255, 127, 0), + ] + + BREWER_DIVERGING_PALLETES = { + "BrBg": ("#543005", "#003c30"), + "PiYg": ("#8e0152", "#276419"), + "PRGn": ("#40004b", "#00441b"), + "PuOr": ("#7f3b08", "#2d004b"), + "RdBu": ("#67001f", "#053061"), + "RdGy": ("#67001f", "#1a1a1a"), + "RdYlBu": ("#a50026", "#313695"), + "RdYlGn": ("#a50026", "#006837"), + "Spectral": ("#9e0142", "#5e4fa2"), + } + + def __init__(self): + self.brewer_colour_idx = 0 + + def rgb_from_hex(self, hexstr): + # http://stackoverflow.com/questions/4296249/how-do-i-convert-a-hex-triplet-to-an-rgb-tuple-and-back + return struct.unpack("BBB", binascii.unhexlify(hexstr)) + + def min_max_gff(self, gff_file): + min_val = None + max_val = None + with open(gff_file, "r") as handle: + for line in handle: + try: + value = float(line.split("\t")[5]) + min_val = min(value, (min_val or value)) + max_val = max(value, (max_val or value)) + + if value < min_val: + min_val = value + + if value > max_val: + max_val = value + except Exception: + pass + return min_val, max_val + + def hex_from_rgb(self, r, g, b): + return "#%02x%02x%02x" % (r, g, b) + + def _get_colours(self): + r, g, b = self.BREWER_COLOUR_SCHEMES[ + self.brewer_colour_idx % len(self.BREWER_COLOUR_SCHEMES) + ] + self.brewer_colour_idx += 1 + return r, g, b + + def parse_menus(self, track): + trackConfig = {"menuTemplate": [{}, {}, {}, {}]} + + if "menu" in track["menus"]: + menu_list = [track["menus"]["menu"]] + if isinstance(track["menus"]["menu"], list): + menu_list = track["menus"]["menu"] + + for m in menu_list: + tpl = { + "action": m["action"], + "label": m.get("label", "{name}"), + "iconClass": m.get("iconClass", "dijitIconBookmark"), + } + if "url" in m: + tpl["url"] = m["url"] + if "content" in m: + tpl["content"] = m["content"] + if "title" in m: + tpl["title"] = m["title"] + + trackConfig["menuTemplate"].append(tpl) + + return trackConfig + + def parse_colours(self, track, trackFormat, gff3=None): + # Wiggle tracks have a bicolor pallete + trackConfig = {"style": {}} + if trackFormat == "wiggle": + + trackConfig["style"]["pos_color"] = track["wiggle"]["color_pos"] + trackConfig["style"]["neg_color"] = track["wiggle"]["color_neg"] + + if trackConfig["style"]["pos_color"] == "__auto__": + trackConfig["style"]["neg_color"] = self.hex_from_rgb( + *self._get_colours() + ) + trackConfig["style"]["pos_color"] = self.hex_from_rgb( + *self._get_colours() + ) + + # Wiggle tracks can change colour at a specified place + bc_pivot = track["wiggle"]["bicolor_pivot"] + if bc_pivot not in ("mean", "zero"): + # The values are either one of those two strings + # or a number + bc_pivot = float(bc_pivot) + trackConfig["bicolor_pivot"] = bc_pivot + elif "scaling" in track: + if track["scaling"]["method"] == "ignore": + if track["scaling"]["scheme"]["color"] != "__auto__": + trackConfig["style"]["color"] = track["scaling"]["scheme"]["color"] + else: + trackConfig["style"]["color"] = self.hex_from_rgb( + *self._get_colours() + ) + else: + # Scored method + algo = track["scaling"]["algo"] + # linear, logarithmic, blast + scales = track["scaling"]["scales"] + # type __auto__, manual (min, max) + scheme = track["scaling"]["scheme"] + # scheme -> (type (opacity), color) + # ================================== + # GENE CALLS OR BLAST + # ================================== + if trackFormat == "blast": + red, green, blue = self._get_colours() + color_function = self.COLOR_FUNCTION_TEMPLATE.format( + **{ + "score": "feature._parent.get('score')", + "opacity": self.OPACITY_MATH["blast"], + "red": red, + "green": green, + "blue": blue, + } + ) + trackConfig["style"]["color"] = color_function.replace("\n", "") + elif trackFormat == "gene_calls": + # Default values, based on GFF3 spec + min_val = 0 + max_val = 1000 + # Get min/max and build a scoring function since JBrowse doesn't + if scales["type"] == "automatic" or scales["type"] == "__auto__": + min_val, max_val = self.min_max_gff(gff3) + else: + min_val = scales.get("min", 0) + max_val = scales.get("max", 1000) + + if scheme["color"] == "__auto__": + user_color = "undefined" + auto_color = "'%s'" % self.hex_from_rgb(*self._get_colours()) + elif scheme["color"].startswith("#"): + user_color = "'%s'" % self.hex_from_rgb( + *self.rgb_from_hex(scheme["color"][1:]) + ) + auto_color = "undefined" + else: + user_color = "undefined" + auto_color = "'%s'" % self.hex_from_rgb(*self._get_colours()) + + color_function = self.COLOR_FUNCTION_TEMPLATE_QUAL.format( + **{ + "opacity": self.OPACITY_MATH[algo].format( + **{"max": max_val, "min": min_val} + ), + "user_spec_color": user_color, + "auto_gen_color": auto_color, + } + ) + + trackConfig["style"]["color"] = color_function.replace("\n", "") + return trackConfig + + +def etree_to_dict(t): + if t is None: + return {} + + d = {t.tag: {} if t.attrib else None} + children = list(t) + if children: + dd = defaultdict(list) + for dc in map(etree_to_dict, children): + for k, v in dc.items(): + dd[k].append(v) + d = {t.tag: {k: v[0] if len(v) == 1 else v for k, v in dd.items()}} + if t.attrib: + d[t.tag].update(("@" + k, v) for k, v in t.attrib.items()) + if t.text: + text = t.text.strip() + if children or t.attrib: + if text: + d[t.tag]["#text"] = text + else: + d[t.tag] = text + return d + + +INSTALLED_TO = os.path.dirname(os.path.realpath(__file__)) + + +def metadata_from_node(node): + metadata = {} + try: + if len(node.findall("dataset")) != 1: + # exit early + return metadata + except Exception: + return {} + + for (key, value) in node.findall("dataset")[0].attrib.items(): + metadata["dataset_%s" % key] = value + + if node.findall("history"): + for (key, value) in node.findall("history")[0].attrib.items(): + metadata["history_%s" % key] = value + + if node.findall("metadata"): + for (key, value) in node.findall("metadata")[0].attrib.items(): + metadata["metadata_%s" % key] = value + # Additional Mappings applied: + metadata[ + "dataset_edam_format" + ] = '<a target="_blank" href="http://edamontology.org/{0}">{1}</a>'.format( + metadata["dataset_edam_format"], metadata["dataset_file_ext"] + ) + metadata["history_user_email"] = '<a href="mailto:{0}">{0}</a>'.format( + metadata["history_user_email"] + ) + metadata["hist_name"] = metadata["history_display_name"] + metadata[ + "history_display_name" + ] = '<a target="_blank" href="{galaxy}/history/view/{encoded_hist_id}">{hist_name}</a>'.format( + galaxy=GALAXY_INFRASTRUCTURE_URL, + encoded_hist_id=metadata.get("history_id", "not available"), + hist_name=metadata.get("history_display_name", "not available"), + ) + if node.findall("tool"): + for (key, value) in node.findall("tool")[0].attrib.items(): + metadata["tool_%s" % key] = value + metadata[ + "tool_tool" + ] = '<a target="_blank" href="{galaxy}/datasets/{encoded_id}/show_params">{tool_id}{tool_version}</a>'.format( + galaxy=GALAXY_INFRASTRUCTURE_URL, + encoded_id=metadata.get("dataset_id", ""), + tool_id=metadata.get("tool_tool_id", ""), + tool_version=metadata.get("tool_tool_version", ""), + ) + return metadata + + +class JbrowseConnector(object): + def __init__(self, outdir, jbrowse2path): + self.trackCounter = 0 # to avoid name clashes + self.assemblies = [] # these require more than a few line diff. + self.assmeta = {} + self.ass_first_contigs = ( + [] + ) # for default session - these are read as first line of the assembly .fai + self.giURL = GALAXY_INFRASTRUCTURE_URL + self.outdir = outdir + self.jbrowse2path = jbrowse2path + os.makedirs(self.outdir, exist_ok=True) + self.genome_names = [] + self.trackIdlist = [] + self.tracksToAdd = {} + self.config_json = {} + self.config_json_file = os.path.join(outdir, "config.json") + self.clone_jbrowse() + + def get_cwd(self, cwd): + if cwd: + return self.outdir + else: + return subprocess.check_output(["pwd"]).decode("utf-8").strip() + + def subprocess_check_call(self, command, output=None, cwd=True): + if output: + if logCommands: + log.debug( + "cd %s && %s > %s", self.get_cwd(cwd), " ".join(command), output + ) + subprocess.check_call(command, cwd=self.get_cwd(cwd), stdout=output) + else: + if logCommands: + log.debug("cd %s && %s", self.get_cwd(cwd), " ".join(command)) + subprocess.check_call(command, cwd=self.get_cwd(cwd)) + + def subprocess_popen(self, command, cwd=True): + if logCommands: + log.debug(command) + p = subprocess.Popen( + command, + cwd=self.get_cwd(cwd), + shell=True, + stdin=subprocess.PIPE, + stdout=subprocess.PIPE, + stderr=subprocess.PIPE, + ) + output, err = p.communicate() + retcode = p.returncode + if retcode != 0: + log.error(command) + log.error(output) + log.error(err) + raise RuntimeError(f"Command ( {command} ) failed with exit code {retcode}") + + def subprocess_check_output(self, command): + if logCommands: + log.debug(" ".join(command)) + return subprocess.check_output(command, cwd=self.outdir) + + def symlink_or_copy(self, src, dest): + if "GALAXY_JBROWSE_SYMLINKS" in os.environ and bool( + os.environ["GALAXY_JBROWSE_SYMLINKS"] + ): + cmd = ["ln", "-s", src, dest] + else: + cmd = ["cp", src, dest] + + return self.subprocess_check_call(cmd) + + def _prepare_track_style(self, trackDict): + + style_data = { + "type": "LinearBasicDisplay", + "displayId": "%s-LinearBasicDisplay" % trackDict["trackId"], + } + + if trackDict.get("displays", None): # use first if multiple like bed + style_data["type"] = trackDict["displays"][0]["type"] + style_data["displayId"] = trackDict["displays"][0]["displayId"] + return style_data + + def getNrow(self, url): + useuri = url.startswith("https://") or url.startswith("http://") + if not useuri: + fl = open(url, "r").readlines() + nrow = len(fl) + else: + try: + scontext = ssl.SSLContext(ssl.PROTOCOL_TLS_CLIENT) + scontext.check_hostname = False + scontext.verify_mode = ssl.VerifyMode.CERT_NONE + with urllib.request.urlopen(url, context=scontext) as f: + fl = f.readlines() + nrow = len(fl) + except Exception: + nrow = 0 + logging.debug("getNrow %s returning %d" % (url, nrow)) + return nrow + + def process_genomes(self, genomes): + assembly = [] + assmeta = [] + useuri = False + primaryGenome = None + for i, genome_node in enumerate(genomes): + this_genome = {} + if genome_node["useuri"] == "yes": + useuri = True + genome_name = genome_node["label"].strip() + if len(genome_name) == 0: + genome_name = os.path.splitext(os.path.basename(genome_node["path"]))[0] + if len(genome_name.split()) > 1: + genome_name = genome_name.split()[0] + # spaces and cruft break scripts when substituted + if not primaryGenome: + primaryGenome = genome_name + if genome_name not in self.genome_names: + self.genome_names.append(genome_name) + fapath = genome_node["path"] + if not useuri: + fapath = os.path.realpath(fapath) + assem, first_contig = self.make_assembly(fapath, genome_name, useuri) + assembly.append(assem) + self.ass_first_contigs.append(first_contig) + if genome_name == primaryGenome: # first one + this_genome["genome_name"] = genome_name # first one for all tracks + this_genome["genome_sequence_adapter"] = assem["sequence"][ + "adapter" + ] + this_genome["genome_firstcontig"] = first_contig + assmeta.append(this_genome) + self.assemblies += assembly + self.assmeta[primaryGenome] = assmeta + self.tracksToAdd[primaryGenome] = [] + return primaryGenome + + def make_assembly(self, fapath, gname, useuri): + if useuri: + faname = fapath + scontext = ssl.SSLContext(ssl.PROTOCOL_TLS_CLIENT) + scontext.check_hostname = False + scontext.verify_mode = ssl.VerifyMode.CERT_NONE + with urllib.request.urlopen(url=faname + ".fai", context=scontext) as f: + fl = f.readline() + contig = fl.decode("utf8").strip() + # Merlin 172788 8 60 61 + else: + faname = gname + ".fa.gz" + fadest = os.path.realpath(os.path.join(self.outdir, faname)) + cmd = "bgzip -i -c %s -I %s.gzi > %s && samtools faidx %s" % ( + fapath, + fadest, + fadest, + fadest, + ) + self.subprocess_popen(cmd) + contig = open(fadest + ".fai", "r").readline().strip() + adapter = { + "type": "BgzipFastaAdapter", + "fastaLocation": { + "uri": faname, + }, + "faiLocation": { + "uri": faname + ".fai", + }, + "gziLocation": { + "uri": faname + ".gzi", + }, + } + first_contig = contig.split()[:2] + first_contig.insert(0, gname) + trackDict = { + "name": gname, + "sequence": { + "type": "ReferenceSequenceTrack", + "trackId": gname, + "adapter": adapter, + }, + "displays": [ + { + "type": "LinearReferenceSequenceDisplay", + "displayId": "%s-LinearReferenceSequenceDisplay" % gname, + }, + { + "type": "LinearGCContentDisplay", + "displayId": "%s-LinearGCContentDisplay" % gname, + }, + ], + } + return (trackDict, first_contig) + + def add_default_view(self): + cmd = [ + "jbrowse", + "set-default-session", + "-s", + self.config_json_file, + "-t", + ",".join(self.trackIdlist), + "-n", + "JBrowse2 in Galaxy", + "--target", + self.config_json_file, + "-v", + " LinearGenomeView", + ] + self.subprocess_check_call(cmd) + + def write_config(self): + with open(self.config_json_file, "w") as fp: + json.dump(self.config_json, fp, indent=2) + + def text_index(self): + # Index tracks + args = [ + "jbrowse", + "text-index", + "--target", + self.outdir, + "--assemblies", + self.genome_name, + ] + + tracks = ",".join(self.trackIdlist) + if tracks: + args += ["--tracks", tracks] + + self.subprocess_check_call(args) + + def add_hic(self, data, trackData): + """ + HiC adapter. + https://github.com/aidenlab/hic-format/blob/master/HiCFormatV9.md + for testing locally, these work: + HiC data is from https://s3.amazonaws.com/igv.broadinstitute.org/data/hic/intra_nofrag_30.hic + using hg19 reference track as a + 'BgzipFastaAdapter' + fastaLocation: + uri: 'https://s3.amazonaws.com/jbrowse.org/genomes/GRCh38/fasta/GRCh38.fa.gz', + faiLocation: + uri: 'https://s3.amazonaws.com/jbrowse.org/genomes/GRCh38/fasta/GRCh38.fa.gz.fai', + gziLocation: + uri: 'https://s3.amazonaws.com/jbrowse.org/genomes/GRCh38/fasta/GRCh38.fa.gz.gzi', + Cool will not be likely to be a good fit - see discussion at https://github.com/GMOD/jbrowse-components/issues/2438 + + """ + tId = trackData["label"] + wasCool = trackData["wasCool"] + # can be served - if public. + # dsId = trackData["metadata"]["dataset_id"] + # url = "%s/api/datasets/%s/display?to_ext=hic " % (self.giURL, dsId) + useuri = trackData["useuri"].lower() == "yes" + logging.debug("wasCool=%s, data=%s, tId=%s" % (wasCool, data, tId)) + if useuri: + uri = data + else: + uri = tId + ".hic" + if not wasCool: + dest = os.path.join(self.outdir, uri) + if not os.path.exists(dest): + cmd = ["cp", data, dest] + self.subprocess_check_call(cmd) + else: + logging.error("not wasCool but %s exists" % dest) + categ = trackData["category"] + trackDict = { + "type": "HicTrack", + "trackId": tId, + "name": trackData["name"], + "assemblyNames": [trackData["assemblyNames"]], + "category": [ + categ, + ], + "adapter": {"type": "HicAdapter", "hicLocation": {"uri": uri}}, + } + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def add_maf(self, data, trackData): + """ + from https://github.com/cmdcolin/maf2bed + Note: Both formats start with a MAF as input, and note that your MAF file should contain the species name and chromosome name + e.g. hg38.chr1 in the sequence identifiers. + need the reference id - eg hg18, for maf2bed.pl as the first parameter + """ + tId = trackData["label"] + mafPlugin = { + "plugins": [ + { + "name": "MafViewer", + "url": "https://unpkg.com/jbrowse-plugin-mafviewer/dist/jbrowse-plugin-mafviewer.umd.production.min.js", + } + ] + } + categ = trackData["category"] + fname = f"{tId}" + dest = os.path.join(self.outdir, fname) + gname = trackData["assemblyNames"] + + cmd = [ + "bash", + os.path.join(INSTALLED_TO, "convertMAF.sh"), + data, + gname, + INSTALLED_TO, + dest, + ] + self.subprocess_check_call(cmd) + mafs = open(data, "r").readlines() + mafss = [x for x in mafs if (x.startswith("s\t") or x.startswith("s "))] + samp = [x.split()[1] for x in mafss if len(x.split()) > 0] + sampu = list(dict.fromkeys(samp)) + samples = [x.split(".")[0] for x in sampu] + samples.sort() + if logCommands: + logging.debug( + "$$$$ cmd=%s, mafss=%s samp=%s samples=%s" + % (" ".join(cmd), mafss, samp, samples) + ) + trackDict = { + "type": "MafTrack", + "trackId": tId, + "name": trackData["name"], + "category": [ + categ, + ], + "adapter": { + "type": "MafTabixAdapter", + "samples": samples, + "bedGzLocation": { + "uri": fname + ".sorted.bed.gz", + }, + "index": { + "location": { + "uri": fname + ".sorted.bed.gz.tbi", + }, + }, + }, + "assemblyNames": [trackData["assemblyNames"]], + "displays": [ + { + "type": "LinearBasicDisplay", + "displayId": "%s-LinearBasicDisplay" % tId, + }, + {"type": "LinearArcDisplay", "displayId": "%s-LinearArcDisplay" % tId}, + ], + } + style_json = self._prepare_track_style(trackDict) + trackDict["style"] = style_json + self.tracksToAdd[gname].append(trackDict) + self.trackIdlist.append(tId) + if self.config_json.get("plugins", None): + self.config_json["plugins"].append(mafPlugin[0]) + else: + self.config_json.update(mafPlugin) + + def _blastxml_to_gff3(self, xml, min_gap=10): + gff3_unrebased = tempfile.NamedTemporaryFile(delete=False) + cmd = [ + "python", + os.path.join(INSTALLED_TO, "blastxml_to_gapped_gff3.py"), + "--trim", + "--trim_end", + "--include_seq", + "--min_gap", + str(min_gap), + xml, + ] + subprocess.check_call(cmd, cwd=self.outdir, stdout=gff3_unrebased) + gff3_unrebased.close() + logging.debug("### blastxml to gff3 cmd = %s" % " ".join(cmd)) + return gff3_unrebased.name + + def add_blastxml(self, data, trackData, blastOpts, **kwargs): + gff3 = self._blastxml_to_gff3(data, min_gap=blastOpts["min_gap"]) + if "parent" in blastOpts and blastOpts["parent"] != "None": + gff3_rebased = tempfile.NamedTemporaryFile(delete=False) + cmd = ["python", os.path.join(INSTALLED_TO, "gff3_rebase.py")] + if blastOpts.get("protein", "false") == "true": + cmd.append("--protein2dna") + cmd.extend([os.path.realpath(blastOpts["parent"]), gff3]) + subprocess.check_call(cmd, cwd=self.outdir, stdout=gff3_rebased) + logging.debug("### gff3rebase cmd = %s" % " ".join(cmd)) + gff3_rebased.close() + # Replace original gff3 file + shutil.copy(gff3_rebased.name, gff3) + os.unlink(gff3_rebased.name) + self.add_gff(gff3, trackData, **kwargs) + + def add_bigwig(self, data, trackData): + tId = trackData["label"] + useuri = trackData["useuri"].lower() == "yes" + if useuri: + url = data + else: + url = tId + # slashes in names cause path trouble + dest = os.path.join(self.outdir, url) + cmd = ["cp", data, dest] + self.subprocess_check_call(cmd) + bwloc = {"uri": url} + categ = trackData["category"] + trackDict = { + "type": "QuantitativeTrack", + "trackId": tId, + "name": trackData["name"], + "category": [ + categ, + ], + "assemblyNames": [trackData["assemblyNames"]], + "adapter": { + "type": "BigWigAdapter", + "bigWigLocation": bwloc, + }, + "displays": [ + { + "type": "LinearWiggleDisplay", + "displayId": "%s-LinearWiggleDisplay" % tId, + } + ], + } + style_json = self._prepare_track_style(trackDict) + trackDict["style"] = style_json + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def add_bam(self, data, trackData, bam_indexes=None, **kwargs): + tId = trackData["label"] + realFName = trackData["path"] + useuri = trackData["useuri"].lower() == "yes" + categ = trackData["category"] + if useuri: + url = data + else: + fname = tId + dest = "%s/%s" % (self.outdir, fname) + self.subprocess_check_call(["cp", data, dest]) + url = fname + bindex = fname + ".bai" + bi = bam_indexes.split(",") + bam_index = [ + x.split(" ~ ")[1].strip() + for x in bi + if " ~ " in x and x.split(" ~ ")[0].strip() == realFName + ] + logging.debug( + "===realFName=%s got %s as bam_indexes %s as bi, %s for bam_index" + % (realFName, bam_indexes, bi, bam_index) + ) + if len(bam_index) > 0 and os.path.exists(os.path.realpath(bam_index[0])): + self.subprocess_check_call(["cp", bam_index[0], bindex]) + else: + cmd = ["samtools", "index", "-b", "-o", bindex, data] + self.subprocess_check_call(cmd) + trackDict = { + "type": "AlignmentsTrack", + "trackId": tId, + "name": trackData["name"], + "category": [ + categ, + ], + "assemblyNames": [trackData["assemblyNames"]], + "adapter": { + "type": "BamAdapter", + "bamLocation": {"uri": url}, + "index": { + "location": { + "uri": bindex, + } + }, + }, + "displays": [ + { + "type": "LinearAlignmentsDisplay", + "displayId": "%s-LinearAlignmentsDisplay" % tId, + }, + ], + } + style_json = self._prepare_track_style(trackDict) + trackDict["style"] = style_json + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def add_cram(self, data, trackData, cram_indexes=None, **kwargs): + tId = trackData["label"] + realFName = trackData["path"] + categ = trackData["category"] + useuri = trackData["useuri"].lower() == "yes" + gsa = self.assmeta.get(trackData["assemblyNames"], None) + if gsa: + genseqad = gsa[0]["genome_sequence_adapter"] + else: + genseqad = "Not found" + logging.warning("No adapter found for cram %s in gsa=%s" % (tId, gsa)) + if useuri: + url = data + else: + fname = tId + dest = os.path.join(self.outdir, fname) + url = fname + self.subprocess_check_call(["cp", data, dest]) + ci = cram_indexes.split(",") + cram_index = [ + x.split(" ~ ")[1].strip() + for x in ci + if " ~ " in x and x.split(" ~ ")[0].strip() == realFName + ] + logging.debug( + "===realFName=%s got %s as cram_indexes %s as ci, %s for cram_index" + % (realFName, cram_indexes, ci, cram_index) + ) + if len(cram_index) > 0 and os.path.exists(cram_index[0]): + if not os.path.exists(dest + ".crai"): + # most probably made by galaxy and stored in galaxy dirs, need to copy it to dest + self.subprocess_check_call( + ["cp", os.path.realpath(cram_index[0]), dest + ".crai"] + ) + else: + cpath = os.path.realpath(dest) + ".crai" + cmd = ["samtools", "index", "-c", "-o", cpath, os.path.realpath(dest)] + self.subprocess_check_call(cmd) + trackDict = { + "type": "AlignmentsTrack", + "trackId": tId, + "name": trackData["name"], + "category": [ + categ, + ], + "assemblyNames": [trackData["assemblyNames"]], + "adapter": { + "type": "CramAdapter", + "cramLocation": {"uri": url}, + "craiLocation": { + "uri": url + ".crai", + }, + "sequenceAdapter": genseqad, + }, + "displays": [ + { + "type": "LinearAlignmentsDisplay", + "displayId": "%s-LinearAlignmentsDisplay" % tId, + }, + ], + } + style_json = self._prepare_track_style(trackDict) + trackDict["style"] = style_json + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def add_vcf(self, data, trackData): + tId = trackData["label"] + categ = trackData["category"] + useuri = trackData["useuri"].lower() == "yes" + if useuri: + url = data + else: + url = tId + dest = os.path.join(self.outdir, url) + cmd = "bgzip -c %s > %s" % (data, dest) + self.subprocess_popen(cmd) + cmd = ["tabix", "-f", "-p", "vcf", dest] + self.subprocess_check_call(cmd) + trackDict = { + "type": "VariantTrack", + "trackId": tId, + "name": trackData["name"], + "assemblyNames": [trackData["assemblyNames"]], + "category": [ + categ, + ], + "adapter": { + "type": "VcfTabixAdapter", + "vcfGzLocation": {"uri": url}, + "index": { + "location": { + "uri": url + ".tbi", + } + }, + }, + "displays": [ + { + "type": "LinearVariantDisplay", + "displayId": "%s-LinearVariantDisplay" % tId, + }, + { + "type": "ChordVariantDisplay", + "displayId": "%s-ChordVariantDisplay" % tId, + }, + { + "type": "LinearPairedArcDisplay", + "displayId": "%s-LinearPairedArcDisplay" % tId, + }, + ], + } + style_json = self._prepare_track_style(trackDict) + trackDict["style"] = style_json + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def _sort_gff(self, data, dest): + # Only index if not already done + if not os.path.exists(dest): + cmd = "jbrowse sort-gff '%s' | bgzip -c > '%s'" % ( + data, + dest, + ) + self.subprocess_popen(cmd) + self.subprocess_check_call(["tabix", "-f", "-p", "gff", dest]) + + def _sort_bed(self, data, dest): + # Only index if not already done + if not os.path.exists(dest): + cmd = "sort -k1,1 -k2,2n '%s' | bgzip -c > '%s'" % (data, dest) + self.subprocess_popen(cmd) + cmd = ["tabix", "-f", "-p", "bed", dest] + self.subprocess_check_call(cmd) + + def add_gff(self, data, trackData): + tId = trackData["label"] + useuri = trackData["useuri"].lower() == "yes" + if useuri: + url = trackData["path"] + else: + url = tId + ".gz" + dest = os.path.join(self.outdir, url) + self._sort_gff(data, dest) + categ = trackData["category"] + trackDict = { + "type": "FeatureTrack", + "trackId": tId, + "name": trackData["name"], + "assemblyNames": [trackData["assemblyNames"]], + "category": [ + categ, + ], + "adapter": { + "type": "Gff3TabixAdapter", + "gffGzLocation": { + "uri": url, + }, + "index": { + "location": { + "uri": url + ".tbi", + } + }, + }, + "displays": [ + { + "type": "LinearBasicDisplay", + "displayId": "%s-LinearBasicDisplay" % tId, + }, + { + "type": "LinearArcDisplay", + "displayId": "%s-LinearArcDisplay" % tId, + }, + ], + } + style_json = self._prepare_track_style(trackDict) + trackDict["style"] = style_json + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def add_bed(self, data, ext, trackData): + tId = trackData["label"] + categ = trackData["category"] + useuri = trackData["useuri"].lower() == "yes" + if useuri: + url = data + else: + url = tId + ".gz" + dest = os.path.join(self.outdir, url) + self._sort_bed(data, dest) + trackDict = { + "type": "FeatureTrack", + "trackId": tId, + "name": trackData["name"], + "assemblyNames": [trackData["assemblyNames"]], + "adapter": { + "category": [ + categ, + ], + "type": "BedTabixAdapter", + "bedGzLocation": { + "uri": url, + }, + "index": { + "location": { + "uri": url + ".tbi", + } + }, + }, + "displays": [ + { + "type": "LinearBasicDisplay", + "displayId": "%s-LinearBasicDisplay" % tId, + }, + { + "type": "LinearPileupDisplay", + "displayId": "%s-LinearPileupDisplay" % tId, + }, + { + "type": "LinearArcDisplay", + "displayId": "%s-LinearArcDisplay" % tId, + }, + ], + } + style_json = self._prepare_track_style(trackDict) + trackDict["style"] = style_json + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def add_paf(self, data, trackData, pafOpts, **kwargs): + tname = trackData["name"] + tId = trackData["label"] + url = tId + useuri = data.startswith("http://") or data.startswith("https://") + if not useuri: + dest = os.path.join(self.outdir, url) + self.symlink_or_copy(os.path.realpath(data), dest) + nrow = self.getNrow(dest) + else: + url = data + nrow = self.getNrow(url) + categ = trackData["category"] + pg = pafOpts["genome"].split(",") + pgc = [x.strip() for x in pg if x.strip() > ""] + gnomes = [x.split(" ~ ") for x in pgc] + logging.debug("pg=%s, gnomes=%s" % (pg, gnomes)) + passnames = [trackData["assemblyNames"]] # always first + for i, (gpath, gname) in enumerate(gnomes): + # may have been forgotten by user for uri + if len(gname) == 0: + gn = os.path.basename(gpath) + gname = os.path.splitext(gn)[0] + # trouble from spacey names in command lines avoidance + if len(gname.split()) > 1: + gname = gname.split()[0] + if gname not in passnames: + passnames.append(gname) + useuri = pafOpts["useuri"] == "true" + if gname not in self.genome_names: + # ignore if already there - eg for duplicates among pafs. + asstrack, first_contig = self.make_assembly(gpath, gname, useuri) + self.genome_names.append(gname) + self.tracksToAdd[gname] = [] + self.assemblies.append(asstrack) + self.ass_first_contigs.append(first_contig) + trackDict = { + "type": "SyntenyTrack", + "trackId": tId, + "assemblyNames": passnames, + "category": [ + categ, + ], + "name": tname, + "adapter": { + "type": "PAFAdapter", + "pafLocation": {"uri": url}, + "assemblyNames": passnames, + }, + "displays": [ + { + "type": "LGVSyntenyDisplay", + "displayId": "%s-LGVSyntenyDisplay" % tId, + }, + { + "type": "DotplotDisplay", + "displayId": "%s-DotplotDisplay" % tId, + }, + { + "type": "LinearComparativeDisplay", + "displayId": "%s-LinearComparativeDisplay" % tId, + }, + { + "type": "LinearBasicDisplay", + "displayId": "%s-LinearSyntenyDisplay" % tId, + }, + ], + } + if nrow > 10000: + style_json = { + "type": "LGVSyntenyDisplay", + "displayId": "%s-LGVSyntenyDisplay" % tId, + } + else: + style_json = { + "type": "LinearBasicDisplay", + "displayId": "%s-LinearBasicDisplay" % tId, + } + trackDict["style"] = style_json + self.tracksToAdd[trackData["assemblyNames"]].append(trackDict) + self.trackIdlist.append(tId) + + def process_annotations(self, track): + category = track["category"].replace("__pd__date__pd__", TODAY) + for trackIndex, ( + dataset_path, + dataset_ext, + useuri, + track_human_label, + extra_metadata, + ) in enumerate(track["trackfiles"]): + if not dataset_path.strip().startswith("http"): + # Unsanitize labels (element_identifiers are always sanitized by Galaxy) + for key, value in mapped_chars.items(): + track_human_label = track_human_label.replace(value, key) + track_human_label = track_human_label.replace(" ", "_") + outputTrackConfig = { + "category": category, + "style": {}, + } + + # hashData = [ + # str(dataset_path), + # track_human_label, + # track["category"], + # ] + # hashData = "|".join(hashData).encode("utf-8") + # hash_string = hashlib.md5(hashData).hexdigest() + + outputTrackConfig["assemblyNames"] = track["assemblyNames"] + outputTrackConfig["key"] = track_human_label + outputTrackConfig["useuri"] = useuri + outputTrackConfig["path"] = dataset_path + outputTrackConfig["ext"] = dataset_ext + outputTrackConfig["trackset"] = track.get("trackset", {}) + outputTrackConfig["label"] = track["label"] + # outputTrackConfig["label"] = "%s_%i_%s_%s" % ( + # dataset_ext, + # trackIndex, + # track_human_label, + # hash_string, + # ) + + outputTrackConfig["metadata"] = extra_metadata + outputTrackConfig["name"] = track_human_label + if track["label"] in self.trackIdlist: + logging.error( + "### not adding %s already in %s" + % (track["label"], self.trackIdlist) + ) + yield None + if dataset_ext in ("gff", "gff3"): + self.add_gff( + dataset_path, + outputTrackConfig, + ) + elif dataset_ext in ("hic", "juicebox_hic"): + outputTrackConfig["wasCool"] = False + self.add_hic( + dataset_path, + outputTrackConfig, + ) + elif dataset_ext in ("cool", "mcool", "scool"): + hic_url = outputTrackConfig["label"] + hic_path = os.path.join(self.outdir, hic_url) + ".hic" + outputTrackConfig["wasCool"] = True + self.subprocess_check_call( + [ + "hictk", + "convert", + "-f", + "--output-fmt", + "hic", + dataset_path, + hic_path, + ] + ) + self.add_hic( + hic_path, + outputTrackConfig, + ) + elif dataset_ext in ("bed",): + self.add_bed( + dataset_path, + dataset_ext, + outputTrackConfig, + ) + elif dataset_ext in ("maf",): + self.add_maf( + dataset_path, + outputTrackConfig, + ) + elif dataset_ext == "bigwig": + self.add_bigwig( + dataset_path, + outputTrackConfig, + ) + elif dataset_ext == "bam": + real_indexes = track["conf"]["options"]["bam"]["bam_index"] + self.add_bam( + dataset_path, + outputTrackConfig, + bam_indexes=real_indexes, + ) + elif dataset_ext == "cram": + real_indexes = track["conf"]["options"]["cram"]["cram_index"] + self.add_cram( + dataset_path, + outputTrackConfig, + cram_indexes=real_indexes, + ) + elif dataset_ext == "blastxml": + self.add_blastxml( + dataset_path, + outputTrackConfig, + track["conf"]["options"]["blast"], + ) + elif dataset_ext == "vcf": + self.add_vcf(dataset_path, outputTrackConfig) + elif dataset_ext == "paf": + self.add_paf( + dataset_path, + outputTrackConfig, + track["conf"]["options"]["paf"], + ) + else: + logging.warning("Do not know how to handle %s", dataset_ext) + # Return non-human label for use in other fields + yield outputTrackConfig["label"] + + def add_default_session(self, default_data): + """ + default session settings are hard and fragile. + .add_default_view() and other configuration code adapted from + https://github.com/abretaud/tools-iuc/blob/jbrowse2/tools/jbrowse2/jbrowse2.py + """ + # TODO using the default session for now, but check out session specs in the future https://github.com/GMOD/jbrowse-components/issues/2708 + track_types = {} + with open(self.config_json_file, "r") as config_file: + config_json = json.load(config_file) + if self.config_json: + config_json.update(self.config_json) + if "defaultSession" in config_json: + session_json = config_json["defaultSession"] + session_views = [] + else: + session_json = {} + session_views = [] + for gnome in self.assmeta.keys(): # assemblies have their own tracks + tracks_data = [] + for track_conf in self.tracksToAdd[gnome]: + tId = track_conf["trackId"] + if tId in default_data[gnome]["visibility"]["default_on"]: + track_types[tId] = track_conf["type"] + style_data = default_data[gnome]["style"].get(tId, None) + if not style_data: + logging.debug( + "### No style data for %s in available default data %s" + % (tId, default_data) + ) + style_data = {"type": "LinearBasicDisplay"} + if "displays" in track_conf: + disp = track_conf["displays"][0]["type"] + style_data["type"] = disp + if track_conf.get("style_labels", None): + # TODO fix this: it should probably go in a renderer block (SvgFeatureRenderer) but still does not work + # TODO move this to per track displays? + style_data["labels"] = track_conf["style_labels"] + tracks_data.append( + { + "type": track_types[tId], + "configuration": tId, + "displays": [style_data], + } + ) + view_json = { + "type": "LinearGenomeView", + "offsetPx": 0, + "minimized": False, + "tracks": tracks_data, + } + first = [x for x in self.ass_first_contigs if x[0] == gnome] + if len(first) > 0: + [gnome, refName, end] = first[0] + start = 0 + end = int(end) + drdict = { + "refName": refName, + "start": start, + "end": end, + "reversed": False, + "assemblyName": gnome, + } + else: + ddl = default_data.get("defaultLocation", None) + if ddl: + loc_match = re.search(r"^([^:]+):([\d,]*)\.*([\d,]*)$", ddl) + # allow commas like 100,000 but ignore as integer + if loc_match: + refName = loc_match.group(1) + drdict["refName"] = refName + if loc_match.group(2) > "": + drdict["start"] = int(loc_match.group(2).replace(",", "")) + if loc_match.group(3) > "": + drdict["end"] = int(loc_match.group(3).replace(",", "")) + else: + logging.info( + "@@@ regexp could not match contig:start..end in the supplied location %s - please fix" + % ddl + ) + if drdict.get("refName", None): + # TODO displayedRegions is not just zooming to the region, it hides the rest of the chromosome + view_json["displayedRegions"] = [ + drdict, + ] + logging.info("@@@ defaultlocation %s for default session" % drdict) + else: + logging.info( + "@@@ no track location for default session - please add one!" + ) + session_views.append(view_json) + session_name = default_data.get("session_name", "New session") + for key, value in mapped_chars.items(): + session_name = session_name.replace(value, key) + session_json["name"] = session_name + + if "views" not in session_json: + session_json["views"] = session_views + else: + session_json["views"] += session_views + + pp = json.dumps(session_views, indent=2) + config_json["defaultSession"] = session_json + self.config_json.update(config_json) + logging.debug("defaultSession=%s" % (pp)) + with open(self.config_json_file, "w") as config_file: + json.dump(self.config_json, config_file, indent=2) + + def add_defsess_to_index(self, data): + """ + ---------------------------------------------------------- + Add some default session settings: set some assemblies/tracks on/off + + This allows to select a default view: + - jb type (Linear, Circular, etc) + - default location on an assembly + - default tracks + - ... + + Different methods to do that were tested/discussed: + - using a defaultSession item in config.json: this proved to be difficult: + forced to write a full session block, including hard-coded/hard-to-guess items, + no good way to let Jbrowse2 display a scaffold without knowing its size + - using JBrowse2 as an embedded React component in a tool-generated html file: + it works but it requires generating js code to actually do what we want = chosing default view, assembly, tracks, ... + - writing a session-spec inside the config.json file: this is not yet supported as of 2.10.2 (see PR 4148 below) + a session-spec is a kind of simplified defaultSession where you don't need to specify every aspect of the session + - passing a session-spec through URL params by embedding the JBrowse2 index.html inside an iframe + we selected this option + + Xrefs to understand the choices: + https://github.com/GMOD/jbrowse-components/issues/2708 + https://github.com/GMOD/jbrowse-components/discussions/3568 + https://github.com/GMOD/jbrowse-components/pull/4148 + """ + new_index = "Nothing written" + session_spec = {"views": []} + logging.debug("def ass_first=%s\ndata=%s" % (self.ass_first_contigs, data)) + for first_contig in self.ass_first_contigs: + logging.debug("first contig=%s" % self.ass_first_contigs) + [gnome, refName, end] = first_contig + start = 0 + aview = { + "assembly": gnome, + "loc": "{}:{}..{}".format(refName, start, end), + "type": "LinearGenomeView", + "tracks": data[gnome]["tracks"], + } + session_spec["views"].append(aview) + sess = json.dumps(session_spec, sort_keys=True, indent=2) + new_index = INDEX_TEMPLATE.replace( + "__SESSION_SPEC__", "&session=spec-{}".format(sess) + ) + + os.rename( + os.path.join(self.outdir, "index.html"), + os.path.join(self.outdir, "index_noview.html"), + ) + + with open(os.path.join(self.outdir, "index.html"), "w") as nind: + nind.write(new_index) + logging.debug( + "#### add_defsession gnome=%s refname=%s\nsession_spec=%s\nnew_index=%s" + % (gnome, refName, sess, new_index) + ) + + def add_general_configuration(self, data): + """ + Add some general configuration to the config.json file + """ + + config_path = self.config_json_file + if os.path.exists(config_path): + with open(config_path, "r") as config_file: + config_json = json.load(config_file) + else: + config_json = {} + if self.config_json: + config_json.update(self.config_json) + config_data = {} + + config_data["disableAnalytics"] = data.get("analytics", "false") == "true" + + config_data["theme"] = { + "palette": { + "primary": {"main": data.get("primary_color", "#0D233F")}, + "secondary": {"main": data.get("secondary_color", "#721E63")}, + "tertiary": {"main": data.get("tertiary_color", "#135560")}, + "quaternary": {"main": data.get("quaternary_color", "#FFB11D")}, + }, + "typography": {"fontSize": int(data.get("font_size", 10))}, + } + if not config_json.get("configuration", None): + config_json["configuration"] = {} + config_json["configuration"].update(config_data) + self.config_json.update(config_json) + with open(config_path, "w") as config_file: + json.dump(self.config_json, config_file, indent=2) + + def clone_jbrowse(self, realclone=False): + """ + Clone a JBrowse directory into a destination directory. + + `realclone=true` will use the `jbrowse create` command. + To allow running on internet-less compute and for reproducibility + use frozen code with `realclone=false + + """ + dest = self.outdir + if realclone: + self.subprocess_check_call( + ["jbrowse", "create", dest, "-f", "--tag", f"{JB2VER}"] + ) + else: + shutil.copytree(self.jbrowse2path, dest, dirs_exist_ok=True) + for fn in [ + "asset-manifest.json", + "favicon.ico", + "robots.txt", + "umd_plugin.js", + "version.txt", + "test_data", + ]: + try: + path = os.path.join(dest, fn) + if os.path.isdir(path): + shutil.rmtree(path) + else: + os.remove(path) + except OSError as e: + log.error("Error: %s - %s." % (e.filename, e.strerror)) + shutil.copyfile(os.path.join(INSTALLED_TO, "jb2_webserver.py"), os.path.join(dest, "jb2_webserver.py")) + + +def parse_style_conf(item): + if item.text.lower() in ["false", "true", "yes", "no"]: + return item.text.lower in ("yes", "true") + else: + return item.text + + +if __name__ == "__main__": + parser = argparse.ArgumentParser(description="", epilog="") + parser.add_argument("--xml", help="Track Configuration") + parser.add_argument( + "--jbrowse2path", help="Path to JBrowse2 directory in BioContainer or Conda" + ) + parser.add_argument("--outdir", help="Output directory", default="out") + parser.add_argument("--version", "-V", action="version", version=JB2VER) + args = parser.parse_args() + tree = ET.parse(args.xml) + root = tree.getroot() + + # This should be done ASAP + GALAXY_INFRASTRUCTURE_URL = root.find("metadata/galaxyUrl").text + # Sometimes this comes as `localhost` without a protocol + if not GALAXY_INFRASTRUCTURE_URL.startswith("http"): + # so we'll prepend `http://` and hope for the best. Requests *should* + # be GET and not POST so it should redirect OK + GALAXY_INFRASTRUCTURE_URL = "http://" + GALAXY_INFRASTRUCTURE_URL + + jc = JbrowseConnector(outdir=args.outdir, jbrowse2path=args.jbrowse2path) + + default_session_data = {} + trackI = 0 + for ass in root.findall("assembly"): + genomes = [ + { + "path": x.attrib["path"], + "label": x.attrib["label"].split(" ")[0].replace(",", ""), + "useuri": x.attrib["useuri"], + "meta": metadata_from_node(x.find("metadata")), + } + for x in ass.findall("metadata/genomes/genome") + ] + primaryGenome = jc.process_genomes(genomes) + if not default_session_data.get(primaryGenome, None): + default_session_data[primaryGenome] = { + "tracks": [], + "style": {}, + "style_labels": {}, + "visibility": { + "default_on": [], + "default_off": [], + }, + } + for track in ass.find("tracks"): + track_conf = {} + track_conf["trackfiles"] = [] + track_conf["assemblyNames"] = primaryGenome + is_multi_bigwig = False + try: + if track.find("options/wiggle/multibigwig") and ( + track.find("options/wiggle/multibigwig").text == "True" + ): + is_multi_bigwig = True + multi_bigwig_paths = [] + except KeyError: + pass + + trackfiles = track.findall("files/trackFile") + if trackfiles: + for x in trackfiles: + track_conf["label"] = "%s_%d" % ( + x.attrib["label"].replace(" ", "_").replace(",", ""), + trackI, + ) + trackI += 1 + track_conf["useuri"] = x.attrib["useuri"] + if is_multi_bigwig: + multi_bigwig_paths.append( + ( + track_conf["label"], + track_conf["useuri"], + os.path.realpath(x.attrib["path"]), + ) + ) + else: + if trackfiles: + metadata = metadata_from_node(x.find("metadata")) + track_conf["dataset_id"] = metadata.get( + "dataset_id", "None" + ) + if x.attrib["useuri"].lower() == "yes": + tfa = ( + x.attrib["path"], + x.attrib["ext"], + x.attrib["useuri"], + track_conf["label"], + metadata, + ) + else: + tfa = ( + os.path.realpath(x.attrib["path"]), + x.attrib["ext"], + x.attrib["useuri"], + track_conf["label"], + metadata, + ) + track_conf["trackfiles"].append(tfa) + + if is_multi_bigwig: + metadata = metadata_from_node(x.find("metadata")) + + track_conf["trackfiles"].append( + ( + multi_bigwig_paths, # Passing an array of paths to represent as one track + "bigwig_multiple", + "MultiBigWig", # Giving an hardcoded name for now + {}, # No metadata for multiple bigwig + ) + ) + + track_conf["category"] = track.attrib["cat"] + track_conf["format"] = track.attrib["format"] + track_conf["conf"] = etree_to_dict(track.find("options")) + keys = jc.process_annotations(track_conf) + if keys: + for key in keys: + vis = track.attrib.get("visibility", "default_off") + if not vis: + vis = "default_off" + default_session_data[primaryGenome]["visibility"][vis].append(key) + trakdat = jc.tracksToAdd[primaryGenome] + stile = {} + for trak in trakdat: + if trak["trackId"] == key: + stile = trak.get("style", {}) + if track.find("options/style"): + supdate = { + item.tag: parse_style_conf(item) + for item in track.find("options/style") + } + stile.update(supdate) + default_session_data[primaryGenome]["style"][key] = stile + if track.find("options/style_labels"): + default_session_data[primaryGenome]["style_labels"][key] = { + item.tag: parse_style_conf(item) + for item in track.find("options/style_labels") + } + default_session_data[primaryGenome]["tracks"].append(key) + default_session_data["defaultLocation"] = root.find( + "metadata/general/defaultLocation" + ).text + default_session_data["session_name"] = root.find( + "metadata/general/session_name" + ).text + logging.debug("default_session=%s" % (json.dumps(default_session_data, indent=2))) + jc.zipOut = root.find("metadata/general/zipOut").text == "true" + general_data = { + "analytics": root.find("metadata/general/analytics").text, + "primary_color": root.find("metadata/general/primary_color").text, + "secondary_color": root.find("metadata/general/secondary_color").text, + "tertiary_color": root.find("metadata/general/tertiary_color").text, + "quaternary_color": root.find("metadata/general/quaternary_color").text, + "font_size": root.find("metadata/general/font_size").text, + } + jc.add_general_configuration(general_data) + trackconf = jc.config_json.get("tracks", []) + for gnome in jc.genome_names: + gtracks = jc.tracksToAdd[gnome] + if len(gtracks) > 0: + logging.debug( + "for genome %s adding gtracks %s" + % (gnome, json.dumps(gtracks, indent=2)) + ) + trackconf += gtracks + jc.config_json["tracks"] = trackconf + assconf = jc.config_json.get("assemblies", []) + assconf += jc.assemblies + jc.config_json["assemblies"] = assconf + logging.debug( + "assmeta=%s, first_contigs=%s, assemblies=%s, gnames=%s, trackidlist=%s, tracks=%s" + % ( + jc.assmeta, + jc.ass_first_contigs, + json.dumps(assconf, indent=2), + jc.genome_names, + jc.trackIdlist, + json.dumps(trackconf, indent=2), + ) + ) + jc.write_config() + jc.add_default_session(default_session_data) + # note that this can be left in the config.json but has NO EFFECT if add_defsess_to_index is called. + # jc.add_defsess_to_index(default_session_data) + # jc.text_index() not sure what broke here.
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/jbrowse2.xml Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,1239 @@ +<tool id="jbrowse2" name="JBrowse2" version="@TOOL_VERSION@+galaxy@VERSION_SUFFIX@" profile="22.05"> + <description>genome browser</description> + <macros> + <import>macros.xml</import> + </macros> + <expand macro="edamInc"/> + <xrefs> + <xref type="bio.tools">jbrowse2</xref> + </xrefs> + <expand macro="requirements"/> + <required_files> + <include path="autogenJB2.py"/> + <include path="blastxml_to_gapped_gff3.py"/> + <include path="convertMAF.sh"/> + <include path="gff3_rebase.py"/> + <include path="jb2_webserver.py"/> + <include path="jbrowse2.py"/> + <include path="maf2bed.py"/> + </required_files> + <version_command>python '${__tool_directory__}/jbrowse2.py' --version</version_command> + <command detect_errors="aggressive"><![CDATA[ +mkdir -p '$output.files_path' && +cp '$trackxml' '$output.files_path/galaxy.xml' && +export JBROWSE2_PATH=\$(dirname \$(which jbrowse))/../opt/jbrowse2 && +#if $jbgen.ucol.formcoll=="collect": + python '$__tool_directory__/autogenJB2.py' + #for $key in $autoCollection.keys(): + #if $autoCollection[$key].is_collection: + #set subCol=$autoCollection[$key] + #set pafs=[($subCol[x],$subcol[x].ext,x) for x in $subCol.keys() if $subCol[x].ext == 'paf'] + #if len($pafs) > 0: + --pafmeta '$pafs[0]' + #set refs = [($pafs[0][2],$subCol[x],x) for x in $subCol.keys() if $subCol[x].ext == 'fasta'] + #for $ref in $refs: + --pafreferencemeta '$ref' + #end for + #end if + #else if $autoCollection[$key].ext == 'fasta': + --referencemeta '$autoCollection[$key],$autoCollection[$key].ext,$key' + #else if $autoCollection[$key].ext in ['bed', 'bigwig', 'cool', 'gff', 'gff3', 'hic', 'maf', 'mcool', 'scool', 'vcf'] + --trackmeta '$autoCollection[$key],$autoCollection[$key].ext,$key' + #else if $autoCollection[$key].ext in ['bam',] + --trackmeta '$autoCollection[$key],$autoCollection[$key].ext,$key,$autoCollection[$key].metadata.bam_index' + #else if $autoCollection[$key].ext in ['cram',] + --trackmeta '$autoCollection[$key],$autoCollection[$key].ext,$key,$autoCollection[$key].metadata.cram_index' + #end if + #end for + --outdir '$output.files_path' + --jbrowse2path \${JBROWSE2_PATH} + --sessName "Autogen JBrowse" && + #if $jbgen.zipOut == "true": + cd '$output.files_path' && + zip -r - . > '$output' + #else + cp '$output.files_path/index.html' '$output' + #end if +#else: + python '$__tool_directory__/jbrowse2.py' + --jbrowse2path \${JBROWSE2_PATH} + --outdir '$output.files_path' + --xml '$trackxml' && + #if $jbgen.zipOut == "true": + cd '$output.files_path' && + zip -r - . > '$output' + #else + cp '$output.files_path/index.html' '$output' + #end if +#end if + ]]></command> + <configfiles> + <configfile name="trackxml"><![CDATA[<?xml version="1.0"?> +<root> + <metadata> + <general> + <defaultLocation>${jbgen.defaultLocation}</defaultLocation> + <zipOut>${jbgen.zipOut}</zipOut> + <analytics>${jbgen.enableAnalytics}</analytics> + <primary_color>${jbgen.primary_color}</primary_color> + <secondary_color>${jbgen.secondary_color}</secondary_color> + <tertiary_color>${jbgen.tertiary_color}</tertiary_color> + <quaternary_color>${jbgen.quaternary_color}</quaternary_color> + <font_size>${jbgen.font_size}</font_size> + <session_name>${jbgen.session_name}</session_name> + </general> + <galaxyUrl>${__app__.config.galaxy_infrastructure_url}</galaxyUrl> + </metadata> +#if $jbgen.ucol.formcoll=="form": + #for $assembly in $assemblies: + <assembly> + <metadata> + <genomes> + #if str($assembly.reference_genome.genome_type_select) == "uri": + <genome path="${assembly.reference_genome.uri}" label="${assembly.reference_genome.refname}" useuri="yes"> + <metadata> + <dataset + dname = "${assembly.reference_genome.refname}" /> + </metadata> + </genome> + #else if str($assembly.reference_genome.genome_type_select) == "indexed": + <genome path="${assembly.reference_genome.genome.fields.path}" label="${assembly.reference_genome.genome.fields.dbkey}" useuri="no"> + <metadata> + <dataset + dname = "${assembly.reference_genome.genome.fields.name}" /> + </metadata> + </genome> + #else + <genome path="$assembly.reference_genome.genome" label="${assembly.reference_genome.genome.name}" useuri="no"> + <metadata> + + <dataset id="${__app__.security.encode_id($assembly.reference_genome.genome.id)}" + hid="${assembly.reference_genome.genome.hid}" + size="${assembly.reference_genome.genome.get_size(nice_size=True)}" + edam_format="${assembly.reference_genome.genome.datatype.edam_format}" + file_ext="${assembly.reference_genome.genome.ext}" + dname = "${assembly.reference_genome.genome.name}" + /> + <history + #if $assembly.reference_genome.genome.history.user: + user_email="${assembly.reference_genome.genome.history.user.email}" + user_id="${assembly.reference_genome.genome.history.user_id}" + display_name="${assembly.reference_genome.genome.history.get_display_name()}"/> + #else + user_email="anonymous" + user_id="-1" + display_name="Unnamed History" + #end if + id="${__app__.security.encode_id($assembly.reference_genome.genome.history_id)}" + /> + <metadata + #for (key, value) in $assembly.reference_genome.genome.get_metadata().items(): + #if "_types" not in $key: + #if isinstance($value, list): + #set value_str = "[%s]" % ','.join([str(val) for val in value]) + ${key}="$value_str" + #else + ${key}="${value}" + #end if + #end if + #end for + /> + <tool + tool_id="${assembly.reference_genome.genome.creating_job.tool_id}" + tool_version="${assembly.reference_genome.genome.creating_job.tool_version}" + /> + </metadata> + </genome> + #end if + </genomes> + </metadata> + <tracks> + #for $tg in $assembly.track_groups: + #for $track in $tg.data_tracks: + #set input_exists = "False" + #if $track.data_format.useuri.insource == "uri": + #if $track.data_format.useuri.annouri: + #set input_exists = "True" + <track cat="${tg.category}" format="${track.data_format.data_format_select}" visibility="${track.data_format.track_visibility}"> + <files> + <trackFile path="${track.data_format.useuri.annouri}" ext="${track.data_format.data_format_select}" + label="${track.data_format.useuri.annoname}" useuri="yes"> + <metadata> + <dataset id = "${track.data_format.useuri.annouri}" /> + </metadata> + </trackFile> + </files> + #end if + #else if $track.data_format.useuri.insource == "history": + #if $track.data_format.useuri.annotation: + #set input_exists = "True" + <track cat="${tg.category}" format="${track.data_format.data_format_select}" visibility="${track.data_format.track_visibility}"> + <files> + #for $dataset in $track.data_format.useuri.annotation: + <trackFile path="${dataset}" ext="${dataset.ext}" label="${dataset.name}" useuri="no"> + <metadata> + <dataset id="${__app__.security.encode_id($dataset.id)}" hid="${dataset.hid}" + size="${dataset.get_size(nice_size=True)}" + edam_format="${dataset.datatype.edam_format}" + file_ext="${dataset.ext}" /> + <history id="${__app__.security.encode_id($dataset.history_id)}" + #if $dataset.history.user: + user_email="${dataset.history.user.email}" + user_id="${dataset.history.user_id}" + display_name="${dataset.history.get_display_name()}"/> + #else + user_email="anonymous" + user_id="-1" + display_name="Unnamed History"/> + #end if + <metadata + #for (key, value) in $dataset.get_metadata().items(): + #if "_types" not in $key and $value is not None and len(str($value)) < 5000: + #if isinstance($value, list): + #set value_str = "[%s]" % ','.join([str(val) for val in value]) + ${key}="$value_str" + #else + ${key}="${value}" + #end if + #end if + #end for + /> + <tool + tool_id="${dataset.creating_job.tool_id}" + tool_version="${dataset.creating_job.tool_version}" + /> + </metadata> + </trackFile> + #end for + </files> + #end if + #end if + #if $input_exists == "True": + <options> + <style> + #if str($track.data_format.data_format_select) in ["gff", "bed", "maf", "blastxml"]: + <type>${track.data_format.jbstyle.display}</type> + #if str($track.data_format.jbstyle.display) in ["LinearBasicDisplay", "LinearVariantDisplay"]: + <trackShowLabels>${track.data_format.jbstyle.show_labels}</trackShowLabels> + <trackShowDescriptions>${track.data_format.jbstyle.show_descriptions}</trackShowDescriptions> + #end if + #else if str($track.data_format.data_format_select) == "vcf": + <type>${track.data_format.jbstyle.track_style.display}</type> + #if str($track.data_format.jbstyle.track_style.display) in ["LinearBasicDisplay", "LinearVariantDisplay"]: + <trackShowLabels>${track.data_format.jbstyle.track_style.show_labels}</trackShowLabels> + <trackShowDescriptions>${track.data_format.jbstyle.track_style.show_descriptions}</trackShowDescriptions> + #end if + + #end if + #if str($track.data_format.data_format_select) in ["bam", "cram"]: + <type>LinearAlignmentsDisplay</type> + #end if + #if str($track.data_format.data_format_select) in ["hic", "cool", "mcool", "scool"]: + <type>LinearHicDisplay</type> + #end if + </style> + #if str($track.data_format.data_format_select) == "bam": + <bam> + <bam_index> + #for $dataset in $track.data_format.useuri.annotation: + ${dataset} ~ ${dataset.metadata.bam_index}, + #end for + </bam_index> + </bam> + #else if str($track.data_format.data_format_select) == "cram": + <cram> + <cram_index> + #for $dataset in $track.data_format.useuri.annotation: + ${dataset} ~ ${dataset.metadata.cram_index}, + #end for + </cram_index> + </cram> + #else if str($track.data_format.data_format_select) == "blastxml": + <blast> + #if str($track.data_format.blast_parent) != "": + <parent>${track.data_format.blast_parent}</parent> + #end if + <protein>${track.data_format.is_protein}</protein> + <min_gap>${track.data_format.min_gap}</min_gap> + </blast> + #else if str($track.data_format.data_format_select) == "gff": + <gff> + #if $track.data_format.match_part.match_part_select == "true": + <match>${track.data_format.match_part.name}</match> + #end if + </gff> + #else if str($track.data_format.data_format_select) == "paf": + <paf> + #if str($track.data_format.pafuseuri.insource) == "history": + <genome> + #for $anno in $track.data_format.pafuseuri.annotation: + ${anno} ~ ${anno.name}, + #end for + </genome> + <useuri> + false + </useuri> + #else: + <genome> + #for $refgenome in $track.data_format.pafuseuri.refuri: + ${refgenome.annotation} ~ ${refgenome.annoname}, + #end for + </genome> + <useuri> + true + </useuri> + #end if + </paf> + #else if str($track.data_format.data_format_select) == "hic": + <hic> + </hic> + #else if str($track.data_format.data_format_select) == "cool": + <cool> + </cool> + #else if str($track.data_format.data_format_select) == "bed": + <bed> + </bed> + #else if str($track.data_format.data_format_select) == "sparql": + <label>${track.data_format.label}</label> + <sparql> + <url>${track.data_format.url}</url> + <query>${track.data_format.query}</query> + <query_refnames>${track.data_format.query_refnames}</query_refnames> + </sparql> + #end if + </options> + </track> + #end if + #end for + #end for + </tracks> + </assembly> + #end for +#end if +</root> +]]></configfile> + </configfiles> + <inputs> + <repeat name="assemblies" min="1" title="Genome reference to provide display coordinates for this set of tracks" help="JBrowse2 can show a set of tracks for each of multiple genome reference fasta files"> + <conditional name="reference_genome"> + <param name="genome_type_select" type="select" label="Reference genome source" help="Select a built in, history or remote tabix URI for the reference track"> + <option selected="True" value="indexed">Use a Galaxy server built-in genome</option> + <option value="history">Use a genome fasta file from the current history</option> + <option value="uri">Provide a URI (e.g. "https://..." for a genome reference as + tabix bgzip with predictable index file URI</option> + </param> + <when value="indexed"> + <param name="genome" type="select" optional="true" label="Select a built in reference genome or custom genome" help="If not listed, add a custom genome, use a history genome or a remote URI tabix genome"> + <options from_data_table="all_fasta"> + <filter column="2" type="sort_by"/> + <validator message="No genomes are available for the selected input dataset" type="no_options"/> + </options> + </param> + </when> + <when value="history"> + <param name="genome" type="data" format="fasta" optional="true" label="Select the reference genome"/> + </when> + <when value="uri"> + <param name="uri" type="text" label="URI pointing to tabix compressed fasta"/> + <param name="refname" type="text" label="Reference dbkey for tracks with this genome, such as hg38 or SacCer4"/> + </when> + </conditional> + <repeat name="track_groups" title="Track Group"> + <param name="category" type="text" value="Default" optional="False" label="Track Category" help="Organise your tracks into Categories for a nicer end-user experience. You can use #date# and it will be replaced with the current date in 'yyyy-mm-dd' format, which is very useful for repeatedly updating a JBrowse instance when member databases / underlying tool versions are updated."/> + <repeat name="data_tracks" title="Annotation Track"> + <conditional name="data_format" label="Track Data Selection Options"> + <param name="data_format_select" type="select" label="Track Type"> + <option value="bam">BAM track. Recommend convert to faster BED unless mapping annotation needed</option> + <option value="bed">BED track</option> + <option value="bigwig">BigWig track</option> + <option value="blastxml">Blast XML track (as GFF3)</option> + <option value="cram">CRAM track, we recomment to convert to BED like for BAM</option> + <option value="gff">GFF/GFF3 track</option> + <option value="cool">HiC as cool/mcool/scool format files</option> + <option value="hic">HiC as juicebox_hic format file. Tabular hic_matrix will NOT work.</option> + <option value="maf">Multiple alignment format. Reference name must match the MAF name exactly to work correctly</option> + <option value="paf">PAF - approximate mapping positions between two set of sequences</option> + <option value="vcf">VCF SNP track</option> + </param> + <when value="blastxml"> + <expand macro="input_conditional" label="BlastXML Track Data" format="blastxml"/> + <param name="blast_parent" type="data" format="gff3" optional="true" label="Features used in Blast Search" help="in GFF3. This is used so we know where to map features. E.g. where results of which CDS Protein32 match up to. The query IDs in your blast results should MATCH some feature IDs in your GFF3 file. This is an optional field and is most useful if using JBrowse to display protein blast results on a DNA genome. blastn results don't need this, blastp results on a protein sequence don't need this."/> + <param name="min_gap" type="integer" min="2" value="10" label="Minimum Gap Size" help="before a new match_part feature is created"/> + <param name="is_protein" type="boolean" truevalue="true" falsevalue="false" label="Is this a protein blast search?"/> + <expand macro="track_styling_vgp"/> + <expand macro="track_visibility"/> + </when> + <when value="vcf"> + <expand macro="input_conditional" label="SNP Track Data" format="vcf,vcf_bgzip"/> + <expand macro="track_styling_vcf"/> + <expand macro="track_visibility"/> + </when> + <when value="gff"> + <expand macro="input_conditional" label="GFF/GFF3 Track Data" format="gff,gff3"/> + <conditional name="match_part" label="match/match_part data"> + <param name="match_part_select" type="select" label="This is match/match_part data" help="Match part data selection "> + <option selected="True" value="false">Not match/match part data</option> + <option value="true">Match/match part data</option> + </param> + <when value="true"> + <param name="name" type="text" value="match" optional="True" label="Match Part Feature Type" help="Match_parts have options for the parent feature type, such as cDNA_match, match, translated_nucleotide_match, etc. Please select the appropriate one here. You can leave empty to try autodetection (only works with CanvasFeatures track type)."/> + </when> + <when value="false"/> + </conditional> + <expand macro="track_styling_vgp"/> + <expand macro="track_visibility"/> + </when> + <when value="bam"> + <expand macro="input_conditional" label="BAM Track Data" format="bam"/> + <expand macro="track_visibility"/> + </when> + <when value="bed"> + <expand macro="input_conditional" label="BED Track Data" format="bed"/> + <expand macro="track_styling_vgp"/> + <expand macro="track_visibility"/> + </when> + <when value="cram"> + <expand macro="input_conditional" label="CRAM Track Data" format="cram"/> + <expand macro="track_visibility"/> + </when> + <when value="maf"> + <expand macro="input_conditional" label="MAF Track Data" format="maf"/> + <expand macro="track_styling_vgp"/> + <expand macro="track_visibility"/> + </when> + <when value="bigwig"> + <expand macro="input_conditional" label="BigWig Track Data" format="bigwig"/> + <expand macro="track_visibility"/> + </when> + <when value="paf"> + <expand macro="input_conditional" label="PAF format synteny mapping data" format="paf" help="PAF made with mashmap or minimap2 mapping the real reference as a query on the comparison genomes as references"/> + <expand macro="pafref_conditional" label="Comparison genome sequence(s)" format="fasta" help="Paf from these as the reference(s), using the common reference as the reads to map"/> + <expand macro="track_visibility"/> + </when> + <when value="hic"> + <expand macro="input_conditional" label="Binary Juicebox HiC data" format="hic,juicebox_hic"/> + <expand macro="track_visibility"/> + </when> + <when value="cool"> + <expand macro="input_conditional" label="HiC data in cool/mcool/scool format" format="cool,mcool,scool"/> + <expand macro="track_visibility"/> + </when> + </conditional> + </repeat> + </repeat> + </repeat> + <expand macro="general_options"/> + </inputs> + <outputs> + <data format="html" name="output" label="JBrowse2: ${jbgen.session_name}"> + <change_format> + <when input="zipOut" value="true" format="zip"/> + </change_format> + </data> + </outputs> + <tests> + <test> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/Merlin"/> + <param name="genome.ext" value="fasta"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Assembly properties"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/merlin.gff3"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="bigwig"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/merlin.bw"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="bed"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/test-6.bed"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="vcf"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/merlin.vcf"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="cram"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/merlin.cram"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="bam"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/merlin-sample.bam"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="maf"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/merlinlastz.maf"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="blastxml"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/merlin.blastxml"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="bam"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" value=""/> + </conditional> + </conditional> + </repeat> + </repeat> + </repeat> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/dm3.fa"/> + <param name="genome.ext" value="fasta"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Assembly properties"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="cool"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/dm3test.cool"/> + </conditional> + </conditional> + </repeat> + </repeat> + </repeat> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/Ppersica1.fa"/> + <param name="genome.ext" value="fasta"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Assembly properties"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="paf"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/peach-grape-map.paf"/> + </conditional> + <conditional name="pafuseuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/Vvinifera1.fa"/> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="paf"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/arabgrapepeach.paf"/> + </conditional> + <conditional name="pafuseuri"> + <param name="insource" value="uri"/> + <repeat name="refuri"> + <param name="annotation" value="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/arabidopsis.fa.gz"/> + <param name="annoname" value="arabidopsis.URI"/> + </repeat> + <repeat name="refuri"> + <param name="annotation" value="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/Vvinifera1.fa.gz"/> + <param name="annoname" value="Vvinifera1.URI"/> + </repeat> + </conditional> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="paf"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/arabgrapepeach.paf"/> + </conditional> + <conditional name="pafuseuri"> + <param name="insource" value="history"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/arabidopsis.fa"/> + <param name="annotation" location="https://usegalaxy.eu/static/share/ebp/jbrowse2/test-data/Vvinifera1.fa"/> + </conditional> + </conditional> + </repeat> + </repeat> + </repeat> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="uri"/> + <param name="uri" value="https://s3.amazonaws.com/jbrowse.org/genomes/GRCh38/fasta/GRCh38.fa.gz"/> + <param name="refname" value="hg38"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Assembly properties"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="hic"/> + <conditional name="useuri"> + <param name="insource" value="uri"/> + <param name="annouri" value="https://s3.amazonaws.com/igv.broadinstitute.org/data/hic/intra_nofrag_30.hic"/> + <param name="annoname" value="humanhic"/> + </conditional> + </conditional> + </repeat> + </repeat> + </repeat> + <section name="jbgen"> + <param name="zipOut" value="true"/> + <param name="session_name" value="New test session"/> + </section> + <!-- use the following line to generate a ZIP file to inspect it --> + <!--output name="output" file="jbrowse2_result02.zip" ftype="zip" compare="sim_size"--> + <output name="output" ftype="zip"> + <assert_contents> + <has_archive_member path=".*" n="637" delta="10"/> + <has_archive_member path="index.html"/> + <has_archive_member path="static/media/roboto-latin-400-normal.3f2b9a42f643e62a49b7.woff"/> + <has_archive_member path="manifest.json"> + <has_text text="favicon.ico"/> + <has_text text="background_color"/> + </has_archive_member> + <has_archive_member path="galaxy.xml"> + <is_valid_xml/> + <has_n_elements_with_path path="./metadata/general/session_name" n="1"/> + <element_text_is path="./metadata/general/session_name" text="New test session"/> + </has_archive_member> + <has_archive_member path="merlin.gff3_0.gz"/> + <has_archive_member path="merlin.gff3_0.gz.tbi"/> + <has_archive_member path="merlin.bw_1"/> + <has_archive_member path="test-6.bed_2.gz"/> + <has_archive_member path="merlin.vcf_3"/> + <has_archive_member path="merlin.vcf_3.tbi"/> + <has_archive_member path="merlin.cram_4"/> + <has_archive_member path="merlin.cram_4.crai"/> + <has_archive_member path="merlin-sample.bam_5"/> + <has_archive_member path="merlin-sample.bam_5.bai"/> + <has_archive_member path="merlinlastz.maf_6.sorted.bed.gz.tbi"/> + <has_archive_member path="merlin.blastxml_7.gz"/> + <has_archive_member path="dm3test.cool_9.hic"/> + <has_archive_member path="peach-grape-map.paf_10"/> + <has_archive_member path="arabgrapepeach.paf_11"/> + <has_archive_member path="Merlin.fa.gz.fai"/> + <has_archive_member path="dm3.fa.fa.gz.fai"/> + <has_archive_member path="Ppersica1.fa.fa.gz"/> + <has_archive_member path="Ppersica1.fa.fa.gz.fai"/> + <has_archive_member path="Ppersica1.fa.fa.gz.gzi"/> + <has_archive_member path="Vvinifera1.fa.fa.gz.fai"/> + <has_archive_member path="Vvinifera1.fa.fa.gz"/> + <has_archive_member path="Vvinifera1.fa.fa.gz.gzi"/> + <has_archive_member path="config.json"> + <has_json_property_with_text property="name" text="Merlin"/> + </has_archive_member> + </assert_contents> + </output> + </test> + <test> + <param name="autoCollection"> + <collection type="list" name="merlintest"> + <element name="merlin.fa" value="merlin.fa"/> + <element name="merlin.bw" value="bw/merlin.bw"/> + <element name="test-6.bed" value="bed/test-6.bed"/> + </collection> + </param> + <section name="jbgen"> + <param name="zipOut" value="true"/> + <param name="session_name" value="Collection test"/> + <param name="ucol|formcoll" value="collect"/> + </section> + <!-- use the following line to generate a ZIP file to inspect it --> + <!--output name="output" file="jbrowse2_result01.zip" ftype="zip" compare="sim_size"--> + <output name="output" ftype="zip"> + <assert_contents> + <has_archive_member path=".*" n="611" delta="10"/> + <has_archive_member path="index.html"/> + <has_archive_member path="static/media/roboto-latin-400-normal.3f2b9a42f643e62a49b7.woff"/> + <has_archive_member path="manifest.json"> + <has_text text="favicon.ico"/> + <has_text text="background_color"/> + </has_archive_member> + <has_archive_member path="galaxy.xml"> + <is_valid_xml/> + <has_n_elements_with_path path="./metadata/general/session_name" n="1"/> + <element_text_is path="./metadata/general/session_name" text="Collection test"/> + </has_archive_member> + <has_archive_member path="jb2_webserver.py"/> + <has_archive_member path="merlin.fa.fa.gz.gzi"/> + <has_archive_member path="merlin.fa.fa.gz"/> + <has_archive_member path="merlin.fa.fa.gz.fai"/> + <has_archive_member path="merlin.bw"/> + <has_archive_member path="index.html"/> + <has_archive_member path="test-6.bed.gz"/> + <has_archive_member path="test-6.bed.gz.tbi"/> + </assert_contents> + </output> + </test> + <test> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" value="merlin.fa"/> + <param name="genome.ext" value="fasta"/> + <param name="genome.name" value="Merlin"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Default"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="bigwig"/> + <conditional name="useuri"> + <param name="annotation" value="bw/merlin.bw"/> + <param name="insource" value="history"/> + </conditional> + </conditional> + </repeat> + </repeat> + </repeat> + <section name="jbgen"> + <param name="zipOut" value="true"/> + </section> + <output name="output"> + <assert_contents> + <has_archive_member path="galaxy.xml"> + <has_text text="genome path="/> + <has_text text="label="/> + <has_text text="format="/> + <has_text text="metadata"/> + <has_text text="useuri="/> + <has_text text="trackFile path="/> + <has_text text="ext="bigwig""/> + <has_text text="format="bigwig""/> + </has_archive_member> + </assert_contents> + </output> + </test> + <test> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" value="merlin.fa"/> + <param name="genome.ext" value="fasta"/> + <param name="genome.name" value="Merlin"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Default"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="vcf"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" value="vcf/merlin.vcf"/> + </conditional> + </conditional> + </repeat> + </repeat> + </repeat> + <section name="jbgen"> + <param name="zipOut" value="true"/> + </section> + <output name="output"> + <assert_contents> + <has_archive_member path="galaxy.xml"> + <has_text text="genome path="/> + <has_text text="label="/> + <has_text text="format="/> + <has_text text="metadata"/> + <has_text text="useuri="/> + <has_text text="trackFile path="/> + <has_text text="ext="vcf" label="merlin.vcf""/> + </has_archive_member> + </assert_contents> + </output> + </test> + <test> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" value="merlin.fa"/> + <param name="genome.ext" value="fasta"/> + <param name="genome.name" value="Merlin"/> + </conditional> + </repeat> + <section name="jbgen"> + <param name="zipOut" value="true"/> + </section> + <output name="output"> + <assert_contents> + <has_archive_member path="galaxy.xml"> + <has_text text="genome path="/> + <has_text text="label="/> + <has_text text="useuri="/> + <has_text text="metadata"/> + </has_archive_member> + </assert_contents> + </output> + </test> + <test> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" value="merlin.fa"/> + <param name="genome.ext" value="fasta"/> + <param name="genome.name" value="Merlin"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Default"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="vcf"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" value="vcf/merlin.vcf"/> + </conditional> + </conditional> + </repeat> + </repeat> + </repeat> + <section name="jbgen"> + <param name="zipOut" value="true"/> + </section> + <output name="output"> + <assert_contents> + <has_archive_member path="galaxy.xml"> + <has_text text="genome path="/> + <has_text text="label="/> + <has_text text="format="/> + <has_text text="metadata"/> + <has_text text="useuri="/> + <has_text text="trackFile path="/> + <has_text text="ext="vcf" label="merlin.vcf""/> + </has_archive_member> + </assert_contents> + </output> + </test> + <test> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" value="merlin.fa"/> + <param name="genome.ext" value="fasta"/> + <param name="genome.name" value="Merlin"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Default"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" value="gff3/A.gff"/> + </conditional> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="false"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="none"/> + </conditional> + <conditional name="color"> + <param name="color_select" value="automatic"/> + </conditional> + </section> + </repeat> + </repeat> + <repeat name="track_groups"> + <param name="category" value="Ignore Scale"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" value="gff3/1.gff"/> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="false"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="none"/> + <conditional name="color"> + <param name="color_select" value="manual"/> + <param name="style_color" value="#ff00ff"/> + </conditional> + </conditional> + </section> + </conditional> + </repeat> + </repeat> + <repeat name="track_groups"> + <param name="category" value="Scaled Colour"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="insource" value="history"/> + <param name="annotation" value="gff3/C.gff"/> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="false"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="score"/> + <param name="score_scaling" value="linear"/> + <conditional name="score_scales"> + <param name="scale_select" value="automatic"/> + </conditional> + <conditional name="color_scheme"> + <param name="score_scheme" value="opacity"/> + <conditional name="color"> + <param name="color_select" value="automatic"/> + </conditional> + </conditional> + </conditional> + </section> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="annotation" value="gff3/B.gff"/> + <param name="insource" value="history"/> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="false"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="score"/> + <param name="score_scaling" value="linear"/> + <conditional name="score_scales"> + <param name="scale_select" value="automatic"/> + </conditional> + <conditional name="color_scheme"> + <param name="score_scheme" value="opacity"/> + <conditional name="color"> + <param name="color_select" value="manual"/> + <param name="style_color" value="#0000ff"/> + </conditional> + </conditional> + </conditional> + </section> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="annotation" value="gff3/A.gff"/> + <param name="insource" value="history"/> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="false"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="score"/> + <param name="score_scaling" value="linear"/> + <conditional name="score_scales"> + <param name="scale_select" value="manual"/> + <param name="minimum" value="0"/> + <param name="maximum" value="1000"/> + </conditional> + <conditional name="color_scheme"> + <param name="score_scheme" value="opacity"/> + <conditional name="color"> + <param name="color_select" value="automatic"/> + </conditional> + </conditional> + </conditional> + </section> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="annotation" value="gff3/1.gff"/> + <param name="insource" value="history"/> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="false"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="score"/> + <param name="score_scaling" value="linear"/> + <conditional name="score_scales"> + <param name="scale_select" value="manual"/> + <param name="minimum" value="0"/> + <param name="maximum" value="1000"/> + </conditional> + <conditional name="color_scheme"> + <param name="score_scheme" value="opacity"/> + <conditional name="color"> + <param name="color_select" value="manual"/> + <param name="style_color" value="#ff0000"/> + </conditional> + </conditional> + </conditional> + </section> + </conditional> + </repeat> + </repeat> + <repeat name="track_groups"> + <param name="category" value="Realistic"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="annotation" value="gff3/interpro.gff"/> + <param name="insource" value="history"/> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="false"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="none"/> + </conditional> + <conditional name="color"> + <param name="color_select" value="automatic"/> + </conditional> + </section> + </conditional> + </repeat> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="gff"/> + <conditional name="useuri"> + <param name="annotation" value="gff3/2.gff"/> + <param name="insource" value="history"/> + </conditional> + <conditional name="match_part"> + <param name="match_part_select" value="true"/> + <param name="name" value="cDNA_match"/> + </conditional> + <section name="jbcolor_scale"> + <conditional name="color_score"> + <param name="color_score_select" value="none"/> + </conditional> + <conditional name="color"> + <param name="color_select" value="automatic"/> + </conditional> + </section> + </conditional> + </repeat> + </repeat> + </repeat> + <section name="jbgen"> + <param name="zipOut" value="true"/> + </section> + <output name="output"> + <assert_contents> + <has_archive_member path="A.gff"/> + <has_archive_member path="B.gff"/> + <has_archive_member path="C.gff"/> + <has_archive_member path="interpro.gff"/> + <has_archive_member path="1.gff"/> + <has_archive_member path="2.gff"/> + <has_archive_member path="galaxy.xml"> + <is_valid_xml/> + <has_n_elements_with_path path="./metadata/general/session_name" n="1"/> + <element_text_is path="./metadata/general/session_name" text="New JBrowse2 session"/> + </has_archive_member> + </assert_contents> + </output> + </test> + <test> + <repeat name="assemblies"> + <conditional name="reference_genome"> + <param name="genome_type_select" value="history"/> + <param name="genome" value="merlin.fa"/> + </conditional> + <repeat name="track_groups"> + <param name="category" value="Auto Coloured"/> + <repeat name="data_tracks"> + <conditional name="data_format"> + <param name="data_format_select" value="cram"/> + <param name="annotation" value="cram/merlin-sample.cram"/> + </conditional> + </repeat> + </repeat> + </repeat> + <section name="jbgen"> + <param name="zipOut" value="true"/> + </section> + <output name="output" ftype="zip"> + <assert_contents> + <!-- roughly 590 files in an archive --> + <has_archive_member path=".*" n="608" delta="30"/> + <has_archive_member path="index.html"/> + <has_archive_member path="static/media/roboto-latin-400-normal.3f2b9a42f643e62a49b7.woff"/> + <has_archive_member path="manifest.json"> + <has_text text="favicon.ico"/> + <has_text text="background_color"/> + </has_archive_member> + <has_archive_member path="merlin.fa.fa.gz.fai"/> + <has_archive_member path="galaxy.xml"> + <is_valid_xml/> + <has_n_elements_with_path path="./metadata/general/session_name" n="1"/> + <element_text_is path="./metadata/general/session_name" text="New JBrowse2 session"/> + </has_archive_member> + </assert_contents> + </output> + </test> + </tests> + <help><![CDATA[ + +JBrowse2-in-Galaxy +================== + +JBrowse2-in-Galaxy offers a highly configurable, workflow-compatible Genome viewer. + +Compared to JBrowse1-in-Galaxy, there is no support for alternative codons for unusual genomes, +and detailed track styling is not yet implemented. Please contact us if you are missing features. + +Use and local viewing +===================== + + +A JBrowse2 history item can be opened by viewing it (the "eye" icon). + +The same browser data and setup can also be downloaded as a compressed zip archive by clicking the download ("floppy disk") icon in the history. +This can be shared and viewed without Galaxy. + +A replacement application to serve the browser is required without Galaxy. A local python web server can be started using a script included in each archive, +assuming that Python3 is already working on your desktop - if not you will have to install it first. Unzip the archive (*unzip [filename].zip*) and change +directory to the first level in that zip archive. It contains a file named *jb2_webserver.py* + +With python3 installed, + +*python3 jb2_webserver.py* + +will serve the unarchived JBrowse2 configuration from the same directory as the python script automatically. If a new browser window does not open, +but the script appears to be running, try pointing your web browser to the default of *localhost:8080* + +Overview +-------- + +JBrowse is a fast, embeddable genome browser built completely with +JavaScript and HTML5. + +The JBrowse-in-Galaxy (JiG) tool was written to help build complex +JBrowse installations straight from Galaxy. It allows you to build up a JBrowse instance without worrying +about how to run the command line tools to format your data, and which +options need to be supplied and where. + +Options +------- + +**Reference or Assembly** + +Choose either a built-in or select one from your history. + +Track coordinates and contig names *must* match this reference precisely +or they will not display. + +**Track Groups** represent a set of tracks in a single category. + +Annotation Tracks +----------------- + +MAF +~~~ + +For history references, the name of the reference genome dataset in your history is very important for MAF tracks, +because it becomes the "dbkey" for that reference. So, it must be exactly the same as the genome name (dbkey) used +when making the MAF, typically in the Lastz tool. + +If you used "hg38" as the reference in Lastz, that is exactly what the reference fasta should be named for any MAF track. +Change it using the "pencil" icon on the reference data in your history. Any other name, such as "hg38.fasta" +will cause the MAF track to show no data since there are no matches with that reference dbkey. + + +GFF3/BED +~~~~~~~~ + +Standard feature tracks. They usually highlight genes, mRNAs and other features of interest along a genomic region. + +When these contain tens of millions of features, such as repeat regions from a VGP assembly, displaying one at a time leads +to extremely slow loading times when a large region is in view, unless the "LinearPileupDisplay" display option is +selected for that track in the styling options section. The default is LinearBasicDisplay, which shows all details and works +well for relatively sparse bed files. A better option is to make a bigwig track using a set of windows based on the +lengths of each assembly or reference contig. + +BAM Pileups +~~~~~~~~~~~ + +We support BAM files and can automatically generate SNP tracks based on +that bam data. + + +BlastXML +~~~~~~~~ + +JiG now supports both blastn and blastp datasets. JiG internally uses a +blastXML to gapped GFF3 tool to convert your blastxml datasets into a +format amenable to visualization in JBrowse. This tool is also +available separately from the IUC on the toolshed. + +**Minimum Gap Size** reflects how long a gap must be before it becomes a +real gap in the processed gff3 file. In the picture above, various sizes +of gaps can be seen. If the minimum gap size was set much higher, say +100nt, many of the smaller gaps would disappear, and the features on +both sides would be merged into one, longer feature. This setting is +inversely proportional to runtime and output file size. *Do not set this +to a low value for large datasets*. By setting this number lower, you +will have extremely large outputs and extremely long runtimes. The +default was configured based off of the author's experience, but the +author only works on small viruses. It is *strongly* recommended that +you filter your blast results before display, e.g. picking out the top +10 hits or so. + +**Protein blast search** option merely informs underlying tools that +they should adjust feature locations by 3x. + + +]]></help> + <expand macro="citations"/> + <expand macro="creators"/> +</tool>
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/macros.xml Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,481 @@ +<macros> + <token name="@TOOL_VERSION@">2.11.1</token> + <xml name="edamInc"> + <edam_topics> + <edam_topic>topic_3307</edam_topic> + <edam_topic>topic_0092</edam_topic> + </edam_topics> + <edam_operations> + <edam_operation>operation_0573</edam_operation> + <edam_operation>operation_0564</edam_operation> + </edam_operations> + </xml> + <xml name="requirements"> + <requirements> + <requirement type="package" version="@TOOL_VERSION@">jbrowse2</requirement> + <requirement type="package" version="1.82">biopython</requirement> + <requirement type="package" version="0.7.1">bcbio-gff</requirement> + <requirement type="package" version="1.19">samtools</requirement> + <requirement type="package" version="6.0.1">pyyaml</requirement> + <requirement type="package" version="1.11">tabix</requirement> + <requirement type="package" version="4.6.0">findutils</requirement> + <requirement type="package" version="0.0.12-0">hictk</requirement> + <requirement type="package" version="3.0">zip</requirement> + <yield/> + </requirements> + </xml> + <token name="@DATA_DIR@">\$GALAXY_JBROWSE_SHARED_DIR</token> + <token name="@VERSION_SUFFIX@">0</token> + <xml name="creators"> + <creator> + <person givenName="Helena" familyName="Rasche" url="https://github.com/hexylena"/> + <person givenName="Anthony" familyName="Bretaudeau" url="https://github.com/abretaud"/> + <person givenName="Ross" familyName="Lazarus" url="https://github.com/fubar2/"/> + </creator> + </xml> + <xml name="genome_selector" token_help="" token_label="Fasta sequences" token_optional="False"> + <conditional name="reference_genome"> + <param name="genome_type_select" type="select" label="Reference genome to display" help="Built-in references"> + <option selected="True" value="indexed">Use a built-in genome</option> + <option value="history">Use a genome from history</option> + </param> + <when value="indexed"> + <param name="genomes" type="select" optional="@OPTIONAL@" label="@LABEL@" help="@HELP@"> + <options from_data_table="all_fasta"> + <filter column="2" type="sort_by"/> + <validator message="No genomes are available for the selected input dataset" type="no_options"/> + </options> + </param> + </when> + <when value="history"> + <param name="genomes" type="data" format="fasta" optional="@OPTIONAL@" label="@LABEL@" help="@HELP@" multiple="True"/> + </when> + </conditional> + </xml> + <xml name="auto_manual_tk" token_cond_label="Color" token_cond_name="color" token_select_label="Color Specification" token_select_name="color_select" token_automatic_label="Automatically selected" token_manual_label="Manual Color Selection"> + <conditional name="@COND_NAME@" label="@COND_LABEL@"> + <param name="@SELECT_NAME@" type="select" label="@SELECT_LABEL@"> + <option value="automatic" selected="true">@AUTOMATIC_LABEL@</option> + <option value="manual">@MANUAL_LABEL@</option> + </param> + <when value="automatic"> + </when> + <when value="manual"> + <yield/> + </when> + </conditional> + </xml> + <xml name="jb_color" token_label="JBrowse style.color" token_name="style_color" token_value="goldenrod" token_help="Basic color of features. Most glyphs interpret this as the fill color of the rectangle they draw. Color syntax is the same as that used for CSS"> + <param name="@NAME@" type="color" value="@VALUE@" label="@LABEL@" help="@HELP@"> + <sanitizer> + <valid initial="string.letters,string.digits"> + <add value="#"/> + </valid> + </sanitizer> + </param> + </xml> + <xml name="auto_color" token_cond_label="Color" token_cond_name="color" token_select_label="Color Selection" token_select_name="color_select" token_automatic_label="Automatically selected" token_manual_label="Manual Color Selection"> + <expand macro="auto_manual_tk" cond_label="@COND_LABEL@" cond_name="@COND_NAME@" select_label="@SELECT_LABEL@" select_name="@SELECT_NAME@" automatic_label="@AUTOMATIC_LABEL@" manual_label="@MANUAL_LABEL@"> + <expand macro="jb_color"/> + <yield/> + </expand> + </xml> + <xml name="brewer_scheme"> + <param name="brewer_scheme" type="select" label="Brewer color Scheme"> + <option value="BrBg">BrBg: Brown - Blue Green</option> + <option value="PiYg">PiYg: Pink - Yellow Green</option> + <option value="PRGn">PRGn: Purple Red - Green</option> + <option value="PuOr">PuOr: Purple - Orange</option> + <option value="RdBu" selected="true">RdBu: Red - Blue</option> + <option value="RdGy">RdGy: Red - Gray</option> + <option value="RdYlBu">RdYlBu: Red - Yellow - Blue</option> + <option value="RdYlGn">RdYlBu: Red - Yellow - Green</option> + <option value="Spectral">Spectral</option> + </param> + </xml> + <xml name="general_options"> + <section name="jbgen" title="General JBrowse Options [Advanced]" expanded="false"> + <conditional name="ucol"> + <param name="formcoll" type="select" label="Convert a collection of track files rather than filling in the tool form" help="A collection of bam/vcf and other track types will be converted into a JBrowse2 automatically"> + <option value="form" selected="true">Fill in the tool form to specify tracks for the output JBrowse2</option> + <option value="collect">Convert a collection of suitable track files</option> + </param> + <when value="collect"> + <param name="autoCollection" type="data_collection" label="Collection of bed, bam and other track files"/> + </when> + <when value="form"/> + </conditional> + <param name="zipOut" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Create a zip archive for downloading rather than viewing" help="Default is to make an interactive browser appear when the 'eye' icon is activated"/> + <param name="defaultLocation" type="text" value="" label="Subset to display to new users" help="Initial subset to be shown for users who have never visited the browser before. Example: 'ctgA:1234..5678"/> + <param name="session_name" type="text" value="New JBrowse2 session" label="Session name" help="Displayed at the top of the window"/> + <param name="enableAnalytics" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Enable analytics" help="Will send usage data to Google Analytics, see https://github.com/GMOD/jbrowse-components/issues/1166"/> + <param name="primary_color" type="color" value="#0D233F" label="Primary color"> + <sanitizer> + <valid initial="string.ascii_letters,string.digits"> + <add value="#"/> + </valid> + </sanitizer> + </param> + <param name="secondary_color" type="color" value="#721E63" label="Secondary color"> + <sanitizer> + <valid initial="string.ascii_letters,string.digits"> + <add value="#"/> + </valid> + </sanitizer> + </param> + <param name="tertiary_color" type="color" value="#135560" label="Tertiary color"> + <sanitizer> + <valid initial="string.ascii_letters,string.digits"> + <add value="#"/> + </valid> + </sanitizer> + </param> + <param name="quaternary_color" type="color" value="#FFB11D" label="Quaternary color"> + <sanitizer> + <valid initial="string.ascii_letters,string.digits"> + <add value="#"/> + </valid> + </sanitizer> + </param> + <param name="font_size" type="integer" value="10" label="Font size"/> + </section> + </xml> + <xml name="color_selection_minmax"> + <section name="jbcolor" title="JBrowse Color Options [Advanced]" expanded="false"> + <!-- Abuse auto/manual for bicolor pivot. Means we'll have to handle the + auto case as well, but may be safe to just say "brewer colors? Pff, + red/blue" --> + <expand macro="auto_manual_tk" token_cond_label="Color" token_cond_name="color" token_select_label="Color Selection" token_select_name="color_select" token_automatic_label="Automatically selected" token_manual_label="Manual Color Selection"> + <expand macro="jb_color" label="JBrowse style.pos_color" name="style_pos_color" value="blue" help="CSS color, default 'blue'. When drawing bicolor plots, the fill color to use for values that are above the pivot point."/> + <expand macro="jb_color" label="JBrowse style.neg_color" name="style_neg_color" value="red" help=" CSS color, default 'red'. When drawing bicolor plots, the fill color to use for values that are below the pivot point."/> + </expand> + <conditional name="bicolor_pivot" label="Bicolor Pivot"> + <param name="bicolor_pivot_select" type="select" label="Bicolor Pivot"> + <option value="zero" selected="true">Zero</option> + <option value="mean">Mean</option> + <option value="custom">Custom Value</option> + </param> + <when value="zero"/> + <when value="mean"/> + <when value="custom"> + <param name="pivot_point" type="float" value="0.0" label="JBrowse style.bicolor_pivot" help="Where to change from pos_color to neg_color when drawing bicolor plots."/> + </when> + </conditional> + </section> + </xml> + <xml name="color_selection" token_scaling_lin_select="true" token_scaling_log_select="false"> + <section name="jbcolor_scale" title="JBrowse Feature Score Scaling & Coloring Options [Advanced]" expanded="false"> + <conditional name="color_score" label="JBrowse style.color & Score relationship"> + <param name="color_score_select" type="select" label="Color Score Algorithm" help="How to color the features. If it is based on score, then features with a score attribute anywhere in their hierachy will have their color affected by the score. If you choose to ignore the score, then you'll be able to select a single solid color for every feature in the track"> + <option value="score">Based on score</option> + <option value="none" selected="true">Ignore score</option> + </param> + <when value="none"> + <!-- When no scaling is done, no scores available, then just let the + user choose a base color for the track --> + <expand macro="auto_color"/> + </when> + <when value="score"> + <!-- Scaling --> + <param name="score_scaling" type="select" label="JBrowse style.color function's score scaling" help="How should the colors be distributed across the values? For blast results which distributes scores on the scale of approximately [1e-500, 10], it makes sense to request a logarithmic scaling of the color values. Logarithmic is indeed the default for blast. However other analysis methods may produce scores on ranges such as [0, 100] where a linear scale would be more appropriate for color distribution."> + <option value="linear" selected="@SCALING_LIN_SELECT@">Linear scaling</option> + <option value="logarithmic" selected="false">Logarithmic scaling</option> + <option value="blast" selected="@SCALING_LOG_SELECT@">Blast scaling</option> + </param> + <!-- Scaling Bounds --> + <conditional name="score_scales" label="Minimum/Maximum values for track scores"> + <param name="scale_select" type="select" label="How should minimum and maximum values be determined for the scores of the features"> + <option value="automatic" selected="true">Automatically determined</option> + <option value="manual">Manually specify minimum and maximum expected scores for the feature track</option> + </param> + <when value="automatic"/> + <when value="manual"> + <param name="minimum" type="integer" value="0" label="Minimum expected score"/> + <param name="maximum" type="integer" value="100" label="Maximum expected score"/> + </when> + </conditional> + <!-- Scale color --> + <conditional name="color_scheme" label="Color Scheme for scored features"> + <param name="score_scheme" type="select" label="JBrowse style.color function's color scheme for scored values"> + <option value="opacity">Opacity (high scores = 1.0 opacity)</option> + <!--<option value="brewer">Brewer Color Schemes</option>--> + </param> + <when value="opacity"> + <!-- Single color selection mode --> + <expand macro="auto_color"/> + </when> + <!--<when value="brewer">--> + <!--[> Brewer continuum selection <]--> + <!--<expand macro="brewer_scheme" />--> + <!--</when>--> + </conditional> + </when> + </conditional> + </section> + </xml> + <xml name="track_visibility"> + <param name="track_visibility" type="select" label="Track Visibility"> + <option value="default_off">Off when browser opens</option> + <option value="default_on" selected="true">On when browser opens</option> + </param> + </xml> + <xml name="track_styling_linear"> + <param name="show_labels" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Show labels"/> + <param name="show_descriptions" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Show descriptions"/> + <param name="display_mode" type="select" label="Display mode"> + <option value="normal" selected="true">normal</option> + <option value="compact">compact</option> + <option value="reducedRepresentation">reducedRepresentation</option> + <option value="collapse">collapse</option> + </param> + <param name="max_height" type="integer" value="600" label="Max height" help="Maximum height that the track is permitted to reach in pixels."/> + </xml> + <xml name="track_styling_vgp"> + <section name="jbstyle" title="JBrowse Styling Options [Advanced]" expanded="false"> + <param name="show_labels" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Show labels"/> + <param name="show_descriptions" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Show descriptions"/> + <param name="display" type="select" label="Display style" help="How the track will be displayed by default"> + <option value="LinearBasicDisplay" selected="true">LinearBasicDisplay</option> + <option value="LinearPileupDisplay">LinearPileupDisplay - good for dense tracks at scale</option> + <option value="LinearArcDisplay">LinearArcDisplay</option> + </param> + </section> + </xml> + <xml name="track_styling_feature"> + <section name="jbstyle" title="JBrowse Styling Options [Advanced]" expanded="false"> + <conditional name="track_style"> + <param name="display" type="select" label="Display style" help="How the track will be displayed by default"> + <option value="LinearBasicDisplay" selected="true">LinearBasicDisplay</option> + <option value="LinearPileupDisplay">LinearPileupDisplay - good for dense tracks at scale</option> + <option value="LinearArcDisplay">LinearArcDisplay</option> + </param> + <when value="LinearPileupDisplay"> + <expand macro="track_styling_linear"/> + </when> + <when value="LinearBasicDisplay"> + <expand macro="track_styling_linear"/> + <!-- examples: https://github.com/GMOD/jbrowse-components/discussions/2729 + and https://github.com/GMOD/jbrowse-components/blob/main/plugins/svg/src/SvgFeatureRenderer/configSchema.ts#L41 --> + <param name="label" type="text" value="jexl:get(feature,'name') || get(feature,'id')" label="Features label" help="See https://jbrowse.org/jb2/docs/config_guide/#configuration-callbacks for syntax"> + <sanitizer> + <valid initial="default"> + <add value="|"/> + <add value="{"/> + <add value="}"/> + <add value="!"/> + <add value="?"/> + <add value="+"/> + <add value="="/> + <add value="'"/> + </valid> + </sanitizer> + </param> + <param name="description" type="text" value="jexl:get(feature,'note') || get(feature,'description')" label="Features description" help="See https://jbrowse.org/jb2/docs/config_guide/#configuration-callbacks for syntax"> + <sanitizer> + <valid initial="default"> + <add value="|"/> + <add value="{"/> + <add value="}"/> + <add value="!"/> + <add value="?"/> + <add value="+"/> + <add value="="/> + <add value="'"/> + </valid> + </sanitizer> + </param> + </when> + <when value="LinearArcDisplay"/> + </conditional> + </section> + </xml> + <xml name="track_styling_xam"> + <section name="jbstyle" title="JBrowse Styling Options [Advanced]" expanded="false"> + <conditional name="track_style"> + <param name="display" type="select" label="Display style" help="How the track will be displayed by default"> + <option value="LinearAlignmentsDisplay" selected="true">LinearAlignmentsDisplay</option> + <option value="LinearPileupDisplay">LinearPileupDisplay</option> + <option value="LinearSNPCoverageDisplay">LinearSNPCoverageDisplay</option> + <option value="LinearReadArcsDisplay">LinearReadArcsDisplay</option> + <option value="LinearReadCloudDisplay">LinearReadCloudDisplay</option> + </param> + <!-- TODO check if possible to handle more options than just the display style--> + <when value="LinearAlignmentsDisplay"/> + <when value="LinearPileupDisplay"/> + <when value="LinearSNPCoverageDisplay"> + <param name="displayCrossHatches" type="boolean" truevalue="true" falsevalue="false" checked="true" label="Draw cross hatches"/> + <param name="minScore" type="integer" value="" optional="true" label="Min score"/> + <param name="maxScore" type="integer" value="" optional="true" label="Max score"/> + </when> + <when value="LinearReadArcsDisplay"/> + <when value="LinearReadCloudDisplay"/> + </conditional> + </section> + </xml> + <xml name="track_styling_vcf"> + <section name="jbstyle" title="JBrowse Styling Options [Advanced]" expanded="false"> + <conditional name="track_style"> + <param name="display" type="select" label="Display style" help="How the track will be displayed by default"> + <option value="LinearVariantDisplay" selected="true">LinearVariantDisplay</option> + <option value="LinearPairedArcDisplay">LinearPairedArcDisplay</option> + </param> + <when value="LinearVariantDisplay"> + <expand macro="track_styling_linear"/> + </when> + <when value="LinearPairedArcDisplay"/> + </conditional> + </section> + </xml> + <xml name="track_styling_bigwig"> + <section name="jbstyle" title="JBrowse Styling Options [Advanced]" expanded="false"> + <conditional name="track_style"> + <param name="display" type="select" label="Display style" help="How the track will be displayed by default"> + <option value="LinearWiggleDisplay" selected="true">LinearWiggleDisplay</option> + </param> + <when value="LinearWiggleDisplay"> + <param name="autoscale" type="select" label="Autoscale type"> + <option value="local" selected="true">Local</option> + <option value="global">Global</option> + <option value="globalsd">Global ± 3σ</option> + <option value="localsd">Local ± 3σ</option> + </param> + <param name="resolution" type="integer" value="1" label="Resolution"/> + <param name="summaryScoreMode" type="select" label="Autoscale type"> + <option value="max">Max</option> + <option value="min">Min</option> + <option value="avg">Avg</option> + <option value="whiskers" selected="true">Whiskers (combines all three)</option> + </param> + <param name="filled" type="boolean" truevalue="true" falsevalue="false" checked="true" label="Fill in histogram"/> + <param name="scaleType" type="select" label="Autoscale type"> + <option value="linear" selected="true">Linear</option> + <option value="log">Log</option> + </param> + <param name="displayCrossHatches" type="boolean" truevalue="true" falsevalue="false" checked="true" label="Draw cross hatches"/> + <param name="minScore" type="integer" value="" optional="true" label="Min score"/> + <param name="maxScore" type="integer" value="" optional="true" label="Max score"/> + </when> + </conditional> + </section> + </xml> + <xml name="track_menu"> + <section name="jbmenu" title="JBrowse Contextual Menu options [Advanced]" expanded="false"> + <repeat name="track_menu" title="Track Menu"> + <param name="menu_action" type="select" label="Menu action" help="Action performed when user clicks on the menu"> + <option value="iframeDialog" selected="true">iframeDialog: causes the given url to be opened in a popup dialog box within JBrowse, in an iframe element.</option> + <option value="newWindow">newWindow: causes the given url to be opened in a new browser window.</option> + <option value="navigateTo">navigateTo: opens the given url in the same browser window, navigating the user away from JBrowse.</option> + </param> + <param name="menu_label" type="text" label="Menu label" help="Will be displayed in the contextual menu on each feature ({name}, {id}, {type}, {start}, {end}, {strand} variables will be interpreted)"> + <expand macro="menu_sanitize"/> + </param> + <param name="menu_title" type="text" label="Menu title" help="Will be displayed in the popup title bar if displayed ({id}, {type}, {start}, {end}, {strand} variables will be interpreted)"> + <expand macro="menu_sanitize"/> + </param> + <param name="menu_url" type="text" label="Menu url" help="Destination URL ({name}, {id}, {type}, {start}, {end}, {strand} variables will be interpreted)"> + <expand macro="menu_sanitize"/> + </param> + <param name="menu_icon" type="select" label="Menu icon" help="Icon to display next to menu label"> + <option value="dijitIconBookmark" selected="true">Bookmark</option> + <option value="dijitIconSave">Save</option> + <option value="dijitIconPrint">Print</option> + <option value="dijitIconCut">Cut</option> + <option value="dijitIconCopy">Copy</option> + <option value="dijitIconClear">Clear</option> + <option value="dijitIconDelete">Delete</option> + <option value="dijitIconUndo">Undo</option> + <option value="dijitIconEdit">Edit</option> + <option value="dijitIconNewTask">New Task</option> + <option value="dijitIconEditTask">Edit Task</option> + <option value="dijitIconEditProperty">Edit Property</option> + <option value="dijitIconTask">Task</option> + <option value="dijitIconFilter">Filter</option> + <option value="dijitIconConfigure">Configure</option> + <option value="dijitIconSearch">Search</option> + <option value="dijitIconApplication">Application</option> + <option value="dijitIconChart">Chart</option> + <option value="dijitIconConnector">Connector</option> + <option value="dijitIconDatabase">Database</option> + <option value="dijitIconDocuments">Documents</option> + <option value="dijitIconMail">Mail</option> + <option value="dijitLeaf">Leaf</option> + <option value="dijitIconFile">File</option> + <option value="dijitIconFunction">Function</option> + <option value="dijitIconKey">Key</option> + <option value="dijitIconPackage">Package</option> + <option value="dijitIconSample">Sample</option> + <option value="dijitIconTable">Table</option> + <option value="dijitIconUsers">Users</option> + <option value="dijitIconFolderClosed">Folder Closed</option> + <option value="dijitIconFolderOpen">Folder Open</option> + <option value="dijitIconError">Error</option> + </param> + </repeat> + </section> + </xml> + <xml name="menu_sanitize"> + <sanitizer> + <valid> + <add value="{"/> + <add value="}"/> + <add value="!"/> + <add value="?"/> + <add value="&"/> + <add value="+"/> + <add value="="/> + <add value="'"/> + <add value="""/> + </valid> + </sanitizer> + </xml> + <xml name="input_conditional" token_label="Track Data" token_format="data"> + <conditional name="useuri"> + <param name="insource" type="select" label="Define track data as a history file or an internet URI" help="A public URI implies that all the associated tabix files are also in place. They are created for history files"> + <option value="history" selected="true">Track data from a history file</option> + <option value="uri">Tabix data URI - index files must be available at corresponding URI</option> + </param> + <when value="history"> + <param name="annotation" type="data" format="@FORMAT@" optional="true" label="@LABEL@" multiple="True"/> + </when> + <when value="uri"> + <param name="annouri" type="text" label="@LABEL@"/> + <param name="annoname" type="text" label="Short name for track display"> + <sanitizer invalid_char="_"> + <valid initial="string.printable"> + <remove value="'"/> + </valid> + </sanitizer> + </param> + </when> + </conditional> + </xml> + <xml name="pafref_conditional" token_label="Track Data" token_format="data"> + <conditional name="pafuseuri"> + <param name="insource" type="select" label="PAF reference data from a history file or an internet URI?" help="Multiple references can be used for mashmap PAF. A URI source requires all the associated tabix index files to be in place."> + <option value="history" selected="true">PAF reference comparison genome from a history file</option> + <option value="uri">PAF reference comparison genome tabix .gz URI with index files at the corresponding URIs</option> + </param> + <when value="history"> + <param name="annotation" type="data" format="@FORMAT@" label="@LABEL@" help="Add all PAF reference comparison genomes used to make the PAF" multiple="true"/> + </when> + <when value="uri"> + <repeat name="refuri" title="PAF reference comparison genome URI" min="1"> + <param name="annotation" type="text" label="@LABEL@"/> + <param name="annoname" type="text" label="Short name for this reference" help="Short names take less track space"> + <sanitizer invalid_char="_"> + <valid initial="string.printable"> + <remove value="'"/> + </valid> + </sanitizer> + </param> + </repeat> + </when> + </conditional> + </xml> + <xml name="citations"> + <citations> + <citation type="doi">10.1186/s13059-016-0924-1</citation> + <citation type="doi">10.1101/gr.094607.109</citation> + </citations> + </xml> +</macros>
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/maf2bed.py Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,44 @@ +#!/usr/bin/env python +# painfully converted from b0rken perl from +# https://unpkg.com/browse/jbrowse-plugin-mafviewer@1.0.6/dist/ +# license is Apache2_license.txt included here + +import sys + +id = 0 +buffer = '' +start = 0 +end = 0 +score = 0 +chrom = '' + +db = "%s." % sys.argv[1] +# Read input from stdin +for line in sys.stdin: + line = line.strip() + if not line: + continue + + line = line.split() + if line[0] == 's' and line[1].startswith(db): + chrom = line[1] + chrom = chrom.replace(db, '') + start = int(line[2]) + end = int(line[2]) + int(line[3]) + line = line[1:] + line = ':'.join(line) + temp = line + buffer = temp if buffer == '' else f"{buffer},{temp}" + elif line[0] == 'a': + score = int(line[1].split('=')[1]) + if id > 0: + sys.stdout.write('\t'.join([chrom, '%d' % start, '%d' % end, f"{sys.argv[1]}_{id}", '%d' % score, buffer]) + '\n') + id += 1 + buffer = '' + elif line[0] == 's': + line = line[1:] + line = ':'.join(line) + temp = line + buffer = temp if buffer == '' else f"{buffer},{temp}" + +sys.stdout.write('\t'.join([chrom, '%d' % start, '%d' % end, f"{sys.argv[1]}_{id}", '%d' % score, buffer]) + '\n')
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/readme.rst Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,116 @@ +JBrowse2 in Galaxy +================== + + JBrowse2 is a fast, embeddable genome browser built completely with + JavaScript and HTML5 + +Added April 2024: +Multiple independent assembly/track sets in the same browser now working +Tests are now far more convincing using the zip outputs to check for files +Automated collection generation is really cool - TODO: need visibility for each track +maf and blastxml are working for test files at least - not yet properly tested. +cool/mcool are converted to hic +vcf, cram, bam, bed, gff, bigwig, hic, cool, blastxml, maf, paf tracks. + + +Added Mar 26: +Testing history added with tests of the 8 simple formats +hic and paf need small tests but work with big ones... + +Added Mar 8: + - URI as data source for tracks + - Autogenerate from a collection now builtin to advanced options - ignores form + - potentially thousands of tracks without filling in any form... + - Cram now working + - Multigenome pafs from mashmap working properly.. + - TODO reuse code for references for paf references to allow URI and builtin genomes + +Added Mar 3: + - optional tracks at last for JB2 + - dotted noodles from optional inputs now ignored without harm + - autogenJB2. + - Takes a collection of bam/vcf etc and turns them into tracks in a normal JB2 history item. + - Uses the JbrowseConnector, but the XML tool drives it from the contents of the collection. + - Collection can be built from any source or with optional noodles in a WF + - Produces a JB2 with all available tracks. + - Argparse command line with repeats for tracks and references + - so works outside Galaxy to process directories or s3 buckets. + + +Added Feb 3: cool/mcool -> hic + +.. image:: dm4_in_jb2.png + +Other tracks + +.. image:: jb2_samplerMay5.png + +JBrowse2 is a fast, embeddable genome browser built completely with +JavaScript and HTML5 + +Makes an ideal fit with Galaxy, especially for use as a +workflow summary. E.g. annotate a genome, then visualise all of the +associated datasets as an interactive HTML page. This tool MUST be whitelisted +(or ``sanitize_all_html=False`` in galaxy.ini) to function correctly. +The built-in Galaxy gunicorn server does not support byte range requests, so this tool must be proxied by nginx +or another web server, correctly configured to support range requests. A tiny web server is bundled +with each JBrowse2 archive - see below. + +Installation +============ + +This wrapper is normally installed by a server administrator from the Galaxy Tool Shed IUC JBrowse2 repository. + +Local display +============= + +Each JBrowse2 history item can be downloaded ("floppy disk" icon) to your local disk. There, it can be unzipped into a new directory. +That directory includes a python script, *jb2_webserver.py* that will run a local web server able to serve byte range requests, +giving the same view as seen when viewed from the Galaxy history. + +From the newly unzipped directory where that file can be found, and with Python3 installed and working, + +`python3 jb2_webserver.py` + +will open the preconfigured browser using the default web browser application. + +That webserver code is separately under the Apache 2 license copied in this repository. See below for this code's MIT license. + +History +======= + +- 2.10.0+galaxy2 + + - UPDATED existing JBrowse1.16.11 code to JBrowse 2.10.0 + - was working well enough for VGP when previous PR discovered + - too late to backport all the fixes + - working default session and some other ideas copied instead. + - seems to work well with defaults. + - need to document and implement track settings by running the browser locally. + - works well enough to be useful in workflows such as TreeValGal. + - JB2 seems to set defaults wisely. + - not yet ideal for users who need fine grained track control. + - synteny (paf + reference) now working + - rehomed at https://github.com/usegalaxy-eu/temporary-tools/tree/master/jbrowse2 while IUC reviews are slowly sorted out. + + +Wrapper License (MIT/BSD Style) +=============================== + +Permission to use, copy, modify, and distribute this software and its +documentation with or without modifications and for any purpose and +without fee is hereby granted, provided that any copyright notices +appear in all copies and that both those copyright notices and this +permission notice appear in supporting documentation, and that the names +of the contributors or copyright holders not be used in advertising or +publicity pertaining to distribution of the software without specific +prior permission. + +THE CONTRIBUTORS AND COPYRIGHT HOLDERS OF THIS SOFTWARE DISCLAIM ALL +WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED +WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL THE +CONTRIBUTORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY SPECIAL, INDIRECT OR +CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF +USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR +OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR +PERFORMANCE OF THIS SOFTWARE.
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/all_fasta.loc Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,19 @@ +#This file lists the locations and dbkeys of all the fasta files +#under the "genome" directory (a directory that contains a directory +#for each build). The script extract_fasta.py will generate the file +#all_fasta.loc. This file has the format (white space characters are +#TAB characters): +# +#<unique_build_id> <dbkey> <display_name> <file_path> +# +#So, all_fasta.loc could look something like this: +# +#apiMel3 apiMel3 Honeybee (Apis mellifera): apiMel3 /path/to/genome/apiMel3/apiMel3.fa +#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /path/to/genome/hg19/hg19canon.fa +#hg19full hg19 Human (Homo sapiens): hg19 Full /path/to/genome/hg19/hg19full.fa +# +#Your all_fasta.loc file should contain an entry for each individual +#fasta file. So there will be multiple fasta files for each build, +#such as with hg19 above. +# +merlin merlin Merlin ${__HERE__}/merlin.fa
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/bed/test-3.bed Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,3 @@ +Merlin 49 1452 +Merlin 1457 2557 +Merlin 2557 3630
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/bed/test-6.bed Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,3 @@ +Merlin 49 1452 chromosomal_replication_initiator_protein_DnaA 0 + +Merlin 1457 2557 DNA_polymerase_III_subunit_beta 0 + +Merlin 2557 3630 DNA_replication_and_repair_protein_RecF 0 +
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/blastxml/blast-gene1.xml Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,126 @@ +<?xml version="1.0"?> +<!DOCTYPE BlastOutput PUBLIC "-//NCBI//NCBI BlastOutput/EN" "http://www.ncbi.nlm.nih.gov/dtd/NCBI_BlastOutput.dtd"> +<BlastOutput> + <BlastOutput_program>blastp</BlastOutput_program> + <BlastOutput_version>BLASTP 2.2.28+</BlastOutput_version> + <BlastOutput_reference>Stephen F. Altschul, Thomas L. Madden, Alejandro A. Sch&auml;ffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.</BlastOutput_reference> + <BlastOutput_db>/usr/local/syncdb/community/nr/nr</BlastOutput_db> + <BlastOutput_query-ID>Query_1</BlastOutput_query-ID> + <BlastOutput_query-def>Merlin_1</BlastOutput_query-def> + <BlastOutput_query-len>229</BlastOutput_query-len> + <BlastOutput_param> + <Parameters> + <Parameters_matrix>BLOSUM62</Parameters_matrix> + <Parameters_expect>0.001</Parameters_expect> + <Parameters_gap-open>11</Parameters_gap-open> + <Parameters_gap-extend>1</Parameters_gap-extend> + <Parameters_filter>F</Parameters_filter> + </Parameters> + </BlastOutput_param> +<BlastOutput_iterations> +<Iteration> + <Iteration_iter-num>1</Iteration_iter-num> + <Iteration_query-ID>Query_1</Iteration_query-ID> + <Iteration_query-def>Merlin_1</Iteration_query-def> + <Iteration_query-len>229</Iteration_query-len> +<Iteration_hits> +<Hit> + <Hit_num>1</Hit_num> + <Hit_id>gi|422934611|ref|YP_007004572.1|</Hit_id> + <Hit_def>hypothetical protein [Enterobacteria phage ime09] >gi|339791394|gb|AEK12451.1| hypothetical protein [Enterobacteria phage ime09]</Hit_def> + <Hit_accession>YP_007004572</Hit_accession> + <Hit_len>685</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>197.593</Hsp_bit-score> + <Hsp_score>501</Hsp_score> + <Hsp_evalue>3.74548e-55</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>229</Hsp_query-to> + <Hsp_hit-from>474</Hsp_hit-from> + <Hsp_hit-to>684</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>106</Hsp_identity> + <Hsp_positive>154</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>230</Hsp_align-len> + <Hsp_qseq>LDKGTLLYRGQKLDLPTFEHNAENKLFYFRNYVSTSLKPLIFGEFGRMFMALDDDTTIYTAETPDDYNRFANPEDIIDIGATQKDSFDDNNNDGTSINIGKQVNLGFVISGAENVRVIVPGSLTEYPEEAEVILPRGTLLKINKITTQVDKRS--NKFMVEGSIVPPSEQIDESVEIYDGDLFMETGEVVKLSGFMQFVNESAYDEEQNQMAAEILSGFLDIDDMPRKFR</Hsp_qseq> + <Hsp_hseq>LPPGTTLYRGQEVTFKTLRHNIENKMFYFKNFVSTSLKPNIFGEHGKNYMALDDSGAVFSGEGEGS----VDAEDLMHMGSHSAYANED-----------AETSVGMVIKGAERIKVIVPGHLSGFPSEAEVILPRGILLKINKVSTYMMKETAYNKYLIEGTIVPPSEQLEESV--YDGDHLMETGEVRPMAGFNQFLVEES--KEEENEVSQILASLVNINGMSKKFK</Hsp_hseq> + <Hsp_midline>L GT LYRGQ++ T HN ENK+FYF+N+VSTSLKP IFGE G+ +MALDD +++ E + ED++ +G+ + +D + ++G VI GAE ++VIVPG L+ +P EAEVILPRG LLKINK++T + K + NK+++EG+IVPPSEQ++ESV YDGD METGEV ++GF QF+ E + +E+ ++IL+ ++I+ M +KF+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>2</Hit_num> + <Hit_id>gi|330858714|ref|YP_004415089.1|</Hit_id> + <Hit_def>hypothetical protein Shfl2p198 [Shigella phage Shfl2] >gi|327397648|gb|AEA73150.1| hypothetical protein Shfl2p198 [Shigella phage Shfl2]</Hit_def> + <Hit_accession>YP_004415089</Hit_accession> + <Hit_len>685</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>197.593</Hsp_bit-score> + <Hsp_score>501</Hsp_score> + <Hsp_evalue>4.31042e-55</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>229</Hsp_query-to> + <Hsp_hit-from>474</Hsp_hit-from> + <Hsp_hit-to>684</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>106</Hsp_identity> + <Hsp_positive>154</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>230</Hsp_align-len> + <Hsp_qseq>LDKGTLLYRGQKLDLPTFEHNAENKLFYFRNYVSTSLKPLIFGEFGRMFMALDDDTTIYTAETPDDYNRFANPEDIIDIGATQKDSFDDNNNDGTSINIGKQVNLGFVISGAENVRVIVPGSLTEYPEEAEVILPRGTLLKINKITTQVDKRS--NKFMVEGSIVPPSEQIDESVEIYDGDLFMETGEVVKLSGFMQFVNESAYDEEQNQMAAEILSGFLDIDDMPRKFR</Hsp_qseq> + <Hsp_hseq>LPPGTTLYRGQEVTFKTLRHNIENKMFYFKNFVSTSLKPNIFGEHGKNYMALDDSGAVFSGEGEGS----VDAEDLMHMGSHSAYANED-----------AETSVGMVIKGAERIKVIVPGHLSGFPSEAEVILPRGILLKINKVSTYMMKETAYNKYLIEGTIVPPSEQLEESV--YDGDHLMETGEVRPMAGFNQFLVEES--KEEENEVSQILASLVNINGMSKKFK</Hsp_hseq> + <Hsp_midline>L GT LYRGQ++ T HN ENK+FYF+N+VSTSLKP IFGE G+ +MALDD +++ E + ED++ +G+ + +D + ++G VI GAE ++VIVPG L+ +P EAEVILPRG LLKINK++T + K + NK+++EG+IVPPSEQ++ESV YDGD METGEV ++GF QF+ E + +E+ ++IL+ ++I+ M +KF+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>3</Hit_num> + <Hit_id>gi|228861509|ref|YP_002854530.1|</Hit_id> + <Hit_def>alt.-2 hypothetical protein [Enterobacteria phage RB14] >gi|227438525|gb|ACP30838.1| alt.-2 hypothetical protein [Enterobacteria phage RB14]</Hit_def> + <Hit_accession>YP_002854530</Hit_accession> + <Hit_len>685</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>197.593</Hsp_bit-score> + <Hsp_score>501</Hsp_score> + <Hsp_evalue>4.35388e-55</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>229</Hsp_query-to> + <Hsp_hit-from>474</Hsp_hit-from> + <Hsp_hit-to>684</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>108</Hsp_identity> + <Hsp_positive>152</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>230</Hsp_align-len> + <Hsp_qseq>LDKGTLLYRGQKLDLPTFEHNAENKLFYFRNYVSTSLKPLIFGEFGRMFMALDDDTTIYTAETPDDYNRFANPEDIIDIGATQKDSFDDNNNDGTSINIGKQVNLGFVISGAENVRVIVPGSLTEYPEEAEVILPRGTLLKINKITTQVDKRS--NKFMVEGSIVPPSEQIDESVEIYDGDLFMETGEVVKLSGFMQFVNESAYDEEQNQMAAEILSGFLDIDDMPRKFR</Hsp_qseq> + <Hsp_hseq>LPPGTTLYRGQEVTFKTLRHNIENKMFYFKNFVSTSLKPNIFGEHGKNYMALDDSGAVFSGEGEGS----VDAEDLMHMGS-----------HSTYANEDAETSVGMVIKGAERVKVIVPGHLSGFPSEAEVILPRGILLKINKVSTYFMKETAYNKYLIEGTIVPPSEQLEESV--YDGDHLMETGEVRPMAGFNQFLVEES--KEEENEVSQILASLVNINGMSKKFK</Hsp_hseq> + <Hsp_midline>L GT LYRGQ++ T HN ENK+FYF+N+VSTSLKP IFGE G+ +MALDD +++ E + ED++ +G+ T N + ++G VI GAE V+VIVPG L+ +P EAEVILPRG LLKINK++T K + NK+++EG+IVPPSEQ++ESV YDGD METGEV ++GF QF+ E + +E+ ++IL+ ++I+ M +KF+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +</Iteration_hits> + <Iteration_stat> + <Statistics> + <Statistics_db-num>48094830</Statistics_db-num> + <Statistics_db-len>17186091396</Statistics_db-len> + <Statistics_hsp-len>143</Statistics_hsp-len> + <Statistics_eff-space>886533640716</Statistics_eff-space> + <Statistics_kappa>0.041</Statistics_kappa> + <Statistics_lambda>0.267</Statistics_lambda> + <Statistics_entropy>0.14</Statistics_entropy> + </Statistics> + </Iteration_stat> +</Iteration> +</BlastOutput_iterations> +</BlastOutput> +
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/blastxml/blast.xml Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,2862 @@ +<?xml version="1.0"?> +<!DOCTYPE BlastOutput PUBLIC "-//NCBI//NCBI BlastOutput/EN" "http://www.ncbi.nlm.nih.gov/dtd/NCBI_BlastOutput.dtd"> +<BlastOutput> + <BlastOutput_program>blastp</BlastOutput_program> + <BlastOutput_version>BLASTP 2.2.28+</BlastOutput_version> + <BlastOutput_reference>Stephen F. Altschul, Thomas L. Madden, Alejandro A. Sch&auml;ffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.</BlastOutput_reference> + <BlastOutput_db>/usr/local/syncdb/community/nr/nr</BlastOutput_db> + <BlastOutput_query-ID>Query_1</BlastOutput_query-ID> + <BlastOutput_query-def>Merlin_1</BlastOutput_query-def> + <BlastOutput_query-len>229</BlastOutput_query-len> + <BlastOutput_param> + <Parameters> + <Parameters_matrix>BLOSUM62</Parameters_matrix> + <Parameters_expect>0.001</Parameters_expect> + <Parameters_gap-open>11</Parameters_gap-open> + <Parameters_gap-extend>1</Parameters_gap-extend> + <Parameters_filter>F</Parameters_filter> + </Parameters> + </BlastOutput_param> +<BlastOutput_iterations> +<Iteration> + <Iteration_iter-num>1</Iteration_iter-num> + <Iteration_query-ID>Query_1</Iteration_query-ID> + <Iteration_query-def>Merlin_1</Iteration_query-def> + <Iteration_query-len>229</Iteration_query-len> +<Iteration_hits> +<Hit> + <Hit_num>1</Hit_num> + <Hit_id>gi|422934611|ref|YP_007004572.1|</Hit_id> + <Hit_def>hypothetical protein [Enterobacteria phage ime09] >gi|339791394|gb|AEK12451.1| hypothetical protein [Enterobacteria phage ime09]</Hit_def> + <Hit_accession>YP_007004572</Hit_accession> + <Hit_len>685</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>197.593</Hsp_bit-score> + <Hsp_score>501</Hsp_score> + <Hsp_evalue>3.74548e-55</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>229</Hsp_query-to> + <Hsp_hit-from>474</Hsp_hit-from> + <Hsp_hit-to>684</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>106</Hsp_identity> + <Hsp_positive>154</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>230</Hsp_align-len> + <Hsp_qseq>LDKGTLLYRGQKLDLPTFEHNAENKLFYFRNYVSTSLKPLIFGEFGRMFMALDDDTTIYTAETPDDYNRFANPEDIIDIGATQKDSFDDNNNDGTSINIGKQVNLGFVISGAENVRVIVPGSLTEYPEEAEVILPRGTLLKINKITTQVDKRS--NKFMVEGSIVPPSEQIDESVEIYDGDLFMETGEVVKLSGFMQFVNESAYDEEQNQMAAEILSGFLDIDDMPRKFR</Hsp_qseq> + <Hsp_hseq>LPPGTTLYRGQEVTFKTLRHNIENKMFYFKNFVSTSLKPNIFGEHGKNYMALDDSGAVFSGEGEGS----VDAEDLMHMGSHSAYANED-----------AETSVGMVIKGAERIKVIVPGHLSGFPSEAEVILPRGILLKINKVSTYMMKETAYNKYLIEGTIVPPSEQLEESV--YDGDHLMETGEVRPMAGFNQFLVEES--KEEENEVSQILASLVNINGMSKKFK</Hsp_hseq> + <Hsp_midline>L GT LYRGQ++ T HN ENK+FYF+N+VSTSLKP IFGE G+ +MALDD +++ E + ED++ +G+ + +D + ++G VI GAE ++VIVPG L+ +P EAEVILPRG LLKINK++T + K + NK+++EG+IVPPSEQ++ESV YDGD METGEV ++GF QF+ E + +E+ ++IL+ ++I+ M +KF+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>2</Hit_num> + <Hit_id>gi|330858714|ref|YP_004415089.1|</Hit_id> + <Hit_def>hypothetical protein Shfl2p198 [Shigella phage Shfl2] >gi|327397648|gb|AEA73150.1| hypothetical protein Shfl2p198 [Shigella phage Shfl2]</Hit_def> + <Hit_accession>YP_004415089</Hit_accession> + <Hit_len>685</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>197.593</Hsp_bit-score> + <Hsp_score>501</Hsp_score> + <Hsp_evalue>4.31042e-55</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>229</Hsp_query-to> + <Hsp_hit-from>474</Hsp_hit-from> + <Hsp_hit-to>684</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>106</Hsp_identity> + <Hsp_positive>154</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>230</Hsp_align-len> + <Hsp_qseq>LDKGTLLYRGQKLDLPTFEHNAENKLFYFRNYVSTSLKPLIFGEFGRMFMALDDDTTIYTAETPDDYNRFANPEDIIDIGATQKDSFDDNNNDGTSINIGKQVNLGFVISGAENVRVIVPGSLTEYPEEAEVILPRGTLLKINKITTQVDKRS--NKFMVEGSIVPPSEQIDESVEIYDGDLFMETGEVVKLSGFMQFVNESAYDEEQNQMAAEILSGFLDIDDMPRKFR</Hsp_qseq> + <Hsp_hseq>LPPGTTLYRGQEVTFKTLRHNIENKMFYFKNFVSTSLKPNIFGEHGKNYMALDDSGAVFSGEGEGS----VDAEDLMHMGSHSAYANED-----------AETSVGMVIKGAERIKVIVPGHLSGFPSEAEVILPRGILLKINKVSTYMMKETAYNKYLIEGTIVPPSEQLEESV--YDGDHLMETGEVRPMAGFNQFLVEES--KEEENEVSQILASLVNINGMSKKFK</Hsp_hseq> + <Hsp_midline>L GT LYRGQ++ T HN ENK+FYF+N+VSTSLKP IFGE G+ +MALDD +++ E + ED++ +G+ + +D + ++G VI GAE ++VIVPG L+ +P EAEVILPRG LLKINK++T + K + NK+++EG+IVPPSEQ++ESV YDGD METGEV ++GF QF+ E + +E+ ++IL+ ++I+ M +KF+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>3</Hit_num> + <Hit_id>gi|228861509|ref|YP_002854530.1|</Hit_id> + <Hit_def>alt.-2 hypothetical protein [Enterobacteria phage RB14] >gi|227438525|gb|ACP30838.1| alt.-2 hypothetical protein [Enterobacteria phage RB14]</Hit_def> + <Hit_accession>YP_002854530</Hit_accession> + <Hit_len>685</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>197.593</Hsp_bit-score> + <Hsp_score>501</Hsp_score> + <Hsp_evalue>4.35388e-55</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>229</Hsp_query-to> + <Hsp_hit-from>474</Hsp_hit-from> + <Hsp_hit-to>684</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>108</Hsp_identity> + <Hsp_positive>152</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>230</Hsp_align-len> + <Hsp_qseq>LDKGTLLYRGQKLDLPTFEHNAENKLFYFRNYVSTSLKPLIFGEFGRMFMALDDDTTIYTAETPDDYNRFANPEDIIDIGATQKDSFDDNNNDGTSINIGKQVNLGFVISGAENVRVIVPGSLTEYPEEAEVILPRGTLLKINKITTQVDKRS--NKFMVEGSIVPPSEQIDESVEIYDGDLFMETGEVVKLSGFMQFVNESAYDEEQNQMAAEILSGFLDIDDMPRKFR</Hsp_qseq> + <Hsp_hseq>LPPGTTLYRGQEVTFKTLRHNIENKMFYFKNFVSTSLKPNIFGEHGKNYMALDDSGAVFSGEGEGS----VDAEDLMHMGS-----------HSTYANEDAETSVGMVIKGAERVKVIVPGHLSGFPSEAEVILPRGILLKINKVSTYFMKETAYNKYLIEGTIVPPSEQLEESV--YDGDHLMETGEVRPMAGFNQFLVEES--KEEENEVSQILASLVNINGMSKKFK</Hsp_hseq> + <Hsp_midline>L GT LYRGQ++ T HN ENK+FYF+N+VSTSLKP IFGE G+ +MALDD +++ E + ED++ +G+ T N + ++G VI GAE V+VIVPG L+ +P EAEVILPRG LLKINK++T K + NK+++EG+IVPPSEQ++ESV YDGD METGEV ++GF QF+ E + +E+ ++IL+ ++I+ M +KF+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +</Iteration_hits> + <Iteration_stat> + <Statistics> + <Statistics_db-num>48094830</Statistics_db-num> + <Statistics_db-len>17186091396</Statistics_db-len> + <Statistics_hsp-len>143</Statistics_hsp-len> + <Statistics_eff-space>886533640716</Statistics_eff-space> + <Statistics_kappa>0.041</Statistics_kappa> + <Statistics_lambda>0.267</Statistics_lambda> + <Statistics_entropy>0.14</Statistics_entropy> + </Statistics> + </Iteration_stat> +</Iteration> +<Iteration> + <Iteration_iter-num>2</Iteration_iter-num> + <Iteration_query-ID>Query_2</Iteration_query-ID> + <Iteration_query-def>Merlin_2</Iteration_query-def> + <Iteration_query-len>95</Iteration_query-len> +<Iteration_hits> +<Hit> + <Hit_num>1</Hit_num> + <Hit_id>gi|308814559|ref|YP_003934833.1|</Hit_id> + <Hit_def>hypothetical protein SP18_gp210 [Shigella phage SP18] >gi|308206151|gb|ADO19550.1| hypothetical protein SP18gp210 [Shigella phage SP18]</Hit_def> + <Hit_accession>YP_003934833</Hit_accession> + <Hit_len>107</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>79.337</Hsp_bit-score> + <Hsp_score>194</Hsp_score> + <Hsp_evalue>9.23754e-17</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>95</Hsp_query-to> + <Hsp_hit-from>12</Hsp_hit-from> + <Hsp_hit-to>107</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>42</Hsp_identity> + <Hsp_positive>56</Hsp_positive> + <Hsp_gaps>1</Hsp_gaps> + <Hsp_align-len>96</Hsp_align-len> + <Hsp_qseq>MKSIFRINGVEIVVEDVVPMSYEFNEVVFKELKKILG-DKKLQSTPIGRFGMKENVDTYIESVVTGQLEGEFSVAVQTVENDEVILTLPAFVIFRK</Hsp_qseq> + <Hsp_hseq>MKSSFRFNGQELVVENVIPASEEFDSAVGNELRRVFGEDKKFDLRPVENFVNSEQTENIFNGVVTGQLESEAPIAITVFAKKEVVMTAAGFISFRK</Hsp_hseq> + <Hsp_midline>MKS FR NG E+VVE+V+P S EF+ V EL+++ G DKK P+ F E + VVTGQLE E +A+ EV++T F+ FRK</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>2</Hit_num> + <Hit_id>gi|456351278|ref|YP_007501230.1|</Hit_id> + <Hit_def>hypothetical protein [Salmonella phage S16] >gi|448913695|gb|AGE48199.1| hypothetical protein [Salmonella phage S16]</Hit_def> + <Hit_accession>YP_007501230</Hit_accession> + <Hit_len>106</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>77.7962</Hsp_bit-score> + <Hsp_score>190</Hsp_score> + <Hsp_evalue>2.9568e-16</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>94</Hsp_query-to> + <Hsp_hit-from>11</Hsp_hit-from> + <Hsp_hit-to>106</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>42</Hsp_identity> + <Hsp_positive>57</Hsp_positive> + <Hsp_gaps>2</Hsp_gaps> + <Hsp_align-len>96</Hsp_align-len> + <Hsp_qseq>MKSIFRINGVEIVVEDVVPMSYEFNEVVFKELKKILG-DKKLQSTPIGRFGMKE-NVDTYIESVVTGQLEGEFSVAVQTVENDEVILTLPAFVIFR</Hsp_qseq> + <Hsp_hseq>MKSILRIASTEIVIENAKPDSREFNEAAYELLQELYGTDKNFQLHPLPRFGVKEGQADNYISGVLSGNLVGEVPCAISIIAEDNQISNVVGFVVFR</Hsp_hseq> + <Hsp_midline>MKSI RI EIV+E+ P S EFNE ++ L+++ G DK Q P+ RFG+KE D YI V++G L GE A+ + D I + FV+FR</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>3</Hit_num> + <Hit_id>gi|408387127|gb|AFU64136.1|</Hit_id> + <Hit_def>hypothetical protein [Salmonella phage STML-198]</Hit_def> + <Hit_accession>AFU64136</Hit_accession> + <Hit_len>96</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>77.0258</Hsp_bit-score> + <Hsp_score>188</Hsp_score> + <Hsp_evalue>5.19436e-16</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>94</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>96</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>42</Hsp_identity> + <Hsp_positive>57</Hsp_positive> + <Hsp_gaps>2</Hsp_gaps> + <Hsp_align-len>96</Hsp_align-len> + <Hsp_qseq>MKSIFRINGVEIVVEDVVPMSYEFNEVVFKELKKILG-DKKLQSTPIGRFGMKE-NVDTYIESVVTGQLEGEFSVAVQTVENDEVILTLPAFVIFR</Hsp_qseq> + <Hsp_hseq>MKSILRIASTETVIENVKPDSREFNEAAYELLQELYGTDKNFQLHPLPRFGVKEGQADNYISGVLSGNLVGEVPCAISIIAEDNQISNVVGFVVFR</Hsp_hseq> + <Hsp_midline>MKSI RI E V+E+V P S EFNE ++ L+++ G DK Q P+ RFG+KE D YI V++G L GE A+ + D I + FV+FR</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>4</Hit_num> + <Hit_id>gi|314121774|ref|YP_004063893.1|</Hit_id> + <Hit_def>Alt.-3 conserved hypothetical protein [Enterobacteria phage vB_EcoM-VR7] >gi|313151531|gb|ADR32587.1| Alt.-3 conserved hypothetical protein [Enterobacteria phage vB_EcoM-VR7]</Hit_def> + <Hit_accession>YP_004063893</Hit_accession> + <Hit_len>96</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>76.6406</Hsp_bit-score> + <Hsp_score>187</Hsp_score> + <Hsp_evalue>7.7684e-16</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>95</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>96</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>40</Hsp_identity> + <Hsp_positive>56</Hsp_positive> + <Hsp_gaps>1</Hsp_gaps> + <Hsp_align-len>96</Hsp_align-len> + <Hsp_qseq>MKSIFRINGVEIVVEDVVPMSYEFNEVVFKELKKILG-DKKLQSTPIGRFGMKENVDTYIESVVTGQLEGEFSVAVQTVENDEVILTLPAFVIFRK</Hsp_qseq> + <Hsp_hseq>MKSSFRFNGQELVVENVIPASEEFDSAVGNELRRVFGEDKKFDLRPVENFVNSEQTENIFNGIVTGQLESEAPIAITVFVKKEAVMTVAGFISFRK</Hsp_hseq> + <Hsp_midline>MKS FR NG E+VVE+V+P S EF+ V EL+++ G DKK P+ F E + +VTGQLE E +A+ E ++T+ F+ FRK</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>5</Hit_num> + <Hit_id>gi|161622625|ref|YP_001595321.1|</Hit_id> + <Hit_def>Alt.-3 conserved hypothetical protein [Enterobacteria phage JS98] >gi|52139951|gb|AAU29321.1| Alt.-3 conserved hypothetical protein [Enterobacteria phage JS98]</Hit_def> + <Hit_accession>YP_001595321</Hit_accession> + <Hit_len>96</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>75.485</Hsp_bit-score> + <Hsp_score>184</Hsp_score> + <Hsp_evalue>2.41009e-15</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>95</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>96</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>39</Hsp_identity> + <Hsp_positive>55</Hsp_positive> + <Hsp_gaps>1</Hsp_gaps> + <Hsp_align-len>96</Hsp_align-len> + <Hsp_qseq>MKSIFRINGVEIVVEDVVPMSYEFNEVVFKELKKILG-DKKLQSTPIGRFGMKENVDTYIESVVTGQLEGEFSVAVQTVENDEVILTLPAFVIFRK</Hsp_qseq> + <Hsp_hseq>MKSAFRFNGQELVVENVIPASEEFDSAVGNELRRVFGEDKQFDLRPIENFSQPEQTENIFNGVVTGQLESEAPISITVFVKKQPLMTAAGFISFRK</Hsp_hseq> + <Hsp_midline>MKS FR NG E+VVE+V+P S EF+ V EL+++ G DK+ PI F E + VVTGQLE E +++ + ++T F+ FRK</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +</Iteration_hits> + <Iteration_stat> + <Statistics> + <Statistics_db-num>48094830</Statistics_db-num> + <Statistics_db-len>17186091396</Statistics_db-len> + <Statistics_hsp-len>65</Statistics_hsp-len> + <Statistics_eff-space>421797823380</Statistics_eff-space> + <Statistics_kappa>0.041</Statistics_kappa> + <Statistics_lambda>0.267</Statistics_lambda> + <Statistics_entropy>0.14</Statistics_entropy> + </Statistics> + </Iteration_stat> +</Iteration> +<Iteration> + <Iteration_iter-num>3</Iteration_iter-num> + <Iteration_query-ID>Query_3</Iteration_query-ID> + <Iteration_query-def>Merlin_3</Iteration_query-def> + <Iteration_query-len>314</Iteration_query-len> +<Iteration_hits> +<Hit> + <Hit_num>1</Hit_num> + <Hit_id>gi|456351277|ref|YP_007501229.1|</Hit_id> + <Hit_def>baseplate subunit [Salmonella phage S16] >gi|347466342|gb|AEO97128.1| baseplate subunit [Salmonella phage S16] >gi|408387126|gb|AFU64135.1| tail assembly [Salmonella phage STML-198]</Hit_def> + <Hit_accession>YP_007501229</Hit_accession> + <Hit_len>305</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>573.548</Hsp_bit-score> + <Hsp_score>1477</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>302</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>302</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>266</Hsp_identity> + <Hsp_positive>289</Hsp_positive> + <Hsp_gaps>0</Hsp_gaps> + <Hsp_align-len>302</Hsp_align-len> + <Hsp_qseq>MLTLDEFKNQAGNIDFQRTNMFSCVFATTPSAKSQQLLDQFGGMLFNNLPLNNDWLGLTQGEFTSGLTSIITAGTQQLVRKSGVSKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYPQSGLMIYSVKIPENRLSHEMDFNHNSPNIRITGRELDPLTISFRMDPEASNYRAMQDWVNSVQDPVTGLRALPTDVEADIQVNLHARNGLPHTVIMFTGCVPVACGAPELTYEGDNQIAVFDVTFAYRVMQTGAVGRQAALDWIEDRAVNSITGINSEMSLNGSLSRLSRLGGAAGGLSHVIN</Hsp_qseq> + <Hsp_hseq>MYTLDEFKNQAANIDFQRTNMFSCVFATTPSAKSQQLLDQFGGMLFNNLPLNNDWLGLTQGEFTQGLTNIITSGTRDLTRKSGVSKYLIGAMSNRVVQSLLGEFEVGTYLIDFFNMVYPQSGLMIYSVKIPENRLSHEMDFMHNSPNIKITGRDLEPLTVSFRMDPEASNYRAMQDWVNAVQDPVTGLRALPTDVEADIQVNLHARNGIPHTVIMFTGCIPISCGAPELTYEGDNQIAVFDVTFAYRVMQAGAVGRQAAIDWLEDKTVDSIDKINPDLSLNGSLSRLSRLGGAGGGISNIVN</Hsp_hseq> + <Hsp_midline>M TLDEFKNQA NIDFQRTNMFSCVFATTPSAKSQQLLDQFGGMLFNNLPLNNDWLGLTQGEFT GLT+IIT+GT+ L RKSGVSKYLIGAMSNRVVQSLLGEFEVGTYL+DFFNM YPQSGLMIYSVKIPENRLSHEMDF HNSPNI+ITGR+L+PLT+SFRMDPEASNYRAMQDWVN+VQDPVTGLRALPTDVEADIQVNLHARNG+PHTVIMFTGC+P++CGAPELTYEGDNQIAVFDVTFAYRVMQ GAVGRQAA+DW+ED+ V+SI IN ++SLNGSLSRLSRLGGA GG+S+++N</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>2</Hit_num> + <Hit_id>gi|311993189|ref|YP_004010055.1|</Hit_id> + <Hit_def>gp54 base plate tail tube initiator [Enterobacteria phage CC31] >gi|284178027|gb|ADB81693.1| gp54 base plate tail tube initiator [Enterobacteria phage CC31]</Hit_def> + <Hit_accession>YP_004010055</Hit_accession> + <Hit_len>320</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>539.265</Hsp_bit-score> + <Hsp_score>1388</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>314</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>320</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>258</Hsp_identity> + <Hsp_positive>286</Hsp_positive> + <Hsp_gaps>6</Hsp_gaps> + <Hsp_align-len>320</Hsp_align-len> + <Hsp_qseq>MLTLDEFKNQAGNIDFQRTNMFSCVFATTPSAKSQQLLDQFGGMLFNNLPLNNDWLGLTQGEFTSGLTSIITAGTQQLVRKSGVSKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYPQSGLMIYSVKIPENRLSHEMDFNHNSPNIRITGRELDPLTISFRMDPEASNYRAMQDWVNSVQDPVTGLRALPTDVEADIQVNLHARNGLPHTVIMFTGCVPVACGAPELTYEGDNQIAVFDVTFAYRVMQTGAVGRQAALDWIEDRAVNSITGINSEMSLNGSLSRLSRLGGAAGGLSHVINST------RNSTSKILGL</Hsp_qseq> + <Hsp_hseq>MLNLDEFNNQVMNVDFQRTNMFSCVFATSPSAKSQLLLDQFGGMLYNNLPVSGDWLGLSQGEFTQGLTSIITAGTQELVRKSGVSKYLIGAMTNRVVQSLLGEFEVGTYLLDFFNMAFPTSGLMIYSAKIPDNRLSHETDWLHNSPNIRITGRELEPLTLSFRMDSEASNWRAMQDWVNSVQDPVTGLRALPVDVEADIQVNLHARNGLPHTVCMFTGCVPVSCGSPEFTWDGDNQIAVFDVQFAYRVMQVGAVGRQAAADWVEDRLVHAIGNISDDMGLDSSLSRLSRLGGAAGGITQMGNAIGRKTGMWNSTSKILGL</Hsp_hseq> + <Hsp_midline>ML LDEF NQ N+DFQRTNMFSCVFAT+PSAKSQ LLDQFGGML+NNLP++ DWLGL+QGEFT GLTSIITAGTQ+LVRKSGVSKYLIGAM+NRVVQSLLGEFEVGTYLLDFFNMA+P SGLMIYS KIP+NRLSHE D+ HNSPNIRITGREL+PLT+SFRMD EASN+RAMQDWVNSVQDPVTGLRALP DVEADIQVNLHARNGLPHTV MFTGCVPV+CG+PE T++GDNQIAVFDV FAYRVMQ GAVGRQAA DW+EDR V++I I+ +M L+ SLSRLSRLGGAAGG++ + N+ NSTSKILGL</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>3</Hit_num> + <Hit_id>gi|589889940|ref|YP_009005476.1|</Hit_id> + <Hit_def>baseplate subunit [Enterobacter phage PG7] >gi|583927853|gb|AHI61115.1| baseplate subunit [Enterobacter phage PG7]</Hit_def> + <Hit_accession>YP_009005476</Hit_accession> + <Hit_len>320</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>535.798</Hsp_bit-score> + <Hsp_score>1379</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>314</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>320</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>257</Hsp_identity> + <Hsp_positive>285</Hsp_positive> + <Hsp_gaps>6</Hsp_gaps> + <Hsp_align-len>320</Hsp_align-len> + <Hsp_qseq>MLTLDEFKNQAGNIDFQRTNMFSCVFATTPSAKSQQLLDQFGGMLFNNLPLNNDWLGLTQGEFTSGLTSIITAGTQQLVRKSGVSKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYPQSGLMIYSVKIPENRLSHEMDFNHNSPNIRITGRELDPLTISFRMDPEASNYRAMQDWVNSVQDPVTGLRALPTDVEADIQVNLHARNGLPHTVIMFTGCVPVACGAPELTYEGDNQIAVFDVTFAYRVMQTGAVGRQAALDWIEDRAVNSITGINSEMSLNGSLSRLSRLGGAAGGLSHVINST------RNSTSKILGL</Hsp_qseq> + <Hsp_hseq>MLNLDEFNNQVMNVDFQRTNMFSCVFATTPSAKSQLLLDQFGGMLYNNLPVSGDWLGLSQGEFTQGITSIITAGTQELVRKSGVSKYLIGAMTNRVVQSLLGEFEVGTYLLDFFNMAFPTSGLMIYSAKIPDNRLSHETDWLHNSPNIRITGRELEPLTLSFRMDSEASNWRAMQDWVNSVQDPVTGLRALPVDVEADIQVNLHARNGLPHTVCMFTGCVPVSCGSPEFTWDGDNQIAVFDVQFAYRVMQVGAVGRQAAADWVEDRLVHAIGNISDDMGLDPSLSRLSRLGGAGGGITQMGNAIGRKTGMWNSTSKILGL</Hsp_hseq> + <Hsp_midline>ML LDEF NQ N+DFQRTNMFSCVFATTPSAKSQ LLDQFGGML+NNLP++ DWLGL+QGEFT G+TSIITAGTQ+LVRKSGVSKYLIGAM+NRVVQSLLGEFEVGTYLLDFFNMA+P SGLMIYS KIP+NRLSHE D+ HNSPNIRITGREL+PLT+SFRMD EASN+RAMQDWVNSVQDPVTGLRALP DVEADIQVNLHARNGLPHTV MFTGCVPV+CG+PE T++GDNQIAVFDV FAYRVMQ GAVGRQAA DW+EDR V++I I+ +M L+ SLSRLSRLGGA GG++ + N+ NSTSKILGL</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>4</Hit_num> + <Hit_id>gi|314121773|ref|YP_004063892.1|</Hit_id> + <Hit_def>gp54 baseplate subunit [Enterobacteria phage vB_EcoM-VR7] >gi|313151530|gb|ADR32586.1| gp54 baseplate subunit [Enterobacteria phage vB_EcoM-VR7]</Hit_def> + <Hit_accession>YP_004063892</Hit_accession> + <Hit_len>319</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>479.174</Hsp_bit-score> + <Hsp_score>1232</Hsp_score> + <Hsp_evalue>6.96493e-167</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>313</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>313</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>218</Hsp_identity> + <Hsp_positive>264</Hsp_positive> + <Hsp_gaps>0</Hsp_gaps> + <Hsp_align-len>313</Hsp_align-len> + <Hsp_qseq>MLTLDEFKNQAGNIDFQRTNMFSCVFATTPSAKSQQLLDQFGGMLFNNLPLNNDWLGLTQGEFTSGLTSIITAGTQQLVRKSGVSKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYPQSGLMIYSVKIPENRLSHEMDFNHNSPNIRITGRELDPLTISFRMDPEASNYRAMQDWVNSVQDPVTGLRALPTDVEADIQVNLHARNGLPHTVIMFTGCVPVACGAPELTYEGDNQIAVFDVTFAYRVMQTGAVGRQAALDWIEDRAVNSITGINSEMSLNGSLSRLSRLGGAAGGLSHVINSTRNSTSKILG</Hsp_qseq> + <Hsp_hseq>MFTLQEFQTQAINIDLQRNNLFSVVFATAPSSKSQNLLDQFGGALFSNLPVNSDWFGLTQGDLTQGITTLVTAGTQKLIRKSGISKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYPTAGLLVHSVKLPDNTLNYEMDLNHNAPNIKITGREYSPLVLSFRMDSEAGNFRAFNDWVNSVQDPVTQLRALPEDVEADIQVNLHSRNGLPHTVVMLTGCVPVSVSAPELSYEGDNQIATFDVTFAYRVMSTGAVGRNAALEWLEDKVIKGVSGISSDNNLNAEVAKLSRLSGAQSGLTSLYNTFTGSGRAVSG</Hsp_hseq> + <Hsp_midline>M TL EF+ QA NID QR N+FS VFAT PS+KSQ LLDQFGG LF+NLP+N+DW GLTQG+ T G+T+++TAGTQ+L+RKSG+SKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYP +GL+++SVK+P+N L++EMD NHN+PNI+ITGRE PL +SFRMD EA N+RA DWVNSVQDPVT LRALP DVEADIQVNLH+RNGLPHTV+M TGCVPV+ APEL+YEGDNQIA FDVTFAYRVM TGAVGR AAL+W+ED+ + ++GI+S+ +LN +++LSRL GA GL+ + N+ S + G</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>5</Hit_num> + <Hit_id>gi|308814558|ref|YP_003934832.1|</Hit_id> + <Hit_def>baseplate tail tube initiator [Shigella phage SP18] >gi|308206150|gb|ADO19549.1| baseplate tail tube initiator [Shigella phage SP18]</Hit_def> + <Hit_accession>YP_003934832</Hit_accession> + <Hit_len>314</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>478.404</Hsp_bit-score> + <Hsp_score>1230</Hsp_score> + <Hsp_evalue>1.05147e-166</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>303</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>303</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>216</Hsp_identity> + <Hsp_positive>261</Hsp_positive> + <Hsp_gaps>0</Hsp_gaps> + <Hsp_align-len>303</Hsp_align-len> + <Hsp_qseq>MLTLDEFKNQAGNIDFQRTNMFSCVFATTPSAKSQQLLDQFGGMLFNNLPLNNDWLGLTQGEFTSGLTSIITAGTQQLVRKSGVSKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYPQSGLMIYSVKIPENRLSHEMDFNHNSPNIRITGRELDPLTISFRMDPEASNYRAMQDWVNSVQDPVTGLRALPTDVEADIQVNLHARNGLPHTVIMFTGCVPVACGAPELTYEGDNQIAVFDVTFAYRVMQTGAVGRQAALDWIEDRAVNSITGINSEMSLNGSLSRLSRLGGAAGGLSHVINS</Hsp_qseq> + <Hsp_hseq>MFTLQEFQTQAINIDLQRNNLFSVVFATAPSSKSQNLLDQFGGALFSNLPVNSDWFGLTQGDLTQGITTLVTAGTQKLIRKSGISKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYPTAGLLVHSVKLPDNTLNYEMDLNHNAPNIKITGREYSPLVLSFRMDSEAGNFRAFNDWVNSVQDPVTQLRALPEDVEADIQVNLHSRNGLPHTVVMLTGCVPVSVSAPELSYEGDNQIATFDVTFAYRVMSTGAVGRAAALEWLEDKVIKGVSGISSDNNLNAEVAKLSRLSGAQSGLTSLYNT</Hsp_hseq> + <Hsp_midline>M TL EF+ QA NID QR N+FS VFAT PS+KSQ LLDQFGG LF+NLP+N+DW GLTQG+ T G+T+++TAGTQ+L+RKSG+SKYLIGAMSNRVVQSLLGEFEVGTYLLDFFNMAYP +GL+++SVK+P+N L++EMD NHN+PNI+ITGRE PL +SFRMD EA N+RA DWVNSVQDPVT LRALP DVEADIQVNLH+RNGLPHTV+M TGCVPV+ APEL+YEGDNQIA FDVTFAYRVM TGAVGR AAL+W+ED+ + ++GI+S+ +LN +++LSRL GA GL+ + N+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +</Iteration_hits> + <Iteration_stat> + <Statistics> + <Statistics_db-num>48094830</Statistics_db-num> + <Statistics_db-len>17186091396</Statistics_db-len> + <Statistics_hsp-len>147</Statistics_hsp-len> + <Statistics_eff-space>1689397281462</Statistics_eff-space> + <Statistics_kappa>0.041</Statistics_kappa> + <Statistics_lambda>0.267</Statistics_lambda> + <Statistics_entropy>0.14</Statistics_entropy> + </Statistics> + </Iteration_stat> +</Iteration> +<Iteration> + <Iteration_iter-num>4</Iteration_iter-num> + <Iteration_query-ID>Query_4</Iteration_query-ID> + <Iteration_query-def>Merlin_4</Iteration_query-def> + <Iteration_query-len>351</Iteration_query-len> +<Iteration_hits> +<Hit> + <Hit_num>1</Hit_num> + <Hit_id>gi|456351276|ref|YP_007501228.1|</Hit_id> + <Hit_def>baseplate subunit [Salmonella phage S16] >gi|347466341|gb|AEO97127.1| baseplate subunit [Salmonella phage S16]</Hit_def> + <Hit_accession>YP_007501228</Hit_accession> + <Hit_len>350</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>590.882</Hsp_bit-score> + <Hsp_score>1522</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>5</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>350</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>291</Hsp_identity> + <Hsp_positive>319</Hsp_positive> + <Hsp_gaps>1</Hsp_gaps> + <Hsp_align-len>348</Hsp_align-len> + <Hsp_qseq>VRELDDKTDALIS-GVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>VKELKDTAKELWNKGEKISAGQSSQSSKIKSTVTVQYPSERSAGNDVTGNLRVHDLYKNGLLFTAYDMNSRTSGDMRNMRLGELRRTSQDIVKSVTGKNTKQVDKIPVANILLPRSKSDVDSTSHKFNDVADSLISRGGGTATGVLSNVASTAVFGALESVTQGLMADNNEQIYNTARSMYAGADNRTKVFTWDLTPRSVHDLIAIVEIYEYFNYYSYGETGNSTFAKEVKSTLDEWYKSTFLDTLTPTGAPQNDTVFEKITSFLSNVIVVSNPTVWYVRNFGNTSKFDGKTDIFGPCQIQSIRFDKTPNGVFNGLAVAPNLPSTFTLEITMREILTLNRSSIYSEGF</Hsp_hseq> + <Hsp_midline>V+EL D L + G K SAGQSSQS+KIKST+T QYPSERSAGND +G+LRVHDLYKNGLLFTAYDMNSRT+GDMR+MRLGE++RT+ +VKS+TG NT +VDKIPV NILLPRSKSDV+S SHKFNDV DSLISRGGGTATGVLSNVASTAVFG LES+TQGLMAD+NEQIYNTARSMY GADNRTKVFTWDLTPRSV DLIAI+EIYEYFNYYSYGETG ST+AKEVKS LDEWYKSTFLDTLTP A +NDTVFEKITSFLSNVIVVSNPTVW+VRNFG TSKFDG+ ++FGPCQIQSIRFDKTPNG FNGLA+APNLPSTFTLEITMREILTLNR+S+Y+EGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>2</Hit_num> + <Hit_id>gi|408387125|gb|AFU64134.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Salmonella phage STML-198]</Hit_def> + <Hit_accession>AFU64134</Hit_accession> + <Hit_len>350</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>590.497</Hsp_bit-score> + <Hsp_score>1521</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>5</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>350</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>291</Hsp_identity> + <Hsp_positive>319</Hsp_positive> + <Hsp_gaps>1</Hsp_gaps> + <Hsp_align-len>348</Hsp_align-len> + <Hsp_qseq>VRELDDKTDALIS-GVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>VKELKDTAKELWNKGEKISAGQSSQSSKIKSTVTVQYPSERSAGNDVTGNLRVHDLYKNGLLFTAYDMNSRTSGDMRNMRLGELRRTSQDIVKSVTGKNTKQVDKIPVANILLPRSKSDVDSTSHKFNDVADSLISRGGGTATGVLSNVASTAVFGALESVTQGLMADNNEQIYNTARSMYAGADNRTKVFTWDLTPRSVHDLIAIVEIYEYFNYYSYGETGNSTFAKEVKSTLDEWYKSTFLDTLTPTGAPQNDTVFEKITSFLSNVIVVSNPTVWYVRNFGNTSKFDGKTDIFGPCQIQSIRFDKTPNGIFNGLAVAPNLPSTFTLEITMREILTLNRSSIYSEGF</Hsp_hseq> + <Hsp_midline>V+EL D L + G K SAGQSSQS+KIKST+T QYPSERSAGND +G+LRVHDLYKNGLLFTAYDMNSRT+GDMR+MRLGE++RT+ +VKS+TG NT +VDKIPV NILLPRSKSDV+S SHKFNDV DSLISRGGGTATGVLSNVASTAVFG LES+TQGLMAD+NEQIYNTARSMY GADNRTKVFTWDLTPRSV DLIAI+EIYEYFNYYSYGETG ST+AKEVKS LDEWYKSTFLDTLTP A +NDTVFEKITSFLSNVIVVSNPTVW+VRNFG TSKFDG+ ++FGPCQIQSIRFDKTPNG FNGLA+APNLPSTFTLEITMREILTLNR+S+Y+EGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>3</Hit_num> + <Hit_id>gi|311993188|ref|YP_004010054.1|</Hit_id> + <Hit_def>gp48 base plate tail tube cap [Enterobacteria phage CC31] >gi|284178026|gb|ADB81692.1| gp48 base plate tail tube cap [Enterobacteria phage CC31]</Hit_def> + <Hit_accession>YP_004010054</Hit_accession> + <Hit_len>349</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>559.296</Hsp_bit-score> + <Hsp_score>1440</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>349</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>270</Hsp_identity> + <Hsp_positive>310</Hsp_positive> + <Hsp_gaps>2</Hsp_gaps> + <Hsp_align-len>351</Hsp_align-len> + <Hsp_qseq>MSIKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MAIRATEILDK--AFGSGEKTSAGQSSISSTRRSTVTAQYPAERSAGNDAAGDLRVHDLYKNGLLFTAYDMSSRTTPDLRSMRQSQLSKSASSILNSLGIKNNGQVDKSPIANILLPRSKSDVESISHKFNDVGDSLMTRGNNSATGVLSNVASTAVFGALDSITQGLMADNNEQIYNTARSMYAGADNRTKVFTWDLTPRSVADLVSIIQIYEYFNYFSYGETGNSTYAKELKGQLDEWYKTTLLSPLTPDGADLNNTMFENITSFLSNVIVVTNPTVWFIRNFGKTSKFDGRAEVFGPCQIQSIRFDKTPNGQFNGLAIAPNMPSTFTLEITFREILTLNRASLYAEGF</Hsp_hseq> + <Hsp_midline>M+I+ E+ DK A SG KTSAGQSS S+ +ST+TAQYP+ERSAGND +G LRVHDLYKNGLLFTAYDM+SRTT D+RSMR ++ ++A+S++ S+ N +VDK P+ NILLPRSKSDVES+SHKFNDVGDSL++RG +ATGVLSNVASTAVFG L+S+TQGLMAD+NEQIYNTARSMY GADNRTKVFTWDLTPRSV DL++II+IYEYFNY+SYGETG STYAKE+K QLDEWYK+T L LTPD A+ N+T+FE ITSFLSNVIVV+NPTVWF+RNFG TSKFDGRAEVFGPCQIQSIRFDKTPNG FNGLAIAPN+PSTFTLEIT REILTLNRAS+YAEGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>4</Hit_num> + <Hit_id>gi|589889939|ref|YP_009005475.1|</Hit_id> + <Hit_def>baseplate subunit [Enterobacter phage PG7] >gi|583927852|gb|AHI61114.1| baseplate subunit [Enterobacter phage PG7]</Hit_def> + <Hit_accession>YP_009005475</Hit_accession> + <Hit_len>349</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>536.954</Hsp_bit-score> + <Hsp_score>1382</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>349</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>260</Hsp_identity> + <Hsp_positive>305</Hsp_positive> + <Hsp_gaps>2</Hsp_gaps> + <Hsp_align-len>351</Hsp_align-len> + <Hsp_qseq>MSIKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MAIRATEILDKD--FGSGEKTSAGQSSISSTRRSTIVAQYPAQRAAGNDAAGDLRVHDLYKNGLLFTAYDMSSRTSPDLRNMRQSQLSKSASSILNSLGIKNNGQVDKSPIANILLPRSKSDVESTSHKFNDVGESLITRGNNSATGVLSNVASTAVFGALDSVTQGLMADNNEQIYNTARSMYAGADNRTKVFTWDLTPRSVADLVSIIQIYECFNYFSYGETGNSSYAKELKGQLDEWYKTTLLSPLTPDGADLNNTMFENITSFLSNVIVVTNPTVWFIRNFGKTSKFDGRTELFGPCQIQSIRFDKTPNGQFNGLAIAPNMPSTFTLEITFREILTLSRASLYAEGF</Hsp_hseq> + <Hsp_midline>M+I+ E+ DK SG KTSAGQSS S+ +STI AQYP++R+AGND +G LRVHDLYKNGLLFTAYDM+SRT+ D+R+MR ++ ++A+S++ S+ N +VDK P+ NILLPRSKSDVES SHKFNDVG+SLI+RG +ATGVLSNVASTAVFG L+S+TQGLMAD+NEQIYNTARSMY GADNRTKVFTWDLTPRSV DL++II+IYE FNY+SYGETG S+YAKE+K QLDEWYK+T L LTPD A+ N+T+FE ITSFLSNVIVV+NPTVWF+RNFG TSKFDGR E+FGPCQIQSIRFDKTPNG FNGLAIAPN+PSTFTLEIT REILTL+RAS+YAEGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>5</Hit_num> + <Hit_id>gi|414086559|ref|YP_006986748.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Enterobacteria phage vB_EcoM_ACG-C40] >gi|383396340|gb|AFH20156.1| baseplate tail tube cap [Enterobacteria phage vB_EcoM_ACG-C40]</Hit_def> + <Hit_accession>YP_006986748</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>494.197</Hsp_bit-score> + <Hsp_score>1271</Hsp_score> + <Hsp_evalue>1.69091e-171</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>15</Hsp_hit-from> + <Hsp_hit-to>364</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>236</Hsp_identity> + <Hsp_positive>287</Hsp_positive> + <Hsp_gaps>15</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMR------LGEMKRTANSVVKS---------ITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>SGETISAGQSTKSEVGTKTYTAQFPTGRASGNDTTGDFQVTDLYKNGLLFTAYNMSSRDSGSLRTMRSNYSSSSSSILRTARNTISNTVSKLSNGLISDNNSGTISKVPVANILLPRSKSDVDTSSHRFNDVQDSLITKGGGTATGVLSNMASTAVFGALESITQGIMADNNEQIYTTARSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAAEIKGYLDEWYRSTFIEPLTPEDAVKNKTLFEKMTSSLTNVLVVSNPTIWMVKNFGATSKFDGKTEIFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREIITLNRASLYAGTF</Hsp_hseq> + <Hsp_midline>SG SAGQS++S T TAQ+P+ R++GNDT+G +V DLYKNGLLFTAY+M+SR +G +R+MR + RTA + + + I+ N+ + K+PV NILLPRSKSDV++ SH+FNDV DSLI++GGGTATGVLSN+ASTAVFG LES+TQG+MAD+NEQIY TARSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA E+K LDEWY+STF++ LTP++A KN T+FEK+TS L+NV+VVSNPT+W V+NFG TSKFDG+ E+FGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREI+TLNRAS+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>6</Hit_num> + <Hit_id>gi|431809133|ref|YP_007236030.1|</Hit_id> + <Hit_def>phage baseplate tail tube cap (T4-like gp48) [Yersinia phage phiR1-RT] >gi|398313422|emb|CCI88771.1| phage baseplate tail tube cap (T4-like gp48) [Yersinia phage phiR1-RT]</Hit_def> + <Hit_accession>YP_007236030</Hit_accession> + <Hit_len>348</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>492.656</Hsp_bit-score> + <Hsp_score>1267</Hsp_score> + <Hsp_evalue>3.88245e-171</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>347</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>242</Hsp_identity> + <Hsp_positive>290</Hsp_positive> + <Hsp_gaps>6</Hsp_gaps> + <Hsp_align-len>352</Hsp_align-len> + <Hsp_qseq>MSIKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSI-TGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MSIRATEITEST-IKSAGISTSAGQVTQSTAIK-TIQAQFPAERASGNDSTLDLQITDLYKNGLLFTAYDFTSRTSPDLRQNR-ADIQIAAQKKPSSIFTGTKT--VQQTPVANILLPRSKSDVDNTSHKFNDVGESLVTRGGGNATGILSNMASTAVFGALESLTQGYMSDHGEQIYNTARSMYGGADNRQKVFTWDLTPRNVQDLVQIIKIYETFNYYSYGQTGSSSFAKGLKGDLDTWYKNTFLKNMTPDGANLDNTMFEQITSFLTNVIVVSNPTVWYVRNFGATSSFDGRADVFGPCQIASIRFDKSPNGHFNGLAIAPNLPSTFVLEITFREILTLNRNSLYAGGL</Hsp_hseq> + <Hsp_midline>MSI+ E+ + T +G+ TSAGQ +QS IK TI AQ+P+ER++GND++ L++ DLYKNGLLFTAYD SRT+ D+R R +++ A SI TGT T V + PV NILLPRSKSDV++ SHKFNDVG+SL++RGGG ATG+LSN+ASTAVFG LESLTQG M+DH EQIYNTARSMYGGADNR KVFTWDLTPR+VQDL+ II+IYE FNYYSYG+TG+S++AK +K LD WYK+TFL +TPD AN ++T+FE+ITSFL+NVIVVSNPTVW+VRNFG TS FDGRA+VFGPCQI SIRFDK+PNG+FNGLAIAPNLPSTF LEIT REILTLNR S+YA G </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>7</Hit_num> + <Hit_id>gi|228861125|ref|YP_002854148.1|</Hit_id> + <Hit_def>gp48 base plate [Enterobacteria phage RB51] >gi|422934973|ref|YP_007004933.1| baseplate tail tube cap [Escherichia phage wV7] >gi|227438799|gb|ACP31111.1| gp48 base plate [Enterobacteria phage RB51] >gi|291290411|dbj|BAI83206.1| baseplate tail tube cap [Enterobacteria phage AR1] >gi|343177527|gb|AEM00853.1| baseplate tail tube cap [Escherichia phage wV7]</Hit_def> + <Hit_accession>YP_002854148</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>491.5</Hsp_bit-score> + <Hsp_score>1264</Hsp_score> + <Hsp_evalue>1.72752e-170</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>15</Hsp_hit-from> + <Hsp_hit-to>364</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>235</Hsp_identity> + <Hsp_positive>286</Hsp_positive> + <Hsp_gaps>15</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMR------LGEMKRTANSVVKS---------ITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>SGETISAGQSTKSEVGTKTYTAQFPTGRASGNDTTGDFQVTDLYKNGLLFTAYNMSSRDSGSLRTMRSNYSSSSSSILRTARNTISNTVSKLSNGLISDNNSGTISKVPVANILLPRSKSDVDTSSHRFNDVQDSLITKGGGTATGVLSNMASTAVFGALESITQGIMADNNEQIYTTARSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAAEIKGYLDEWYRSTFIEPLTPEDAIKNKTLFEKMTSSLTNVLVVSNPTIWMVKNFGATSKFDGKTEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSSFTLEITMREIITLNRASLYTGTF</Hsp_hseq> + <Hsp_midline>SG SAGQS++S T TAQ+P+ R++GNDT+G +V DLYKNGLLFTAY+M+SR +G +R+MR + RTA + + + I+ N+ + K+PV NILLPRSKSDV++ SH+FNDV DSLI++GGGTATGVLSN+ASTAVFG LES+TQG+MAD+NEQIY TARSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA E+K LDEWY+STF++ LTP++A KN T+FEK+TS L+NV+VVSNPT+W V+NFG TSKFDG+ EVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPS+FTLEITMREI+TLNRAS+Y F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>8</Hit_num> + <Hit_id>gi|116326413|ref|YP_803133.1|</Hit_id> + <Hit_def>base plate [Enterobacteria phage RB32] >gi|228861506|ref|YP_002854527.1| gp48 base plate [Enterobacteria phage RB14] >gi|115344006|gb|ABI95015.1| base plate [Enterobacteria phage RB32] >gi|227438522|gb|ACP30835.1| gp48 base plate [Enterobacteria phage RB14] >gi|398313741|emb|CCI89088.1| phage baseplate tail tube cap (T4-like gp48) [Yersinia phage phiD1] >gi|525334459|gb|AGR46141.1| baseplate tail tube cap [Yersinia phage PST]</Hit_def> + <Hit_accession>YP_803133</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>488.419</Hsp_bit-score> + <Hsp_score>1256</Hsp_score> + <Hsp_evalue>3.32248e-169</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>15</Hsp_hit-from> + <Hsp_hit-to>364</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>237</Hsp_identity> + <Hsp_positive>286</Hsp_positive> + <Hsp_gaps>15</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGE------MKRTANSVVKS---------ITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>SGEKISAGQSTKSEVGTKTYTAQFPTGRASGNDTTGDFQVTDLYKNGLLFTAYNMSSRDSGSLRSMRSNYSSSSSSILRTARNTISSTVSKLSNGLISNNNSGTISKAPIANILLPRSKSDVDTSSHRFNDVQESLISRGGGTATGVLSNIASTAVFGALESITQGIMADNNEQIYTTARSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAAEIKGYLDDWYRSTLIEPLSPEDAAKNKTLFEKMTSSLTNVLVVSNPTVWMVKNFGATSKFDGKTEIFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREIITLNRASLYAGTF</Hsp_hseq> + <Hsp_midline>SG K SAGQS++S T TAQ+P+ R++GNDT+G +V DLYKNGLLFTAY+M+SR +G +RSMR + RTA + + S I+ N+ + K P+ NILLPRSKSDV++ SH+FNDV +SLISRGGGTATGVLSN+ASTAVFG LES+TQG+MAD+NEQIY TARSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA E+K LD+WY+ST ++ L+P++A KN T+FEK+TS L+NV+VVSNPTVW V+NFG TSKFDG+ E+FGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREI+TLNRAS+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>9</Hit_num> + <Hit_id>gi|639438843|ref|YP_009030800.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Escherichia phage e11/2] >gi|628971671|gb|AHY83393.1| baseplate tail tube cap [Escherichia phage e11/2]</Hit_def> + <Hit_accession>YP_009030800</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>486.878</Hsp_bit-score> + <Hsp_score>1252</Hsp_score> + <Hsp_evalue>1.3135e-168</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>15</Hsp_hit-from> + <Hsp_hit-to>364</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>236</Hsp_identity> + <Hsp_positive>286</Hsp_positive> + <Hsp_gaps>15</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGE------MKRTANSVVKS---------ITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>SGEKISAGQSTKSEVGTKTYTAQFPTGRASGNDTTGDFQVTDLYKNGLLFTAYNMSSRDSGSLRSMRSNYSSSSSSILRTARNTISNTVSKLSNGLISNNNSGTISKAPIANILLPRSKSDVDTSSHRFNDVQESLISRGGGTATGVLSNIASTAVFGALESITQGIMADNNEQIYTTARSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAAEIKGYLDDWYRSTLIEPLSPEDAAKNKTLFEKMTSSLTNVLVVSNPTVWMVKNFGATSKFDGKTEIFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREIITLNRASLYAGTF</Hsp_hseq> + <Hsp_midline>SG K SAGQS++S T TAQ+P+ R++GNDT+G +V DLYKNGLLFTAY+M+SR +G +RSMR + RTA + + + I+ N+ + K P+ NILLPRSKSDV++ SH+FNDV +SLISRGGGTATGVLSN+ASTAVFG LES+TQG+MAD+NEQIY TARSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA E+K LD+WY+ST ++ L+P++A KN T+FEK+TS L+NV+VVSNPTVW V+NFG TSKFDG+ E+FGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREI+TLNRAS+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>10</Hit_num> + <Hit_id>gi|330858711|ref|YP_004415086.1|</Hit_id> + <Hit_def>putative baseplate tail tube cap [Shigella phage Shfl2] >gi|422934608|ref|YP_007004569.1| phage baseplate protein [Enterobacteria phage ime09] >gi|327397645|gb|AEA73147.1| putative baseplate tail tube cap [Shigella phage Shfl2] >gi|339791391|gb|AEK12448.1| phage baseplate protein [Enterobacteria phage ime09]</Hit_def> + <Hit_accession>YP_004415086</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>486.493</Hsp_bit-score> + <Hsp_score>1251</Hsp_score> + <Hsp_evalue>1.49721e-168</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>15</Hsp_hit-from> + <Hsp_hit-to>364</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>236</Hsp_identity> + <Hsp_positive>284</Hsp_positive> + <Hsp_gaps>15</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKR------TANSVVKS---------ITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>SGEKISAGQSTKSEVATKTYTAQFPTGRASGNDTTGDFQVTDLYKNGLLFTAYNMSSRDSGSLRSMRSNYSSSSSSILSTARNTISSTVSKLSNGLISNNNSGTISKAPVANILLPRSKSDVDTSSHRFNDVQESLISRGGGTATGVLSNIASTAVFGALESITQGIMADNNEQIYTTARSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAAEIKGYLDDWYRSTLIEPLSPEDAAKNKTLFEKMTSSLTNVLVVSNPTIWMVKNFGATSKFDGKTEIFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREIITLNRASLYAGTF</Hsp_hseq> + <Hsp_midline>SG K SAGQS++S T TAQ+P+ R++GNDT+G +V DLYKNGLLFTAY+M+SR +G +RSMR TA + + S I+ N+ + K PV NILLPRSKSDV++ SH+FNDV +SLISRGGGTATGVLSN+ASTAVFG LES+TQG+MAD+NEQIY TARSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA E+K LD+WY+ST ++ L+P++A KN T+FEK+TS L+NV+VVSNPT+W V+NFG TSKFDG+ E+FGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREI+TLNRAS+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>11</Hit_num> + <Hit_id>gi|397134210|gb|AFO10717.1|</Hit_id> + <Hit_def>baseplate protein [Escherichia phage ECML-134]</Hit_def> + <Hit_accession>AFO10717</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>485.337</Hsp_bit-score> + <Hsp_score>1248</Hsp_score> + <Hsp_evalue>4.36088e-168</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>15</Hsp_hit-from> + <Hsp_hit-to>364</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>236</Hsp_identity> + <Hsp_positive>285</Hsp_positive> + <Hsp_gaps>15</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGE------MKRTANSVVKS---------ITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>SGEKISAGQSTKSEVGTKTYTAQFPTGRASGNDTTGDFQVTDLYKNGLLFTAYNMSSRDSGSLRSMRSNYSSSSSSILRTARNTISSTVSKLSNGLISNNNSGTISKSPIANTLLPRSKSDVDTSSHRFNDVQESLISRGGGTATGVLSNIASTAVFGALESITQGIMADNNEQIYTTARSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAAEIKGYLDDWYRSTLIEPLSPEDAAKNKTLFEKMTSSLTNVLVVSNPTVWMVKNFGATSKFDGKTEIFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREIITLNRASLYAGTF</Hsp_hseq> + <Hsp_midline>SG K SAGQS++S T TAQ+P+ R++GNDT+G +V DLYKNGLLFTAY+M+SR +G +RSMR + RTA + + S I+ N+ + K P+ N LLPRSKSDV++ SH+FNDV +SLISRGGGTATGVLSN+ASTAVFG LES+TQG+MAD+NEQIY TARSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA E+K LD+WY+ST ++ L+P++A KN T+FEK+TS L+NV+VVSNPTVW V+NFG TSKFDG+ E+FGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREI+TLNRAS+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>12</Hit_num> + <Hit_id>gi|9632645|ref|NP_049806.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Enterobacteria phage T4] >gi|138041|sp|P13339.3|VG48_BPT4 RecName: Full=Tail-tube assembly protein Gp48 [Enterobacteria phage T4] >gi|5354269|gb|AAD42476.1|AF158101_63 gp48 baseplate tail tube cap [Enterobacteria phage T4] >gi|215947|gb|AAA32539.1| tail-tube assembly protein [Enterobacteria phage T4] >gi|299780554|gb|ADJ39916.1| baseplate subunit [Enterobacteria phage T4T] >gi|628971799|gb|AHY83520.1| baseplate subunit [Enterobacteria phage T4] >gi|628972001|gb|AHY83721.1| baseplate subunit [Enterobacteria phage T4] >gi|628972192|gb|AHY83911.1| baseplate subunit [Enterobacteria phage T4]</Hit_def> + <Hit_accession>NP_049806</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>484.567</Hsp_bit-score> + <Hsp_score>1246</Hsp_score> + <Hsp_evalue>8.86163e-168</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>15</Hsp_hit-from> + <Hsp_hit-to>364</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>236</Hsp_identity> + <Hsp_positive>285</Hsp_positive> + <Hsp_gaps>15</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGE------MKRTANSVVKS---------ITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>SGEKISAGQSTKSEVGTKTYTAQFPTGRASGNDTTEDFQVTDLYKNGLLFTAYNMSSRDSGSLRSMRSNYSSSSSSILRTARNTISSTVSKLSNGLISNNNSGTISKSPIANILLPRSKSDVDTSSHRFNDVQESLISRGGGTATGVLSNIASTAVFGALESITQGIMADNNEQIYTTARSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAAEIKGYLDDWYRSTLIEPLSPEDAAKNKTLFEKMTSSLTNVLVVSNPTVWMVKNFGATSKFDGKTEIFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREIITLNRASLYAGTF</Hsp_hseq> + <Hsp_midline>SG K SAGQS++S T TAQ+P+ R++GNDT+ +V DLYKNGLLFTAY+M+SR +G +RSMR + RTA + + S I+ N+ + K P+ NILLPRSKSDV++ SH+FNDV +SLISRGGGTATGVLSN+ASTAVFG LES+TQG+MAD+NEQIY TARSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA E+K LD+WY+ST ++ L+P++A KN T+FEK+TS L+NV+VVSNPTVW V+NFG TSKFDG+ E+FGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREI+TLNRAS+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>13</Hit_num> + <Hit_id>gi|642905805|ref|YP_009037574.1|</Hit_id> + <Hit_def>baseplate subunit [Escherichia phage vB_EcoM_JS09] >gi|642903959|gb|AIA79979.1| baseplate subunit [Escherichia phage vB_EcoM_JS09]</Hit_def> + <Hit_accession>YP_009037574</Hit_accession> + <Hit_len>369</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>484.952</Hsp_bit-score> + <Hsp_score>1247</Hsp_score> + <Hsp_evalue>9.36795e-168</Hsp_evalue> + <Hsp_query-from>19</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>20</Hsp_hit-from> + <Hsp_hit-to>369</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>227</Hsp_identity> + <Hsp_positive>285</Hsp_positive> + <Hsp_gaps>17</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>VKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSIT-----------------GTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>VSESAGQSTKTETTTKTYVAQFPTGRAAGNDSTGDFQVTDLYKNGLLFTAYNMSARDSGSLRNLRPAYAGTSSNGIISDLTDNVKDAVTKFSNGLLPAGANKSTINKTPVANILLPRSKSDVDTTSHRFNDVGDSLITKGGGTATGVLSNIASTAVFGALDSITQGLMADNNEQIYTTSRSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAQEIKSYLDEWYRSTFIEPMTPDDAVKNKTLFEKITASLTNVLVVSNPTIWMVKNFGYTSKFDGLTDVFGPCQIQSVRFDKTPNGQFNGLAVAPNLPSTFTLEITMREIITLNRSSLYAGTF</Hsp_hseq> + <Hsp_midline>V SAGQS+++ T AQ+P+ R+AGND++G +V DLYKNGLLFTAY+M++R +G +R++R ++N ++ +T G N + ++K PV NILLPRSKSDV++ SH+FNDVGDSLI++GGGTATGVLSN+ASTAVFG L+S+TQGLMAD+NEQIY T+RSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA+E+KS LDEWY+STF++ +TPD+A KN T+FEKIT+ L+NV+VVSNPT+W V+NFG TSKFDG +VFGPCQIQS+RFDKTPNG FNGLA+APNLPSTFTLEITMREI+TLNR+S+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>14</Hit_num> + <Hit_id>gi|32453688|ref|NP_861897.1|</Hit_id> + <Hit_def>baseplate subunit [Enterobacteria phage RB69] >gi|32350507|gb|AAP76106.1| gp48 baseplate tail tube cap [Enterobacteria phage RB69] >gi|604671902|gb|AHV82896.1| baseplate tail tube cap [Escherichia phage vB_EcoM_PhAPEC2]</Hit_def> + <Hit_accession>NP_861897</Hit_accession> + <Hit_len>369</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>484.567</Hsp_bit-score> + <Hsp_score>1246</Hsp_score> + <Hsp_evalue>1.0678e-167</Hsp_evalue> + <Hsp_query-from>19</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>20</Hsp_hit-from> + <Hsp_hit-to>369</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>226</Hsp_identity> + <Hsp_positive>285</Hsp_positive> + <Hsp_gaps>17</Hsp_gaps> + <Hsp_align-len>350</Hsp_align-len> + <Hsp_qseq>VKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSIT-----------------GTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>VSESAGQSTKTETTTKTYVAQFPTGRAAGNDSTGDFQVTDLYKNGLLFTAYNMSARDSGSLRNLRPAYAGTSSNGIISDLTDNVKDAVTKFSNGLLPAGANKSTINKTPVANILLPRSKSDVDTTSHRFNDIGDSLITKGGGTATGVLSNIASTAVFGALDSITQGLMADNNEQIYTTSRSMYGGAENRTKVFTWDLTPRSTEDLMAIINIYQYFNYFSYGETGKSQYAQEIKSYLDEWYRSTFIEPMTPDDAVKNKTLFEKITASLTNVLVVSNPTIWMVKNFGHTSKFDGLTDVFGPCQIQSVRFDKTPNGQFNGLAVAPNLPSTFTLEITMREIITLNRSSLYAGTF</Hsp_hseq> + <Hsp_midline>V SAGQS+++ T AQ+P+ R+AGND++G +V DLYKNGLLFTAY+M++R +G +R++R ++N ++ +T G N + ++K PV NILLPRSKSDV++ SH+FND+GDSLI++GGGTATGVLSN+ASTAVFG L+S+TQGLMAD+NEQIY T+RSMYGGA+NRTKVFTWDLTPRS +DL+AII IY+YFNY+SYGETG S YA+E+KS LDEWY+STF++ +TPD+A KN T+FEKIT+ L+NV+VVSNPT+W V+NFG TSKFDG +VFGPCQIQS+RFDKTPNG FNGLA+APNLPSTFTLEITMREI+TLNR+S+YA F</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>15</Hit_num> + <Hit_id>gi|314121772|ref|YP_004063891.1|</Hit_id> + <Hit_def>gp48 baseplate subunit [Enterobacteria phage vB_EcoM-VR7] >gi|313151529|gb|ADR32585.1| gp48 baseplate subunit [Enterobacteria phage vB_EcoM-VR7]</Hit_def> + <Hit_accession>YP_004063891</Hit_accession> + <Hit_len>368</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>461.84</Hsp_bit-score> + <Hsp_score>1187</Hsp_score> + <Hsp_evalue>1.08287e-158</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>368</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>228</Hsp_identity> + <Hsp_positive>285</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>369</Hsp_align-len> + <Hsp_qseq>IKVRELD---DKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMR---LGEMKRTANSV----VKSIT----------GTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MKVKELDFDFDIAGLFNGGSKTSAGQS-KAAQTQATIVAQYPAERASGNDSSDDMRVNDLYKNGLLFTAYNFSSRTSPELRSDRSSQLTSLKKVSNGASFNPVKSLTSFAKSKLTGSGSTGKSFDSNAVANILLPRSKSDVESVSHRFNDVGESLITKGGGSATGILSNIASTAVFGALESVTNGVMADHGEQIYTTARSMYAGPDNRTKVYTWEMTPRSAQDLIQIVKIYEIFNYYSYGETGKSSFASELKDKIDTWYKSTFPSKRKAIDNFDGKLLGEEITSFLTNVLVVSNPTIWYIRNFGDTSSYDGRGELFGPCQIQSIRFDKSPDGHFGGLAIAPNLPSTFVLEITFREIITLNRGSLYAEGF</Hsp_hseq> + <Hsp_midline>+KV+ELD D G KTSAGQS ++A+ ++TI AQYP+ER++GND+S +RV+DLYKNGLLFTAY+ +SRT+ ++RS R L +K+ +N VKS+T G+ D V NILLPRSKSDVESVSH+FNDVG+SLI++GGG+ATG+LSN+ASTAVFG LES+T G+MADH EQIY TARSMY G DNRTKV+TW++TPRS QDLI I++IYE FNYYSYGETG S++A E+K ++D WYKSTF + + E+ITSFL+NV+VVSNPT+W++RNFG TS +DGR E+FGPCQIQSIRFDK+P+G+F GLAIAPNLPSTF LEIT REI+TLNR S+YAEGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>16</Hit_num> + <Hit_id>gi|308814557|ref|YP_003934831.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Shigella phage SP18] >gi|308206149|gb|ADO19548.1| baseplate tail tube cap [Shigella phage SP18]</Hit_def> + <Hit_accession>YP_003934831</Hit_accession> + <Hit_len>362</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>460.299</Hsp_bit-score> + <Hsp_score>1183</Hsp_score> + <Hsp_evalue>3.47109e-158</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>362</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>228</Hsp_identity> + <Hsp_positive>285</Hsp_positive> + <Hsp_gaps>21</Hsp_gaps> + <Hsp_align-len>366</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMR---LGEMKRTANSV----VKSIT----------GTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MKVKELD-IAGLFNGGSKTSAGQS-KAAQTQATIVAQYPAERASGNDSSDDMRVNDLYKNGLLFTAYNFSSRTSPELRSDRSSQLTSLKKVSNGASFNPVKSLTSFAKSKLTGAGSTGKSFDSNAVANILLPRSKSDVESVSHRFNDVGESLITKGGGSATGILSNIASTAVFGALESVTNGVMADHGEQIYTTARSMYAGPDNRTKVYTWEMTPRSAQDLIQIVKIYEIFNYYSYGETGKSSFASELKEKIDTWYKSTFKKEAIDNFDGK--LLGEEITSFLTNVLVVSNPTIWYIRNFGDTSSYDGRGELFGPCQIQSIRFDKSPDGHFGGLAIAPNLPSTFVLEITFREIITLNRGSLYAEGF</Hsp_hseq> + <Hsp_midline>+KV+ELD G KTSAGQS ++A+ ++TI AQYP+ER++GND+S +RV+DLYKNGLLFTAY+ +SRT+ ++RS R L +K+ +N VKS+T G+ D V NILLPRSKSDVESVSH+FNDVG+SLI++GGG+ATG+LSN+ASTAVFG LES+T G+MADH EQIY TARSMY G DNRTKV+TW++TPRS QDLI I++IYE FNYYSYGETG S++A E+K ++D WYKSTF + K + E+ITSFL+NV+VVSNPT+W++RNFG TS +DGR E+FGPCQIQSIRFDK+P+G+F GLAIAPNLPSTF LEIT REI+TLNR S+YAEGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>17</Hit_num> + <Hit_id>gi|422934215|ref|YP_007004251.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Enterobacteria phage Bp7] >gi|345450724|gb|AEN93927.1| baseplate tail tube cap [Enterobacteria phage Bp7]</Hit_def> + <Hit_accession>YP_007004251</Hit_accession> + <Hit_len>362</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>458.759</Hsp_bit-score> + <Hsp_score>1179</Hsp_score> + <Hsp_evalue>1.18966e-157</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>362</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>233</Hsp_identity> + <Hsp_positive>284</Hsp_positive> + <Hsp_gaps>23</Hsp_gaps> + <Hsp_align-len>367</Hsp_align-len> + <Hsp_qseq>IKVRELD-DKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEM---------KRTANS----VVKSITGTNTN--KVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKND--TVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MKVKELDFDVASLFKGGSKTSAGQSKTPA-IKTTVTAQYPAERASGNDTSTDMVLNDLYKNGLLFTAYNFSSRVSPDLRNDRSSQMTKKFSSAASKLTGNSGTYSAVKNLFGGNTKGVKFDTQALANILLPRSKSDVDSVSHKFNDVGESLITKGGGTATGILSNVASTAVFGALESVTNGVMADSGEQIYTTARSMYAGPDNRTKVFTWEMTPRNAQDLIQIIKIYEIFNYYSYGETGNSAFAGELKEKIDTWYRSTF----KKEAIDKFDGKLLGESITSFLSNVIVVSNPTIWYIRNFGDSSSYDGREDIFGPCQIQSIRFDKTPDGHFNGLAIAPNLPSTFSLEVTFREIITLNRGSLYTEGF</Hsp_hseq> + <Hsp_midline>+KV+ELD D G KTSAGQS A IK+T+TAQYP+ER++GNDTS + ++DLYKNGLLFTAY+ +SR + D+R+ R +M K T NS VK++ G NT K D + NILLPRSKSDV+SVSHKFNDVG+SLI++GGGTATG+LSNVASTAVFG LES+T G+MAD EQIY TARSMY G DNRTKVFTW++TPR+ QDLI II+IYE FNYYSYGETG S +A E+K ++D WY+STF + +K D + E ITSFLSNVIVVSNPT+W++RNFG +S +DGR ++FGPCQIQSIRFDKTP+G+FNGLAIAPNLPSTF+LE+T REI+TLNR S+Y EGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>18</Hit_num> + <Hit_id>gi|299779141|ref|YP_003734335.1|</Hit_id> + <Hit_def>48 gene product [Enterobacteria phage IME08] >gi|298105870|gb|ADI55514.1| gp48 baseplate tail tube cap [Enterobacteria phage IME08]</Hit_def> + <Hit_accession>YP_003734335</Hit_accession> + <Hit_len>363</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>451.825</Hsp_bit-score> + <Hsp_score>1161</Hsp_score> + <Hsp_evalue>7.00414e-155</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>2</Hsp_hit-from> + <Hsp_hit-to>363</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>228</Hsp_identity> + <Hsp_positive>283</Hsp_positive> + <Hsp_gaps>23</Hsp_gaps> + <Hsp_align-len>367</Hsp_align-len> + <Hsp_qseq>IKVRELD-DKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEM---------KRTAN----SVVKSITGTNTN--KVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKND--TVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MKVKELDFDVASLFKGGSKTSAGQS-KAKPIQTTVTAQYPAERASGNDTSTDMVLSDLYKNGLLFTAYNFSSRVSPDLRNDRSSQMTKKFSKATGKLTGNTGGFSAVKNLFSNNSKGVKFDNQALANILLPRSKSDVDSVTHKFNDVGESLITKGGGTATGILSNVASTAVFGALESVTNGVMADSGEQIYTTARSMYAGPDNRTKVFTWEMTPRNAQDLIQIIKIYEIFNYYSYGETGNSAFAGELKEKIDTWYRSTF----KKEAIDKFDGKLLGESITSFLSNVIVVSNPTIWYIRNFGDSSSYDGREDIFGPCQIQSIRFDKTPDGHFNGLAIAPNLPSTFSLEVTFREIITLNRGSLYTEGF</Hsp_hseq> + <Hsp_midline>+KV+ELD D G KTSAGQS ++ I++T+TAQYP+ER++GNDTS + + DLYKNGLLFTAY+ +SR + D+R+ R +M K T N S VK++ N+ K D + NILLPRSKSDV+SV+HKFNDVG+SLI++GGGTATG+LSNVASTAVFG LES+T G+MAD EQIY TARSMY G DNRTKVFTW++TPR+ QDLI II+IYE FNYYSYGETG S +A E+K ++D WY+STF + +K D + E ITSFLSNVIVVSNPT+W++RNFG +S +DGR ++FGPCQIQSIRFDKTP+G+FNGLAIAPNLPSTF+LE+T REI+TLNR S+Y EGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>19</Hit_num> + <Hit_id>gi|161622626|ref|YP_001595319.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Enterobacteria phage JS98] >gi|238695346|ref|YP_002922539.1| gp48 baseplate tail tube cap [Enterobacteria phage JS10] >gi|52139949|gb|AAU29319.1| gp48 baseplate tail tube cap [Enterobacteria phage JS98] >gi|220029482|gb|ACL78416.1| gp48 baseplate tail tube cap [Enterobacteria phage JS10]</Hit_def> + <Hit_accession>YP_001595319</Hit_accession> + <Hit_len>362</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>450.669</Hsp_bit-score> + <Hsp_score>1158</Hsp_score> + <Hsp_evalue>1.82386e-154</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>351</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>362</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>226</Hsp_identity> + <Hsp_positive>282</Hsp_positive> + <Hsp_gaps>19</Hsp_gaps> + <Hsp_align-len>365</Hsp_align-len> + <Hsp_qseq>IKVRELD-DKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEM---------KRTAN----SVVKSITGTNTN--KVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEGF</Hsp_qseq> + <Hsp_hseq>MKVKEIDIDVASLFKGGSKTSAGQS-KAKPAQTTVTAQYPAERASGNDTSTDMVLNDLYKNGLLFTAYNFSSRVSPDLRNDRSSQMTKKFSKAAGKLTSNTGGFSAVKNLFSNNSKGVKFDSQALANILLPRSKSDVDSVTHKFNDVGESLITKGGGTATGILSNVASTAVFGALESVTNGVMADSGEQIYTTARSMYAGPDNRTKVFTWEMTPRNAQDLIQIIKIYEIFNYYSYGETGNSAFAGELKEKIDTWYRSTFKKEAIDNFDGK--LLGEGITSFLSNVIVVSNPTIWYIRNFGNTSSYDGREDIFGPCQIQSIRFDKTPDGHFNGLAIAPNLPSTFSLEVTFREIITLNRGSLYTEGF</Hsp_hseq> + <Hsp_midline>+KV+E+D D G KTSAGQS ++ ++T+TAQYP+ER++GNDTS + ++DLYKNGLLFTAY+ +SR + D+R+ R +M K T+N S VK++ N+ K D + NILLPRSKSDV+SV+HKFNDVG+SLI++GGGTATG+LSNVASTAVFG LES+T G+MAD EQIY TARSMY G DNRTKVFTW++TPR+ QDLI II+IYE FNYYSYGETG S +A E+K ++D WY+STF + K + E ITSFLSNVIVVSNPT+W++RNFG TS +DGR ++FGPCQIQSIRFDKTP+G+FNGLAIAPNLPSTF+LE+T REI+TLNR S+Y EGF</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>20</Hit_num> + <Hit_id>gi|311992692|ref|YP_004009560.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Acinetobacter phage Ac42] >gi|298684475|gb|ADI96436.1| gp48 baseplate tail tube cap [Acinetobacter phage Ac42]</Hit_def> + <Hit_accession>YP_004009560</Hit_accession> + <Hit_len>358</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>447.588</Hsp_bit-score> + <Hsp_score>1150</Hsp_score> + <Hsp_evalue>2.52876e-153</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>355</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>217</Hsp_identity> + <Hsp_positive>280</Hsp_positive> + <Hsp_gaps>14</Hsp_gaps> + <Hsp_align-len>358</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMR---------LGEMKRTA-NSVVKSITGTNTNK-VDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>MKVKEIT-IANIVQAGTDVSAGYTNKRSEPK-TMIAQYPSERSSGNDAS-DMQISDLYRNGLLFTAYDYKSRTTPDMRGMRKREQNKVKALYEQTRTQFNRITSGITSESPKKSVSQDPVANILMPRSKSDSENINHKFNDVGDSLITKGGGTMTGAISNMASTAVFGAIESMTQGLLSDKGEQIYTTARSMYAGPENRTKVYSWELTPRTIDDLVQIIRIYEIFNFYSYGMTGNSQYAKELKSQIDEWYKKTFINNLTPEGSDRSGTMMESVTAFLSNVIVVTNPTVWFVRNFGKTTKFDGRPDVFGPAQIQSIRFDKAPDGNFRGLSIAPNMPSTFVLEVTMREILTLSRGTLYGD</Hsp_hseq> + <Hsp_midline>+KV+E+ + + +G SAG +++ ++ K T+ AQYPSERS+GND S +++ DLY+NGLLFTAYD SRTT DMR MR L E RT N + IT + K V + PV NIL+PRSKSD E+++HKFNDVGDSLI++GGGT TG +SN+ASTAVFG +ES+TQGL++D EQIY TARSMY G +NRTKV++W+LTPR++ DL+ II IYE FN+YSYG TG S YAKE+KSQ+DEWYK TF++ LTP+ ++++ T+ E +T+FLSNVIVV+NPTVWFVRNFG T+KFDGR +VFGP QIQSIRFDK P+GNF GL+IAPN+PSTF LE+TMREILTL+R ++Y +</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>21</Hit_num> + <Hit_id>gi|326536336|ref|YP_004300777.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Acinetobacter phage 133] >gi|299483417|gb|ADJ19511.1| gp48 baseplate tail tube cap [Acinetobacter phage 133]</Hit_def> + <Hit_accession>YP_004300777</Hit_accession> + <Hit_len>356</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>438.343</Hsp_bit-score> + <Hsp_score>1126</Hsp_score> + <Hsp_evalue>1.19665e-149</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>13</Hsp_hit-from> + <Hsp_hit-to>354</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>210</Hsp_identity> + <Hsp_positive>264</Hsp_positive> + <Hsp_gaps>13</Hsp_gaps> + <Hsp_align-len>344</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRT-----------ANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>AGTEISAGYTKQDT-TQQTFSAQYPAERSAGNDATKTSN-GDLYRNGLLFTAYDYKARATPDMTRQRQGELDKARSLYTRISSGLADAGKRSSTQGQDKKIVKDPVANILLPRSKSDSDVVSHKFNDVQDSLITRGGGTATGILSNIASTAVFGTIESVTQGWMADKGEQIFNASRSMYNGAENRSKVYTWELTPRTLEDLVEIMKIYEIFNYYSYGMTGTSAYAKELKAYIDDWYKKTFLNNLTPEGSDKSGTAMESVTSFLSNVITVSNPTIWFVRNFGKSTKFDGRPDVFGPAQIQSIRFDKAPEGHFKGLAIAPNMPSTFVLEITMREVIALSRGSIYGE</Hsp_hseq> + <Hsp_midline>+G + SAG + Q + T +AQYP+ERSAGND + + DLY+NGLLFTAYD +R T DM R GE+ + A++ +S T K+ K PV NILLPRSKSD + VSHKFNDV DSLI+RGGGTATG+LSN+ASTAVFG +ES+TQG MAD EQI+N +RSMY GA+NR+KV+TW+LTPR+++DL+ I++IYE FNYYSYG TGTS YAKE+K+ +D+WYK TFL+ LTP+ ++K+ T E +TSFLSNVI VSNPT+WFVRNFG ++KFDGR +VFGP QIQSIRFDK P G+F GLAIAPN+PSTF LEITMRE++ L+R S+Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>22</Hit_num> + <Hit_id>gi|311992948|ref|YP_004009815.1|</Hit_id> + <Hit_def>gp48 baseplate [Acinetobacter phage Acj61] >gi|295815237|gb|ADG36163.1| gp48 baseplate [Acinetobacter phage Acj61]</Hit_def> + <Hit_accession>YP_004009815</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>416.001</Hsp_bit-score> + <Hsp_score>1068</Hsp_score> + <Hsp_evalue>9.51542e-141</Hsp_evalue> + <Hsp_query-from>5</Hsp_query-from> + <Hsp_query-to>348</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>360</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>203</Hsp_identity> + <Hsp_positive>264</Hsp_positive> + <Hsp_gaps>14</Hsp_gaps> + <Hsp_align-len>358</Hsp_align-len> + <Hsp_qseq>VRELDDKTDALIS--GVKTSAGQSSQSAKIKSTI-TAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSV---------VKSITGTNTNKVDKI--PVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYA</Hsp_qseq> + <Hsp_hseq>VKEIVDSETNLIERIGSFVAAGRSSKEEESKTKIFEAQYPDGRAAATDSVDDARIQDLYANGLLFTAVEYKGRTTPEMTDMRGQVMKNMVDAIDQAKGVFNQLRGKSGGNKKISSAIKNPVCQILLPRSKTDTDTISHKFNDVNESLITRGNGTATGILSNLASTAVFGAVESISQGVMADHGEQIYNTSRAMYGGAENRTKTYTWELTPRTEGDLVQIIRIYELFSFFSYGVTGNSAYAKEIKGQIDDWYKKTFINNLTPEGADRSGTMMESVTSFLSNVIVVSNPTVWFIQNFGTMTTYDKHADVFGPAQISNIRFDKAPDGNFSGLAIAPNMPSTFVLEITFREILTLNRGSLYG</Hsp_hseq> + <Hsp_midline>V+E+ D LI G +AG+SS+ + K+ I AQYP R+A D+ R+ DLY NGLLFTA + RTT +M MR MK +++ ++ +G N I PV ILLPRSK+D +++SHKFNDV +SLI+RG GTATG+LSN+ASTAVFG +ES++QG+MADH EQIYNT+R+MYGGA+NRTK +TW+LTPR+ DL+ II IYE F+++SYG TG S YAKE+K Q+D+WYK TF++ LTP+ A+++ T+ E +TSFLSNVIVVSNPTVWF++NFGT + +D A+VFGP QI +IRFDK P+GNF+GLAIAPN+PSTF LEIT REILTLNR S+Y </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>23</Hit_num> + <Hit_id>gi|311993474|ref|YP_004010339.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Acinetobacter phage Acj9] >gi|295917431|gb|ADG60102.1| gp48 baseplate tail tube cap [Acinetobacter phage Acj9]</Hit_def> + <Hit_accession>YP_004010339</Hit_accession> + <Hit_len>360</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>412.92</Hsp_bit-score> + <Hsp_score>1060</Hsp_score> + <Hsp_evalue>1.46922e-139</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>357</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>212</Hsp_identity> + <Hsp_positive>267</Hsp_positive> + <Hsp_gaps>22</Hsp_gaps> + <Hsp_align-len>363</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALIS--GVKTSAGQSSQSAKIKSTI-TAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMR-----LGEMKRTANSVVKSI---TGTNTNKVDKI--PVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEA---NKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>MQIEEITD----LVSKAGSDISAGQSMRSQESETKILTAQYPAERSASVANTADVGVGQSYSNGLLFTAFEYKSRTTNDLRSMRTKAQNAAKVLRSSKSVTKAIQAVTGGNPNDPNTIKNPVANILMPRSKTDTDVTGHKFNDVGESLISRGGGTATGILSNVASTAVFGTIESVTKGAMADHGEQIYNTSRSMYAGAENRVKTYTWELTPRTYDDLTQIVKIYEIFNYLSYGMTGKSAFAKGVKDEIDKWYRKTFINPL--NEATGSNVQSTTMESVTSFLSNVIVVSNPTVWTIQNFGTASKFDGLADVFGPAQISNIRFDKAPDGQFNGLAAAPNMPSSFVLEVTFREILTLNRATIYGE</Hsp_hseq> + <Hsp_midline>+++ E+ D L+S G SAGQS +S + ++ I TAQYP+ERSA + + V Y NGLLFTA++ SRTT D+RSMR ++ R++ SV K+I TG N N + I PV NIL+PRSK+D + HKFNDVG+SLISRGGGTATG+LSNVASTAVFG +ES+T+G MADH EQIYNT+RSMY GA+NR K +TW+LTPR+ DL I++IYE FNY SYG TG S +AK VK ++D+WY+ TF++ L +EA N T E +TSFLSNVIVVSNPTVW ++NFGT SKFDG A+VFGP QI +IRFDK P+G FNGLA APN+PS+F LE+T REILTLNRA++Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>24</Hit_num> + <Hit_id>gi|639438515|ref|YP_009030255.1|</Hit_id> + <Hit_def>baseplate subunit [Serratia phage PS2] >gi|625370588|gb|AHY25448.1| baseplate subunit [Serratia phage PS2]</Hit_def> + <Hit_accession>YP_009030255</Hit_accession> + <Hit_len>358</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>411.379</Hsp_bit-score> + <Hsp_score>1056</Hsp_score> + <Hsp_evalue>4.21058e-139</Hsp_evalue> + <Hsp_query-from>18</Hsp_query-from> + <Hsp_query-to>350</Hsp_query-to> + <Hsp_hit-from>20</Hsp_hit-from> + <Hsp_hit-to>357</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>201</Hsp_identity> + <Hsp_positive>252</Hsp_positive> + <Hsp_gaps>7</Hsp_gaps> + <Hsp_align-len>339</Hsp_align-len> + <Hsp_qseq>GVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTA----NSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANK--NDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAEG</Hsp_qseq> + <Hsp_hseq>GETIGAGSTGQKKLIQKTLQAQFPAERSAGTDGSSDLRVNDLYRNGLLFTAYDFDARTTQALRDFRKKNNTKTVLDQWNPIKFLTNYGSTFQLNQEAVANILMPRSQSDVDNISHKFNDVGESLTGRNGGDVGKTISNMASTAVFGALESVTQGIMADKGEQVYNSARSMYAGPDNRTKIFVWNLTPRTVYDLLEILKIYEIFAYYSYGRVGYSPWAKDLKSQIDAWYKET-LTKATFDQAKGEVKDTFFEGITDFLTNVITVSNPTIWTVKNFGRTSSFDGKTDIFGPCQIQSIRFDKSPNGHFNGLAIAPNLPSTFVLEITMREIMTLNRDVLFAEG</Hsp_hseq> + <Hsp_midline>G AG + Q I+ T+ AQ+P+ERSAG D S LRV+DLY+NGLLFTAYD ++RTT +R R +T N + +T ++++ V NIL+PRS+SDV+++SHKFNDVG+SL R GG +SN+ASTAVFG LES+TQG+MAD EQ+YN+ARSMY G DNRTK+F W+LTPR+V DL+ I++IYE F YYSYG G S +AK++KSQ+D WYK T L T D+A DT FE IT FL+NVI VSNPT+W V+NFG TS FDG+ ++FGPCQIQSIRFDK+PNG+FNGLAIAPNLPSTF LEITMREI+TLNR ++AEG</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>25</Hit_num> + <Hit_id>gi|33620542|ref|NP_891751.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Enterobacteria phage RB49] >gi|33348009|gb|AAQ15410.1| gp48 baseplate tail tube cap [Enterobacteria phage RB49]</Hit_def> + <Hit_accession>NP_891751</Hit_accession> + <Hit_len>352</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>408.683</Hsp_bit-score> + <Hsp_score>1049</Hsp_score> + <Hsp_evalue>4.9384e-138</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>348</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>349</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>200</Hsp_identity> + <Hsp_positive>260</Hsp_positive> + <Hsp_gaps>13</Hsp_gaps> + <Hsp_align-len>354</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSIT--------GTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYA</Hsp_qseq> + <Hsp_hseq>MKISVINDAVDSFKAGVKTSAGFTSKNKG--KTLTAQFPAERASGNDASG-YYINDLYNNGLLFTAYDYTSRTTGSLRDFR--KKKNVASGFGGSVNIAGFDLNLGGRNAAFDREAIANILLPRSQSDVDAASHKFNDVGESVISRGGGTLGGALSNMASTAVFGGIESITGGYLADHGEQIYNTARSMYAGADARTKNYVWHLTPRSIEDLRNILIIYETFLELSYGSSGISSTAKELKAEVDAWYKNTLLRKSTPEEAKRNDTLFEGITDFLSNVITVSNPTIWMISNFGKRTSFEGRSDAFGPAQISSVRLDKSPDGKFNGLAISPNLPSTFVLEVTFREILTLSRGTIFG</Hsp_hseq> + <Hsp_midline>+K+ ++D D+ +GVKTSAG +S++ T+TAQ+P+ER++GND SG ++DLY NGLLFTAYD SRTTG +R R + K A+ S+ G D+ + NILLPRS+SDV++ SHKFNDVG+S+ISRGGGT G LSN+ASTAVFGG+ES+T G +ADH EQIYNTARSMY GAD RTK + W LTPRS++DL I+ IYE F SYG +G S+ AKE+K+++D WYK+T L TP+EA +NDT+FE IT FLSNVI VSNPT+W + NFG + F+GR++ FGP QI S+R DK+P+G FNGLAI+PNLPSTF LE+T REILTL+R +++ </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>26</Hit_num> + <Hit_id>gi|238695065|ref|YP_002922259.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Enterobacteria phage JSE] >gi|220029201|gb|ACL78136.1| gp48 baseplate tail tube cap [Enterobacteria phage JSE]</Hit_def> + <Hit_accession>YP_002922259</Hit_accession> + <Hit_len>352</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>406.757</Hsp_bit-score> + <Hsp_score>1044</Hsp_score> + <Hsp_evalue>2.44502e-137</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>348</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>349</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>199</Hsp_identity> + <Hsp_positive>259</Hsp_positive> + <Hsp_gaps>13</Hsp_gaps> + <Hsp_align-len>354</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSIT--------GTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYA</Hsp_qseq> + <Hsp_hseq>MKISVINDAVDSFKAGVKTSAGFTSKNKG--KTLTAQFPAERASGNDASG-YYINDLYNNGLLFTAYDYTSRTTGSLRDFR--KKKNVASGFGGSVNIAGFDLNLGGRNAAFDREAIANILLPRSQSDVDAASHKFNDVGESVISRGGGTLGGALSNMASTAVFGGIESITGGYLADHGEQIYNTARSMYAGADARTKNYVWHLTPRSIEDLRNILIIYETFLELSYGSSGISSTAKELKAEVDAWYKNTLLSKSTPAEAKRNDTLFEGITDFLSNVITVSNPTIWMISNFGKRTSFEGRSDAFGPAQISSVRLDKSPDGKFNGLAISPNLPSTFVLEVSFREILTLSRGTIFG</Hsp_hseq> + <Hsp_midline>+K+ ++D D+ +GVKTSAG +S++ T+TAQ+P+ER++GND SG ++DLY NGLLFTAYD SRTTG +R R + K A+ S+ G D+ + NILLPRS+SDV++ SHKFNDVG+S+ISRGGGT G LSN+ASTAVFGG+ES+T G +ADH EQIYNTARSMY GAD RTK + W LTPRS++DL I+ IYE F SYG +G S+ AKE+K+++D WYK+T L TP EA +NDT+FE IT FLSNVI VSNPT+W + NFG + F+GR++ FGP QI S+R DK+P+G FNGLAI+PNLPSTF LE++ REILTL+R +++ </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>27</Hit_num> + <Hit_id>gi|157311484|ref|YP_001469527.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Enterobacteria phage Phi1] >gi|149380688|gb|ABR24693.1| gp48 baseplate tail tube cap [Enterobacteria phage Phi1]</Hit_def> + <Hit_accession>YP_001469527</Hit_accession> + <Hit_len>352</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>405.601</Hsp_bit-score> + <Hsp_score>1041</Hsp_score> + <Hsp_evalue>6.50999e-137</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>348</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>349</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>198</Hsp_identity> + <Hsp_positive>259</Hsp_positive> + <Hsp_gaps>13</Hsp_gaps> + <Hsp_align-len>354</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSIT--------GTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYA</Hsp_qseq> + <Hsp_hseq>MKISVINDAVDSFKAGVKTSAGFTSKNKG--KTLTAQFPAERASGNDASG-YYINDLYNNGLLFTAYDYTSRTTGSLRDFR--KKKNVASGFGGSVNIAGFDLNLGGRNAAFDREAIANILLPRSQSDVDAASHKFNDVGESVISRGGGTLGGALSNMASTAVFGGIESITGGYLADHGEQIYNTARSMYAGADARTKNYVWHLTPRSIEDLRNILIIYETFLELSYGSSGISSTAKELKAEVDAWYKNTLLRKSTPEEAKRNDTLFEGITDFLSNAITVSNPTIWMISNFGKRTSFEGRSDAFGPAQISSVRLDKSPDGKFNGLAISPNLPSTFVLEVSFREILTLSRGTIFG</Hsp_hseq> + <Hsp_midline>+K+ ++D D+ +GVKTSAG +S++ T+TAQ+P+ER++GND SG ++DLY NGLLFTAYD SRTTG +R R + K A+ S+ G D+ + NILLPRS+SDV++ SHKFNDVG+S+ISRGGGT G LSN+ASTAVFGG+ES+T G +ADH EQIYNTARSMY GAD RTK + W LTPRS++DL I+ IYE F SYG +G S+ AKE+K+++D WYK+T L TP+EA +NDT+FE IT FLSN I VSNPT+W + NFG + F+GR++ FGP QI S+R DK+P+G FNGLAI+PNLPSTF LE++ REILTL+R +++ </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>28</Hit_num> + <Hit_id>gi|401824981|gb|AFQ22671.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Stenotrophomonas phage IME13]</Hit_def> + <Hit_accession>AFQ22671</Hit_accession> + <Hit_len>342</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>368.237</Hsp_bit-score> + <Hsp_score>944</Hsp_score> + <Hsp_evalue>2.03823e-122</Hsp_evalue> + <Hsp_query-from>8</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>7</Hsp_hit-from> + <Hsp_hit-to>341</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>181</Hsp_identity> + <Hsp_positive>241</Hsp_positive> + <Hsp_gaps>13</Hsp_gaps> + <Hsp_align-len>345</Hsp_align-len> + <Hsp_qseq>LDDKTDALISGV---KTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>LDGGVQDVVGGILKGENPATGSSPRRPISKIAIAQFPAERNSANDSAQDFNVNDLYKNGLILSAFNYAGRQTGDLRSFRSGQ-----NNI-----GDYRKGVVKEAIANILMPRGQTDVDTISHKFNDVQQSLVERGDSSVTGALSSMASHALFGGLESITQGAFADRGEQVYITSRAMYAGADNRTKTYTWQLTPRNVYDLMQILIIYEMLSYYSYGAVEKSKTASQIKSTLDKAYKETFINPLTPEATHGQTTMFERITSFLSNVNVVSNPIIWTIRNFGETSSFDMRSDVFGPAQIQSIRFDKSPDGHFGGLAIAPNLPSSFVLEVTFREILALNRSDLYDE</Hsp_hseq> + <Hsp_midline>LD ++ G+ + A SS I AQ+P+ER++ ND++ V+DLYKNGL+ +A++ R TGD+RS R G+ N++ G V K + NIL+PR ++DV+++SHKFNDV SL+ RG + TG LS++AS A+FGGLES+TQG AD EQ+Y T+R+MY GADNRTK +TW LTPR+V DL+ I+ IYE +YYSYG S A ++KS LD+ YK TF++ LTP+ + T+FE+ITSFLSNV VVSNP +W +RNFG TS FD R++VFGP QIQSIRFDK+P+G+F GLAIAPNLPS+F LE+T REIL LNR+ +Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>29</Hit_num> + <Hit_id>gi|472438117|ref|YP_007677897.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Aeromonas phage Aes012] >gi|395653255|gb|AFN69810.1| baseplate tail tube cap [Aeromonas phage Aes012]</Hit_def> + <Hit_accession>YP_007677897</Hit_accession> + <Hit_len>342</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>364.385</Hsp_bit-score> + <Hsp_score>934</Hsp_score> + <Hsp_evalue>7.92274e-121</Hsp_evalue> + <Hsp_query-from>8</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>7</Hsp_hit-from> + <Hsp_hit-to>341</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>178</Hsp_identity> + <Hsp_positive>240</Hsp_positive> + <Hsp_gaps>13</Hsp_gaps> + <Hsp_align-len>345</Hsp_align-len> + <Hsp_qseq>LDDKTDALISGV---KTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>LDGGVQDVVGGILKGENPATGSSPRRPISKIAIAQFPAERNSANDSAQDFNVNDLYKNGLILSAFNYAGRQTGDLRSFRSGQ-----NNI-----GDYRKGVVKEAIANILMPRGQTDVDTISHKFNDVQQSLVERGDSSVTGALSSMASHALYGGLESITQGAFADRGEQVYIASRAMYAGADNRTKTYTWQLTPRNVYDLMQILIIYEMLSYYSYGAVEKSKTASQIKSTLDKAYKETFINPLTPEATHGQTTMFERITSFLSNVNVVSNPIIWTIRNFGETSSFDMRSDMFGPAQIQSIRFDKSPDGHFGGLAIAPNLPSSFVLEVTFREILALNRSDLYDE</Hsp_hseq> + <Hsp_midline>LD ++ G+ + A SS I AQ+P+ER++ ND++ V+DLYKNGL+ +A++ R TGD+RS R G+ N++ G V K + NIL+PR ++DV+++SHKFNDV SL+ RG + TG LS++AS A++GGLES+TQG AD EQ+Y +R+MY GADNRTK +TW LTPR+V DL+ I+ IYE +YYSYG S A ++KS LD+ YK TF++ LTP+ + T+FE+ITSFLSNV VVSNP +W +RNFG TS FD R+++FGP QIQSIRFDK+P+G+F GLAIAPNLPS+F LE+T REIL LNR+ +Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>30</Hit_num> + <Hit_id>gi|310722276|ref|YP_003969100.1|</Hit_id> + <Hit_def>unnamed protein product [Aeromonas phage phiAS4] >gi|306021119|gb|ADM79654.1| baseplate protein [Aeromonas phage phiAS4]</Hit_def> + <Hit_accession>YP_003969100</Hit_accession> + <Hit_len>342</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>363.999</Hsp_bit-score> + <Hsp_score>933</Hsp_score> + <Hsp_evalue>1.00609e-120</Hsp_evalue> + <Hsp_query-from>8</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>11</Hsp_hit-from> + <Hsp_hit-to>341</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>177</Hsp_identity> + <Hsp_positive>239</Hsp_positive> + <Hsp_gaps>11</Hsp_gaps> + <Hsp_align-len>342</Hsp_align-len> + <Hsp_qseq>LDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>VQDVVGGILKGENPATG-SSPRRPISKIAIAQFPAERNSANDSAQDFNVNDLYKNGLILSAFNYAGRQTGDLRSFRSGQ-----NNI-----GDYRKGVVKEAIANILMPRGQTDVDTISHKFNDVQQSLVERGDSSVTGALSSMASHALYGGLESITQGAFADRGEQVYIASRAMYAGAENRTKTYTWQLTPRNVYDLMQILIIYEMLSYYSYGAVEKSKTASQIKSTLDKAYKETFINPLTPEATHGQTTMFERITSFLSNVNVVSNPIIWTIRNFGETSSFDMRSDVFGPAQIQSIRFDKSPDGHFGGLAIAPNLPSSFVLEVTFREILALNRSDLYDE</Hsp_hseq> + <Hsp_midline>+ D ++ G + G SS I AQ+P+ER++ ND++ V+DLYKNGL+ +A++ R TGD+RS R G+ N++ G V K + NIL+PR ++DV+++SHKFNDV SL+ RG + TG LS++AS A++GGLES+TQG AD EQ+Y +R+MY GA+NRTK +TW LTPR+V DL+ I+ IYE +YYSYG S A ++KS LD+ YK TF++ LTP+ + T+FE+ITSFLSNV VVSNP +W +RNFG TS FD R++VFGP QIQSIRFDK+P+G+F GLAIAPNLPS+F LE+T REIL LNR+ +Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>31</Hit_num> + <Hit_id>gi|109290161|ref|YP_656410.1|</Hit_id> + <Hit_def>gp48 base plate protein [Aeromonas phage 25] >gi|423262259|ref|YP_007010858.1| baseplate tail tube cap [Aeromonas phage Aes508] >gi|104345834|gb|ABF72734.1| gp48 base plate protein [Aeromonas phage 25] >gi|402762137|gb|AFQ97251.1| baseplate tail tube cap [Aeromonas phage Aes508]</Hit_def> + <Hit_accession>YP_656410</Hit_accession> + <Hit_len>342</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>362.459</Hsp_bit-score> + <Hsp_score>929</Hsp_score> + <Hsp_evalue>3.78445e-120</Hsp_evalue> + <Hsp_query-from>8</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>11</Hsp_hit-from> + <Hsp_hit-to>341</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>176</Hsp_identity> + <Hsp_positive>238</Hsp_positive> + <Hsp_gaps>11</Hsp_gaps> + <Hsp_align-len>342</Hsp_align-len> + <Hsp_qseq>LDDKTDALISGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>IQDVVGGILKGENPATG-SSPRRPISKIAIAQFPAERNSANDSAQDFNVNDLYKNGLILSAFNYAGRQTGDLRSFRSGQ-----NNI-----GDYRKGVVKEAIANILMPRGQTDVDTISHKFNDVQQSLVERGDSSVTGALSSMASHALYGGLESITQGAFADRGEQVYIASRAMYAGAENRTKTYTWQLTPRNVYDLMQILTIYEMLSYYSYGAVEKSKTASQIKSTLDNAYKETFINPLTPEATHGQTTMFERITSFLSNVNVVSNPIIWTIRNFGETSSFDMRSDMFGPAQIQSIRFDKSPDGHFGGLAIAPNLPSSFVLEVTFREILALNRSDLYDE</Hsp_hseq> + <Hsp_midline>+ D ++ G + G SS I AQ+P+ER++ ND++ V+DLYKNGL+ +A++ R TGD+RS R G+ N++ G V K + NIL+PR ++DV+++SHKFNDV SL+ RG + TG LS++AS A++GGLES+TQG AD EQ+Y +R+MY GA+NRTK +TW LTPR+V DL+ I+ IYE +YYSYG S A ++KS LD YK TF++ LTP+ + T+FE+ITSFLSNV VVSNP +W +RNFG TS FD R+++FGP QIQSIRFDK+P+G+F GLAIAPNLPS+F LE+T REIL LNR+ +Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>32</Hit_num> + <Hit_id>gi|37651665|ref|NP_932539.1|</Hit_id> + <Hit_def>baseplate subunit [Aeromonas phage 44RR2.8t] >gi|66391986|ref|YP_238911.1| baseplate tail tube cap [Aeromonas phage 31] >gi|34732965|gb|AAQ81502.1| baseplate tail tube cap [Aeromonas phage 44RR2.8t] >gi|62114823|gb|AAX63671.1| gp48 [Aeromonas phage 31]</Hit_def> + <Hit_accession>NP_932539</Hit_accession> + <Hit_len>342</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>362.073</Hsp_bit-score> + <Hsp_score>928</Hsp_score> + <Hsp_evalue>5.01898e-120</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>341</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>174</Hsp_identity> + <Hsp_positive>245</Hsp_positive> + <Hsp_gaps>14</Hsp_gaps> + <Hsp_align-len>351</Hsp_align-len> + <Hsp_qseq>IKVREL-DDKTDALISGV---KTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>MKVTELIDGGVQDVVKGILKGENPAGGSTPRQPLSKITIAQFPAERNAANDSTQDFNVNDLYKNGLLLSAFNYSGRQTGDLRSFRTDQ-----NNI-----GDYRKGVVKEAIANILMPKGQTDIDTINHKFNDVQQSLVERGNGSITGALSSMASHAVYGGLESITQGAFADRGEQVYIASRAMYAGAENRTKTYTWQLTPRNVYDLVEIIKIYEMLSYYSYGSVEKSNTANDIRKSVDAAYKETIINPLTPEATHGQTTMFERITSFLSNVNVVSNPIIWTIRNFGQSSSFDSRSDIFGPAQIQSIRFDKSPDGHFGGLAVAPNLPSSFVLEVTFREILALNRSDLYSE</Hsp_hseq> + <Hsp_midline>+KV EL D ++ G+ + AG S+ + AQ+P+ER+A ND++ V+DLYKNGLL +A++ + R TGD+RS R + N++ G V K + NIL+P+ ++D+++++HKFNDV SL+ RG G+ TG LS++AS AV+GGLES+TQG AD EQ+Y +R+MY GA+NRTK +TW LTPR+V DL+ II+IYE +YYSYG S A +++ +D YK T ++ LTP+ + T+FE+ITSFLSNV VVSNP +W +RNFG +S FD R+++FGP QIQSIRFDK+P+G+F GLA+APNLPS+F LE+T REIL LNR+ +Y+E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>33</Hit_num> + <Hit_id>gi|582955110|gb|AHI44678.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Acinetobacter phage ZZ1]</Hit_def> + <Hit_accession>AHI44678</Hit_accession> + <Hit_len>216</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>302.753</Hsp_bit-score> + <Hsp_score>774</Hsp_score> + <Hsp_evalue>1.69313e-98</Hsp_evalue> + <Hsp_query-from>138</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>213</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>139</Hsp_identity> + <Hsp_positive>171</Hsp_positive> + <Hsp_gaps>1</Hsp_gaps> + <Hsp_align-len>213</Hsp_align-len> + <Hsp_qseq>ISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEA-NKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>MTRGNGSPTGILSNMASTAVFGAIESATQGAMADHGEQIYNTSRSMYAGAENRTKTYSWDLTPRTPEDLSQILKIYEIFNYLSYGMTGNSAFAKSIKDEIDNWYKKTFIKPINDATGTTTQSTVMESVTSFLSNVIVVSNPTVWFIQNFGTQSKYDGLADIFGPAQISNIRFEKTSDGNFNGLAIAPNMPSTFVLEVTFREILTLNRASLYGE</Hsp_hseq> + <Hsp_midline>++RG G+ TG+LSN+ASTAVFG +ES TQG MADH EQIYNT+RSMY GA+NRTK ++WDLTPR+ +DL I++IYE FNY SYG TG S +AK +K ++D WYK TF+ + TV E +TSFLSNVIVVSNPTVWF++NFGT SK+DG A++FGP QI +IRF+KT +GNFNGLAIAPN+PSTF LE+T REILTLNRAS+Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>34</Hit_num> + <Hit_id>gi|392973134|ref|YP_006489092.1|</Hit_id> + <Hit_def>putative split baseplate tail tube cap [Acinetobacter phage ZZ1]</Hit_def> + <Hit_accession>YP_006489092</Hit_accession> + <Hit_len>202</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>284.263</Hsp_bit-score> + <Hsp_score>726</Hsp_score> + <Hsp_evalue>1.55814e-91</Hsp_evalue> + <Hsp_query-from>152</Hsp_query-from> + <Hsp_query-to>349</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>199</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>131</Hsp_identity> + <Hsp_positive>159</Hsp_positive> + <Hsp_gaps>1</Hsp_gaps> + <Hsp_align-len>199</Hsp_align-len> + <Hsp_qseq>VASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANK-NDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVYAE</Hsp_qseq> + <Hsp_hseq>MASTAVFGAIESATQGAMADHGEQIYNTSRSMYAGAENRTKTYSWDLTPRTPEDLSQILKIYEIFNYLSYGMTGNSAFAKSIKDEIDNWYKKTFIKPINDATGTTTQSTVMESVTSFLSNVIVVSNPTVWFIQNFGTQSKYDGLADIFGPAQISNIRFEKTSDGNFNGLAIAPNMPSTFVLEVTFREILTLNRASLYGE</Hsp_hseq> + <Hsp_midline>+ASTAVFG +ES TQG MADH EQIYNT+RSMY GA+NRTK ++WDLTPR+ +DL I++IYE FNY SYG TG S +AK +K ++D WYK TF+ + TV E +TSFLSNVIVVSNPTVWF++NFGT SK+DG A++FGP QI +IRF+KT +GNFNGLAIAPN+PSTF LE+T REILTLNRAS+Y E</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>35</Hit_num> + <Hit_id>gi|294661512|ref|YP_003579965.1|</Hit_id> + <Hit_def>gp48 baseplate subunit [Klebsiella phage KP15] >gi|448260646|ref|YP_007348740.1| baseplate tail tube cap [Klebsiella phage KP27] >gi|292660673|gb|ADE34921.1| gp48 baseplate subunit [Klebsiella phage KP15] >gi|370343455|gb|AEX26584.1| baseplate tail tube cap [Klebsiella phage KP27]</Hit_def> + <Hit_accession>YP_003579965</Hit_accession> + <Hit_len>357</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>170.244</Hsp_bit-score> + <Hsp_score>430</Hsp_score> + <Hsp_evalue>1.23976e-45</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>347</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>353</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>111</Hsp_identity> + <Hsp_positive>191</Hsp_positive> + <Hsp_gaps>32</Hsp_gaps> + <Hsp_align-len>365</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALIS----GVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGD-MRSMRLGEMKRTANSVVKSITGT-------NTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYK---STFLDTLTPDE-----ANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVY</Hsp_qseq> + <Hsp_hseq>MKFSIIDDSINTLKNIKNRGIPSGGAAITESVLKQTIVTAEFPAQRAAGIDNA--YNASSLYNNGLLFTAYDFNGVGSKDNYRSLR--QAAQNPKQILSSATGNVKYKQVLNSSIGTMEPVCQILLPRSLNDNEVNSHRYQDANDSFLTKG-------LSRVVSNMVWGAVESISGGIMADRREALDVGTKAAFQGSDKRTKMYYNTFVIESRNDLLELIKIYYLFTVLGYGTTSGGT-AKEVAALVKQYYGVLGAKTANAISPSSNPVTASDFDNSLGNDVVDFISNVEVIKSPPVWFIRDFQSGDSLRLPHSTFGPAGITSVRFGRSIDNIVNTLRESPNTPISLEVEIQFMELIDMRQDSIF</Hsp_hseq> + <Hsp_midline>+K +DD + L + G+ + ++S ++ +TA++P++R+AG D + LY NGLLFTAYD N + D RS+R + + ++ S TG N++ PV ILLPRS +D E SH++ D DS +++G LS V S V+G +ES++ G+MAD E + ++ + G+D RTK++ S DL+ +I+IY F YG T T AKEV + + ++Y + + ++P ++ ++++ + F+SNV V+ +P VWF+R+F + FGP I S+RF ++ + N L +PN P + +EI E++ + + S++</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>36</Hit_num> + <Hit_id>gi|66391556|ref|YP_239081.1|</Hit_id> + <Hit_def>gp48 baseplate [Enterobacteria phage RB43] >gi|62288644|gb|AAX78627.1| gp48 baseplate [Enterobacteria phage RB43] >gi|406718846|emb|CCL97571.1| protein of unknown function [Enterobacteria phage RB43] >gi|415434114|emb|CCK73954.1| protein of unknown function [Enterobacteria phage RB43]</Hit_def> + <Hit_accession>YP_239081</Hit_accession> + <Hit_len>361</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>168.703</Hsp_bit-score> + <Hsp_score>426</Hsp_score> + <Hsp_evalue>6.23176e-45</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>347</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>357</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>111</Hsp_identity> + <Hsp_positive>191</Hsp_positive> + <Hsp_gaps>36</Hsp_gaps> + <Hsp_align-len>369</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALI----SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYD----MNSRTTGDMRSMRLGEMKRTANSVVKSITGT-------NTNKVDKI-PVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKST---FLDTLTPDE-----ANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVY</Hsp_qseq> + <Hsp_hseq>MKIKVLQDTVQSFAEIKNAGIPSGGATTTKNALSQPIVTAEFPSQRAAGIDNA--YNASSLYNNGLLFTAYDFTGGLAPGSKDNYRSLR--QAAQNAKQILSANTGNVRYKQVLNTRTMGTLNPICQILLPRSLNDNEVNSHRYQDATDSIVAKG-------LSRAVSNVIWGAVESVSGGILADRREAIDIGTKAAFQGSDKRTKMYYNTFVIESRYDLLELIKIYYLFTVLGYGTTSGGTPA-ELAGLVKTAYNNTASKVANVFAPSSNQTTASDFNDSIGDQIVDFVSNVEVIKSPPVWFIRDFQTGDSLRFPHSTFGPAGITSVRFGRTMDNIVNTLRESPNTPISVEIEIQFMELIDMRQDSIF</Hsp_hseq> + <Hsp_midline>+K++ L D + +G+ + ++++A + +TA++PS+R+AG D + LY NGLLFTAYD + + + RS+R + + A ++ + TG NT + + P+ ILLPRS +D E SH++ D DS++++G LS S ++G +ES++ G++AD E I ++ + G+D RTK++ S DL+ +I+IY F YG T T A E+ + Y +T + P ++ ND++ ++I F+SNV V+ +P VWF+R+F T FGP I S+RF +T + N L +PN P + +EI E++ + + S++</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>37</Hit_num> + <Hit_id>gi|509141759|ref|YP_008060624.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Escherichia phage Lw1] >gi|479258586|gb|AGJ71509.1| baseplate tail tube cap [Escherichia phage Lw1]</Hit_def> + <Hit_accession>YP_008060624</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>156.377</Hsp_bit-score> + <Hsp_score>394</Hsp_score> + <Hsp_evalue>2.35983e-40</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>347</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>360</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>106</Hsp_identity> + <Hsp_positive>187</Hsp_positive> + <Hsp_gaps>39</Hsp_gaps> + <Hsp_align-len>372</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALI----SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSR----TTGDMRSMRLGEMKRTANSVVKSITGT-------NTNKVDKI-PVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGET--GTSTYAKEVKSQLDEWYKSTFLDTL---------TPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVY</Hsp_qseq> + <Hsp_hseq>MKIKVLQDTVQSFAKIKNAGIPSGGATTTKNALTQPIVTAEFPSQRAAGIDNA--YNASSLYNNGLLFTAYEFTGGFAPGSKDNYRSLR--QAAQNAQQILSANTGNVRYKQVLNTRTMGTLNPICQILLPRSLNDNEVNSHRYQDATDSIVAKGP-------SRVVSNVIWGVIESASGGILADRREAVDVGTKAAFQGSDKRTKMYYNTFVIESRYDLLELIKIYYLFTVLGYGTTSGGTAAEIAELAKQTINKASTTGAKLINNAAAGNGPTPTVSN-GSIISDQMVDFVTNIEVIKSPPVWFIRDFQTGDSLRFPHSTFGPAGITSVRFGRTMDNIVNTLRESPNTPISVEIEIQFMELIDMRQDSIF</Hsp_hseq> + <Hsp_midline>+K++ L D + +G+ + ++++A + +TA++PS+R+AG D + LY NGLLFTAY+ + + RS+R + + A ++ + TG NT + + P+ ILLPRS +D E SH++ D DS++++G S V S ++G +ES + G++AD E + ++ + G+D RTK++ S DL+ +I+IY F YG T GT+ E+ Q +T + TP +N + +++ F++N+ V+ +P VWF+R+F T FGP I S+RF +T + N L +PN P + +EI E++ + + S++</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>38</Hit_num> + <Hit_id>gi|304373651|ref|YP_003858396.1|</Hit_id> + <Hit_def>gp48 baseplate tail tube cap [Enterobacteria phage RB16] >gi|299829607|gb|ADJ55400.1| gp48 baseplate tail tube cap [Enterobacteria phage RB16]</Hit_def> + <Hit_accession>YP_003858396</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>155.221</Hsp_bit-score> + <Hsp_score>391</Hsp_score> + <Hsp_evalue>6.71724e-40</Hsp_evalue> + <Hsp_query-from>3</Hsp_query-from> + <Hsp_query-to>347</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>360</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>106</Hsp_identity> + <Hsp_positive>186</Hsp_positive> + <Hsp_gaps>39</Hsp_gaps> + <Hsp_align-len>372</Hsp_align-len> + <Hsp_qseq>IKVRELDDKTDALI----SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSR----TTGDMRSMRLGEMKRTANSVVKSITGT-------NTNKVDKI-PVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGET--GTSTYAKEVKSQLDEWYKSTFLDTL---------TPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVY</Hsp_qseq> + <Hsp_hseq>MKIKVLQDTVQSFAEIKNAGIPSGGATTTKNALTQPIVTAEFPSQRAAGIDNA--YNASSLYNNGLLFTAYEFTGGFAPGSKDNYRSLR--QAAQNAQQILSANTGNVRYKQVLNTRTMGTLNPICQILLPRSLNDNEVNSHRYQDATDSIVAKGP-------SRVVSNVIWGVIESASGGILADRREAVDVGTKAAFQGSDKRTKMYYNTFVIESRYDLLELIKIYYLFTVLGYGTTSGGTAAEIAELAKQTINKSSTTGAKLINNAVAGNGPTPTVSN-GSIISDQMVDFVINIEVIKSPPVWFIRDFQTGDSLRFPHSTFGPAGITSVRFGRTMDNIVNTLRESPNTPISVEIEIQFMELIDMRQDSIF</Hsp_hseq> + <Hsp_midline>+K++ L D + +G+ + ++++A + +TA++PS+R+AG D + LY NGLLFTAY+ + + RS+R + + A ++ + TG NT + + P+ ILLPRS +D E SH++ D DS++++G S V S ++G +ES + G++AD E + ++ + G+D RTK++ S DL+ +I+IY F YG T GT+ E+ Q +T + TP +N + +++ F+ N+ V+ +P VWF+R+F T FGP I S+RF +T + N L +PN P + +EI E++ + + S++</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>39</Hit_num> + <Hit_id>gi|414086183|ref|YP_006986373.1|</Hit_id> + <Hit_def>baseplate tail tube cap [Cronobacter phage vB_CsaM_GAP161] >gi|378566508|gb|AFC22204.1| baseplate tail tube cap [Cronobacter phage vB_CsaM_GAP161]</Hit_def> + <Hit_accession>YP_006986373</Hit_accession> + <Hit_len>364</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>153.68</Hsp_bit-score> + <Hsp_score>387</Hsp_score> + <Hsp_evalue>2.64906e-39</Hsp_evalue> + <Hsp_query-from>17</Hsp_query-from> + <Hsp_query-to>347</Hsp_query-to> + <Hsp_hit-from>19</Hsp_hit-from> + <Hsp_hit-to>360</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>102</Hsp_identity> + <Hsp_positive>178</Hsp_positive> + <Hsp_gaps>43</Hsp_gaps> + <Hsp_align-len>358</Hsp_align-len> + <Hsp_qseq>SGVKTSAGQSSQSAKIKSTITAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYD----MNSRTTGDMRSMRLGEMKRTANSVVKSITGT-------NTNKVDKI-PVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLT--------------PDEANKNDTVF-EKITSFLSNVIVVSNPTVWFVRNFGTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASVY</Hsp_qseq> + <Hsp_hseq>AGIPSGGAATTKNALTQPIVTAEFPSQRAAGIDNA--YNASSLYNNGLLFTAYEFTGGLAPGSKDNYRSLR--QAAQNAQQILSANTGNVRYKQVLNSRTIGTLNPICQILLPRSLNDNEVNSHRYQDATDSIVAKGP-------SRVVSNVIWGAIESASGGILADRREAVDVGTKAAFQGSDKRTKMYYNTFVIESRYDLLELIKIYYLFTVLGYGTTSGGTAA-----EIAELAKQTINKSSTAGAKLINNAIAGNGPTPTVSNGSIISDQAVDFVTNIEVIKSPPVWFIRDFQTGDSLRFPHSTFGPAGITSVRFGRTMDNIVNTLRESPNTPIAVEIEIQFMELIDMRQDSIF</Hsp_hseq> + <Hsp_midline>+G+ + ++++A + +TA++PS+R+AG D + LY NGLLFTAY+ + + + RS+R + + A ++ + TG N+ + + P+ ILLPRS +D E SH++ D DS++++G S V S ++G +ES + G++AD E + ++ + G+D RTK++ S DL+ +I+IY F YG T T A ++ E K T + T P N ++ ++ F++N+ V+ +P VWF+R+F T FGP I S+RF +T + N L +PN P +EI E++ + + S++</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>40</Hit_num> + <Hit_id>gi|392973135|ref|YP_006489093.1|</Hit_id> + <Hit_def>putative split baseplate tail tube cap [Acinetobacter phage ZZ1]</Hit_def> + <Hit_accession>YP_006489093</Hit_accession> + <Hit_len>143</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>107.071</Hsp_bit-score> + <Hsp_score>266</Hsp_score> + <Hsp_evalue>1.55074e-24</Hsp_evalue> + <Hsp_query-from>22</Hsp_query-from> + <Hsp_query-to>136</Hsp_query-to> + <Hsp_hit-from>19</Hsp_hit-from> + <Hsp_hit-to>143</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>59</Hsp_identity> + <Hsp_positive>80</Hsp_positive> + <Hsp_gaps>10</Hsp_gaps> + <Hsp_align-len>125</Hsp_align-len> + <Hsp_qseq>SAGQSSQSAKIKSTI-TAQYPSERSAGNDTSGSLRVHDLYKNGLLFTAYDMNSRTTGDMRSMRLGEMK-----RTANSVVKSITG----TNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDS</Hsp_qseq> + <Hsp_hseq>SAGQSQKSKETKTKIMTAQFPAERAASVDTTNAAEVGQNYQNGLLFTAYEYTSRTTPDLRSMRQRVQKSYKVLESTQKILSAVAGVSGQTEGRSTSKAPVANILMPRSKTDSDNTSHKFNDVGES</Hsp_hseq> + <Hsp_midline>SAGQS +S + K+ I TAQ+P+ER+A DT+ + V Y+NGLLFTAY+ SRTT D+RSMR K + ++ ++ G T K PV NIL+PRSK+D ++ SHKFNDVG+S</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>41</Hit_num> + <Hit_id>gi|646519388|ref|WP_025548737.1|</Hit_id> + <Hit_def>hypothetical protein [Vibrio parahaemolyticus] >gi|655769907|gb|KEE53216.1| hypothetical protein EM88_01435 [Vibrio parahaemolyticus] >gi|655811799|gb|KEE89780.1| hypothetical protein EM91_01710 [Vibrio parahaemolyticus]</Hit_def> + <Hit_accession>WP_025548737</Hit_accession> + <Hit_len>356</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>60.8474</Hsp_bit-score> + <Hsp_score>146</Hsp_score> + <Hsp_evalue>3.83249e-07</Hsp_evalue> + <Hsp_query-from>87</Hsp_query-from> + <Hsp_query-to>346</Hsp_query-to> + <Hsp_hit-from>109</Hsp_hit-from> + <Hsp_hit-to>342</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>65</Hsp_identity> + <Hsp_positive>105</Hsp_positive> + <Hsp_gaps>44</Hsp_gaps> + <Hsp_align-len>269</Hsp_align-len> + <Hsp_qseq>MKRTANSVVKSITGTNTNKVDKIPVVNILLPRSKSDVESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNT-ARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNF--------GTTSKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRASV</Hsp_qseq> + <Hsp_hseq>MPLLQDSLVHDIGGS----VDDITSVALAAGLDVADLEGDLSKLSSGVKSLVQNAKDITVGTVSQQAG-------QGSRQSTLASGNKVIQNNPGTDSWQGTQLREQTLIWQFNPKSLPELKAVASIIKTFKLLSLGSIGNSS------------------NELT--QANNNDRLNNPYGHIAS---CIKTPPLWFLEEVSDYYTGQDGAGARYTDRL-VFGPAAIASIKVNRTPDQYWKTFKGTAGDPASLDLEITFIELLPLDKETV</Hsp_hseq> + <Hsp_midline>M +S+V I G+ VD I V + +D+E K + SL+ G +S A + Q +A N+ I N + G R + W P+S+ +L A+ I + F S G G S+ + LT +AN ND + S + P +WF+ G +++ R VFGP I SI+ ++TP+ + P++ LEIT E+L L++ +V</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>42</Hit_num> + <Hit_id>gi|589286464|ref|YP_009006262.1|</Hit_id> + <Hit_def>tail-tube assembly protein [Vibrio phage VH7D] >gi|432142395|gb|AGB06975.1| tail-tube assembly protein [Vibrio phage VH7D]</Hit_def> + <Hit_accession>YP_009006262</Hit_accession> + <Hit_len>378</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>58.5362</Hsp_bit-score> + <Hsp_score>140</Hsp_score> + <Hsp_evalue>2.65852e-06</Hsp_evalue> + <Hsp_query-from>60</Hsp_query-from> + <Hsp_query-to>344</Hsp_query-to> + <Hsp_hit-from>61</Hsp_hit-from> + <Hsp_hit-to>339</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>73</Hsp_identity> + <Hsp_positive>122</Hsp_positive> + <Hsp_gaps>50</Hsp_gaps> + <Hsp_align-len>307</Hsp_align-len> + <Hsp_qseq>YKNGLLFTAYDMNSRTTGDMRSMR----------------LGEMKRTANSVVKSITGTNTNKVDKIP--VVNILLPRSKSDV--ESVSHKFNDVGDSLISRGGGTATGVLSNVASTAVFGGLESLTQGLMADHNEQIYNTARSMYGGADNRTKVFTWDLTPRSVQDLIAIIEIYEYFNYYSYGETGTSTYAKEVKSQLDEWYKSTFLDTLTPDEANKNDTVFEKITSFLSNVIVVSNPTVWFVRNFGTT--SKFDGRAEVFGPCQIQSIRFDKTPNGNFNGLAIAPNLPSTFTLEITMREILTLNRA</Hsp_qseq> + <Hsp_hseq>HPNFFIFRAYDLAHTTKQHYTDMRSSFTAAQTENEQSGEVPSELKATLALYAPNIVEEVSHEYDKTPTSVLNDFLASAASAAGSDTVSEGVDRGKRAVATAAGATLAQIKRSFIQSNAAGQLEK-NSSVVTD------NVTVTAYKGTAQRTQTMVYQFHPKSLDELKVVAEIIKTF----YG------LSLPVKGQID----SQLLDTGTANLGSGFAAGFAKYATLLKT------PPVWMIEEVSDTDATRYTPRF-IFGPAGITSVKLNRTPDQYWRTFRGTAGDPAGIELEITFSELIPLDRA</Hsp_hseq> + <Hsp_midline>+ N +F AYD+ T MR E+K T +I +++ DK P V+N L + S ++VS + ++ + G T + + + G LE ++ D N + Y G RT+ + P+S+ +L + EI + F YG + VK Q+D S LDT T + + F K + L P VW + T +++ R +FGP I S++ ++TP+ + P+ LEIT E++ L+RA</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +</Iteration_hits> + <Iteration_stat> + <Statistics> + <Statistics_db-num>48094830</Statistics_db-num> + <Statistics_db-len>17186091396</Statistics_db-len> + <Statistics_hsp-len>148</Statistics_hsp-len> + <Statistics_eff-space>2043815480868</Statistics_eff-space> + <Statistics_kappa>0.041</Statistics_kappa> + <Statistics_lambda>0.267</Statistics_lambda> + <Statistics_entropy>0.14</Statistics_entropy> + </Statistics> + </Iteration_stat> +</Iteration> +<Iteration> + <Iteration_iter-num>5</Iteration_iter-num> + <Iteration_query-ID>Query_5</Iteration_query-ID> + <Iteration_query-def>Merlin_5</Iteration_query-def> + <Iteration_query-len>576</Iteration_query-len> +<Iteration_hits> +<Hit> + <Hit_num>1</Hit_num> + <Hit_id>gi|456351275|ref|YP_007501227.1|</Hit_id> + <Hit_def>baseplate hub [Salmonella phage S16] >gi|347466340|gb|AEO97126.1| baseplate hub [Salmonella phage S16] >gi|408387124|gb|AFU64133.1| baseplate hub [Salmonella phage STML-198]</Hit_def> + <Hit_accession>YP_007501227</Hit_accession> + <Hit_len>577</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>675.626</Hsp_bit-score> + <Hsp_score>1742</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>577</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>345</Hsp_identity> + <Hsp_positive>442</Hsp_positive> + <Hsp_gaps>3</Hsp_gaps> + <Hsp_align-len>578</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPE--NNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKTENMTSFRRRKVIADSKGERDAAAAASNQVESLDSIGYKLDSVQSATELTSEVIEQKSNDIISAVNDTTAGVELTAEFAENTSKTVRELTDVTSAISDKISKLTDMLEQKIQAVQQKFVDSSKVTDDTLKVIGDSIPEPVESNLPAIPEKIFDKPEENNS-PDADFFPTLPSKAEEVDNKKDSDKKILDTENLLKDLVGTTKTGFKATVSITDKISNMLFKYTVSALAESAKLAGTIFAIVLGIDLLRAHFKYWSDKFSSNFDEFSQSAGEWGSLLQSVLGSLQEIKKFWENNDWSGLAVAIVKGLADVLYNLSELMSLGISKISAAILSALGFDNAALSIKGAALEGFQARTGNELNEEDQDTLARYQTRRIQEGPDAFDKFSEYKTRAFDFITGRDNKNTTTTEQEREAEVKKLKSLPEEELNEINKKSNNARAALVRFEKYMGDVDPENATNIESLDKAYNNVKSLVNDSELNKAPAIKKELEVRLQKAEARYQKIKTESKPEPAAPSASEDVQKVQNIEKAEQAKKSDANQSSSSSVVNAQVNNVNNSRTIQTINPVTATPAPGVFKATGVN</Hsp_hseq> + <Hsp_midline>MK+ENM++ RRRKVIADSKGERDAA+ AS+QV+SL+ IG KLD VQSA EL +EVIE+K N++I +V++ G EL AE +E T+++++ LT V S ISDK+SKL MLE K+QAV+QK +S L VI D +P+P E P +PE+I ++NN+ PD DFFP +P + E +NKKD K ++L DL+ TTK GFKAT+SITDKIS+MLFKYTV+ALAE+AK+A +FA+VLGIDLLR HFKYW+DKF SNFDEFS AGEWG LLQS+ G L +IKKFWE DWSGLAVAIVKGLADV+YNLSE+MSLGISKISA+IL ALGF+NAA +I+G+ALEGFQ RTGN L+E+DQ LA+YQ++RI+EGP DK E+KTRAFD++ GR+NK +T +R+ E + LK++ E+ E K N ARAA+ R EKY+GDVDPEN TN++SL+KAYN+ K ++DS ++ PA KKEL+ R Q+ E++YQK+K ++ P+PAAP+ SED Q+VQNI+KAE AK+ + +V N QVNNVNNS+TI + VTATPAPGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>2</Hit_num> + <Hit_id>gi|589889938|ref|YP_009005474.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Enterobacter phage PG7] >gi|583927851|gb|AHI61113.1| baseplate hub subunit tail length determinator [Enterobacter phage PG7]</Hit_def> + <Hit_accession>YP_009005474</Hit_accession> + <Hit_len>586</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>549.28</Hsp_bit-score> + <Hsp_score>1414</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>586</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>297</Hsp_identity> + <Hsp_positive>414</Hsp_positive> + <Hsp_gaps>20</Hsp_gaps> + <Hsp_align-len>591</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAE-------GTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPE---RILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDD--KKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANT-QVNNV-NNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKTENMKTMRR-KVIEEGRSERDAAKAASTQAESLSVLSSQLDDLQTQAELTSEVIEDKGNQVIDALNRVDQSIIDTTAGAELTAEASERTTEAVKQQTEVSNKISDKLSKLTELLNERLSAITPNLPQISV-PDTSLSVVEDAVPV--DIVTPGLPELLQELIPDPVNNTNNPNDAFFPTVPENPESDSKKGADEERKKKDSDTLSNLLKATKSGFKASMSITDRIAGMLFKYTVTAVIEAAKTAALLFSIVLGIDVIMKHFKYWSDKFTSDFDKFSAEAGEWGSTLSSIFGTLENIQKFWEAGDWSGLTVAIVKGVTEIIYNLSELISLGMSKVAAAILSIIPGLGDAALSVEGAALEGFQERTGNSLSKEDQDTLAKYQSSKIEKGENFFDKVSQGKTWIVNKITGDANISDFVTDEERESQNEKLRQMKPEEREQVLKKGNEARAAIVRFEKYMEQINPDDKRSVESADKAYANLQTQLNDTDLNNSPVTKKELSARMNIVTAKYDKLK-GKEPQPAPSSQSEDVKKVESIEKNKAAKEASLGTSAGAAAANLFNTNNVINNSRTINTVSPVTSTNAPGVFGATGVN</Hsp_hseq> + <Hsp_midline>MK+ENM TMRR KVI + + ERDAA AS Q +SL ++ +LDD+Q+ EL +EVIE+KGN +ID+++ V + G EL AEASERTTE++K T V++ ISDKLSKL +L ++ A+ + + T LSV+ED +P + +PGLPE ++P +N N P++ FFP VP+ PE++ K ++ KK +D L +LLK TK GFKA++SITD+I+ MLFKYTVTA+ EAAK AA+LF++VLGID++ HFKYW+DKF S+FD+FSAEAGEWG L SIFG L +I+KFWEAGDWSGL VAIVKG+ ++IYNLSE++SLG+SK++A+IL + G +AA ++ G+ALEGFQERTGNSLS++DQ LAKYQS +IE+G DK + KT + + G N D +R+ + + L+ M PE+RE+ L K NEARAA+ R EKY+ ++P++ +++S +KAY + + ++D+ +++ P TKKEL R V +KY KLK P+PA + SED ++V++I+K + AKE S ++ AN NNV NNS+TI+ V VT+T APGVFGATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>3</Hit_num> + <Hit_id>gi|311993187|ref|YP_004010053.1|</Hit_id> + <Hit_def>gp29 base plate hub subunit, tail length determinator [Enterobacteria phage CC31] >gi|284178025|gb|ADB81691.1| gp29 base plate hub subunit, tail length determinator [Enterobacteria phage CC31]</Hit_def> + <Hit_accession>YP_004010053</Hit_accession> + <Hit_len>586</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>546.584</Hsp_bit-score> + <Hsp_score>1407</Hsp_score> + <Hsp_evalue>0</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>586</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>296</Hsp_identity> + <Hsp_positive>412</Hsp_positive> + <Hsp_gaps>22</Hsp_gaps> + <Hsp_align-len>592</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAE-------GTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPE---RILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDD--KKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQK---AENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKTENMKTMRR-KVIEEGRSERDAAKAASTQAESLSVLSSQLDDLQTQAELTSEVIEDKGNQVIDALNRVDQSIIDTTAGAELTAEASERTTEAVKQQTEVSNKISDKLSKLTELLNERLSAITPNLPQISV-PDTSLSVVEDAVPV--DIVTPGLPELLQELIPDPVNNTNNPNDAFFPTVPENPESDSKKGADEERKKKDSDTLSNLLKATKSGFKASMSITDRIAGMLFKYTVTAVIEAAKTAALLFSIVLGIDVIMKHFKYWSDKFTSDFDKFSAEAGEWGSTLSSIFGTLENIQKFWEAGDWSGLTVAIVKGVTEIIYNLSELISLGMSKVAAAILSLIPGLGDAALSVEGAALEGFQERTGNSLSKEDQDTLAKYQSSKIEKGENFFDKVSQGKTWIVNKITGDANISDFVTDEERTAQNEKLRQMKPEEREQVLKKGNEARAAIVRFEKYMEQINPDDKRSVQSADKAYANLQTQLNDTDLNNSPITKKELNARMNIVTAKYDKLK-GKEPQPAPSSQSEDVKKVESIEKNKAAEKASLGTGAGAAAANLFNTN-NVINNSRTINTVSPVTSTNAPGVFGATGVN</Hsp_hseq> + <Hsp_midline>MK+ENM TMRR KVI + + ERDAA AS Q +SL ++ +LDD+Q+ EL +EVIE+KGN +ID+++ V + G EL AEASERTTE++K T V++ ISDKLSKL +L ++ A+ + + T LSV+ED +P + +PGLPE ++P +N N P++ FFP VP+ PE++ K ++ KK +D L +LLK TK GFKA++SITD+I+ MLFKYTVTA+ EAAK AA+LF++VLGID++ HFKYW+DKF S+FD+FSAEAGEWG L SIFG L +I+KFWEAGDWSGL VAIVKG+ ++IYNLSE++SLG+SK++A+IL + G +AA ++ G+ALEGFQERTGNSLS++DQ LAKYQS +IE+G DK + KT + + G N D +R + + L+ M PE+RE+ L K NEARAA+ R EKY+ ++P++ ++QS +KAY + + ++D+ +++ P TKKEL+ R V +KY KLK P+PA + SED ++V++I+K AE A + N+ NT N +NNS+TI+ V VT+T APGVFGATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>4</Hit_num> + <Hit_id>gi|422934607|ref|YP_007004568.1|</Hit_id> + <Hit_def>phage baseplate hub [Enterobacteria phage ime09] >gi|339791390|gb|AEK12447.1| phage baseplate hub [Enterobacteria phage ime09]</Hit_def> + <Hit_accession>YP_007004568</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>447.588</Hsp_bit-score> + <Hsp_score>1150</Hsp_score> + <Hsp_evalue>1.35305e-146</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>267</Hsp_identity> + <Hsp_positive>374</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKSTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKDAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMTAMILAVVVGIDLLMVHFKYWSDKFSQAWDLFSTDFTKFSSETGTWGPLLQSIFSSIDEIKKFWEAGDWGGLTVAIVEGLGKVLYNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNDDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKKD + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + +IKKFWEAGDW GL VAIV+GL V+YNL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ + G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>5</Hit_num> + <Hit_id>gi|228861124|ref|YP_002854147.1|</Hit_id> + <Hit_def>gp29 base plate hub [Enterobacteria phage RB51] >gi|227438798|gb|ACP31110.1| gp29 base plate hub [Enterobacteria phage RB51] >gi|291290410|dbj|BAI83205.1| baseplate hub subunit/tail length determinator [Enterobacteria phage AR1]</Hit_def> + <Hit_accession>YP_002854147</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>442.965</Hsp_bit-score> + <Hsp_score>1138</Hsp_score> + <Hsp_evalue>9.14277e-145</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>264</Hsp_identity> + <Hsp_positive>378</Hsp_positive> + <Hsp_gaps>49</Hsp_gaps> + <Hsp_align-len>606</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLP----DPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIVGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTG----TSLAVVENAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPIEPKQESPEEKQKRDAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMVALIMAVVIGIDLLMVHFKYWSDKFSKAWDLFSTDFTKFSSETGTWGPLLQSIFDSIDEIKKFWEAGDWGGLTVAIVEGLGSVLYNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNDDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSIEAAHEDLKKRMNDPDLNNSPAVKKELASRFAKIDATYQELKK-NQPEAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFGATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N ++ ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G T L+V+E+ +P + D ES G +LP + NN PD DFFP P P EP E+ ++ QK+D + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM A++ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + +IKKFWEAGDW GL VAIV+GL V+YNL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ + G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E A+ KK ++D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVFGATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>6</Hit_num> + <Hit_id>gi|422934972|ref|YP_007004932.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Escherichia phage wV7] >gi|343177526|gb|AEM00852.1| baseplate hub subunit tail length determinator [Escherichia phage wV7]</Hit_def> + <Hit_accession>YP_007004932</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>442.58</Hsp_bit-score> + <Hsp_score>1137</Hsp_score> + <Hsp_evalue>1.58375e-144</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>263</Hsp_identity> + <Hsp_positive>378</Hsp_positive> + <Hsp_gaps>49</Hsp_gaps> + <Hsp_align-len>606</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLP----DPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIVGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTG----TSLAVVENAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPIEPKQESPEEKQKRDAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMVALIMAVVIGIDLLMVHFKYWSDKFSKAWDLFSTDFTKFSSETGTWGPLLQSIFDSIDEIKKFWEAGDWGGLTVAIIEGLGSVLYNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNDDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSIEAAHEDLKKRMNDPDLNNSPAVKKELASRFAKIDATYQELKK-NQPEAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFGATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N ++ ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G T L+V+E+ +P + D ES G +LP + NN PD DFFP P P EP E+ ++ QK+D + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM A++ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + +IKKFWEAGDW GL VAI++GL V+YNL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ + G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E A+ KK ++D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVFGATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>7</Hit_num> + <Hit_id>gi|604671901|gb|AHV82895.1|</Hit_id> + <Hit_def>baseplate hub subunit, tail length determinator [Escherichia phage vB_EcoM_PhAPEC2]</Hit_def> + <Hit_accession>AHV82895</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>441.425</Hsp_bit-score> + <Hsp_score>1134</Hsp_score> + <Hsp_evalue>3.83095e-144</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>269</Hsp_identity> + <Hsp_positive>375</Hsp_positive> + <Hsp_gaps>30</Hsp_gaps> + <Hsp_align-len>598</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSV-------DNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLP-DPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKD-DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMS-------NFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEF----KTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKE-DNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQT-VTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKPEEMKSMRRNKVIADNKPQKVAATAATDSLEALNNISSKLDDVQAASELTSQSVEDKGNGIIESIGDLKNSTDNTAEGTELIAEVIEKQTEVTKSINEVSSAISSKLDRLATLLEQKLQ-TSTAIQNTGG---TSLEVIENAIPVKVVENETSDELFKALPTPEKIDNKPDEDFFPVPVQESANSTSDSKGGISFKLSDKIAMLTKTVQTGFNKSISISDRIAGMLFKYTITAAIEAAKMAALILGIVIGIDLLIVHFKYWTDKFTSAWDLFDENFTKFSDEAKEWGKFLSDIFTSIDSIKQLWEAGDWGGLTVAIVKGVGTALMNLGELIQLGMAKLSASILRAIGFGDTADEIEGRALEGFQETTGNKLKKEDQEKVAKYQMKRDDGELGTVSKGLDMLQRGKTFVTNWVRGNDNKEEFSTSDERAAESAKLKELPEEERKEAYIKANETRAALVRFEDYIDKIDMTNPENAKNVEKSYADLSKLIKDPELNKTPVVKKELDARFEKLNNKMAEAKKAQTTVKPESSSKSPEAKQVQSIEKGRAS--ESKQQQPVAAISNT--NNVVKKNTVVQNMTPVTSTTAPGIFHATGVN</Hsp_hseq> + <Hsp_midline>MK E M +MRR KVIAD+K ++ AA+ A+D +++L I KLDDVQ+A+EL ++ +E+KGN +I+S+ DN AEGTEL AE E+ TE K++ V+S IS KL +LA++LE K+Q +Q +G T L VIE+ +P E E+ + LP + +N PDEDFFP QE N+ D K K +D + L KT + GF +ISI+D+I+ MLFKYT+TA EAAKMAA++ +V+GIDLL +HFKYWTDKF S NF +FS EA EWG L IF + IK+ WEAGDW GL VAIVKG+ + NL E++ LG++K+SASIL A+GF + A I G ALEGFQE TGN L ++DQ+ +AKYQ KR + G + K + KT +WV G +NK + + + +R E+ LK + E+R+E IK NE RAA+ R E YI +D NP N +++EK+Y K I D ++ P KKELD RF+++ +K + K+ T KP + + S + ++VQ+I+K + +SK+ ++NT NNV T+ Q T VT+T APG+F ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>8</Hit_num> + <Hit_id>gi|32453687|ref|NP_861896.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit, tail length determinator [Enterobacteria phage RB69] >gi|32350506|gb|AAP76105.1| gp29 baseplate hub subunit, tail length determinator [Enterobacteria phage RB69]</Hit_def> + <Hit_accession>NP_861896</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>441.425</Hsp_bit-score> + <Hsp_score>1134</Hsp_score> + <Hsp_evalue>4.26665e-144</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>270</Hsp_identity> + <Hsp_positive>376</Hsp_positive> + <Hsp_gaps>34</Hsp_gaps> + <Hsp_align-len>600</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSV-------DNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLP-DPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKD-DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMS-------NFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEF----KTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKE-DNTPKPAAPATSEDNQRVQNIQK--AENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQT-VTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKPEEMKSMRRNKVIADNKPQKVAATAATDSLEALNNISSKLDDVQAASELTSQSVEDKGNGIIESIGDLKNSTDNTAEGTELIAEVIEKQTEVTKSINEVSSAISSKLDRLATLLEQKLQ-TSTAIQNTGG---TSLEVIENAIPVKVVENETSDELFKAFPTPEKIDNKPDEDFFPTPVQESANSTSDSKGGISFKLSDKIAMLTKTVQTGFNKSISISDRIAGMLFKYTITAAIEAAKMAALILGIVIGIDLLIVHFKYWTDKFTSAWDLFDENFTKFSDEAKEWGKFLSDIFTSIDSIKQLWEAGDWGGLTVAIVKGVGTALMNLGELIQLGMAKLSASILRAIGFGDTADEIEGRALEGFQETTGNKLKKEDQEKVAKYQMKRDDGELGTVSKGLDMLQRGKTFVTNWVRGNDNKEEFSTSDERAAESAKLKELPEEERKEAYIKANETRAALVRFEDYIDKIDMTNPENAKNVEKSYADLSKLIKDPELNKTPVVKKELDARFEKLNNKMAEAKKAQTTVKPESSSKSPEAKQVQSIEKGRASESKQQQPVAT----ISNT--NNVVKKNTVVQNMTPVTSTTAPGIFHATGVN</Hsp_hseq> + <Hsp_midline>MK E M +MRR KVIAD+K ++ AA+ A+D +++L I KLDDVQ+A+EL ++ +E+KGN +I+S+ DN AEGTEL AE E+ TE K++ V+S IS KL +LA++LE K+Q +Q +G T L VIE+ +P E E+ + P + +N PDEDFFP QE N+ D K K +D + L KT + GF +ISI+D+I+ MLFKYT+TA EAAKMAA++ +V+GIDLL +HFKYWTDKF S NF +FS EA EWG L IF + IK+ WEAGDW GL VAIVKG+ + NL E++ LG++K+SASIL A+GF + A I G ALEGFQE TGN L ++DQ+ +AKYQ KR + G + K + KT +WV G +NK + + + +R E+ LK + E+R+E IK NE RAA+ R E YI +D NP N +++EK+Y K I D ++ P KKELD RF+++ +K + K+ T KP + + S + ++VQ+I+K A +K+Q +T ++NT NNV T+ Q T VT+T APG+F ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>9</Hit_num> + <Hit_id>gi|642905806|ref|YP_009037575.1|</Hit_id> + <Hit_def>baseplate hub subunit, tail length determinator [Escherichia phage vB_EcoM_JS09] >gi|642903960|gb|AIA79980.1| baseplate hub subunit, tail length determinator [Escherichia phage vB_EcoM_JS09]</Hit_def> + <Hit_accession>YP_009037575</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>441.039</Hsp_bit-score> + <Hsp_score>1133</Hsp_score> + <Hsp_evalue>6.28771e-144</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>267</Hsp_identity> + <Hsp_positive>375</Hsp_positive> + <Hsp_gaps>30</Hsp_gaps> + <Hsp_align-len>598</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSV-------DNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLP-DPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKD-DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMS-------NFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEF----KTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKE-DNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQT-VTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKPEEMKSMRRNKVIADNKPQKVAATAATDSLEALNDISSKLDDVQAASELTSQSVEDKGNGIIESIGDLKNSTDNTAEGTELIAEVIEKQTEVTKSINEVSSAISSKLDRLATLLEQKLQ-TSTAIQNTGG---TSLEVIENAIPVKVVENETSDELFKAFPTPEKIDNKPDEDFFPAPVQESANSTSDSKGGISFKLSDKIAMLTKTVQTGFNKSISISDRIAGMLFKYTITAAIEAAKLAALILGIVIGIDLLIVHFKYWTDKFTSAWDLFDENFTKFSDEAKEWGKFLSDIFTSIDSIKQLWEAGDWGGLTVAIVKGVGTALMNLGELIQLGMAKLSASILRAIGFGDTADEIEGRALEGFQETTGNTLKKEDQEKVAKYQMKRDDGELGTVSKGLDMLQRGKTFVTNWVRGNDNKEEFSTSDERAAESAKLKELPEEERKEAYIKANETRAALVRFEDYIDKIDMTNPENAKNVEKSYADLSKLIKDPELNKTPVVKKELDARFEKLNNKMAEAKKAQTTVKPESSSKSPEAKQVQSIEKGRAS--ESKQQQPVAAISNT--NNVVKKNTVVQNMTPVTSTTAPGIFHATGVN</Hsp_hseq> + <Hsp_midline>MK E M +MRR KVIAD+K ++ AA+ A+D +++L I KLDDVQ+A+EL ++ +E+KGN +I+S+ DN AEGTEL AE E+ TE K++ V+S IS KL +LA++LE K+Q +Q +G T L VIE+ +P E E+ + P + +N PDEDFFP QE N+ D K K +D + L KT + GF +ISI+D+I+ MLFKYT+TA EAAK+AA++ +V+GIDLL +HFKYWTDKF S NF +FS EA EWG L IF + IK+ WEAGDW GL VAIVKG+ + NL E++ LG++K+SASIL A+GF + A I G ALEGFQE TGN+L ++DQ+ +AKYQ KR + G + K + KT +WV G +NK + + + +R E+ LK + E+R+E IK NE RAA+ R E YI +D NP N +++EK+Y K I D ++ P KKELD RF+++ +K + K+ T KP + + S + ++VQ+I+K + +SK+ ++NT NNV T+ Q T VT+T APG+F ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>10</Hit_num> + <Hit_id>gi|228861505|ref|YP_002854526.1|</Hit_id> + <Hit_def>gp29 base plate hub [Enterobacteria phage RB14] >gi|227438521|gb|ACP30834.1| gp29 base plate hub [Enterobacteria phage RB14]</Hit_def> + <Hit_accession>YP_002854526</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>438.343</Hsp_bit-score> + <Hsp_score>1126</Hsp_score> + <Hsp_evalue>7.24825e-143</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>263</Hsp_identity> + <Hsp_positive>371</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMTAMILAVVVGIDLLMVHFKYWSDKFSKAWDLFNTDFTKFSSETGTWGPLLQSIFDSIDKIKQLWEAGDWGGLTAAIVEGLGSVLFNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNGDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFGATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + IK+ WEAGDW GL AIV+GL V++NL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVFGATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>11</Hit_num> + <Hit_id>gi|414086558|ref|YP_006986747.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Enterobacteria phage vB_EcoM_ACG-C40] >gi|383396339|gb|AFH20155.1| baseplate hub subunit tail length determinator [Enterobacteria phage vB_EcoM_ACG-C40]</Hit_def> + <Hit_accession>YP_006986747</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>437.958</Hsp_bit-score> + <Hsp_score>1125</Hsp_score> + <Hsp_evalue>8.89384e-143</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>263</Hsp_identity> + <Hsp_positive>372</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPVQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLAVVESAIPVKVVEDDTAEFVG---PLLPAPEAVNNDPDADFFPAPQPVEPKRESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMVALIMAVVIGIDLLMVHFKYWSDKFSKAWDLFSTDFKTFSSETGTWGPLLQSIFESIDEIKKFWEAGDWGGLTVAIVEGLGKVLYNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNGDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSIEAAHEDLKKRMNDPDLNNSPAVKKELASRFAKIDATYQELKK-NQPEAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFGATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G S + S I K+ + D E G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM A++ A+V+GIDLL +HFKYW+DKF ++F FS+E G WG LLQSIF + +IKKFWEAGDW GL VAIV+GL V+YNL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E A+ KK ++D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVFGATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>12</Hit_num> + <Hit_id>gi|9632606|ref|NP_049805.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit, tail length determinator [Enterobacteria phage T4] >gi|137988|sp|P13337.1|VG29_BPT4 RecName: Full=Tail-tube assembly protein Gp29; AltName: Full=Folylpolyglutamate synthase; AltName: Full=Tail length regulator; AltName: Full=Tetrahydrofolylpolyglutamate synthase [Enterobacteria phage T4] >gi|5354230|gb|AAD42437.1|AF158101_24 gp29 baseplate hub subunit, tail length determinator [Enterobacteria phage T4] >gi|215946|gb|AAA32538.1| tail-tube assembly protein [Enterobacteria phage T4]</Hit_def> + <Hit_accession>NP_049805</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>437.573</Hsp_bit-score> + <Hsp_score>1124</Hsp_score> + <Hsp_evalue>1.07961e-142</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>264</Hsp_identity> + <Hsp_positive>372</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVESAVSDTSAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMTAMILAVVVGIDLLMIHFKYWSDKFSKAWDLFSTDFTKFSSETGTWGPLLQSIFDSIDKIKQLWEAGDWGGLTVAIVEGLGKVLFNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNGDLGPIAEGLDKISNWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV + SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM AM+ A+V+GIDLL IHFKYW+DKF ++F +FS+E G WG LLQSIF + IK+ WEAGDW GL VAIV+GL V++NL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>13</Hit_num> + <Hit_id>gi|525334458|gb|AGR46140.1|</Hit_id> + <Hit_def>baseplate hub subunit [Yersinia phage PST]</Hit_def> + <Hit_accession>AGR46140</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>437.187</Hsp_bit-score> + <Hsp_score>1123</Hsp_score> + <Hsp_evalue>1.95194e-142</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>267</Hsp_identity> + <Hsp_positive>373</Hsp_positive> + <Hsp_gaps>47</Hsp_gaps> + <Hsp_align-len>605</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNV-------AEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVEGAVSDTTAGSELIAETVEIGNNINKE---IGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKDAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMTAMILAVVVGIDLLMVHFKYWSDKFSKAWDLFNTDFTKFSSETGTWGPLLQSIFDSIDKIKQLWEAGDWGGLTVAIIEGLGKVLFNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNGDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV G+EL AE E K + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKKD + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + IK+ WEAGDW GL VAI++GL V++NL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>14</Hit_num> + <Hit_id>gi|299780553|gb|ADJ39915.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Enterobacteria phage T4T]</Hit_def> + <Hit_accession>ADJ39915</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>437.187</Hsp_bit-score> + <Hsp_score>1123</Hsp_score> + <Hsp_evalue>2.03785e-142</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>264</Hsp_identity> + <Hsp_positive>371</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMTAMILAVVVGIDLLMIHFKYWSDKFSKAWDLFSTDFTKFSSETGTWGPLLQSIFDSIDKIKQLWEAGDWGGLTVAIVEGLGKVLFNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNGDLGPIAEGLDKISNWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM AM+ A+V+GIDLL IHFKYW+DKF ++F +FS+E G WG LLQSIF + IK+ WEAGDW GL VAIV+GL V++NL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>15</Hit_num> + <Hit_id>gi|330858710|ref|YP_004415085.1|</Hit_id> + <Hit_def>putative baseplate hub subunit and tail length determinator [Shigella phage Shfl2] >gi|327397644|gb|AEA73146.1| putative baseplate hub subunit and tail length determinator [Shigella phage Shfl2]</Hit_def> + <Hit_accession>YP_004415085</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>424.091</Hsp_bit-score> + <Hsp_score>1089</Hsp_score> + <Hsp_evalue>1.93327e-137</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>261</Hsp_identity> + <Hsp_positive>368</Hsp_positive> + <Hsp_gaps>33</Hsp_gaps> + <Hsp_align-len>598</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISITDKISSMLFKYTVSAAIEAAKMTAMILAVVVGIDLLMVHFKYWSDKFSKAWDLFNTDFTKFSSETGTWGPLLQSIFDSIDKIKQLWEAGDWGGLTAAIVEGLGSVLFNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNTTGASLNKEDQEKVANYQDKRMNGDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK--NKAQQAPVQQASPSINNTNNVVKKNTVV-HNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISITDKISSMLFKYTV+A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + IK+ WEAGDW GL AIV+GL V++NL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K N +Q+ ++ NT N+ H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>16</Hit_num> + <Hit_id>gi|397134209|gb|AFO10716.1|</Hit_id> + <Hit_def>baseplate hub protein [Escherichia phage ECML-134]</Hit_def> + <Hit_accession>AFO10716</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>421.009</Hsp_bit-score> + <Hsp_score>1081</Hsp_score> + <Hsp_evalue>3.75934e-136</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>263</Hsp_identity> + <Hsp_positive>373</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIVGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISISDKISSMLFKYTISAAIEAAKMTAMILAVVVGIDLLMVHFKYWSDKFSKAWDLFNTDFTKFSSETGTWGPLLQSIFSSIDEIKKFWEAGDWGGLTVAIVEGLGKVLYNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLSKEDQEKVANYQDKRMNGDLGPIAEGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEQYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N ++ ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISI+DKISSMLFKYT++A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + +IKKFWEAGDW GL VAIV+GL V+YNL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SLS++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R E+Y D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>17</Hit_num> + <Hit_id>gi|116326412|ref|YP_803132.1|</Hit_id> + <Hit_def>base plate hub [Enterobacteria phage RB32] >gi|115344005|gb|ABI95014.1| base plate hub [Enterobacteria phage RB32]</Hit_def> + <Hit_accession>YP_803132</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>407.527</Hsp_bit-score> + <Hsp_score>1046</Hsp_score> + <Hsp_evalue>5.49342e-131</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>261</Hsp_identity> + <Hsp_positive>372</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIIGAIDNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPAPQPVEPKQESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISISDKISSMLFKYTISAAIEAAKMTAMILAVVVGIDLLMVHFKYWSDKFSKAWDLFNTDFTKFSSETGTWGPLLQSIFDSIDEIKKFWEAGDWGGLTVAIVEGLGKVLYNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNTTNASLSKEDQEKVANYQYKRMNGDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAEEEEKLKQLSPEEAKIALMKANEARAAMNRFDQYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N +I ++DNV SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISI+DKISSMLFKYT++A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + +IKKFWEAGDW GL VAIV+GL V+YNL E++ LG++K+SA+IL + G ++ A + G ALE FQ T SLS++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R +E + LK ++PE+ + L+K NEARAA+ R ++Y D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>18</Hit_num> + <Hit_id>gi|639438842|ref|YP_009030799.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Escherichia phage e11/2] >gi|628971670|gb|AHY83392.1| baseplate hub subunit tail length determinator [Escherichia phage e11/2]</Hit_def> + <Hit_accession>YP_009030799</Hit_accession> + <Hit_len>590</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>399.823</Hsp_bit-score> + <Hsp_score>1026</Hsp_score> + <Hsp_evalue>4.84152e-128</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>590</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>255</Hsp_identity> + <Hsp_positive>369</Hsp_positive> + <Hsp_gaps>41</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGV----ASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFP-PVPQEP--ENNKKDQKKD--DKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDAQAASELIAQTVEEKSNEIVGAIGNVESAVSDTTAGSELIAETVEIGNNINKEIGESLGSKLDKLTSLLEQKIQTA--GIQQTGTSLATVESAIPVKVVEDDTAESVG---PLLPAPEAVNNDPDADFFPTPQPVEPKQESPEEKQKKEAFNLKLSQALDKLTKTVDFGFKKSISISDKISSMLFKYTISAAIEAAKMTAMILAVVVGIDLLMVHFKYWSDKFSKAWDLFNTDFTKFSSETGTWGPLLQSIFSSIDKIQQFWEKGDWGGLTAAIIEGLGSVLFNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNTTGASLNKEDQEKVANYQYKRMNGDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTSDEERAEEEEKLKQLSPEEAKIALMKANEARAAMNRFEKYADSADMSKDSTVKSVESAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N ++ ++ NV SE E+++ + ++ KL KL S+LE K+Q +Q++G S +T S I K+ + D ES G +LP + NN PD DFFP P P EP E+ ++ QKK+ + K + L L KT GFK +ISI+DKISSMLFKYT++A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + I++FWE GDW GL AI++GL V++NL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R +E + LK ++PE+ + L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>19</Hit_num> + <Hit_id>gi|398313740|emb|CCI89087.1|</Hit_id> + <Hit_def>phage baseplate hub [Yersinia phage phiD1]</Hit_def> + <Hit_accession>CCI89087</Hit_accession> + <Hit_len>369</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>308.531</Hsp_bit-score> + <Hsp_score>789</Hsp_score> + <Hsp_evalue>1.22596e-95</Hsp_evalue> + <Hsp_query-from>218</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>369</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>169</Hsp_identity> + <Hsp_positive>239</Hsp_positive> + <Hsp_gaps>26</Hsp_gaps> + <Hsp_align-len>377</Hsp_align-len> + <Hsp_qseq>MLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFM-------SNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGII----DKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTP--KPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MLFKYTISAAIEAAKMTAMILAVVIGIDLLMVHFKYWSDKFSKAWDLFNTDFTKFSSETGTWGPLLQSIFDSIDKIKQLWEAGDWGGLTAAIVEGLGSVLFNLGELIQLGMAKLSAAILRVIPGMKDTADEVEGRALENFQNSTGASLNKEDQEKVANYQDKRMNGDLGPIAKGLDKIANWKTRASNWIRGVDNKEALTTDEERAAEEEKLKQLSPEERKNALMKANEARAAMIRFEKYADSADMSKDSTVKSVEAAYEDLKKRMDDPDLNNSPAVKKELAARFSKIDATYQELKK-NQPNAKPETSAKSPEAKQVQVIEK-------NKAQQAPVQQASPSINNTNNVIKKNTVVHNMTPVTSTTAPGVFDATGVN</Hsp_hseq> + <Hsp_midline>MLFKYT++A EAAKM AM+ A+V+GIDLL +HFKYW+DKF ++F +FS+E G WG LLQSIF + IK+ WEAGDW GL AIV+GL V++NL E++ LG++K+SA+IL + G ++ A + G ALE FQ TG SL+++DQ+ +A YQ KR+ G I DK +KTRA +W+ G +NK T +R E + LK ++PE+R+ L+K NEARAA+ R EKY D + ++S+E AY KK + D +++ PA KKEL RF ++++ YQ+LK+ N P KP A S + ++VQ I+K +K + A+ +NN NN + +H + VT+T APGVF ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>20</Hit_num> + <Hit_id>gi|431809132|ref|YP_007236029.1|</Hit_id> + <Hit_def>phage baseplate hub [Yersinia phage phiR1-RT] >gi|398313421|emb|CCI88770.1| phage baseplate hub [Yersinia phage phiR1-RT]</Hit_def> + <Hit_accession>YP_007236029</Hit_accession> + <Hit_len>582</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>298.516</Hsp_bit-score> + <Hsp_score>763</Hsp_score> + <Hsp_evalue>2.81533e-89</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>582</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>217</Hsp_identity> + <Hsp_positive>334</Hsp_positive> + <Hsp_gaps>46</Hsp_gaps> + <Hsp_align-len>602</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLID-------SVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDE---PE---SPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPT--DMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRI--EEGPGIIDKAGEFKTRAFDWVL--GRENKIDSTQASDRDQ--ETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKY--QKLKEDNTPKPAAPATSEDNQRVQNIQK-AENAKEQSKKSTGDMNVANT--QVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKQPSQQNSFRRKVIEDSKPERDAASAANSQSTSLDSIDSKLSDVQAASELTSEVVEAKTDQLIDTIGQLKGSVQDVQAASELAVDAIGDSNSYLKSIDTVSQAINAKLAQLTSMLEAKFG--DQLAPLNAPNPVSG------ALPEPVPVVLPEDFIGPMLP--TVPDTDPNEEVLPEPPRREPEPKSEEDKKSSSEGDEKNTISEKLDLLIRTTQSGFKTAVGYSDKISNMLFKFTLTAIAQAAKTAAMILGIILAIDVIKANFTFWAEKFSTNFTEFAERAKEWGPLIESVVGMVRNISDAWNSDDPLGIIKAIAFGLSDITKQLADLLGLAVAKLTAGILRALGFNDKADALEGSYLKGYQDRTGSVMSEGHQKLIAKADNQKIKDEHDTTAYDQFKGMDQRGYDQAYKNGSMSK-DTYEALSKGEAKASDPLQGLSEEERLNVIIKRNEAQAAINRTKDYSTKIDPNNEREVNSLNKALADIKSRLDDPEISKIPESKSDLTRQFNELNNKTSANKLK---------PAPIAENQEVQTTKRVAELQKQNDTQSVNKGPTQNTVVQANTTNTSRTMYNMPPTTNIPAPGMRAALGTN</Hsp_hseq> + <Hsp_midline>MK + RRKVI DSK ERDAAS A+ Q SL+ I KL DVQ+A+EL +EV+E K + LID SV +V +ELA +A + +K++ V+ I+ KL++L SMLE+K +Q + + +G LP+P PE P LP +P D N + E + E +KK + D+K T + L L++TT+ GFK + +DKIS+MLFK+T+TA+A+AAK AAM+ ++L ID+++ +F +W +KF +NF EF+ A EWG L++S+ GM+ +I W + D G+ AI GL+D+ L++++ L ++K++A IL ALGF + A + GS L+G+Q+RTG+ +SE QK +AK +++I E D+ R +D G +K D+ +A + + + L+ ++ E+R +IK+NEA+AA+ R + Y +DP N + SL KA K + D IS P +K +L ++F + +K KLK PA +NQ VQ ++ AE K+ +S NT Q N N S+T++ + T PAPG+ A G N</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>21</Hit_num> + <Hit_id>gi|422934216|ref|YP_007004252.1|</Hit_id> + <Hit_def>baseplate hub subunit [Enterobacteria phage Bp7] >gi|345450725|gb|AEN93928.1| baseplate hub subunit [Enterobacteria phage Bp7]</Hit_def> + <Hit_accession>YP_007004252</Hit_accession> + <Hit_len>578</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>269.24</Hsp_bit-score> + <Hsp_score>687</Hsp_score> + <Hsp_evalue>3.573e-78</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>578</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>204</Hsp_identity> + <Hsp_positive>331</Hsp_positive> + <Hsp_gaps>54</Hsp_gaps> + <Hsp_align-len>604</Hsp_align-len> + <Hsp_qseq>MKSE-NMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVD-------NVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLP---ERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKP-TDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEG--------------PGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKK-ELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKNESNQNSFRRNKLIEEMAPQRRAEALAQTQNDELGNISDVLSDSQAASELLSEVVETKSNQIISSVDRVDKSVQDVVAGTELTAEAISEQTQQSKALS---DALNEKINKLSNMLEAKFSGI--SIPPEGSS----LKVIEDSIPEEPKAETPKVPAVVEDILPPED---NKPDAEFMP----EPPKNSDEGKEGDKTSLSDKIEALTKITEKGFKASIGVADRISGMLFKYTITAAAEAAKLAAGLVLLIFGIDAIRVYFQYFMDQFESGWKEFNDKFKEWGPLLEGLMTWAKNAEAMFSEGNWLGLAEAIIRGMVNLTKNMAQLLMLGISKLISAILSKIPGMGELAENVEASALMSYQQNTGATLDVEDQTKVAKYHDRRSAEALETAEKMNKKYKDKPELINQAEKYGN------LTKE-QADQLRAGGIDTSFRDLPE---EERLEYFKKRDKAQADIIRLTQTADNIMKPDSKDIENAKAFKADIEKQLADPIMAKGGAPKDLNIQQLLDKMNKSLEKFNEAEKPKPASVAESPENTQVKKVDEQMRAKENAKYSQQAPTQINQQTNIKKTSKTSYNLPPQSSTPAPGMRQATKVN</Hsp_hseq> + <Hsp_midline>MK+E N ++ RR K+I + +R A + A Q D L I L D Q+A+EL++EV+E K N +I SVD +V GTEL AEA T+ K L+ +++K++KL++MLE+K + + G+S L VIED +P+ + E+P +P E ILPP D N PD +F P EP N + K+ DK +D + L K T+ GFKA+I + D+IS MLFKYT+TA AEAAK+AA L L+ GID +R++F+Y+ D+F S + EF+ + EWG LL+ + + + + G+W GLA AI++G+ ++ N+++++ LGISK+ ++IL + G A + SAL +Q+ TG +L +DQ +AKY +R E P +I++A ++ L +E + D +A D ++L E+R E K+++A+A + RL + ++ + ++++ + +K ++D ++ A K + + ++ +K E PKPA+ A S +N +V+ + + AKE +K S N Q N SKT + + ++TPAPG+ AT VN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>22</Hit_num> + <Hit_id>gi|314121771|ref|YP_004063890.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit tail length determinator [Enterobacteria phage vB_EcoM-VR7] >gi|313151528|gb|ADR32584.1| gp29 baseplate hub subunit tail length determinator [Enterobacteria phage vB_EcoM-VR7]</Hit_def> + <Hit_accession>YP_004063890</Hit_accession> + <Hit_len>581</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>269.24</Hsp_bit-score> + <Hsp_score>687</Hsp_score> + <Hsp_evalue>3.63307e-78</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>581</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>186</Hsp_identity> + <Hsp_positive>328</Hsp_positive> + <Hsp_gaps>58</Hsp_gaps> + <Hsp_align-len>606</Hsp_align-len> + <Hsp_qseq>KSENMSTMRR----RKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVD-------NVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEG--------------PGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKK-ELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KNSEQTSFRRGGPNKKLIEELAPQRRAEALSAEQNDELSNLNTTLTNTQAATELVSEAIEDKGNQIIENIQTNNGVLQDISAGVELTAEATEKTQQGIKNLTDI---LSDKLDKLSAMISGKIGVT------SPVAGSESLKPVEDALPEPEENKPTASVPALIPPEEQK---PDADFIPE-PEQPKTDAEGKETNTWSLGDKLDTLSKITEKGFKASISVADRISGMLFKYTITAAAEAAKLIGGLLLLVFGIDAIRVYFQYFMKQFEKGWAEFNDKFKEWGPLLEGLMTWAKNAEAMFSERNWLGLAEAIIRGMVNLTKNMAQLLMLGISKLISAILSKIPGMGDLADNVEASALMSYQQNTGATLDDEDQTKIAKYHDKRSAEAMKTAEKMNKKYKDKPELINQAEKYGN------LTKE-QADQLRAGGIDTSFRDLPE---EERLDYFKKRDKTQADIIRLTQTADNLMKPDATDKKNAEASYKAIQEQLADPVMAKGGAPKDLNMHALLEKLDKSLEKFKDEPKVKPPDVKASPDAQQAAKVDEGMKAKENKYKDAP----ANAQINTVNNIQKTSRTQYNMPPQSSTPAPGMRQATRIN</Hsp_hseq> + <Hsp_midline>K+ ++ RR +K+I + +R A + +++Q D L + L + Q+A ELV+E IE+KGN +I+++ +++ G EL AEA+E+T + IK LT + +SDKL KL++M+ K+ S + S L +ED LP+P+E + ++PP + PD DF P P++P+ + + ++ + D L L K T+ GFKA+IS+ D+IS MLFKYT+TA AEAAK+ L LV GID +R++F+Y+ +F + EF+ + EWG LL+ + + + + +W GLA AI++G+ ++ N+++++ LGISK+ ++IL + G + A + SAL +Q+ TG +L ++DQ +AKY KR E P +I++A ++ L +E + D +A D ++L E+R + K+++ +A + RL + ++ + T+ ++ E +Y + ++ ++D ++ A K + ++++ +K K++ KP S D Q+ + + AKE K AN Q+N VNN S+T + + ++TPAPG+ AT +N</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>23</Hit_num> + <Hit_id>gi|299779140|ref|YP_003734334.1|</Hit_id> + <Hit_def>29 gene product [Enterobacteria phage IME08] >gi|298105869|gb|ADI55513.1| gp29 baseplate hub subunit [Enterobacteria phage IME08]</Hit_def> + <Hit_accession>YP_003734334</Hit_accession> + <Hit_len>578</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>266.929</Hsp_bit-score> + <Hsp_score>681</Hsp_score> + <Hsp_evalue>2.99001e-77</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>578</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>203</Hsp_identity> + <Hsp_positive>335</Hsp_positive> + <Hsp_gaps>56</Hsp_gaps> + <Hsp_align-len>605</Hsp_align-len> + <Hsp_qseq>MKSE-NMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVD-------NVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLP---ERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKP-TDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEG--------------PGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNS-AKKSISDSAISDQPATKK-ELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKNESNQNSFRRNKLIEEMAPQRRAEALAQTQNDELGNISDVLSDSQAASELLSEVVETKSNQIISSVDRVDKSVQDVVAGTELTAEAISEQTQQSKALS---DALNEKINKLSNMLEAKFSGI--SIPPEGSS----LKVIEDSIPEEPKAETPKVPAVVEDILPPED---NKPDAEFVP----EPPKNSDEGKEGAKSPLSEKIEALTKITEKGFKASVGVADRISGMLFKYTITAAAEAAKLAAGLVLLIFGIDAIRVYFQYFMDQFEAGWKEFNDKFKEWGPLLEGLMTWAKNAEAMFSEGNWLGLAEAIIRGMVNLTKNMAQLLMVGISKLISAILSKIPGMGELAENVEASALMSYQQNTGATLDDEDQTKVAKYHDRRSAEALETAEKMNKKYKNKPELINQAEKYGN------LTKE-QADQLRAGGIDTSFRDLPE---EERLEYFKKRDKAQADIIRLTQTADNIMKPDSKDIENA-KAYKADIEKQLADPIMAKGGAPKDLNIQQLLDKMNKSLEKFNEAEKPKPASVAESPENTQVKKVDEQMRAKENAKYSQQAPTQINQQTNIKKTSKTSYNLPPQSSTPAPGMRQATKVN</Hsp_hseq> + <Hsp_midline>MK+E N ++ RR K+I + +R A + A Q D L I L D Q+A+EL++EV+E K N +I SVD +V GTEL AEA T+ K L+ +++K++KL++MLE+K + + G+S L VIED +P+ + E+P +P E ILPP D N PD +F P EP N + K+ K P ++ + L K T+ GFKA++ + D+IS MLFKYT+TA AEAAK+AA L L+ GID +R++F+Y+ D+F + + EF+ + EWG LL+ + + + + G+W GLA AI++G+ ++ N+++++ +GISK+ ++IL + G A + SAL +Q+ TG +L ++DQ +AKY +R E P +I++A ++ L +E + D +A D ++L E+R E K+++A+A + RL + ++ + ++++ KAY + +K ++D ++ A K + + ++ +K E PKPA+ A S +N +V+ + + AKE +K S N Q N SKT + + ++TPAPG+ AT VN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>24</Hit_num> + <Hit_id>gi|308814556|ref|YP_003934830.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Shigella phage SP18] >gi|308206148|gb|ADO19547.1| baseplate hub subunit tail length determinator [Shigella phage SP18]</Hit_def> + <Hit_accession>YP_003934830</Hit_accession> + <Hit_len>581</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>265.388</Hsp_bit-score> + <Hsp_score>677</Hsp_score> + <Hsp_evalue>1.10381e-76</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>581</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>188</Hsp_identity> + <Hsp_positive>331</Hsp_positive> + <Hsp_gaps>60</Hsp_gaps> + <Hsp_align-len>607</Hsp_align-len> + <Hsp_qseq>KSENMSTMRR----RKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVD-------NVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEG--------------PGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKR--FQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNN----SKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KNSEQTSFRRGGPNKKLIEELAPQRRAEALSAEQNDELSNLNTTLTNTQAATELVSEAIEDKGNQIIENIQTNNGVLQDISAGVELTAEATEKTQQGIKNLTDI---LSDKLDKLSAMISGKLGVT------SPVAGSESLKPVEDALPEPEENKPTASVPTLIPPEEQK---PDADFIPE-PEQPKTDAEGKETNTWSLGDKLDTLSKITEKGFKASISVADRISGMLFKYTITAAAEAAKLIGGLLLLVFGIDAIRVYFQYFMKQFEKGWAEFNDKFKEWGPLLEGLMTWAKNAQAMFSEKNWLGLAEAIIRGMVNLTKNMAQLLMLGISKLISAILSKIPGMGDLADNVEASALMSYQQNTGATLDDEDQTKIAKYHDKRSAEAMEATEKMNKKYKDKPELINQAEKYGN------LTKE-QADQLRAGGIDTSFRDLPE---EERLDYFKKRDKAQADIIRLTQTADNLMKPDATDKKNAMEMRANIEKQLADPSMAKGGAP-KDLNMRALLEKLDKSLEKFKDEPKVKPPDVKTSPDAQQAAKVDEGMKAKENKYKDAP----AQAQINTVNNIQKTSRTQYNMPPQSSTPAPGMRQATRIN</Hsp_hseq> + <Hsp_midline>K+ ++ RR +K+I + +R A + +++Q D L + L + Q+A ELV+E IE+KGN +I+++ +++ G EL AEA+E+T + IK LT + +SDKL KL++M+ K+ S + S L +ED LP+P+E + ++PP + PD DF P P++P+ + + ++ + D L L K T+ GFKA+IS+ D+IS MLFKYT+TA AEAAK+ L LV GID +R++F+Y+ +F + EF+ + EWG LL+ + + + + +W GLA AI++G+ ++ N+++++ LGISK+ ++IL + G + A + SAL +Q+ TG +L ++DQ +AKY KR E P +I++A ++ L +E + D +A D ++L E+R + K+++A+A + RL + ++ + T+ ++ + + +K ++D +++ A K+L+ R ++++ +K K++ KP TS D Q+ + + AKE K A Q+N VNN S+T + + ++TPAPG+ AT +N</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>25</Hit_num> + <Hit_id>gi|238695345|ref|YP_002922538.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit, tail length determinator [Enterobacteria phage JS10] >gi|220029481|gb|ACL78415.1| gp29 baseplate hub subunit, tail length determinator [Enterobacteria phage JS10]</Hit_def> + <Hit_accession>YP_002922538</Hit_accession> + <Hit_len>578</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>262.692</Hsp_bit-score> + <Hsp_score>670</Hsp_score> + <Hsp_evalue>1.03696e-75</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>578</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>198</Hsp_identity> + <Hsp_positive>334</Hsp_positive> + <Hsp_gaps>52</Hsp_gaps> + <Hsp_align-len>603</Hsp_align-len> + <Hsp_qseq>MKSE-NMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVD-------NVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLP---ERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEG--------------PGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKK-ELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKNESNQNSFRRNKLIEEMAPQRRAEALAQTQNDELGNITEALSETQAASELLSEVVETKSNQIINSIDRVDKSVQDVVAGTELTAEAISEQTQQSKALS---DALNEKINKLSNMLEAKFSGI--SIPPEGSS----LKVIEDSIPEEPKAETPKVPAVIEDILPPED---NKPDAEF---VPEPPKNSDEGKEGDKSSLSDKIEALTKITEKGFKASIGVADRISGMLFKYTITAAAEAAKLAAGLALLIFGIDAIRVYFQYFMDQFNEGWKKFNDKFKEWGPLLEGLMTWAKNAEAMFSERNWLGLAEAIIRGMVNLTKNMAQLLMLGISKLISAILSKIPGMGDLAENVEASALMSYQQNTGATLDDEDQTKVAKYHDRRSAEALETAEKMNKKYKGKPELINQAEKYGN------LTKE-QADQLRAGGIDTSFRNLPE---EERLEYFKKRDKAQADIIRLTQTADNIMKPDSKDIENAKAFKADIEKQLADPIMAKGGAPKDLNIQQLLDKMNKSLEKFNEAEKPKPASVAESPENTQVKKVDEQMRAKENAKYSQQAPTQINQQTNIKKTSKTSYNLPPQSSTPAPGMRQATKVN</Hsp_hseq> + <Hsp_midline>MK+E N ++ RR K+I + +R A + A Q D L I L + Q+A+EL++EV+E K N +I+S+D +V GTEL AEA T+ K L+ +++K++KL++MLE+K + + G+S L VIED +P+ + E+P +P E ILPP D N PD +F VP+ P+N+ + ++ D +D + L K T+ GFKA+I + D+IS MLFKYT+TA AEAAK+AA L L+ GID +R++F+Y+ D+F + +F+ + EWG LL+ + + + + +W GLA AI++G+ ++ N+++++ LGISK+ ++IL + G + A + SAL +Q+ TG +L ++DQ +AKY +R E P +I++A ++ L +E + D +A D +NL E+R E K+++A+A + RL + ++ + ++++ + +K ++D ++ A K + + ++ +K E PKPA+ A S +N +V+ + + AKE +K S N Q N SKT + + ++TPAPG+ AT VN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>26</Hit_num> + <Hit_id>gi|161622623|ref|YP_001595318.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit tail length determinator [Enterobacteria phage JS98] >gi|52139948|gb|AAU29318.1| gp29 baseplate hub subunit tail length determinator [Enterobacteria phage JS98]</Hit_def> + <Hit_accession>YP_001595318</Hit_accession> + <Hit_len>578</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>259.225</Hsp_bit-score> + <Hsp_score>661</Hsp_score> + <Hsp_evalue>1.72858e-74</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>578</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>196</Hsp_identity> + <Hsp_positive>334</Hsp_positive> + <Hsp_gaps>52</Hsp_gaps> + <Hsp_align-len>603</Hsp_align-len> + <Hsp_qseq>MKSE-NMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVD-------NVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLP---ERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEG--------------PGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKK-ELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MKNESNQNSFRRNKLIEEMAPQRRAEALAQTQNDELGNITEALSETQAASELLSEVVETKSNQIINSIDRVDKSVQDVVAGTELTAEAISEQTQQSKALS---DALNEKINKLSNMLEAKFSGI--SIPPEGSS----LKVIEDSIPEEPKAETPKVPAVIEDILPPED---NKPDAEF---VPEPPKNSDEGKEGDKSSLSDKIEALTKITEKGFKASIGVADRISGMLFKYTITAAAEAAKLAAGLALLIFGIDAIRVYFQYFMDQFNEGWKKFNDKFKEWGPVLEGLMTWAKNAEAMFSERNWLGLAEAIIRGMVNLTKNMAQLLMLGISKLISAILSKIPGMGDLAENVEASALMSYQQNTGATLDDEDQTKVAKYHDRRSAEALETAEKMNKKYKGKPELINQAEKYGN------LTKE-QADQLRAGGIDTSFRDLPE---EERLEYFKKRDKAQADIIRLTQTADNIMKPDSKDIENAKAFKADIEKQLADPIMAKGGAPKDLNIQQLLDKMNKSLEKFNEAEKPKPASVAESPENTQVKKVDEQMRAKENAKYSQQAPTQINQQTNIKKTSKTSYNLPPQSSTPAPGMRQATKVN</Hsp_hseq> + <Hsp_midline>MK+E N ++ RR K+I + +R A + A Q D L I L + Q+A+EL++EV+E K N +I+S+D +V GTEL AEA T+ K L+ +++K++KL++MLE+K + + G+S L VIED +P+ + E+P +P E ILPP D N PD +F VP+ P+N+ + ++ D +D + L K T+ GFKA+I + D+IS MLFKYT+TA AEAAK+AA L L+ GID +R++F+Y+ D+F + +F+ + EWG +L+ + + + + +W GLA AI++G+ ++ N+++++ LGISK+ ++IL + G + A + SAL +Q+ TG +L ++DQ +AKY +R E P +I++A ++ L +E + D +A D ++L E+R E K+++A+A + RL + ++ + ++++ + +K ++D ++ A K + + ++ +K E PKPA+ A S +N +V+ + + AKE +K S N Q N SKT + + ++TPAPG+ AT VN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>27</Hit_num> + <Hit_id>gi|311992691|ref|YP_004009559.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit [Acinetobacter phage Ac42] >gi|298684474|gb|ADI96435.1| gp29 baseplate hub subunit [Acinetobacter phage Ac42]</Hit_def> + <Hit_accession>YP_004009559</Hit_accession> + <Hit_len>569</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>227.639</Hsp_bit-score> + <Hsp_score>579</Hsp_score> + <Hsp_evalue>7.65187e-63</Hsp_evalue> + <Hsp_query-from>1</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>1</Hsp_hit-from> + <Hsp_hit-to>569</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>183</Hsp_identity> + <Hsp_positive>306</Hsp_positive> + <Hsp_gaps>91</Hsp_gaps> + <Hsp_align-len>618</Hsp_align-len> + <Hsp_qseq>MKSENMSTMRRRKVIADSKGERDAASTASDQVDSLEL---------IGLKLDDVQSANELVAEVIE------EKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDE--PESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGDLLKTT-KGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGE------------WGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGI----SKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGP-----GIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPAT-SEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNN--VNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>MAQQSLKSEVRDRVLAKSASLRDARKQIIDKANSQTLKPQESPQEAVQTPIDDLSPVSSTMSQALQQSSTSNEIGRASLDELHNISESSKL---------------------INQRLQKLSTLLESKFVNAETKPVELNERA---VDVIKDYVEKPEQKVPEPNPIP-KLLPGIEYTSSLGD-------TKDDQSKTVDQKE---KREDANGTGVKSILKTGFGKTVSVIDRISGFLFKYTLSAAIASAKIVGGLFALILGFDLLRIHFKYWGEKLMEKFDQISDWFGENISAPFNALLERWTPVFESIMDSVGFVKRAWENGDWG----ALISGIGSAIDTATTSLLVGIQSALAKLGAAILDKLGFKDAADNLEGAAIQNKQNHTDAVLSDKEKIALAEYQKKNIEKGEAPSRGGITSFLPDSWRKNLDLITEQ----DYNQIKAEEKDMGRLKSMSSDDQTKVLIKNNEAKDALDRYAEAGRKLDVNNEQDKARLNKLYNEASTRVKDKDLSNTPEVQKHLEGRLERIKNSINAKKVKVEPAPSNESKDATTASRIQAIDSKKNS------SAGNGNASNTNVQNNIVKSNRQINIQAPVTSSNAPGIFKATSAN</Hsp_hseq> + <Hsp_midline>M +++ + R +V+A S RDA D+ +S L + +DD+ + +++ ++ E G +D + N++E ++L I+ +L KL+++LESK E K E A + VI+D + P++ PE +P ++LP ++ ++L D ++ ++ DQK+ K D G +K+ K GF T+S+ D+IS LFKYT++A +AK+ LFAL+LG DLLRIHFKYW +K M FD+ S GE W + +SI +G +K+ WE GDW A++ G+ I + + +GI +K+ A+ILD LGF++AA + G+A++ Q T LS+ ++ ALA+YQ K IE+G GI + + D + + D Q +++ LK+M+ + + + LIK NEA+ A+ R + +D N + L K YN A + D +S+ P +K L+ R +R+++ K P P+ + + R+Q I +N+ S G+ N +NT V N V +++ I+ VT++ APG+F AT N</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>28</Hit_num> + <Hit_id>gi|639438514|ref|YP_009030254.1|</Hit_id> + <Hit_def>baseplate hub subunit, tail length determinator [Serratia phage PS2] >gi|625370587|gb|AHY25447.1| baseplate hub subunit, tail length determinator [Serratia phage PS2]</Hit_def> + <Hit_accession>YP_009030254</Hit_accession> + <Hit_len>572</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>211.46</Hsp_bit-score> + <Hsp_score>537</Hsp_score> + <Hsp_evalue>6.69261e-57</Hsp_evalue> + <Hsp_query-from>42</Hsp_query-from> + <Hsp_query-to>570</Hsp_query-to> + <Hsp_hit-from>35</Hsp_hit-from> + <Hsp_hit-to>566</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>158</Hsp_identity> + <Hsp_positive>276</Hsp_positive> + <Hsp_gaps>33</Hsp_gaps> + <Hsp_align-len>547</Hsp_align-len> + <Hsp_qseq>LDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDP-DEPESPGLPERILP-PL-DDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDAL-GFENAATTIRGSALEGFQERTGNSLSEDD-QKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGREN------KIDSTQAS--DRDQETQNLKAMAPEK----REETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKY-QKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVF</Hsp_qseq> + <Hsp_hseq>LDDIVEANELIADRVEDNTNRSVAAQEDSTAATELVAENTEHGNKHLSNIADTARQISSKLSEFADRLNSKIEASVQSGLPAIGNQATAIQAIEEQINTPLNEEVLADAIEKLLPMPVKSETDVFPEPEKPKEPEQNPQEDKREEERKDKEKSQASEKILSAVKGGFKSTYGLLNNIAGSLFKYTITAAANMLKWAGIMFAIVFAIDLIRVHFKYWQKVFEKSLDELNEQVGAWGPILTDIFNTAQEMRDYWAKGQYGDLVTSLVQGIGRTLLDLGHMIMFGIGKAIASMLDAIPGMSETAKKVEGRAIRTYSEQTGYVPDEEERQKVIAVEKYDQGQQYKDLKDEANKYTEDQFVKKTGNRGFLNDGISLNETQARQIHKDIRSGKLKDSDIEKEIGIQADLAMRMNTIENRVQRTSG--------SPSTNAELMDNLSKLAKDIGNADI--QSYMKEPLQERVQKMESALAERTKPKVTPKPAAE--SAEATQVKEVEATIKPKETASTNAG---TTLNNINNVRNSRTVVQVQPRSSIPSGGIM</Hsp_hseq> + <Hsp_midline>LDD+ ANEL+A+ +E+ N + + ++ TEL AE +E + + + A IS KLS+ A L SK++A Q + + +T + IE+++ P +E E++LP P+ + + P+ + Q P+ +K+++++ DK+ + +L KGGFK+T + + I+ LFKYT+TA A K A ++FA+V IDL+R+HFKYW F + DE + + G WG +L IF +++ +W G + L ++V+G+ + +L ++ GI K AS+LDA+ G A + G A+ + E+TG E++ QK +A + + ++ + D+A ++ F G ++ TQA +D + LK EK + + ++ N VQR +P+ L + K I ++ I Q K+ L +R Q++ES ++ K TPKPAA S + +V+ ++ KE + + G +NNV NS+T+ QVQ ++ P+ G+ </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>29</Hit_num> + <Hit_id>gi|238695064|ref|YP_002922258.1|</Hit_id> + <Hit_def>tail length regulator [Enterobacteria phage JSE] >gi|220029200|gb|ACL78135.1| tail length regulator [Enterobacteria phage JSE]</Hit_def> + <Hit_accession>YP_002922258</Hit_accession> + <Hit_len>577</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>204.527</Hsp_bit-score> + <Hsp_score>519</Hsp_score> + <Hsp_evalue>2.33408e-54</Hsp_evalue> + <Hsp_query-from>22</Hsp_query-from> + <Hsp_query-to>570</Hsp_query-to> + <Hsp_hit-from>13</Hsp_hit-from> + <Hsp_hit-to>572</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>167</Hsp_identity> + <Hsp_positive>299</Hsp_positive> + <Hsp_gaps>83</Hsp_gaps> + <Hsp_align-len>596</Hsp_align-len> + <Hsp_qseq>RDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDN-------VAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQE-PE-NNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIY----NLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEF-----KTRAFDWVLGR--ENKIDSTQASD--RDQETQNLKAMAPEKREETLIK------QNEARAAVQRL---------------EKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDN----QRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVF</Hsp_qseq> + <Hsp_hseq>KEAEENPIDKLNKLDKLN-SIDNLQAATELVAETVEQKSNEVVGAVEDNTAANELTAENTQSTAGNTQKTYEELQKLNNFSSQMNEKLRGFGVMMERRFGVV--------SKMASGIGAIEEALKKPEQPQTMPSPQPVLPTVPEQ---PNNDNYQGLPKKKPDADDRKKKNATDKRNADSMENLLKVVRGGFKETIGISNKVLGMLFKITLTAMAEAAKWGAILMGIVFVIDTLMVHFRYWSDLFETKFNEFMDKAGGWAGPISDILTTVRQVRDYWSKGEYGELIKSLVMGIGDAFYKTFIQLDRIITTGIAKILRMI---PGMGDYADKLEYGALKSAVAQ-GYTPNERELELMDKVESEHEE------DKYGERTGWTGKARDIGEAIGESIKDKVNEGLVSLGWRDQ-----KDVDAEKRQEELKRGEYKSVSAEQRSASRKLRIKSEGAINNINEVMENLSGDYDKE---RMGELKKDIDVYREQVQDPTLVE--SDRSQLERLIEKFDEMYADKTKGVVPTKSVPATETETAKQAERTEQMQKQAAIQQQTTNQTS--NVNNTQI--VTNNRTIKQGAPTTRIDAPGTI</Hsp_hseq> + <Hsp_midline>++A D+++ L+ + +D++Q+A ELVAE +E+K N ++ +V++ AE T+ A +++T E ++ L +S +++KL M+E + V + ++G+ IE+ L P++P++ P+ +LP + + P+ D + +P++ P+ +++K + DK+ D + +LLK +GGFK TI I++K+ MLFK T+TA+AEAAK A+L +V ID L +HF+YW+D F + F+EF +AG W G + I + ++ +W G++ L ++V G+ D Y L I++ GI+KI I G + A + AL+ + G + +E + + + K +S+ E DK GE K R +G ++K++ S RDQ K + EKR+E L + E R+A ++L E GD D E M L+K + ++ + D + + + + +L++ ++ + Y + P + PAT + +R + +QK ++Q+ T NV NTQ+ V N++TI Q T APG </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>30</Hit_num> + <Hit_id>gi|157311483|ref|YP_001469526.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit tail length determinator [Enterobacteria phage Phi1] >gi|149380687|gb|ABR24692.1| gp29 baseplate hub subunit tail length determinator [Enterobacteria phage Phi1]</Hit_def> + <Hit_accession>YP_001469526</Hit_accession> + <Hit_len>577</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>200.675</Hsp_bit-score> + <Hsp_score>509</Hsp_score> + <Hsp_evalue>5.33273e-53</Hsp_evalue> + <Hsp_query-from>42</Hsp_query-from> + <Hsp_query-to>570</Hsp_query-to> + <Hsp_hit-from>32</Hsp_hit-from> + <Hsp_hit-to>572</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>163</Hsp_identity> + <Hsp_positive>286</Hsp_positive> + <Hsp_gaps>82</Hsp_gaps> + <Hsp_align-len>576</Hsp_align-len> + <Hsp_qseq>LDDVQSANELVAEVIEEKGNNLIDSVDNVAEGTELAAEASE-------RTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQE-PE-NNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIY----NLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEF-----KTRAFDWVLGR--ENKIDSTQASD--RDQETQNLKAMAPEKREETLIK------QNEARAAVQRL---------------EKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDN----QRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVF</Hsp_qseq> + <Hsp_hseq>IDNLQAATELVAETVEQKSNEVVGAVEDNTAANELTAENTQSTAGNTKKTYEELQKLNNFSSQMNEKLRGFGVMMERRFGVV--------SKMASGIGAIEEALKKPEQPQTMPSPQPVLPTVPEQ---PNNDNYQGLPKKKPDVDDRKKKNAADKRNADSMENLLKVVRGGFKETIGISNKVLGMLFKITLTAMAEAAKWGAILMGIVFVIDTLMVHFRYWSDLFETKFNEFMDKAGGWAGPISDILTTVRQVRDYWSKGEYKELIKSLVMGIGDAFYKTFIQLDRIITTGIAKILRMI---PGMGDYADKLEYGALKSAVAQ-GYTPNERELELMDKVESEHEE------DKYGERTGWTGKARDIGEAIGDSIKDKVNEGLVSLGWRDQ-----KDVDAEKRQEELKRGEYKSVSAEQRSASRKLRIKSEGAINNINEVMENLSGDYDKE---RMGELKKDIDVYREQVQDPTLVE--SDRSQLERLIEKFDEMYADKTNGVVPTNPVPATETETAKQAERTEQMQKQAAIQQQTTNQTS--NVNNTQI--VTNNRTVKQGAPTTRIDAPGTI</Hsp_hseq> + <Hsp_midline>+D++Q+A ELVAE +E+K N ++ +V++ EL AE ++ +T E ++ L +S +++KL M+E + V + ++G+ IE+ L P++P++ P+ +LP + + P+ D + +P++ P+ +++K + DK+ D + +LLK +GGFK TI I++K+ MLFK T+TA+AEAAK A+L +V ID L +HF+YW+D F + F+EF +AG W G + I + ++ +W G++ L ++V G+ D Y L I++ GI+KI I G + A + AL+ + G + +E + + + K +S+ E DK GE K R +G ++K++ S RDQ K + EKR+E L + E R+A ++L E GD D E M L+K + ++ + D + + + + +L++ ++ + Y P PAT + +R + +QK ++Q+ T NV NTQ+ V N++T+ Q T APG </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>31</Hit_num> + <Hit_id>gi|33620639|ref|NP_891750.1|</Hit_id> + <Hit_def>tail length regulator [Enterobacteria phage RB49] >gi|33438535|gb|AAL15120.2| tail length regulator [Enterobacteria phage RB49]</Hit_def> + <Hit_accession>NP_891750</Hit_accession> + <Hit_len>577</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>200.675</Hsp_bit-score> + <Hsp_score>509</Hsp_score> + <Hsp_evalue>5.38583e-53</Hsp_evalue> + <Hsp_query-from>42</Hsp_query-from> + <Hsp_query-to>570</Hsp_query-to> + <Hsp_hit-from>32</Hsp_hit-from> + <Hsp_hit-to>572</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>164</Hsp_identity> + <Hsp_positive>284</Hsp_positive> + <Hsp_gaps>82</Hsp_gaps> + <Hsp_align-len>576</Hsp_align-len> + <Hsp_qseq>LDDVQSANELVAEVIEEKGNNLIDSVDN-------VAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQE-PE-NNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIY----NLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEF-----KTRAFDWVLGR--ENKIDSTQASD--RDQETQNLKAMAPEKREETLIK------QNEARAAVQRL---------------EKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDN----QRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVF</Hsp_qseq> + <Hsp_hseq>IDNLQAATELVAETVEQKSNEVVGAVEDNTAANELTAENTQSTAGNTQKTYEELQKLNNFSSQMNEKLRGFGVMMERRFGVV--------SKMASGIGAIEEALKKPEQPQTMPSPQPFLPTVPEQ---PNNDNYQGLPKKKPDVDDRKKKNATDKRNADSMENLLKVVRGGFKETIGISNKVLGMLFKITLTAMAEAAKWGAILMGIVFVIDTLMVHFRYWSDLFETKFKEFMDKAGGWAGPISDILTTVRQVRDYWSKGEYKELIKSLVMGIGDAFYKTFIQLDRIITTGIAKILRMI---PGMGDYADNLEYGALKSAVAK-GYKPNERELELMDKVESEHEE------DKYGERTGWTGKARDIGEAIGESIKDKFNEGLVSLGWRDQ-----KDVDAEKRQEELKRGEYKSVSAEQRSASRKLKIKSEGAINNINEVMENLSGDYDKE---RMEELKKDIDVYREQVQDPTLVE--SDRSQLERLIEKFDEMYADKTNGVVPTNPVPATETETAKQAERTEQMQKQAAIQQQTTNQTS--NVNNTQI--VTNNRTIKQGAPTTRIDAPGTI</Hsp_hseq> + <Hsp_midline>+D++Q+A ELVAE +E+K N ++ +V++ AE T+ A +++T E ++ L +S +++KL M+E + V + ++G+ IE+ L P++P++ P+ LP + + P+ D + +P++ P+ +++K + DK+ D + +LLK +GGFK TI I++K+ MLFK T+TA+AEAAK A+L +V ID L +HF+YW+D F + F EF +AG W G + I + ++ +W G++ L ++V G+ D Y L I++ GI+KI I G + A + AL+ + G +E + + + K +S+ E DK GE K R +G ++K + S RDQ K + EKR+E L + E R+A ++L E GD D E M+ L+K + ++ + D + + + + +L++ ++ + Y P PAT + +R + +QK ++Q+ T NV NTQ+ V N++TI Q T APG </Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>32</Hit_num> + <Hit_id>gi|392973136|ref|YP_006489094.1|</Hit_id> + <Hit_def>baseplate hub subunit [Acinetobacter phage ZZ1] >gi|390058277|gb|AFL47731.1| baseplate hub subunit, tail length determinator [Acinetobacter phage ZZ1]</Hit_def> + <Hit_accession>YP_006489094</Hit_accession> + <Hit_len>585</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>195.667</Hsp_bit-score> + <Hsp_score>496</Hsp_score> + <Hsp_evalue>4.41683e-51</Hsp_evalue> + <Hsp_query-from>112</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>105</Hsp_hit-from> + <Hsp_hit-to>585</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>156</Hsp_identity> + <Hsp_positive>246</Hsp_positive> + <Hsp_gaps>32</Hsp_gaps> + <Hsp_align-len>489</Hsp_align-len> + <Hsp_qseq>KVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEP--ENNKKDQKKDDKKPTDMLGDLLKTTKG----GFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEW-----------GGLLQSIFGMLGD---IKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDK---AGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQT-VTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>KLAALSERLKEKYEAANDATVDLPVKAEEPTTSES--LSSRISPEDTNNNVIPSVVADDPKPSKDLLESTNEVKGAPSLGPAAMIVSGLQTLTGAVKTGFAKSKSVSDKIAGMLFKYTVTQAVNAAKIALAVFGIILALDLLKMAWNAWGEKIMAKFEEWTQTFSKWWDNFKEWSTYFSDMKYAFEGMQGDLMGIRNAWESGDWPALASAIGTAFVDGIKTLSGIMDRVITKLIATILNKLGFKDTAKSIEAEGLQRYQNMTNNKLDPENQQKLAEEQLKR-EKKDGLTSTQRGVTSFLPDSWREKLGFITKNEHSQIEAEKKDQKARQSLSKDDQVKVVAASNEAREAVARLENIAVNADPNNKGQMATLDKYRKEAQNYINNPALSKSPNVKAELQNQLDRLTPK-QSVK--NTVTPETSTASKDVQTAKNIQIAE--AQKAKTNAVQNNNTANVQNNIVKSSRQYNVQAPITGTAAPGIFKATGVN</Hsp_hseq> + <Hsp_midline>K+ A+ ++++E +A+ + K +P ES L RI P +NN +P P P + E+ + + P M+ L+T G GF + S++DKI+ MLFKYTVT AAK+A +F ++L +DLL++ + W +K M+ F+E++ +W + + GM GD I+ WE+GDW LA AI D I LS IM I+K+ A+IL+ LGF++ A +I L+ +Q T N L ++Q+ LA+ Q KR E+ G+ F ++ LG K + +Q ++ + ++++ + + + + NEAR AV RLE + DP N M +L+K A+ I++ A+S P K EL + R+ K Q +K NT P S+D Q +NIQ AE +++K + N NN+ S + VQ +T T APG+F ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>33</Hit_num> + <Hit_id>gi|326536335|ref|YP_004300776.1|</Hit_id> + <Hit_def>gp29 baseplate hub [Acinetobacter phage 133] >gi|299483416|gb|ADJ19510.1| gp29 baseplate hub [Acinetobacter phage 133]</Hit_def> + <Hit_accession>YP_004300776</Hit_accession> + <Hit_len>582</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>182.185</Hsp_bit-score> + <Hsp_score>461</Hsp_score> + <Hsp_evalue>1.85312e-46</Hsp_evalue> + <Hsp_query-from>75</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>84</Hsp_hit-from> + <Hsp_hit-to>582</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>164</Hsp_identity> + <Hsp_positive>246</Hsp_positive> + <Hsp_gaps>65</Hsp_gaps> + <Hsp_align-len>533</Hsp_align-len> + <Hsp_qseq>ELAAEASERTTESIKTLTGVASTISDK---LSKLASMLESKV-------QAVEQKVQESGASASTGLSVIED---KLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDE-------FSAEAGEWGGLLQSIFGMLGD----IKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEG-----PGIIDKAGEFKTRAFDWVLG--RENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>ELQQEAVEANTH----LEQIEKSTTDSNATLSKLSSQLESKFSGQVQSPQVVEHKTTEE---------IIKDFAEKSKSKTESTEPAILPAVLPEATKKPNLGGAT----TPKE-----QKAKSDSTKASHPAMKVFNVVKSGFKSVKSVGDKIAGFLFKGALTAAIEAAKMAGIIFLIIAAIDLVRIHFKYWTEKFSAKFDAVKEIIMGYFDRFGNWMESIMPMFSGLFDAIDYIRNVFAKGDWSALAGAIGNVMKEAFNSLGAMIQNGIAKLASILLRKFGFNDTADSIEAIGLENKQNMTNTPLTPEEQKKVAKQQQKMLDKDYTPTQTGIT----AFLPDKFRKAIGALSDGEYDQIQAEKKNM--SQLKGLNKEDQTNTIGAMNEARAALNRYENKVERLDPNDPNQAAKIDNAYKEAKTAISDPDLKNVPDVKIELENQLGKLQAKTGRAAPKPAPAANSPEAAQANSIA---RKTNEVKAPVAQAANNTNVNTTM---VKNNKSVHVQAPVTSTNAPGVFHGTGVN</Hsp_hseq> + <Hsp_midline>EL EA E T L + + +D LSKL+S LESK Q VE K E +I+D K E P + +LP NL P+E + K D K + + K GFK+ S+ DKI+ LFK +TA EAAKMA ++F ++ IDL+RIHFKYWT+KF + FD + G W + +F L D I+ + GDWS LA AI + + +L ++ GI+K+++ +L GF + A +I LE Q T L+ ++QK +AK Q K +++ GI F F +G + + D QA ++ LK + E + T+ NEARAA+ R E + +DP +P ++ AY AK +ISD + + P K EL+ + ++++K + P +P ++ N +K K ++ + NV T V N+K++H VT+T APGVF TGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>34</Hit_num> + <Hit_id>gi|311993473|ref|YP_004010338.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit [Acinetobacter phage Acj9] >gi|295917430|gb|ADG60101.1| gp29 baseplate hub subunit [Acinetobacter phage Acj9]</Hit_def> + <Hit_accession>YP_004010338</Hit_accession> + <Hit_len>572</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>172.17</Hsp_bit-score> + <Hsp_score>435</Hsp_score> + <Hsp_evalue>5.19477e-43</Hsp_evalue> + <Hsp_query-from>86</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>72</Hsp_hit-from> + <Hsp_hit-to>572</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>159</Hsp_identity> + <Hsp_positive>249</Hsp_positive> + <Hsp_gaps>58</Hsp_gaps> + <Hsp_align-len>525</Hsp_align-len> + <Hsp_qseq>ESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENN--KKDQKKDDKK-----PTDMLGDLLKTTKG-------GFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSA-EAGEWGGL---------LQSIF-GMLGD---IKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFD-----WVLGRENKIDSTQASDRDQ-ETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>EETKYLSNTADEISAKLSVLSERLKVKYDAASPDAPPVVRDNSTA-EVLADRL-DAQSEEQPKKQAWMPQPM-------------PVEKKPSDDLLSKSEDKGSKEGVKGAPNESTIPMIAAVKGVGSVVKAGFNKSIGIVDKISNLLFKMSVKQIADAALMGAAIFGIILSIDLLKAAWAAWGEKIMAKVEEWTTIFKGWWEGFKGWASSFSDLTTAFEGMRGDFMGIRNAWESGDWPSLAKALGTTIKDGLMTLSGILDRLFTKVLSTILDKVGLGKAAKAVEAEGLQRYQGKTNNKLSDENQKKLAEEQIRR-EKKDGLTPTQRGLTSFLPDKMRKGWAL-TDNEYNQIQAEKKDKAATKNL---SHDDQVKVTAATNEAREAVARFKNIADNYDPNKKDQAAQFDKYKKEAQAYISKPELAKSPAVKAELEAQVAAI-SKGKGGKASVAPEKS--ANSQDSGTVKNIKVAEAQRAANKNASPAGNTV-IQTNVAKTNKNVHVQAPVTSTTAPGVYGATKVN</Hsp_hseq> + <Hsp_midline>E K L+ A IS KLS L+ L+ K A ST V+ D+L D E P + P+ PV ++P ++ K + K K+ P + ++ KG GF +I I DKIS++LFK +V +A+AA M A +F ++L IDLL+ + W +K M+ +E++ G W G L + F GM GD I+ WE+GDW LA A+ + D + LS I+ +K+ ++ILD +G AA + L+ +Q +T N LS+++QK LA+ Q +R E+ G+ + D W L +N+ + QA +D+ T+NL + + + + NEAR AV R + + DP +K A+ IS ++ PA K EL+ + + SK + K P+ + A S+D+ V+NI+ AE + +K ++ N Q N +K +H VT+T APGV+GAT VN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>35</Hit_num> + <Hit_id>gi|310722277|ref|YP_003969101.1|</Hit_id> + <Hit_def>unnamed protein product [Aeromonas phage phiAS4] >gi|306021120|gb|ADM79655.1| baseplate hub [Aeromonas phage phiAS4]</Hit_def> + <Hit_accession>YP_003969101</Hit_accession> + <Hit_len>565</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>150.984</Hsp_bit-score> + <Hsp_score>380</Hsp_score> + <Hsp_evalue>5.93083e-36</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>569</Hsp_query-to> + <Hsp_hit-from>36</Hsp_hit-from> + <Hsp_hit-to>560</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>143</Hsp_identity> + <Hsp_positive>271</Hsp_positive> + <Hsp_gaps>69</Hsp_gaps> + <Hsp_align-len>560</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEVIEEKGN-------NLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQK--VQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKK----DQKKDDKKPTDMLGD-LLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETL-IKQNEARAAVQRLE------KYIGDVDPENPTNMQSLEKAY-------NSAKKSISDSAISDQPA--TKKELDKRFQRVESKYQKLKEDNTPKPAA---PATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNV-NNSKTIHQVQTVTATPAPGV</Hsp_qseq> + <Hsp_hseq>DLLASSELIAETVEQ-GNSELRKIVNNTSETENIAAATELSAEATEISNQHLKEISDTSKKTFSKLSEFAEKLKNNFLADVEKNPITATNTSDQTAKKIVEEEEQTPKNNPVLGYLKTISE---------DIKFLKNDKPKEEEKEDKVKPDKEENVERAIDRIGDRIVSSVDNGFKKTISVADSISSMLFKYTLTAVLNFAKMAALVLSLIMTFDVLSRHFTHWTKMFEENYAEFKNQLGSLATPFENVHGVITDLMNYFKSDEYTKMFVRLAEGAFDQMKYMVNMMMVGLAKLGATILRALGADEKADSLEASAISVAASEVGYTPSKEEEEVIGRVRKREAEDANN---------TEA-NWFEKQWRKVNG-----EDEETPDEK----EKREKRMEIAKNTTAEQFGRYDVLSGKINHVGVTAKKNETSPELLNKHRELLDDRSNEVEQSYQEGKLTKESYEQLRVEIDK-----QTKFLAEHEKTLVVPTAAIKPAPEPEVSTVKSIDKEEKRVESKKQEAASQNNYHTKANIVKNQNQTIVQAPR-TSSPGPGI</Hsp_hseq> + <Hsp_midline>D+ +++EL+AE +E+ GN N +N+A TEL+AEA+E + + +K ++ + KLS+ A L++ A +K + + S T ++E++ P G + I D F + E + D++++ ++ D +GD ++ + GFK TIS+ D ISSMLFKYT+TA+ AKMAA++ +L++ D+L HF +WT F N+ EF + G +++ G++ D+ ++++ +++ + V + +G D + + +M +G++K+ A+IL ALG + A ++ SA+ G + S+++++ + + + + E+ T A +W + K++ D+ET + K EKRE+ + I +N R + ++G +N T+ + L K N ++S + ++ + + E+DK ++K+ E P A PA + V++I K E E K+ N +T+ N V N ++TI Q T++P PG+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>36</Hit_num> + <Hit_id>gi|472438116|ref|YP_007677896.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Aeromonas phage Aes012] >gi|395653254|gb|AFN69809.1| baseplate hub subunit tail length determinator [Aeromonas phage Aes012]</Hit_def> + <Hit_accession>YP_007677896</Hit_accession> + <Hit_len>565</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>150.599</Hsp_bit-score> + <Hsp_score>379</Hsp_score> + <Hsp_evalue>8.25687e-36</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>569</Hsp_query-to> + <Hsp_hit-from>36</Hsp_hit-from> + <Hsp_hit-to>560</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>143</Hsp_identity> + <Hsp_positive>271</Hsp_positive> + <Hsp_gaps>69</Hsp_gaps> + <Hsp_align-len>560</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEVIEEKGN-------NLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQK--VQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKK----DQKKDDKKPTDMLGD-LLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETL-IKQNEARAAVQRLE------KYIGDVDPENPTNMQSLEKAY-------NSAKKSISDSAISDQPA--TKKELDKRFQRVESKYQKLKEDNTPKPAA---PATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNV-NNSKTIHQVQTVTATPAPGV</Hsp_qseq> + <Hsp_hseq>DLLASSELIAETVEQ-GNSELRKIVNNTSETENIAAATELSAEATEISNQHLKEISDTSKKTFSKLSEFAEKLKNNFLADVEKNPITATNTSDQTAKKIVEEEEQTPKDNPVLGYLKTISE---------DIKFLKNDKPKEEEKEDKVKPDEEENVERAIDRIGDRIVSSVDNGFKKTISVADSISSMLFKYTLTAVLNFAKMAALVLSLIMTFDVLSRHFTHWTKMFEENYAEFKNQLGSLATPFENVHGVITDLMNYFKSDEYTKMFVRLAKGAFDQMKYMVNMMMVGLAKLGATILRALGADEKADSLEASAISVAASEVGYTPSKEEEEVIGRVRKREAEDANN---------TEA-NWFEKQWRKVNG-----EDEETPDEK----EKREKRMEIAKNTTAEQFGRYDVLRGKINHVGVTAKKNETSPELLNKHRELLDDRSNEVEQSYQEGKLTKESYEQLRVEIDK-----QTKFLAEHEKTLVVPTAAIKPAPEPEVSTVKSIDKEEKRVESKKQEAASQTNYHTKANIVKNQNQTIVQAPR-TSSPGPGI</Hsp_hseq> + <Hsp_midline>D+ +++EL+AE +E+ GN N +N+A TEL+AEA+E + + +K ++ + KLS+ A L++ A +K + + S T ++E++ P + G + I D F + E + D++++ ++ D +GD ++ + GFK TIS+ D ISSMLFKYT+TA+ AKMAA++ +L++ D+L HF +WT F N+ EF + G +++ G++ D+ ++++ +++ + V + KG D + + +M +G++K+ A+IL ALG + A ++ SA+ G + S+++++ + + + + E+ T A +W + K++ D+ET + K EKRE+ + I +N R + ++G +N T+ + L K N ++S + ++ + + E+DK ++K+ E P A PA + V++I K E E K+ +T+ N V N ++TI Q T++P PG+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>37</Hit_num> + <Hit_id>gi|311992947|ref|YP_004009814.1|</Hit_id> + <Hit_def>gp29 baseplate hub subunit [Acinetobacter phage Acj61] >gi|295815236|gb|ADG36162.1| gp29 baseplate hub subunit [Acinetobacter phage Acj61]</Hit_def> + <Hit_accession>YP_004009814</Hit_accession> + <Hit_len>597</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>149.443</Hsp_bit-score> + <Hsp_score>376</Hsp_score> + <Hsp_evalue>2.04985e-35</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>576</Hsp_query-to> + <Hsp_hit-from>46</Hsp_hit-from> + <Hsp_hit-to>597</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>174</Hsp_identity> + <Hsp_positive>287</Hsp_positive> + <Hsp_gaps>61</Hsp_gaps> + <Hsp_align-len>573</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEV---IEEKGNNLIDSVDNVAEG-----TELAAEASERTTESI------KTLTGVASTISDKLSKLASML-ESKVQA-VEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPT------DMLGDLLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSA-------EAGEWGGL---LQSIF-GM---LGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKR-IEEGPGIIDKA-GEFKTRAFDWVLG--RENKIDSTQASDRDQETQNLKAMAPEKREETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPATKKELDKRFQRVESKYQKLKEDNTPKPAAPATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNVNNSKTIHQVQTVTATPAPGVFGATGVN</Hsp_qseq> + <Hsp_hseq>DMKAANDALDDIRDQVSDKADDPIDTLDASKQSLASIDNKMSQQISDNLASSIVQRRYEGTMIGETQNISAKLSLLLGKLTEMHVDAQVEAAQKDNIKSEPTTSEVIGDLIKKEQPEQKPEIAEKILPTEEK----------PSTKLLDENAGKSGKELVGKANPIVMGLDKVGGLLKT---GFKSSIGVMDKISGMLFKFTATQAINAAKVAAAIFAIILAIDLIKIYWSVWGEKIMAKLSEWAEIFKGWWDTFTDWGSQFSDFKTAFEGMGANLMEIKNAWVSGDFPALAKALGNAIIDMGKTISGIIGRTLASLFGPLLRKLGFGETADNLEAAGLRHYQNMTDNRLSPENQRKLAENQVKQEAKDGKTATERGMTDFLPNTWRNKLGFISDNELSQINAEKKDQSARS--NLSQEQKVDSVAATNEAREAIARYKKFADAANPDNAGDMAKVDKYKKEAAQYLSNKALDLTPSIKSELQTQYNAIKVKSKKDDV----KPETSAASKDTQTVNSIKTAEAAK--ANQQTQQTNVANVQNNVVKNSKTVHVQAPTTSTRAPGVHKATGVN</Hsp_hseq> + <Hsp_midline>D+++AN+ + ++ + +K ++ ID++D + +++ + S+ SI T+ G IS KLS L L E V A VE +++ S T VI D + + P + E+ILP + P EN K K+ K D +G LLKT GFK++I + DKIS MLFK+T T AAK+AA +FA++L IDL++I++ W +K M+ E++ +WG ++ F GM L +IK W +GD+ LA A+ + D+ +S I+ ++ + +L LGF A + + L +Q T N LS ++Q+ LA+ Q K+ ++G ++ +F + LG +N++ A +DQ ++ ++ E++ +++ NEAR A+ R +K+ +P+N +M ++K A + +S+ A+ P+ K EL ++ ++ K +K KP A S+D Q V +I+ AE AK + + T NVAN Q N V NSKT+H T+T APGV ATGVN</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>38</Hit_num> + <Hit_id>gi|401824980|gb|AFQ22670.1|</Hit_id> + <Hit_def>baseplate hub [Stenotrophomonas phage IME13]</Hit_def> + <Hit_accession>AFQ22670</Hit_accession> + <Hit_len>565</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>147.902</Hsp_bit-score> + <Hsp_score>372</Hsp_score> + <Hsp_evalue>5.89358e-35</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>569</Hsp_query-to> + <Hsp_hit-from>36</Hsp_hit-from> + <Hsp_hit-to>560</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>142</Hsp_identity> + <Hsp_positive>270</Hsp_positive> + <Hsp_gaps>69</Hsp_gaps> + <Hsp_align-len>560</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEVIEEKGN-------NLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQK--VQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKK----DQKKDDKKPTDMLGD-LLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETL-IKQNEARAAVQRLE------KYIGDVDPENPTNMQSLEKAY-------NSAKKSISDSAISDQPA--TKKELDKRFQRVESKYQKLKEDNTPKPAA---PATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNV-NNSKTIHQVQTVTATPAPGV</Hsp_qseq> + <Hsp_hseq>DLLAASELISETVEQ-GNSELRKIVNNTSETENIAAATEISAEATEISNQHLKEISDTSKKTFSKLSEFAEKLKNNFLADVEKNPITTTNTSDQTAKKIVEEEEQTPKNNPVLGYLKTISE---------DIKFLKNDKPKEEEKEDKVKPDEEENVERAIDRIGDRIVSSVDNGFKKTISIADSISSMLFKYTLTAVLNFAKMAALVLSLIMTFDVLSRHFTHWTKMFEENYAEFKNQLGSLATPFENVHGVITDLMNYFKSDEYTKMFVRLAEGAFDQMKYMVNMMMVGLAKLGATILRALGADEKADSLEASAISVAASEVGYTPSKEEEEVIGRVRKREAEDANN---------TEA-NWFEKQWRKVNG-----EDEETPDEK----EKREKRMEIAKNTTAEQFGRYDVLSGKINHVGVTAKKNETSPELLNKHRELLDDRSNEVEQSYQEGKLTKESYEQLRVEIDK-----QTKFLAEHEKTLVVPTAAIKPAPEPEVSTVKSIDKEEKRVESKKQEAASQTNYHTKANIVKNQNQTIVQAPR-TSSPGPGI</Hsp_hseq> + <Hsp_midline>D+ +A+EL++E +E+ GN N +N+A TE++AEA+E + + +K ++ + KLS+ A L++ A +K + + S T ++E++ P G + I D F + E + D++++ ++ D +GD ++ + GFK TISI D ISSMLFKYT+TA+ AKMAA++ +L++ D+L HF +WT F N+ EF + G +++ G++ D+ ++++ +++ + V + +G D + + +M +G++K+ A+IL ALG + A ++ SA+ G + S+++++ + + + + E+ T A +W + K++ D+ET + K EKRE+ + I +N R + ++G +N T+ + L K N ++S + ++ + + E+DK ++K+ E P A PA + V++I K E E K+ +T+ N V N ++TI Q T++P PG+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>39</Hit_num> + <Hit_id>gi|109290160|ref|YP_656409.1|</Hit_id> + <Hit_def>gp29 base plate hub [Aeromonas phage 25] >gi|104345833|gb|ABF72733.1| gp29 base plate hub [Aeromonas phage 25]</Hit_def> + <Hit_accession>YP_656409</Hit_accession> + <Hit_len>565</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>145.976</Hsp_bit-score> + <Hsp_score>367</Hsp_score> + <Hsp_evalue>2.35249e-34</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>569</Hsp_query-to> + <Hsp_hit-from>36</Hsp_hit-from> + <Hsp_hit-to>560</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>142</Hsp_identity> + <Hsp_positive>269</Hsp_positive> + <Hsp_gaps>69</Hsp_gaps> + <Hsp_align-len>560</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEVIEEKGN-------NLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQK--VQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKK----DQKKDDKKPTDMLGD-LLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETL-IKQNEARAAVQRLE------KYIGDVDPENPTNMQSLEKAY-------NSAKKSISDSAISDQPA--TKKELDKRFQRVESKYQKLKEDNTPKPAA---PATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNV-NNSKTIHQVQTVTATPAPGV</Hsp_qseq> + <Hsp_hseq>DLLAASELISETVEQ-GNSELRKIVNNTSETENIAAATELSAEATEISNQHLKEISDTSKKTFSKLSEFAEKLKNNFLADVEKNPITATNTSDQTAKKIVEEEEQTPKNNPVLGYLKTISE---------DIKFLKNDKPKEEEKEDKVKPDKEENVESAIDRIGDRIVSSVDNGFKKTINIADSISSMLFKYTLTAVLNFAKMAALVLSLIMTFDVLSRHFTHWTKMFEENYAEFKNQLGSLATPFENVHGVITDLMNYFKSDEYTKMFVRLAEGAFDQMKYMVNMMMVGLAKLGATILRALGADEKADSLEASAISVAASEVGYTPSKEEEEVIGRVRKREAEDANN---------TEA-NWFEKQWRKVNG-----EDEETPDEK----EKREKRMEIAKNTTAEQFGRYDVLSGKINHVGVTAKKNETSPELLNKHRELLDDRSNEVEQSYQEGKLTKESYEQLRVEIDK-----QTKFLAEHEKTLVVPTAAIKPAPEPEVSTVKSIDKEEKRVESKKQEAASQTNYHTKANIVKNQNQTIVQAPR-TSSPGPGI</Hsp_hseq> + <Hsp_midline>D+ +A+EL++E +E+ GN N +N+A TEL+AEA+E + + +K ++ + KLS+ A L++ A +K + + S T ++E++ P G + I D F + E + D++++ + D +GD ++ + GFK TI+I D ISSMLFKYT+TA+ AKMAA++ +L++ D+L HF +WT F N+ EF + G +++ G++ D+ ++++ +++ + V + +G D + + +M +G++K+ A+IL ALG + A ++ SA+ G + S+++++ + + + + E+ T A +W + K++ D+ET + K EKRE+ + I +N R + ++G +N T+ + L K N ++S + ++ + + E+DK ++K+ E P A PA + V++I K E E K+ +T+ N V N ++TI Q T++P PG+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>40</Hit_num> + <Hit_id>gi|423262258|ref|YP_007010857.1|</Hit_id> + <Hit_def>baseplate hub subunit tail length determinator [Aeromonas phage Aes508] >gi|402762136|gb|AFQ97250.1| baseplate hub subunit tail length determinator [Aeromonas phage Aes508]</Hit_def> + <Hit_accession>YP_007010857</Hit_accession> + <Hit_len>565</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>145.591</Hsp_bit-score> + <Hsp_score>366</Hsp_score> + <Hsp_evalue>3.57946e-34</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>569</Hsp_query-to> + <Hsp_hit-from>36</Hsp_hit-from> + <Hsp_hit-to>560</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>142</Hsp_identity> + <Hsp_positive>269</Hsp_positive> + <Hsp_gaps>69</Hsp_gaps> + <Hsp_align-len>560</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEVIEEKGN-------NLIDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQK--VQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKK----DQKKDDKKPTDMLGD-LLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKREETL-IKQNEARAAVQRLE------KYIGDVDPENPTNMQSLEKAY-------NSAKKSISDSAISDQPA--TKKELDKRFQRVESKYQKLKEDNTPKPAA---PATSEDNQRVQNIQKAENAKEQSKKSTGDMNVANTQVNNV-NNSKTIHQVQTVTATPAPGV</Hsp_qseq> + <Hsp_hseq>DLLASSELIAETVEQ-GNSELRKIVNNTSETENIAAATELSAEATEISNQHLKEISDTSKKTFSKLSEFAEKLKNNFLADVEKNPITTTNTSDQTAKKISEEEEQTPKNNPVLGYLKTISE---------DIKFLKNDKPKEEEKEDKVKPDKEENVERAIDRIGDRIVSSVDNGFKKTISVADSISSMLFKYTLTAVLNFAKMAALVLSLIMTFDVLSRHFTHWTKMFEENYAEFKNQLGSLATPFENVHGVITDLMNYFKSDEYTKMFVRLAEGAFDQMKYMVNMMMVGLAKLGATILRALGADEKADSLEASAISVAASEVGYTPSKEEEEVIGRVRKREAEDANN---------TEA-NWFEKQWRKVNG-----EDEETPDEK----EKREKRMEIAKNTTAEQFGRYDVLSGKINHVGVTAKKNETSPELLNKHRELLDDRSNEVEQSYQEGKLTKESYEQLRVEIDK-----QTKFLAEHEKTLVVPTAAIKPAPEPEVSTVKSIDKEEKRVESKKQEAASQTNYHTKANIVKNQNQTIVQAPR-TSSPGPGI</Hsp_hseq> + <Hsp_midline>D+ +++EL+AE +E+ GN N +N+A TEL+AEA+E + + +K ++ + KLS+ A L++ A +K + + S T + E++ P G + I D F + E + D++++ ++ D +GD ++ + GFK TIS+ D ISSMLFKYT+TA+ AKMAA++ +L++ D+L HF +WT F N+ EF + G +++ G++ D+ ++++ +++ + V + +G D + + +M +G++K+ A+IL ALG + A ++ SA+ G + S+++++ + + + + E+ T A +W + K++ D+ET + K EKRE+ + I +N R + ++G +N T+ + L K N ++S + ++ + + E+DK ++K+ E P A PA + V++I K E E K+ +T+ N V N ++TI Q T++P PG+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>41</Hit_num> + <Hit_id>gi|66391985|ref|YP_238910.1|</Hit_id> + <Hit_def>baseplate hub subunit [Aeromonas phage 31] >gi|62114822|gb|AAX63670.1| gp29 [Aeromonas phage 31]</Hit_def> + <Hit_accession>YP_238910</Hit_accession> + <Hit_len>566</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>144.05</Hsp_bit-score> + <Hsp_score>362</Hsp_score> + <Hsp_evalue>1.01075e-33</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>569</Hsp_query-to> + <Hsp_hit-from>36</Hsp_hit-from> + <Hsp_hit-to>562</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>150</Hsp_identity> + <Hsp_positive>269</Hsp_positive> + <Hsp_gaps>53</Hsp_gaps> + <Hsp_align-len>553</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEVIEEKGNNL------IDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQA--VEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGD-LLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKR---EETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPA---------TKKELDKRFQRVESKYQKLKEDNTPKPAA---PATSEDNQRVQNIQKAENAKE--QSKKSTGDMNVANTQVNNV-NNSKTIHQVQTVTATPAPGV</Hsp_qseq> + <Hsp_hseq>DSLAAQELIAETVEQGNNELRQIKANTASLHDTAAATELSAESTEMSNTILREISETGKQTFSKLSEFAERLKGSFSADDVEQAPIRTASSSDQAIQIINEENPEPENPLVG-----YLRTISEDIKFLRENKNEPSDPKDPDVVPDDKDDLKTMIDRIGDQIVKSVDSGFKRTVNIADSISSTLFKYTITAALNFAKMAALVLSLIIAFDVLSRHFSHWTQMFQEQYAEFKETLGSFGTPFENLTGIVTDLVNYFKSDEYLKMFVRLAEGAADQMIYIVNMMMVGLAKLGAAILRALGADDKADTLEASAISVATKTVGYTPSEEEEATIGRVRKRQAQE---------EAEQSEASWWEKKKREWDG-----KPIETDEEKAVRERKKSIAENTTAEQFGKHDALSQKIQHVGVTAEKNETSNELLGKHRELLEKRASDVEQAKQSGEITTESYKQLKVEIEKQREFLDAHEQKL-----LKPKASIKPAPEPEIGVVGSIAKEEKRVEASQTAKQEAASNY-NTNANIVKNNNQTLVQAPR-TSSPGPGI</Hsp_hseq> + <Hsp_midline>D +A EL+AE +E+ N L S+ + A TEL+AE++E + ++ ++ KLS+ A L+ A VEQ + +S+ + +I ++ P+P+ P L + ++ E+ P + + D K D K D +GD ++K+ GFK T++I D ISS LFKYT+TA AKMAA++ +L++ D+L HF +WT F + EF G +G +++ G++ D+ ++++ ++ + V + +G AD + + +M +G++K+ A+IL ALG ++ A T+ SA+ + G + SE+++ + + + ++ +E E + W ++ + D + ET KA+ K+ E T +Q A+ + +++G +N T+ + L K +K SD + Q K E++K+ + +++ QKL KP A PA + V +I K E E Q+ K N NT N V NN++T+ Q T++P PG+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>42</Hit_num> + <Hit_id>gi|37651664|ref|NP_932538.1|</Hit_id> + <Hit_def>baseplate hub subunit [Aeromonas phage 44RR2.8t] >gi|34732964|gb|AAQ81501.1| baseplate hub subunit [Aeromonas phage 44RR2.8t]</Hit_def> + <Hit_accession>NP_932538</Hit_accession> + <Hit_len>566</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>144.05</Hsp_bit-score> + <Hsp_score>362</Hsp_score> + <Hsp_evalue>1.1527e-33</Hsp_evalue> + <Hsp_query-from>44</Hsp_query-from> + <Hsp_query-to>569</Hsp_query-to> + <Hsp_hit-from>36</Hsp_hit-from> + <Hsp_hit-to>562</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>150</Hsp_identity> + <Hsp_positive>268</Hsp_positive> + <Hsp_gaps>53</Hsp_gaps> + <Hsp_align-len>553</Hsp_align-len> + <Hsp_qseq>DVQSANELVAEVIEEKGNNL------IDSVDNVAEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQA--VEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKKPTDMLGD-LLKTTKGGFKATISITDKISSMLFKYTVTALAEAAKMAAMLFALVLGIDLLRIHFKYWTDKFMSNFDEFSAEAGEWGGLLQSIFGMLGDIKKFWEAGDWSGLAVAIVKGLADVIYNLSEIMSLGISKISASILDALGFENAATTIRGSALEGFQERTGNSLSEDDQKALAKYQSKRIEEGPGIIDKAGEFKTRAFDWVLGRENKIDSTQASDRDQETQNLKAMAPEKR---EETLIKQNEARAAVQRLEKYIGDVDPENPTNMQSLEKAYNSAKKSISDSAISDQPA---------TKKELDKRFQRVESKYQKLKEDNTPKPAA---PATSEDNQRVQNIQKAENAKE--QSKKSTGDMNVANTQVNNV-NNSKTIHQVQTVTATPAPGV</Hsp_qseq> + <Hsp_hseq>DSLAAQELIAETVEQGNNELRQIKANTASLHDTAAATELGAESTEMSNTILREISETGKQTFSKLSEFAERLKGSFSADDVEQTPIRAASSSDQAIQIINEENPEPENPLVG-----YLRTISEDIKFLRENKNEPSDPKDPDVVPDDKDDLKTMIDRIGDQIVKSVDSGFKRTVNIADSISSTLFKYTITAALNFAKMAALVLSLIIAFDVLSRHFSHWTQMFQEQYAEFKETLGSFGTPFENLTGIVTDLVNYFKSDEYLKMFVRLAEGAADQMIYIVNMMMVGLAKLGAAILRALGADDKADTLEASAISVATKTVGYTPSEEEEATIGRVRKRQAQE---------EAEQSEASWWEKKKREWDG-----KPIETDEEKAVRERKKSIAENTTAEQFGKHDALSQKIQHVGVTAEKNETSNELLGKHRELLEKRASDVEQAKQSGEITTESYKQLKVEIEKQREFLDAHEQKL-----LKPKASIKPAPEPEIGVVGSIAKEEKRVEASQTAKQEAASNY-NTNANIVKNNNQTLVQAPR-TSSPGPGI</Hsp_hseq> + <Hsp_midline>D +A EL+AE +E+ N L S+ + A TEL AE++E + ++ ++ KLS+ A L+ A VEQ + +S+ + +I ++ P+P+ P L + ++ E+ P + + D K D K D +GD ++K+ GFK T++I D ISS LFKYT+TA AKMAA++ +L++ D+L HF +WT F + EF G +G +++ G++ D+ ++++ ++ + V + +G AD + + +M +G++K+ A+IL ALG ++ A T+ SA+ + G + SE+++ + + + ++ +E E + W ++ + D + ET KA+ K+ E T +Q A+ + +++G +N T+ + L K +K SD + Q K E++K+ + +++ QKL KP A PA + V +I K E E Q+ K N NT N V NN++T+ Q T++P PG+</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +<Hit> + <Hit_num>43</Hit_num> + <Hit_id>gi|398313739|emb|CCI89086.1|</Hit_id> + <Hit_def>phage baseplate hub [Yersinia phage phiD1]</Hit_def> + <Hit_accession>CCI89086</Hit_accession> + <Hit_len>191</Hit_len> + <Hit_hsps> + <Hsp> + <Hsp_num>1</Hsp_num> + <Hsp_bit-score>79.7221</Hsp_bit-score> + <Hsp_score>195</Hsp_score> + <Hsp_evalue>1.49556e-13</Hsp_evalue> + <Hsp_query-from>2</Hsp_query-from> + <Hsp_query-to>189</Hsp_query-to> + <Hsp_hit-from>3</Hsp_hit-from> + <Hsp_hit-to>187</Hsp_hit-to> + <Hsp_query-frame>0</Hsp_query-frame> + <Hsp_hit-frame>0</Hsp_hit-frame> + <Hsp_identity>69</Hsp_identity> + <Hsp_positive>102</Hsp_positive> + <Hsp_gaps>17</Hsp_gaps> + <Hsp_align-len>195</Hsp_align-len> + <Hsp_qseq>KSENMSTMRRRKVIADSKGERDAASTASDQVDSLELIGLKLDDVQSANELVAEVIEEKGNNLIDSVDNV-------AEGTELAAEASERTTESIKTLTGVASTISDKLSKLASMLESKVQAVEQKVQESGASASTGLSVIEDKLPDPDEPESPGLPERILPPLDDNNNLPDEDFFPPVPQEPENNKKDQKKDDKK</Hsp_qseq> + <Hsp_hseq>KPQEMQTMRR-KVISDNKPTQEAAKSASNTLSGLNDISTKLDDTQAASELIAQTVEEKSNEIVGAIGNVESAVSDTTAGSELIAETVEIGNNINKE---IGESLGSKLDKLTSLLEQKIQTA--GIQQTGTXLATVESAIPVKVVEDDTDRXXVLXYRXLKQLIMILTLI---FSLPLSQLSQ-SKNHQKKNRKK</Hsp_hseq> + <Hsp_midline>K + M TMRR KVI+D+K ++AA +AS+ + L I KLDD Q+A+EL+A+ +EEK N ++ ++ NV G+EL AE E K + ++ KL KL S+LE K+Q +Q++G +T S I K+ + D L R L L L F P+ Q + +K QKK+ KK</Hsp_midline> + </Hsp> + </Hit_hsps> +</Hit> +</Iteration_hits> + <Iteration_stat> + <Statistics> + <Statistics_db-num>48094830</Statistics_db-num> + <Statistics_db-len>17186091396</Statistics_db-len> + <Statistics_hsp-len>153</Statistics_hsp-len> + <Statistics_eff-space>4157067357738</Statistics_eff-space> + <Statistics_kappa>0.041</Statistics_kappa> + <Statistics_lambda>0.267</Statistics_lambda> + <Statistics_entropy>0.14</Statistics_entropy> + </Statistics> + </Iteration_stat> +</Iteration> +</BlastOutput_iterations> +</BlastOutput> +
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/blastxml/test.xml Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,138 @@ +<?xml version="1.0"?> +<root> + <metadata> + <genomes> + <genome>test-data/merlin.fa</genome> + </genomes> + <general> + <defaultLocation></defaultLocation> + <trackPadding>40</trackPadding> + <shareLink>true</shareLink> + <aboutDescription></aboutDescription> + <show_tracklist>true</show_tracklist> + <show_nav>true</show_nav> + <show_overview>false</show_overview> + <show_menu>true</show_menu> + <hideGenomeOptions>false</hideGenomeOptions> + </general> + </metadata> + <tracks> + <track cat="Blah" format="blast"> + <files> + <trackFile path="test-data/blastxml/blast.xml" ext="blastxml" label="BlastP"/> + </files> + + <options> + <style> + <className>feature</className> + <description></description> + <label>Description</label> + <height>400px</height> + </style> + <blast> + <parent>test-data/blastxml/merlin.gff</parent> + <protein>true</protein> + <min_gap>10</min_gap> + </blast> + <scaling> + <method>score</method> + <algo>blast</algo> + <scales> + <type>__auto__</type> + </scales> + <scheme> + <type>opacity</type> + <color>__auto__</color> + </scheme> + </scaling> + </options> + </track> + <track cat="Blah" format="blast"> + <files> + <trackFile path="test-data/blastxml/blast.xml" ext="blastxml" label="BlastP, min_gap=3"/> + </files> + + <options> + <style> + <className>feature</className> + <description></description> + <label>Description</label> + <height>400px</height> + </style> + <blast> + <parent>test-data/blastxml/merlin.gff</parent> + <protein>true</protein> + <min_gap>3</min_gap> + </blast> + <scaling> + <method>score</method> + <algo>blast</algo> + <scales> + <type>__auto__</type> + </scales> + <scheme> + <type>opacity</type> + <color>__auto__</color> + </scheme> + </scaling> + </options> + </track> + + <track cat="Blah" format="blast"> + <files> + <trackFile path="test-data/blastxml/blast-gene1.xml" ext="blastxml" label="Non protein match/matchpart"/> + </files> + + <options> + <style> + <className>feature</className> + <description></description> + <label>Description</label> + <height>400px</height> + </style> + <blast> + <parent>test-data/blastxml/merlin.gff</parent> + <min_gap>3</min_gap> + </blast> + <scaling> + <method>ignore</method> + <algo>linear</algo> + <scheme> + <type>opacity</type> + <color>__auto__</color> + </scheme> + </scaling> + </options> + </track> + + <track cat="Blah" format="blast"> + <files> + <trackFile path="test-data/blastxml/blastn-gene1.xml" ext="blastxml" label="Non protein match/matchpart, no parent"/> + </files> + + <options> + <style> + <className>feature</className> + <description></description> + <label>Description</label> + <height>400px</height> + </style> + <blast> + <min_gap>40</min_gap> + </blast> + <scaling> + <method>score</method> + <algo>blast</algo> + <scales> + <type>__auto__</type> + </scales> + <scheme> + <type>opacity</type> + <color>__auto__</color> + </scheme> + </scaling> + </options> + </track> + + </tracks> +</root>
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/bw/test.xml Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,149 @@ +<?xml version="1.0"?> +<root> + <metadata> + <genomes> + <genome>test-data/merlin.fa</genome> + </genomes> + <general> + <defaultLocation></defaultLocation> + <trackPadding>40</trackPadding> + <shareLink>true</shareLink> + <aboutDescription></aboutDescription> + <show_tracklist>true</show_tracklist> + <show_nav>true</show_nav> + <show_overview>false</show_overview> + <show_menu>true</show_menu> + <hideGenomeOptions>false</hideGenomeOptions> + </general> + </metadata> + <tracks> + <track cat="Scaling" format="wiggle"> + <files> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Auto Global"/> + </files> + + <options> + <wiggle> + <type>JBrowse/View/Track/Wiggle/XYPlot</type> + <variance_band>true</variance_band> + <autoscale>global</autoscale> + <color_pos>__auto__</color_pos> + <color_neg>__auto__</color_neg> + <bicolor_pivot>zero</bicolor_pivot> + <scale>linear</scale> + </wiggle> + </options> + </track> + <track cat="Scaling" format="wiggle"> + <files> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Auto Local"/> + </files> + + <options> + <wiggle> + <type>JBrowse/View/Track/Wiggle/XYPlot</type> + <variance_band>true</variance_band> + <autoscale>local</autoscale> + <color_pos>__auto__</color_pos> + <color_neg>__auto__</color_neg> + <bicolor_pivot>zero</bicolor_pivot> + <scale>linear</scale> + </wiggle> + </options> + </track> + <track cat="Scaling" format="wiggle"> + <files> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Manual"/> + </files> + + <options> + <wiggle> + <type>JBrowse/View/Track/Wiggle/XYPlot</type> + <variance_band>false</variance_band> + <autoscale>fixed</autoscale> + <min>20</min> + <max>60</max> + <color_pos>__auto__</color_pos> + <color_neg>__auto__</color_neg> + <bicolor_pivot>zero</bicolor_pivot> + <scale>linear</scale> + </wiggle> + </options> + </track> + + + <track cat="Colouring" format="wiggle"> + <files> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Auto A"/> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Auto B"/> + </files> + + <options> + <wiggle> + <type>JBrowse/View/Track/Wiggle/XYPlot</type> + <variance_band>false</variance_band> + <autoscale>local</autoscale> + + <color_pos>__auto__</color_pos> + <color_neg>__auto__</color_neg> + <bicolor_pivot>mean</bicolor_pivot> + <scale>linear</scale> + </wiggle> + </options> + </track> + <track cat="Colouring" format="wiggle"> + <files> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Manual"/> + </files> + + <options> + <wiggle> + <type>JBrowse/View/Track/Wiggle/XYPlot</type> + <variance_band>false</variance_band> + <autoscale>local</autoscale> + + <color_pos>#0000ff</color_pos> + <color_neg>#ff0000</color_neg> + <bicolor_pivot>mean</bicolor_pivot> + <scale>linear</scale> + </wiggle> + </options> + </track> + <track cat="Colouring" format="wiggle"> + <files> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Manual - Inverted"/> + </files> + + <options> + <wiggle> + <type>JBrowse/View/Track/Wiggle/XYPlot</type> + <variance_band>false</variance_band> + <autoscale>local</autoscale> + + <color_pos>#ff0000</color_pos> + <color_neg>#0000ff</color_neg> + <bicolor_pivot>mean</bicolor_pivot> + <scale>log</scale> + </wiggle> + </options> + </track> + <track cat="Colouring" format="wiggle"> + <files> + <trackFile path="test-data/bw/data.bw" ext="bigwig" label="Nonstandard Pivot"/> + </files> + + <options> + <wiggle> + <type>JBrowse/View/Track/Wiggle/XYPlot</type> + <variance_band>false</variance_band> + <autoscale>local</autoscale> + + <color_pos>#0000ff</color_pos> + <color_neg>#ff0000</color_neg> + <bicolor_pivot>100</bicolor_pivot> + <scale>linear</scale> + </wiggle> + </options> + </track> + </tracks> +</root>
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/fakemerlins.fa Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,12 @@ +>Merlin1 +ATGTAAGCTCAGGAGCTCCACACGCAACAGGAACACAACCTGTGAACATTATCACAGTAT +>Merlin2 +ATGAATTTATCAGTCCAATACTTAAAATGAATACGAAGTAAATCTATGCCTAATACTAAT +>Merlin3 +TAATCAACGTGTGATGCTTCAAGCCAAGCTTAGGAATAGAAATGGTTTTGCCATTGACTT +>Merlin4 +AAAAGCTTATTGCTTAAGCCTACAGTTAAACTCGCTATTCCAGTTAAATGCGATAAATGT +>Merlin5 +TTTTCTTTGCTAATTTAACACCAAGAGCTGCAATCCATTGGTTTCTTCGTTTATATCCTG +>Merlin6 +ATACTGCATCCTTTTGATACCAATGCGGTTCAATTTGAGTGTTACCAGAGTATATCTTGA
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/1.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,46 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_1;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_1_mRNA;Parent=Merlin_1 +Merlin GeneMark.hmm CDS 14 20 1000 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA +Merlin GeneMark.hmm CDS 24 30 500 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA + +Merlin GeneMark.hmm gene 14 30 . + . ID=Merlin_2;seqid=Merlin;color=#00ff00 +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_2_mRNA;seqid=Merlin;color=#00ff00;Parent=Merlin_2 +Merlin GeneMark.hmm CDS 14 20 500 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 750 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#00ff00 + +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3A_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 18 1000 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 20 30 800 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3B_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 22 400 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 1000 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff + + + +Merlin exonerate gene 1740 2300 . + . Name=Apple3;Note=Gene with two splicing models;ID=1 +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-a;Note=mRNA A with both CDSs and UTRs;ID=1A;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1A +Merlin exonerate CDS 1900 2080 . + 0 Parent=1A +Merlin exonerate CDS 2100 2120 . + 2 Parent=1A +Merlin exonerate UTR 2120 2300 . + . Parent=1A +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-b;Note=mRNA B with both CDSs and UTRs;ID=1B;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1B +Merlin exonerate CDS 1800 1880 . + 0 Parent=1B +Merlin exonerate CDS 1900 1950 . + 1 Parent=1B +Merlin exonerate CDS 2100 2120 . + 2 Parent=1B +Merlin exonerate UTR 2120 2300 . + . Parent=1B + + + +# { +# "baseUrl": "http://localhost:8000/out/data/" +# "compress": 0, +# "label": "Transcript", +# "storeClass": "JBrowse/Store/SeqFeature/NCList", +# "trackType": "JBrowse/View/Track/CanvasFeatures", +# "type": "JBrowse/View/Track/CanvasFeatures", +# "urlTemplate": "tracks/42ff9cb16c0509f0abb4a76ce14077bc_0/{refseq}/trackData.json", +# }
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/2.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,6 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 + +Merlin . cDNA_match 1200 9000 . . . ID=cDNA00001 +Merlin . match_part 1200 3200 2.2e-30 + . ID=match00002;Parent=cDNA00001;Target=Merlin 5 506;Gap=M301 D1499 M201 +Merlin . match_part 7000 9000 7.4e-32 - . ID=match00003;Parent=cDNA00001;Target=Merlin 1 502;Gap=M101 D1499 M401
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/A.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,46 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_1;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_1_mRNA;Parent=Merlin_1 +Merlin GeneMark.hmm CDS 14 20 1000 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA +Merlin GeneMark.hmm CDS 24 30 500 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA + +Merlin GeneMark.hmm gene 14 30 . + . ID=Merlin_2;seqid=Merlin;color=#00ff00 +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_2_mRNA;seqid=Merlin;color=#00ff00;Parent=Merlin_2 +Merlin GeneMark.hmm CDS 14 20 500 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 750 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#00ff00 + +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3A_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 18 1000 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 20 30 800 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3B_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 22 400 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 1000 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff + + + +Merlin exonerate gene 1740 2300 . + . Name=Apple3;Note=Gene with two splicing models;ID=1 +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-a;Note=mRNA A with both CDSs and UTRs;ID=1A;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1A +Merlin exonerate CDS 1900 2080 . + 0 Parent=1A +Merlin exonerate CDS 2100 2120 . + 2 Parent=1A +Merlin exonerate UTR 2120 2300 . + . Parent=1A +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-b;Note=mRNA B with both CDSs and UTRs;ID=1B;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1B +Merlin exonerate CDS 1800 1880 . + 0 Parent=1B +Merlin exonerate CDS 1900 1950 . + 1 Parent=1B +Merlin exonerate CDS 2100 2120 . + 2 Parent=1B +Merlin exonerate UTR 2120 2300 . + . Parent=1B + + + +# { +# "baseUrl": "http://localhost:8000/out/data/" +# "compress": 0, +# "label": "Transcript", +# "storeClass": "JBrowse/Store/SeqFeature/NCList", +# "trackType": "JBrowse/View/Track/CanvasFeatures", +# "type": "JBrowse/View/Track/CanvasFeatures", +# "urlTemplate": "tracks/42ff9cb16c0509f0abb4a76ce14077bc_0/{refseq}/trackData.json", +# }
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/B.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,46 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_1;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_1_mRNA;Parent=Merlin_1 +Merlin GeneMark.hmm CDS 14 20 1000 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA +Merlin GeneMark.hmm CDS 24 30 500 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA + +Merlin GeneMark.hmm gene 14 30 . + . ID=Merlin_2;seqid=Merlin;color=#00ff00 +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_2_mRNA;seqid=Merlin;color=#00ff00;Parent=Merlin_2 +Merlin GeneMark.hmm CDS 14 20 500 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 750 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#00ff00 + +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3A_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 18 1000 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 20 30 800 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3B_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 22 400 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 1000 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff + + + +Merlin exonerate gene 1740 2300 . + . Name=Apple3;Note=Gene with two splicing models;ID=1 +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-a;Note=mRNA A with both CDSs and UTRs;ID=1A;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1A +Merlin exonerate CDS 1900 2080 . + 0 Parent=1A +Merlin exonerate CDS 2100 2120 . + 2 Parent=1A +Merlin exonerate UTR 2120 2300 . + . Parent=1A +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-b;Note=mRNA B with both CDSs and UTRs;ID=1B;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1B +Merlin exonerate CDS 1800 1880 . + 0 Parent=1B +Merlin exonerate CDS 1900 1950 . + 1 Parent=1B +Merlin exonerate CDS 2100 2120 . + 2 Parent=1B +Merlin exonerate UTR 2120 2300 . + . Parent=1B + + + +# { +# "baseUrl": "http://localhost:8000/out/data/" +# "compress": 0, +# "label": "Transcript", +# "storeClass": "JBrowse/Store/SeqFeature/NCList", +# "trackType": "JBrowse/View/Track/CanvasFeatures", +# "type": "JBrowse/View/Track/CanvasFeatures", +# "urlTemplate": "tracks/42ff9cb16c0509f0abb4a76ce14077bc_0/{refseq}/trackData.json", +# }
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/C.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,46 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_1;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_1_mRNA;Parent=Merlin_1 +Merlin GeneMark.hmm CDS 14 20 1000 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA +Merlin GeneMark.hmm CDS 24 30 500 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA + +Merlin GeneMark.hmm gene 14 30 . + . ID=Merlin_2;seqid=Merlin;color=#00ff00 +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_2_mRNA;seqid=Merlin;color=#00ff00;Parent=Merlin_2 +Merlin GeneMark.hmm CDS 14 20 500 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 750 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#00ff00 + +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3A_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 18 1000 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 20 30 800 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3B_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 22 400 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 1000 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff + + + +Merlin exonerate gene 1740 2300 . + . Name=Apple3;Note=Gene with two splicing models;ID=1 +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-a;Note=mRNA A with both CDSs and UTRs;ID=1A;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1A +Merlin exonerate CDS 1900 2080 . + 0 Parent=1A +Merlin exonerate CDS 2100 2120 . + 2 Parent=1A +Merlin exonerate UTR 2120 2300 . + . Parent=1A +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-b;Note=mRNA B with both CDSs and UTRs;ID=1B;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1B +Merlin exonerate CDS 1800 1880 . + 0 Parent=1B +Merlin exonerate CDS 1900 1950 . + 1 Parent=1B +Merlin exonerate CDS 2100 2120 . + 2 Parent=1B +Merlin exonerate UTR 2120 2300 . + . Parent=1B + + + +# { +# "baseUrl": "http://localhost:8000/out/data/" +# "compress": 0, +# "label": "Transcript", +# "storeClass": "JBrowse/Store/SeqFeature/NCList", +# "trackType": "JBrowse/View/Track/CanvasFeatures", +# "type": "JBrowse/View/Track/CanvasFeatures", +# "urlTemplate": "tracks/42ff9cb16c0509f0abb4a76ce14077bc_0/{refseq}/trackData.json", +# }
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/D.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,46 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_1;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_1_mRNA;Parent=Merlin_1 +Merlin GeneMark.hmm CDS 14 20 1000 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA +Merlin GeneMark.hmm CDS 24 30 500 + 0 ID=Merlin_1_CDS ;Parent=Merlin_1_mRNA + +Merlin GeneMark.hmm gene 14 30 . + . ID=Merlin_2;seqid=Merlin;color=#00ff00 +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_2_mRNA;seqid=Merlin;color=#00ff00;Parent=Merlin_2 +Merlin GeneMark.hmm CDS 14 20 500 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 750 + 0 ID=Merlin_2_CDS ;Parent=Merlin_2_mRNA;color=#00ff00 + +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3A_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 18 1000 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 20 30 800 + 0 ID=Merlin_3A_CDS ;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3B_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS 14 22 400 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS 24 30 1000 + 0 ID=Merlin_3B_CDS ;Parent=Merlin_3B_mRNA;color=#0000ff + + + +Merlin exonerate gene 1740 2300 . + . Name=Apple3;Note=Gene with two splicing models;ID=1 +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-a;Note=mRNA A with both CDSs and UTRs;ID=1A;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1A +Merlin exonerate CDS 1900 2080 . + 0 Parent=1A +Merlin exonerate CDS 2100 2120 . + 2 Parent=1A +Merlin exonerate UTR 2120 2300 . + . Parent=1A +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-b;Note=mRNA B with both CDSs and UTRs;ID=1B;Parent=1; +Merlin exonerate UTR 1740 1799 . + . Parent=1B +Merlin exonerate CDS 1800 1880 . + 0 Parent=1B +Merlin exonerate CDS 1900 1950 . + 1 Parent=1B +Merlin exonerate CDS 2100 2120 . + 2 Parent=1B +Merlin exonerate UTR 2120 2300 . + . Parent=1B + + + +# { +# "baseUrl": "http://localhost:8000/out/data/" +# "compress": 0, +# "label": "Transcript", +# "storeClass": "JBrowse/Store/SeqFeature/NCList", +# "trackType": "JBrowse/View/Track/CanvasFeatures", +# "type": "JBrowse/View/Track/CanvasFeatures", +# "urlTemplate": "tracks/42ff9cb16c0509f0abb4a76ce14077bc_0/{refseq}/trackData.json", +# }
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/interpro.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,558 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 +Merlin annotation remark 1 172788 . . . gff-version=3;sequence-region=%28%27Merlin%27%2C 0%2C 172788%29 +Merlin feature polypeptide 1 229 . + . ID=Merlin_1;md5=4d58b2b569c2fe52e2945e3f6e380c48 +Merlin Gene3D protein_match 2 50 2.9E-21 + . ID=match%2477_2_50;Name=G3DSA:3.90.176.10;Target=Merlin_1 2 50;date=23-02-2015;status=T +Merlin Gene3D protein_match 106 165 2.9E-21 + . ID=match%2477_106_165;Name=G3DSA:3.90.176.10;Target=Merlin_1 106 165;date=23-02-2015;status=T +Merlin Pfam protein_match 7 162 1.9E-12 + . Dbxref=InterPro:IPR003540;ID=match%2478_7_162;Name=PF03496;Ontology_term=GO:0005576%22%2C%22GO:0009405;Target=Merlin_1 7 162;date=23-02-2015;signature_desc=ADP-ribosyltransferase exoenzyme;status=T +Merlin SUPERFAMILY protein_match 2 48 . + . ID=match%2479_2_48;Name=SSF56399;Target=Merlin_1 2 48;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 104 164 . + . ID=match%2479_104_164;Name=SSF56399;Target=Merlin_1 104 164;date=23-02-2015;status=T +Merlin feature polypeptide 1698 2011 . - . ID=Merlin_3;md5=6b220b99a5d2dd40f55bb664a8dbdfb3 +Merlin Pfam protein_match 1912 2011 1.5E-8 - . Dbxref=InterPro:IPR010667;ID=match%24113_149_248;Name=PF06841;Ontology_term=GO:0005198;Target=Merlin_3 149 248;date=23-02-2015;signature_desc=T4-like virus tail tube protein gp19;status=T +Merlin feature polypeptide 2716 3066 . - . ID=Merlin_4;md5=bdb226d471fe35e28ce6a9ed4649a1f8 +Merlin Pfam protein_match 2725 3066 1.6E-150 - . Dbxref=InterPro:IPR024389;ID=match%24361_4_345;Name=PF11091;Target=Merlin_4 4 345;date=23-02-2015;signature_desc=Tail-tube assembly protein;status=T +Merlin feature polypeptide 5144 5317 . - . ID=Merlin_6;md5=c61e0e2dba259054b9c93fd931056fdd +Merlin Pfam protein_match 5166 5317 1.5E-59 - . Dbxref=InterPro:IPR024342;ID=match%24360_21_172;Name=PF11110;Target=Merlin_6 21 172;date=23-02-2015;signature_desc=Baseplate hub distal subunit;status=T +Merlin feature polypeptide 6052 6431 . - . ID=Merlin_7;md5=b51a60ffef9f07b672e0d12d26d27bbc +Merlin SUPERFAMILY protein_match 6256 6431 . - . ID=match%24227_199_374;Name=SSF69279;Target=Merlin_7 199 374;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 6238 6431 . - . ID=match%24228_5_198;Name=SSF69279;Target=Merlin_7 5 198;date=23-02-2015;status=T +Merlin Pfam protein_match 6237 6431 4.3E-96 - . Dbxref=InterPro:IPR015181;ID=match%24229_4_198;Name=PF09097;Target=Merlin_7 4 198;date=23-02-2015;signature_desc=Baseplate structural protein,domain 1;status=T +Merlin Gene3D protein_match 6320 6431 1.4E-54 - . ID=match%24230_3_114;Name=G3DSA:2.40.10.10;Target=Merlin_7 3 114;date=23-02-2015;status=T +Merlin Pfam protein_match 6263 6431 3.9E-83 - . Dbxref=InterPro:IPR015180;ID=match%24231_201_369;Name=PF09096;Target=Merlin_7 201 369;date=23-02-2015;signature_desc=Baseplate structural protein,domain 2;status=T +Merlin feature polypeptide 6931 7180 . - . ID=Merlin_8;md5=78306f53371e5e47b051cad8a16b86e5 +Merlin Pfam protein_match 6985 7180 9.8E-56 - . Dbxref=InterPro:IPR024364;ID=match%24348_5_200;Name=PF12322;Target=Merlin_8 5 200;date=23-02-2015;signature_desc=T4 bacteriophage base plate protein;status=T +Merlin feature polypeptide 7227 7435 . + . ID=Merlin_9;md5=5ced4f78a57bd34e165ccf7b43ed3ef1 +Merlin Pfam protein_match 7231 7433 6.3E-63 + . Dbxref=InterPro:IPR024364;ID=match%24164_5_207;Name=PF12322;Target=Merlin_9 5 207;date=23-02-2015;signature_desc=T4 bacteriophage base plate protein;status=T +Merlin feature polypeptide 7856 7970 . + . ID=Merlin_10;md5=b0c491c633f373b9340ede7359636469 +Merlin Pfam protein_match 7863 7956 1.6E-17 + . Dbxref=InterPro:IPR007048;ID=match%2416_8_101;Name=PF04965;Target=Merlin_10 8 101;date=23-02-2015;signature_desc=Gene 25-like lysozyme;status=T +Merlin SUPERFAMILY protein_match 7862 7957 . + . ID=match%2417_7_102;Name=SSF160719;Target=Merlin_10 7 102;date=23-02-2015;status=T +Merlin Gene3D protein_match 7857 7959 9.3E-27 + . Dbxref=InterPro:IPR015801;ID=match%2418_2_104;Name=G3DSA:3.10.450.40;Ontology_term=GO:0005507%22%2C%22GO:0009308%22%2C%22GO:0048038;Target=Merlin_10 2 104;date=23-02-2015;status=T +Merlin feature polypeptide 8339 8475 . + . ID=Merlin_11;md5=7125953ccce81b5059482c7b3922d29d +Merlin Pfam protein_match 8347 8475 2.2E-49 + . Dbxref=InterPro:IPR021289;ID=match%24334_9_137;Name=PF11056;Target=Merlin_11 9 137;date=23-02-2015;signature_desc=Recombination,repair and ssDNA binding protein UvsY;status=T +Merlin feature polypeptide 8786 8839 . + . ID=Merlin_12;md5=bcd73a62fca23ea0a1174d9b0e57d679 +Merlin Pfam protein_match 8788 8839 2.1E-24 + . Dbxref=InterPro:IPR024362;ID=match%24364_3_54;Name=PF10886;Target=Merlin_12 3 54;date=23-02-2015;signature_desc=Protein of unknown function %28DUF2685%29;status=T +Merlin feature polypeptide 9167 9241 . - . ID=Merlin_13;md5=a54985fe0f4378a1bf4e8dee4703f4c0 +Merlin Pfam protein_match 9188 9241 1.2E-27 - . Dbxref=InterPro:IPR020975;ID=match%24300_19_72;Name=PF11637;Target=Merlin_13 19 72;date=23-02-2015;signature_desc=ATP-dependant DNA helicase UvsW;status=T +Merlin feature polypeptide 10249 10747 . - . ID=Merlin_14;md5=8e7c294d59d5955f5678e0d98ec0d4df +Merlin SMART protein_match 10561 10747 1.6E-14 - . Dbxref=InterPro:IPR014001;ID=match%24169_110_296;Name=SM00487;Target=Merlin_14 110 296;date=23-02-2015;signature_desc=DEAD-like helicases superfamily;status=T +Merlin Pfam protein_match 10605 10747 6.2E-9 - . Dbxref=InterPro:IPR006935;ID=match%24170_112_254;Name=PF04851;Ontology_term=GO:0003677%22%2C%22GO:0005524%22%2C%22GO:0016787;Target=Merlin_14 112 254;date=23-02-2015;signature_desc=Type III restriction enzyme,res subunit;status=T +Merlin Gene3D protein_match 10658 10747 7.5E-10 - . Dbxref=InterPro:IPR027417;ID=match%24171_357_446;Name=G3DSA:3.40.50.300;Target=Merlin_14 357 446;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 10593 10747 . - . Dbxref=InterPro:IPR014001;ID=match%24172_122_276;Name=PS51192;Target=Merlin_14 122 276;date=23-02-2015;signature_desc=Superfamilies 1 and 2 helicase ATP-binding type-1 domain profile.;status=T +Merlin ProSiteProfiles protein_match 10595 10747 . - . Dbxref=InterPro:IPR001650;ID=match%24173_331_483;Name=PS51194;Target=Merlin_14 331 483;date=23-02-2015;signature_desc=Superfamilies 1 and 2 helicase C-terminal domain profile.;status=T +Merlin Gene3D protein_match 10575 10747 8.0E-36 - . Dbxref=InterPro:IPR027417;ID=match%24174_103_275;Name=G3DSA:3.40.50.300;Target=Merlin_14 103 275;date=23-02-2015;status=T +Merlin Pfam protein_match 10678 10747 3.3E-6 - . Dbxref=InterPro:IPR001650;ID=match%24175_370_439;Name=PF00271;Target=Merlin_14 370 439;date=23-02-2015;signature_desc=Helicase conserved C-terminal domain;status=T +Merlin SUPERFAMILY protein_match 10698 10747 . - . Dbxref=InterPro:IPR027417;ID=match%24176_129_178;Name=SSF52540;Target=Merlin_14 129 178;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 10639 10747 . - . Dbxref=InterPro:IPR027417;ID=match%24176_366_474;Name=SSF52540;Target=Merlin_14 366 474;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 10473 10747 . - . Dbxref=InterPro:IPR027417;ID=match%24177_1_275;Name=SSF52540;Target=Merlin_14 1 275;date=23-02-2015;status=T +Merlin SMART protein_match 10664 10747 1.6E-5 - . Dbxref=InterPro:IPR001650;ID=match%24178_357_440;Name=SM00490;Target=Merlin_14 357 440;date=23-02-2015;signature_desc=helicase superfamily c-terminal domain;status=T +Merlin feature polypeptide 10799 11009 . + . ID=Merlin_15;md5=99330680c5d343d9693515d8855b17f2 +Merlin PIRSF protein_match 10799 11009 3.6E-83 + . Dbxref=InterPro:IPR016594;ID=match%24278_1_211;Name=PIRSF012159;Target=Merlin_15 1 211;date=23-02-2015;status=T +Merlin feature polypeptide 11468 11740 . + . ID=Merlin_16;md5=7a6101f7a0bb1fc1bebb4187e33dd9a7 +Merlin ProSiteProfiles protein_match 11544 11637 . + . Dbxref=InterPro:IPR007110;ID=match%24378_77_170;Name=PS50835;Ontology_term=GO:0005515;Target=Merlin_16 77 170;date=23-02-2015;signature_desc=Ig-like domain profile.;status=T +Merlin feature polypeptide 12364 12441 . + . ID=Merlin_17;md5=51287e0ea7c2e110589ed61f01177ebf +Merlin Pfam protein_match 12364 12427 3.7E-21 + . Dbxref=InterPro:IPR021404;ID=match%24349_1_64;Name=PF11242;Target=Merlin_17 1 64;date=23-02-2015;signature_desc=Protein of unknown function %28DUF2774%29;status=T +Merlin feature polypeptide 13339 13671 . + . ID=Merlin_20;md5=c55831dd21d84f2dc5e691281c13e17f +Merlin Gene3D protein_match 13490 13570 3.9E-35 + . ID=match%24326_152_232;Name=G3DSA:3.30.1490.70;Target=Merlin_20 152 232;date=23-02-2015;status=T +Merlin Gene3D protein_match 13346 13376 3.9E-35 + . ID=match%24326_8_38;Name=G3DSA:3.30.1490.70;Target=Merlin_20 8 38;date=23-02-2015;status=T +Merlin Gene3D protein_match 13377 13489 2.3E-38 + . ID=match%24327_39_151;Name=G3DSA:3.30.470.30;Target=Merlin_20 39 151;date=23-02-2015;status=T +Merlin Pfam protein_match 13368 13564 1.2E-50 + . Dbxref=InterPro:IPR021122;ID=match%24328_30_226;Name=PF09414;Target=Merlin_20 30 226;date=23-02-2015;signature_desc=RNA ligase;status=T +Merlin TIGRFAM protein_match 13340 13670 6.8E-105 + . Dbxref=InterPro:IPR012647;ID=match%24329_2_332;Name=TIGR02307;Ontology_term=GO:0003972%22%2C%22GO:0005524%22%2C%22GO:0016874;Target=Merlin_20 2 332;date=23-02-2015;signature_desc=RNA_lig_RNL2: RNA ligase,Rnl2 family;status=T +Merlin SUPERFAMILY protein_match 13339 13571 . + . ID=match%24330_1_233;Name=SSF56091;Target=Merlin_20 1 233;date=23-02-2015;status=T +Merlin feature polypeptide 15770 16197 . - . ID=Merlin_22;md5=d01d0f5a1c78f3ecd35c1050fbaca9f9 +Merlin Pfam protein_match 15985 16197 5.2E-13 - . Dbxref=InterPro:IPR010762;ID=match%24304_12_224;Name=PF07068;Target=Merlin_22 12 224;date=23-02-2015;signature_desc=Major capsid protein Gp23;status=T +Merlin feature polypeptide 17322 17836 . - . ID=Merlin_23;md5=75bfb18ed2707b309c2a9ce33a7f1b9c +Merlin Pfam protein_match 17343 17836 3.3E-283 - . Dbxref=InterPro:IPR010762;ID=match%24343_6_499;Name=PF07068;Target=Merlin_23 6 499;date=23-02-2015;signature_desc=Major capsid protein Gp23;status=T +Merlin feature polypeptide 19138 19351 . - . ID=Merlin_25;md5=267295efe7b1e708e23dbc20b7038290 +Merlin Pfam protein_match 19141 19351 3.6E-106 - . Dbxref=InterPro:IPR005082;ID=match%24335_3_213;Name=PF03420;Target=Merlin_25 3 213;date=23-02-2015;signature_desc=Prohead core protein protease;status=T +Merlin feature polypeptide 19636 19776 . - . ID=Merlin_26;md5=955dc2ce10a08eb3c66e9762917da515 +Merlin PIRSF protein_match 19637 19776 1.8E-82 - . Dbxref=InterPro:IPR016415;ID=match%24369_1_140;Name=PIRSF004377;Target=Merlin_26 1 140;date=23-02-2015;status=T +Merlin feature polypeptide 21031 21550 . - . ID=Merlin_28;md5=8d80cd7dbe0c21919b521c03a8e6d93e +Merlin Pfam protein_match 21051 21550 7.9E-227 - . Dbxref=InterPro:IPR010823;ID=match%24252_6_505;Name=PF07230;Target=Merlin_28 6 505;date=23-02-2015;signature_desc=Bacteriophage T4-like capsid assembly protein %28Gp20%29;status=T +Merlin feature polypeptide 21954 22116 . - . ID=Merlin_29;md5=ca77bdf62fcb7d10099120a70ea65664 +Merlin Pfam protein_match 21974 22116 2.5E-14 - . Dbxref=InterPro:IPR010667;ID=match%24371_19_161;Name=PF06841;Ontology_term=GO:0005198;Target=Merlin_29 19 161;date=23-02-2015;signature_desc=T4-like virus tail tube protein gp19;status=T +Merlin feature polypeptide 23559 24216 . - . ID=Merlin_30;md5=54c0ebcc81ab5bb693c092ec7455c4a8 +Merlin Pfam protein_match 23572 24216 2.9E-152 - . Dbxref=InterPro:IPR007067;ID=match%24219_1_645;Name=PF04984;Target=Merlin_30 1 645;date=23-02-2015;signature_desc=Phage tail sheath protein;status=T +Merlin feature polypeptide 25481 26094 . - . ID=Merlin_31;md5=55392e2827eef02df63fd882f2ced816 +Merlin SUPERFAMILY protein_match 25922 26094 . - . Dbxref=InterPro:IPR027417;ID=match%24217_137_309;Name=SSF52540;Target=Merlin_31 137 309;date=23-02-2015;status=T +Merlin Pfam protein_match 25708 26094 3.8E-77 - . Dbxref=InterPro:IPR004921;ID=match%24218_160_546;Name=PF03237;Target=Merlin_31 160 546;date=23-02-2015;signature_desc=Terminase-like family;status=T +Merlin feature polypeptide 26405 26569 . - . ID=Merlin_32;md5=c476d9fbef90079fdff5846a36df9115 +Merlin Pfam protein_match 26423 26569 1.0E-67 - . Dbxref=InterPro:IPR020342;ID=match%24243_1_147;Name=PF11053;Target=Merlin_32 1 147;date=23-02-2015;signature_desc=Terminase DNA packaging enzyme;status=T +Merlin feature polypeptide 27949 28204 . - . ID=Merlin_34;md5=fb9b89cda5243f9bca8663b2591aabf3 +Merlin Pfam protein_match 27973 28204 2.0E-96 - . Dbxref=InterPro:IPR021674;ID=match%24365_21_252;Name=PF11649;Target=Merlin_34 21 252;date=23-02-2015;signature_desc=Virus neck protein;status=T +Merlin feature polypeptide 30091 30553 . - . ID=Merlin_36;md5=3acd68b6f89b288da59e028fb8bcf461 +Merlin Gene3D protein_match 30449 30553 4.6E-29 - . Dbxref=InterPro:IPR012284%22%2C%22KEGG:00030%2B1.1.1.44%22%2C%22KEGG:00480%2B1.1.1.44%22%2C%22UniPathway:UPA00115;ID=match%24233_2_106;Name=G3DSA:1.20.5.320;Target=Merlin_36 2 106;date=23-02-2015;status=T +Merlin Pfam protein_match 30462 30553 8.5E-25 - . Dbxref=InterPro:IPR012473;ID=match%24234_369_460;Name=PF07921;Target=Merlin_36 369 460;date=23-02-2015;signature_desc=Fibritin C-terminal region;status=T +Merlin SUPERFAMILY protein_match 30449 30553 . - . ID=match%24235_2_106;Name=SSF58046;Target=Merlin_36 2 106;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 30463 30553 . - . ID=match%24236_371_461;Name=SSF58046;Target=Merlin_36 371 461;date=23-02-2015;status=T +Merlin Gene3D protein_match 30463 30553 4.5E-23 - . Dbxref=InterPro:IPR012284%22%2C%22KEGG:00030%2B1.1.1.44%22%2C%22KEGG:00480%2B1.1.1.44%22%2C%22UniPathway:UPA00115;ID=match%24237_371_461;Name=G3DSA:1.20.5.320;Target=Merlin_36 371 461;date=23-02-2015;status=T +Merlin PRINTS protein_match 30527 30553 . - . Dbxref=InterPro:IPR012473;ID=match%24238_369_395;Name=PR01880;Target=Merlin_36 369 395;date=23-02-2015;signature_desc=Fibritin signature;status=T +Merlin PRINTS protein_match 30532 30553 . - . Dbxref=InterPro:IPR012473;ID=match%24238_396_417;Name=PR01880;Target=Merlin_36 396 417;date=23-02-2015;signature_desc=Fibritin signature;status=T +Merlin feature polypeptide 31511 31982 . - . ID=Merlin_37;md5=d1a2df5071389c1bcd6fccd29a6b043c +Merlin Pfam protein_match 31939 31982 3.1E-11 - . Dbxref=InterPro:IPR011083;ID=match%24222_328_371;Name=PF07484;Target=Merlin_37 328 371;date=23-02-2015;signature_desc=Phage Tail Collar Domain;status=T +Merlin SUPERFAMILY protein_match 31821 31982 . - . ID=match%24223_310_471;Name=SSF88874;Target=Merlin_37 310 471;date=23-02-2015;status=T +Merlin Gene3D protein_match 31911 31982 1.2E-17 - . Dbxref=InterPro:IPR011083;ID=match%24224_306_377;Name=G3DSA:3.90.1340.10;Target=Merlin_37 306 377;date=23-02-2015;status=T +Merlin feature polypeptide 32417 32632 . - . ID=Merlin_38;md5=e77431a566cbfb86e7fdf48dfa58fafd +Merlin SUPERFAMILY protein_match 32428 32632 . - . Dbxref=InterPro:IPR014791;ID=match%2421_12_216;Name=SSF56558;Target=Merlin_38 12 216;date=23-02-2015;status=T +Merlin Gene3D protein_match 32595 32632 1.5E-17 - . ID=match%2422_179_216;Name=G3DSA:2.20.20.20;Target=Merlin_38 179 216;date=23-02-2015;status=T +Merlin Gene3D protein_match 32534 32632 7.6E-31 - . Dbxref=InterPro:IPR015976;ID=match%2423_80_178;Name=G3DSA:3.90.1160.10;Target=Merlin_38 80 178;date=23-02-2015;status=T +Merlin Gene3D protein_match 32572 32632 2.6E-25 - . Dbxref=InterPro:IPR015982;ID=match%2424_1_61;Name=G3DSA:1.10.286.30;Target=Merlin_38 1 61;date=23-02-2015;status=T +Merlin Pfam protein_match 32417 32632 1.7E-67 - . Dbxref=InterPro:IPR014791;ID=match%2425_1_216;Name=PF08677;Target=Merlin_38 1 216;date=23-02-2015;signature_desc=GP11 baseplate wedge protein;status=T +Merlin feature polypeptide 33837 34437 . - . ID=Merlin_39;md5=8f9953a81c739e4a1a5d80943d8ce8d5 +Merlin Pfam protein_match 34176 34437 5.8E-81 - . Dbxref=InterPro:IPR008987;ID=match%24225_2_263;Name=PF07880;Ontology_term=GO:0019058;Target=Merlin_39 2 263;date=23-02-2015;signature_desc=Bacteriophage T4 gp9/10-like protein;status=T +Merlin SUPERFAMILY protein_match 34206 34437 . - . Dbxref=InterPro:IPR008987;ID=match%24226_1_232;Name=SSF50017;Ontology_term=GO:0019058;Target=Merlin_39 1 232;date=23-02-2015;status=T +Merlin feature polypeptide 35013 35300 . - . ID=Merlin_40;md5=c997aa230728d19f4b70247099ceff3a +Merlin Gene3D protein_match 35268 35300 2.8E-21 - . ID=match%2427_17_49;Name=G3DSA:1.20.5.960;Target=Merlin_40 17 49;date=23-02-2015;status=T +Merlin Pfam protein_match 35045 35300 1.6E-78 - . Dbxref=InterPro:IPR008987;ID=match%2428_7_262;Name=PF07880;Ontology_term=GO:0019058;Target=Merlin_40 7 262;date=23-02-2015;signature_desc=Bacteriophage T4 gp9/10-like protein;status=T +Merlin Gene3D protein_match 35198 35300 5.7E-37 - . Dbxref=InterPro:IPR027411;ID=match%2429_67_169;Name=G3DSA:2.60.120.640;Target=Merlin_40 67 169;date=23-02-2015;status=T +Merlin Gene3D protein_match 35189 35300 2.5E-32 - . Dbxref=InterPro:IPR027412;ID=match%2430_172_283;Name=G3DSA:2.60.40.1680;Target=Merlin_40 172 283;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 35014 35300 . - . Dbxref=InterPro:IPR008987;ID=match%2431_1_287;Name=SSF50017;Ontology_term=GO:0019058;Target=Merlin_40 1 287;date=23-02-2015;status=T +Merlin feature polypeptide 36049 36385 . - . ID=Merlin_41;md5=b066782ec3d6e79967657aef7997933e +Merlin Pfam protein_match 36058 36385 1.8E-145 - . Dbxref=InterPro:IPR015298;ID=match%2480_10_337;Name=PF09215;Target=Merlin_41 10 337;date=23-02-2015;signature_desc=Bacteriophage T4,Gp8;status=T +Merlin Gene3D protein_match 36228 36385 1.5E-71 - . ID=match%2481_88_245;Name=G3DSA:2.170.290.10;Target=Merlin_41 88 245;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 36055 36385 . - . Dbxref=InterPro:IPR015298;ID=match%2482_7_337;Name=SSF89433;Target=Merlin_41 7 337;date=23-02-2015;status=T +Merlin Gene3D protein_match 36294 36385 6.2E-29 - . ID=match%2483_246_337;Name=G3DSA:2.60.340.10;Target=Merlin_41 246 337;date=23-02-2015;status=T +Merlin feature polypeptide 38447 39479 . - . ID=Merlin_42;md5=52620a3cdfad015acc904a2b25021bb6 +Merlin TMHMM protein_match 39457 39479 . - . ID=match%24359_883_905;Name=TMhelix;Target=Merlin_42 883 905;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 41613 41709 . - . ID=Merlin_44;md5=a6e57c0aea8f31d2af0f531a762ace4d +Merlin Pfam protein_match 41631 41709 1.5E-11 - . Dbxref=InterPro:IPR008727;ID=match%24352_11_89;Name=PF05488;Target=Merlin_44 11 89;date=23-02-2015;signature_desc=PAAR motif;status=T +Merlin feature polypeptide 43377 43951 . - . ID=Merlin_46;md5=b56ec180edf42efb12a9a9261aba0195 +Merlin SUPERFAMILY protein_match 43830 43951 . - . ID=match%24244_7_128;Name=SSF69255;Target=Merlin_46 7 128;date=23-02-2015;status=T +Merlin Pfam protein_match 43823 43951 1.3E-19 - . Dbxref=InterPro:IPR002196;ID=match%24245_197_325;Name=PF00959;Ontology_term=GO:0003796%22%2C%22GO:0009253%22%2C%22GO:0016998;Target=Merlin_46 197 325;date=23-02-2015;signature_desc=Phage lysozyme;status=T +Merlin Pfam protein_match 43929 43951 5.9E-5 - . Dbxref=InterPro:IPR010609;ID=match%24246_532_554;Name=PF06715;Target=Merlin_46 532 554;date=23-02-2015;signature_desc=Gp5 C-terminal repeat %283 copies%29;status=T +Merlin SUPERFAMILY protein_match 43739 43951 . - . Dbxref=InterPro:IPR023346;ID=match%24247_130_342;Name=SSF53955;Target=Merlin_46 130 342;date=23-02-2015;status=T +Merlin Pfam protein_match 43811 43951 6.2E-60 - . Dbxref=InterPro:IPR009590;ID=match%24248_33_173;Name=PF06714;Target=Merlin_46 33 173;date=23-02-2015;signature_desc=Gp5 N-terminal OB domain;status=T +Merlin Gene3D protein_match 43786 43951 5.3E-47 - . Dbxref=InterPro:IPR023347;ID=match%24249_175_340;Name=G3DSA:1.10.530.40;Ontology_term=GO:0003796;Target=Merlin_46 175 340;date=23-02-2015;status=T +Merlin PRINTS protein_match 43931 43951 . - . Dbxref=InterPro:IPR001165;ID=match%24250_250_270;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_46 250 270;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 43930 43951 . - . Dbxref=InterPro:IPR001165;ID=match%24250_316_337;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_46 316 337;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 43933 43951 . - . Dbxref=InterPro:IPR001165;ID=match%24250_197_215;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_46 197 215;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 43932 43951 . - . Dbxref=InterPro:IPR001165;ID=match%24250_273_292;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_46 273 292;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 43932 43951 . - . Dbxref=InterPro:IPR001165;ID=match%24250_177_196;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_46 177 196;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin SUPERFAMILY protein_match 43738 43951 . - . ID=match%24251_362_575;Name=SSF69349;Target=Merlin_46 362 575;date=23-02-2015;status=T +Merlin feature polypeptide 44336 44526 . - . ID=Merlin_47;md5=f66cc0e274aa4a2e3ad3524d11b4ca8d +Merlin Pfam protein_match 44338 44526 1.1E-80 - . Dbxref=InterPro:IPR022607;ID=match%24376_1_189;Name=PF11246;Target=Merlin_47 1 189;date=23-02-2015;signature_desc=Base plate wedge protein 53;status=T +Merlin feature polypeptide 44575 44723 . + . ID=Merlin_48;md5=db6e43ebc109e2ca8dd1621bf1045f60 +Merlin Pfam protein_match 44631 44713 5.5E-6 + . Dbxref=InterPro:IPR014833;ID=match%24263_57_139;Name=PF08722;Target=Merlin_48 57 139;date=23-02-2015;signature_desc=TnsA endonuclease N terminal;status=T +Merlin Gene3D protein_match 44595 44722 6.7E-8 + . Dbxref=InterPro:IPR011578;ID=match%24264_21_148;Name=G3DSA:3.40.91.30;Ontology_term=GO:0003677%22%2C%22GO:0004536;Target=Merlin_48 21 148;date=23-02-2015;status=T +Merlin feature polypeptide 45939 46133 . + . ID=Merlin_50;md5=c12e4de52a8c430f588d233b08b61c47 +Merlin Pfam protein_match 45951 46102 7.2E-13 + . Dbxref=InterPro:IPR010667;ID=match%24289_13_164;Name=PF06841;Ontology_term=GO:0005198;Target=Merlin_50 13 164;date=23-02-2015;signature_desc=T4-like virus tail tube protein gp19;status=T +Merlin feature polypeptide 46526 46767 . + . ID=Merlin_51;md5=44c32bcfbe87334f8b9979c78641ea0e +Merlin Gene3D protein_match 46558 46666 8.6E-35 + . Dbxref=InterPro:IPR023191%22%2C%22MetaCyc:PWY-7197;ID=match%24184_33_141;Name=G3DSA:1.10.238.70;Target=Merlin_51 33 141;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 46526 46766 . + . Dbxref=InterPro:IPR027417;ID=match%24185_1_241;Name=SSF52540;Target=Merlin_51 1 241;date=23-02-2015;status=T +Merlin Gene3D protein_match 46667 46766 1.1E-38 + . Dbxref=InterPro:IPR027417;ID=match%24186_142_241;Name=G3DSA:3.40.50.300;Target=Merlin_51 142 241;date=23-02-2015;status=T +Merlin feature polypeptide 48417 48476 . + . ID=Merlin_56;md5=dc34e36a55b68ab85f9f4025953722dd +Merlin TMHMM protein_match 48423 48445 . + . ID=match%2419_7_29;Name=TMhelix;Target=Merlin_56 7 29;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin TMHMM protein_match 48450 48467 . + . ID=match%2420_34_51;Name=TMhelix;Target=Merlin_56 34 51;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 48583 48643 . + . ID=Merlin_57;md5=92432814d3b042b81b0243bcc206f353 +Merlin TMHMM protein_match 48614 48636 . + . ID=match%24140_32_54;Name=TMhelix;Target=Merlin_57 32 54;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 53031 53177 . + . ID=Merlin_67;md5=11d053c2a3286db1c972e4f47c68b4d0 +Merlin SUPERFAMILY protein_match 53037 53176 . + . ID=match%24138_7_146;Name=SSF143990;Target=Merlin_67 7 146;date=23-02-2015;status=T +Merlin Pfam protein_match 53032 53175 2.7E-72 + . Dbxref=InterPro:IPR012596;ID=match%24139_2_145;Name=PF08010;Target=Merlin_67 2 145;date=23-02-2015;signature_desc=Bacteriophage protein GP30.3;status=T +Merlin feature polypeptide 53646 53837 . + . ID=Merlin_68;md5=53b087335d3f0f83ce9373e9e04d7ac9 +Merlin Pfam protein_match 53680 53836 4.7E-30 + . Dbxref=InterPro:IPR009576;ID=match%24366_35_191;Name=PF06693;Target=Merlin_68 35 191;date=23-02-2015;signature_desc=Protein of unknown function %28DUF1190%29;status=T +Merlin ProSiteProfiles protein_match 53646 53676 . + . ID=match%24367_1_31;Name=PS51257;Target=Merlin_68 1 31;date=23-02-2015;signature_desc=Prokaryotic membrane lipoprotein lipid attachment site profile.;status=T +Merlin feature polypeptide 55856 55995 . + . ID=Merlin_72;md5=beba87d69ada37b97ba8a268a912d352 +Merlin TMHMM protein_match 55881 55903 . + . ID=match%24211_26_48;Name=TMhelix;Target=Merlin_72 26 48;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin TMHMM protein_match 55916 55938 . + . ID=match%24212_61_83;Name=TMhelix;Target=Merlin_72 61 83;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 56275 56396 . + . ID=Merlin_73;md5=7108a9e88b0d3d1b26354b1ccaaed3c2 +Merlin TMHMM protein_match 56294 56316 . + . ID=match%24295_20_42;Name=TMhelix;Target=Merlin_73 20 42;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin TMHMM protein_match 56331 56348 . + . ID=match%24296_57_74;Name=TMhelix;Target=Merlin_73 57 74;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 58716 58761 . + . ID=Merlin_79;md5=40427c43e50fbf56572904272e3cac72 +Merlin ProSitePatterns protein_match 58725 58746 . + . Dbxref=InterPro:IPR007087;ID=match%2441_10_31;Name=PS00028;Ontology_term=GO:0046872;Target=Merlin_79 10 31;date=23-02-2015;signature_desc=Zinc finger C2H2 type domain signature.;status=T +Merlin SUPERFAMILY protein_match 58724 58748 . + . ID=match%2442_9_33;Name=SSF57667;Target=Merlin_79 9 33;date=23-02-2015;status=T +Merlin feature polypeptide 59560 59671 . + . ID=Merlin_81;md5=80a8316762d779be1dde1c22ea394bad +Merlin TMHMM protein_match 59621 59640 . + . ID=match%24220_62_81;Name=TMhelix;Target=Merlin_81 62 81;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin TMHMM protein_match 59589 59611 . + . ID=match%24221_30_52;Name=TMhelix;Target=Merlin_81 30 52;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 60378 60530 . + . ID=Merlin_84;md5=de55f2e874e08fa432878ad3f51f20a0 +Merlin SUPERFAMILY protein_match 60384 60473 . + . Dbxref=InterPro:IPR015797;ID=match%24130_7_96;Name=SSF55811;Ontology_term=GO:0016787;Target=Merlin_84 7 96;date=23-02-2015;status=T +Merlin PRINTS protein_match 60414 60428 . + . Dbxref=InterPro:IPR020476%22%2C%22KEGG:00230%2B3.6.1.-%22%2C%22KEGG:00790%2B3.6.1.-;ID=match%24131_37_51;Name=PR00502;Target=Merlin_84 37 51;date=23-02-2015;signature_desc=NUDIX hydrolase family signature;status=T +Merlin PRINTS protein_match 60428 60443 . + . Dbxref=InterPro:IPR020476%22%2C%22KEGG:00230%2B3.6.1.-%22%2C%22KEGG:00790%2B3.6.1.-;ID=match%24131_51_66;Name=PR00502;Target=Merlin_84 51 66;date=23-02-2015;signature_desc=NUDIX hydrolase family signature;status=T +Merlin Pfam protein_match 60384 60472 1.2E-13 + . Dbxref=InterPro:IPR000086;ID=match%24132_7_95;Name=PF00293;Ontology_term=GO:0016787;Target=Merlin_84 7 95;date=23-02-2015;signature_desc=NUDIX domain;status=T +Merlin ProSiteProfiles protein_match 60381 60529 . + . Dbxref=InterPro:IPR000086;ID=match%24133_4_152;Name=PS51462;Ontology_term=GO:0016787;Target=Merlin_84 4 152;date=23-02-2015;signature_desc=Nudix hydrolase domain profile.;status=T +Merlin Gene3D protein_match 60384 60509 7.2E-19 + . Dbxref=InterPro:IPR015797;ID=match%24134_7_132;Name=G3DSA:3.90.79.10;Ontology_term=GO:0016787;Target=Merlin_84 7 132;date=23-02-2015;status=T +Merlin feature polypeptide 60868 61033 . + . ID=Merlin_85;md5=67b88c9345f371fd1fc546f87f499d95 +Merlin SUPERFAMILY protein_match 60868 61030 . + . Dbxref=InterPro:IPR023346;ID=match%2460_1_163;Name=SSF53955;Target=Merlin_85 1 163;date=23-02-2015;status=T +Merlin Gene3D protein_match 60868 61032 2.3E-56 + . Dbxref=InterPro:IPR023347;ID=match%2461_1_165;Name=G3DSA:1.10.530.40;Ontology_term=GO:0003796;Target=Merlin_85 1 165;date=23-02-2015;status=T +Merlin PRINTS protein_match 60891 60909 . + . Dbxref=InterPro:IPR001165;ID=match%2462_24_42;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_85 24 42;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 61007 61028 . + . Dbxref=InterPro:IPR001165;ID=match%2462_140_161;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_85 140 161;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 60871 60890 . + . Dbxref=InterPro:IPR001165;ID=match%2462_4_23;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_85 4 23;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 60918 60937 . + . Dbxref=InterPro:IPR001165;ID=match%2462_51_70;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_85 51 70;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 60964 60983 . + . Dbxref=InterPro:IPR001165;ID=match%2462_97_116;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_85 97 116;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin PRINTS protein_match 60987 61006 . + . Dbxref=InterPro:IPR001165;ID=match%2462_120_139;Name=PR00684;Ontology_term=GO:0003796%22%2C%22GO:0016998;Target=Merlin_85 120 139;date=23-02-2015;signature_desc=Phage T4 lysozyme signature;status=T +Merlin Pfam protein_match 60891 61017 1.7E-16 + . Dbxref=InterPro:IPR002196;ID=match%2463_24_150;Name=PF00959;Ontology_term=GO:0003796%22%2C%22GO:0009253%22%2C%22GO:0016998;Target=Merlin_85 24 150;date=23-02-2015;signature_desc=Phage lysozyme;status=T +Merlin feature polypeptide 61759 61893 . + . ID=Merlin_87;md5=af0621387ec215f9e7e60b939738a863 +Merlin Gene3D protein_match 61759 61892 7.0E-47 + . Dbxref=InterPro:IPR024796;ID=match%24148_1_134;Name=G3DSA:1.10.440.10;Target=Merlin_87 1 134;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 61760 61892 . + . ID=match%24149_2_134;Name=SSF47077;Target=Merlin_87 2 134;date=23-02-2015;status=T +Merlin PIRSF protein_match 61759 61893 1.7E-72 + . Dbxref=InterPro:IPR021143;ID=match%24150_1_135;Name=PIRSF001000;Target=Merlin_87 1 135;date=23-02-2015;status=T +Merlin Pfam protein_match 61759 61890 8.0E-38 + . Dbxref=InterPro:IPR004260;ID=match%24151_1_132;Name=PF03013;Target=Merlin_87 1 132;date=23-02-2015;signature_desc=Pyrimidine dimer DNA glycosylase;status=T +Merlin feature polypeptide 62358 62533 . + . ID=Merlin_88;md5=343d5f99ebdb7c459b8ef33ea4c77b21 +Merlin Gene3D protein_match 62406 62495 1.7E-4 + . Dbxref=InterPro:IPR027417;ID=match%24350_49_138;Name=G3DSA:3.40.50.300;Target=Merlin_88 49 138;date=23-02-2015;status=T +Merlin feature polypeptide 63941 64094 . + . ID=Merlin_93;md5=d1e024dd6276c4c93812dbe4d25dfaa7 +Merlin Pfam protein_match 63951 64092 2.6E-34 + . Dbxref=InterPro:IPR019653;ID=match%24325_11_152;Name=PF10715;Target=Merlin_93 11 152;date=23-02-2015;signature_desc=Endoribonuclease RegB T4-bacteriophage encoded;status=T +Merlin feature polypeptide 64413 64594 . + . ID=Merlin_94;md5=17f369e25c56962dc6118725b507f2f3 +Merlin Gene3D protein_match 64475 64568 1.6E-7 + . ID=match%24152_63_156;Name=G3DSA:1.10.530.10;Target=Merlin_94 63 156;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 64474 64586 . + . Dbxref=InterPro:IPR023346;ID=match%24153_62_174;Name=SSF53955;Target=Merlin_94 62 174;date=23-02-2015;status=T +Merlin Pfam protein_match 64440 64591 4.5E-35 + . Dbxref=InterPro:IPR019653;ID=match%24154_28_179;Name=PF10715;Target=Merlin_94 28 179;date=23-02-2015;signature_desc=Endoribonuclease RegB T4-bacteriophage encoded;status=T +Merlin feature polypeptide 65675 65824 . + . ID=Merlin_97;md5=279df7c12a4627883bf88c18423e767f +Merlin SUPERFAMILY protein_match 65678 65822 . + . ID=match%24142_4_148;Name=SSF52949;Target=Merlin_97 4 148;date=23-02-2015;status=T +Merlin PANTHER protein_match 65678 65822 . + . ID=match%24143_4_148;Name=PTHR12521:SF0;Target=Merlin_97 4 148;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 65675 65824 . + . Dbxref=InterPro:IPR002589;ID=match%24144_1_150;Name=PS51154;Target=Merlin_97 1 150;date=23-02-2015;signature_desc=Macro domain profile.;status=T +Merlin Pfam protein_match 65695 65806 1.6E-8 + . Dbxref=InterPro:IPR002589;ID=match%24145_21_132;Name=PF01661;Target=Merlin_97 21 132;date=23-02-2015;signature_desc=Macro domain;status=T +Merlin SMART protein_match 65676 65807 5.9E-8 + . Dbxref=InterPro:IPR002589;ID=match%24146_2_133;Name=SM00506;Target=Merlin_97 2 133;date=23-02-2015;signature_desc=Appr-1%22-p processing enzyme;status=T +Merlin PANTHER protein_match 65678 65822 . + . ID=match%24147_4_148;Name=PTHR12521;Target=Merlin_97 4 148;date=23-02-2015;status=T +Merlin feature polypeptide 67266 67457 . + . ID=Merlin_104;md5=fec6b737f5627a7b46fcdfcea90813dd +Merlin SUPERFAMILY protein_match 67266 67406 . + . Dbxref=InterPro:IPR027417;ID=match%24265_1_141;Name=SSF52540;Target=Merlin_104 1 141;date=23-02-2015;status=T +Merlin Pfam protein_match 67267 67451 1.4E-53 + . Dbxref=InterPro:IPR001267%22%2C%22KEGG:00240%2B2.7.1.21%22%2C%22KEGG:00983%2B2.7.1.21%22%2C%22MetaCyc:PWY-7199;ID=match%24266_2_186;Name=PF00265;Ontology_term=GO:0004797%22%2C%22GO:0005524;Target=Merlin_104 2 186;date=23-02-2015;signature_desc=Thymidine kinase;status=T +Merlin SUPERFAMILY protein_match 67407 67454 . + . ID=match%24267_142_189;Name=SSF57716;Target=Merlin_104 142 189;date=23-02-2015;status=T +Merlin PIRSF protein_match 67266 67455 1.7E-80 + . Dbxref=InterPro:IPR001267%22%2C%22KEGG:00240%2B2.7.1.21%22%2C%22KEGG:00983%2B2.7.1.21%22%2C%22MetaCyc:PWY-7199;ID=match%24268_1_190;Name=PIRSF035805;Ontology_term=GO:0004797%22%2C%22GO:0005524;Target=Merlin_104 1 190;date=23-02-2015;status=T +Merlin Gene3D protein_match 67407 67453 2.7E-18 + . ID=match%24269_142_188;Name=G3DSA:3.30.60.20;Target=Merlin_104 142 188;date=23-02-2015;status=T +Merlin ProSitePatterns protein_match 67438 67451 . + . Dbxref=InterPro:IPR020633%22%2C%22KEGG:00240%2B2.7.1.21%22%2C%22KEGG:00983%2B2.7.1.21%22%2C%22MetaCyc:PWY-7199;ID=match%24270_173_186;Name=PS00603;Ontology_term=GO:0004797%22%2C%22GO:0005524;Target=Merlin_104 173 186;date=23-02-2015;signature_desc=Thymidine kinase cellular-type signature.;status=T +Merlin PANTHER protein_match 67266 67455 . + . Dbxref=InterPro:IPR001267%22%2C%22KEGG:00240%2B2.7.1.21%22%2C%22KEGG:00983%2B2.7.1.21%22%2C%22MetaCyc:PWY-7199;ID=match%24271_1_190;Name=PTHR11441;Ontology_term=GO:0004797%22%2C%22GO:0005524;Target=Merlin_104 1 190;date=23-02-2015;status=T +Merlin Hamap protein_match 67266 67454 . + . Dbxref=InterPro:IPR020634%22%2C%22KEGG:00240%2B2.7.1.21%22%2C%22KEGG:00983%2B2.7.1.21%22%2C%22MetaCyc:PWY-7199;ID=match%24272_1_189;Name=MF_00124;Ontology_term=GO:0004797%22%2C%22GO:0005524%22%2C%22GO:0006259;Target=Merlin_104 1 189;date=23-02-2015;signature_desc=Thymidine kinase %5Btdk%5D.;status=T +Merlin feature polypeptide 70262 70346 . + . ID=Merlin_114;md5=f4679b9aa19f2ade7f5b62621e588fa7 +Merlin SUPERFAMILY protein_match 70301 70332 . + . ID=match%24353_40_71;Name=SSF57716;Target=Merlin_114 40 71;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 70269 70338 . + . Dbxref=InterPro:IPR000962;ID=match%24354_8_77;Name=PS51128;Ontology_term=GO:0008270;Target=Merlin_114 8 77;date=23-02-2015;signature_desc=Prokaryotic dksA C4-type zinc finger profiles.;status=T +Merlin Pfam protein_match 70301 70329 3.2E-8 + . Dbxref=InterPro:IPR000962;ID=match%24355_40_68;Name=PF01258;Ontology_term=GO:0008270;Target=Merlin_114 40 68;date=23-02-2015;signature_desc=Prokaryotic dksA/traR C4-type zinc finger;status=T +Merlin feature polypeptide 71091 71249 . + . ID=Merlin_117;md5=868a76ac07a28f5a4276cea0f0115a99 +Merlin TMHMM protein_match 71161 71183 . + . ID=match%24293_71_93;Name=TMhelix;Target=Merlin_117 71 93;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin TMHMM protein_match 71124 71146 . + . ID=match%24294_34_56;Name=TMhelix;Target=Merlin_117 34 56;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 72115 72450 . + . ID=Merlin_119;md5=70a5fa60dbb5199b4e2ec2e2aef23cdb +Merlin Pfam protein_match 72116 72250 7.7E-14 + . Dbxref=InterPro:IPR018775;ID=match%24232_2_136;Name=PF10127;Target=Merlin_119 2 136;date=23-02-2015;signature_desc=Predicted nucleotidyltransferase;status=T +Merlin feature polypeptide 73122 73199 . + . ID=Merlin_120;md5=f9244d3cfff8ae5f1bf98b23034898e1 +Merlin TMHMM protein_match 73125 73147 . + . ID=match%24297_4_26;Name=TMhelix;Target=Merlin_120 4 26;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin TMHMM protein_match 73159 73181 . + . ID=match%24298_38_60;Name=TMhelix;Target=Merlin_120 38 60;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 73720 74044 . + . ID=Merlin_122;md5=aacb7a94599c95f7eda3ac8dd33a1abf +Merlin SUPERFAMILY protein_match 73808 74029 . + . Dbxref=InterPro:IPR027417;ID=match%24290_89_310;Name=SSF52540;Target=Merlin_122 89 310;date=23-02-2015;status=T +Merlin Gene3D protein_match 73818 73919 2.6E-6 + . Dbxref=InterPro:IPR027417;ID=match%24291_99_200;Name=G3DSA:3.40.50.300;Target=Merlin_122 99 200;date=23-02-2015;status=T +Merlin SMART protein_match 73820 73960 0.0011 + . Dbxref=InterPro:IPR003593;ID=match%24292_101_241;Name=SM00382;Target=Merlin_122 101 241;date=23-02-2015;signature_desc=ATPases associated with a variety of cellular activities;status=T +Merlin feature polypeptide 79110 79277 . + . ID=Merlin_133;md5=c460a763069c40e50b510edd824bacb0 +Merlin Gene3D protein_match 79173 79200 4.3E-4 + . Dbxref=InterPro:IPR024482;ID=match%24109_64_91;Name=G3DSA:3.90.1000.10;Target=Merlin_133 64 91;date=23-02-2015;status=T +Merlin feature polypeptide 80153 80239 . + . ID=Merlin_136;md5=f5a8f14b788987de09ccca12d0b80dee +Merlin SUPERFAMILY protein_match 80155 80238 . + . Dbxref=InterPro:IPR012336;ID=match%24207_3_86;Name=SSF52833;Target=Merlin_136 3 86;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 80153 80239 . + . Dbxref=InterPro:IPR002109;ID=match%24208_1_87;Name=PS51354;Ontology_term=GO:0009055%22%2C%22GO:0015035%22%2C%22GO:0045454;Target=Merlin_136 1 87;date=23-02-2015;signature_desc=Glutaredoxin domain profile.;status=T +Merlin Pfam protein_match 80155 80228 1.5E-8 + . Dbxref=InterPro:IPR002109;ID=match%24209_3_76;Name=PF00462;Ontology_term=GO:0009055%22%2C%22GO:0015035%22%2C%22GO:0045454;Target=Merlin_136 3 76;date=23-02-2015;signature_desc=Glutaredoxin;status=T +Merlin Gene3D protein_match 80155 80239 1.0E-21 + . Dbxref=InterPro:IPR012336;ID=match%24210_3_87;Name=G3DSA:3.40.30.10;Target=Merlin_136 3 87;date=23-02-2015;status=T +Merlin feature polypeptide 81510 81653 . + . ID=Merlin_142;md5=d0f8afe40748b3e1ac4e6d8ed89d3025 +Merlin Pfam protein_match 81513 81646 2.2E-24 + . Dbxref=InterPro:IPR019506;ID=match%24311_4_137;Name=PF10465;Target=Merlin_142 4 137;date=23-02-2015;signature_desc=PinA peptidase inhibitor;status=T +Merlin feature polypeptide 82144 82300 . + . ID=Merlin_144;md5=1a091004966af2bde680570516d57330 +Merlin Gene3D protein_match 82246 82300 5.7E-29 + . ID=match%24258_103_157;Name=G3DSA:1.10.720.10;Target=Merlin_144 103 157;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 82247 82300 . + . Dbxref=InterPro:IPR015208;ID=match%24259_104_157;Name=SSF68918;Target=Merlin_144 104 157;date=23-02-2015;status=T +Merlin Pfam protein_match 82151 82240 5.1E-22 + . Dbxref=InterPro:IPR004211;ID=match%24260_8_97;Name=PF02945;Target=Merlin_144 8 97;date=23-02-2015;signature_desc=Recombination endonuclease VII;status=T +Merlin SUPERFAMILY protein_match 82144 82245 . + . ID=match%24261_1_102;Name=SSF54060;Target=Merlin_144 1 102;date=23-02-2015;status=T +Merlin Pfam protein_match 82247 82300 1.2E-24 + . Dbxref=InterPro:IPR015208;ID=match%24262_104_157;Name=PF09124;Target=Merlin_144 104 157;date=23-02-2015;signature_desc=T4 recombination endonuclease VII,dimerisation;status=T +Merlin feature polypeptide 82614 83222 . + . ID=Merlin_145;md5=a5948e1a7431f4a84dce85c7bc214ebf +Merlin TIGRFAM protein_match 82633 83219 8.5E-169 + . Dbxref=InterPro:IPR012833%22%2C%22KEGG:00230%2B1.17.4.2%22%2C%22KEGG:00240%2B1.17.4.2%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222;ID=match%24281_20_606;Name=TIGR02487;Ontology_term=GO:0006260%22%2C%22GO:0008998%22%2C%22GO:0016491%22%2C%22GO:0055114;Target=Merlin_145 20 606;date=23-02-2015;signature_desc=NrdD: anaerobic ribonucleoside-triphosphate reductase;status=T +Merlin Gene3D protein_match 82642 83200 1.7E-175 + . ID=match%24282_29_587;Name=G3DSA:3.20.70.20;Target=Merlin_145 29 587;date=23-02-2015;status=T +Merlin PANTHER protein_match 82616 83222 . + . ID=match%24283_3_609;Name=PTHR21075;Target=Merlin_145 3 609;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 82639 83201 . + . ID=match%24284_26_588;Name=SSF51998;Target=Merlin_145 26 588;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 83097 83222 . + . Dbxref=InterPro:IPR001150;ID=match%24285_484_609;Name=PS51149;Ontology_term=GO:0003824%22%2C%22GO:0008152;Target=Merlin_145 484 609;date=23-02-2015;signature_desc=Glycine radical domain profile.;status=T +Merlin ProSitePatterns protein_match 83190 83198 . + . Dbxref=InterPro:IPR019777;ID=match%24286_577_585;Name=PS00850;Target=Merlin_145 577 585;date=23-02-2015;signature_desc=Glycine radical domain signature.;status=T +Merlin Pfam protein_match 82633 83219 2.0E-71 + . ID=match%24287_20_606;Name=PF13597;Target=Merlin_145 20 606;date=23-02-2015;signature_desc=Anaerobic ribonucleoside-triphosphate reductase;status=T +Merlin feature polypeptide 84511 84648 . + . ID=Merlin_146;md5=cbdd3d17904270a3be66142258d543d0 +Merlin Gene3D protein_match 84513 84609 3.4E-9 + . Dbxref=InterPro:IPR013785;ID=match%2490_3_99;Name=G3DSA:3.20.20.70;Ontology_term=GO:0003824;Target=Merlin_146 3 99;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 84513 84588 . + . ID=match%2491_3_78;Name=SSF102114;Target=Merlin_146 3 78;date=23-02-2015;status=T +Merlin Pfam protein_match 84512 84629 1.4E-38 + . ID=match%2492_2_119;Name=PF13353;Target=Merlin_146 2 119;date=23-02-2015;signature_desc=4Fe-4S single cluster domain;status=T +Merlin PIRSF protein_match 84511 84639 1.5E-51 + . Dbxref=InterPro:IPR012837%22%2C%22KEGG:00351%2B1.97.1.-%22%2C%22KEGG:00361%2B1.97.1.-%22%2C%22KEGG:00363%2B1.97.1.-%22%2C%22KEGG:00625%2B1.97.1.-;ID=match%2493_1_129;Name=PIRSF000368;Ontology_term=GO:0005737%22%2C%22GO:0043365%22%2C%22GO:0051539%22%2C%22GO:0055114;Target=Merlin_146 1 129;date=23-02-2015;status=T +Merlin PANTHER protein_match 84512 84629 . + . ID=match%2494_2_119;Name=PTHR30352:SF2;Target=Merlin_146 2 119;date=23-02-2015;status=T +Merlin TIGRFAM protein_match 84512 84632 1.4E-47 + . Dbxref=InterPro:IPR012837%22%2C%22KEGG:00351%2B1.97.1.-%22%2C%22KEGG:00361%2B1.97.1.-%22%2C%22KEGG:00363%2B1.97.1.-%22%2C%22KEGG:00625%2B1.97.1.-;ID=match%2495_2_122;Name=TIGR02491;Ontology_term=GO:0005737%22%2C%22GO:0043365%22%2C%22GO:0051539%22%2C%22GO:0055114;Target=Merlin_146 2 122;date=23-02-2015;signature_desc=NrdG: anaerobic ribonucleoside-triphosphate reductase activating protein;status=T +Merlin PANTHER protein_match 84512 84629 . + . ID=match%2496_2_119;Name=PTHR30352;Target=Merlin_146 2 119;date=23-02-2015;status=T +Merlin feature polypeptide 85015 85111 . + . ID=Merlin_147;md5=a1daa097d1e8d434f9ba415043be17bb +Merlin Pfam protein_match 85017 85093 3.0E-6 + . Dbxref=InterPro:IPR002109;ID=match%24119_3_79;Name=PF00462;Ontology_term=GO:0009055%22%2C%22GO:0015035%22%2C%22GO:0045454;Target=Merlin_147 3 79;date=23-02-2015;signature_desc=Glutaredoxin;status=T +Merlin Gene3D protein_match 85017 85095 2.1E-14 + . Dbxref=InterPro:IPR012336;ID=match%24120_3_81;Name=G3DSA:3.40.30.10;Target=Merlin_147 3 81;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 85015 85096 . + . Dbxref=InterPro:IPR012336;ID=match%24121_1_82;Name=SSF52833;Target=Merlin_147 1 82;date=23-02-2015;status=T +Merlin feature polypeptide 86228 86335 . + . ID=Merlin_151;md5=afe29fa46e22254d21131d8820eb3e82 +Merlin PIRSF protein_match 86228 86335 9.7E-48 + . Dbxref=InterPro:IPR016409;ID=match%24374_1_108;Name=PIRSF004270;Target=Merlin_151 1 108;date=23-02-2015;status=T +Merlin feature polypeptide 88662 88783 . + . ID=Merlin_159;md5=9c7f710896ff23c345ff26114d38329c +Merlin Pfam protein_match 88708 88777 1.9E-30 + . Dbxref=InterPro:IPR022558;ID=match%24362_47_116;Name=PF10849;Target=Merlin_159 47 116;date=23-02-2015;signature_desc=Protein of unknown function %28DUF2654%29;status=T +Merlin feature polypeptide 89825 89871 . + . ID=Merlin_163;md5=2005b9626edfb081cb7dac63a8da5d9d +Merlin TMHMM protein_match 89828 89850 . + . ID=match%24368_4_26;Name=TMhelix;Target=Merlin_163 4 26;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 89965 90182 . + . ID=Merlin_164;md5=4d986c19d325540052f2899795da1f10 +Merlin SUPERFAMILY protein_match 89968 90127 . + . Dbxref=InterPro:IPR029052;ID=match%24242_4_163;Name=SSF56300;Target=Merlin_164 4 163;date=23-02-2015;status=T +Merlin feature polypeptide 91187 91746 . + . ID=Merlin_166;md5=aa145f82a3e793616638e1d4ab43649a +Merlin SUPERFAMILY protein_match 91533 91742 . + . Dbxref=InterPro:IPR027417;ID=match%24202_347_556;Name=SSF52540;Target=Merlin_166 347 556;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 91191 91390 . + . Dbxref=InterPro:IPR027417;ID=match%24202_5_204;Name=SSF52540;Target=Merlin_166 5 204;date=23-02-2015;status=T +Merlin Gene3D protein_match 91441 91531 3.2E-9 + . ID=match%24203_255_345;Name=G3DSA:1.10.287.510;Target=Merlin_166 255 345;date=23-02-2015;status=T +Merlin Gene3D protein_match 91569 91581 2.7E-21 + . Dbxref=InterPro:IPR027417;ID=match%24204_383_395;Name=G3DSA:3.40.50.300;Target=Merlin_166 383 395;date=23-02-2015;status=T +Merlin Gene3D protein_match 91191 91346 2.7E-21 + . Dbxref=InterPro:IPR027417;ID=match%24204_5_160;Name=G3DSA:3.40.50.300;Target=Merlin_166 5 160;date=23-02-2015;status=T +Merlin Gene3D protein_match 91616 91740 2.7E-21 + . Dbxref=InterPro:IPR027417;ID=match%24204_430_554;Name=G3DSA:3.40.50.300;Target=Merlin_166 430 554;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 91438 91520 . + . ID=match%24205_252_334;Name=SSF75712;Target=Merlin_166 252 334;date=23-02-2015;status=T +Merlin Pfam protein_match 91194 91579 2.9E-12 + . ID=match%24206_8_393;Name=PF13476;Target=Merlin_166 8 393;date=23-02-2015;signature_desc=AAA domain;status=T +Merlin feature polypeptide 93066 93192 . + . ID=Merlin_168;md5=e95f4806ae6b63742588eb3ac1e8d66a +Merlin Pfam protein_match 93075 93185 2.1E-38 + . Dbxref=InterPro:IPR019725;ID=match%24358_10_120;Name=PF10789;Target=Merlin_168 10 120;date=23-02-2015;signature_desc=Phage RNA polymerase binding,RpbA;status=T +Merlin feature polypeptide 93468 93695 . + . ID=Merlin_169;md5=54bfd611a0d27ae361fbf937acc89f21 +Merlin Pfam protein_match 93580 93695 3.2E-43 + . Dbxref=InterPro:IPR015200;ID=match%24179_113_228;Name=PF09116;Target=Merlin_169 113 228;date=23-02-2015;signature_desc=gp45 sliding clamp,C terminal;status=T +Merlin Pfam protein_match 93468 93576 6.1E-37 + . Dbxref=InterPro:IPR004190;ID=match%24180_1_109;Name=PF02916;Ontology_term=GO:0006260;Target=Merlin_169 1 109;date=23-02-2015;signature_desc=DNA polymerase processivity factor;status=T +Merlin SUPERFAMILY protein_match 93579 93695 . + . ID=match%24181_112_228;Name=SSF55979;Target=Merlin_169 112 228;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 93468 93577 . + . ID=match%24182_1_110;Name=SSF55979;Target=Merlin_169 1 110;date=23-02-2015;status=T +Merlin Gene3D protein_match 93468 93695 6.7E-94 + . ID=match%24183_1_228;Name=G3DSA:3.70.10.10;Target=Merlin_169 1 228;date=23-02-2015;status=T +Merlin feature polypeptide 94208 94528 . + . ID=Merlin_170;md5=c060fa03490c1d499a42ab951ed85e0e +Merlin SMART protein_match 94248 94368 1.6E-10 + . Dbxref=InterPro:IPR003593;ID=match%24313_41_161;Name=SM00382;Target=Merlin_170 41 161;date=23-02-2015;signature_desc=ATPases associated with a variety of cellular activities;status=T +Merlin PANTHER protein_match 94220 94396 . + . ID=match%24314_13_189;Name=PTHR11669;Target=Merlin_170 13 189;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 94239 94414 . + . Dbxref=InterPro:IPR027417;ID=match%24315_32_207;Name=SSF52540;Target=Merlin_170 32 207;date=23-02-2015;status=T +Merlin Gene3D protein_match 94219 94366 1.9E-27 + . Dbxref=InterPro:IPR027417;ID=match%24316_12_159;Name=G3DSA:3.40.50.300;Target=Merlin_170 12 159;date=23-02-2015;status=T +Merlin Pfam protein_match 94254 94362 3.5E-14 + . Dbxref=InterPro:IPR003959;ID=match%24317_47_155;Name=PF00004;Ontology_term=GO:0005524;Target=Merlin_170 47 155;date=23-02-2015;signature_desc=ATPase family associated with various cellular activities %28AAA%29;status=T +Merlin feature polypeptide 95730 95846 . + . ID=Merlin_172;md5=e19a2fd43f5e602d60cc647b7e898700 +Merlin Gene3D protein_match 95730 95845 3.0E-54 + . Dbxref=InterPro:IPR002702;ID=match%24155_1_116;Name=G3DSA:3.30.70.650;Ontology_term=GO:0003723;Target=Merlin_172 1 116;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 95730 95846 . + . Dbxref=InterPro:IPR002702;ID=match%24156_1_117;Name=SSF55064;Ontology_term=GO:0003723;Target=Merlin_172 1 117;date=23-02-2015;status=T +Merlin Pfam protein_match 95730 95846 1.1E-57 + . Dbxref=InterPro:IPR002702;ID=match%24157_1_117;Name=PF01818;Ontology_term=GO:0003723;Target=Merlin_172 1 117;date=23-02-2015;signature_desc=Bacteriophage translational regulator;status=T +Merlin feature polypeptide 96425 97320 . + . ID=Merlin_174;md5=a01cb6247aa3e6f55bfb246a7750715b +Merlin Gene3D protein_match 96888 96913 8.9E-25 + . ID=match%24187_464_489;Name=G3DSA:1.10.287.690;Target=Merlin_174 464 489;date=23-02-2015;status=T +Merlin Gene3D protein_match 96960 96990 8.9E-25 + . ID=match%24187_536_566;Name=G3DSA:1.10.287.690;Target=Merlin_174 536 566;date=23-02-2015;status=T +Merlin Pfam protein_match 96488 96712 2.0E-30 + . Dbxref=InterPro:IPR006133%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24188_64_288;Name=PF03104;Ontology_term=GO:0003887;Target=Merlin_174 64 288;date=23-02-2015;signature_desc=DNA polymerase family B,exonuclease domain;status=T +Merlin SUPERFAMILY protein_match 96795 97319 . + . ID=match%24189_371_895;Name=SSF56672;Target=Merlin_174 371 895;date=23-02-2015;status=T +Merlin Gene3D protein_match 96803 96843 4.7E-4 + . Dbxref=InterPro:IPR023211%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24190_379_419;Name=G3DSA:3.90.1600.10;Target=Merlin_174 379 419;date=23-02-2015;status=T +Merlin PRINTS protein_match 96826 96839 . + . Dbxref=InterPro:IPR006172%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24191_402_415;Name=PR00106;Ontology_term=GO:0000166%22%2C%22GO:0003676%22%2C%22GO:0003887%22%2C%22GO:0006139;Target=Merlin_174 402 415;date=23-02-2015;signature_desc=DNA-directed DNA-polymerase family B signature;status=T +Merlin PRINTS protein_match 96974 96986 . + . Dbxref=InterPro:IPR006172%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24191_550_562;Name=PR00106;Ontology_term=GO:0000166%22%2C%22GO:0003676%22%2C%22GO:0003887%22%2C%22GO:0006139;Target=Merlin_174 550 562;date=23-02-2015;signature_desc=DNA-directed DNA-polymerase family B signature;status=T +Merlin PRINTS protein_match 97035 97043 . + . Dbxref=InterPro:IPR006172%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24191_611_619;Name=PR00106;Ontology_term=GO:0000166%22%2C%22GO:0003676%22%2C%22GO:0003887%22%2C%22GO:0006139;Target=Merlin_174 611 619;date=23-02-2015;signature_desc=DNA-directed DNA-polymerase family B signature;status=T +Merlin PANTHER protein_match 96610 97183 . + . ID=match%24192_186_759;Name=PTHR10322;Target=Merlin_174 186 759;date=23-02-2015;status=T +Merlin Pfam protein_match 96789 97233 7.8E-51 + . Dbxref=InterPro:IPR006134%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24193_365_809;Name=PF00136;Ontology_term=GO:0000166%22%2C%22GO:0003677%22%2C%22GO:0003887%22%2C%22GO:0006260;Target=Merlin_174 365 809;date=23-02-2015;signature_desc=DNA polymerase family B;status=T +Merlin SMART protein_match 96526 97049 8.9E-86 + . Dbxref=InterPro:IPR006172%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24194_102_625;Name=SM00486;Ontology_term=GO:0000166%22%2C%22GO:0003676%22%2C%22GO:0003887%22%2C%22GO:0006139;Target=Merlin_174 102 625;date=23-02-2015;signature_desc=DNA polymerase type-B family;status=T +Merlin SUPERFAMILY protein_match 96428 96794 . + . Dbxref=InterPro:IPR012337;ID=match%24195_4_370;Name=SSF53098;Ontology_term=GO:0003676;Target=Merlin_174 4 370;date=23-02-2015;status=T +Merlin ProSitePatterns protein_match 97037 97045 . + . Dbxref=InterPro:IPR017964%22%2C%22KEGG:00230%2B2.7.7.7%22%2C%22KEGG:00240%2B2.7.7.7;ID=match%24196_613_621;Name=PS00116;Ontology_term=GO:0000166%22%2C%22GO:0003676%22%2C%22GO:0003887%22%2C%22GO:0006139;Target=Merlin_174 613 621;date=23-02-2015;signature_desc=DNA polymerase family B signature.;status=T +Merlin Gene3D protein_match 96425 96528 3.8E-60 + . ID=match%24197_1_104;Name=G3DSA:3.30.342.10;Target=Merlin_174 1 104;date=23-02-2015;status=T +Merlin Gene3D protein_match 96759 96801 3.8E-60 + . ID=match%24197_335_377;Name=G3DSA:3.30.342.10;Target=Merlin_174 335 377;date=23-02-2015;status=T +Merlin Gene3D protein_match 96529 96755 2.5E-48 + . Dbxref=InterPro:IPR012337;ID=match%24198_105_331;Name=G3DSA:3.30.420.10;Ontology_term=GO:0003676;Target=Merlin_174 105 331;date=23-02-2015;status=T +Merlin feature polypeptide 99454 99599 . + . ID=Merlin_176;md5=0ddb9a020b5f8731c6451000db73376a +Merlin ProSiteProfiles protein_match 99454 99470 . + . ID=match%24370_1_17;Name=PS51257;Target=Merlin_176 1 17;date=23-02-2015;signature_desc=Prokaryotic membrane lipoprotein lipid attachment site profile.;status=T +Merlin feature polypeptide 99927 99996 . + . ID=Merlin_177;md5=686fb92fe3a04f9848279e47ee27032a +Merlin TMHMM protein_match 99930 99948 . + . ID=match%24158_4_22;Name=TMhelix;Target=Merlin_177 4 22;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin Pfam protein_match 99938 99980 5.0E-18 + . Dbxref=InterPro:IPR016410;ID=match%24159_12_54;Name=PF14373;Target=Merlin_177 12 54;date=23-02-2015;signature_desc=Superinfection immunity protein;status=T +Merlin TMHMM protein_match 99955 99977 . + . ID=match%24160_29_51;Name=TMhelix;Target=Merlin_177 29 51;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 100136 100381 . + . ID=Merlin_178;md5=cafa8b1acb8fbada80a1a9cd2b5edfce +Merlin Pfam protein_match 100155 100371 7.0E-22 + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2443_20_236;Name=PF00303;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_178 20 236;date=23-02-2015;signature_desc=Thymidylate synthase;status=T +Merlin PIRSF protein_match 100136 100381 1.8E-176 + . Dbxref=InterPro:IPR014619%22%2C%22KEGG:00240%2B2.1.2.8%22%2C%22KEGG:00670%2B2.1.2.8;ID=match%2444_1_246;Name=PIRSF036750;Ontology_term=GO:0047153;Target=Merlin_178 1 246;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 100143 100367 . + . Dbxref=InterPro:IPR023451%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2445_8_232;Name=SSF55831;Target=Merlin_178 8 232;date=23-02-2015;status=T +Merlin Gene3D protein_match 100148 100379 9.1E-46 + . Dbxref=InterPro:IPR023451%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2446_13_244;Name=G3DSA:3.30.572.10;Target=Merlin_178 13 244;date=23-02-2015;status=T +Merlin feature polypeptide 101700 102025 . + . ID=Merlin_180;md5=8b6aa68fa43e6a42ce1399cd364572a0 +Merlin Gene3D protein_match 101700 101848 8.6E-71 + . ID=match%24161_1_149;Name=G3DSA:3.40.50.2000;Target=Merlin_180 1 149;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 101700 102025 . + . ID=match%24162_1_326;Name=SSF53756;Target=Merlin_180 1 326;date=23-02-2015;status=T +Merlin Gene3D protein_match 101849 102006 1.1E-87 + . ID=match%24163_150_307;Name=G3DSA:3.40.50.2000;Target=Merlin_180 150 307;date=23-02-2015;status=T +Merlin feature polypeptide 102884 103278 . + . ID=Merlin_181;md5=9e33d74116cd5bea90dea67441bec75e +Merlin SUPERFAMILY protein_match 102913 103150 . + . Dbxref=InterPro:IPR027417;ID=match%2455_30_267;Name=SSF52540;Target=Merlin_181 30 267;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 103088 103150 . + . Dbxref=InterPro:IPR020587;ID=match%2456_205_267;Name=PS50163;Ontology_term=GO:0003677%22%2C%22GO:0005524%22%2C%22GO:0006259%22%2C%22GO:0008094;Target=Merlin_181 205 267;date=23-02-2015;signature_desc=RecA family profile 2.;status=T +Merlin ProSiteProfiles protein_match 102911 103079 . + . Dbxref=InterPro:IPR020588;ID=match%2457_28_196;Name=PS50162;Ontology_term=GO:0003677%22%2C%22GO:0005524%22%2C%22GO:0006259%22%2C%22GO:0008094;Target=Merlin_181 28 196;date=23-02-2015;signature_desc=RecA family profile 1.;status=T +Merlin Pfam protein_match 102911 103183 3.3E-14 + . Dbxref=InterPro:IPR013765;ID=match%2458_28_300;Name=PF00154;Ontology_term=GO:0003697%22%2C%22GO:0005524%22%2C%22GO:0006281%22%2C%22GO:0009432;Target=Merlin_181 28 300;date=23-02-2015;signature_desc=recA bacterial DNA recombination protein;status=T +Merlin Gene3D protein_match 102912 103149 7.1E-24 + . Dbxref=InterPro:IPR027417;ID=match%2459_29_266;Name=G3DSA:3.40.50.300;Target=Merlin_181 29 266;date=23-02-2015;status=T +Merlin feature polypeptide 104071 104186 . + . ID=Merlin_182;md5=0fdc17100c73d770256622356921c325 +Merlin Pfam protein_match 104115 104171 6.2E-24 + . Dbxref=InterPro:IPR021049;ID=match%24342_45_101;Name=PF11113;Target=Merlin_182 45 101;date=23-02-2015;signature_desc=Head assembly gene product;status=T +Merlin feature polypeptide 104499 104953 . + . ID=Merlin_183;md5=cc9f55ac9c0744d7adc1fe519239284b +Merlin ProSiteProfiles protein_match 104641 104911 . + . Dbxref=InterPro:IPR007694;ID=match%246_143_413;Name=PS51199;Ontology_term=GO:0003678%22%2C%22GO:0005524%22%2C%22GO:0006260;Target=Merlin_183 143 413;date=23-02-2015;signature_desc=Superfamily 4 helicase domain profile.;status=T +Merlin Gene3D protein_match 104641 104889 3.1E-28 + . Dbxref=InterPro:IPR027417;ID=match%247_143_391;Name=G3DSA:3.40.50.300;Target=Merlin_183 143 391;date=23-02-2015;status=T +Merlin Pfam protein_match 104656 104874 1.4E-15 + . Dbxref=InterPro:IPR007694;ID=match%248_158_376;Name=PF03796;Ontology_term=GO:0003678%22%2C%22GO:0005524%22%2C%22GO:0006260;Target=Merlin_183 158 376;date=23-02-2015;signature_desc=DnaB-like helicase C terminal domain;status=T +Merlin SUPERFAMILY protein_match 104759 104894 . + . Dbxref=InterPro:IPR027417;ID=match%249_261_396;Name=SSF52540;Target=Merlin_183 261 396;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 104647 104716 . + . Dbxref=InterPro:IPR027417;ID=match%249_149_218;Name=SSF52540;Target=Merlin_183 149 218;date=23-02-2015;status=T +Merlin feature polypeptide 105927 106019 . + . ID=Merlin_184;md5=ecf1cbcc01065a95b4231f44b64d6c11 +Merlin ProSiteProfiles protein_match 105927 105941 . + . ID=match%24288_1_15;Name=PS51257;Target=Merlin_184 1 15;date=23-02-2015;signature_desc=Prokaryotic membrane lipoprotein lipid attachment site profile.;status=T +Merlin feature polypeptide 106698 106851 . + . ID=Merlin_187;md5=3328d8262077d810b478714c77103968 +Merlin Pfam protein_match 106799 106842 1.3E-4 + . Dbxref=InterPro:IPR010762;ID=match%24377_102_145;Name=PF07068;Target=Merlin_187 102 145;date=23-02-2015;signature_desc=Major capsid protein Gp23;status=T +Merlin feature polypeptide 107199 107539 . + . ID=Merlin_188;md5=31c5c16a43fc25f2302a607909d5dfb7 +Merlin Pfam protein_match 107368 107414 4.3E-5 + . Dbxref=InterPro:IPR013264%22%2C%22KEGG:00520%2B2.7.7.-;ID=match%24301_170_216;Name=PF08275;Target=Merlin_188 170 216;date=23-02-2015;signature_desc=DNA primase catalytic core,N-terminal domain;status=T +Merlin Gene3D protein_match 107368 107404 3.5E-4 + . Dbxref=InterPro:IPR013264%22%2C%22KEGG:00520%2B2.7.7.-;ID=match%24302_170_206;Name=G3DSA:3.90.980.10;Target=Merlin_188 170 206;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 107327 107449 . + . ID=match%24303_129_251;Name=SSF56731;Target=Merlin_188 129 251;date=23-02-2015;status=T +Merlin feature polypeptide 108355 108419 . - . ID=Merlin_189;md5=a14aa86fdca4a9ba4ba4c70f4ab47bc5 +Merlin TMHMM protein_match 108397 108419 . - . ID=match%24312_21_43;Name=TMhelix;Target=Merlin_189 21 43;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 108745 108917 . + . ID=Merlin_191;md5=2a2c17bbbc7329a79d7f64ac4429f9bd +Merlin Gene3D protein_match 108815 108900 7.7E-4 + . Dbxref=InterPro:IPR023292;ID=match%24199_71_156;Name=G3DSA:1.10.3420.10;Target=Merlin_191 71 156;date=23-02-2015;status=T +Merlin Pfam protein_match 108890 108917 5.0E-8 + . Dbxref=InterPro:IPR014871;ID=match%24200_146_173;Name=PF08761;Target=Merlin_191 146 173;date=23-02-2015;signature_desc=dUTPase;status=T +Merlin Pfam protein_match 108777 108889 2.0E-21 + . Dbxref=InterPro:IPR014871;ID=match%24200_33_145;Name=PF08761;Target=Merlin_191 33 145;date=23-02-2015;signature_desc=dUTPase;status=T +Merlin SUPERFAMILY protein_match 108773 108917 . + . ID=match%24201_29_173;Name=SSF101386;Target=Merlin_191 29 173;date=23-02-2015;status=T +Merlin feature polypeptide 113786 114228 . + . ID=Merlin_205;md5=d761eea39913dd9e566ca00c7f39a61d +Merlin SUPERFAMILY protein_match 113793 114227 . + . Dbxref=InterPro:IPR027417;ID=match%24305_8_442;Name=SSF52540;Target=Merlin_205 8 442;date=23-02-2015;status=T +Merlin Gene3D protein_match 113791 113973 1.2E-26 + . Dbxref=InterPro:IPR027417;ID=match%24306_6_188;Name=G3DSA:3.40.50.300;Target=Merlin_205 6 188;date=23-02-2015;status=T +Merlin Gene3D protein_match 114170 114226 8.1E-6 + . Dbxref=InterPro:IPR027417;ID=match%24307_385_441;Name=G3DSA:3.40.50.300;Target=Merlin_205 385 441;date=23-02-2015;status=T +Merlin Gene3D protein_match 113974 114049 8.1E-6 + . Dbxref=InterPro:IPR027417;ID=match%24307_189_264;Name=G3DSA:3.40.50.300;Target=Merlin_205 189 264;date=23-02-2015;status=T +Merlin Pfam protein_match 113793 113980 1.2E-23 + . ID=match%24308_8_195;Name=PF13604;Target=Merlin_205 8 195;date=23-02-2015;signature_desc=AAA domain;status=T +Merlin SMART protein_match 113811 113959 1.5E-4 + . Dbxref=InterPro:IPR003593;ID=match%24309_26_174;Name=SM00382;Target=Merlin_205 26 174;date=23-02-2015;signature_desc=ATPases associated with a variety of cellular activities;status=T +Merlin feature polypeptide 115345 115574 . + . ID=Merlin_207;md5=3e69ed7195dc78989fce44b6f539f18f +Merlin Gene3D protein_match 115351 115393 6.3E-4 + . Dbxref=InterPro:IPR012337;ID=match%24213_7_49;Name=G3DSA:3.30.420.10;Ontology_term=GO:0003676;Target=Merlin_207 7 49;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 115350 115547 . + . Dbxref=InterPro:IPR012337;ID=match%24214_6_203;Name=SSF53098;Ontology_term=GO:0003676;Target=Merlin_207 6 203;date=23-02-2015;status=T +Merlin feature polypeptide 116039 116175 . + . ID=Merlin_208;md5=31c5fbdbf8eacf149749e9472c1f98ec +Merlin ProSitePatterns protein_match 116043 116060 . + . Dbxref=InterPro:IPR005825;ID=match%24333_5_22;Name=PS01108;Ontology_term=GO:0003735%22%2C%22GO:0005622%22%2C%22GO:0005840%22%2C%22GO:0006412;Target=Merlin_208 5 22;date=23-02-2015;signature_desc=Ribosomal protein L24 signature.;status=T +Merlin feature polypeptide 117176 117239 . + . ID=Merlin_211;md5=f22ab5ee011f7031e4ed0ed024e9b42e +Merlin TIGRFAM protein_match 117176 117221 2.0E-11 + . Dbxref=InterPro:IPR013429;ID=match%24336_1_46;Name=TIGR02605;Target=Merlin_211 1 46;date=23-02-2015;signature_desc=CxxC_CxxC_SSSS: putative regulatory protein,FmdB family;status=T +Merlin SMART protein_match 117176 117219 1.1E-6 + . Dbxref=InterPro:IPR013429;ID=match%24337_1_44;Name=SM00834;Target=Merlin_211 1 44;date=23-02-2015;signature_desc=Putative regulatory protein;status=T +Merlin Pfam protein_match 117176 117218 1.5E-10 + . Dbxref=InterPro:IPR013429;ID=match%24338_1_43;Name=PF09723;Target=Merlin_211 1 43;date=23-02-2015;signature_desc=Zinc ribbon domain;status=T +Merlin feature polypeptide 117935 118017 . + . ID=Merlin_214;md5=3f00b7b379e5ce5d44c70b9d1cd8a9ed +Merlin ProSiteProfiles protein_match 117935 117951 . + . ID=match%2426_1_17;Name=PS51257;Target=Merlin_214 1 17;date=23-02-2015;signature_desc=Prokaryotic membrane lipoprotein lipid attachment site profile.;status=T +Merlin feature polypeptide 118848 119460 . + . ID=Merlin_217;md5=afd95c8d27e9670604ae9e6d3e49e753 +Merlin ProSitePatterns protein_match 119268 119276 . + . Dbxref=InterPro:IPR018522;ID=match%2464_421_429;Name=PS00177;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 421 429;date=23-02-2015;signature_desc=DNA topoisomerase II signature.;status=T +Merlin SUPERFAMILY protein_match 119085 119216 . + . Dbxref=InterPro:IPR020568;ID=match%2465_238_369;Name=SSF54211;Target=Merlin_217 238 369;date=23-02-2015;status=T +Merlin PANTHER protein_match 118853 119460 . + . ID=match%2466_6_613;Name=PTHR10169;Target=Merlin_217 6 613;date=23-02-2015;status=T +Merlin Pfam protein_match 119106 119238 2.2E-27 + . Dbxref=InterPro:IPR013506;ID=match%2467_259_391;Name=PF00204;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 259 391;date=23-02-2015;signature_desc=DNA gyrase B;status=T +Merlin PRINTS protein_match 119407 119423 . + . Dbxref=InterPro:IPR001241;ID=match%2468_560_576;Name=PR00418;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 560 576;date=23-02-2015;signature_desc=DNA topoisomerase II family signature;status=T +Merlin PRINTS protein_match 119330 119346 . + . Dbxref=InterPro:IPR001241;ID=match%2468_483_499;Name=PR00418;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 483 499;date=23-02-2015;signature_desc=DNA topoisomerase II family signature;status=T +Merlin PRINTS protein_match 119266 119280 . + . Dbxref=InterPro:IPR001241;ID=match%2468_419_433;Name=PR00418;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 419 433;date=23-02-2015;signature_desc=DNA topoisomerase II family signature;status=T +Merlin PRINTS protein_match 118931 118944 . + . Dbxref=InterPro:IPR001241;ID=match%2468_84_97;Name=PR00418;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 84 97;date=23-02-2015;signature_desc=DNA topoisomerase II family signature;status=T +Merlin PRINTS protein_match 119120 119133 . + . Dbxref=InterPro:IPR001241;ID=match%2468_273_286;Name=PR00418;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 273 286;date=23-02-2015;signature_desc=DNA topoisomerase II family signature;status=T +Merlin PRINTS protein_match 118894 118909 . + . Dbxref=InterPro:IPR001241;ID=match%2468_47_62;Name=PR00418;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 47 62;date=23-02-2015;signature_desc=DNA topoisomerase II family signature;status=T +Merlin PRINTS protein_match 118977 118991 . + . Dbxref=InterPro:IPR001241;ID=match%2468_130_144;Name=PR00418;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 130 144;date=23-02-2015;signature_desc=DNA topoisomerase II family signature;status=T +Merlin Gene3D protein_match 119235 119402 8.7E-40 + . Dbxref=InterPro:IPR013759;ID=match%2469_388_555;Name=G3DSA:3.40.50.670;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 388 555;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 119257 119460 . + . Dbxref=InterPro:IPR013760;ID=match%2470_410_613;Name=SSF56719;Ontology_term=GO:0003918%22%2C%22GO:0005524;Target=Merlin_217 410 613;date=23-02-2015;status=T +Merlin Pfam protein_match 119265 119372 6.6E-6 + . Dbxref=InterPro:IPR006171;ID=match%2471_418_525;Name=PF01751;Target=Merlin_217 418 525;date=23-02-2015;signature_desc=Toprim domain;status=T +Merlin SMART protein_match 118894 119460 1.6E-41 + . Dbxref=InterPro:IPR001241;ID=match%2472_47_613;Name=SM00433;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_217 47 613;date=23-02-2015;signature_desc=TopoisomeraseII;status=T +Merlin Gene3D protein_match 119103 119222 7.5E-25 + . Dbxref=InterPro:IPR014721;ID=match%2473_256_375;Name=G3DSA:3.30.230.10;Target=Merlin_217 256 375;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 118850 119082 . + . Dbxref=InterPro:IPR003594;ID=match%2474_3_235;Name=SSF55874;Target=Merlin_217 3 235;date=23-02-2015;status=T +Merlin Pfam protein_match 118895 119043 3.4E-8 + . Dbxref=InterPro:IPR003594;ID=match%2475_48_196;Name=PF02518;Target=Merlin_217 48 196;date=23-02-2015;signature_desc=Histidine kinase-,DNA gyrase B-,and HSP90-like ATPase;status=T +Merlin Gene3D protein_match 118856 119095 1.4E-48 + . Dbxref=InterPro:IPR003594;ID=match%2476_9_248;Name=G3DSA:3.30.565.10;Target=Merlin_217 9 248;date=23-02-2015;status=T +Merlin feature polypeptide 121410 122134 . + . ID=Merlin_221;md5=8cbbd112d0bb3389b429a30f2b2e1325 +Merlin SUPERFAMILY protein_match 121448 121611 . + . Dbxref=InterPro:IPR003594;ID=match%24256_39_202;Name=SSF55874;Target=Merlin_221 39 202;date=23-02-2015;status=T +Merlin Gene3D protein_match 121436 121600 4.5E-12 + . Dbxref=InterPro:IPR003594;ID=match%24257_27_191;Name=G3DSA:3.30.565.10;Target=Merlin_221 27 191;date=23-02-2015;status=T +Merlin feature polypeptide 123597 123894 . + . ID=Merlin_222;md5=fc75366616aa6b1c8d89ad7e2cbf0acb +Merlin Gene3D protein_match 123603 123640 5.6E-4 + . Dbxref=InterPro:IPR009057;ID=match%24373_7_44;Name=G3DSA:1.10.10.60;Ontology_term=GO:0003677;Target=Merlin_222 7 44;date=23-02-2015;status=T +Merlin feature polypeptide 126095 126240 . + . ID=Merlin_228;md5=e2674fc0b0fb87de3f8bc7b0633664e4 +Merlin Pfam protein_match 126098 126240 1.2E-51 + . Dbxref=InterPro:IPR009514;ID=match%24375_4_146;Name=PF06591;Target=Merlin_228 4 146;date=23-02-2015;signature_desc=T4-like phage nuclear disruption protein %28Ndd%29;status=T +Merlin feature polypeptide 126984 127428 . + . ID=Merlin_230;md5=2249bccef173ac4c8331f7c07ada51aa +Merlin PANTHER protein_match 126990 127428 . + . ID=match%2448_7_445;Name=PTHR10169;Target=Merlin_230 7 445;date=23-02-2015;status=T +Merlin Gene3D protein_match 127187 127286 7.1E-7 + . Dbxref=InterPro:IPR024946;ID=match%2449_204_303;Name=G3DSA:3.30.1360.40;Target=Merlin_230 204 303;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 126988 127426 . + . Dbxref=InterPro:IPR013760;ID=match%2450_5_443;Name=SSF56719;Ontology_term=GO:0003918%22%2C%22GO:0005524;Target=Merlin_230 5 443;date=23-02-2015;status=T +Merlin Gene3D protein_match 127305 127427 1.2E-21 + . Dbxref=InterPro:IPR013757;ID=match%2451_322_444;Name=G3DSA:1.10.268.10;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_230 322 444;date=23-02-2015;status=T +Merlin Pfam protein_match 127010 127423 5.2E-97 + . Dbxref=InterPro:IPR002205;ID=match%2452_27_440;Name=PF00521;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_230 27 440;date=23-02-2015;signature_desc=DNA gyrase/topoisomerase IV,subunit A;status=T +Merlin SMART protein_match 126989 127421 1.3E-89 + . Dbxref=InterPro:IPR002205;ID=match%2453_6_438;Name=SM00434;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006265;Target=Merlin_230 6 438;date=23-02-2015;signature_desc=DNA Topoisomerase IV;status=T +Merlin Gene3D protein_match 127012 127175 5.2E-44 + . Dbxref=InterPro:IPR013758;ID=match%2454_29_192;Name=G3DSA:3.90.199.10;Ontology_term=GO:0003677%22%2C%22GO:0003918%22%2C%22GO:0005524%22%2C%22GO:0006259%22%2C%22GO:0006265;Target=Merlin_230 29 192;date=23-02-2015;status=T +Merlin feature polypeptide 128312 128357 . + . ID=Merlin_231;md5=8fb99c131c93689b3cce08a40c0ba992 +Merlin TMHMM protein_match 128337 128356 . + . ID=match%24331_26_45;Name=TMhelix;Target=Merlin_231 26 45;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin TMHMM protein_match 128316 128333 . + . ID=match%24332_5_22;Name=TMhelix;Target=Merlin_231 5 22;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin feature polypeptide 129580 129793 . + . ID=Merlin_235;md5=e691bfe31136d0b7c42429a040090fff +Merlin SUPERFAMILY protein_match 129684 129789 . + . Dbxref=InterPro:IPR015241;ID=match%2484_105_210;Name=SSF69652;Target=Merlin_235 105 210;date=23-02-2015;status=T +Merlin Pfam protein_match 129581 129675 8.6E-43 + . Dbxref=InterPro:IPR015198;ID=match%2485_2_96;Name=PF09114;Target=Merlin_235 2 96;date=23-02-2015;signature_desc=Transcription factor MotA,activation domain;status=T +Merlin Pfam protein_match 129686 129789 3.6E-30 + . Dbxref=InterPro:IPR015241;ID=match%2486_107_210;Name=PF09158;Target=Merlin_235 107 210;date=23-02-2015;signature_desc=Bacteriophage T4 MotA,C-terminal;status=T +Merlin Gene3D protein_match 129684 129792 1.9E-40 + . Dbxref=InterPro:IPR015241;ID=match%2487_105_213;Name=G3DSA:3.90.1150.20;Target=Merlin_235 105 213;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 129581 129676 . + . ID=match%2488_2_97;Name=SSF46785;Target=Merlin_235 2 97;date=23-02-2015;status=T +Merlin Gene3D protein_match 129581 129676 3.6E-43 + . Dbxref=InterPro:IPR011991;ID=match%2489_2_97;Name=G3DSA:1.10.10.10;Target=Merlin_235 2 97;date=23-02-2015;status=T +Merlin feature polypeptide 132198 132325 . + . ID=Merlin_242;md5=02904d6232dfbe3af5f3c72b0a040112 +Merlin Pfam protein_match 132286 132317 5.6E-5 + . Dbxref=InterPro:IPR010762;ID=match%2447_89_120;Name=PF07068;Target=Merlin_242 89 120;date=23-02-2015;signature_desc=Major capsid protein Gp23;status=T +Merlin feature polypeptide 132909 132998 . + . ID=Merlin_244;md5=5a288df81d0b0f5e2cd713a2dea64b36 +Merlin SUPERFAMILY protein_match 132911 132998 . + . Dbxref=InterPro:IPR015100;ID=match%24135_3_90;Name=SSF69070;Target=Merlin_244 3 90;date=23-02-2015;status=T +Merlin Gene3D protein_match 132909 132998 9.3E-35 + . Dbxref=InterPro:IPR015100;ID=match%24136_1_90;Name=G3DSA:1.10.1810.10;Target=Merlin_244 1 90;date=23-02-2015;status=T +Merlin Pfam protein_match 132911 132998 8.5E-31 + . Dbxref=InterPro:IPR015100;ID=match%24137_3_90;Name=PF09010;Target=Merlin_244 3 90;date=23-02-2015;signature_desc=Anti-Sigma Factor A;status=T +Merlin feature polypeptide 133618 133835 . - . ID=Merlin_245;md5=6d30b85ab93d050fbc70c933fd4cf677 +Merlin TMHMM protein_match 133816 133835 . - . ID=match%24215_29_48;Name=TMhelix;Target=Merlin_245 29 48;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin Pfam protein_match 133627 133835 2.9E-91 - . Dbxref=InterPro:IPR020982;ID=match%24216_10_218;Name=PF11031;Target=Merlin_245 10 218;date=23-02-2015;signature_desc=Bacteriophage T holin;status=T +Merlin feature polypeptide 134396 134663 . - . ID=Merlin_246;md5=510cb4bc2dec8f7bc18404c281edcd37 +Merlin Pfam protein_match 134396 134663 2.8E-88 - . Dbxref=InterPro:IPR007932;ID=match%24324_1_268;Name=PF05268;Target=Merlin_246 1 268;date=23-02-2015;signature_desc=Phage tail fibre adhesin Gp38;status=T +Merlin feature polypeptide 136278 137068 . - . ID=Merlin_247;md5=d90331409d9bd7731f40e5bf863de335 +Merlin Pfam protein_match 137011 137068 9.8E-9 - . ID=match%24351_681_738;Name=PF13884;Target=Merlin_247 681 738;date=23-02-2015;signature_desc=Chaperone of endosialidase;status=T +Merlin feature polypeptide 137516 137734 . - . ID=Merlin_248;md5=5b4b6bb25c73fffc70347d02439a9103 +Merlin Pfam protein_match 137517 137734 1.7E-63 - . Dbxref=InterPro:IPR005601;ID=match%24344_1_218;Name=PF03903;Target=Merlin_248 1 218;date=23-02-2015;signature_desc=Phage T4 tail fibre;status=T +Merlin feature polypeptide 142826 143133 . + . ID=Merlin_251;md5=2be9adbe3bfd1ea615410731bdfd3888 +Merlin Gene3D protein_match 142844 143023 1.3E-28 + . Dbxref=InterPro:IPR029060;ID=match%24318_19_198;Name=G3DSA:3.40.50.1010;Target=Merlin_251 19 198;date=23-02-2015;status=T +Merlin Pfam protein_match 143010 143133 7.8E-48 + . Dbxref=InterPro:IPR020045;ID=match%24319_185_308;Name=PF09293;Ontology_term=GO:0003677%22%2C%22GO:0003824;Target=Merlin_251 185 308;date=23-02-2015;signature_desc=T4 RNase H,C terminal;status=T +Merlin Gene3D protein_match 143024 143087 1.7E-29 + . ID=match%24320_199_262;Name=G3DSA:1.10.150.20;Target=Merlin_251 199 262;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 143010 143133 . + . Dbxref=InterPro:IPR020045;ID=match%24321_185_308;Name=SSF47807;Ontology_term=GO:0003677%22%2C%22GO:0003824;Target=Merlin_251 185 308;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 142840 143009 . + . Dbxref=InterPro:IPR029060;ID=match%24322_15_184;Name=SSF88723;Target=Merlin_251 15 184;date=23-02-2015;status=T +Merlin Pfam protein_match 142885 143003 5.3E-12 + . Dbxref=InterPro:IPR020046;ID=match%24323_60_178;Name=PF02739;Ontology_term=GO:0003677;Target=Merlin_251 60 178;date=23-02-2015;signature_desc=5%27-3%27 exonuclease,N-terminal resolvase-like domain;status=T +Merlin feature polypeptide 143742 143813 . + . ID=Merlin_252;md5=9783f62923fc9780c06b063b118511fb +Merlin Pfam protein_match 143746 143813 2.6E-28 + . Dbxref=InterPro:IPR020313;ID=match%24165_5_72;Name=PF11126;Target=Merlin_252 5 72;date=23-02-2015;signature_desc=Transcriptional regulator DsbA;status=T +Merlin feature polypeptide 144300 144409 . + . ID=Merlin_254;md5=4d49e8c780e1609e8457a347994d173d +Merlin SUPERFAMILY protein_match 144301 144409 . + . Dbxref=InterPro:IPR023197;ID=match%24345_2_110;Name=SSF48493;Target=Merlin_254 2 110;date=23-02-2015;status=T +Merlin Gene3D protein_match 144301 144409 5.5E-44 + . Dbxref=InterPro:IPR015086;ID=match%24346_2_110;Name=G3DSA:1.10.220.50;Target=Merlin_254 2 110;date=23-02-2015;status=T +Merlin Pfam protein_match 144305 144408 6.8E-40 + . Dbxref=InterPro:IPR015086;ID=match%24347_6_109;Name=PF08994;Target=Merlin_254 6 109;date=23-02-2015;signature_desc=T4 gene Gp59 loader of gp41 DNA helicase C-term;status=T +Merlin feature polypeptide 144963 145265 . + . ID=Merlin_255;md5=32e668b41f40418382e56f65081b552c +Merlin Pfam protein_match 144992 145081 1.0E-43 + . Dbxref=InterPro:IPR012339;ID=match%24239_30_119;Name=PF08804;Ontology_term=GO:0003697;Target=Merlin_255 30 119;date=23-02-2015;signature_desc=gp32 DNA binding protein like;status=T +Merlin SUPERFAMILY protein_match 144988 145204 . + . Dbxref=InterPro:IPR012340;ID=match%24240_26_242;Name=SSF50249;Target=Merlin_255 26 242;date=23-02-2015;status=T +Merlin Gene3D protein_match 144987 145216 2.8E-107 + . Dbxref=InterPro:IPR012339;ID=match%24241_25_254;Name=G3DSA:3.90.198.10;Ontology_term=GO:0003697;Target=Merlin_255 25 254;date=23-02-2015;status=T +Merlin feature polypeptide 147053 147173 . + . ID=Merlin_260;md5=164b48a67c5b739914650b8f4e21a811 +Merlin Pfam protein_match 147078 147171 3.5E-16 + . Dbxref=InterPro:IPR004885;ID=match%24141_26_119;Name=PF03197;Target=Merlin_260 26 119;date=23-02-2015;signature_desc=Bacteriophage FRD2 protein;status=T +Merlin feature polypeptide 148635 148831 . + . ID=Merlin_265;md5=87601ad83b4a0ca0ff7773aa68195b86 +Merlin SUPERFAMILY protein_match 148635 148794 . + . Dbxref=InterPro:IPR024072;ID=match%24273_1_160;Name=SSF53597;Target=Merlin_265 1 160;date=23-02-2015;status=T +Merlin Pfam protein_match 148655 148796 8.4E-19 + . Dbxref=InterPro:IPR001796%22%2C%22KEGG:00670%2B1.5.1.3%22%2C%22KEGG:00790%2B1.5.1.3%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-6614%22%2C%22UniPathway:UPA00077;ID=match%24274_21_162;Name=PF00186;Ontology_term=GO:0004146%22%2C%22GO:0006545%22%2C%22GO:0009165%22%2C%22GO:0055114;Target=Merlin_265 21 162;date=23-02-2015;signature_desc=Dihydrofolate reductase;status=T +Merlin Gene3D protein_match 148635 148816 6.1E-27 + . Dbxref=InterPro:IPR024072;ID=match%24275_1_182;Name=G3DSA:3.40.430.10;Target=Merlin_265 1 182;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 148635 148831 . + . Dbxref=InterPro:IPR001796%22%2C%22KEGG:00670%2B1.5.1.3%22%2C%22KEGG:00790%2B1.5.1.3%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-6614%22%2C%22UniPathway:UPA00077;ID=match%24276_1_197;Name=PS51330;Ontology_term=GO:0004146%22%2C%22GO:0006545%22%2C%22GO:0009165%22%2C%22GO:0055114;Target=Merlin_265 1 197;date=23-02-2015;signature_desc=Dihydrofolate reductase %28DHFR%29 domain profile.;status=T +Merlin ProSitePatterns protein_match 148655 148678 . + . Dbxref=InterPro:IPR017925%22%2C%22KEGG:00670%2B1.5.1.3%22%2C%22KEGG:00790%2B1.5.1.3%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-6614%22%2C%22UniPathway:UPA00077;ID=match%24277_21_44;Name=PS00075;Ontology_term=GO:0004146%22%2C%22GO:0055114;Target=Merlin_265 21 44;date=23-02-2015;signature_desc=Dihydrofolate reductase %28DHFR%29 domain signature.;status=T +Merlin feature polypeptide 149876 150161 . + . ID=Merlin_268;md5=343b568aed0711488c414db4698d4f27 +Merlin TIGRFAM protein_match 149877 150161 6.5E-91 + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2432_2_286;Name=TIGR03284;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 2 286;date=23-02-2015;signature_desc=thym_sym: thymidylate synthase;status=T +Merlin PANTHER protein_match 149876 150161 . + . ID=match%2433_1_286;Name=PTHR11549:SF9;Target=Merlin_268 1 286;date=23-02-2015;status=T +Merlin Hamap protein_match 149876 150161 . + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2434_1_286;Name=MF_00008;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 1 286;date=23-02-2015;signature_desc=Thymidylate synthase %5BthyA%5D.;status=T +Merlin ProSitePatterns protein_match 150011 150039 . + . Dbxref=InterPro:IPR020940%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2435_136_164;Name=PS00091;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 136 164;date=23-02-2015;signature_desc=Thymidylate synthase active site.;status=T +Merlin Pfam protein_match 149877 150161 2.1E-92 + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2436_2_286;Name=PF00303;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 2 286;date=23-02-2015;signature_desc=Thymidylate synthase;status=T +Merlin PANTHER protein_match 149876 150161 . + . ID=match%2437_1_286;Name=PTHR11549;Target=Merlin_268 1 286;date=23-02-2015;status=T +Merlin PRINTS protein_match 150026 150041 . + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2438_151_166;Name=PR00108;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 151 166;date=23-02-2015;signature_desc=Thymidylate synthase family signature;status=T +Merlin PRINTS protein_match 150000 150019 . + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2438_125_144;Name=PR00108;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 125 144;date=23-02-2015;signature_desc=Thymidylate synthase family signature;status=T +Merlin PRINTS protein_match 150082 150099 . + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2438_207_224;Name=PR00108;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 207 224;date=23-02-2015;signature_desc=Thymidylate synthase family signature;status=T +Merlin PRINTS protein_match 149917 149938 . + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2438_42_63;Name=PR00108;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 42 63;date=23-02-2015;signature_desc=Thymidylate synthase family signature;status=T +Merlin PRINTS protein_match 150044 150070 . + . Dbxref=InterPro:IPR000398%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2438_169_195;Name=PR00108;Ontology_term=GO:0004799%22%2C%22GO:0006231;Target=Merlin_268 169 195;date=23-02-2015;signature_desc=Thymidylate synthase family signature;status=T +Merlin Gene3D protein_match 149876 150161 2.3E-108 + . Dbxref=InterPro:IPR023451%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2439_1_286;Name=G3DSA:3.30.572.10;Target=Merlin_268 1 286;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 149876 150161 . + . Dbxref=InterPro:IPR023451%22%2C%22KEGG:00240%2B2.1.1.45%22%2C%22KEGG:00670%2B2.1.1.45%22%2C%22MetaCyc:PWY-3841%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7187%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7199%22%2C%22MetaCyc:PWY-7210%22%2C%22UniPathway:UPA00575;ID=match%2440_1_286;Name=SSF55831;Target=Merlin_268 1 286;date=23-02-2015;status=T +Merlin feature polypeptide 150921 151021 . + . ID=Merlin_270;md5=80f804f1d4d05678ff6d0fb20ac0a23a +Merlin Pfam protein_match 150924 151004 3.3E-21 + . Dbxref=InterPro:IPR025475;ID=match%24363_4_84;Name=PF14216;Target=Merlin_270 4 84;date=23-02-2015;signature_desc=Domain of unknown function %28DUF4326%29;status=T +Merlin feature polypeptide 151217 151967 . + . ID=Merlin_271;md5=555ccb974aff7222667adb8a51d5a65d +Merlin SUPERFAMILY protein_match 151223 151431 . + . Dbxref=InterPro:IPR008926%22%2C%22KEGG:00230%2B1.17.4.1%22%2C%22KEGG:00240%2B1.17.4.1%22%2C%22KEGG:00480%2B1.17.4.1%22%2C%22MetaCyc:PWY-6545%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7210%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222%22%2C%22MetaCyc:PWY-7226%22%2C%22MetaCyc:PWY-7227%22%2C%22UniPathway:UPA00326;ID=match%2497_7_215;Name=SSF48168;Target=Merlin_271 7 215;date=23-02-2015;status=T +Merlin PANTHER protein_match 151217 151944 . + . ID=match%2498_1_728;Name=PTHR11573:SF6;Target=Merlin_271 1 728;date=23-02-2015;status=T +Merlin ProSiteProfiles protein_match 151217 151306 . + . Dbxref=InterPro:IPR005144;ID=match%2499_1_90;Name=PS51161;Target=Merlin_271 1 90;date=23-02-2015;signature_desc=ATP-cone domain profile.;status=T +Merlin Pfam protein_match 151353 151429 2.2E-16 + . Dbxref=InterPro:IPR013509%22%2C%22KEGG:00230%2B1.17.4.1%22%2C%22KEGG:00240%2B1.17.4.1%22%2C%22KEGG:00480%2B1.17.4.1%22%2C%22MetaCyc:PWY-6545%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7210%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222%22%2C%22MetaCyc:PWY-7226%22%2C%22MetaCyc:PWY-7227%22%2C%22UniPathway:UPA00326;ID=match%24100_137_213;Name=PF00317;Ontology_term=GO:0004748%22%2C%22GO:0005524%22%2C%22GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 137 213;date=23-02-2015;signature_desc=Ribonucleotide reductase,all-alpha domain;status=T +Merlin Pfam protein_match 151217 151303 2.5E-10 + . Dbxref=InterPro:IPR005144;ID=match%24101_1_87;Name=PF03477;Target=Merlin_271 1 87;date=23-02-2015;signature_desc=ATP cone domain;status=T +Merlin Gene3D protein_match 151817 151862 1.4E-37 + . ID=match%24102_601_646;Name=G3DSA:3.20.70.20;Target=Merlin_271 601 646;date=23-02-2015;status=T +Merlin Gene3D protein_match 151609 151766 1.4E-37 + . ID=match%24102_393_550;Name=G3DSA:3.20.70.20;Target=Merlin_271 393 550;date=23-02-2015;status=T +Merlin Gene3D protein_match 151894 151919 1.4E-37 + . ID=match%24102_678_703;Name=G3DSA:3.20.70.20;Target=Merlin_271 678 703;date=23-02-2015;status=T +Merlin Gene3D protein_match 151343 151554 1.4E-37 + . ID=match%24102_127_338;Name=G3DSA:3.20.70.20;Target=Merlin_271 127 338;date=23-02-2015;status=T +Merlin TIGRFAM protein_match 151356 151944 5.5E-158 + . Dbxref=InterPro:IPR013346%22%2C%22KEGG:00230%2B1.17.4.1%22%2C%22KEGG:00240%2B1.17.4.1%22%2C%22KEGG:00480%2B1.17.4.1%22%2C%22MetaCyc:PWY-6545%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7210%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222%22%2C%22MetaCyc:PWY-7226%22%2C%22MetaCyc:PWY-7227%22%2C%22UniPathway:UPA00326;ID=match%24103_140_728;Name=TIGR02506;Ontology_term=GO:0055114;Target=Merlin_271 140 728;date=23-02-2015;signature_desc=NrdE_NrdA: ribonucleoside-diphosphate reductase,alpha subunit;status=T +Merlin ProSitePatterns protein_match 151806 151828 . + . Dbxref=InterPro:IPR000788;ID=match%24104_590_612;Name=PS00089;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 590 612;date=23-02-2015;signature_desc=Ribonucleotide reductase large subunit signature.;status=T +Merlin SUPERFAMILY protein_match 151432 151966 . + . ID=match%24105_216_750;Name=SSF51998;Target=Merlin_271 216 750;date=23-02-2015;status=T +Merlin PRINTS protein_match 151644 151655 . + . Dbxref=InterPro:IPR000788;ID=match%24106_428_439;Name=PR01183;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 428 439;date=23-02-2015;signature_desc=Ribonucleotide reductase large chain signature;status=T +Merlin PRINTS protein_match 151508 151527 . + . Dbxref=InterPro:IPR000788;ID=match%24106_292_311;Name=PR01183;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 292 311;date=23-02-2015;signature_desc=Ribonucleotide reductase large chain signature;status=T +Merlin PRINTS protein_match 151746 151769 . + . Dbxref=InterPro:IPR000788;ID=match%24106_530_553;Name=PR01183;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 530 553;date=23-02-2015;signature_desc=Ribonucleotide reductase large chain signature;status=T +Merlin PRINTS protein_match 151683 151706 . + . Dbxref=InterPro:IPR000788;ID=match%24106_467_490;Name=PR01183;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 467 490;date=23-02-2015;signature_desc=Ribonucleotide reductase large chain signature;status=T +Merlin PRINTS protein_match 151720 151742 . + . Dbxref=InterPro:IPR000788;ID=match%24106_504_526;Name=PR01183;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 504 526;date=23-02-2015;signature_desc=Ribonucleotide reductase large chain signature;status=T +Merlin PRINTS protein_match 151817 151844 . + . Dbxref=InterPro:IPR000788;ID=match%24106_601_628;Name=PR01183;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 601 628;date=23-02-2015;signature_desc=Ribonucleotide reductase large chain signature;status=T +Merlin PANTHER protein_match 151217 151944 . + . ID=match%24107_1_728;Name=PTHR11573;Target=Merlin_271 1 728;date=23-02-2015;status=T +Merlin Pfam protein_match 151432 151940 1.2E-122 + . Dbxref=InterPro:IPR000788;ID=match%24108_216_724;Name=PF02867;Ontology_term=GO:0006260%22%2C%22GO:0055114;Target=Merlin_271 216 724;date=23-02-2015;signature_desc=Ribonucleotide reductase,barrel domain;status=T +Merlin feature polypeptide 153579 153958 . + . ID=Merlin_272;md5=6644d7fb6031305c7deb695f447f2970 +Merlin ProSitePatterns protein_match 153692 153708 . + . Dbxref=InterPro:IPR000358%22%2C%22KEGG:00230%2B1.17.4.1%22%2C%22KEGG:00240%2B1.17.4.1%22%2C%22KEGG:00480%2B1.17.4.1%22%2C%22MetaCyc:PWY-6545%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7210%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222%22%2C%22MetaCyc:PWY-7226%22%2C%22MetaCyc:PWY-7227%22%2C%22Reactome:REACT_1698%22%2C%22UniPathway:UPA00326;ID=match%241_114_130;Name=PS00368;Ontology_term=GO:0009186%22%2C%22GO:0055114;Target=Merlin_272 114 130;date=23-02-2015;signature_desc=Ribonucleotide reductase small subunit signature.;status=T +Merlin SUPERFAMILY protein_match 153581 153921 . + . Dbxref=InterPro:IPR009078;ID=match%242_3_343;Name=SSF47240;Target=Merlin_272 3 343;date=23-02-2015;status=T +Merlin PANTHER protein_match 153589 153901 . + . Dbxref=InterPro:IPR000358%22%2C%22KEGG:00230%2B1.17.4.1%22%2C%22KEGG:00240%2B1.17.4.1%22%2C%22KEGG:00480%2B1.17.4.1%22%2C%22MetaCyc:PWY-6545%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7210%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222%22%2C%22MetaCyc:PWY-7226%22%2C%22MetaCyc:PWY-7227%22%2C%22Reactome:REACT_1698%22%2C%22UniPathway:UPA00326;ID=match%243_11_323;Name=PTHR23409;Ontology_term=GO:0009186%22%2C%22GO:0055114;Target=Merlin_272 11 323;date=23-02-2015;status=T +Merlin Pfam protein_match 153608 153731 3.0E-12 + . Dbxref=InterPro:IPR000358%22%2C%22KEGG:00230%2B1.17.4.1%22%2C%22KEGG:00240%2B1.17.4.1%22%2C%22KEGG:00480%2B1.17.4.1%22%2C%22MetaCyc:PWY-6545%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7210%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222%22%2C%22MetaCyc:PWY-7226%22%2C%22MetaCyc:PWY-7227%22%2C%22Reactome:REACT_1698%22%2C%22UniPathway:UPA00326;ID=match%244_30_153;Name=PF00268;Ontology_term=GO:0009186%22%2C%22GO:0055114;Target=Merlin_272 30 153;date=23-02-2015;signature_desc=Ribonucleotide reductase,small chain;status=T +Merlin Pfam protein_match 153779 153891 2.6E-9 + . Dbxref=InterPro:IPR000358%22%2C%22KEGG:00230%2B1.17.4.1%22%2C%22KEGG:00240%2B1.17.4.1%22%2C%22KEGG:00480%2B1.17.4.1%22%2C%22MetaCyc:PWY-6545%22%2C%22MetaCyc:PWY-7184%22%2C%22MetaCyc:PWY-7198%22%2C%22MetaCyc:PWY-7210%22%2C%22MetaCyc:PWY-7220%22%2C%22MetaCyc:PWY-7222%22%2C%22MetaCyc:PWY-7226%22%2C%22MetaCyc:PWY-7227%22%2C%22Reactome:REACT_1698%22%2C%22UniPathway:UPA00326;ID=match%244_201_313;Name=PF00268;Ontology_term=GO:0009186%22%2C%22GO:0055114;Target=Merlin_272 201 313;date=23-02-2015;signature_desc=Ribonucleotide reductase,small chain;status=T +Merlin Gene3D protein_match 153580 153919 2.2E-109 + . Dbxref=InterPro:IPR012348%22%2C%22Reactome:REACT_1698;ID=match%245_2_341;Name=G3DSA:1.10.620.20;Ontology_term=GO:0016491%22%2C%22GO:0055114;Target=Merlin_272 2 341;date=23-02-2015;status=T +Merlin feature polypeptide 155391 155766 . + . ID=Merlin_275;md5=ce3495549b48ab3bd0c2b0037e6df426 +Merlin Pfam protein_match 155443 155638 8.3E-40 + . Dbxref=InterPro:IPR019039;ID=match%24356_53_248;Name=PF09511;Target=Merlin_275 53 248;date=23-02-2015;signature_desc=RNA ligase;status=T +Merlin TIGRFAM protein_match 155392 155765 8.6E-152 + . Dbxref=InterPro:IPR012648;ID=match%24357_2_375;Name=TIGR02308;Target=Merlin_275 2 375;date=23-02-2015;signature_desc=RNA_lig_T4_1: RNA ligase,T4 RnlA family;status=T +Merlin feature polypeptide 157075 157192 . + . ID=Merlin_277;md5=b11599cb66104e684ea53adc8bc842fa +Merlin TMHMM protein_match 157078 157096 . + . ID=match%24279_4_22;Name=TMhelix;Target=Merlin_277 4 22;date=23-02-2015;signature_desc=Region of a membrane-bound protein predicted to be embedded in the membrane.;status=T +Merlin Pfam protein_match 157080 157147 5.6E-5 + . Dbxref=InterPro:IPR022538;ID=match%24280_6_73;Name=PF10828;Target=Merlin_277 6 73;date=23-02-2015;signature_desc=Protein of unknown function %28DUF2570%29;status=T +Merlin feature polypeptide 157428 157528 . + . ID=Merlin_278;md5=68f4d1929d8336b59397e3d49278570b +Merlin ProSiteProfiles protein_match 157428 157444 . + . ID=match%24299_1_17;Name=PS51257;Target=Merlin_278 1 17;date=23-02-2015;signature_desc=Prokaryotic membrane lipoprotein lipid attachment site profile.;status=T +Merlin feature polypeptide 158834 159131 . + . ID=Merlin_282;md5=ce335df5857b476c0b439049d2098663 +Merlin Gene3D protein_match 158990 159131 2.9E-42 + . Dbxref=InterPro:IPR023214;ID=match%24114_157_298;Name=G3DSA:3.40.50.1000;Target=Merlin_282 157 298;date=23-02-2015;status=T +Merlin Gene3D protein_match 158834 158976 2.4E-33 + . Dbxref=InterPro:IPR027417;ID=match%24115_1_143;Name=G3DSA:3.40.50.300;Target=Merlin_282 1 143;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 158834 158979 . + . Dbxref=InterPro:IPR027417;ID=match%24116_1_146;Name=SSF52540;Target=Merlin_282 1 146;date=23-02-2015;status=T +Merlin Pfam protein_match 158837 158975 2.6E-18 + . ID=match%24117_4_142;Name=PF13671;Target=Merlin_282 4 142;date=23-02-2015;signature_desc=AAA domain;status=T +Merlin SUPERFAMILY protein_match 158990 159130 . + . Dbxref=InterPro:IPR023214;ID=match%24118_157_297;Name=SSF56784;Target=Merlin_282 157 297;date=23-02-2015;status=T +Merlin feature polypeptide 160985 161171 . + . ID=Merlin_288;md5=d0b41cecb322093c58b42250882d2a49 +Merlin Pfam protein_match 160987 161127 6.8E-38 + . Dbxref=InterPro:IPR002125;ID=match%2410_3_143;Name=PF00383;Ontology_term=GO:0008270%22%2C%22GO:0016787;Target=Merlin_288 3 143;date=23-02-2015;signature_desc=Cytidine and deoxycytidylate deaminase zinc-binding region;status=T +Merlin PIRSF protein_match 160985 161160 4.6E-55 + . Dbxref=InterPro:IPR016473%22%2C%22KEGG:00240%2B3.5.4.12%22%2C%22MetaCyc:PWY-7210;ID=match%2411_1_176;Name=PIRSF006019;Ontology_term=GO:0004132%22%2C%22GO:0006220%22%2C%22GO:0008270;Target=Merlin_288 1 176;date=23-02-2015;status=T +Merlin PANTHER protein_match 160991 161147 . + . Dbxref=InterPro:IPR015517;ID=match%2412_7_163;Name=PTHR11086;Target=Merlin_288 7 163;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 160985 161165 . + . Dbxref=InterPro:IPR016193;ID=match%2413_1_181;Name=SSF53927;Ontology_term=GO:0003824;Target=Merlin_288 1 181;date=23-02-2015;status=T +Merlin Gene3D protein_match 160988 161153 4.6E-45 + . ID=match%2414_4_169;Name=G3DSA:3.40.140.10;Target=Merlin_288 4 169;date=23-02-2015;status=T +Merlin ProSitePatterns protein_match 161080 161115 . + . Dbxref=InterPro:IPR016192;ID=match%2415_96_131;Name=PS00903;Ontology_term=GO:0008270%22%2C%22GO:0016787;Target=Merlin_288 96 131;date=23-02-2015;signature_desc=Cytidine and deoxycytidylate deaminases zinc-binding region signature.;status=T +Merlin feature polypeptide 162448 162555 . + . ID=Merlin_292;md5=9c8f652cc909186bc50ffdb6a807f7e0 +Merlin Pfam protein_match 162453 162554 1.8E-10 + . Dbxref=InterPro:IPR020818;ID=match%24110_6_107;Name=PF00166;Ontology_term=GO:0005737%22%2C%22GO:0006457;Target=Merlin_292 6 107;date=23-02-2015;signature_desc=Chaperonin 10 Kd subunit;status=T +Merlin SUPERFAMILY protein_match 162450 162555 . + . Dbxref=InterPro:IPR011032;ID=match%24111_3_108;Name=SSF50129;Target=Merlin_292 3 108;date=23-02-2015;status=T +Merlin Gene3D protein_match 162448 162555 1.4E-43 + . Dbxref=InterPro:IPR020818;ID=match%24112_1_108;Name=G3DSA:2.30.33.40;Ontology_term=GO:0005737%22%2C%22GO:0006457;Target=Merlin_292 1 108;date=23-02-2015;status=T +Merlin feature polypeptide 163763 163884 . + . ID=Merlin_295;md5=bade0d4ddd9fad6cfcadaf07ebe0390f +Merlin Pfam protein_match 163764 163884 1.7E-47 + . Dbxref=InterPro:IPR009258;ID=match%24372_2_122;Name=PF06019;Target=Merlin_295 2 122;date=23-02-2015;signature_desc=Phage GP30.8 protein;status=T +Merlin feature polypeptide 164714 164831 . + . ID=Merlin_297;md5=7ab0074c6c9c4d1ae4120e8c8a78d0ce +Merlin Pfam protein_match 164714 164829 4.3E-44 + . Dbxref=InterPro:IPR009690;ID=match%24310_1_116;Name=PF06919;Target=Merlin_297 1 116;date=23-02-2015;signature_desc=Phage Gp30.7 protein;status=T +Merlin feature polypeptide 166351 166632 . + . ID=Merlin_302;md5=37dcea9b1ef261183a31069700925af4 +Merlin Pfam protein_match 166361 166518 6.6E-5 + . Dbxref=InterPro:IPR023214;ID=match%24166_11_168;Name=PF13419;Target=Merlin_302 11 168;date=23-02-2015;signature_desc=Haloacid dehalogenase-like hydrolase;status=T +Merlin SUPERFAMILY protein_match 166357 166561 . + . Dbxref=InterPro:IPR023214;ID=match%24167_7_211;Name=SSF56784;Target=Merlin_302 7 211;date=23-02-2015;status=T +Merlin Gene3D protein_match 166428 166561 5.5E-6 + . Dbxref=InterPro:IPR023214;ID=match%24168_78_211;Name=G3DSA:3.40.50.1000;Target=Merlin_302 78 211;date=23-02-2015;status=T +Merlin Gene3D protein_match 166359 166371 5.5E-6 + . Dbxref=InterPro:IPR023214;ID=match%24168_9_21;Name=G3DSA:3.40.50.1000;Target=Merlin_302 9 21;date=23-02-2015;status=T +Merlin feature polypeptide 167486 167970 . + . ID=Merlin_304;md5=283a418fea20ac001bffdcbf72299ca8 +Merlin SUPERFAMILY protein_match 167619 167850 . + . ID=match%24122_134_365;Name=SSF56091;Target=Merlin_304 134 365;date=23-02-2015;status=T +Merlin Gene3D protein_match 167834 167952 5.9E-8 + . Dbxref=InterPro:IPR012340;ID=match%24123_349_467;Name=G3DSA:2.40.50.140;Target=Merlin_304 349 467;date=23-02-2015;status=T +Merlin ProSitePatterns protein_match 167827 167850 . + . Dbxref=InterPro:IPR016059%22%2C%22Reactome:REACT_216;ID=match%24124_342_365;Name=PS00333;Ontology_term=GO:0003909%22%2C%22GO:0051103;Target=Merlin_304 342 365;date=23-02-2015;signature_desc=ATP-dependent DNA ligase signature 2.;status=T +Merlin Pfam protein_match 167620 167850 9.0E-28 + . Dbxref=InterPro:IPR012310%22%2C%22Reactome:REACT_216;ID=match%24125_135_365;Name=PF01068;Ontology_term=GO:0003910%22%2C%22GO:0005524%22%2C%22GO:0006281%22%2C%22GO:0006310;Target=Merlin_304 135 365;date=23-02-2015;signature_desc=ATP dependent DNA ligase domain;status=T +Merlin ProSitePatterns protein_match 167642 167650 . + . Dbxref=InterPro:IPR016059%22%2C%22Reactome:REACT_216;ID=match%24126_157_165;Name=PS00697;Ontology_term=GO:0003909%22%2C%22GO:0051103;Target=Merlin_304 157 165;date=23-02-2015;signature_desc=ATP-dependent DNA ligase AMP-binding site.;status=T +Merlin Gene3D protein_match 167803 167833 8.5E-8 + . ID=match%24127_318_348;Name=G3DSA:3.30.1490.70;Target=Merlin_304 318 348;date=23-02-2015;status=T +Merlin Gene3D protein_match 167621 167646 8.5E-8 + . ID=match%24127_136_161;Name=G3DSA:3.30.1490.70;Target=Merlin_304 136 161;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 167823 167930 . + . Dbxref=InterPro:IPR012340;ID=match%24128_338_445;Name=SSF50249;Target=Merlin_304 338 445;date=23-02-2015;status=T +Merlin Gene3D protein_match 167661 167794 1.2E-4 + . ID=match%24129_176_309;Name=G3DSA:3.30.470.30;Target=Merlin_304 176 309;date=23-02-2015;status=T +Merlin feature polypeptide 169174 169869 . + . ID=Merlin_306;md5=3f61e1cb18fb135a3dc061968bcd879c +Merlin SUPERFAMILY protein_match 169736 169781 . + . ID=match%24253_563_608;Name=SSF56399;Target=Merlin_306 563 608;date=23-02-2015;status=T +Merlin SUPERFAMILY protein_match 169590 169696 . + . ID=match%24253_417_523;Name=SSF56399;Target=Merlin_306 417 523;date=23-02-2015;status=T +Merlin Pfam protein_match 169589 169781 1.6E-22 + . Dbxref=InterPro:IPR003540;ID=match%24254_416_608;Name=PF03496;Ontology_term=GO:0005576%22%2C%22GO:0009405;Target=Merlin_306 416 608;date=23-02-2015;signature_desc=ADP-ribosyltransferase exoenzyme;status=T +Merlin Gene3D protein_match 169597 169698 2.1E-28 + . ID=match%24255_424_525;Name=G3DSA:3.90.176.10;Target=Merlin_306 424 525;date=23-02-2015;status=T +Merlin Gene3D protein_match 169738 169784 2.1E-28 + . ID=match%24255_565_611;Name=G3DSA:3.90.176.10;Target=Merlin_306 565 611;date=23-02-2015;status=T +Merlin feature polypeptide 171300 171794 . + . ID=Merlin_307;md5=0f4b8b0843334ccf18e5a4a7cbdf67b2 +Merlin SUPERFAMILY protein_match 171723 171792 . + . ID=match%24339_424_493;Name=SSF56399;Target=Merlin_307 424 493;date=23-02-2015;status=T +Merlin Gene3D protein_match 171722 171791 5.2E-11 + . ID=match%24340_423_492;Name=G3DSA:3.90.176.10;Target=Merlin_307 423 492;date=23-02-2015;status=T +Merlin Pfam protein_match 171723 171791 2.0E-9 + . Dbxref=InterPro:IPR003540;ID=match%24341_424_492;Name=PF03496;Ontology_term=GO:0005576%22%2C%22GO:0009405;Target=Merlin_307 424 492;date=23-02-2015;signature_desc=ADP-ribosyltransferase exoenzyme;status=T
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/merlin.gff Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,1230 @@ +##gff-version 3 +##sequence-region Merlin 1 172788 +Merlin GeneMark.hmm gene 2 691 -856.563659 + . ID=Merlin_1;seqid=Merlin +Merlin GeneMark.hmm mRNA 2 691 . + . ID=Merlin_1_mRNA;Parent=Merlin_1;seqid=Merlin;color=#00ff00 +Merlin GeneMark.hmm exon 2 691 . + . ID=Merlin_1_exon;Parent=Merlin_1_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 2 691 . + 0 ID=Merlin_1_CDS;Parent=Merlin_1_exon;seqid=Merlin +Merlin GeneMark.hmm gene 752 1039 -339.046618 + . ID=Merlin_2;seqid=Merlin +Merlin GeneMark.hmm mRNA 752 1039 . + . ID=Merlin_2_mRNA;Parent=Merlin_2;seqid=Merlin +Merlin GeneMark.hmm exon 752 1039 . + . ID=Merlin_2_exon;Parent=Merlin_2_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 752 1039 . + 0 ID=Merlin_2_CDS;Parent=Merlin_2_exon;seqid=Merlin +Merlin GeneMark.hmm gene 1067 2011 -1229.683915 - . ID=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm mRNA 1067 2011 . - . ID=Merlin_3_mRNA;Parent=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm exon 1067 2011 . - . ID=Merlin_3_exon;Parent=Merlin_3_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 1067 2011 . - 0 ID=Merlin_3_CDS;Parent=Merlin_3_exon;seqid=Merlin +Merlin GeneMark.hmm gene 2011 3066 -1335.034872 - . ID=Merlin_4;seqid=Merlin +Merlin GeneMark.hmm mRNA 2011 3066 . - . ID=Merlin_4_mRNA;Parent=Merlin_4;seqid=Merlin +Merlin GeneMark.hmm exon 2011 3066 . - . ID=Merlin_4_exon;Parent=Merlin_4_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 2011 3066 . - 0 ID=Merlin_4_CDS;Parent=Merlin_4_exon;seqid=Merlin +Merlin GeneMark.hmm gene 3066 4796 -2177.374893 - . ID=Merlin_5;seqid=Merlin +Merlin GeneMark.hmm mRNA 3066 4796 . - . ID=Merlin_5_mRNA;Parent=Merlin_5;seqid=Merlin +Merlin GeneMark.hmm exon 3066 4796 . - . ID=Merlin_5_exon;Parent=Merlin_5_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 3066 4796 . - 0 ID=Merlin_5_CDS;Parent=Merlin_5_exon;seqid=Merlin +Merlin GeneMark.hmm gene 4793 5317 -682.565030 - . ID=Merlin_6;seqid=Merlin +Merlin GeneMark.hmm mRNA 4793 5317 . - . ID=Merlin_6_mRNA;Parent=Merlin_6;seqid=Merlin +Merlin GeneMark.hmm exon 4793 5317 . - . ID=Merlin_6_exon;Parent=Merlin_6_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 4793 5317 . - 0 ID=Merlin_6_CDS;Parent=Merlin_6_exon;seqid=Merlin +Merlin GeneMark.hmm gene 5289 6431 -1457.525863 - . ID=Merlin_7;seqid=Merlin +Merlin GeneMark.hmm mRNA 5289 6431 . - . ID=Merlin_7_mRNA;Parent=Merlin_7;seqid=Merlin +Merlin GeneMark.hmm exon 5289 6431 . - . ID=Merlin_7_exon;Parent=Merlin_7_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 5289 6431 . - 0 ID=Merlin_7_CDS;Parent=Merlin_7_exon;seqid=Merlin +Merlin GeneMark.hmm gene 6428 7180 -968.015933 - . ID=Merlin_8;seqid=Merlin +Merlin GeneMark.hmm mRNA 6428 7180 . - . ID=Merlin_8_mRNA;Parent=Merlin_8;seqid=Merlin +Merlin GeneMark.hmm exon 6428 7180 . - . ID=Merlin_8_exon;Parent=Merlin_8_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 6428 7180 . - 0 ID=Merlin_8_CDS;Parent=Merlin_8_exon;seqid=Merlin +Merlin GeneMark.hmm gene 7228 7857 -809.330137 + . ID=Merlin_9;seqid=Merlin +Merlin GeneMark.hmm mRNA 7228 7857 . + . ID=Merlin_9_mRNA;Parent=Merlin_9;seqid=Merlin +Merlin GeneMark.hmm exon 7228 7857 . + . ID=Merlin_9_exon;Parent=Merlin_9_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 7228 7857 . + 0 ID=Merlin_9_CDS;Parent=Merlin_9_exon;seqid=Merlin +Merlin GeneMark.hmm gene 7857 8252 -515.006678 + . ID=Merlin_10;seqid=Merlin +Merlin GeneMark.hmm mRNA 7857 8252 . + . ID=Merlin_10_mRNA;Parent=Merlin_10;seqid=Merlin +Merlin GeneMark.hmm exon 7857 8252 . + . ID=Merlin_10_exon;Parent=Merlin_10_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 7857 8252 . + 0 ID=Merlin_10_CDS;Parent=Merlin_10_exon;seqid=Merlin +Merlin GeneMark.hmm gene 8340 8753 -522.529341 + . ID=Merlin_11;seqid=Merlin +Merlin GeneMark.hmm mRNA 8340 8753 . + . ID=Merlin_11_mRNA;Parent=Merlin_11;seqid=Merlin +Merlin GeneMark.hmm exon 8340 8753 . + . ID=Merlin_11_exon;Parent=Merlin_11_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 8340 8753 . + 0 ID=Merlin_11_CDS;Parent=Merlin_11_exon;seqid=Merlin +Merlin GeneMark.hmm gene 8787 8951 -212.019038 + . ID=Merlin_12;seqid=Merlin +Merlin GeneMark.hmm mRNA 8787 8951 . + . ID=Merlin_12_mRNA;Parent=Merlin_12;seqid=Merlin +Merlin GeneMark.hmm exon 8787 8951 . + . ID=Merlin_12_exon;Parent=Merlin_12_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 8787 8951 . + 0 ID=Merlin_12_CDS;Parent=Merlin_12_exon;seqid=Merlin +Merlin GeneMark.hmm gene 9014 9241 -274.669850 - . ID=Merlin_13;seqid=Merlin +Merlin GeneMark.hmm mRNA 9014 9241 . - . ID=Merlin_13_mRNA;Parent=Merlin_13;seqid=Merlin +Merlin GeneMark.hmm exon 9014 9241 . - . ID=Merlin_13_exon;Parent=Merlin_13_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 9014 9241 . - 0 ID=Merlin_13_CDS;Parent=Merlin_13_exon;seqid=Merlin +Merlin GeneMark.hmm gene 9248 10747 -1911.373457 - . ID=Merlin_14;seqid=Merlin +Merlin GeneMark.hmm mRNA 9248 10747 . - . ID=Merlin_14_mRNA;Parent=Merlin_14;seqid=Merlin +Merlin GeneMark.hmm exon 9248 10747 . - . ID=Merlin_14_exon;Parent=Merlin_14_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 9248 10747 . - 0 ID=Merlin_14_CDS;Parent=Merlin_14_exon;seqid=Merlin +Merlin GeneMark.hmm gene 10800 11435 -778.108633 + . ID=Merlin_15;seqid=Merlin +Merlin GeneMark.hmm mRNA 10800 11435 . + . ID=Merlin_15_mRNA;Parent=Merlin_15;seqid=Merlin +Merlin GeneMark.hmm exon 10800 11435 . + . ID=Merlin_15_exon;Parent=Merlin_15_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 10800 11435 . + 0 ID=Merlin_15_CDS;Parent=Merlin_15_exon;seqid=Merlin +Merlin GeneMark.hmm gene 11469 12290 -1045.093825 + . ID=Merlin_16;seqid=Merlin +Merlin GeneMark.hmm mRNA 11469 12290 . + . ID=Merlin_16_mRNA;Parent=Merlin_16;seqid=Merlin +Merlin GeneMark.hmm exon 11469 12290 . + . ID=Merlin_16_exon;Parent=Merlin_16_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 11469 12290 . + 0 ID=Merlin_16_CDS;Parent=Merlin_16_exon;seqid=Merlin +Merlin GeneMark.hmm gene 12365 12601 -286.579590 + . ID=Merlin_17;seqid=Merlin +Merlin GeneMark.hmm mRNA 12365 12601 . + . ID=Merlin_17_mRNA;Parent=Merlin_17;seqid=Merlin +Merlin GeneMark.hmm exon 12365 12601 . + . ID=Merlin_17_exon;Parent=Merlin_17_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 12365 12601 . + 0 ID=Merlin_17_CDS;Parent=Merlin_17_exon;seqid=Merlin +Merlin GeneMark.hmm gene 12598 12951 -440.013978 + . ID=Merlin_18;seqid=Merlin +Merlin GeneMark.hmm mRNA 12598 12951 . + . ID=Merlin_18_mRNA;Parent=Merlin_18;seqid=Merlin +Merlin GeneMark.hmm exon 12598 12951 . + . ID=Merlin_18_exon;Parent=Merlin_18_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 12598 12951 . + 0 ID=Merlin_18_CDS;Parent=Merlin_18_exon;seqid=Merlin +Merlin GeneMark.hmm gene 13067 13330 -321.884922 + . ID=Merlin_19;seqid=Merlin +Merlin GeneMark.hmm mRNA 13067 13330 . + . ID=Merlin_19_mRNA;Parent=Merlin_19;seqid=Merlin +Merlin GeneMark.hmm exon 13067 13330 . + . ID=Merlin_19_exon;Parent=Merlin_19_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 13067 13330 . + 0 ID=Merlin_19_CDS;Parent=Merlin_19_exon;seqid=Merlin +Merlin GeneMark.hmm gene 13340 14341 -1253.644245 + . ID=Merlin_20;seqid=Merlin +Merlin GeneMark.hmm mRNA 13340 14341 . + . ID=Merlin_20_mRNA;Parent=Merlin_20;seqid=Merlin +Merlin GeneMark.hmm exon 13340 14341 . + . ID=Merlin_20_exon;Parent=Merlin_20_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 13340 14341 . + 0 ID=Merlin_20_CDS;Parent=Merlin_20_exon;seqid=Merlin +Merlin GeneMark.hmm gene 14320 14883 -740.935174 + . ID=Merlin_21;seqid=Merlin +Merlin GeneMark.hmm mRNA 14320 14883 . + . ID=Merlin_21_mRNA;Parent=Merlin_21;seqid=Merlin +Merlin GeneMark.hmm exon 14320 14883 . + . ID=Merlin_21_exon;Parent=Merlin_21_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 14320 14883 . + 0 ID=Merlin_21_CDS;Parent=Merlin_21_exon;seqid=Merlin +Merlin GeneMark.hmm gene 14911 16197 -1617.100759 - . ID=Merlin_22;seqid=Merlin +Merlin GeneMark.hmm mRNA 14911 16197 . - . ID=Merlin_22_mRNA;Parent=Merlin_22;seqid=Merlin +Merlin GeneMark.hmm exon 14911 16197 . - . ID=Merlin_22_exon;Parent=Merlin_22_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 14911 16197 . - 0 ID=Merlin_22_CDS;Parent=Merlin_22_exon;seqid=Merlin +Merlin GeneMark.hmm gene 16289 17836 -1947.052483 - . ID=Merlin_23;seqid=Merlin +Merlin GeneMark.hmm mRNA 16289 17836 . - . ID=Merlin_23_mRNA;Parent=Merlin_23;seqid=Merlin +Merlin GeneMark.hmm exon 16289 17836 . - . ID=Merlin_23_exon;Parent=Merlin_23_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 16289 17836 . - 0 ID=Merlin_23_CDS;Parent=Merlin_23_exon;seqid=Merlin +Merlin GeneMark.hmm gene 17858 18673 -991.849469 - . ID=Merlin_24;seqid=Merlin +Merlin GeneMark.hmm mRNA 17858 18673 . - . ID=Merlin_24_mRNA;Parent=Merlin_24;seqid=Merlin +Merlin GeneMark.hmm exon 17858 18673 . - . ID=Merlin_24_exon;Parent=Merlin_24_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 17858 18673 . - 0 ID=Merlin_24_CDS;Parent=Merlin_24_exon;seqid=Merlin +Merlin GeneMark.hmm gene 18707 19351 -821.724123 - . ID=Merlin_25;seqid=Merlin +Merlin GeneMark.hmm mRNA 18707 19351 . - . ID=Merlin_25_mRNA;Parent=Merlin_25;seqid=Merlin +Merlin GeneMark.hmm exon 18707 19351 . - . ID=Merlin_25_exon;Parent=Merlin_25_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 18707 19351 . - 0 ID=Merlin_25_CDS;Parent=Merlin_25_exon;seqid=Merlin +Merlin GeneMark.hmm gene 19351 19776 -538.184958 - . ID=Merlin_26;seqid=Merlin +Merlin GeneMark.hmm mRNA 19351 19776 . - . ID=Merlin_26_mRNA;Parent=Merlin_26;seqid=Merlin +Merlin GeneMark.hmm exon 19351 19776 . - . ID=Merlin_26_exon;Parent=Merlin_26_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 19351 19776 . - 0 ID=Merlin_26_CDS;Parent=Merlin_26_exon;seqid=Merlin +Merlin GeneMark.hmm gene 19776 19988 -255.987740 - . ID=Merlin_27;seqid=Merlin +Merlin GeneMark.hmm mRNA 19776 19988 . - . ID=Merlin_27_mRNA;Parent=Merlin_27;seqid=Merlin +Merlin GeneMark.hmm exon 19776 19988 . - . ID=Merlin_27_exon;Parent=Merlin_27_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 19776 19988 . - 0 ID=Merlin_27_CDS;Parent=Merlin_27_exon;seqid=Merlin +Merlin GeneMark.hmm gene 19988 21550 -1974.103338 - . ID=Merlin_28;seqid=Merlin +Merlin GeneMark.hmm mRNA 19988 21550 . - . ID=Merlin_28_mRNA;Parent=Merlin_28;seqid=Merlin +Merlin GeneMark.hmm exon 19988 21550 . - . ID=Merlin_28_exon;Parent=Merlin_28_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 19988 21550 . - 0 ID=Merlin_28_CDS;Parent=Merlin_28_exon;seqid=Merlin +Merlin GeneMark.hmm gene 21625 22116 -616.669463 - . ID=Merlin_29;seqid=Merlin +Merlin GeneMark.hmm mRNA 21625 22116 . - . ID=Merlin_29_mRNA;Parent=Merlin_29;seqid=Merlin +Merlin GeneMark.hmm exon 21625 22116 . - . ID=Merlin_29_exon;Parent=Merlin_29_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 21625 22116 . - 0 ID=Merlin_29_CDS;Parent=Merlin_29_exon;seqid=Merlin +Merlin GeneMark.hmm gene 22240 24216 -2488.948058 - . ID=Merlin_30;seqid=Merlin +Merlin GeneMark.hmm mRNA 22240 24216 . - . ID=Merlin_30_mRNA;Parent=Merlin_30;seqid=Merlin +Merlin GeneMark.hmm exon 22240 24216 . - . ID=Merlin_30_exon;Parent=Merlin_30_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 22240 24216 . - 0 ID=Merlin_30_CDS;Parent=Merlin_30_exon;seqid=Merlin +Merlin GeneMark.hmm gene 24250 26094 -2334.323049 - . ID=Merlin_31;seqid=Merlin +Merlin GeneMark.hmm mRNA 24250 26094 . - . ID=Merlin_31_mRNA;Parent=Merlin_31;seqid=Merlin +Merlin GeneMark.hmm exon 24250 26094 . - . ID=Merlin_31_exon;Parent=Merlin_31_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 24250 26094 . - 0 ID=Merlin_31_CDS;Parent=Merlin_31_exon;seqid=Merlin +Merlin GeneMark.hmm gene 26072 26569 -622.542092 - . ID=Merlin_32;seqid=Merlin +Merlin GeneMark.hmm mRNA 26072 26569 . - . ID=Merlin_32_mRNA;Parent=Merlin_32;seqid=Merlin +Merlin GeneMark.hmm exon 26072 26569 . - . ID=Merlin_32_exon;Parent=Merlin_32_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 26072 26569 . - 0 ID=Merlin_32_CDS;Parent=Merlin_32_exon;seqid=Merlin +Merlin GeneMark.hmm gene 26572 27390 -1062.517306 - . ID=Merlin_33;seqid=Merlin +Merlin GeneMark.hmm mRNA 26572 27390 . - . ID=Merlin_33_mRNA;Parent=Merlin_33;seqid=Merlin +Merlin GeneMark.hmm exon 26572 27390 . - . ID=Merlin_33_exon;Parent=Merlin_33_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 26572 27390 . - 0 ID=Merlin_33_CDS;Parent=Merlin_33_exon;seqid=Merlin +Merlin GeneMark.hmm gene 27434 28204 -971.349898 - . ID=Merlin_34;seqid=Merlin +Merlin GeneMark.hmm mRNA 27434 28204 . - . ID=Merlin_34_mRNA;Parent=Merlin_34;seqid=Merlin +Merlin GeneMark.hmm exon 27434 28204 . - . ID=Merlin_34_exon;Parent=Merlin_34_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 27434 28204 . - 0 ID=Merlin_34_CDS;Parent=Merlin_34_exon;seqid=Merlin +Merlin GeneMark.hmm gene 28201 29130 -1172.195550 - . ID=Merlin_35;seqid=Merlin +Merlin GeneMark.hmm mRNA 28201 29130 . - . ID=Merlin_35_mRNA;Parent=Merlin_35;seqid=Merlin +Merlin GeneMark.hmm exon 28201 29130 . - . ID=Merlin_35_exon;Parent=Merlin_35_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 28201 29130 . - 0 ID=Merlin_35_CDS;Parent=Merlin_35_exon;seqid=Merlin +Merlin GeneMark.hmm gene 29162 30553 -1754.882559 - . ID=Merlin_36;seqid=Merlin +Merlin GeneMark.hmm mRNA 29162 30553 . - . ID=Merlin_36_mRNA;Parent=Merlin_36;seqid=Merlin +Merlin GeneMark.hmm exon 29162 30553 . - . ID=Merlin_36_exon;Parent=Merlin_36_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 29162 30553 . - 0 ID=Merlin_36_CDS;Parent=Merlin_36_exon;seqid=Merlin +Merlin GeneMark.hmm gene 30564 31982 -1840.409176 - . ID=Merlin_37;seqid=Merlin +Merlin GeneMark.hmm mRNA 30564 31982 . - . ID=Merlin_37_mRNA;Parent=Merlin_37;seqid=Merlin +Merlin GeneMark.hmm exon 30564 31982 . - . ID=Merlin_37_exon;Parent=Merlin_37_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 30564 31982 . - 0 ID=Merlin_37_CDS;Parent=Merlin_37_exon;seqid=Merlin +Merlin GeneMark.hmm gene 31982 32632 -810.715921 - . ID=Merlin_38;seqid=Merlin +Merlin GeneMark.hmm mRNA 31982 32632 . - . ID=Merlin_38_mRNA;Parent=Merlin_38;seqid=Merlin +Merlin GeneMark.hmm exon 31982 32632 . - . ID=Merlin_38_exon;Parent=Merlin_38_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 31982 32632 . - 0 ID=Merlin_38_CDS;Parent=Merlin_38_exon;seqid=Merlin +Merlin GeneMark.hmm gene 32632 34437 -2286.512966 - . ID=Merlin_39;seqid=Merlin +Merlin GeneMark.hmm mRNA 32632 34437 . - . ID=Merlin_39_mRNA;Parent=Merlin_39;seqid=Merlin +Merlin GeneMark.hmm exon 32632 34437 . - . ID=Merlin_39_exon;Parent=Merlin_39_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 32632 34437 . - 0 ID=Merlin_39_CDS;Parent=Merlin_39_exon;seqid=Merlin +Merlin GeneMark.hmm gene 34434 35300 -1103.339440 - . ID=Merlin_40;seqid=Merlin +Merlin GeneMark.hmm mRNA 34434 35300 . - . ID=Merlin_40_mRNA;Parent=Merlin_40;seqid=Merlin +Merlin GeneMark.hmm exon 34434 35300 . - . ID=Merlin_40_exon;Parent=Merlin_40_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 34434 35300 . - 0 ID=Merlin_40_CDS;Parent=Merlin_40_exon;seqid=Merlin +Merlin GeneMark.hmm gene 35372 36385 -1286.607331 - . ID=Merlin_41;seqid=Merlin +Merlin GeneMark.hmm mRNA 35372 36385 . - . ID=Merlin_41_mRNA;Parent=Merlin_41;seqid=Merlin +Merlin GeneMark.hmm exon 35372 36385 . - . ID=Merlin_41_exon;Parent=Merlin_41_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 35372 36385 . - 0 ID=Merlin_41_CDS;Parent=Merlin_41_exon;seqid=Merlin +Merlin GeneMark.hmm gene 36378 39479 -3926.862479 - . ID=Merlin_42;seqid=Merlin +Merlin GeneMark.hmm mRNA 36378 39479 . - . ID=Merlin_42_mRNA;Parent=Merlin_42;seqid=Merlin +Merlin GeneMark.hmm exon 36378 39479 . - . ID=Merlin_42_exon;Parent=Merlin_42_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 36378 39479 . - 0 ID=Merlin_42_CDS;Parent=Merlin_42_exon;seqid=Merlin +Merlin GeneMark.hmm gene 39476 41416 -2421.657174 - . ID=Merlin_43;seqid=Merlin +Merlin GeneMark.hmm mRNA 39476 41416 . - . ID=Merlin_43_mRNA;Parent=Merlin_43;seqid=Merlin +Merlin GeneMark.hmm exon 39476 41416 . - . ID=Merlin_43_exon;Parent=Merlin_43_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 39476 41416 . - 0 ID=Merlin_43_CDS;Parent=Merlin_43_exon;seqid=Merlin +Merlin GeneMark.hmm gene 41416 41709 -381.858612 - . ID=Merlin_44;seqid=Merlin +Merlin GeneMark.hmm mRNA 41416 41709 . - . ID=Merlin_44_mRNA;Parent=Merlin_44;seqid=Merlin +Merlin GeneMark.hmm exon 41416 41709 . - . ID=Merlin_44_exon;Parent=Merlin_44_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 41416 41709 . - 0 ID=Merlin_44_CDS;Parent=Merlin_44_exon;seqid=Merlin +Merlin GeneMark.hmm gene 41709 42224 -673.160274 - . ID=Merlin_45;seqid=Merlin +Merlin GeneMark.hmm mRNA 41709 42224 . - . ID=Merlin_45_mRNA;Parent=Merlin_45;seqid=Merlin +Merlin GeneMark.hmm exon 41709 42224 . - . ID=Merlin_45_exon;Parent=Merlin_45_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 41709 42224 . - 0 ID=Merlin_45_CDS;Parent=Merlin_45_exon;seqid=Merlin +Merlin GeneMark.hmm gene 42224 43951 -2203.710381 - . ID=Merlin_46;seqid=Merlin +Merlin GeneMark.hmm mRNA 42224 43951 . - . ID=Merlin_46_mRNA;Parent=Merlin_46;seqid=Merlin +Merlin GeneMark.hmm exon 42224 43951 . - . ID=Merlin_46_exon;Parent=Merlin_46_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 42224 43951 . - 0 ID=Merlin_46_CDS;Parent=Merlin_46_exon;seqid=Merlin +Merlin GeneMark.hmm gene 43951 44526 -730.479121 - . ID=Merlin_47;seqid=Merlin +Merlin GeneMark.hmm mRNA 43951 44526 . - . ID=Merlin_47_mRNA;Parent=Merlin_47;seqid=Merlin +Merlin GeneMark.hmm exon 43951 44526 . - . ID=Merlin_47_exon;Parent=Merlin_47_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 43951 44526 . - 0 ID=Merlin_47_CDS;Parent=Merlin_47_exon;seqid=Merlin +Merlin GeneMark.hmm gene 44576 45025 -562.019925 + . ID=Merlin_48;seqid=Merlin +Merlin GeneMark.hmm mRNA 44576 45025 . + . ID=Merlin_48_mRNA;Parent=Merlin_48;seqid=Merlin +Merlin GeneMark.hmm exon 44576 45025 . + . ID=Merlin_48_exon;Parent=Merlin_48_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 44576 45025 . + 0 ID=Merlin_48_CDS;Parent=Merlin_48_exon;seqid=Merlin +Merlin GeneMark.hmm gene 45025 45855 -1066.702009 + . ID=Merlin_49;seqid=Merlin +Merlin GeneMark.hmm mRNA 45025 45855 . + . ID=Merlin_49_mRNA;Parent=Merlin_49;seqid=Merlin +Merlin GeneMark.hmm exon 45025 45855 . + . ID=Merlin_49_exon;Parent=Merlin_49_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 45025 45855 . + 0 ID=Merlin_49_CDS;Parent=Merlin_49_exon;seqid=Merlin +Merlin GeneMark.hmm gene 45940 46527 -776.360306 + . ID=Merlin_50;seqid=Merlin +Merlin GeneMark.hmm mRNA 45940 46527 . + . ID=Merlin_50_mRNA;Parent=Merlin_50;seqid=Merlin +Merlin GeneMark.hmm exon 45940 46527 . + . ID=Merlin_50_exon;Parent=Merlin_50_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 45940 46527 . + 0 ID=Merlin_50_CDS;Parent=Merlin_50_exon;seqid=Merlin +Merlin GeneMark.hmm gene 46527 47255 -921.088284 + . ID=Merlin_51;seqid=Merlin +Merlin GeneMark.hmm mRNA 46527 47255 . + . ID=Merlin_51_mRNA;Parent=Merlin_51;seqid=Merlin +Merlin GeneMark.hmm exon 46527 47255 . + . ID=Merlin_51_exon;Parent=Merlin_51_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 46527 47255 . + 0 ID=Merlin_51_CDS;Parent=Merlin_51_exon;seqid=Merlin +Merlin GeneMark.hmm gene 47252 47485 -286.785634 + . ID=Merlin_52;seqid=Merlin +Merlin GeneMark.hmm mRNA 47252 47485 . + . ID=Merlin_52_mRNA;Parent=Merlin_52;seqid=Merlin +Merlin GeneMark.hmm exon 47252 47485 . + . ID=Merlin_52_exon;Parent=Merlin_52_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 47252 47485 . + 0 ID=Merlin_52_CDS;Parent=Merlin_52_exon;seqid=Merlin +Merlin GeneMark.hmm gene 47485 47940 -595.997014 + . ID=Merlin_53;seqid=Merlin +Merlin GeneMark.hmm mRNA 47485 47940 . + . ID=Merlin_53_mRNA;Parent=Merlin_53;seqid=Merlin +Merlin GeneMark.hmm exon 47485 47940 . + . ID=Merlin_53_exon;Parent=Merlin_53_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 47485 47940 . + 0 ID=Merlin_53_CDS;Parent=Merlin_53_exon;seqid=Merlin +Merlin GeneMark.hmm gene 47937 48143 -259.350499 + . ID=Merlin_54;seqid=Merlin +Merlin GeneMark.hmm mRNA 47937 48143 . + . ID=Merlin_54_mRNA;Parent=Merlin_54;seqid=Merlin +Merlin GeneMark.hmm exon 47937 48143 . + . ID=Merlin_54_exon;Parent=Merlin_54_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 47937 48143 . + 0 ID=Merlin_54_CDS;Parent=Merlin_54_exon;seqid=Merlin +Merlin GeneMark.hmm gene 48140 48358 -277.240023 + . ID=Merlin_55;seqid=Merlin +Merlin GeneMark.hmm mRNA 48140 48358 . + . ID=Merlin_55_mRNA;Parent=Merlin_55;seqid=Merlin +Merlin GeneMark.hmm exon 48140 48358 . + . ID=Merlin_55_exon;Parent=Merlin_55_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 48140 48358 . + 0 ID=Merlin_55_CDS;Parent=Merlin_55_exon;seqid=Merlin +Merlin GeneMark.hmm gene 48418 48600 -230.583168 + . ID=Merlin_56;seqid=Merlin +Merlin GeneMark.hmm mRNA 48418 48600 . + . ID=Merlin_56_mRNA;Parent=Merlin_56;seqid=Merlin +Merlin GeneMark.hmm exon 48418 48600 . + . ID=Merlin_56_exon;Parent=Merlin_56_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 48418 48600 . + 0 ID=Merlin_56_CDS;Parent=Merlin_56_exon;seqid=Merlin +Merlin GeneMark.hmm gene 48584 48769 -232.687067 + . ID=Merlin_57;seqid=Merlin +Merlin GeneMark.hmm mRNA 48584 48769 . + . ID=Merlin_57_mRNA;Parent=Merlin_57;seqid=Merlin +Merlin GeneMark.hmm exon 48584 48769 . + . ID=Merlin_57_exon;Parent=Merlin_57_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 48584 48769 . + 0 ID=Merlin_57_CDS;Parent=Merlin_57_exon;seqid=Merlin +Merlin GeneMark.hmm gene 48826 49053 -288.143395 + . ID=Merlin_58;seqid=Merlin +Merlin GeneMark.hmm mRNA 48826 49053 . + . ID=Merlin_58_mRNA;Parent=Merlin_58;seqid=Merlin +Merlin GeneMark.hmm exon 48826 49053 . + . ID=Merlin_58_exon;Parent=Merlin_58_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 48826 49053 . + 0 ID=Merlin_58_CDS;Parent=Merlin_58_exon;seqid=Merlin +Merlin GeneMark.hmm gene 49076 49432 -449.304895 + . ID=Merlin_59;seqid=Merlin +Merlin GeneMark.hmm mRNA 49076 49432 . + . ID=Merlin_59_mRNA;Parent=Merlin_59;seqid=Merlin +Merlin GeneMark.hmm exon 49076 49432 . + . ID=Merlin_59_exon;Parent=Merlin_59_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 49076 49432 . + 0 ID=Merlin_59_CDS;Parent=Merlin_59_exon;seqid=Merlin +Merlin GeneMark.hmm gene 49844 50110 -322.091381 + . ID=Merlin_60;seqid=Merlin +Merlin GeneMark.hmm mRNA 49844 50110 . + . ID=Merlin_60_mRNA;Parent=Merlin_60;seqid=Merlin +Merlin GeneMark.hmm exon 49844 50110 . + . ID=Merlin_60_exon;Parent=Merlin_60_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 49844 50110 . + 0 ID=Merlin_60_CDS;Parent=Merlin_60_exon;seqid=Merlin +Merlin GeneMark.hmm gene 50983 51234 -301.882768 + . ID=Merlin_61;seqid=Merlin +Merlin GeneMark.hmm mRNA 50983 51234 . + . ID=Merlin_61_mRNA;Parent=Merlin_61;seqid=Merlin +Merlin GeneMark.hmm exon 50983 51234 . + . ID=Merlin_61_exon;Parent=Merlin_61_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 50983 51234 . + 0 ID=Merlin_61_CDS;Parent=Merlin_61_exon;seqid=Merlin +Merlin GeneMark.hmm gene 51596 51838 -304.801536 + . ID=Merlin_62;seqid=Merlin +Merlin GeneMark.hmm mRNA 51596 51838 . + . ID=Merlin_62_mRNA;Parent=Merlin_62;seqid=Merlin +Merlin GeneMark.hmm exon 51596 51838 . + . ID=Merlin_62_exon;Parent=Merlin_62_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 51596 51838 . + 0 ID=Merlin_62_CDS;Parent=Merlin_62_exon;seqid=Merlin +Merlin GeneMark.hmm gene 51835 52182 -434.777109 + . ID=Merlin_63;seqid=Merlin +Merlin GeneMark.hmm mRNA 51835 52182 . + . ID=Merlin_63_mRNA;Parent=Merlin_63;seqid=Merlin +Merlin GeneMark.hmm exon 51835 52182 . + . ID=Merlin_63_exon;Parent=Merlin_63_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 51835 52182 . + 0 ID=Merlin_63_CDS;Parent=Merlin_63_exon;seqid=Merlin +Merlin GeneMark.hmm gene 52175 52684 -629.023983 + . ID=Merlin_64;seqid=Merlin +Merlin GeneMark.hmm mRNA 52175 52684 . + . ID=Merlin_64_mRNA;Parent=Merlin_64;seqid=Merlin +Merlin GeneMark.hmm exon 52175 52684 . + . ID=Merlin_64_exon;Parent=Merlin_64_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 52175 52684 . + 0 ID=Merlin_64_CDS;Parent=Merlin_64_exon;seqid=Merlin +Merlin GeneMark.hmm gene 52681 52827 -183.076828 + . ID=Merlin_65;seqid=Merlin +Merlin GeneMark.hmm mRNA 52681 52827 . + . ID=Merlin_65_mRNA;Parent=Merlin_65;seqid=Merlin +Merlin GeneMark.hmm exon 52681 52827 . + . ID=Merlin_65_exon;Parent=Merlin_65_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 52681 52827 . + 0 ID=Merlin_65_CDS;Parent=Merlin_65_exon;seqid=Merlin +Merlin GeneMark.hmm gene 52806 53030 -287.687980 + . ID=Merlin_66;seqid=Merlin +Merlin GeneMark.hmm mRNA 52806 53030 . + . ID=Merlin_66_mRNA;Parent=Merlin_66;seqid=Merlin +Merlin GeneMark.hmm exon 52806 53030 . + . ID=Merlin_66_exon;Parent=Merlin_66_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 52806 53030 . + 0 ID=Merlin_66_CDS;Parent=Merlin_66_exon;seqid=Merlin +Merlin GeneMark.hmm gene 53032 53475 -570.370348 + . ID=Merlin_67;seqid=Merlin +Merlin GeneMark.hmm mRNA 53032 53475 . + . ID=Merlin_67_mRNA;Parent=Merlin_67;seqid=Merlin +Merlin GeneMark.hmm exon 53032 53475 . + . ID=Merlin_67_exon;Parent=Merlin_67_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 53032 53475 . + 0 ID=Merlin_67_CDS;Parent=Merlin_67_exon;seqid=Merlin +Merlin GeneMark.hmm gene 53647 54225 -757.038069 + . ID=Merlin_68;seqid=Merlin +Merlin GeneMark.hmm mRNA 53647 54225 . + . ID=Merlin_68_mRNA;Parent=Merlin_68;seqid=Merlin +Merlin GeneMark.hmm exon 53647 54225 . + . ID=Merlin_68_exon;Parent=Merlin_68_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 53647 54225 . + 0 ID=Merlin_68_CDS;Parent=Merlin_68_exon;seqid=Merlin +Merlin GeneMark.hmm gene 54316 54516 -236.842212 + . ID=Merlin_69;seqid=Merlin +Merlin GeneMark.hmm mRNA 54316 54516 . + . ID=Merlin_69_mRNA;Parent=Merlin_69;seqid=Merlin +Merlin GeneMark.hmm exon 54316 54516 . + . ID=Merlin_69_exon;Parent=Merlin_69_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 54316 54516 . + 0 ID=Merlin_69_CDS;Parent=Merlin_69_exon;seqid=Merlin +Merlin GeneMark.hmm gene 54569 55168 -748.986136 + . ID=Merlin_70;seqid=Merlin +Merlin GeneMark.hmm mRNA 54569 55168 . + . ID=Merlin_70_mRNA;Parent=Merlin_70;seqid=Merlin +Merlin GeneMark.hmm exon 54569 55168 . + . ID=Merlin_70_exon;Parent=Merlin_70_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 54569 55168 . + 0 ID=Merlin_70_CDS;Parent=Merlin_70_exon;seqid=Merlin +Merlin GeneMark.hmm gene 55216 55860 -813.197162 + . ID=Merlin_71;seqid=Merlin +Merlin GeneMark.hmm mRNA 55216 55860 . + . ID=Merlin_71_mRNA;Parent=Merlin_71;seqid=Merlin +Merlin GeneMark.hmm exon 55216 55860 . + . ID=Merlin_71_exon;Parent=Merlin_71_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 55216 55860 . + 0 ID=Merlin_71_CDS;Parent=Merlin_71_exon;seqid=Merlin +Merlin GeneMark.hmm gene 55857 56279 -536.845669 + . ID=Merlin_72;seqid=Merlin +Merlin GeneMark.hmm mRNA 55857 56279 . + . ID=Merlin_72_mRNA;Parent=Merlin_72;seqid=Merlin +Merlin GeneMark.hmm exon 55857 56279 . + . ID=Merlin_72_exon;Parent=Merlin_72_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 55857 56279 . + 0 ID=Merlin_72_CDS;Parent=Merlin_72_exon;seqid=Merlin +Merlin GeneMark.hmm gene 56276 56644 -463.468418 + . ID=Merlin_73;seqid=Merlin +Merlin GeneMark.hmm mRNA 56276 56644 . + . ID=Merlin_73_mRNA;Parent=Merlin_73;seqid=Merlin +Merlin GeneMark.hmm exon 56276 56644 . + . ID=Merlin_73_exon;Parent=Merlin_73_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 56276 56644 . + 0 ID=Merlin_73_CDS;Parent=Merlin_73_exon;seqid=Merlin +Merlin GeneMark.hmm gene 56634 56894 -313.595651 + . ID=Merlin_74;seqid=Merlin +Merlin GeneMark.hmm mRNA 56634 56894 . + . ID=Merlin_74_mRNA;Parent=Merlin_74;seqid=Merlin +Merlin GeneMark.hmm exon 56634 56894 . + . ID=Merlin_74_exon;Parent=Merlin_74_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 56634 56894 . + 0 ID=Merlin_74_CDS;Parent=Merlin_74_exon;seqid=Merlin +Merlin GeneMark.hmm gene 56894 57172 -343.261028 + . ID=Merlin_75;seqid=Merlin +Merlin GeneMark.hmm mRNA 56894 57172 . + . ID=Merlin_75_mRNA;Parent=Merlin_75;seqid=Merlin +Merlin GeneMark.hmm exon 56894 57172 . + . ID=Merlin_75_exon;Parent=Merlin_75_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 56894 57172 . + 0 ID=Merlin_75_CDS;Parent=Merlin_75_exon;seqid=Merlin +Merlin GeneMark.hmm gene 57182 57403 -269.950515 + . ID=Merlin_76;seqid=Merlin +Merlin GeneMark.hmm mRNA 57182 57403 . + . ID=Merlin_76_mRNA;Parent=Merlin_76;seqid=Merlin +Merlin GeneMark.hmm exon 57182 57403 . + . ID=Merlin_76_exon;Parent=Merlin_76_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 57182 57403 . + 0 ID=Merlin_76_CDS;Parent=Merlin_76_exon;seqid=Merlin +Merlin GeneMark.hmm gene 57499 57786 -373.177871 + . ID=Merlin_77;seqid=Merlin +Merlin GeneMark.hmm mRNA 57499 57786 . + . ID=Merlin_77_mRNA;Parent=Merlin_77;seqid=Merlin +Merlin GeneMark.hmm exon 57499 57786 . + . ID=Merlin_77_exon;Parent=Merlin_77_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 57499 57786 . + 0 ID=Merlin_77_CDS;Parent=Merlin_77_exon;seqid=Merlin +Merlin GeneMark.hmm gene 57777 58724 -1215.940307 + . ID=Merlin_78;seqid=Merlin +Merlin GeneMark.hmm mRNA 57777 58724 . + . ID=Merlin_78_mRNA;Parent=Merlin_78;seqid=Merlin +Merlin GeneMark.hmm exon 57777 58724 . + . ID=Merlin_78_exon;Parent=Merlin_78_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 57777 58724 . + 0 ID=Merlin_78_CDS;Parent=Merlin_78_exon;seqid=Merlin +Merlin GeneMark.hmm gene 58717 58857 -173.930421 + . ID=Merlin_79;seqid=Merlin +Merlin GeneMark.hmm mRNA 58717 58857 . + . ID=Merlin_79_mRNA;Parent=Merlin_79;seqid=Merlin +Merlin GeneMark.hmm exon 58717 58857 . + . ID=Merlin_79_exon;Parent=Merlin_79_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 58717 58857 . + 0 ID=Merlin_79_CDS;Parent=Merlin_79_exon;seqid=Merlin +Merlin GeneMark.hmm gene 58872 59561 -880.645375 + . ID=Merlin_80;seqid=Merlin +Merlin GeneMark.hmm mRNA 58872 59561 . + . ID=Merlin_80_mRNA;Parent=Merlin_80;seqid=Merlin +Merlin GeneMark.hmm exon 58872 59561 . + . ID=Merlin_80_exon;Parent=Merlin_80_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 58872 59561 . + 0 ID=Merlin_80_CDS;Parent=Merlin_80_exon;seqid=Merlin +Merlin GeneMark.hmm gene 59561 59899 -428.109831 + . ID=Merlin_81;seqid=Merlin +Merlin GeneMark.hmm mRNA 59561 59899 . + . ID=Merlin_81_mRNA;Parent=Merlin_81;seqid=Merlin +Merlin GeneMark.hmm exon 59561 59899 . + . ID=Merlin_81_exon;Parent=Merlin_81_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 59561 59899 . + 0 ID=Merlin_81_CDS;Parent=Merlin_81_exon;seqid=Merlin +Merlin GeneMark.hmm gene 59896 60144 -306.923987 + . ID=Merlin_82;seqid=Merlin +Merlin GeneMark.hmm mRNA 59896 60144 . + . ID=Merlin_82_mRNA;Parent=Merlin_82;seqid=Merlin +Merlin GeneMark.hmm exon 59896 60144 . + . ID=Merlin_82_exon;Parent=Merlin_82_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 59896 60144 . + 0 ID=Merlin_82_CDS;Parent=Merlin_82_exon;seqid=Merlin +Merlin GeneMark.hmm gene 60144 60386 -304.982653 + . ID=Merlin_83;seqid=Merlin +Merlin GeneMark.hmm mRNA 60144 60386 . + . ID=Merlin_83_mRNA;Parent=Merlin_83;seqid=Merlin +Merlin GeneMark.hmm exon 60144 60386 . + . ID=Merlin_83_exon;Parent=Merlin_83_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 60144 60386 . + 0 ID=Merlin_83_CDS;Parent=Merlin_83_exon;seqid=Merlin +Merlin GeneMark.hmm gene 60379 60840 -594.547870 + . ID=Merlin_84;seqid=Merlin +Merlin GeneMark.hmm mRNA 60379 60840 . + . ID=Merlin_84_mRNA;Parent=Merlin_84;seqid=Merlin +Merlin GeneMark.hmm exon 60379 60840 . + . ID=Merlin_84_exon;Parent=Merlin_84_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 60379 60840 . + 0 ID=Merlin_84_CDS;Parent=Merlin_84_exon;seqid=Merlin +Merlin GeneMark.hmm gene 60869 61369 -617.611500 + . ID=Merlin_85;seqid=Merlin +Merlin GeneMark.hmm mRNA 60869 61369 . + . ID=Merlin_85_mRNA;Parent=Merlin_85;seqid=Merlin +Merlin GeneMark.hmm exon 60869 61369 . + . ID=Merlin_85_exon;Parent=Merlin_85_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 60869 61369 . + 0 ID=Merlin_85_CDS;Parent=Merlin_85_exon;seqid=Merlin +Merlin GeneMark.hmm gene 61356 61703 -422.353181 + . ID=Merlin_86;seqid=Merlin +Merlin GeneMark.hmm mRNA 61356 61703 . + . ID=Merlin_86_mRNA;Parent=Merlin_86;seqid=Merlin +Merlin GeneMark.hmm exon 61356 61703 . + . ID=Merlin_86_exon;Parent=Merlin_86_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 61356 61703 . + 0 ID=Merlin_86_CDS;Parent=Merlin_86_exon;seqid=Merlin +Merlin GeneMark.hmm gene 61760 62167 -519.180141 + . ID=Merlin_87;seqid=Merlin +Merlin GeneMark.hmm mRNA 61760 62167 . + . ID=Merlin_87_mRNA;Parent=Merlin_87;seqid=Merlin +Merlin GeneMark.hmm exon 61760 62167 . + . ID=Merlin_87_exon;Parent=Merlin_87_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 61760 62167 . + 0 ID=Merlin_87_CDS;Parent=Merlin_87_exon;seqid=Merlin +Merlin GeneMark.hmm gene 62359 62889 -691.422401 + . ID=Merlin_88;seqid=Merlin +Merlin GeneMark.hmm mRNA 62359 62889 . + . ID=Merlin_88_mRNA;Parent=Merlin_88;seqid=Merlin +Merlin GeneMark.hmm exon 62359 62889 . + . ID=Merlin_88_exon;Parent=Merlin_88_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 62359 62889 . + 0 ID=Merlin_88_CDS;Parent=Merlin_88_exon;seqid=Merlin +Merlin GeneMark.hmm gene 62886 63131 -315.050979 + . ID=Merlin_89;seqid=Merlin +Merlin GeneMark.hmm mRNA 62886 63131 . + . ID=Merlin_89_mRNA;Parent=Merlin_89;seqid=Merlin +Merlin GeneMark.hmm exon 62886 63131 . + . ID=Merlin_89_exon;Parent=Merlin_89_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 62886 63131 . + 0 ID=Merlin_89_CDS;Parent=Merlin_89_exon;seqid=Merlin +Merlin GeneMark.hmm gene 63124 63435 -400.565460 + . ID=Merlin_90;seqid=Merlin +Merlin GeneMark.hmm mRNA 63124 63435 . + . ID=Merlin_90_mRNA;Parent=Merlin_90;seqid=Merlin +Merlin GeneMark.hmm exon 63124 63435 . + . ID=Merlin_90_exon;Parent=Merlin_90_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 63124 63435 . + 0 ID=Merlin_90_CDS;Parent=Merlin_90_exon;seqid=Merlin +Merlin GeneMark.hmm gene 63432 63710 -335.031911 + . ID=Merlin_91;seqid=Merlin +Merlin GeneMark.hmm mRNA 63432 63710 . + . ID=Merlin_91_mRNA;Parent=Merlin_91;seqid=Merlin +Merlin GeneMark.hmm exon 63432 63710 . + . ID=Merlin_91_exon;Parent=Merlin_91_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 63432 63710 . + 0 ID=Merlin_91_CDS;Parent=Merlin_91_exon;seqid=Merlin +Merlin GeneMark.hmm gene 63710 63883 -203.175066 + . ID=Merlin_92;seqid=Merlin +Merlin GeneMark.hmm mRNA 63710 63883 . + . ID=Merlin_92_mRNA;Parent=Merlin_92;seqid=Merlin +Merlin GeneMark.hmm exon 63710 63883 . + . ID=Merlin_92_exon;Parent=Merlin_92_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 63710 63883 . + 0 ID=Merlin_92_CDS;Parent=Merlin_92_exon;seqid=Merlin +Merlin GeneMark.hmm gene 63942 64406 -597.655245 + . ID=Merlin_93;seqid=Merlin +Merlin GeneMark.hmm mRNA 63942 64406 . + . ID=Merlin_93_mRNA;Parent=Merlin_93;seqid=Merlin +Merlin GeneMark.hmm exon 63942 64406 . + . ID=Merlin_93_exon;Parent=Merlin_93_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 63942 64406 . + 0 ID=Merlin_93_CDS;Parent=Merlin_93_exon;seqid=Merlin +Merlin GeneMark.hmm gene 64414 64962 -713.810677 + . ID=Merlin_94;seqid=Merlin +Merlin GeneMark.hmm mRNA 64414 64962 . + . ID=Merlin_94_mRNA;Parent=Merlin_94;seqid=Merlin +Merlin GeneMark.hmm exon 64414 64962 . + . ID=Merlin_94_exon;Parent=Merlin_94_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 64414 64962 . + 0 ID=Merlin_94_CDS;Parent=Merlin_94_exon;seqid=Merlin +Merlin GeneMark.hmm gene 64962 65282 -412.685055 + . ID=Merlin_95;seqid=Merlin +Merlin GeneMark.hmm mRNA 64962 65282 . + . ID=Merlin_95_mRNA;Parent=Merlin_95;seqid=Merlin +Merlin GeneMark.hmm exon 64962 65282 . + . ID=Merlin_95_exon;Parent=Merlin_95_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 64962 65282 . + 0 ID=Merlin_95_CDS;Parent=Merlin_95_exon;seqid=Merlin +Merlin GeneMark.hmm gene 65303 65683 -496.639498 + . ID=Merlin_96;seqid=Merlin +Merlin GeneMark.hmm mRNA 65303 65683 . + . ID=Merlin_96_mRNA;Parent=Merlin_96;seqid=Merlin +Merlin GeneMark.hmm exon 65303 65683 . + . ID=Merlin_96_exon;Parent=Merlin_96_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 65303 65683 . + 0 ID=Merlin_96_CDS;Parent=Merlin_96_exon;seqid=Merlin +Merlin GeneMark.hmm gene 65676 66128 -573.822848 + . ID=Merlin_97;seqid=Merlin +Merlin GeneMark.hmm mRNA 65676 66128 . + . ID=Merlin_97_mRNA;Parent=Merlin_97;seqid=Merlin +Merlin GeneMark.hmm exon 65676 66128 . + . ID=Merlin_97_exon;Parent=Merlin_97_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 65676 66128 . + 0 ID=Merlin_97_CDS;Parent=Merlin_97_exon;seqid=Merlin +Merlin GeneMark.hmm gene 66128 66337 -267.423513 + . ID=Merlin_98;seqid=Merlin +Merlin GeneMark.hmm mRNA 66128 66337 . + . ID=Merlin_98_mRNA;Parent=Merlin_98;seqid=Merlin +Merlin GeneMark.hmm exon 66128 66337 . + . ID=Merlin_98_exon;Parent=Merlin_98_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 66128 66337 . + 0 ID=Merlin_98_CDS;Parent=Merlin_98_exon;seqid=Merlin +Merlin GeneMark.hmm gene 66328 66507 -214.194539 + . ID=Merlin_99;seqid=Merlin +Merlin GeneMark.hmm mRNA 66328 66507 . + . ID=Merlin_99_mRNA;Parent=Merlin_99;seqid=Merlin +Merlin GeneMark.hmm exon 66328 66507 . + . ID=Merlin_99_exon;Parent=Merlin_99_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 66328 66507 . + 0 ID=Merlin_99_CDS;Parent=Merlin_99_exon;seqid=Merlin +Merlin GeneMark.hmm gene 66504 66683 -217.450578 + . ID=Merlin_100;seqid=Merlin +Merlin GeneMark.hmm mRNA 66504 66683 . + . ID=Merlin_100_mRNA;Parent=Merlin_100;seqid=Merlin +Merlin GeneMark.hmm exon 66504 66683 . + . ID=Merlin_100_exon;Parent=Merlin_100_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 66504 66683 . + 0 ID=Merlin_100_CDS;Parent=Merlin_100_exon;seqid=Merlin +Merlin GeneMark.hmm gene 66680 66871 -235.908196 + . ID=Merlin_101;seqid=Merlin +Merlin GeneMark.hmm mRNA 66680 66871 . + . ID=Merlin_101_mRNA;Parent=Merlin_101;seqid=Merlin +Merlin GeneMark.hmm exon 66680 66871 . + . ID=Merlin_101_exon;Parent=Merlin_101_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 66680 66871 . + 0 ID=Merlin_101_CDS;Parent=Merlin_101_exon;seqid=Merlin +Merlin GeneMark.hmm gene 66873 67058 -233.275820 + . ID=Merlin_102;seqid=Merlin +Merlin GeneMark.hmm mRNA 66873 67058 . + . ID=Merlin_102_mRNA;Parent=Merlin_102;seqid=Merlin +Merlin GeneMark.hmm exon 66873 67058 . + . ID=Merlin_102_exon;Parent=Merlin_102_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 66873 67058 . + 0 ID=Merlin_102_CDS;Parent=Merlin_102_exon;seqid=Merlin +Merlin GeneMark.hmm gene 67058 67267 -264.096823 + . ID=Merlin_103;seqid=Merlin +Merlin GeneMark.hmm mRNA 67058 67267 . + . ID=Merlin_103_mRNA;Parent=Merlin_103;seqid=Merlin +Merlin GeneMark.hmm exon 67058 67267 . + . ID=Merlin_103_exon;Parent=Merlin_103_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 67058 67267 . + 0 ID=Merlin_103_CDS;Parent=Merlin_103_exon;seqid=Merlin +Merlin GeneMark.hmm gene 67267 67845 -752.300357 + . ID=Merlin_104;seqid=Merlin +Merlin GeneMark.hmm mRNA 67267 67845 . + . ID=Merlin_104_mRNA;Parent=Merlin_104;seqid=Merlin +Merlin GeneMark.hmm exon 67267 67845 . + . ID=Merlin_104_exon;Parent=Merlin_104_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 67267 67845 . + 0 ID=Merlin_104_CDS;Parent=Merlin_104_exon;seqid=Merlin +Merlin GeneMark.hmm gene 67970 68128 -196.227328 + . ID=Merlin_105;seqid=Merlin +Merlin GeneMark.hmm mRNA 67970 68128 . + . ID=Merlin_105_mRNA;Parent=Merlin_105;seqid=Merlin +Merlin GeneMark.hmm exon 67970 68128 . + . ID=Merlin_105_exon;Parent=Merlin_105_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 67970 68128 . + 0 ID=Merlin_105_CDS;Parent=Merlin_105_exon;seqid=Merlin +Merlin GeneMark.hmm gene 68125 68280 -186.665512 + . ID=Merlin_106;seqid=Merlin +Merlin GeneMark.hmm mRNA 68125 68280 . + . ID=Merlin_106_mRNA;Parent=Merlin_106;seqid=Merlin +Merlin GeneMark.hmm exon 68125 68280 . + . ID=Merlin_106_exon;Parent=Merlin_106_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 68125 68280 . + 0 ID=Merlin_106_CDS;Parent=Merlin_106_exon;seqid=Merlin +Merlin GeneMark.hmm gene 68345 68728 -480.408576 + . ID=Merlin_107;seqid=Merlin +Merlin GeneMark.hmm mRNA 68345 68728 . + . ID=Merlin_107_mRNA;Parent=Merlin_107;seqid=Merlin +Merlin GeneMark.hmm exon 68345 68728 . + . ID=Merlin_107_exon;Parent=Merlin_107_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 68345 68728 . + 0 ID=Merlin_107_CDS;Parent=Merlin_107_exon;seqid=Merlin +Merlin GeneMark.hmm gene 68787 68999 -267.936260 + . ID=Merlin_108;seqid=Merlin +Merlin GeneMark.hmm mRNA 68787 68999 . + . ID=Merlin_108_mRNA;Parent=Merlin_108;seqid=Merlin +Merlin GeneMark.hmm exon 68787 68999 . + . ID=Merlin_108_exon;Parent=Merlin_108_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 68787 68999 . + 0 ID=Merlin_108_CDS;Parent=Merlin_108_exon;seqid=Merlin +Merlin GeneMark.hmm gene 69008 69295 -369.655354 + . ID=Merlin_109;seqid=Merlin +Merlin GeneMark.hmm mRNA 69008 69295 . + . ID=Merlin_109_mRNA;Parent=Merlin_109;seqid=Merlin +Merlin GeneMark.hmm exon 69008 69295 . + . ID=Merlin_109_exon;Parent=Merlin_109_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 69008 69295 . + 0 ID=Merlin_109_CDS;Parent=Merlin_109_exon;seqid=Merlin +Merlin GeneMark.hmm gene 69285 69668 -486.207714 + . ID=Merlin_110;seqid=Merlin +Merlin GeneMark.hmm mRNA 69285 69668 . + . ID=Merlin_110_mRNA;Parent=Merlin_110;seqid=Merlin +Merlin GeneMark.hmm exon 69285 69668 . + . ID=Merlin_110_exon;Parent=Merlin_110_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 69285 69668 . + 0 ID=Merlin_110_CDS;Parent=Merlin_110_exon;seqid=Merlin +Merlin GeneMark.hmm gene 69767 69862 -119.090489 + . ID=Merlin_111;seqid=Merlin +Merlin GeneMark.hmm mRNA 69767 69862 . + . ID=Merlin_111_mRNA;Parent=Merlin_111;seqid=Merlin +Merlin GeneMark.hmm exon 69767 69862 . + . ID=Merlin_111_exon;Parent=Merlin_111_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 69767 69862 . + 0 ID=Merlin_111_CDS;Parent=Merlin_111_exon;seqid=Merlin +Merlin GeneMark.hmm gene 69859 70023 -200.738602 + . ID=Merlin_112;seqid=Merlin +Merlin GeneMark.hmm mRNA 69859 70023 . + . ID=Merlin_112_mRNA;Parent=Merlin_112;seqid=Merlin +Merlin GeneMark.hmm exon 69859 70023 . + . ID=Merlin_112_exon;Parent=Merlin_112_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 69859 70023 . + 0 ID=Merlin_112_CDS;Parent=Merlin_112_exon;seqid=Merlin +Merlin GeneMark.hmm gene 70030 70263 -281.446786 + . ID=Merlin_113;seqid=Merlin +Merlin GeneMark.hmm mRNA 70030 70263 . + . ID=Merlin_113_mRNA;Parent=Merlin_113;seqid=Merlin +Merlin GeneMark.hmm exon 70030 70263 . + . ID=Merlin_113_exon;Parent=Merlin_113_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 70030 70263 . + 0 ID=Merlin_113_CDS;Parent=Merlin_113_exon;seqid=Merlin +Merlin GeneMark.hmm gene 70263 70520 -332.653168 + . ID=Merlin_114;seqid=Merlin +Merlin GeneMark.hmm mRNA 70263 70520 . + . ID=Merlin_114_mRNA;Parent=Merlin_114;seqid=Merlin +Merlin GeneMark.hmm exon 70263 70520 . + . ID=Merlin_114_exon;Parent=Merlin_114_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 70263 70520 . + 0 ID=Merlin_114_CDS;Parent=Merlin_114_exon;seqid=Merlin +Merlin GeneMark.hmm gene 70517 70780 -336.190173 + . ID=Merlin_115;seqid=Merlin +Merlin GeneMark.hmm mRNA 70517 70780 . + . ID=Merlin_115_mRNA;Parent=Merlin_115;seqid=Merlin +Merlin GeneMark.hmm exon 70517 70780 . + . ID=Merlin_115_exon;Parent=Merlin_115_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 70517 70780 . + 0 ID=Merlin_115_CDS;Parent=Merlin_115_exon;seqid=Merlin +Merlin GeneMark.hmm gene 70866 71102 -289.943350 + . ID=Merlin_116;seqid=Merlin +Merlin GeneMark.hmm mRNA 70866 71102 . + . ID=Merlin_116_mRNA;Parent=Merlin_116;seqid=Merlin +Merlin GeneMark.hmm exon 70866 71102 . + . ID=Merlin_116_exon;Parent=Merlin_116_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 70866 71102 . + 0 ID=Merlin_116_CDS;Parent=Merlin_116_exon;seqid=Merlin +Merlin GeneMark.hmm gene 71092 71571 -594.658724 + . ID=Merlin_117;seqid=Merlin +Merlin GeneMark.hmm mRNA 71092 71571 . + . ID=Merlin_117_mRNA;Parent=Merlin_117;seqid=Merlin +Merlin GeneMark.hmm exon 71092 71571 . + . ID=Merlin_117_exon;Parent=Merlin_117_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 71092 71571 . + 0 ID=Merlin_117_CDS;Parent=Merlin_117_exon;seqid=Merlin +Merlin GeneMark.hmm gene 71574 72116 -686.096724 + . ID=Merlin_118;seqid=Merlin +Merlin GeneMark.hmm mRNA 71574 72116 . + . ID=Merlin_118_mRNA;Parent=Merlin_118;seqid=Merlin +Merlin GeneMark.hmm exon 71574 72116 . + . ID=Merlin_118_exon;Parent=Merlin_118_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 71574 72116 . + 0 ID=Merlin_118_CDS;Parent=Merlin_118_exon;seqid=Merlin +Merlin GeneMark.hmm gene 72116 73126 -1269.074513 + . ID=Merlin_119;seqid=Merlin +Merlin GeneMark.hmm mRNA 72116 73126 . + . ID=Merlin_119_mRNA;Parent=Merlin_119;seqid=Merlin +Merlin GeneMark.hmm exon 72116 73126 . + . ID=Merlin_119_exon;Parent=Merlin_119_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 72116 73126 . + 0 ID=Merlin_119_CDS;Parent=Merlin_119_exon;seqid=Merlin +Merlin GeneMark.hmm gene 73123 73359 -314.305354 + . ID=Merlin_120;seqid=Merlin +Merlin GeneMark.hmm mRNA 73123 73359 . + . ID=Merlin_120_mRNA;Parent=Merlin_120;seqid=Merlin +Merlin GeneMark.hmm exon 73123 73359 . + . ID=Merlin_120_exon;Parent=Merlin_120_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 73123 73359 . + 0 ID=Merlin_120_CDS;Parent=Merlin_120_exon;seqid=Merlin +Merlin GeneMark.hmm gene 73461 73631 -201.815396 + . ID=Merlin_121;seqid=Merlin +Merlin GeneMark.hmm mRNA 73461 73631 . + . ID=Merlin_121_mRNA;Parent=Merlin_121;seqid=Merlin +Merlin GeneMark.hmm exon 73461 73631 . + . ID=Merlin_121_exon;Parent=Merlin_121_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 73461 73631 . + 0 ID=Merlin_121_CDS;Parent=Merlin_121_exon;seqid=Merlin +Merlin GeneMark.hmm gene 73721 74698 -1210.601194 + . ID=Merlin_122;seqid=Merlin +Merlin GeneMark.hmm mRNA 73721 74698 . + . ID=Merlin_122_mRNA;Parent=Merlin_122;seqid=Merlin +Merlin GeneMark.hmm exon 73721 74698 . + . ID=Merlin_122_exon;Parent=Merlin_122_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 73721 74698 . + 0 ID=Merlin_122_CDS;Parent=Merlin_122_exon;seqid=Merlin +Merlin GeneMark.hmm gene 74744 74893 -185.633773 + . ID=Merlin_123;seqid=Merlin +Merlin GeneMark.hmm mRNA 74744 74893 . + . ID=Merlin_123_mRNA;Parent=Merlin_123;seqid=Merlin +Merlin GeneMark.hmm exon 74744 74893 . + . ID=Merlin_123_exon;Parent=Merlin_123_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 74744 74893 . + 0 ID=Merlin_123_CDS;Parent=Merlin_123_exon;seqid=Merlin +Merlin GeneMark.hmm gene 74890 75141 -315.506963 + . ID=Merlin_124;seqid=Merlin +Merlin GeneMark.hmm mRNA 74890 75141 . + . ID=Merlin_124_mRNA;Parent=Merlin_124;seqid=Merlin +Merlin GeneMark.hmm exon 74890 75141 . + . ID=Merlin_124_exon;Parent=Merlin_124_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 74890 75141 . + 0 ID=Merlin_124_CDS;Parent=Merlin_124_exon;seqid=Merlin +Merlin GeneMark.hmm gene 75141 75602 -594.209518 + . ID=Merlin_125;seqid=Merlin +Merlin GeneMark.hmm mRNA 75141 75602 . + . ID=Merlin_125_mRNA;Parent=Merlin_125;seqid=Merlin +Merlin GeneMark.hmm exon 75141 75602 . + . ID=Merlin_125_exon;Parent=Merlin_125_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 75141 75602 . + 0 ID=Merlin_125_CDS;Parent=Merlin_125_exon;seqid=Merlin +Merlin GeneMark.hmm gene 75602 75865 -344.721707 + . ID=Merlin_126;seqid=Merlin +Merlin GeneMark.hmm mRNA 75602 75865 . + . ID=Merlin_126_mRNA;Parent=Merlin_126;seqid=Merlin +Merlin GeneMark.hmm exon 75602 75865 . + . ID=Merlin_126_exon;Parent=Merlin_126_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 75602 75865 . + 0 ID=Merlin_126_CDS;Parent=Merlin_126_exon;seqid=Merlin +Merlin GeneMark.hmm gene 75856 76044 -230.523164 + . ID=Merlin_127;seqid=Merlin +Merlin GeneMark.hmm mRNA 75856 76044 . + . ID=Merlin_127_mRNA;Parent=Merlin_127;seqid=Merlin +Merlin GeneMark.hmm exon 75856 76044 . + . ID=Merlin_127_exon;Parent=Merlin_127_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 75856 76044 . + 0 ID=Merlin_127_CDS;Parent=Merlin_127_exon;seqid=Merlin +Merlin GeneMark.hmm gene 76041 76367 -416.228479 + . ID=Merlin_128;seqid=Merlin +Merlin GeneMark.hmm mRNA 76041 76367 . + . ID=Merlin_128_mRNA;Parent=Merlin_128;seqid=Merlin +Merlin GeneMark.hmm exon 76041 76367 . + . ID=Merlin_128_exon;Parent=Merlin_128_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 76041 76367 . + 0 ID=Merlin_128_CDS;Parent=Merlin_128_exon;seqid=Merlin +Merlin GeneMark.hmm gene 76546 77334 -987.711287 + . ID=Merlin_129;seqid=Merlin +Merlin GeneMark.hmm mRNA 76546 77334 . + . ID=Merlin_129_mRNA;Parent=Merlin_129;seqid=Merlin +Merlin GeneMark.hmm exon 76546 77334 . + . ID=Merlin_129_exon;Parent=Merlin_129_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 76546 77334 . + 0 ID=Merlin_129_CDS;Parent=Merlin_129_exon;seqid=Merlin +Merlin GeneMark.hmm gene 77420 78424 -1261.524373 + . ID=Merlin_130;seqid=Merlin +Merlin GeneMark.hmm mRNA 77420 78424 . + . ID=Merlin_130_mRNA;Parent=Merlin_130;seqid=Merlin +Merlin GeneMark.hmm exon 77420 78424 . + . ID=Merlin_130_exon;Parent=Merlin_130_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 77420 78424 . + 0 ID=Merlin_130_CDS;Parent=Merlin_130_exon;seqid=Merlin +Merlin GeneMark.hmm gene 78417 78707 -360.350742 + . ID=Merlin_131;seqid=Merlin +Merlin GeneMark.hmm mRNA 78417 78707 . + . ID=Merlin_131_mRNA;Parent=Merlin_131;seqid=Merlin +Merlin GeneMark.hmm exon 78417 78707 . + . ID=Merlin_131_exon;Parent=Merlin_131_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 78417 78707 . + 0 ID=Merlin_131_CDS;Parent=Merlin_131_exon;seqid=Merlin +Merlin GeneMark.hmm gene 78704 79111 -518.845840 + . ID=Merlin_132;seqid=Merlin +Merlin GeneMark.hmm mRNA 78704 79111 . + . ID=Merlin_132_mRNA;Parent=Merlin_132;seqid=Merlin +Merlin GeneMark.hmm exon 78704 79111 . + . ID=Merlin_132_exon;Parent=Merlin_132_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 78704 79111 . + 0 ID=Merlin_132_CDS;Parent=Merlin_132_exon;seqid=Merlin +Merlin GeneMark.hmm gene 79111 79617 -613.282382 + . ID=Merlin_133;seqid=Merlin +Merlin GeneMark.hmm mRNA 79111 79617 . + . ID=Merlin_133_mRNA;Parent=Merlin_133;seqid=Merlin +Merlin GeneMark.hmm exon 79111 79617 . + . ID=Merlin_133_exon;Parent=Merlin_133_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 79111 79617 . + 0 ID=Merlin_133_CDS;Parent=Merlin_133_exon;seqid=Merlin +Merlin GeneMark.hmm gene 79614 79919 -369.305081 + . ID=Merlin_134;seqid=Merlin +Merlin GeneMark.hmm mRNA 79614 79919 . + . ID=Merlin_134_mRNA;Parent=Merlin_134;seqid=Merlin +Merlin GeneMark.hmm exon 79614 79919 . + . ID=Merlin_134_exon;Parent=Merlin_134_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 79614 79919 . + 0 ID=Merlin_134_CDS;Parent=Merlin_134_exon;seqid=Merlin +Merlin GeneMark.hmm gene 79933 80160 -288.575732 + . ID=Merlin_135;seqid=Merlin +Merlin GeneMark.hmm mRNA 79933 80160 . + . ID=Merlin_135_mRNA;Parent=Merlin_135;seqid=Merlin +Merlin GeneMark.hmm exon 79933 80160 . + . ID=Merlin_135_exon;Parent=Merlin_135_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 79933 80160 . + 0 ID=Merlin_135_CDS;Parent=Merlin_135_exon;seqid=Merlin +Merlin GeneMark.hmm gene 80154 80417 -324.958009 + . ID=Merlin_136;seqid=Merlin +Merlin GeneMark.hmm mRNA 80154 80417 . + . ID=Merlin_136_mRNA;Parent=Merlin_136;seqid=Merlin +Merlin GeneMark.hmm exon 80154 80417 . + . ID=Merlin_136_exon;Parent=Merlin_136_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 80154 80417 . + 0 ID=Merlin_136_CDS;Parent=Merlin_136_exon;seqid=Merlin +Merlin GeneMark.hmm gene 80414 80623 -254.916892 + . ID=Merlin_137;seqid=Merlin +Merlin GeneMark.hmm mRNA 80414 80623 . + . ID=Merlin_137_mRNA;Parent=Merlin_137;seqid=Merlin +Merlin GeneMark.hmm exon 80414 80623 . + . ID=Merlin_137_exon;Parent=Merlin_137_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 80414 80623 . + 0 ID=Merlin_137_CDS;Parent=Merlin_137_exon;seqid=Merlin +Merlin GeneMark.hmm gene 80620 80949 -405.138197 + . ID=Merlin_138;seqid=Merlin +Merlin GeneMark.hmm mRNA 80620 80949 . + . ID=Merlin_138_mRNA;Parent=Merlin_138;seqid=Merlin +Merlin GeneMark.hmm exon 80620 80949 . + . ID=Merlin_138_exon;Parent=Merlin_138_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 80620 80949 . + 0 ID=Merlin_138_CDS;Parent=Merlin_138_exon;seqid=Merlin +Merlin GeneMark.hmm gene 80939 81091 -189.705268 + . ID=Merlin_139;seqid=Merlin +Merlin GeneMark.hmm mRNA 80939 81091 . + . ID=Merlin_139_mRNA;Parent=Merlin_139;seqid=Merlin +Merlin GeneMark.hmm exon 80939 81091 . + . ID=Merlin_139_exon;Parent=Merlin_139_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 80939 81091 . + 0 ID=Merlin_139_CDS;Parent=Merlin_139_exon;seqid=Merlin +Merlin GeneMark.hmm gene 81088 81396 -379.041172 + . ID=Merlin_140;seqid=Merlin +Merlin GeneMark.hmm mRNA 81088 81396 . + . ID=Merlin_140_mRNA;Parent=Merlin_140;seqid=Merlin +Merlin GeneMark.hmm exon 81088 81396 . + . ID=Merlin_140_exon;Parent=Merlin_140_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 81088 81396 . + 0 ID=Merlin_140_CDS;Parent=Merlin_140_exon;seqid=Merlin +Merlin GeneMark.hmm gene 81381 81527 -178.904000 + . ID=Merlin_141;seqid=Merlin +Merlin GeneMark.hmm mRNA 81381 81527 . + . ID=Merlin_141_mRNA;Parent=Merlin_141;seqid=Merlin +Merlin GeneMark.hmm exon 81381 81527 . + . ID=Merlin_141_exon;Parent=Merlin_141_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 81381 81527 . + 0 ID=Merlin_141_CDS;Parent=Merlin_141_exon;seqid=Merlin +Merlin GeneMark.hmm gene 81511 81945 -531.842575 + . ID=Merlin_142;seqid=Merlin +Merlin GeneMark.hmm mRNA 81511 81945 . + . ID=Merlin_142_mRNA;Parent=Merlin_142;seqid=Merlin +Merlin GeneMark.hmm exon 81511 81945 . + . ID=Merlin_142_exon;Parent=Merlin_142_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 81511 81945 . + 0 ID=Merlin_142_CDS;Parent=Merlin_142_exon;seqid=Merlin +Merlin GeneMark.hmm gene 81945 82109 -200.193240 + . ID=Merlin_143;seqid=Merlin +Merlin GeneMark.hmm mRNA 81945 82109 . + . ID=Merlin_143_mRNA;Parent=Merlin_143;seqid=Merlin +Merlin GeneMark.hmm exon 81945 82109 . + . ID=Merlin_143_exon;Parent=Merlin_143_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 81945 82109 . + 0 ID=Merlin_143_CDS;Parent=Merlin_143_exon;seqid=Merlin +Merlin GeneMark.hmm gene 82145 82618 -597.711728 + . ID=Merlin_144;seqid=Merlin +Merlin GeneMark.hmm mRNA 82145 82618 . + . ID=Merlin_144_mRNA;Parent=Merlin_144;seqid=Merlin +Merlin GeneMark.hmm exon 82145 82618 . + . ID=Merlin_144_exon;Parent=Merlin_144_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 82145 82618 . + 0 ID=Merlin_144_CDS;Parent=Merlin_144_exon;seqid=Merlin +Merlin GeneMark.hmm gene 82615 84444 -2332.730592 + . ID=Merlin_145;seqid=Merlin +Merlin GeneMark.hmm mRNA 82615 84444 . + . ID=Merlin_145_mRNA;Parent=Merlin_145;seqid=Merlin +Merlin GeneMark.hmm exon 82615 84444 . + . ID=Merlin_145_exon;Parent=Merlin_145_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 82615 84444 . + 0 ID=Merlin_145_CDS;Parent=Merlin_145_exon;seqid=Merlin +Merlin GeneMark.hmm gene 84512 84928 -529.993287 + . ID=Merlin_146;seqid=Merlin +Merlin GeneMark.hmm mRNA 84512 84928 . + . ID=Merlin_146_mRNA;Parent=Merlin_146;seqid=Merlin +Merlin GeneMark.hmm exon 84512 84928 . + . ID=Merlin_146_exon;Parent=Merlin_146_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 84512 84928 . + 0 ID=Merlin_146_CDS;Parent=Merlin_146_exon;seqid=Merlin +Merlin GeneMark.hmm gene 85016 85309 -372.795932 + . ID=Merlin_147;seqid=Merlin +Merlin GeneMark.hmm mRNA 85016 85309 . + . ID=Merlin_147_mRNA;Parent=Merlin_147;seqid=Merlin +Merlin GeneMark.hmm exon 85016 85309 . + . ID=Merlin_147_exon;Parent=Merlin_147_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 85016 85309 . + 0 ID=Merlin_147_CDS;Parent=Merlin_147_exon;seqid=Merlin +Merlin GeneMark.hmm gene 85459 85722 -330.097448 + . ID=Merlin_148;seqid=Merlin +Merlin GeneMark.hmm mRNA 85459 85722 . + . ID=Merlin_148_mRNA;Parent=Merlin_148;seqid=Merlin +Merlin GeneMark.hmm exon 85459 85722 . + . ID=Merlin_148_exon;Parent=Merlin_148_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 85459 85722 . + 0 ID=Merlin_148_CDS;Parent=Merlin_148_exon;seqid=Merlin +Merlin GeneMark.hmm gene 85722 85910 -230.155567 + . ID=Merlin_149;seqid=Merlin +Merlin GeneMark.hmm mRNA 85722 85910 . + . ID=Merlin_149_mRNA;Parent=Merlin_149;seqid=Merlin +Merlin GeneMark.hmm exon 85722 85910 . + . ID=Merlin_149_exon;Parent=Merlin_149_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 85722 85910 . + 0 ID=Merlin_149_CDS;Parent=Merlin_149_exon;seqid=Merlin +Merlin GeneMark.hmm gene 85903 86166 -332.190142 + . ID=Merlin_150;seqid=Merlin +Merlin GeneMark.hmm mRNA 85903 86166 . + . ID=Merlin_150_mRNA;Parent=Merlin_150;seqid=Merlin +Merlin GeneMark.hmm exon 85903 86166 . + . ID=Merlin_150_exon;Parent=Merlin_150_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 85903 86166 . + 0 ID=Merlin_150_CDS;Parent=Merlin_150_exon;seqid=Merlin +Merlin GeneMark.hmm gene 86229 86555 -399.176919 + . ID=Merlin_151;seqid=Merlin +Merlin GeneMark.hmm mRNA 86229 86555 . + . ID=Merlin_151_mRNA;Parent=Merlin_151;seqid=Merlin +Merlin GeneMark.hmm exon 86229 86555 . + . ID=Merlin_151_exon;Parent=Merlin_151_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 86229 86555 . + 0 ID=Merlin_151_CDS;Parent=Merlin_151_exon;seqid=Merlin +Merlin GeneMark.hmm gene 86552 86833 -365.746982 + . ID=Merlin_152;seqid=Merlin +Merlin GeneMark.hmm mRNA 86552 86833 . + . ID=Merlin_152_mRNA;Parent=Merlin_152;seqid=Merlin +Merlin GeneMark.hmm exon 86552 86833 . + . ID=Merlin_152_exon;Parent=Merlin_152_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 86552 86833 . + 0 ID=Merlin_152_CDS;Parent=Merlin_152_exon;seqid=Merlin +Merlin GeneMark.hmm gene 86826 87074 -314.427851 + . ID=Merlin_153;seqid=Merlin +Merlin GeneMark.hmm mRNA 86826 87074 . + . ID=Merlin_153_mRNA;Parent=Merlin_153;seqid=Merlin +Merlin GeneMark.hmm exon 86826 87074 . + . ID=Merlin_153_exon;Parent=Merlin_153_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 86826 87074 . + 0 ID=Merlin_153_CDS;Parent=Merlin_153_exon;seqid=Merlin +Merlin GeneMark.hmm gene 87067 87291 -270.187122 + . ID=Merlin_154;seqid=Merlin +Merlin GeneMark.hmm mRNA 87067 87291 . + . ID=Merlin_154_mRNA;Parent=Merlin_154;seqid=Merlin +Merlin GeneMark.hmm exon 87067 87291 . + . ID=Merlin_154_exon;Parent=Merlin_154_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 87067 87291 . + 0 ID=Merlin_154_CDS;Parent=Merlin_154_exon;seqid=Merlin +Merlin GeneMark.hmm gene 87288 87548 -320.850170 + . ID=Merlin_155;seqid=Merlin +Merlin GeneMark.hmm mRNA 87288 87548 . + . ID=Merlin_155_mRNA;Parent=Merlin_155;seqid=Merlin +Merlin GeneMark.hmm exon 87288 87548 . + . ID=Merlin_155_exon;Parent=Merlin_155_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 87288 87548 . + 0 ID=Merlin_155_CDS;Parent=Merlin_155_exon;seqid=Merlin +Merlin GeneMark.hmm gene 87545 87838 -368.941897 + . ID=Merlin_156;seqid=Merlin +Merlin GeneMark.hmm mRNA 87545 87838 . + . ID=Merlin_156_mRNA;Parent=Merlin_156;seqid=Merlin +Merlin GeneMark.hmm exon 87545 87838 . + . ID=Merlin_156_exon;Parent=Merlin_156_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 87545 87838 . + 0 ID=Merlin_156_CDS;Parent=Merlin_156_exon;seqid=Merlin +Merlin GeneMark.hmm gene 87906 88445 -686.934268 + . ID=Merlin_157;seqid=Merlin +Merlin GeneMark.hmm mRNA 87906 88445 . + . ID=Merlin_157_mRNA;Parent=Merlin_157;seqid=Merlin +Merlin GeneMark.hmm exon 87906 88445 . + . ID=Merlin_157_exon;Parent=Merlin_157_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 87906 88445 . + 0 ID=Merlin_157_CDS;Parent=Merlin_157_exon;seqid=Merlin +Merlin GeneMark.hmm gene 88429 88656 -293.300141 + . ID=Merlin_158;seqid=Merlin +Merlin GeneMark.hmm mRNA 88429 88656 . + . ID=Merlin_158_mRNA;Parent=Merlin_158;seqid=Merlin +Merlin GeneMark.hmm exon 88429 88656 . + . ID=Merlin_158_exon;Parent=Merlin_158_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 88429 88656 . + 0 ID=Merlin_158_CDS;Parent=Merlin_158_exon;seqid=Merlin +Merlin GeneMark.hmm gene 88663 89031 -446.339761 + . ID=Merlin_159;seqid=Merlin +Merlin GeneMark.hmm mRNA 88663 89031 . + . ID=Merlin_159_mRNA;Parent=Merlin_159;seqid=Merlin +Merlin GeneMark.hmm exon 88663 89031 . + . ID=Merlin_159_exon;Parent=Merlin_159_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 88663 89031 . + 0 ID=Merlin_159_CDS;Parent=Merlin_159_exon;seqid=Merlin +Merlin GeneMark.hmm gene 89012 89221 -255.579886 + . ID=Merlin_160;seqid=Merlin +Merlin GeneMark.hmm mRNA 89012 89221 . + . ID=Merlin_160_mRNA;Parent=Merlin_160;seqid=Merlin +Merlin GeneMark.hmm exon 89012 89221 . + . ID=Merlin_160_exon;Parent=Merlin_160_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 89012 89221 . + 0 ID=Merlin_160_CDS;Parent=Merlin_160_exon;seqid=Merlin +Merlin GeneMark.hmm gene 89206 89394 -231.007880 + . ID=Merlin_161;seqid=Merlin +Merlin GeneMark.hmm mRNA 89206 89394 . + . ID=Merlin_161_mRNA;Parent=Merlin_161;seqid=Merlin +Merlin GeneMark.hmm exon 89206 89394 . + . ID=Merlin_161_exon;Parent=Merlin_161_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 89206 89394 . + 0 ID=Merlin_161_CDS;Parent=Merlin_161_exon;seqid=Merlin +Merlin GeneMark.hmm gene 89426 89764 -419.076718 + . ID=Merlin_162;seqid=Merlin +Merlin GeneMark.hmm mRNA 89426 89764 . + . ID=Merlin_162_mRNA;Parent=Merlin_162;seqid=Merlin +Merlin GeneMark.hmm exon 89426 89764 . + . ID=Merlin_162_exon;Parent=Merlin_162_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 89426 89764 . + 0 ID=Merlin_162_CDS;Parent=Merlin_162_exon;seqid=Merlin +Merlin GeneMark.hmm gene 89826 89969 -185.055842 + . ID=Merlin_163;seqid=Merlin +Merlin GeneMark.hmm mRNA 89826 89969 . + . ID=Merlin_163_mRNA;Parent=Merlin_163;seqid=Merlin +Merlin GeneMark.hmm exon 89826 89969 . + . ID=Merlin_163_exon;Parent=Merlin_163_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 89826 89969 . + 0 ID=Merlin_163_CDS;Parent=Merlin_163_exon;seqid=Merlin +Merlin GeneMark.hmm gene 89966 90988 -1312.043599 + . ID=Merlin_164;seqid=Merlin +Merlin GeneMark.hmm mRNA 89966 90988 . + . ID=Merlin_164_mRNA;Parent=Merlin_164;seqid=Merlin +Merlin GeneMark.hmm exon 89966 90988 . + . ID=Merlin_164_exon;Parent=Merlin_164_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 89966 90988 . + 0 ID=Merlin_164_CDS;Parent=Merlin_164_exon;seqid=Merlin +Merlin GeneMark.hmm gene 90985 91191 -254.724476 + . ID=Merlin_165;seqid=Merlin +Merlin GeneMark.hmm mRNA 90985 91191 . + . ID=Merlin_165_mRNA;Parent=Merlin_165;seqid=Merlin +Merlin GeneMark.hmm exon 90985 91191 . + . ID=Merlin_165_exon;Parent=Merlin_165_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 90985 91191 . + 0 ID=Merlin_165_CDS;Parent=Merlin_165_exon;seqid=Merlin +Merlin GeneMark.hmm gene 91188 92870 -2159.860384 + . ID=Merlin_166;seqid=Merlin +Merlin GeneMark.hmm mRNA 91188 92870 . + . ID=Merlin_166_mRNA;Parent=Merlin_166;seqid=Merlin +Merlin GeneMark.hmm exon 91188 92870 . + . ID=Merlin_166_exon;Parent=Merlin_166_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 91188 92870 . + 0 ID=Merlin_166_CDS;Parent=Merlin_166_exon;seqid=Merlin +Merlin GeneMark.hmm gene 92867 93058 -240.822321 + . ID=Merlin_167;seqid=Merlin +Merlin GeneMark.hmm mRNA 92867 93058 . + . ID=Merlin_167_mRNA;Parent=Merlin_167;seqid=Merlin +Merlin GeneMark.hmm exon 92867 93058 . + . ID=Merlin_167_exon;Parent=Merlin_167_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 92867 93058 . + 0 ID=Merlin_167_CDS;Parent=Merlin_167_exon;seqid=Merlin +Merlin GeneMark.hmm gene 93067 93450 -466.762497 + . ID=Merlin_168;seqid=Merlin +Merlin GeneMark.hmm mRNA 93067 93450 . + . ID=Merlin_168_mRNA;Parent=Merlin_168;seqid=Merlin +Merlin GeneMark.hmm exon 93067 93450 . + . ID=Merlin_168_exon;Parent=Merlin_168_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 93067 93450 . + 0 ID=Merlin_168_CDS;Parent=Merlin_168_exon;seqid=Merlin +Merlin GeneMark.hmm gene 93469 94155 -853.161656 + . ID=Merlin_169;seqid=Merlin +Merlin GeneMark.hmm mRNA 93469 94155 . + . ID=Merlin_169_mRNA;Parent=Merlin_169;seqid=Merlin +Merlin GeneMark.hmm exon 93469 94155 . + . ID=Merlin_169_exon;Parent=Merlin_169_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 93469 94155 . + 0 ID=Merlin_169_CDS;Parent=Merlin_169_exon;seqid=Merlin +Merlin GeneMark.hmm gene 94209 95174 -1219.402057 + . ID=Merlin_170;seqid=Merlin +Merlin GeneMark.hmm mRNA 94209 95174 . + . ID=Merlin_170_mRNA;Parent=Merlin_170;seqid=Merlin +Merlin GeneMark.hmm exon 94209 95174 . + . ID=Merlin_170_exon;Parent=Merlin_170_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 94209 95174 . + 0 ID=Merlin_170_CDS;Parent=Merlin_170_exon;seqid=Merlin +Merlin GeneMark.hmm gene 95174 95737 -724.605488 + . ID=Merlin_171;seqid=Merlin +Merlin GeneMark.hmm mRNA 95174 95737 . + . ID=Merlin_171_mRNA;Parent=Merlin_171;seqid=Merlin +Merlin GeneMark.hmm exon 95174 95737 . + . ID=Merlin_171_exon;Parent=Merlin_171_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 95174 95737 . + 0 ID=Merlin_171_CDS;Parent=Merlin_171_exon;seqid=Merlin +Merlin GeneMark.hmm gene 95731 96108 -464.835446 + . ID=Merlin_172;seqid=Merlin +Merlin GeneMark.hmm mRNA 95731 96108 . + . ID=Merlin_172_mRNA;Parent=Merlin_172;seqid=Merlin +Merlin GeneMark.hmm exon 95731 96108 . + . ID=Merlin_172_exon;Parent=Merlin_172_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 95731 96108 . + 0 ID=Merlin_172_CDS;Parent=Merlin_172_exon;seqid=Merlin +Merlin GeneMark.hmm gene 96110 96331 -276.260456 + . ID=Merlin_173;seqid=Merlin +Merlin GeneMark.hmm mRNA 96110 96331 . + . ID=Merlin_173_mRNA;Parent=Merlin_173;seqid=Merlin +Merlin GeneMark.hmm exon 96110 96331 . + . ID=Merlin_173_exon;Parent=Merlin_173_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 96110 96331 . + 0 ID=Merlin_173_CDS;Parent=Merlin_173_exon;seqid=Merlin +Merlin GeneMark.hmm gene 96426 99116 -3385.938661 + . ID=Merlin_174;seqid=Merlin +Merlin GeneMark.hmm mRNA 96426 99116 . + . ID=Merlin_174_mRNA;Parent=Merlin_174;seqid=Merlin +Merlin GeneMark.hmm exon 96426 99116 . + . ID=Merlin_174_exon;Parent=Merlin_174_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 96426 99116 . + 0 ID=Merlin_174_CDS;Parent=Merlin_174_exon;seqid=Merlin +Merlin GeneMark.hmm gene 99179 99418 -294.745409 + . ID=Merlin_175;seqid=Merlin +Merlin GeneMark.hmm mRNA 99179 99418 . + . ID=Merlin_175_mRNA;Parent=Merlin_175;seqid=Merlin +Merlin GeneMark.hmm exon 99179 99418 . + . ID=Merlin_175_exon;Parent=Merlin_175_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 99179 99418 . + 0 ID=Merlin_175_CDS;Parent=Merlin_175_exon;seqid=Merlin +Merlin GeneMark.hmm gene 99455 99895 -551.164186 + . ID=Merlin_176;seqid=Merlin +Merlin GeneMark.hmm mRNA 99455 99895 . + . ID=Merlin_176_mRNA;Parent=Merlin_176;seqid=Merlin +Merlin GeneMark.hmm exon 99455 99895 . + . ID=Merlin_176_exon;Parent=Merlin_176_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 99455 99895 . + 0 ID=Merlin_176_CDS;Parent=Merlin_176_exon;seqid=Merlin +Merlin GeneMark.hmm gene 99928 100140 -262.065624 + . ID=Merlin_177;seqid=Merlin +Merlin GeneMark.hmm mRNA 99928 100140 . + . ID=Merlin_177_mRNA;Parent=Merlin_177;seqid=Merlin +Merlin GeneMark.hmm exon 99928 100140 . + . ID=Merlin_177_exon;Parent=Merlin_177_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 99928 100140 . + 0 ID=Merlin_177_CDS;Parent=Merlin_177_exon;seqid=Merlin +Merlin GeneMark.hmm gene 100137 100877 -927.530517 + . ID=Merlin_178;seqid=Merlin +Merlin GeneMark.hmm mRNA 100137 100877 . + . ID=Merlin_178_mRNA;Parent=Merlin_178;seqid=Merlin +Merlin GeneMark.hmm exon 100137 100877 . + . ID=Merlin_178_exon;Parent=Merlin_178_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 100137 100877 . + 0 ID=Merlin_178_CDS;Parent=Merlin_178_exon;seqid=Merlin +Merlin GeneMark.hmm gene 100868 101704 -1058.313313 + . ID=Merlin_179;seqid=Merlin +Merlin GeneMark.hmm mRNA 100868 101704 . + . ID=Merlin_179_mRNA;Parent=Merlin_179;seqid=Merlin +Merlin GeneMark.hmm exon 100868 101704 . + . ID=Merlin_179_exon;Parent=Merlin_179_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 100868 101704 . + 0 ID=Merlin_179_CDS;Parent=Merlin_179_exon;seqid=Merlin +Merlin GeneMark.hmm gene 101701 102777 -1345.602625 + . ID=Merlin_180;seqid=Merlin +Merlin GeneMark.hmm mRNA 101701 102777 . + . ID=Merlin_180_mRNA;Parent=Merlin_180;seqid=Merlin +Merlin GeneMark.hmm exon 101701 102777 . + . ID=Merlin_180_exon;Parent=Merlin_180_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 101701 102777 . + 0 ID=Merlin_180_CDS;Parent=Merlin_180_exon;seqid=Merlin +Merlin GeneMark.hmm gene 102885 104072 -1483.608352 + . ID=Merlin_181;seqid=Merlin +Merlin GeneMark.hmm mRNA 102885 104072 . + . ID=Merlin_181_mRNA;Parent=Merlin_181;seqid=Merlin +Merlin GeneMark.hmm exon 102885 104072 . + . ID=Merlin_181_exon;Parent=Merlin_181_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 102885 104072 . + 0 ID=Merlin_181_CDS;Parent=Merlin_181_exon;seqid=Merlin +Merlin GeneMark.hmm gene 104072 104422 -451.869493 + . ID=Merlin_182;seqid=Merlin +Merlin GeneMark.hmm mRNA 104072 104422 . + . ID=Merlin_182_mRNA;Parent=Merlin_182;seqid=Merlin +Merlin GeneMark.hmm exon 104072 104422 . + . ID=Merlin_182_exon;Parent=Merlin_182_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 104072 104422 . + 0 ID=Merlin_182_CDS;Parent=Merlin_182_exon;seqid=Merlin +Merlin GeneMark.hmm gene 104500 105867 -1730.587045 + . ID=Merlin_183;seqid=Merlin +Merlin GeneMark.hmm mRNA 104500 105867 . + . ID=Merlin_183_mRNA;Parent=Merlin_183;seqid=Merlin +Merlin GeneMark.hmm exon 104500 105867 . + . ID=Merlin_183_exon;Parent=Merlin_183_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 104500 105867 . + 0 ID=Merlin_183_CDS;Parent=Merlin_183_exon;seqid=Merlin +Merlin GeneMark.hmm gene 105928 106209 -352.988779 + . ID=Merlin_184;seqid=Merlin +Merlin GeneMark.hmm mRNA 105928 106209 . + . ID=Merlin_184_mRNA;Parent=Merlin_184;seqid=Merlin +Merlin GeneMark.hmm exon 105928 106209 . + . ID=Merlin_184_exon;Parent=Merlin_184_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 105928 106209 . + 0 ID=Merlin_184_CDS;Parent=Merlin_184_exon;seqid=Merlin +Merlin GeneMark.hmm gene 106209 106487 -351.122469 + . ID=Merlin_185;seqid=Merlin +Merlin GeneMark.hmm mRNA 106209 106487 . + . ID=Merlin_185_mRNA;Parent=Merlin_185;seqid=Merlin +Merlin GeneMark.hmm exon 106209 106487 . + . ID=Merlin_185_exon;Parent=Merlin_185_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 106209 106487 . + 0 ID=Merlin_185_CDS;Parent=Merlin_185_exon;seqid=Merlin +Merlin GeneMark.hmm gene 106487 106684 -246.970187 + . ID=Merlin_186;seqid=Merlin +Merlin GeneMark.hmm mRNA 106487 106684 . + . ID=Merlin_186_mRNA;Parent=Merlin_186;seqid=Merlin +Merlin GeneMark.hmm exon 106487 106684 . + . ID=Merlin_186_exon;Parent=Merlin_186_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 106487 106684 . + 0 ID=Merlin_186_CDS;Parent=Merlin_186_exon;seqid=Merlin +Merlin GeneMark.hmm gene 106699 107163 -615.053890 + . ID=Merlin_187;seqid=Merlin +Merlin GeneMark.hmm mRNA 106699 107163 . + . ID=Merlin_187_mRNA;Parent=Merlin_187;seqid=Merlin +Merlin GeneMark.hmm exon 106699 107163 . + . ID=Merlin_187_exon;Parent=Merlin_187_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 106699 107163 . + 0 ID=Merlin_187_CDS;Parent=Merlin_187_exon;seqid=Merlin +Merlin GeneMark.hmm gene 107200 108225 -1324.566436 + . ID=Merlin_188;seqid=Merlin +Merlin GeneMark.hmm mRNA 107200 108225 . + . ID=Merlin_188_mRNA;Parent=Merlin_188;seqid=Merlin +Merlin GeneMark.hmm exon 107200 108225 . + . ID=Merlin_188_exon;Parent=Merlin_188_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 107200 108225 . + 0 ID=Merlin_188_CDS;Parent=Merlin_188_exon;seqid=Merlin +Merlin GeneMark.hmm gene 108222 108419 -244.299886 - . ID=Merlin_189;seqid=Merlin +Merlin GeneMark.hmm mRNA 108222 108419 . - . ID=Merlin_189_mRNA;Parent=Merlin_189;seqid=Merlin +Merlin GeneMark.hmm exon 108222 108419 . - . ID=Merlin_189_exon;Parent=Merlin_189_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 108222 108419 . - 0 ID=Merlin_189_CDS;Parent=Merlin_189_exon;seqid=Merlin +Merlin GeneMark.hmm gene 108443 108727 -361.722638 + . ID=Merlin_190;seqid=Merlin +Merlin GeneMark.hmm mRNA 108443 108727 . + . ID=Merlin_190_mRNA;Parent=Merlin_190;seqid=Merlin +Merlin GeneMark.hmm exon 108443 108727 . + . ID=Merlin_190_exon;Parent=Merlin_190_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 108443 108727 . + 0 ID=Merlin_190_CDS;Parent=Merlin_190_exon;seqid=Merlin +Merlin GeneMark.hmm gene 108746 109267 -660.122856 + . ID=Merlin_191;seqid=Merlin +Merlin GeneMark.hmm mRNA 108746 109267 . + . ID=Merlin_191_mRNA;Parent=Merlin_191;seqid=Merlin +Merlin GeneMark.hmm exon 108746 109267 . + . ID=Merlin_191_exon;Parent=Merlin_191_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 108746 109267 . + 0 ID=Merlin_191_CDS;Parent=Merlin_191_exon;seqid=Merlin +Merlin GeneMark.hmm gene 109283 109450 -207.369336 + . ID=Merlin_192;seqid=Merlin +Merlin GeneMark.hmm mRNA 109283 109450 . + . ID=Merlin_192_mRNA;Parent=Merlin_192;seqid=Merlin +Merlin GeneMark.hmm exon 109283 109450 . + . ID=Merlin_192_exon;Parent=Merlin_192_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 109283 109450 . + 0 ID=Merlin_192_CDS;Parent=Merlin_192_exon;seqid=Merlin +Merlin GeneMark.hmm gene 109463 109684 -282.485263 + . ID=Merlin_193;seqid=Merlin +Merlin GeneMark.hmm mRNA 109463 109684 . + . ID=Merlin_193_mRNA;Parent=Merlin_193;seqid=Merlin +Merlin GeneMark.hmm exon 109463 109684 . + . ID=Merlin_193_exon;Parent=Merlin_193_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 109463 109684 . + 0 ID=Merlin_193_CDS;Parent=Merlin_193_exon;seqid=Merlin +Merlin GeneMark.hmm gene 109681 109833 -188.437796 + . ID=Merlin_194;seqid=Merlin +Merlin GeneMark.hmm mRNA 109681 109833 . + . ID=Merlin_194_mRNA;Parent=Merlin_194;seqid=Merlin +Merlin GeneMark.hmm exon 109681 109833 . + . ID=Merlin_194_exon;Parent=Merlin_194_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 109681 109833 . + 0 ID=Merlin_194_CDS;Parent=Merlin_194_exon;seqid=Merlin +Merlin GeneMark.hmm gene 109868 110107 -300.363740 + . ID=Merlin_195;seqid=Merlin +Merlin GeneMark.hmm mRNA 109868 110107 . + . ID=Merlin_195_mRNA;Parent=Merlin_195;seqid=Merlin +Merlin GeneMark.hmm exon 109868 110107 . + . ID=Merlin_195_exon;Parent=Merlin_195_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 109868 110107 . + 0 ID=Merlin_195_CDS;Parent=Merlin_195_exon;seqid=Merlin +Merlin GeneMark.hmm gene 110187 110387 -242.566720 + . ID=Merlin_196;seqid=Merlin +Merlin GeneMark.hmm mRNA 110187 110387 . + . ID=Merlin_196_mRNA;Parent=Merlin_196;seqid=Merlin +Merlin GeneMark.hmm exon 110187 110387 . + . ID=Merlin_196_exon;Parent=Merlin_196_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 110187 110387 . + 0 ID=Merlin_196_CDS;Parent=Merlin_196_exon;seqid=Merlin +Merlin GeneMark.hmm gene 110384 110623 -295.174485 + . ID=Merlin_197;seqid=Merlin +Merlin GeneMark.hmm mRNA 110384 110623 . + . ID=Merlin_197_mRNA;Parent=Merlin_197;seqid=Merlin +Merlin GeneMark.hmm exon 110384 110623 . + . ID=Merlin_197_exon;Parent=Merlin_197_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 110384 110623 . + 0 ID=Merlin_197_CDS;Parent=Merlin_197_exon;seqid=Merlin +Merlin GeneMark.hmm gene 110620 111051 -544.978023 + . ID=Merlin_198;seqid=Merlin +Merlin GeneMark.hmm mRNA 110620 111051 . + . ID=Merlin_198_mRNA;Parent=Merlin_198;seqid=Merlin +Merlin GeneMark.hmm exon 110620 111051 . + . ID=Merlin_198_exon;Parent=Merlin_198_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 110620 111051 . + 0 ID=Merlin_198_CDS;Parent=Merlin_198_exon;seqid=Merlin +Merlin GeneMark.hmm gene 111101 111238 -161.794612 + . ID=Merlin_199;seqid=Merlin +Merlin GeneMark.hmm mRNA 111101 111238 . + . ID=Merlin_199_mRNA;Parent=Merlin_199;seqid=Merlin +Merlin GeneMark.hmm exon 111101 111238 . + . ID=Merlin_199_exon;Parent=Merlin_199_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 111101 111238 . + 0 ID=Merlin_199_CDS;Parent=Merlin_199_exon;seqid=Merlin +Merlin GeneMark.hmm gene 111213 111737 -670.599096 + . ID=Merlin_200;seqid=Merlin +Merlin GeneMark.hmm mRNA 111213 111737 . + . ID=Merlin_200_mRNA;Parent=Merlin_200;seqid=Merlin +Merlin GeneMark.hmm exon 111213 111737 . + . ID=Merlin_200_exon;Parent=Merlin_200_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 111213 111737 . + 0 ID=Merlin_200_CDS;Parent=Merlin_200_exon;seqid=Merlin +Merlin GeneMark.hmm gene 111737 111913 -223.231704 + . ID=Merlin_201;seqid=Merlin +Merlin GeneMark.hmm mRNA 111737 111913 . + . ID=Merlin_201_mRNA;Parent=Merlin_201;seqid=Merlin +Merlin GeneMark.hmm exon 111737 111913 . + . ID=Merlin_201_exon;Parent=Merlin_201_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 111737 111913 . + 0 ID=Merlin_201_CDS;Parent=Merlin_201_exon;seqid=Merlin +Merlin GeneMark.hmm gene 111973 112590 -802.696887 + . ID=Merlin_202;seqid=Merlin +Merlin GeneMark.hmm mRNA 111973 112590 . + . ID=Merlin_202_mRNA;Parent=Merlin_202;seqid=Merlin +Merlin GeneMark.hmm exon 111973 112590 . + . ID=Merlin_202_exon;Parent=Merlin_202_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 111973 112590 . + 0 ID=Merlin_202_CDS;Parent=Merlin_202_exon;seqid=Merlin +Merlin GeneMark.hmm gene 112676 113461 -994.252012 + . ID=Merlin_203;seqid=Merlin +Merlin GeneMark.hmm mRNA 112676 113461 . + . ID=Merlin_203_mRNA;Parent=Merlin_203;seqid=Merlin +Merlin GeneMark.hmm exon 112676 113461 . + . ID=Merlin_203_exon;Parent=Merlin_203_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 112676 113461 . + 0 ID=Merlin_203_CDS;Parent=Merlin_203_exon;seqid=Merlin +Merlin GeneMark.hmm gene 113461 113778 -389.300206 + . ID=Merlin_204;seqid=Merlin +Merlin GeneMark.hmm mRNA 113461 113778 . + . ID=Merlin_204_mRNA;Parent=Merlin_204;seqid=Merlin +Merlin GeneMark.hmm exon 113461 113778 . + . ID=Merlin_204_exon;Parent=Merlin_204_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 113461 113778 . + 0 ID=Merlin_204_CDS;Parent=Merlin_204_exon;seqid=Merlin +Merlin GeneMark.hmm gene 113787 115118 -1697.881894 + . ID=Merlin_205;seqid=Merlin +Merlin GeneMark.hmm mRNA 113787 115118 . + . ID=Merlin_205_mRNA;Parent=Merlin_205;seqid=Merlin +Merlin GeneMark.hmm exon 113787 115118 . + . ID=Merlin_205_exon;Parent=Merlin_205_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 113787 115118 . + 0 ID=Merlin_205_CDS;Parent=Merlin_205_exon;seqid=Merlin +Merlin GeneMark.hmm gene 115125 115355 -279.940476 + . ID=Merlin_206;seqid=Merlin +Merlin GeneMark.hmm mRNA 115125 115355 . + . ID=Merlin_206_mRNA;Parent=Merlin_206;seqid=Merlin +Merlin GeneMark.hmm exon 115125 115355 . + . ID=Merlin_206_exon;Parent=Merlin_206_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 115125 115355 . + 0 ID=Merlin_206_CDS;Parent=Merlin_206_exon;seqid=Merlin +Merlin GeneMark.hmm gene 115346 116038 -870.417189 + . ID=Merlin_207;seqid=Merlin +Merlin GeneMark.hmm mRNA 115346 116038 . + . ID=Merlin_207_mRNA;Parent=Merlin_207;seqid=Merlin +Merlin GeneMark.hmm exon 115346 116038 . + . ID=Merlin_207_exon;Parent=Merlin_207_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 115346 116038 . + 0 ID=Merlin_207_CDS;Parent=Merlin_207_exon;seqid=Merlin +Merlin GeneMark.hmm gene 116040 116453 -527.653367 + . ID=Merlin_208;seqid=Merlin +Merlin GeneMark.hmm mRNA 116040 116453 . + . ID=Merlin_208_mRNA;Parent=Merlin_208;seqid=Merlin +Merlin GeneMark.hmm exon 116040 116453 . + . ID=Merlin_208_exon;Parent=Merlin_208_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 116040 116453 . + 0 ID=Merlin_208_CDS;Parent=Merlin_208_exon;seqid=Merlin +Merlin GeneMark.hmm gene 116520 116714 -243.312871 + . ID=Merlin_209;seqid=Merlin +Merlin GeneMark.hmm mRNA 116520 116714 . + . ID=Merlin_209_mRNA;Parent=Merlin_209;seqid=Merlin +Merlin GeneMark.hmm exon 116520 116714 . + . ID=Merlin_209_exon;Parent=Merlin_209_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 116520 116714 . + 0 ID=Merlin_209_CDS;Parent=Merlin_209_exon;seqid=Merlin +Merlin GeneMark.hmm gene 116714 117190 -587.212745 + . ID=Merlin_210;seqid=Merlin +Merlin GeneMark.hmm mRNA 116714 117190 . + . ID=Merlin_210_mRNA;Parent=Merlin_210;seqid=Merlin +Merlin GeneMark.hmm exon 116714 117190 . + . ID=Merlin_210_exon;Parent=Merlin_210_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 116714 117190 . + 0 ID=Merlin_210_CDS;Parent=Merlin_210_exon;seqid=Merlin +Merlin GeneMark.hmm gene 117177 117371 -246.741774 + . ID=Merlin_211;seqid=Merlin +Merlin GeneMark.hmm mRNA 117177 117371 . + . ID=Merlin_211_mRNA;Parent=Merlin_211;seqid=Merlin +Merlin GeneMark.hmm exon 117177 117371 . + . ID=Merlin_211_exon;Parent=Merlin_211_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 117177 117371 . + 0 ID=Merlin_211_CDS;Parent=Merlin_211_exon;seqid=Merlin +Merlin GeneMark.hmm gene 117368 117844 -587.223837 + . ID=Merlin_212;seqid=Merlin +Merlin GeneMark.hmm mRNA 117368 117844 . + . ID=Merlin_212_mRNA;Parent=Merlin_212;seqid=Merlin +Merlin GeneMark.hmm exon 117368 117844 . + . ID=Merlin_212_exon;Parent=Merlin_212_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 117368 117844 . + 0 ID=Merlin_212_CDS;Parent=Merlin_212_exon;seqid=Merlin +Merlin GeneMark.hmm gene 117841 117939 -117.153787 + . ID=Merlin_213;seqid=Merlin +Merlin GeneMark.hmm mRNA 117841 117939 . + . ID=Merlin_213_mRNA;Parent=Merlin_213;seqid=Merlin +Merlin GeneMark.hmm exon 117841 117939 . + . ID=Merlin_213_exon;Parent=Merlin_213_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 117841 117939 . + 0 ID=Merlin_213_CDS;Parent=Merlin_213_exon;seqid=Merlin +Merlin GeneMark.hmm gene 117936 118187 -314.341261 + . ID=Merlin_214;seqid=Merlin +Merlin GeneMark.hmm mRNA 117936 118187 . + . ID=Merlin_214_mRNA;Parent=Merlin_214;seqid=Merlin +Merlin GeneMark.hmm exon 117936 118187 . + . ID=Merlin_214_exon;Parent=Merlin_214_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 117936 118187 . + 0 ID=Merlin_214_CDS;Parent=Merlin_214_exon;seqid=Merlin +Merlin GeneMark.hmm gene 118184 118411 -293.015141 + . ID=Merlin_215;seqid=Merlin +Merlin GeneMark.hmm mRNA 118184 118411 . + . ID=Merlin_215_mRNA;Parent=Merlin_215;seqid=Merlin +Merlin GeneMark.hmm exon 118184 118411 . + . ID=Merlin_215_exon;Parent=Merlin_215_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 118184 118411 . + 0 ID=Merlin_215_CDS;Parent=Merlin_215_exon;seqid=Merlin +Merlin GeneMark.hmm gene 118435 118818 -477.204459 + . ID=Merlin_216;seqid=Merlin +Merlin GeneMark.hmm mRNA 118435 118818 . + . ID=Merlin_216_mRNA;Parent=Merlin_216;seqid=Merlin +Merlin GeneMark.hmm exon 118435 118818 . + . ID=Merlin_216_exon;Parent=Merlin_216_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 118435 118818 . + 0 ID=Merlin_216_CDS;Parent=Merlin_216_exon;seqid=Merlin +Merlin GeneMark.hmm gene 118849 120690 -2259.486004 + . ID=Merlin_217;seqid=Merlin +Merlin GeneMark.hmm mRNA 118849 120690 . + . ID=Merlin_217_mRNA;Parent=Merlin_217;seqid=Merlin +Merlin GeneMark.hmm exon 118849 120690 . + . ID=Merlin_217_exon;Parent=Merlin_217_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 118849 120690 . + 0 ID=Merlin_217_CDS;Parent=Merlin_217_exon;seqid=Merlin +Merlin GeneMark.hmm gene 120730 120885 -200.778885 + . ID=Merlin_218;seqid=Merlin +Merlin GeneMark.hmm mRNA 120730 120885 . + . ID=Merlin_218_mRNA;Parent=Merlin_218;seqid=Merlin +Merlin GeneMark.hmm exon 120730 120885 . + . ID=Merlin_218_exon;Parent=Merlin_218_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 120730 120885 . + 0 ID=Merlin_218_CDS;Parent=Merlin_218_exon;seqid=Merlin +Merlin GeneMark.hmm gene 120929 121213 -363.032822 + . ID=Merlin_219;seqid=Merlin +Merlin GeneMark.hmm mRNA 120929 121213 . + . ID=Merlin_219_mRNA;Parent=Merlin_219;seqid=Merlin +Merlin GeneMark.hmm exon 120929 121213 . + . ID=Merlin_219_exon;Parent=Merlin_219_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 120929 121213 . + 0 ID=Merlin_219_CDS;Parent=Merlin_219_exon;seqid=Merlin +Merlin GeneMark.hmm gene 121200 121400 -244.392369 + . ID=Merlin_220;seqid=Merlin +Merlin GeneMark.hmm mRNA 121200 121400 . + . ID=Merlin_220_mRNA;Parent=Merlin_220;seqid=Merlin +Merlin GeneMark.hmm exon 121200 121400 . + . ID=Merlin_220_exon;Parent=Merlin_220_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 121200 121400 . + 0 ID=Merlin_220_CDS;Parent=Merlin_220_exon;seqid=Merlin +Merlin GeneMark.hmm gene 121411 123588 -2750.112191 + . ID=Merlin_221;seqid=Merlin +Merlin GeneMark.hmm mRNA 121411 123588 . + . ID=Merlin_221_mRNA;Parent=Merlin_221;seqid=Merlin +Merlin GeneMark.hmm exon 121411 123588 . + . ID=Merlin_221_exon;Parent=Merlin_221_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 121411 123588 . + 0 ID=Merlin_221_CDS;Parent=Merlin_221_exon;seqid=Merlin +Merlin GeneMark.hmm gene 123598 124494 -1129.990261 + . ID=Merlin_222;seqid=Merlin +Merlin GeneMark.hmm mRNA 123598 124494 . + . ID=Merlin_222_mRNA;Parent=Merlin_222;seqid=Merlin +Merlin GeneMark.hmm exon 123598 124494 . + . ID=Merlin_222_exon;Parent=Merlin_222_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 123598 124494 . + 0 ID=Merlin_222_CDS;Parent=Merlin_222_exon;seqid=Merlin +Merlin GeneMark.hmm gene 124494 124691 -244.507612 + . ID=Merlin_223;seqid=Merlin +Merlin GeneMark.hmm mRNA 124494 124691 . + . ID=Merlin_223_mRNA;Parent=Merlin_223;seqid=Merlin +Merlin GeneMark.hmm exon 124494 124691 . + . ID=Merlin_223_exon;Parent=Merlin_223_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 124494 124691 . + 0 ID=Merlin_223_CDS;Parent=Merlin_223_exon;seqid=Merlin +Merlin GeneMark.hmm gene 124727 125047 -399.871946 + . ID=Merlin_224;seqid=Merlin +Merlin GeneMark.hmm mRNA 124727 125047 . + . ID=Merlin_224_mRNA;Parent=Merlin_224;seqid=Merlin +Merlin GeneMark.hmm exon 124727 125047 . + . ID=Merlin_224_exon;Parent=Merlin_224_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 124727 125047 . + 0 ID=Merlin_224_CDS;Parent=Merlin_224_exon;seqid=Merlin +Merlin GeneMark.hmm gene 125097 125537 -571.759726 + . ID=Merlin_225;seqid=Merlin +Merlin GeneMark.hmm mRNA 125097 125537 . + . ID=Merlin_225_mRNA;Parent=Merlin_225;seqid=Merlin +Merlin GeneMark.hmm exon 125097 125537 . + . ID=Merlin_225_exon;Parent=Merlin_225_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 125097 125537 . + 0 ID=Merlin_225_CDS;Parent=Merlin_225_exon;seqid=Merlin +Merlin GeneMark.hmm gene 125606 125851 -292.219635 + . ID=Merlin_226;seqid=Merlin +Merlin GeneMark.hmm mRNA 125606 125851 . + . ID=Merlin_226_mRNA;Parent=Merlin_226;seqid=Merlin +Merlin GeneMark.hmm exon 125606 125851 . + . ID=Merlin_226_exon;Parent=Merlin_226_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 125606 125851 . + 0 ID=Merlin_226_CDS;Parent=Merlin_226_exon;seqid=Merlin +Merlin GeneMark.hmm gene 125848 126039 -240.766275 + . ID=Merlin_227;seqid=Merlin +Merlin GeneMark.hmm mRNA 125848 126039 . + . ID=Merlin_227_mRNA;Parent=Merlin_227;seqid=Merlin +Merlin GeneMark.hmm exon 125848 126039 . + . ID=Merlin_227_exon;Parent=Merlin_227_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 125848 126039 . + 0 ID=Merlin_227_CDS;Parent=Merlin_227_exon;seqid=Merlin +Merlin GeneMark.hmm gene 126096 126536 -555.654560 + . ID=Merlin_228;seqid=Merlin +Merlin GeneMark.hmm mRNA 126096 126536 . + . ID=Merlin_228_mRNA;Parent=Merlin_228;seqid=Merlin +Merlin GeneMark.hmm exon 126096 126536 . + . ID=Merlin_228_exon;Parent=Merlin_228_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 126096 126536 . + 0 ID=Merlin_228_CDS;Parent=Merlin_228_exon;seqid=Merlin +Merlin GeneMark.hmm gene 126843 126980 -167.572589 + . ID=Merlin_229;seqid=Merlin +Merlin GeneMark.hmm mRNA 126843 126980 . + . ID=Merlin_229_mRNA;Parent=Merlin_229;seqid=Merlin +Merlin GeneMark.hmm exon 126843 126980 . + . ID=Merlin_229_exon;Parent=Merlin_229_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 126843 126980 . + 0 ID=Merlin_229_CDS;Parent=Merlin_229_exon;seqid=Merlin +Merlin GeneMark.hmm gene 126985 128322 -1655.641432 + . ID=Merlin_230;seqid=Merlin +Merlin GeneMark.hmm mRNA 126985 128322 . + . ID=Merlin_230_mRNA;Parent=Merlin_230;seqid=Merlin +Merlin GeneMark.hmm exon 126985 128322 . + . ID=Merlin_230_exon;Parent=Merlin_230_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 126985 128322 . + 0 ID=Merlin_230_CDS;Parent=Merlin_230_exon;seqid=Merlin +Merlin GeneMark.hmm gene 128313 128453 -176.429391 + . ID=Merlin_231;seqid=Merlin +Merlin GeneMark.hmm mRNA 128313 128453 . + . ID=Merlin_231_mRNA;Parent=Merlin_231;seqid=Merlin +Merlin GeneMark.hmm exon 128313 128453 . + . ID=Merlin_231_exon;Parent=Merlin_231_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 128313 128453 . + 0 ID=Merlin_231_CDS;Parent=Merlin_231_exon;seqid=Merlin +Merlin GeneMark.hmm gene 128634 128867 -280.339767 + . ID=Merlin_232;seqid=Merlin +Merlin GeneMark.hmm mRNA 128634 128867 . + . ID=Merlin_232_mRNA;Parent=Merlin_232;seqid=Merlin +Merlin GeneMark.hmm exon 128634 128867 . + . ID=Merlin_232_exon;Parent=Merlin_232_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 128634 128867 . + 0 ID=Merlin_232_CDS;Parent=Merlin_232_exon;seqid=Merlin +Merlin GeneMark.hmm gene 128931 129194 -323.191370 + . ID=Merlin_233;seqid=Merlin +Merlin GeneMark.hmm mRNA 128931 129194 . + . ID=Merlin_233_mRNA;Parent=Merlin_233;seqid=Merlin +Merlin GeneMark.hmm exon 128931 129194 . + . ID=Merlin_233_exon;Parent=Merlin_233_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 128931 129194 . + 0 ID=Merlin_233_CDS;Parent=Merlin_233_exon;seqid=Merlin +Merlin GeneMark.hmm gene 129202 129471 -345.520317 + . ID=Merlin_234;seqid=Merlin +Merlin GeneMark.hmm mRNA 129202 129471 . + . ID=Merlin_234_mRNA;Parent=Merlin_234;seqid=Merlin +Merlin GeneMark.hmm exon 129202 129471 . + . ID=Merlin_234_exon;Parent=Merlin_234_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 129202 129471 . + 0 ID=Merlin_234_CDS;Parent=Merlin_234_exon;seqid=Merlin +Merlin GeneMark.hmm gene 129581 130225 -789.527965 + . ID=Merlin_235;seqid=Merlin +Merlin GeneMark.hmm mRNA 129581 130225 . + . ID=Merlin_235_mRNA;Parent=Merlin_235;seqid=Merlin +Merlin GeneMark.hmm exon 129581 130225 . + . ID=Merlin_235_exon;Parent=Merlin_235_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 129581 130225 . + 0 ID=Merlin_235_CDS;Parent=Merlin_235_exon;seqid=Merlin +Merlin GeneMark.hmm gene 130236 130643 -513.741632 + . ID=Merlin_236;seqid=Merlin +Merlin GeneMark.hmm mRNA 130236 130643 . + . ID=Merlin_236_mRNA;Parent=Merlin_236;seqid=Merlin +Merlin GeneMark.hmm exon 130236 130643 . + . ID=Merlin_236_exon;Parent=Merlin_236_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 130236 130643 . + 0 ID=Merlin_236_CDS;Parent=Merlin_236_exon;seqid=Merlin +Merlin GeneMark.hmm gene 130640 131017 -476.781736 + . ID=Merlin_237;seqid=Merlin +Merlin GeneMark.hmm mRNA 130640 131017 . + . ID=Merlin_237_mRNA;Parent=Merlin_237;seqid=Merlin +Merlin GeneMark.hmm exon 130640 131017 . + . ID=Merlin_237_exon;Parent=Merlin_237_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 130640 131017 . + 0 ID=Merlin_237_CDS;Parent=Merlin_237_exon;seqid=Merlin +Merlin GeneMark.hmm gene 131017 131289 -326.061964 + . ID=Merlin_238;seqid=Merlin +Merlin GeneMark.hmm mRNA 131017 131289 . + . ID=Merlin_238_mRNA;Parent=Merlin_238;seqid=Merlin +Merlin GeneMark.hmm exon 131017 131289 . + . ID=Merlin_238_exon;Parent=Merlin_238_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 131017 131289 . + 0 ID=Merlin_238_CDS;Parent=Merlin_238_exon;seqid=Merlin +Merlin GeneMark.hmm gene 131289 131597 -389.454269 + . ID=Merlin_239;seqid=Merlin +Merlin GeneMark.hmm mRNA 131289 131597 . + . ID=Merlin_239_mRNA;Parent=Merlin_239;seqid=Merlin +Merlin GeneMark.hmm exon 131289 131597 . + . ID=Merlin_239_exon;Parent=Merlin_239_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 131289 131597 . + 0 ID=Merlin_239_CDS;Parent=Merlin_239_exon;seqid=Merlin +Merlin GeneMark.hmm gene 131569 131781 -264.904995 + . ID=Merlin_240;seqid=Merlin +Merlin GeneMark.hmm mRNA 131569 131781 . + . ID=Merlin_240_mRNA;Parent=Merlin_240;seqid=Merlin +Merlin GeneMark.hmm exon 131569 131781 . + . ID=Merlin_240_exon;Parent=Merlin_240_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 131569 131781 . + 0 ID=Merlin_240_CDS;Parent=Merlin_240_exon;seqid=Merlin +Merlin GeneMark.hmm gene 131778 132191 -541.018164 + . ID=Merlin_241;seqid=Merlin +Merlin GeneMark.hmm mRNA 131778 132191 . + . ID=Merlin_241_mRNA;Parent=Merlin_241;seqid=Merlin +Merlin GeneMark.hmm exon 131778 132191 . + . ID=Merlin_241_exon;Parent=Merlin_241_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 131778 132191 . + 0 ID=Merlin_241_CDS;Parent=Merlin_241_exon;seqid=Merlin +Merlin GeneMark.hmm gene 132199 132585 -491.258919 + . ID=Merlin_242;seqid=Merlin +Merlin GeneMark.hmm mRNA 132199 132585 . + . ID=Merlin_242_mRNA;Parent=Merlin_242;seqid=Merlin +Merlin GeneMark.hmm exon 132199 132585 . + . ID=Merlin_242_exon;Parent=Merlin_242_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 132199 132585 . + 0 ID=Merlin_242_CDS;Parent=Merlin_242_exon;seqid=Merlin +Merlin GeneMark.hmm gene 132575 132847 -349.509326 + . ID=Merlin_243;seqid=Merlin +Merlin GeneMark.hmm mRNA 132575 132847 . + . ID=Merlin_243_mRNA;Parent=Merlin_243;seqid=Merlin +Merlin GeneMark.hmm exon 132575 132847 . + . ID=Merlin_243_exon;Parent=Merlin_243_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 132575 132847 . + 0 ID=Merlin_243_CDS;Parent=Merlin_243_exon;seqid=Merlin +Merlin GeneMark.hmm gene 132910 133182 -334.452325 + . ID=Merlin_244;seqid=Merlin +Merlin GeneMark.hmm mRNA 132910 133182 . + . ID=Merlin_244_mRNA;Parent=Merlin_244;seqid=Merlin +Merlin GeneMark.hmm exon 132910 133182 . + . ID=Merlin_244_exon;Parent=Merlin_244_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 132910 133182 . + 0 ID=Merlin_244_CDS;Parent=Merlin_244_exon;seqid=Merlin +Merlin GeneMark.hmm gene 133179 133835 -859.997228 - . ID=Merlin_245;seqid=Merlin +Merlin GeneMark.hmm mRNA 133179 133835 . - . ID=Merlin_245_mRNA;Parent=Merlin_245;seqid=Merlin +Merlin GeneMark.hmm exon 133179 133835 . - . ID=Merlin_245_exon;Parent=Merlin_245_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 133179 133835 . - 0 ID=Merlin_245_CDS;Parent=Merlin_245_exon;seqid=Merlin +Merlin GeneMark.hmm gene 133857 134663 -1049.900868 - . ID=Merlin_246;seqid=Merlin +Merlin GeneMark.hmm mRNA 133857 134663 . - . ID=Merlin_246_mRNA;Parent=Merlin_246;seqid=Merlin +Merlin GeneMark.hmm exon 133857 134663 . - . ID=Merlin_246_exon;Parent=Merlin_246_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 133857 134663 . - 0 ID=Merlin_246_CDS;Parent=Merlin_246_exon;seqid=Merlin +Merlin GeneMark.hmm gene 134693 137068 -3033.417419 - . ID=Merlin_247;seqid=Merlin +Merlin GeneMark.hmm mRNA 134693 137068 . - . ID=Merlin_247_mRNA;Parent=Merlin_247;seqid=Merlin +Merlin GeneMark.hmm exon 134693 137068 . - . ID=Merlin_247_exon;Parent=Merlin_247_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 134693 137068 . - 0 ID=Merlin_247_CDS;Parent=Merlin_247_exon;seqid=Merlin +Merlin GeneMark.hmm gene 137075 137734 -856.122084 - . ID=Merlin_248;seqid=Merlin +Merlin GeneMark.hmm mRNA 137075 137734 . - . ID=Merlin_248_mRNA;Parent=Merlin_248;seqid=Merlin +Merlin GeneMark.hmm exon 137075 137734 . - . ID=Merlin_248_exon;Parent=Merlin_248_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 137075 137734 . - 0 ID=Merlin_248_CDS;Parent=Merlin_248_exon;seqid=Merlin +Merlin GeneMark.hmm gene 137787 138962 -1500.330086 - . ID=Merlin_249;seqid=Merlin +Merlin GeneMark.hmm mRNA 137787 138962 . - . ID=Merlin_249_mRNA;Parent=Merlin_249;seqid=Merlin +Merlin GeneMark.hmm exon 137787 138962 . - . ID=Merlin_249_exon;Parent=Merlin_249_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 137787 138962 . - 0 ID=Merlin_249_CDS;Parent=Merlin_249_exon;seqid=Merlin +Merlin GeneMark.hmm gene 138962 142759 -4791.853068 - . ID=Merlin_250;seqid=Merlin +Merlin GeneMark.hmm mRNA 138962 142759 . - . ID=Merlin_250_mRNA;Parent=Merlin_250;seqid=Merlin +Merlin GeneMark.hmm exon 138962 142759 . - . ID=Merlin_250_exon;Parent=Merlin_250_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 138962 142759 . - 0 ID=Merlin_250_CDS;Parent=Merlin_250_exon;seqid=Merlin +Merlin GeneMark.hmm gene 142827 143753 -1151.813807 + . ID=Merlin_251;seqid=Merlin +Merlin GeneMark.hmm mRNA 142827 143753 . + . ID=Merlin_251_mRNA;Parent=Merlin_251;seqid=Merlin +Merlin GeneMark.hmm exon 142827 143753 . + . ID=Merlin_251_exon;Parent=Merlin_251_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 142827 143753 . + 0 ID=Merlin_251_CDS;Parent=Merlin_251_exon;seqid=Merlin +Merlin GeneMark.hmm gene 143743 144030 -331.847936 + . ID=Merlin_252;seqid=Merlin +Merlin GeneMark.hmm mRNA 143743 144030 . + . ID=Merlin_252_mRNA;Parent=Merlin_252;seqid=Merlin +Merlin GeneMark.hmm exon 143743 144030 . + . ID=Merlin_252_exon;Parent=Merlin_252_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 143743 144030 . + 0 ID=Merlin_252_CDS;Parent=Merlin_252_exon;seqid=Merlin +Merlin GeneMark.hmm gene 144008 144304 -369.866491 + . ID=Merlin_253;seqid=Merlin +Merlin GeneMark.hmm mRNA 144008 144304 . + . ID=Merlin_253_mRNA;Parent=Merlin_253;seqid=Merlin +Merlin GeneMark.hmm exon 144008 144304 . + . ID=Merlin_253_exon;Parent=Merlin_253_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 144008 144304 . + 0 ID=Merlin_253_CDS;Parent=Merlin_253_exon;seqid=Merlin +Merlin GeneMark.hmm gene 144301 144954 -836.139828 + . ID=Merlin_254;seqid=Merlin +Merlin GeneMark.hmm mRNA 144301 144954 . + . ID=Merlin_254_mRNA;Parent=Merlin_254;seqid=Merlin +Merlin GeneMark.hmm exon 144301 144954 . + . ID=Merlin_254_exon;Parent=Merlin_254_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 144301 144954 . + 0 ID=Merlin_254_CDS;Parent=Merlin_254_exon;seqid=Merlin +Merlin GeneMark.hmm gene 144964 145875 -1124.370545 + . ID=Merlin_255;seqid=Merlin +Merlin GeneMark.hmm mRNA 144964 145875 . + . ID=Merlin_255_mRNA;Parent=Merlin_255;seqid=Merlin +Merlin GeneMark.hmm exon 144964 145875 . + . ID=Merlin_255_exon;Parent=Merlin_255_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 144964 145875 . + 0 ID=Merlin_255_CDS;Parent=Merlin_255_exon;seqid=Merlin +Merlin GeneMark.hmm gene 145979 146218 -290.192159 + . ID=Merlin_256;seqid=Merlin +Merlin GeneMark.hmm mRNA 145979 146218 . + . ID=Merlin_256_mRNA;Parent=Merlin_256;seqid=Merlin +Merlin GeneMark.hmm exon 145979 146218 . + . ID=Merlin_256_exon;Parent=Merlin_256_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 145979 146218 . + 0 ID=Merlin_256_CDS;Parent=Merlin_256_exon;seqid=Merlin +Merlin GeneMark.hmm gene 146253 146519 -322.908748 + . ID=Merlin_257;seqid=Merlin +Merlin GeneMark.hmm mRNA 146253 146519 . + . ID=Merlin_257_mRNA;Parent=Merlin_257;seqid=Merlin +Merlin GeneMark.hmm exon 146253 146519 . + . ID=Merlin_257_exon;Parent=Merlin_257_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 146253 146519 . + 0 ID=Merlin_257_CDS;Parent=Merlin_257_exon;seqid=Merlin +Merlin GeneMark.hmm gene 146520 146744 -274.376507 + . ID=Merlin_258;seqid=Merlin +Merlin GeneMark.hmm mRNA 146520 146744 . + . ID=Merlin_258_mRNA;Parent=Merlin_258;seqid=Merlin +Merlin GeneMark.hmm exon 146520 146744 . + . ID=Merlin_258_exon;Parent=Merlin_258_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 146520 146744 . + 0 ID=Merlin_258_CDS;Parent=Merlin_258_exon;seqid=Merlin +Merlin GeneMark.hmm gene 146825 147040 -255.288456 + . ID=Merlin_259;seqid=Merlin +Merlin GeneMark.hmm mRNA 146825 147040 . + . ID=Merlin_259_mRNA;Parent=Merlin_259;seqid=Merlin +Merlin GeneMark.hmm exon 146825 147040 . + . ID=Merlin_259_exon;Parent=Merlin_259_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 146825 147040 . + 0 ID=Merlin_259_CDS;Parent=Merlin_259_exon;seqid=Merlin +Merlin GeneMark.hmm gene 147054 147419 -449.354834 + . ID=Merlin_260;seqid=Merlin +Merlin GeneMark.hmm mRNA 147054 147419 . + . ID=Merlin_260_mRNA;Parent=Merlin_260;seqid=Merlin +Merlin GeneMark.hmm exon 147054 147419 . + . ID=Merlin_260_exon;Parent=Merlin_260_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 147054 147419 . + 0 ID=Merlin_260_CDS;Parent=Merlin_260_exon;seqid=Merlin +Merlin GeneMark.hmm gene 147477 147755 -346.840279 + . ID=Merlin_261;seqid=Merlin +Merlin GeneMark.hmm mRNA 147477 147755 . + . ID=Merlin_261_mRNA;Parent=Merlin_261;seqid=Merlin +Merlin GeneMark.hmm exon 147477 147755 . + . ID=Merlin_261_exon;Parent=Merlin_261_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 147477 147755 . + 0 ID=Merlin_261_CDS;Parent=Merlin_261_exon;seqid=Merlin +Merlin GeneMark.hmm gene 147755 148078 -405.900125 + . ID=Merlin_262;seqid=Merlin +Merlin GeneMark.hmm mRNA 147755 148078 . + . ID=Merlin_262_mRNA;Parent=Merlin_262;seqid=Merlin +Merlin GeneMark.hmm exon 147755 148078 . + . ID=Merlin_262_exon;Parent=Merlin_262_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 147755 148078 . + 0 ID=Merlin_262_CDS;Parent=Merlin_262_exon;seqid=Merlin +Merlin GeneMark.hmm gene 148078 148293 -271.597843 + . ID=Merlin_263;seqid=Merlin +Merlin GeneMark.hmm mRNA 148078 148293 . + . ID=Merlin_263_mRNA;Parent=Merlin_263;seqid=Merlin +Merlin GeneMark.hmm exon 148078 148293 . + . ID=Merlin_263_exon;Parent=Merlin_263_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 148078 148293 . + 0 ID=Merlin_263_CDS;Parent=Merlin_263_exon;seqid=Merlin +Merlin GeneMark.hmm gene 148385 148636 -312.527190 + . ID=Merlin_264;seqid=Merlin +Merlin GeneMark.hmm mRNA 148385 148636 . + . ID=Merlin_264_mRNA;Parent=Merlin_264;seqid=Merlin +Merlin GeneMark.hmm exon 148385 148636 . + . ID=Merlin_264_exon;Parent=Merlin_264_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 148385 148636 . + 0 ID=Merlin_264_CDS;Parent=Merlin_264_exon;seqid=Merlin +Merlin GeneMark.hmm gene 148636 149229 -751.963856 + . ID=Merlin_265;seqid=Merlin +Merlin GeneMark.hmm mRNA 148636 149229 . + . ID=Merlin_265_mRNA;Parent=Merlin_265;seqid=Merlin +Merlin GeneMark.hmm exon 148636 149229 . + . ID=Merlin_265_exon;Parent=Merlin_265_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 148636 149229 . + 0 ID=Merlin_265_CDS;Parent=Merlin_265_exon;seqid=Merlin +Merlin GeneMark.hmm gene 149226 149555 -411.956487 + . ID=Merlin_266;seqid=Merlin +Merlin GeneMark.hmm mRNA 149226 149555 . + . ID=Merlin_266_mRNA;Parent=Merlin_266;seqid=Merlin +Merlin GeneMark.hmm exon 149226 149555 . + . ID=Merlin_266_exon;Parent=Merlin_266_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 149226 149555 . + 0 ID=Merlin_266_CDS;Parent=Merlin_266_exon;seqid=Merlin +Merlin GeneMark.hmm gene 149533 149880 -436.887846 + . ID=Merlin_267;seqid=Merlin +Merlin GeneMark.hmm mRNA 149533 149880 . + . ID=Merlin_267_mRNA;Parent=Merlin_267;seqid=Merlin +Merlin GeneMark.hmm exon 149533 149880 . + . ID=Merlin_267_exon;Parent=Merlin_267_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 149533 149880 . + 0 ID=Merlin_267_CDS;Parent=Merlin_267_exon;seqid=Merlin +Merlin GeneMark.hmm gene 149877 150737 -1096.070881 + . ID=Merlin_268;seqid=Merlin +Merlin GeneMark.hmm mRNA 149877 150737 . + . ID=Merlin_268_mRNA;Parent=Merlin_268;seqid=Merlin +Merlin GeneMark.hmm exon 149877 150737 . + . ID=Merlin_268_exon;Parent=Merlin_268_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 149877 150737 . + 0 ID=Merlin_268_CDS;Parent=Merlin_268_exon;seqid=Merlin +Merlin GeneMark.hmm gene 150734 150925 -235.875923 + . ID=Merlin_269;seqid=Merlin +Merlin GeneMark.hmm mRNA 150734 150925 . + . ID=Merlin_269_mRNA;Parent=Merlin_269;seqid=Merlin +Merlin GeneMark.hmm exon 150734 150925 . + . ID=Merlin_269_exon;Parent=Merlin_269_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 150734 150925 . + 0 ID=Merlin_269_CDS;Parent=Merlin_269_exon;seqid=Merlin +Merlin GeneMark.hmm gene 150922 151227 -402.602546 + . ID=Merlin_270;seqid=Merlin +Merlin GeneMark.hmm mRNA 150922 151227 . + . ID=Merlin_270_mRNA;Parent=Merlin_270;seqid=Merlin +Merlin GeneMark.hmm exon 150922 151227 . + . ID=Merlin_270_exon;Parent=Merlin_270_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 150922 151227 . + 0 ID=Merlin_270_CDS;Parent=Merlin_270_exon;seqid=Merlin +Merlin GeneMark.hmm gene 151218 153473 -2890.442885 + . ID=Merlin_271;seqid=Merlin +Merlin GeneMark.hmm mRNA 151218 153473 . + . ID=Merlin_271_mRNA;Parent=Merlin_271;seqid=Merlin +Merlin GeneMark.hmm exon 151218 153473 . + . ID=Merlin_271_exon;Parent=Merlin_271_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 151218 153473 . + 0 ID=Merlin_271_CDS;Parent=Merlin_271_exon;seqid=Merlin +Merlin GeneMark.hmm gene 153580 154722 -1440.286123 + . ID=Merlin_272;seqid=Merlin +Merlin GeneMark.hmm mRNA 153580 154722 . + . ID=Merlin_272_mRNA;Parent=Merlin_272;seqid=Merlin +Merlin GeneMark.hmm exon 153580 154722 . + . ID=Merlin_272_exon;Parent=Merlin_272_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 153580 154722 . + 0 ID=Merlin_272_CDS;Parent=Merlin_272_exon;seqid=Merlin +Merlin GeneMark.hmm gene 154749 155165 -537.328485 + . ID=Merlin_273;seqid=Merlin +Merlin GeneMark.hmm mRNA 154749 155165 . + . ID=Merlin_273_mRNA;Parent=Merlin_273;seqid=Merlin +Merlin GeneMark.hmm exon 154749 155165 . + . ID=Merlin_273_exon;Parent=Merlin_273_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 154749 155165 . + 0 ID=Merlin_273_CDS;Parent=Merlin_273_exon;seqid=Merlin +Merlin GeneMark.hmm gene 155162 155392 -284.548380 + . ID=Merlin_274;seqid=Merlin +Merlin GeneMark.hmm mRNA 155162 155392 . + . ID=Merlin_274_mRNA;Parent=Merlin_274;seqid=Merlin +Merlin GeneMark.hmm exon 155162 155392 . + . ID=Merlin_274_exon;Parent=Merlin_274_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 155162 155392 . + 0 ID=Merlin_274_CDS;Parent=Merlin_274_exon;seqid=Merlin +Merlin GeneMark.hmm gene 155392 156522 -1423.600588 + . ID=Merlin_275;seqid=Merlin +Merlin GeneMark.hmm mRNA 155392 156522 . + . ID=Merlin_275_mRNA;Parent=Merlin_275;seqid=Merlin +Merlin GeneMark.hmm exon 155392 156522 . + . ID=Merlin_275_exon;Parent=Merlin_275_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 155392 156522 . + 0 ID=Merlin_275_CDS;Parent=Merlin_275_exon;seqid=Merlin +Merlin GeneMark.hmm gene 156585 157088 -632.566444 + . ID=Merlin_276;seqid=Merlin +Merlin GeneMark.hmm mRNA 156585 157088 . + . ID=Merlin_276_mRNA;Parent=Merlin_276;seqid=Merlin +Merlin GeneMark.hmm exon 156585 157088 . + . ID=Merlin_276_exon;Parent=Merlin_276_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 156585 157088 . + 0 ID=Merlin_276_CDS;Parent=Merlin_276_exon;seqid=Merlin +Merlin GeneMark.hmm gene 157076 157432 -439.709209 + . ID=Merlin_277;seqid=Merlin +Merlin GeneMark.hmm mRNA 157076 157432 . + . ID=Merlin_277_mRNA;Parent=Merlin_277;seqid=Merlin +Merlin GeneMark.hmm exon 157076 157432 . + . ID=Merlin_277_exon;Parent=Merlin_277_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 157076 157432 . + 0 ID=Merlin_277_CDS;Parent=Merlin_277_exon;seqid=Merlin +Merlin GeneMark.hmm gene 157429 157734 -403.460144 + . ID=Merlin_278;seqid=Merlin +Merlin GeneMark.hmm mRNA 157429 157734 . + . ID=Merlin_278_mRNA;Parent=Merlin_278;seqid=Merlin +Merlin GeneMark.hmm exon 157429 157734 . + . ID=Merlin_278_exon;Parent=Merlin_278_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 157429 157734 . + 0 ID=Merlin_278_CDS;Parent=Merlin_278_exon;seqid=Merlin +Merlin GeneMark.hmm gene 157836 158312 -603.091441 + . ID=Merlin_279;seqid=Merlin +Merlin GeneMark.hmm mRNA 157836 158312 . + . ID=Merlin_279_mRNA;Parent=Merlin_279;seqid=Merlin +Merlin GeneMark.hmm exon 157836 158312 . + . ID=Merlin_279_exon;Parent=Merlin_279_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 157836 158312 . + 0 ID=Merlin_279_CDS;Parent=Merlin_279_exon;seqid=Merlin +Merlin GeneMark.hmm gene 158309 158668 -447.203441 + . ID=Merlin_280;seqid=Merlin +Merlin GeneMark.hmm mRNA 158309 158668 . + . ID=Merlin_280_mRNA;Parent=Merlin_280;seqid=Merlin +Merlin GeneMark.hmm exon 158309 158668 . + . ID=Merlin_280_exon;Parent=Merlin_280_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 158309 158668 . + 0 ID=Merlin_280_CDS;Parent=Merlin_280_exon;seqid=Merlin +Merlin GeneMark.hmm gene 158665 158838 -212.409539 + . ID=Merlin_281;seqid=Merlin +Merlin GeneMark.hmm mRNA 158665 158838 . + . ID=Merlin_281_mRNA;Parent=Merlin_281;seqid=Merlin +Merlin GeneMark.hmm exon 158665 158838 . + . ID=Merlin_281_exon;Parent=Merlin_281_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 158665 158838 . + 0 ID=Merlin_281_CDS;Parent=Merlin_281_exon;seqid=Merlin +Merlin GeneMark.hmm gene 158835 159731 -1132.126395 + . ID=Merlin_282;seqid=Merlin +Merlin GeneMark.hmm mRNA 158835 159731 . + . ID=Merlin_282_mRNA;Parent=Merlin_282;seqid=Merlin +Merlin GeneMark.hmm exon 158835 159731 . + . ID=Merlin_282_exon;Parent=Merlin_282_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 158835 159731 . + 0 ID=Merlin_282_CDS;Parent=Merlin_282_exon;seqid=Merlin +Merlin GeneMark.hmm gene 159731 159922 -235.781764 + . ID=Merlin_283;seqid=Merlin +Merlin GeneMark.hmm mRNA 159731 159922 . + . ID=Merlin_283_mRNA;Parent=Merlin_283;seqid=Merlin +Merlin GeneMark.hmm exon 159731 159922 . + . ID=Merlin_283_exon;Parent=Merlin_283_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 159731 159922 . + 0 ID=Merlin_283_CDS;Parent=Merlin_283_exon;seqid=Merlin +Merlin GeneMark.hmm gene 159922 160137 -267.519915 + . ID=Merlin_284;seqid=Merlin +Merlin GeneMark.hmm mRNA 159922 160137 . + . ID=Merlin_284_mRNA;Parent=Merlin_284;seqid=Merlin +Merlin GeneMark.hmm exon 159922 160137 . + . ID=Merlin_284_exon;Parent=Merlin_284_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 159922 160137 . + 0 ID=Merlin_284_CDS;Parent=Merlin_284_exon;seqid=Merlin +Merlin GeneMark.hmm gene 160137 160436 -372.267833 + . ID=Merlin_285;seqid=Merlin +Merlin GeneMark.hmm mRNA 160137 160436 . + . ID=Merlin_285_mRNA;Parent=Merlin_285;seqid=Merlin +Merlin GeneMark.hmm exon 160137 160436 . + . ID=Merlin_285_exon;Parent=Merlin_285_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 160137 160436 . + 0 ID=Merlin_285_CDS;Parent=Merlin_285_exon;seqid=Merlin +Merlin GeneMark.hmm gene 160414 160641 -289.957825 + . ID=Merlin_286;seqid=Merlin +Merlin GeneMark.hmm mRNA 160414 160641 . + . ID=Merlin_286_mRNA;Parent=Merlin_286;seqid=Merlin +Merlin GeneMark.hmm exon 160414 160641 . + . ID=Merlin_286_exon;Parent=Merlin_286_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 160414 160641 . + 0 ID=Merlin_286_CDS;Parent=Merlin_286_exon;seqid=Merlin +Merlin GeneMark.hmm gene 160638 160985 -435.855402 + . ID=Merlin_287;seqid=Merlin +Merlin GeneMark.hmm mRNA 160638 160985 . + . ID=Merlin_287_mRNA;Parent=Merlin_287;seqid=Merlin +Merlin GeneMark.hmm exon 160638 160985 . + . ID=Merlin_287_exon;Parent=Merlin_287_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 160638 160985 . + 0 ID=Merlin_287_CDS;Parent=Merlin_287_exon;seqid=Merlin +Merlin GeneMark.hmm gene 160986 161549 -716.263909 + . ID=Merlin_288;seqid=Merlin +Merlin GeneMark.hmm mRNA 160986 161549 . + . ID=Merlin_288_mRNA;Parent=Merlin_288;seqid=Merlin +Merlin GeneMark.hmm exon 160986 161549 . + . ID=Merlin_288_exon;Parent=Merlin_288_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 160986 161549 . + 0 ID=Merlin_288_CDS;Parent=Merlin_288_exon;seqid=Merlin +Merlin GeneMark.hmm gene 161546 161848 -371.966910 + . ID=Merlin_289;seqid=Merlin +Merlin GeneMark.hmm mRNA 161546 161848 . + . ID=Merlin_289_mRNA;Parent=Merlin_289;seqid=Merlin +Merlin GeneMark.hmm exon 161546 161848 . + . ID=Merlin_289_exon;Parent=Merlin_289_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 161546 161848 . + 0 ID=Merlin_289_CDS;Parent=Merlin_289_exon;seqid=Merlin +Merlin GeneMark.hmm gene 161845 162081 -287.849916 + . ID=Merlin_290;seqid=Merlin +Merlin GeneMark.hmm mRNA 161845 162081 . + . ID=Merlin_290_mRNA;Parent=Merlin_290;seqid=Merlin +Merlin GeneMark.hmm exon 161845 162081 . + . ID=Merlin_290_exon;Parent=Merlin_290_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 161845 162081 . + 0 ID=Merlin_290_CDS;Parent=Merlin_290_exon;seqid=Merlin +Merlin GeneMark.hmm gene 162074 162391 -387.962641 + . ID=Merlin_291;seqid=Merlin +Merlin GeneMark.hmm mRNA 162074 162391 . + . ID=Merlin_291_mRNA;Parent=Merlin_291;seqid=Merlin +Merlin GeneMark.hmm exon 162074 162391 . + . ID=Merlin_291_exon;Parent=Merlin_291_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 162074 162391 . + 0 ID=Merlin_291_CDS;Parent=Merlin_291_exon;seqid=Merlin +Merlin GeneMark.hmm gene 162449 162775 -406.965469 + . ID=Merlin_292;seqid=Merlin +Merlin GeneMark.hmm mRNA 162449 162775 . + . ID=Merlin_292_mRNA;Parent=Merlin_292;seqid=Merlin +Merlin GeneMark.hmm exon 162449 162775 . + . ID=Merlin_292_exon;Parent=Merlin_292_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 162449 162775 . + 0 ID=Merlin_292_CDS;Parent=Merlin_292_exon;seqid=Merlin +Merlin GeneMark.hmm gene 162905 163159 -321.120824 + . ID=Merlin_293;seqid=Merlin +Merlin GeneMark.hmm mRNA 162905 163159 . + . ID=Merlin_293_mRNA;Parent=Merlin_293;seqid=Merlin +Merlin GeneMark.hmm exon 162905 163159 . + . ID=Merlin_293_exon;Parent=Merlin_293_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 162905 163159 . + 0 ID=Merlin_293_CDS;Parent=Merlin_293_exon;seqid=Merlin +Merlin GeneMark.hmm gene 163465 163644 -217.336356 + . ID=Merlin_294;seqid=Merlin +Merlin GeneMark.hmm mRNA 163465 163644 . + . ID=Merlin_294_mRNA;Parent=Merlin_294;seqid=Merlin +Merlin GeneMark.hmm exon 163465 163644 . + . ID=Merlin_294_exon;Parent=Merlin_294_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 163465 163644 . + 0 ID=Merlin_294_CDS;Parent=Merlin_294_exon;seqid=Merlin +Merlin GeneMark.hmm gene 163764 164132 -441.864606 + . ID=Merlin_295;seqid=Merlin +Merlin GeneMark.hmm mRNA 163764 164132 . + . ID=Merlin_295_mRNA;Parent=Merlin_295;seqid=Merlin +Merlin GeneMark.hmm exon 163764 164132 . + . ID=Merlin_295_exon;Parent=Merlin_295_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 163764 164132 . + 0 ID=Merlin_295_CDS;Parent=Merlin_295_exon;seqid=Merlin +Merlin GeneMark.hmm gene 164158 164646 -602.734029 + . ID=Merlin_296;seqid=Merlin +Merlin GeneMark.hmm mRNA 164158 164646 . + . ID=Merlin_296_mRNA;Parent=Merlin_296;seqid=Merlin +Merlin GeneMark.hmm exon 164158 164646 . + . ID=Merlin_296_exon;Parent=Merlin_296_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 164158 164646 . + 0 ID=Merlin_296_CDS;Parent=Merlin_296_exon;seqid=Merlin +Merlin GeneMark.hmm gene 164715 165071 -451.064481 + . ID=Merlin_297;seqid=Merlin +Merlin GeneMark.hmm mRNA 164715 165071 . + . ID=Merlin_297_mRNA;Parent=Merlin_297;seqid=Merlin +Merlin GeneMark.hmm exon 164715 165071 . + . ID=Merlin_297_exon;Parent=Merlin_297_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 164715 165071 . + 0 ID=Merlin_297_CDS;Parent=Merlin_297_exon;seqid=Merlin +Merlin GeneMark.hmm gene 165107 165601 -618.360781 + . ID=Merlin_298;seqid=Merlin +Merlin GeneMark.hmm mRNA 165107 165601 . + . ID=Merlin_298_mRNA;Parent=Merlin_298;seqid=Merlin +Merlin GeneMark.hmm exon 165107 165601 . + . ID=Merlin_298_exon;Parent=Merlin_298_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 165107 165601 . + 0 ID=Merlin_298_CDS;Parent=Merlin_298_exon;seqid=Merlin +Merlin GeneMark.hmm gene 165612 165773 -191.091430 + . ID=Merlin_299;seqid=Merlin +Merlin GeneMark.hmm mRNA 165612 165773 . + . ID=Merlin_299_mRNA;Parent=Merlin_299;seqid=Merlin +Merlin GeneMark.hmm exon 165612 165773 . + . ID=Merlin_299_exon;Parent=Merlin_299_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 165612 165773 . + 0 ID=Merlin_299_CDS;Parent=Merlin_299_exon;seqid=Merlin +Merlin GeneMark.hmm gene 165770 166000 -285.030914 + . ID=Merlin_300;seqid=Merlin +Merlin GeneMark.hmm mRNA 165770 166000 . + . ID=Merlin_300_mRNA;Parent=Merlin_300;seqid=Merlin +Merlin GeneMark.hmm exon 165770 166000 . + . ID=Merlin_300_exon;Parent=Merlin_300_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 165770 166000 . + 0 ID=Merlin_300_CDS;Parent=Merlin_300_exon;seqid=Merlin +Merlin GeneMark.hmm gene 165997 166191 -241.609251 + . ID=Merlin_301;seqid=Merlin +Merlin GeneMark.hmm mRNA 165997 166191 . + . ID=Merlin_301_mRNA;Parent=Merlin_301;seqid=Merlin +Merlin GeneMark.hmm exon 165997 166191 . + . ID=Merlin_301_exon;Parent=Merlin_301_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 165997 166191 . + 0 ID=Merlin_301_CDS;Parent=Merlin_301_exon;seqid=Merlin +Merlin GeneMark.hmm gene 166352 167200 -1091.167753 + . ID=Merlin_302;seqid=Merlin +Merlin GeneMark.hmm mRNA 166352 167200 . + . ID=Merlin_302_mRNA;Parent=Merlin_302;seqid=Merlin +Merlin GeneMark.hmm exon 166352 167200 . + . ID=Merlin_302_exon;Parent=Merlin_302_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 166352 167200 . + 0 ID=Merlin_302_CDS;Parent=Merlin_302_exon;seqid=Merlin +Merlin GeneMark.hmm gene 167197 167433 -294.645060 + . ID=Merlin_303;seqid=Merlin +Merlin GeneMark.hmm mRNA 167197 167433 . + . ID=Merlin_303_mRNA;Parent=Merlin_303;seqid=Merlin +Merlin GeneMark.hmm exon 167197 167433 . + . ID=Merlin_303_exon;Parent=Merlin_303_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 167197 167433 . + 0 ID=Merlin_303_CDS;Parent=Merlin_303_exon;seqid=Merlin +Merlin GeneMark.hmm gene 167487 168944 -1811.170385 + . ID=Merlin_304;seqid=Merlin +Merlin GeneMark.hmm mRNA 167487 168944 . + . ID=Merlin_304_mRNA;Parent=Merlin_304;seqid=Merlin +Merlin GeneMark.hmm exon 167487 168944 . + . ID=Merlin_304_exon;Parent=Merlin_304_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 167487 168944 . + 0 ID=Merlin_304_CDS;Parent=Merlin_304_exon;seqid=Merlin +Merlin GeneMark.hmm gene 168941 169120 -220.159549 + . ID=Merlin_305;seqid=Merlin +Merlin GeneMark.hmm mRNA 168941 169120 . + . ID=Merlin_305_mRNA;Parent=Merlin_305;seqid=Merlin +Merlin GeneMark.hmm exon 168941 169120 . + . ID=Merlin_305_exon;Parent=Merlin_305_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 168941 169120 . + 0 ID=Merlin_305_CDS;Parent=Merlin_305_exon;seqid=Merlin +Merlin GeneMark.hmm gene 169175 171265 -2617.092758 + . ID=Merlin_306;seqid=Merlin +Merlin GeneMark.hmm mRNA 169175 171265 . + . ID=Merlin_306_mRNA;Parent=Merlin_306;seqid=Merlin +Merlin GeneMark.hmm exon 169175 171265 . + . ID=Merlin_306_exon;Parent=Merlin_306_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 169175 171265 . + 0 ID=Merlin_306_CDS;Parent=Merlin_306_exon;seqid=Merlin +Merlin GeneMark.hmm gene 171301 172788 -1876.322043 + . ID=Merlin_307;seqid=Merlin +Merlin GeneMark.hmm mRNA 171301 172788 . + . ID=Merlin_307_mRNA;Parent=Merlin_307;seqid=Merlin +Merlin GeneMark.hmm exon 171301 172788 . + . ID=Merlin_307_exon;Parent=Merlin_307_mRNA;seqid=Merlin +Merlin GeneMark.hmm CDS 171301 172788 . + 0 ID=Merlin_307_CDS;Parent=Merlin_307_exon;seqid=Merlin
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/gff3/merlin.gff3 Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,28 @@ +##gff-version3 +##sequence-region Merlin 1 172788 +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_1;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_1_mRNA;Parent=Merlin_1 +Merlin GeneMark.hmm CDS14 20 1000 + 0 ID=Merlin_1_CDS;Parent=Merlin_1_mRNA +Merlin GeneMark.hmm CDS24 30 500+ 0 ID=Merlin_1_CDS;Parent=Merlin_1_mRNA +Merlin GeneMark.hmm gene 14 30 . + . ID=Merlin_2;seqid=Merlin;color=#00ff00 +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_2_mRNA;seqid=Merlin;color=#00ff00;Parent=Merlin_2 +Merlin GeneMark.hmm CDS14 20 500+ 0 ID=Merlin_2_CDS;Parent=Merlin_2_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS24 30 750+ 0 ID=Merlin_2_CDS;Parent=Merlin_2_mRNA;color=#00ff00 +Merlin GeneMark.hmm gene 10 30 . + . ID=Merlin_3;seqid=Merlin +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3A_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS14 18 1000 + 0 ID=Merlin_3A_CDS;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS20 30 800+ 0 ID=Merlin_3A_CDS;Parent=Merlin_3A_mRNA;color=#0000ff +Merlin GeneMark.hmm mRNA 14 30 . + . ID=Merlin_3B_mRNA;Parent=Merlin_3;color=#0000ff +Merlin GeneMark.hmm CDS14 22 400+ 0 ID=Merlin_3B_CDS;Parent=Merlin_3B_mRNA;color=#0000ff +Merlin GeneMark.hmm CDS24 30 1000 + 0 ID=Merlin_3B_CDS;Parent=Merlin_3B_mRNA;color=#0000ff +Merlin exonerate gene 1740 2300 . + . Name=Apple3;Note=Genewithtwosplicingmodels;ID=1 +Merlin exonerate mRNA 1740 2300 . + . Name=Apple3-a;Note=mRNAAwithbothCDSsandUTRs;ID=1A;Parent=1; +# { +# "baseUrl":"http://localhost:8000/out/data/" +# "compress":0, +# "label":"Transcript", +# "storeClass":"JBrowse/Store/SeqFeature/NCList", +# "trackType":"JBrowse/View/Track/CanvasFeatures", +# "type":"JBrowse/View/Track/CanvasFeatures", +# "urlTemplate":"tracks/42ff9cb16c0509f0abb4a76ce14077bc_0/{refseq}/trackData.json", +# }
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/maf/merlin.maf Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,38 @@ +##maf version=1 scoring=lastz.v1.04.22 +# lastz.v1.04.22 --strand=both --ambiguous=iupac --traceback=160M --format=maf --action:target=multiple +# +# hsp_threshold = 3000 +# gapped_threshold = 3000 +# x_drop = 910 +# y_drop = 9400 +# gap_open_penalty = 400 +# gap_extend_penalty = 30 +# A C G T +# A 91 -114 -31 -123 +# C -114 100 -125 -31 +# G -31 -125 100 -114 +# T -123 -31 -114 91 + +a score=5613 +s Merlin.Merlin 960 60 + 172788 ATTTTCTGTAGCAGTTCAAACTGTAGAAAATGATGAAGTTATTTTAACTTTACCAGCTTT +s merlin1.Merlin 960 60 + 60 ATTTTCTGTAGCAGTTCCAAACTGTAGAAAATGATGAAGTTATTTTAACTTACCAGCTTT + +a score=5667 +s Merlin.Merlin 1020 60 + 172788 CGTAATTTTCCGCAAATAAAACAATGGGGAGCTATGCTCCCCATTTTTACAATCCAAGTA +s merlin2.Merlin 0 60 + 60 CGTAATTTTCCGCAATAAAACAATGGGGAGCTATGCTCCCCATTTTTACAATCACAAGTA + +a score=5721 +s Merlin.Merlin 1080 60 + 172788 TTTTCGAAGTAGAGTTTCGGGTCGAATTAATGACGTGAGACAACCCTCCAGCAGCTCCTC +s merlin3.Merlin 100 160 + 60 TTTCGAAGTAGAGTTTCGGGTCGAATTAATGACGTGAGTACAACCCTCCAGCAGCTCCTC + +a score=5613 +s Merlin.Merlin 1140 60 + 172788 CAAGTCTAGATAATCTACTTAAACTTCCATTAAGAGACATTTCACTATTAATTCCAGTTA +s merlin4.Merlin 200 260 + 60 CAAGTCTAGGATAATCTACTTAAACTTCCATTAAAGACATTTCACTATTAATTCCAGTTA + +a score=5703 +s Merlin.Merlin 1200 60 + 172788 TAGAATTAACAGCTCTATCTTCAATCCAATCAAGAGCAGCTTGACGTCCAACAGCACCCG +s merlin5.Merlin 300 360 + 60 TAGAATTAACAGCTCTATTCAATCCAATCCCAAGAGCAGCTTGACGTCCAACAGCACCCG + +a score=5649 +s Merlin.Merlin 1260 60 + 172788 TTTGCATTACTCTGTAAGCAAATGTAACATCGAAAACCGCAATTTGGTTATCTCCTTCAT +s merlin6.Merlin 0 60 + 60 TTTGCATTACTCTGTAAGCAATGTAACATCGAAAACCGCAATTATGGTTATCTCCTTCAT
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/maf/merlinlastz.maf Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,37 @@ +##maf version=1 scoring=lastz.v1.04.22 +# lastz.v1.04.22 --strand=both --ambiguous=iupac --traceback=160M --format=maf --action:target=multiple +# +# hsp_threshold = 3000 +# gapped_threshold = 3000 +# x_drop = 910 +# y_drop = 9400 +# gap_open_penalty = 400 +# gap_extend_penalty = 30 +# A C G T +# A 91 -114 -31 -123 +# C -114 100 -125 -31 +# G -31 -125 100 -114 +# T -123 -31 -114 91 +a score=5703 +s Merlin.Merlin 1320 60 + 172788 ATGTAAGCTCAGGAGCTCCACACGCAACAGGAACACAACCTGTGAACATTATCACAGTAT +s Merlin1.Merlin 0 60 + 60 ATGTAAGCTCAGGAGCTCCACACGCAACAGGAACACAACCTGTGAACATTATCACAGTAT + +a score=5595 +s Merlin.Merlin 4020 60 + 172788 ATGAATTTATCAGTCCAATACTTAAAATGAATACGAAGTAAATCTATGCCTAATACTAAT +s Merlin2.Merlin 0 60 + 60 ATGAATTTATCAGTCCAATACTTAAAATGAATACGAAGTAAATCTATGCCTAATACTAAT + +a score=5667 +s Merlin.Merlin 5220 60 + 172788 TAATCAACGTGTGATGCTTCAAGCCAAGCTTAGGAATAGAAATGGTTTTGCCATTGACTT +s Merlin3.Merlin 0 60 + 60 TAATCAACGTGTGATGCTTCAAGCCAAGCTTAGGAATAGAAATGGTTTTGCCATTGACTT + +a score=5640 +s Merlin.Merlin 7740 60 + 172788 AAAAGCTTATTGCTTAAGCCTACAGTTAAACTCGCTATTCCAGTTAAATGCGATAAATGT +s Merlin4.Merlin 0 60 + 60 AAAAGCTTATTGCTTAAGCCTACAGTTAAACTCGCTATTCCAGTTAAATGCGATAAATGT + +a score=5649 +s Merlin.Merlin 9720 60 + 172788 TTTTCTTTGCTAATTTAACACCAAGAGCTGCAATCCATTGGTTTCTTCGTTTATATCCTG +s Merlin5.Merlin 0 60 + 60 TTTTCTTTGCTAATTTAACACCAAGAGCTGCAATCCATTGGTTTCTTCGTTTATATCCTG + +a score=5658 +s Merlin.Merlin 10380 60 + 172788 ATACTGCATCCTTTTGATACCAATGCGGTTCAATTTGAGTGTTACCAGAGTATATCTTGA +s Merlin6.Merlin 0 60 + 60 ATACTGCATCCTTTTGATACCAATGCGGTTCAATTTGAGTGTTACCAGAGTATATCTTGA
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/merlin.fa Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,2881 @@ +>Merlin +TCGTTTAGACAAAGGTACATTATTGTATCGTGGCCAAAAATTAGACCTTCCTACATTCGA +GCATAACGCAGAGAATAAGTTGTTCTATTTCAGAAACTACGTTTCAACTTCATTAAAGCC +TCTGATCTTTGGTGAATTTGGTCGTATGTTTATGGCACTAGATGACGATACTACAATTTA +TACTGCTGAGACGCCTGATGATTATAATCGTTTCGCAAACCCAGAAGATATAATTGATAT +TGGCGCTACTCAAAAAGACTCATTTGACGATAACAATAATGATGGAACATCTATTAATAT +CGGCAAACAAGTTAATTTAGGATTCGTTATTTCCGGTGCTGAAAATGTTCGAGTTATTGT +TCCAGGTTCTTTAACTGAATATCCAGAAGAAGCGGAAGTTATTCTGCCTCGTGGTACTCT +TTTGAAGATCAATAAAATCACTACTCAAGTAGATAAACGCTCGAATAAGTTCATGGTTGA +AGGTTCAATCGTTCCGCCTTCTGAGCAAATTGATGAATCTGTTGAGATTTATGACGGTGA +TCTGTTCATGGAAACAGGTGAAGTAGTAAAACTGTCCGGATTCATGCAGTTCGTCAACGA +ATCTGCATACGATGAAGAGCAAAACCAGATGGCTGCTGAGATTCTGTCTGGATTCTTGGA +CATTGATGACATGCCACGTAAGTTCCGCTAGCCGTTTACATCCACATGGAAGTGGATTAT +AATGGCTCTACGTTAACAAGAGGAAAACAACATGAAATCAATTTTTCGTATCAACGGTGT +AGAAATTGTAGTTGAAGATGTAGTTCCTATGTCTTATGAATTCAATGAAGTTGTTTTCAA +AGAGCTTAAGAAAATTTTAGGCGATAAGAAGCTTCAAAGTACTCCAATTGGACGTTTTGG +AATGAAAGAAAACGTTGATACTTATATTGAAAGTGTAGTGACAGGGCAGTTAGAAGGTGA +ATTTTCTGTAGCAGTTCAAACTGTAGAAAATGATGAAGTTATTTTAACTTTACCAGCTTT +CGTAATTTTCCGCAAATAAAACAATGGGGAGCTATGCTCCCCATTTTTACAATCCAAGTA +TTTTCGAAGTAGAGTTTCGGGTCGAATTAATGACGTGAGACAACCCTCCAGCAGCTCCTC +CAAGTCTAGATAATCTACTTAAACTTCCATTAAGAGACATTTCACTATTAATTCCAGTTA +TAGAATTAACAGCTCTATCTTCAATCCAATCAAGAGCAGCTTGACGTCCAACAGCACCCG +TTTGCATTACTCTGTAAGCAAATGTAACATCGAAAACCGCAATTTGGTTATCTCCTTCAT +ATGTAAGCTCAGGAGCTCCACACGCAACAGGAACACAACCTGTGAACATTATCACAGTAT +GAGGTAATCCATTTCGAGCATGAAGGTTAACCTGAATGTCAGCTTCGACGTCAGTTGGTA +ATGCTCGCAATCCAGTAACCGGGTCTTGAACGGAGTTCACCCAATCTTGCATTGCACGAT +AGTTACTTGCTTCGGGATCCATTCTGAATGATATAGTTAACGGATCGAGTTCACGTCCAG +TTATTCTAATATTCGGTGAGTTATGGTTGAAATCCATTTCATGAGACAATCTGTTCTCTG +GAATTTTGACCGAATAAATCATCAATCCAGATTGCGGATAAGCCATGTTAAAGAAGTCTA +ACAAATAAGTTCCGACTTCAAATTCACCTAATAAAGACTGAACAACACGATTGCTCATTG +CTCCAATAAGATATTTCGATACACCAGACTTTCTTACCAGCTGTTGAGTACCGGCAGTGA +TAATTGAGGTGAGTCCTGATGTGAACTCACCTTGTGTTAATCCAAGCCAGTCATTATTCA +ACGGAAGGTTATTAAAGAGCATACCGCCAAATTGATCGAGTAATTGTTGAGACTTTGCTG +ACGGAGTAGTTGCAAATACACAACTAAACATATTAGTACGCTGAAAGTCTATATTACCCG +CTTGGTTTTTAAATTCATCTAAAGTTAGCATCAGAATCCTTCCGCATATACTGAAGCTCG +GTTCAATGTCAAGATTTCACGCATAGTAATTTCTAATGTGAATGTACTTGGCAGGTTTGG +AGCTATAGCTAAACCGTTAAAGTTTCCATTTGGAGTTTTATCAAAACGGATACTCTGAAT +TTGACATGGACCGAATACTTCAGCACGTCCATCGAATTTACTTGTGGTTCCAAAGTTTCT +GACGAACCACACAGTAGGGTTACTTACAACAATAACATTACTTAAGAATGAAGTTATTTT +CTCAAAAACAGTGTCATTTTTATTAGCTTCATCTGGAGTTAATGTATCAAGGAAAGTTGA +TTTATACCATTCATCTAATTGAGACTTAACTTCTTTTGCATAAGTAGACGTTCCCGTTTC +GCCATAACTATAGTAGTTAAAGTATTCATAGATCTCGATAATAGCAATAAGATCTTGTAC +TGATCGAGGAGTTAAATCCCACGTGAATACCTTCGTACGGTTATCTGCGCCGCCATACAT +TGATCGAGCAGTGTTATAGATCTGCTCGTTATGGTCAGCCATTAATCCTTGAGTCAATGA +CTCTAATCCGCCAAAGACAGCAGTAGATGCAACGTTACTTAATACCCCTGTAGCAGTACC +GCCGCCACGAGAAATAAGTGAATCTCCAACGTCATTAAATTTATGAGAAACTGATTCAAC +ATCTGATTTCGAGCGTGGAAGTAAAATATTCACTACTGGAATTTTATCAACTTTATTAGT +ATTTGTTCCAGTGATTGATTTCACTACACTATTTGCAGTACGTTTCATTTCACCTAAACG +CATGCTACGCATATCACCGGTTGTACGAGAATTCATATCATACGCAGTGAACAACAACCC +GTTCTTATAAAGATCATGAACTCGTAAAGAACCAGATGTGTCATTACCAGCTGAACGTTC +AGACGGATATTGCGCAGTTATAGTGGATTTTATTTTTGCTGATTGTGAACTTTGACCAGC +GGAGGTTTTAACTCCGCTAATTAAAGCATCAGTCTTATCATCTAATTCTCTGACTTTAAT +GCTCATTAATTAACTCCTGTTGCCCCGAATACTCCAGGAGCTGGAGTAGCCGTGACTGTT +TGAACCTGGTGAATAGTCTTACTATTATTTACGTTATTAACCTGAGTGTTAGCAACATTC +ATATCACCGGTTGATTTTTTAGATTGCTCTTTAGCATTTTCAGCTTTTTGAATATTTTGA +ACTCGTTGATTATCTTCCGAAGTAGCTGGAGCCGCAGGCTTCGGAGTGTTATCTTCTTTG +AGCTTCTGATACTTGGATTCTACTCGTTGGAATCTTTTATCGAGTTCCTTTTTAGTAGCT +GGTTGATCACTTATAGCAGAATCACTAATAGACTTTTTGGCACTGTTATATGCCTTCTCT +AAAGATTGCATATTAGTTGGATTCTCTGGATCAACATCACCAATATATTTTTCTAAACGC +TGAACAGCTGCACGAGCTTCGTTTTGTTTGATCAGTGTTTCTTCGCGTTTTTCAGGAGCC +ATTGCTTTAAGATTCTGAGTCTCTTGATCACGGTCAGATGCTTGTGTAGAATCGATTTTA +TTCTCTCTTCCTAGTACCCAATCAAATGCACGAGTTTTAAATTCGCCAGCTTTATCAATA +ATTCCAGGACCTTCTTCAATACGCTTACTCTGATATTTAGCCAAAGCTTTTTGATCATCT +TCAGACAATGAATTACCAGTGCGTTCCTGGAATCCTTCTAGTGCTGAACCACGAATAGTA +GTTGCTGCATTTTCAAAGCCAAGTGCATCGAGTATAGAAGCAGATATCTTTGAAATTCCC +AAAGACATTATTTCGCTCAGGTTGTAAATCACATCAGCTAATCCTTTGACAATAGCTACT +GCTAATCCACTCCAGTCTCCAGCTTCCCAGAACTTTTTAATATCTCCTAACATTCCAAAA +ATTGATTGAAGCAGTCCACCCCATTCACCAGCTTCAGCACTGAATTCATCGAAGTTACTC +ATGAATTTATCAGTCCAATACTTAAAATGAATACGAAGTAAATCTATGCCTAATACTAAT +GCAAATAGCATAGCAGCCATTTTAGCAGCTTCAGCTAATGCTGTTACGGTGTATTTGAAA +AGCATAGACGATATTTTATCAGTGATTGATATCGTAGCTTTAAATCCGCCCTTCGTAGTC +TTCAGCAAGTCACCTAACATATCGGTAGGTTTCTTATCATCTTTCTTTTGATCTTTCTTG +TTGTTCTCGGGTTCCTGAGGGACAGGTGGAAAGAAATCCTCATCAGGTAAATTGTTATTA +TCATCTAATGGAGGAAGTATTCTCTCAGGTAATCCAGGGGACTCTGGCTCATCAGGATCT +GGAAGCTTATCTTCTATCACTGACAGCCCAGTTGAAGCTGAGGCACCAGATTCTTGTACT +TTTTGCTCCACAGCCTGAACCTTCGACTCGAGCATCGAAGCGAGTTTACTTAATTTGTCG +CTGATTGTTGACGCTACGCCAGTAAGAGTCTTGATAGACTCAGTAGTTCGTTCAGATGCT +TCAGCAGCTAATTCAGTACCTTCAGCGACGTTATCAACTGAATCAATTAAGTTATTGCCC +TTTTCTTCAATTACTTCAGCAACTAGTTCATTAGCGCTTTGTACATCATCAAGTTTAAGG +CCGATTAATTCTAAAGAGTCTACTTGATCAGATGCAGTCGAGGCTGCATCTCTTTCACCC +TTTGAATCAGCGATAACTTTACGACGTCTCATTGTGGACATGTTTTCGCTTTTCATTCAA +ATAATCCTAGTACTTTAGCTGTTCCCCTTATATTGTGTGGACCGGGAATAGCAATAAGAT +CTGTGATTTCTTCTGCCCACTTTAAAACAAACGCAGGCATCTTAAGGAAATTAGGAACCT +CAGTGGAGTCATTCACTGATATGAAACATTCCGTGAGCATCTTATCGATAGTAGTAAATG +TTTCATATCGCTCAGGAGAGCGGAACTTAAATGTCTTTCCTTGGAACTGGAATTCAAGGC +GTTGGCAAATATAAATGTCATTGATATTGTAAGTGTATCCATCTTTAACGACACTTGATT +TTAACTTGCCATTGAACTCTAGCAAATGAATAGACACTAAATCTGATTCAGCAGCGTTTA +GCCCAGGATGAATTGAGTCCAAAAGAATTCCCATGTTCTCATCCATTGCACGAACATCTT +TAATCAACGTGTGATGCTTCAAGCCAAGCTTAGGAATAGAAATGGTTTTGCCATTGACTT +GTATTTTCTTAAGTGGTAGTATCAGATTTAAGTTCATTTTTAACCTTTATTGGTTCTACT +GTTTCCAACTTACGAGAGTTAGTAAACATATAAAGTGTAGTTATGCTTTGGTTATTACTT +ACTTCATGAATTACTTCATCAATGTAAAAATCAGCTTTGAATTGTTTCTTTGGATCAAAG +AAATTAATCTTATCTCCAGGAGTCATTTCAAAGTCTCCAACCATTTTACAAGTTGCATAA +CCGTCGTATTGTGCCATAGTCTGAAGCCTGATAGCTTCTTCATATCCATTTCGATAAGTC +ATTTCAGAATATGCACCCGAGCGAGAAACGAAGATGCTATTCTGTCCATCACCTGTTACT +ATTCTAGGAAGGTTATTATCCAAGAATGAGTGTGAATAGATTGTAGCGTTAAAAATAGGA +TCGCGTGTATGGGCATTAGCCTTCGTTAACCACTGGAAGTCATAAGCGAGATTATATTCC +AGCTCACCGACAAATTGACCTATTGTGCGTGGTTCACCGACAATAACCTTGATTGATTCT +TGGTTAATCATTGCGTCATAATCCATCATGTTCAAGCCATAGATATCTTCCCATACAAAA +ACGAATTGGTCATTATCTACTGCTAATGCAACATCTCGAACATATTTCTTGTATTCAGTT +ATGTTACTCGTCCATGGAACTCGAGGAACATATGTGTTGATTGTGTTCATCGCAGGTGCT +ATAAGAGGTTTTGATTCATAAATTGCACCGATCATCTCTTTTATTGATTCACCAGCGTCA +TTAAAGAAACAGCGGCTAAATTTAAGATTCTCTATTTCATGAACCAGTCCTAAGTTAATA +GCAATGATATTATCACCCTTAGAATCTACTGAAACACTGAAGTGTTTGCAGCCATAAATT +CGGTTCAATGTTTTCTTAGAGTTCGCATTAGCTACTGAGATTTGTATAATTTGATTTCCA +TCCATTAACGTATGAAGGTTCTTTGAATCAAAGAATTGTAAAACACCTTCATTACGTCCA +AATAAACCATCTCTCATTGTTAGTGTAGTAATTGTAGCTGCTAACTCAATGTATCTGTTA +GATTCCCATGCATCGTAATCCTGATACAACTTAATGCTCAGGTTCGGGAATCCCGGAAGT +TGTTGTATCATTTAGTGTTATCCTTCTCTATCAGAGATAATGCAATAGATCTTTCAACTG +GGATCATCTGCATTATTGACTCTAAATTGTAATGATTTTTAACCAGCATATGATTTATTT +GATAGAAGCTAAATACTTCGTCTGGATTCAGCAAAAGATTAAAGATATGCAAAAAATCAT +CGTAAATTATTTTCTTAGTTTCACAACATTTCATTCGAAGCTCAAGATGAATTGGTGTCA +TCTGCTTAATAATTGATTCCAAAGCGTTCAAATCTATAGCATCTATAACTTGAATTTGAT +TCTCTTCTGATAACTCGTTCCAAGGTATTATTTTCCCTTGGTATTTAATTGATTTCACAC +ATTCAGCTACTTGTGTAGCTTTATCATCGTAAAACTTATCCGGGTAATTGAAGTAAATCG +TAATACCAGCAACTTCGACGGTAGGCTCAGTAAGATCTTTGGTATTCAAACTGAATAAAG +TTTGCTTATTCTTTTCGCAAACTGGACAAGTAAAAGCAATAGGAATTTTAGTTTTACCGA +TTGACCCAGTGAACACCCTTAAGAATATGTACGGTCTCCATGTTGACGGAAACTCTTCGA +AGTATTCTTCGAGCAACTCATTTAAGATAATGTTTTGCTCTTCTGGTGTTTTTTGGACTA +AGTCGTTTCGGACTAACAGAAAATCTCGATAATCTGCTACTGTGAACGGTTTGAAACGAT +GCACACCATCAGGCAAAACACAACGAATAATATTAGCCATTATGTCTCCTTTATCATATT +TATAAATATTTCAATAAAGGAGTTGATATGAAATATGAATACACTTTTGAGGCTCGTATT +GGTGATGAAGTAATCCAATGCCGAGCGTTTACACTAGAAGAATATCGCGATCTCATCAAA +GCTAAAGCTGATGGGACTATCAAAGAGTATGTTCCAGAGCTGATTAAAAATTGTACTAAT +GCTCGTGGATTAAATCGTCAAGAGTCAGAGCTATTGTTAGTTCACTTATGGGCTAACAGT +TTAGGTGAGGTTAGTCACCAGAATACATGGGTGTGTTCCTGCGGACACGAAATACCAATG +TCAATTAATCTTACATTCGCTCAGATCGATGAACCAGATGATCTCTGGTATTCCCTAGGT +GGTTTCCGTATTAAGCTAAGATATCCTAATCTTTTTGAAGATTCAAATATCTCTATGATG +ATCGCATCATGTATTGAATACATTCATGTGAACGGTGAAACCATTTCAGTCGATGAACTG +AATGACAAAGAGATTGATGACCTTTACTCTGCAATAACAGAAGATGACATCGCTCGCATT +AAAAGCTTATTGCTTAAGCCTACAGTTAAACTCGCTATTCCAGTTAAATGCGATAAATGT +GGGGAAAATCATATCCACGTAATCGAGGGGCTTAAAGAATTCTTTAAGTTAATTTAATGG +CAGATATAAAAAATCTTTATTCAGATATCGATCCCGAGTTCAAAATGGACTGGGATCATG +ATGTAGCGCGGGCGCGTGGCCTACGGGCCATTAAAAACTCTCTTTTGGGTATTATAACAA +CTCGCAAAGGAAGCCGTCCATTTGACCCTAACTTCGGGTGCTCATTAACAGATCAGTTGT +TTGAAAATATGACGCCATTGACCGCTGATACTGTAAAGCGTAATATCGAGTCTTCAGTTC +GTGCTTATGAACCACGCATTCGATATTTAGCAGTTAATGTAACTCCAGTGTATGATGATT +ATACGCTTATTGTAGAAGTGCAATTCAGTGTAATTGATAACCCAGATGATATAGAACAGA +TTAAGCTACAACTTGCTTCAAGCAATCGATGATCGCTTACACACGCCGTTGTTATAATGG +TTCTAGTTCCTTTCAATCAATAACAACCAGAGTTACTAAAGAACAAGAAAGAGGGTTAAA +TGAAACTTGAAGATCTACAAGAAGAGTTGAAAAACGATCTACAATTAGATTCAACTAAAT +TACAATATGAAGCTGCAAATAACCCAGTGCTCTATGGCAAATGGTTGAATAAGCACTCGA +GTATCCGCAAGGAGATGCTTCGCATCGAGGCTCAGAAGAAAACGTCTTTAAAGCGTAAGT +TGGACTACTACACAGGCCGTGGTGACGGTGATGAATTTAGCATGGACCGTTATGAAAAGT +CTGAAATGAAAACTGTTTTAAGCGCTGATCAAGAAGTACTTAAGTTAGATACATCTTTAC +AATATTGGGCTATTCTGTTGGAATTTTGTTCTGGTGCGATGGATGCTATTAAATCTCGTG +GTTTCGGGATTAAGCATGTAATTGAAATGAGACAATTTGAGGCAGGAAAGTAGTATAAAT +AAGATAGTAAACTAGAGGAGACAACCATGTCTGAGATCTGTACTGTATGTAAGCAACCGA +TTGATTCGGCATTGGTTGTTCATACTTCAAGTGGTCCGGTTCACCCGGGTCCATGCTATA +ATTATATCGTTGAATTGCCGGTATCTGAAAATACACAAGAGCATTTAAACGAAACCGAAC +TTTTGCTTTAGTCTAGTGTTGATAGCCAACTTGTTGGTTTTTGCCCCTTCCTTTCGGTTG +GGGCCTTTTTGTATTAGAAGTCTTCTTCCGACTCAGATTCATCTTCTGATTCAAATTCCA +AACGCTTACCGGCCAACGCATCGCGGATAGAAATATCATCGGTGTCCTTCAATTCTGTAT +CTTTAACACGCTTTTTATAATAAGCCTCGAGTTCTTCCAGACCATCAAGAGTCTGGCATT +GAGCGATCTTACTCATAAACGTATCAATAGATGCTTCATAAAGAAATTCTTTAAAATCCA +TAATTCCTCACAGATTGATTGTTTTCATGATGTAGTTAAATTTTTCGTCTGCATATCTTT +GAATGCGTTCTAAAGCGTGTTTTAAAGCATAGTTTAAATGCACGTACTTCTTTTTGGCAT +TCGCTGATTTAGGCTTAACACCCATGTCATCCACGATATCCCAAACTTGTGCCAATGATT +TAGAGTCATGCTTACGAAGAACACGACCAATACTCTGGAGCACTGTAACTTTTGATTTAA +CTGGATGAGCGAAAATAACATGGTGTAAGTTTTTGACTGAAATACCAGTAGAGAATACTC +CATAAGAAGCAACTACTACTAATCCGGTATCTTTTTCAGCCATAGCTTTAAGAGCGTTAC +GAGTGTCAGTGTCAATTTCACCGTTAATGAAATGAACATTTTCATGACCGAGCTCTTTAA +CCATCGCATAAAGCTCTTTACCGTGTTTAGCATTTTTAAACATCAAGAAGACGTTTTCAT +TTTTCTTTGCTAATTTAACACCAAGAGCTGCAATCCATTGGTTTCTTCGTTTATATCCTG +TGATAGCTGCAATTTCTTCTTGATAGGTTTTACCCTTCATTTTAACTGTAGCTGCATCTG +GATAACGAAGGAAAATACAGTTGATCTTTAATTCAGTTACTTGGCCATCTTCCATTAGCT +GAGCAGTTGAAACAGGCTTAAAGATTTCACCGAACATTCCAACATATTGCATGAGGTTGG +CTTTGCCATCTTTCAGGGAACCAGAAAGACCGAATTTAAACATGCAATTATTTAAACCGG +AAATGATGGTTGAAATACTTTTACCTGTCGCGAGGTGACATTCATCGTTCATCATTAAAC +CAAATTGGCTGAACCATTCTTTTGGTTGTTTAACTGCTGTCTGCCAAGTACTAACATAGA +TCATTGCGTTCGAATCACGAGCGGTACCGCCTCGAATTCCAAGACAATGTTTCTTACCAA +ATAAGCGATAATCACAGAAGTCATTGATCATCTGGTCTACTAGTGCAGTAGTCGGAACGA +TGATTAAAATTTTACCCTCATAGTTCTCAACATAATAACGAGCTAAAAGAGCCTGAATAA +GAGATTTACCAGCAGAAGTCGGAAGGTTCAGGATTCGTCTGCGGTTCACTAGCCCTTCAT +ATACTGCATCCTTTTGATACCAATGCGGTTCAATTTGAGTGTTACCAGAGTATATCTTGA +GCTTCCCTAACCAACTGTCAAAGTCCTTACGGGAGAGTTCTTCTTTATCAAATATTTTAG +GGTCAATCCATGCTTTGTAACCCATATTGTCGCAGAACTTTTTAATTTGTCCAACTAAAC +CAAAAGGCAGAAGACGGTTATAATCCAGAAGACGAATACGTCCATCCCATTGTCCATATT +TGAATTTGGGATTGAATCGATAACCATCTGCTTCAAAGCTAAAAAAGTCTCTTAGTTCAT +GGAACACGGATTCATCGCATTCGATATGAACATGACTAAAGTCGTAAAAGTTTACTCGGA +TATCCATTTTGATTCCTTAGTTATAAATACAATCATATTTATACACAAGAGGCGATACCA +TGCTAGACATGCAATATATTGAAGAAATCCGCGTACTTGATAAGAAAGAAGCTAAAGACA +AGCTCGACGAGTATGCATCTCAGTTTGGAATCAAACTGAAGAAAACCAAGTCATTTGAAA +ATATGCTGGCTGATCTTGAAACAGAATTTAAAGCTCTATCTGATGAACCATTACCTGAAG +ATAACGTAGGTATTTCAATTTCTGATCTTATTGACGATGAGAAAGAAGAACTTCCTGATT +TGGTTCAATTAGAAGATTCTCCTGAAGCTGTTATTACTGTTATAGAAGTAGATAACATTC +AGGAAGATGCCATCTGTATTCCAGAAACTGCAACCGTTGAAGAAATATCTAAAATAATTG +AAGATAGCGCAGTTGTAATTGAATCAGAAAAGTTTGAATTACCTGAAGGATTTTCTCCTC +ATTTTGAACTAATTGGAAAGGCTCCAGGATATTGTACTCTTCCTTGGTGGATTTATGAGT +GGATTTCTAAAAATCCAGATTGGAAAGAAAAGCCTACATCATTTGAGCATGCAAGTGCCC +ATCAAACTCTGCTTAGTTTGATTTATTATATTAATCGCGACGGATCAGTTATGGTACGCG +AAACCCGCAATTCATCATTCGTTAAAATTAAATAAGAGGGTTAATCCCTCTTTAACTTAG +GAAACTATATGGCTACATCAATTGCATTATCGCCAACAAACCCCACAATTAAAATTGGAG +ATTCACAACAATTCACCGCAACTTTAACAGGAGCTCCAGAAGGTTCCACTACTGAATATA +AATGGACCGTAGATAATATTCAGCAAAGCTCTACTAGTGCAACTATGAATTATGTTGCAT +CGACACCAGGTAATAAAGTTATCAAGGTAGAATCTACCACTAAAGTAGACTCTCAACCTG +ATGACGTTCAAAGCGCGACTACATCTTTGACAGTCAAAGATGTTATGACTTTGAATGTTA +CGATATCCGCTCAGTCACAAACTATAAAAGTTGGAGAAAGTTATACGGCAGCGTGTGGAG +TAACAGGACAACCATCTGGATCAACTATTGCATATAAATGGTCAACTGGAGAAACCACTG +AAACAGTTTCTAAAGTTGCAACGCAAGAAGGCAATATATCACTGACTTGTGAAGTTACCG +TAACTGCTACAGGGTTTGAAGATGCTGTTAAAACCTCAAATGTTCTTAGCATTACAGTTA +CAGCAGCGGATCCAGTTGTTCCGCCTGAATGTCCATTGATTTATGTTCATCCTCTTCCAT +GGAGAAGCTCAGCTTATATTTGGGCAGGTTGGTGGGTTATGGACGCTATCCAGCGTTTAA +CAATTGAAGGAAAAGATTGGAAAACTGCAACTAAAGAAGATACTCCATATTATTGTCATT +TAGCAGTTCTTGCTAAAATGATTAATGACTATCCAGAAGTGGATGTTCAAGAATCACGCA +ATGGAAGAATTGTTCATCGTACAGCTTTAGAGGCTGGTATTATTTATTAATAAAAAGGGC +TCCCTTGGGAGCCCTTTTTTGCTTTTATATTTTATGATAGAATCACCACACAAACGAGGT +GAACATGAAAACTGAAATTAAAGTGCACATGATGCACGAACGTGGTGAAAGCTTTAAAGA +TATTGCAAAAGCAATAGGTGGAATGTCTGCATATGATGCAGCTTTGATGTATACGAAGGT +TGAATCTCTAAGAGAGAAAGCGAAGAACAAAGAGAAAATTGTTTATCGTAAACGCTTATC +AAATGTTGGTGTTAAAATTCGTCATAAGAAACTCGTCAATAAAATGAAGGAATTAGTATG +ACAAACTTTTATGAACAGATAACAGAATCTCAACTATTTGTAACTGATATGCTAGATCAT +ATGATGTATGAATCTAAATTCAGTCCTGCTGCGCATGGTGTAAACAAATGGCTTCCAGTA +AACGAATTCATCAAACGTTTATCTCCATTTGATGCTAAATCGCAAAATTTAGCAGATAAA +AATGCGTGGGTAATTATTCGGCAAGTATTAGCTACCCGCTTTGCGGTAGAAATTGATCAT +ATTGATTCAGGAATTCCACTCATTATTGAAGTCGGCGATAAAAACCAATTTGAAATATAC +ATCACTACATGGGGATTAACGAAAGCTCGTGTAGTTCCAAGTGATTTGTAATATGCTTAA +AGTTTAAGCAGTGTATAATGGTTTCAAGGATTCTCTACCAGATAGATTACGTTCCAGAAG +GAAAGGATAGACCTTGACTATGTATTCTTATTTATTATGGTGAAATATGACTCGAGTAAC +AACTTTGGCCAGTCACTATGGCCAAATAGCTGATAATGTAACGTATAATCTTTTCAGTAA +AATAAAAGAAGAAATTGAAAAAGAATTTAATAAAAACGCTGAAGTTGGGTGTCGTAGCTT +CATTTGGTATCCAAGCCCAAAAGCTGGCATAATCAAAGAAGAAATTATTAAATGGCTGCA +AGATGAAGGTTGCGCAGTTGTTTGGAACTATGACCAGAAAGATGGTAATTGGGTTGAAAT +CGCTTACTAAGGAATAATTATGTTTGAGAAATATAGCACGCTAGAAAACCACTACAACAA +TAAATTCATTGAACGTATCCGTAGTGCTGGATTTGATTTGACAGAAACATGGGTAGCTCG +TGAAAAGATTCATGGTACTAACTTTTCTATTATCATCACCAAAGACACAGTAACGTGTGC +GAAGCGCACCGGACCTATTCTTGAAGCTGAAGACTTCTTTGGTTACGAGATTATTCTTAA +GAAGTATGATAAGTCTATTAAAGCTCTCCAGGACACAATGAAGAATATGACCACAGAATC +TTATCAGTTATTTGGTGAGTTCGCGGGTGGTGGTATTCAGAAAGGTGTTAACTATGGTGA +AAAAGACTTTTATGTCTTTGACTGCCTGGTCAAAACTCCAGGTGGAATTGTAGAATATTC +TGATGATTACATTCTAACAGCATTTTGTAACGTGTTCGGATTTAAAATGGCTCCGTTGCT +GGGTCGTGGTAAGTTTGACGACCTTATTCAGATGTCCAATATGCTCGATGTTGTTGTTAA +TGACTATAATAAGCTAGCGGAAGCTGATTTGGAAGCTGCTAACCTAAAAGTATGGCCGGT +TGTTGTATCAGAAGATAATATTGCAGAAGGTTATGTTCTGAAGCCTTGTTATCCGAAGTT +CTTTAATAATGGTGCTCGTGTAGCGATTAAGTGTAAGAACTCCAAGTTCAGTGAAAAATC +TAAATCTGATAAGCTGATTAAAGCGAAAGTGGAATTAACTGAAGCTGATAAGAAATGCTT +GTCTGCTTTCTCTGAGTATGTTACTATCAACCGTGTCAATAACGTTATTTCTAAGATTGG +CACAGTAACAACTAAAGATTTTGGTCGAGTACTTGGTCTGACGATGAAAGATATCTTGGA +AGAAGCAGCCCGTGAAGAAGTTGTATTGACTTCTGCTGATAATCCCGATATCGTCAAGAA +AGAACTAACTCGTATCCTTCAAGAAACTTTACGTCCAGCATGGATCGAATTAATAAGTTG +AGATTTGCATTAATAGGGTCCAGGGAAACTCCGAGACGAGTCCTGGATTTAATGAGTCTT +ATGGGGTTGGCCTTTTCGGAGGCTGGCCATTTTTCATATTCAGGTGGAGCACCTGCCGCA +GATGAAGCATGGTTAAGTAGATATGATAGAGCGAATTCTCTTAGGATTATTCCTTACAAC +GGCTTTAATGGGCTTGTATCTGGTACTGGGGTTGCCACCTGGGAATCTATGAGTAATGAA +GCTCGAATAAAAAGTTACATAAAAGCTAAGCAAGTATTCCCGGATTTAGACAATCAACGA +GACATCGTAAAGACTTTATTTTGCCGTAATGCTATGCAGGTTTTAGGTGAAGATTGTATG +TCACCCGTAGATAAAGTTTATTTTTGGGCAACAGTAAGAAATGGAGAAGAAGCCGGTGGA +ACTCGTATCGCTGTGCGAATAGCTAGAGCCCATGGAATAGAGTGTATTAACTTGAATGAT +AAAAGAGTGTTTGCTGATCTTCAAGAAGAATATGCACCAAAGTTTGACATCTTCTCTCTA +TAAACAACAAAAGGGCCTTTCGGCCCTTTATCATTCTTCAATGATGATTTTTGGTAACTT +AACACCAAGAAGAACAGACAAATCAGAACGTCCTGCCATTTTGTCCATATCTCCCCCATC +AATAACTCGAGCTTCTTTATCGTCTTTAGCTACAGTATATGGGTTTGCTGAAAGTGCATA +ACGTACCAGCAAGGCGATAGATGGTTGCAAACTTTCAGGGTCTGTTATGACCTTGAATGC +TCCTACATGCTCTGGGTCATCCAGATCGGCTCCTTCAGTGTATGGAGCGTAGAAAATAGA +ACCAATCAGCTCTTTCTCCCCTAACTTCTCGACCACCCCTACGATCACGTAATCCAATGG +ACTGTTAGTATCACAGTACAAAGGCAATCCATTAGCTAAGAATCCGTAAGCATTTTGTGA +CAAATACTGATCATCTTCAGGTTTATGCTTCAACCAACCACTTGCAGCCAGAACAGCTGC +AGCACGAGTTGACGCTACACAGAATGATGCGGTATAAGTTGACTCACGCTGAATATGAGA +AACCATTTCACAAACCATTCGATACAGAGAGCGACCGGCTTCTGGAGCTGAAGCATAACT +CAAATCGATAAATCCAGTATCTGTGATGCCTTCTACTTTATAACGTTTAGAAACAGTAAT +CAAAGACTGCAGAATATCTTTGTTGATTTCATCAGCCATTTCAGTAGCTAGCAGATCTTC +AATAAAGCTTGGTGCATCAAAGCCGTTGGCTTCTAAGTCTTGAGCAAGTTCAACTGTGAC +GCTAGTTTTTAATTTACGAGACTTAACATGAGTCTGCCATTTGTTGACCAAGAAACGTGC +TTCAGCGATTTCAGTATCTTCACCACCTTCAAACTTTTCAGTGCGAGCAGCATCAGAGAA +AATACGAACTGTTAAAAGAACTACAGCAATTTGAGTAGCCAGCTCCATATCAGTTTCGGT +AATAGAAGCAAATGGATTATCTACCAGAGACTTATAAACGATTTTATTCAACTGAAACAG +ATCGCCTTTCATCAGCGAGCCTTTATTAGCTGCTGTTACTTCGGGAATAGTTTTACGATC +GACAAATCCAGCTTCACCTGCGTATGTAGCACCAGTACGCCATGTGAATTCATTATCTTG +GTTTAGATATTTGATTCCATAAAATGCTGCCATTGGTTGAGTAGTACGTTGAGTAGCAAC +AATATCAGAATAGATTAATTTAGTGGTAGCGCGAGTCAAGGCAACGAGATTTGGGCGACC +AATTTGGTTGCTCGTTGAAATAGTTGATTCGCGCAGGAGTTCGTTGATTTTAGCCATCGC +GCTTTTTCCTTCGTTAGAATATATGAATATTTATTACGTCCAGAAACAACAAAAGGAGCC +CGAAGGCTCCTTATAGCATTAAATGCCTTTAACGTAAACGCGACGGAAGTAAGCGTTCTT +CGAGAGAGTTCAGCAGAGACGGCATACCAGATACGATACGGCCTTTAGGCTGTTGAGCCT +GAGAATCAGCGAACGGGTTAATACCAATACCGTAACGGGTTTTGAATCCCATGACCGGTT +GGAAGTTCTTCGGATCGGAACCACGCAGCGGGGTCAGCGCAACATAAGGAGCGTAGTAAA +TACCAGCATCCATTTCGTTGGCACCTTTGTAACCGATGGTGAAGTAATCTTGGCGAGCAT +ACTGGTCGATATAGACACGGTACTTACCACCCAGAACACCAGCGAAAACAGCTTTAGTGG +TGTCGGTTTCGAAACCACGACCCAGACCTTGAGCAGCCGGAGAAACGTTAACATCGACAG +CAGCCAGAACGTTAACAACGTTACGAGATGCGATCAGGAAGTTACCAGCACCACGACCGG +TCTGACGAGCGATTTCAGCAGCTTCTTTGTCGATCTGGAACAGCAGAGCTTTGAAGCTTT +CACCTGCCCAACGAGCACCGCGGATATCAATCGGATCCTGGAAGTCAAATACACCAGCTT +TAGCACCAGTAGTCAGAGTCATACCAGTTTTACCAACCTGTGCAGAGTAGTTGATCCAGT +CAACAACTTCACGGTTGATTTCCAGCATAATTTCTGTTGCCAGAATACCGGACAGCTCAG +CATCAGCATCCATACCGTGAACAGCACGAAGGTCTTGCGCCAGTTCGATAGAATAAGCAG +CTTTCAGCTGACGAGATTTAGCTTCGATAACTTGTTTATCGATACGGAATCCCATTTCAT +TCCAAGGGTTATCCTGGGAACCATTAAATTGTTCTTGCAGTTCTGCAACAGAGGTAGCCA +TACCTTCAGCAATTTCAGCCAGTTGACCAGCTTCGATCAGCTTAGTTACTTCAGCATCCA +GTTTACCAGCATCGGTAGCACCAGCATCTGGGGTTACAGCAACTACGGCCTGGAGGTGAG +CACGTCCGGTAGCAACGAAGTCGTGAACTACGATAGCACCTTCAGCAATAGCAACACCAG +CAGTCAGTTTGTCAAATTTCTCGGCAGCACCCTGACCAGAATACATTGCGTCCGGAGAGT +ACATCGGGTGGAATGCTTCTTTAGCACCAGCAGCCAGAGGATCGGTACCGTAAACTGCGC +GCAGAGCGAATACCTGACCAGTAGGAGTACTCATCGGCTGAACGCCACAGATATCAAACG +CGATCAGGTTAGGGATTGCACGACGTACCATACCCATAACAGCCGGTCCAATCTGAGTTA +CAGCACCGGAAGTTTGACCAGCAGCGATATTCTGAGCATCGTAACCATGGTCACCGCCGA +TTTCAGCTTCAGTCAAGAAATCACCGAATGCCTGAGCGATTTTTTCATCGCGATATTCTG +GAGAAACGGCGAAGTCTGCTTCCTGGTTTTCAAGAATTTTTGCAATCAGAGCCTTTTTAG +AAGCTCCAACAATTTCCGGCAGTTCTTCGTTTTCCAGCAGCGGCTGCCATTTTTCTACGA +GTTTGTTCTTTTTCATGTGTTGTATAACCTTGTTAAATTAAGAGAGACGTGTTGCACCAG +CAATGTAAGCGTTCATCATAGAGCTTGTCTGATTTGCTGGATTCGGTTTTTCTTCAACCG +CTTCAGTTACGAAATTAAGGCCGCTGGCCTCAGGGTCAACAGTATTTATACTTTCATTTA +CTGCCGGTTTTTCAGCAGAACCTTTAACCATTTCTACGATAGCATTCAGTTTTCCGCCAA +ATTCATCAGAATACGACATACCTTCAATCAGAGATTCGACTTTTTCTTTTTGAGATTCAG +TCAATTCGCGAACAGCTTCATTTACTGCAACTTCGCGCTGAACATAATTGATGTATTCGT +TCTGCTTTGTTACTTCTTCGAACAGTTTAGCCGTAGCTTCTTTCTGTTCTGCCAGTTCTT +CTTCCATTTCAGCAACAACATCTACTGCTTCTTCTGGAATAACAACGTTATGTTCTACGA +ACAGTTCTTTCATTCCACCAAGCATAGATTCGAACAGATCGGCTTTAATACCGCGATCTA +CTGCCAGCTGATTTTCAGCCATCCATTTAGTTGCCAGGTGATCGAAGAATTTAGCAGCAG +CTTCGGTGATTTTCTTTTCAGCTTTTTCTTCTGCTTCTTCTTCTTTTTCTTTTACCTTTT +CTTCCGCTTTTTCAGCGATAGCTTCGATACGTTTTTCTGCCAGCGTCGCGGCACCTTTTT +TAACTGCGGCTTCGAATACAGTGCTGAAGTTAGCTTTTACTTCCGGAGAAAGTTCAACTG +ATTCGAAAACGCTGTCCAGTTCTACCGCGATGTCCAGATTAGTGGATTCAGCGATTAGTT +CTTGTTTCAGCATTTTGATGTTCCTGTTGTTAAGTTACATTATTATTTATAATGCTTTTA +ATGACTCAGTAAGAGCCTTAAATGCTTCATCAGCACTTTTCTTGGCAACCGCTGGTGCCG +CTTCATGAGACTCACTAATTTGTTTAGGAGTAACCCATGCATCTGGAGCGGAAGGTCCCC +ATACTGCGTCAACACCGACAGTAAGTTTAAACCCTTCGTTCACAATCTTGTAGCCTTTAT +TAGTCTCAGTCAGAGAACCAAGTCCTCGAGATGAAACTCCTGGAATCCAACCAGCACGGA +TATTGGCAGCTAGTTTATCGCCTGGGCCATGATCGCCTTCGATAATGCGTGCTCGTCCGT +ATACGTCATTTCCTTTCCACCACATATCTTCAATGATAATGGCAGCTTGCATCGGGTCAA +CGTTTGCACGAGGTGGATGGTTTAATTCTCCTAATGCTTGCTTAGTAACAACTTGTTCTT +TTATATAGTTGGAAACCGCTTTTTCAAGAATACGCTTCGGATAGAGACGTTTATTTCGGT +TAACAACTTCAGCTTGCATAAAAATTCCTTCGATGTAAAGACCTGGAGCTAATCCAGAAT +CTGTTCCATCATGGGATTCAAGCATTGGAACTCCGTCGATACATTCGCCCGGTTGACCCC +AATGCTCAATAAGTAATTGGGGTTCATTCATCAGCTTAATCCTAGTGCTTCGCGACGTTT +CATCGCTTTCTTACGCTTACGAAGTCCACGAGCTTGGCCGCTTGGGTTAGCGCGTTTTGA +TTTGGTCGCTTTACGAGCGATTGAACGACGTTTAGCTTTAGACAATCCGGTAGTTTGAAA +TGCGTTACGAGCCCGAGTCTTACGATCTTTGGTGCGGGTAATTTCACCGCGACTTGAAAC +ATGCTTAACGATAAATTCGTCGAGCTGCATTTCTTCATTAATGGAACCAAGTGCAACCGC +GAGGTCGGTCTCAGTTTCCATCATGTTTTCTACAATTGTATTTATATCGTCTTTGCTTAA +TGCGCCGGATAATGCATCGTAACGACCCTGAGCTTCTGGTAAAAGTAATTCTACGCTTTC +AATTACTAATTCATGGTTATCAGGGATCAGAAACATTATTCATCATCCTCTTTTTCGTCT +TCGTCATCTTCATGATCTTCATCTTTTTCTTCTTCAGGCTCTTCGCCTTCGATCATGACT +GAACGTGCAATTTCGATTTTGCGCTTTTCTACTAAATCAACGATTCGCGGAGCCATGGCT +GCTTCAAACATTTTAGTTGCTTGAACGAGGTCGTTGGATAAGCAGGCGGAAATGAATTCA +TTTTCCATTAGAAATCCTCTTGATTTTCTGGGTCTTGGTAACGAGCCTCTTTAGACTCTA +ATTCAATTTGCTTAGCCTCTTTATCAATTTCATCATCGTTCATATGAAGAATATCTTTCA +TAGCTGTCTGATGAGAAATGTACTTACCAATAAATGGTTCGGCCATTTGAAGCATATTGA +TTCTGCGTTCCATGATTTCAGCATCTTTAAGTTCTGTAAAATATGAATCGCGGTGGAATG +AAATCTTAATATTATTTATTTCATCGTTCCACTCATCTTCTGTGATAACGCCTTTAAGAA +CGAGGTTCGTTTTAAGCGGATCTAAGAAAATTTCTTCAAATTTGTGCTGCAATCCACGGA +TGAATTTGGCAAAAGTTAATTCGTCGCGGGTAATTGAAGTACCGGCATCAAATTGAACTC +CACCTTGTTGATCATTTGGAATACGAGACAGAGGAACACGAAGTGCCATATAAAGAGCAT +TTCTGAACCAACGGACGTCATCCATATCACTCATACCCTGCATACCAGGAAGAGTATCAA +TTTCTGTTACTGCTTTACCGTCACGACGTTGTAACCAATAGTCTTCTGTCATTGACATAT +TATGTTGCTGGTTTTTAATCTTACCAGTCGTGGCATCATATACGACACGGTTTTTCATCG +TGTTCATGATATTCTGCATATGTTGTGCAGCCTTACGTGAAGGCATATTCCCTGTATCGA +TATAAAATACTCGACGATCAGGAGCACGGGTAATACGGTAGATAACTAAAGCATCTTCCA +TTAATTTTAGTTGGTTTGCAGGTTTAACAGCGCGGTGCAAATAGCCAACGATGTTTTGAC +CTGTACAATCCACTAATCCGGAATGAGCGTATACTACTGCTGCTTTAGGGATTTTAATTT +TAGTTCCAGCGTCATATATTCTGCCGTCACATTGATAAGATTCATGACCAGTATCGTAGA +TAAAATATTCTTTATATCCTTTTACGATCTTGACACCTTTTTCCATTTCTGTGACTATTT +CACGAACGTATTGAATATTGCGAGGATCTAAGCGACGAAGTTCTTTAATACCTTCTTTCG +GTTTATGCGGATTAATTATTTTGTGAAAGAAAATACGAGAATCGACATACCAACGCTGGA +AGTGGTCTGCTCCTTTACGTTGGAAATTTAATGTATTCAGCACTCCGTCGAATTCTTCTA +GCATTCGATCTTTGATTTTTTGGCTGAATTCGGTAGCATCTAAATCTAACGCTACTACAT +CATGGTCATCTTCATAAACAATTGAGTCCATCACGATTTCTTGAACCGCGTTATCGACTT +CATAGTTGTTCATCAGGTTACGATAAGTGTTGATTAATTCCCTGGTTGTTTTCATGCCAG +GTTCATGATTGCCCAACATTTTCTGGAAGAAGCCACTAGAAGCTTCATTCACAGACGATT +CAATTTCGTTAGCGCCGTCATCGAATTTAGGGGCGGTTATCGACTCTAAATCGTTATTGA +GTTGTTCTTTATAATTCCGTTCGTCTTCCTTTGCCCAAGGAGCGAACAGACTTAAAATGT +CGTAATTCATTAGAGTCTCCGTGAAAGGGTAATAAGCATATACTTATTTATATCTTAAAA +CTGGCGGATTTCTCCGCCAATTAATTACAACCACCAGTCCATTGCAAACGTAACTTCGAA +GGTTTCTACCTCGTTATTAGAATCCCAATCCATCTGAACTTCACCAACGTTAGTCGGCCA +CAATCCAGTGATTGTTACTTCTTTAGTTACAGTTTTGCCATCACGGTGATATTGACGCAC +AATTGCAGTCTTTTTATACTCTGCTGGAGTACCACCAGTAATTTCGTTGCCTTGGCCATG +ACAAATACCTTGCCAATCAACGATTTGCTGACGAGTTGTATGTGCGTCATCGTTATAAAT +GGTGACAGTCCAGTCATCAAACGTACGGTCGCCCGCAAGGTTAATTTTACGGTTCATATA +ACCGACTGGAACTTTTTCTACGATACCTGCTGGCATAGGAGCAGCTTTACATTTAAAACT +GAAATTTCTGCCGAGGTACGGAATTTCTACTTCAAACAAGTTAGGACGAGCTAAGTCGCC +TGACTCAAATGCACGAGTCATGTCATCAAGAAACATAATAACCTCTATGTGTTTATTTAT +ATGGCTCCAGATTCACCAGAGCCATCGAGACGAATTAAATTGTGTGATCTGTATATATTT +ATGGACCCTCGTTAGAAGGTCCCTAAGTTAATTCTATGATCACTGAGGTCCGATCAGCTC +GTCGAAGTCGGCGCCTGTTGCAGTTGCAACGAAGTTCAGAGTGATGTAGTTAATGCTTCT +TGCCGGCTTGATGTAGAACGAAGCAACAAATTCGTTACGATCGATAACTGCCGGGGTGTT +ATTCGTAGTATCACAAACAACACGGAAGTCATACATTCCACCCAATGCCTTAATACCTTG +GAGATATTGGCTAGTTTCCATACGGAAGCTTGAACGAGTGAAGTTATCGTTCATTTCAAA +CAGACGGTATTTAGAAGCATTACCGATGTTTGACTTCAACATGTTAAACAGACGACGAAC +GTTAACACGGTCAAATGGAGTAGGAACTTTAGTAGCAGTTTTATCGCCAAACAGAACGAA +TCCTTCACCACCGGTTCCAGTCACTGGGTTAATAGCTTCTTGGTACAGACGATCACGCTG +TGACTGACGCGCTTCGATTGCCAACTTAATTACGTTCAGAATCTGACCGCGATTATAACC +AGCTGGAGACATCCAAGGTTGGGCAATATTATCAGTACGAGCACACAGGCCTGCAATATC +AGCCGCTAATGGAACCCAACGGTTGACATCATTATATTTGTCGTATTGATATTTGTAGTT +ACCATCAATAAATGAATAAGTACTAGAAATATTCATATTGTTGTCGGTATAAGTACCGGT +CGCGGTACGCCAATCAACCAAGTTATCAATTGCACGAGTCAACGGAATATTGACAATAGT +TTCACGCGGAGGAGAAATAAGAACCAGACAGTCCTGACGCTCATCACCGATTGCAGAAAC +ATGTTTCTGAACTGTAGAAGCAAATTCTAAACCTTCACCAGCACATGCACCAGCAATAAA +CAGTTGAACCGCGATAGATTCACGATCAGCGAACAGATCCCATGCTTGCATTAAATCACC +AGCAGTAACGCTAGAGTTTGCAGACACTCCGCCATTTAATTTGACAATACCAGAGAAGCC +TTTAGGCCAACCAACTGATGTGCCAAAAATGTAGTTACTTGAACCTTTAGCAAAATAATC +ATCCATAAAGATGTTATTGCTATATACGTCTTTTTCACCAGGCTTAGTAGACAAAACTAC +AGATTCTACAATTGCTCCATCACGACGGACAATAATTGCATACTGATCATCGGTCTGAGG +ACCATAACCAAACACAGCCCGCGCGGTTGATGCACGAGTACCGCCTGCAGGATAAATCGG +AAGTTCAGCCGATGCTCCTTTTTCATACTGGGATTTAGATACGATTTCAATTTCCAGTTG +AGATCCAATTTCACCTGGATACAGAGCAACAACCCCTGGAAGTCCGTATTTTTTCAAATT +GGCTTGAAATTCAGTTGCAGTCATAGCTGCGTCAGCATTTTCGGGTTCAGTCAACAGAAT +ACCAGAATCTGTGATAATTTTACCTACAGATATTGCACCAGCTACGCCAGATGATGCTGA +AGTTACTTCAGCAGTCCAGTTTGGACCAAGATCAGGATATTGGTTAGTACTTTTCGCGTA +AGCAACTATTTTACCAGTAGGAATATAAATTGCTTTAATTTGCCCTTGAGTATCAACTTC +AGTAACTTTACCTGCGGTTTCTACAACAGTCTGATTGTATTTAACACGAATAGTATCTCC +AACTGCATAGTTACTACCAGCAGCTGAAATAGTAGTTTCGATGTTTTCTGCGATAGGAGA +AGAGTTTTTAGCTACATCTCTATTAACAGCTCGAACGATACGAAGATCGTTACCATATTG +CAAGAAGTTCATTGCTGACATAAAGTAATCAGCAGTTTCGCTATTCGGAGTACCAAACAA +ATCAACGAGTTCAACTTCGTTGGTGACCTGAATCATCTGATAAGCAGGACCCCATTGAAA +TTTACCAGCGATTGCAGCACGCCCGGTAGCATTATTAACAATAGTGCTTTGTACACTAGT +TTCTTTGAGCTCAACGCCCGGCGACAATAAAGCCATTTTTAAATCCTCTACTTGTATGCT +TTAATATATTTATACAAATGACACGCCACGATCAAATGACTCGGTGTTGTATTCAGCACC +ATTAACTGCATCCAGAATTACTACTGGAGCGTAATCATCGTTCATATCTTCCAATTCGCG +ACCAAATACTTCGGCTGCTAAACGCATATCGTCTTTTTCAGCATATTCCACGAATTTGGA +TTGGGTCGAAAGCCAAGAGAAAATAACGAGTGACATAACCAAGTCATCATGATACCCTTC +TTCTGCCGCCCAAGATACACCTTTTTCACTAAACGTTCTAAATTCTTGAATAGTGGCTTT +ATGGTGTAGAACTAATTTGTCTTTTTCAATTAAATCTTTTAAAGCTGAACAACCAACTGC +TTTAGTTCTCTTCGTCTGTTTCATACCAAGATCAACAAACGAGTCGCAAATTACACCCTC +ATATTCGAGGTCCATGTAAAGAGATTTAGCAACTGAAACTCCAGTAGAGTTCAACTCAAT +ATAAATTGGAGCTTCGTTATATTCTATCAAGTATCTCATTACGATATCAGGAAGAATAAG +GTGAGATGTTGTGTTATTATGAAACACAGCAACTTGTTCCCAGGTTGAATTCGTGATATC +GATTATATGCATTGCATGATAATCTTGACCACGACCTTCAGAACAGTCCAGCGCCGCTAT +ATATTTATGCCCTTCTTCTGGTTCTTTGAATTTGGTAAATCCATTTGAATCTGGAGTAAC +TTCAATCCAGTCCATATTCGCCAATTTCATACCGGAAATTAATGTTCCTGATGTTCCATG +GAATTCTGCACAGTGTTCTTGCTTAAATTGCTCAAGAGATGAAGCACTAATAGTTTGAGA +CGACCATTGCCATCCATCATCAAACATATCATTATCATCATACAGACGTTCTTTTACTGA +GTTCCAAATAGCTGTATAAGGAGCAAAGCCCGATTTACCTGTTATTGCTGCATCCCAAAT +ATCATAGAAGTGGTTCAATCCATTTGGCGTAGTAGTAATAATAATTTTAGAACGACGCCC +AGATGAAATAACTGGCTGAATAGCAAGCCAAGCATCAATAAAGTTTGGAATAAATGCACA +TTCGTCAATATAGATCATTGCGAATGAGTTACCACGAACAGCATCAGGAGATGACGCATA +AGCACCGATAGAAGAACCATTATCAAGCTCAATAGAGCCTTTGTTCCATTCTACAATACC +AGGTTGTAAAAAGTCTGGAAGTAATTCAATAGCCTGCTTAGTACGATCTAATACTTCAGC +TGACATACTTCCTTTATGCGCAAGAATACCGATTGCTTTATCTTTGTTGAAACAAGCAAA +ATGCGCTAGAAATATTGCTACTACAGTTGTTTTACCAAGCTGACGAGACAGGTTACATAC +TGTCATACGCTTAGAGTGCATGATTTTAAGCATATCGCGCTGATAATCACGAAGTTGGAC +CTTGATAGTCCCGTAGTCAATGTGTGTGATAGCGCAATACTTCTCTGCGAAATAGACTAT +GTCATCACGACATTTCTTCCATTCAGCAATCATTTCAGCTGTATAGTTAACTTTAACGTT +TGCTCGTTTAAGGTTCGGAAGACCCATATAACGAGATCGTTTTACGTTTTTATTTTTAAA +CGTTTGAAATAATTCTGGATTCTCGCCTTGTAATTTAATTTTAACTATTTTATTAATGCG +AAGATAATCATCAAATTTCTCTGGATACCACTTATCATCCCATTGAGATTTGATCCAGTT +TATTCCTTGATCTACCTTTCGTTCTAATTCGCCTGGTGGGCGAATTATAATTTTATTTGT +GGTATTTAGCGGATGTGTGTCCGCTAATACGTTATACGGCTGAGTCTGTTCCATTTACAA +TTTTCTCCTGACGCTCTTGGGATTCATATGAATCTCCCAATTCATCCATTAAATCAGTAG +GAGAACTCATAAAAATGGTAGCGTTTTCAATATTATTAGTTTGATTACCACCTTGTGTAC +CGACGTTCTCAGCAGTAATTTCTTTCATTTCTTTATGAAGTTTAAGAATTTCCTTATTGG +TAGTAGTCATTTGCCCCATCAGCGTGGCGAATACTTCCATATGACGAGGAGAATCAGCGT +TCTTTGCAGTCTCTAGGAAAATCTTAGCTGCATCCATCAACATCTGTTGTTGGAAGTGCA +TGTTTTTACGAACAACTGTATAGTCATCTTCTAAGTCAGGTTTACGATCATTGGGGTTAC +TTTTAACCTCAACTAATTCCAGCTTTTCATATACTGGAATTTCTTCTCCTCCGATACCAG +GGAGGTCCCCGATATCGAGAAGTTTAGCTATATCTAATTGATCTGTCATTGTTATGTCCT +TGGCCCAGGTGGTACAGGAGCAACCGGAATAGGAATATCGGCGGTGTAAGTTTGTTTAGA +CGATCCGTCCCAATCTTGTATTTCAACATCCCGTGGTTCTACTTCAGAATCAACTGATTC +AAATACACCTTCAGGCTCGAGTACTTTACTATTTGCATGGAAATCAAGATAAACAGTACG +AATTTCGCCTTTAATATCGTTCTGAGGTGGGTACATCCACCCTTGAACCTCAAACATAAT +TGACCATTCAAGTCTACGACGACTTATATTATCTCCGTCAATTTGTTCGTCTACTGATAT +AGATTGAAATACTACTCTAATATCTCGGCTAAATTCAATATCTTCACCGTATTGCTCAGT +CATAGTAGTATTGAAGTGTGGCTGAAAATACGGCAGAATCTGTTCAACAATTTGATACAT +GTCATCTTCATAACGTGTAAAAATTCCTAACTCATATATCATTTTATATGGGGTAGGGTT +GAACTGCGAAATCATAGCAGATTGACCATTTTGAATAGCGGTTCTATTTACTATATTCGT +TTTATATGTGCCGTTATACATTACATCCACAAGATGCAAATTCATGCGAGGGAGAATAGT +CTCAACCTTCGCTACATCGTCTTGTGAGTTAATTGAAGTCCACTTATTGAGCTTCATCAT +AAAGTGTTCTTTAGATGCGTATGTAATAGGGACTTTAATGAATCGGGTTCCAGTGTCTTC +CCTTGTTCTAGCTACTTGAATATTGGAGAATAAATCACCCATCAACAATGCGTAGCGGCG +GAAAGACGAATGATAAAAGTGACCAAACATTTTTTCTCCTACGGCCCCGAAGGGCCAATA +TGCTTTATATTATTTATGACATAAAATTATCATCAAATGGACTAGTCTTTTCGGGGTCAT +TAACACCACGGCCATTGATTACAATAAAGTCTTCAACAAACTCAGACGCTTCAGCATTAA +TTGCATCGACTTCTTCATATTGAACTTTAGATATATCAGCAAGTCCATCAAGATTTTTAA +CTGGTTCAAGATCAAGTTCGCTAAATTCAGGTATTTGAATACCGGCATTTCGTTGCAATT +CAGGCTTAAGCTGCTCCCCAGAGTAAATAAATTTGCCTGCAGTAATTTTACGAGTAGCGT +TACGTCCTACTTGATAGAACGGATCATATGGCTCAACCCAGTTAATTTCAAACAAACTGT +TGTCCATTTTGAAATAAATTAAATCACCTTCACGAGGTTCACTTCCATCGCATTGATGCT +TAAACAACCCAGGGTTTATTGTTAAAGTGACTTCATCTGATACTTGCATGCCAAACTTAC +TAAAGAAACTATTTGCTCCTTCATATCCCTCAAATGAGTTCAAATATGCAGCGAATAACC +AAGCTTTGGTAAATTTGTTCTGAAGATCTTCACCGAATAATTGATCTGGTTTAACGTATT +CGCGAGGAATGAAATAACACTCTACCCCTCTCATTTGAATGGCCTCAGCCACTAGCGTAT +CCGCCAACGACTGAGTATTTTCATGACCATTGAAATTTACATAAGGGTTAAGTATATTTG +CTACGTTAGTTTTCTCATAACCAGAACGATTTTCGAGCTTCGCAAATAACTTATTGTTCA +TCATCCGATTAATATTCCAAATGGAGGATCAAGTAAGTATAATTCTTCGCGGAGACGTTC +TTTTTCTCGTTCTGCTTCTTCGAGCAAGCGTTCACCGTTCAGAGTAACACCGCCAGGTAA +TTGAAGACCTTGATGCTTAGCTAATACTTGACCGAGCAATTCTTTTGATAACGTGGTTGC +ATAATCTTTAACCCATCGGTTATTATATGCACCTTGTTTAACCATTGCATCTTCACCAGC +AACTCGCTGAGGTAAAGATCGGTCCATGTTATCCCATTGTTCTGCTACTGACCACTGATC +AGCTGATCCAGCATATCCATAGCCTGCAGTATTTCCAACTGTTTTATCTACACCGGAATA +AGCTAAAGTGAATACTTCAACAATGATAACATCGCGTTTTTGGAAGTTACCCATTACTTT +AAGTTGTTCATTAGCTCCATTATACCAAAAATCTGGTAATGGGCTTAGCATATCTTGCAT +CATACTCATATAAGTCATGAGTTGAGTGAAATAACCAAGATCAGCGCCAAAGGCGTTCGG +GCCATAAAATTTATTACATGAACTTCCCATTCCACCATTAATTCCAGCCATTCCCAAGAG +AAAATCTGTGAACCATGGATAAGTGGCATTACCGTCCATCGAAGTAATAGAACCGATATT +CGTGCGAACTATTTTAGTTACAGCAAAGATATTTGAGCCGCGAAGGTCGAATACACCATG +CTTGTATAATTCATCATCGTCACCAATATAAAATGCGTGATATCCTTTGTTTAAACCATC +ATAATGATATTCGCCAAATAGTTCGAGTGCTCGTTGAATGCAGTTGTAGATCATGTCTTC +TGTTAATTCAACGTTTAAAATTGGTGCGCCTAATCTAGTTAAGATAGCGTCTTTTAATTC +TTTTGGGTTTTGTGGTGCATGAATAGACATAATAAACCTCTAAGGGCCCGAAGGCCCTCT +ATTAAGGAAGAAGAGTTTCTAAATCAACCCAAGCTCCATCTTTACGAACGTAAGCTTTTC +CATCGCGTGGGGCTTCAGGAATATAAGTAACAGCTGCACTTTCAAGCTGGTTTATACGGG +TTTGATGTTTGTTTAAAGTATTTAATACACCAGCAGTTTCAATATTAGGGCTATTTGGAT +TAGTTCCGTTCATTTGATTACTCAAACGAAGAACTTGTCCTTTAAGACCTTCATTGTTAT +TACCGATTTCCACTTGAAGATCTTGAATGGAATCGTTCATAGTACTCTGTTGTTGCTCCA +TTGCAGTCATTTTAAAAATCAATGAAGTCTGCGGAGGTTGAGTACCCTCTGGAACAATAC +CAACAACCTGATTTATCCATGCAACCTGACCACGAAGACCAGATGAAGTATCAGCTCCAA +CTATTTGTTGAAGAGTTGCAATTGAATCAGTGTTAGTTTTTATTTTTCCATTAATAGTCA +ATGGAGCTGAAGGAACACCTATTTGCTGCTCAATAGAATCAAGACGCGGTATTACGCCGT +TTTGTCCACTTAACTTATTATTAATTAAGTTAATTGACGTGATATTGGTTTGAACGAGTG +CGTTGATATTAGAAACACCATCATTCATTCCTATTGAATCCATCACGTCTTCCATTGACA +GTGACACTCCGTTTATACTAGCATTCATACGATTCAATCTAGTATAGACGTTATCAACGG +TAGAAGAAGATCGCGGACCTAATTCTTTACGTAAGTTAGTAACTTCAATAGTTAATGATC +CAACGTCAGAATCATTGAATTGATCTTCTAATATTCCGACACGCTCGATAGTCCTAGAAA +TAGCAGATGAGTTGTCAATAATTCGACGCTTCATGCCAGTAGCTTCAGTACCCTGGACCG +ATTGTCCATTTATATCTTGACCTGGATATTGGCCCATTTCTTTTTTAATCCAGTACAAAT +CTCCACGCACCGGACGATAAAAAGTATCTAAACTTGGATTATAAGTACCAATATCAGTTT +CTATAGAATCCACACGATTATCAAGATCAGTCAAAGAAACTTTATTTTCCGCAAGTGTAG +TATCAATTCGCTCTATAGATTCAGTATTTTTACCGATTTGAGTCATTGCATCGATATTTT +CAGACACCGCAAGAATGCTATTAATTGTTTCAATTTCAGCAGTATGTTTATTTGTGGCTT +CCGCAACTAAACGAATATTTTCGTCTAATACAACCACGTTCATCTGTACCTGATTTGGTC +CTCGATTCAATTGCCCATCGAATCCATATTTAGTAGAAGCCCCATTGAGAGGCTCTCCAT +CTTTAATCCAGTTGATGCGTTGCTGACCTTCAGCTGGAAGTCCATTTACATATGGTAAAT +CTTTTAATTCCATAAGTCCTCTCTTATTTTACGCGAATAACGTAGTTTATCGCGACGTTT +TTCATTCTGTTTTCATTTGCGGTTCTAACTACTCTAGATGAATCAAAGTGCATTCTTAAT +GCAGTCCAACGATTATCCTTTGATCCTGCGTCATAGTTAATACTTCCCCAGCCATCAGCA +TAATATGCTCCATCAGGAGGATAGTTCGACCAAGCGGCCTGATCATCAATAGTCCAGTTA +CCCCAGATTCTTTGTTGAGCATCTTCCTGATATGTTCCAAAACCTCGTCCTGGATCAACA +CCACGCCCAGCATCAAATCCACGAGCTGTCACGCCGCGCATGTCTGGTAAAGCAAAATGA +TCTGATCCATTGCCGCCATAAGTAAAACCTATTCGCTGAAATAAAGTTGGATAACCAGAT +ACAGGCAACCATCTGCCGTTTGCGATCATTAAATTACCGTGATCACCATTGAAAGCAGCC +ATGAACATTCCACCAACTGGAACCGTTGTATCCAATTCTCCCTTTGTCATATATTTCTGA +TTAGGAGATGTATTTCCTTCATAAACACCACCAGTTGATACTGCAGCTCTATCTGACGGA +AGAACGGCTGCTGATCCTGATAAAGCTCTTGAGCCATCTCCTATATTGTTGGCTAATAAA +GTTGTGCCAACACGATTCGTTGTTGCACGTTGACTATTAAATTTAGCAGGTGTTACAGCA +GTATTATTAGAAGATCCATCCATATTAGCTTGCGAAGCTAATTTAATTGCACCAATATTA +CTTTCTGTTGCTTGCCATTGATTTAAAGTAAATGGAGATGCACTAAATCCTTCTCTGATA +TTAGGATCACGTAACTGAGCTAATGTAGCTAATCTAACAACACCTTGTGCTGATTCTGTA +GCTGGTCCATATGCTGGAATAAGAGCTGTGGCGGCGGCGATGGCTTGTTTAGTTTTCAAT +GGAGTCATTGCAGTTGTGTCATCTACACCAGCTTGGGCAGCTGGTGTAGATGACAATTTA +AGAACACCATTTCTAGATTCAGACGACGTTCTATTTGTAAATGTCCAATCAATTACATGC +TTTAAAGCCGATGCTACGATACTTCTGTCGGTCGCTGTACCAGTCAAAGCTTCATCATTT +GTTGCGTATCTAGTTACTCCAAATACGTCGGTTGTAGCTTGAGGGCGCTGTAATCGAAGT +TCGAGAGTTTTTGGAGTTACGGCGACATCTCCAAGAATACCATTATTTACTTCTTCCTGA +GTACCTATTCTAATTTTTCCTTGAACAGTCTCAGTTGCAGAGGGAGTTCCTGACACAGCA +TCTGGACTTAACAAACTTAAAGCCTCTTGGACGTTGTCAACGCTAGAGTTAAAGTTGCTT +CCCGCTGGATCAAATCTCACATAAATCGCTTCATCTGATATGTGTCTATTAGTATTATTC +ATTATGCGATTCTCTTAAAGTAGTGAAGTAACACAGGATCTGTTAAACCTTCTAAAGTTT +CAGTTTTAGTTCCGATACGGTTCCAAACACCATATCCCTCTTTTCCTGGAGTAAGAGTAG +TAGATGATACTGTTATTCCATATGATGACAATGGTTTCAAATTATGCGATTGATTGTCAA +TATAAGTAACAACTAATTCATTACCATTAGTTCCATCTGCGACAGTATTGACAACAAAGT +TATTTGTAACAGCATCTTCTAAAACTAATTTAGCTTTAGCTTTAACTTCTTCTGCGGTAT +CTCCAACAGAAACTAAAACATTAAATCCAAATACTGAAATAATACAATCTGAGGTCTCTC +CGTCTTGACGAGCTACAGTTCCAGTAAAAGTCCATTGATCTGTTTGAGATACACCTTCAG +GAGAAATACCGTCAGTATTAATTTCAATTGAGTTTATTGGACGAATACTGAAACTGCGAA +TATCTTGTATAGCTGATTGAATGTTAGAATATTCAACACCCTTAGCTAATTGAGTAATAG +TGATACTGCCATAAGGCTGTTTATTCATCACATCATTATTTTGTGTCGTATTGAACTGAA +GATAATCAGCTAAGCGGGATACGACACCCGCTTTATTATTCAATAAACTCATTATGCAAT +CCTTAACCAGCGATAAACTGTAATTGAAGGTTGAACTACATCAATTGAAATAGGACTTTG +ATGTGTTTCATTGGTCTTCGCATAATCTTCACGGTATTTAGTGTAAACTGGTCCAGTATC +ATCAGGATCAAACTGGCATCCGCCTACAATAATTGGTCCATTATCATCTGAAATTAAAAC +TTTAGAATCAGTTTGAGTTGGAGGAAGATTATTGTTATTCAATTGTACTGACGTAGAACC +GGTGGTTCCGCCAGCTGTATGTGATGGATTTCCTGTAATGTCTAAATCATTATTATTCAT +TGCAAATCGAGGATCATTTACGTCTGAATTCCATCCTACTGTAACTTGACCTTGTCCCCA +TAGAGTCCATCTTCCGAATCCCATATAAGTAGCTGGGTTGTTCGGGTTAATAGCATTTTC +ATAAATTGTTCCAATTGGATAAATCAAATCAAATATTGCAAATGGACTATTAACAATAAC +TTCGCTAGGAGGAACAGCCTCTGTGTTTGGAATACGAGGTTTGTCGAAATCAGTGATTAC +AACTTGACCAGTAAGACTAATTGGAGATCCTTGAGAAATATACTTAGAATCTGTCTCTTG +GATAATGTCTTCTAAATCAATAGTAGTTCCGATGTTATTATTGAACCAAGTCAAAGTTAT +TATGTCACCATGCTCCATAACTCTGTCAGTTTCAATTCCAGTGATAACGTTATTTTCATC +TATTACGACAAAATAATCAGTTAATCCTGTTCCCCATGTTCCACCGAGTAAAGCACAGCT +TTCTGGAGTATCAGCATCAGCACCATGACAAAATACATCAGGTTGTCCAGTTGTTCCAGC +TTGAACTTGAGTAATTCCGTTAAATTTAACCTCAAGAGAATTTGGATTAATTTTTTCGTG +GGCTGAAATACCAAACATCTGGACAGTGAATTCTTTGGTATTTTTTAAGTCGCCAACAAA +CGATGTTCCTGGGATAGATTTTTCAGTAGTTCTAGAAGAATCTTTCAAGATAATTTGGCG +TTTATTATATGAGCTTCTCCATTGACTTAATCCATCAACATAAGAAACTATGATAACGGT +ATCGCCAATATTACATTTTTGACGCAACCTAATATCTCGACCGTTCAGCGGAACTATTTC +ATTACCCGTTCCAGGAGAGCCATAATCGGAGTTTTCACTAAAATTCGGTCCGTAATAAAG +AATATTACCTCTATGGTAAACTTCAGTATTTGCAATGTTATATTCATTGCCATCAAATAT +ATCTAAAAAATCTACTTGATCCTGAACCTCTACTAAGTATTCTTTGCGTATAACTGACGC +CATATCGCTATTACTAATACGGTCAATTTGCTTATTTTCAACATATTCCCAACGACCAGG +CGGGCAATACACGAGTTCTAAGTCGGAAAGCGGAATATTGAATTCTTTTGGATTAGGATC +ACCCTTTAAAGTATCTCCCGCTGCAGGAATAATAGTAACTGGGTTTTTCTGCCAAGTATT +AAACACGTCACGGAAGCGAACTACAGTGTTGTAATCTTGAACTCGTCCTTTAGGAAGTTC +AACTGTCATTCTTCCGGCAGTTGTATCTAATGCGTATGAATGACCAATCAAAGCTTTTAA +AGTTGACTGATCAGAAGTTTTAAACGTTTTCCAAGCACCAGCTGCATGTGGATTATTGCC +GTCGCCAAGTTGATAATAAAGATCGTTAAAATTATTATTAATTTTTTGACCACCTTGGCG +AAGGTAATCTCCTGAACCATCGTCAACAACGGCGCCAATTTTTAATTCTTGTTTCATTGT +GCTGCTCCAATTTTTTGAGTTGCTATTGATTTAATTGCAACTCTTAGTCCAGGAGCACCG +GTTGCAGTAACAGAAACTAAACCAGTGCTTTCAATTTGAAATGAGTAAACTGCAATATCA +TCGTCTTCGTTTAAAGCTCCGACTCTCATTACTGCGTATTCGGTCGAAATTATTTCACTA +TTGACAGTGTCTACCATGATATTTATTTCTGATGATTTCATTCTTTTCCCGTCGGAAGTA +CGAGATGTAATCAAAAATTTAGCAGCAATGTACTCGCTTTTATCAAATAACATTGCTTTA +ACTGAACCAGATCCTGGGATTTGCCAAGTCCCTTGAACAGGATCATGATAATCTCCGAAC +ATGTTTCTTATTCTGTAGTTCCAAACTGAACGACCACCTTCATCTGAAATACACCAAACA +GTTACTTCAGCATATGGAGCTGTCACTACAAGCGGTCCAGAGAGTCCCACAAAGCTATCT +GATGGAACTATTGTCAATGGTTTAGAAACTGACACAGTGCCGTTTGAGTTAATAAATTTG +ACACCTTCTCCTCGTACTCCTTTGGTTAAAGTTACTACAGCTGGAAGATCGGTGTTATCA +ATATCAAACATAGATCCATTTTGAACTGCCGTAGCGAATTCAGAGTTTGCTAATTTCTGA +TAATAACCAGTTGCATGAATAACTTGACCAGTAGGACCTTGCCCGTCATCTAGTGGCATT +TTTCTTTGATCGCCAAATGCGTTATAGATGGCGTTAATATTAGAGTTTAATTTTTCACCG +CCGTCGAATAGGATATCACCAGTAGAGGCGTTTCCGATTTCACCGACGTCAATGAGTTGT +TTAGGTTCTTGTATGTACATATCAAATCCTCAAGTGATATCTGTCTATGCTTATATTTAT +TCACAAAAAAGGGACCCCGAAGGGTCCCTAATTAGAATTCGAAGATGATGTTAATTTCTT +CTGTTTGGTCCATTGCCATTATAATAGGAGCTCTATTTTCCATGTATATCATTTCACCTG +AATGCCTTTCAAGCTCATGTGGGCTATAATAAAGACCTTCAGCTTTAACATTAGGATCAC +TAGGAAGAGCTTTCTTTTCTAATGGATTGACAATTACCGCAATTTGACGAAATCCAGAGT +TTCCAGGTAATGCAGCATCAGGAAAATATACCGAGTCAAAATATGCTTTAAACCTAATGG +TATATGCTTTTACTCTGTAAATGATATTGAAGTCATTTTGTTGCCATGTCAAGTTGTTTT +GAAATCCCCAACGTTGAGGATCTTCTGCTACTTCTTCTGGCCAAGGAACTACAATATACT +CGTTTGTGCAACGGTTGATACTTACATCAGGTGGAATCTCATACAAATATTCCCAAATGT +ATCCATCACCAGTATCAATTTTCTTCTGATCTGAAATAGCATCTCCTCTACCTCTAGGTG +GAGCAAAACTATCTATAGATGAAGTCCATTTACCACCGAGTTTAATACATTCGTCTTTGG +TATTCATTCCTTGAATTGAGCATGTTCCTTGAGAAGGAATATCAAGTACTCTATATACCA +ACCATCCAGCACCAACTTCGGTAGCATTATATGGAGCGCTGTTAACAACTACTATTTCGC +CAATTTGGAAGTTCTTGGGATTAGGATAACGAATATCACCCCAATCCTTACGTGGAACAA +TAGCATCGAGCATTGATGGCATAACTTTAACGGATCCCATCATATTCGTCCACATATCTA +CTACACCCTGTGTATCATCAATTGGATAAGGTGGAGCGAACCCCACCTCGTTTTCATTTT +CAGACCAAGGTATAGAACGACCAAATGTAATGTAAAGAGAATTTTTGTCATCACCATCAC +CTATTGAGTCGTGAAAGTTTTGCATTTTCTCTGTGCGAAATTTGGATGTAACAATTGCGC +GATAAATTACGCTTGAATCATTCATTTATTTTAACCTGTGTTGGATTTTCAGGATCACGA +GGAATACCAACCTTATCAATAAGTCTATCTTCAACTAGATCTCTGAATTGAGAAAATGTA +ACAGCTGATTGATCAAATAATGGACTCATTGCTTTGCGGCGTTCGGTTGGTGTCTGACCT +TGAAAAATTGAATTATCGTTTTCGGAGTCATAATCTGCTGGAACCGGATATGGAATACCA +GTCATTGGCCCTGCAGTGTAAATAACTAATCCAGTAATAGGATCTTTTTCTGGATTTCCA +TTTGAATCCAATGAAGCAACTCTGTCTGGATAAAAAGTAGGAATACCAGCATCCCATTTA +TAATTCTTATATTTATTGATTATAGTCTGAACGTGCTTCATTGTCAGACCCACATTAATG +AACATAGTCAATAATGTGATACCAATAAATCCAAATCCAACTGGATGAACAAACCGAAGC +ACATCATTACGGTAACGAGAAGTTGGAAGATTAGACTTTATCTTCATAACGTAATAAGAA +CGATTTCGGTTGATATAGTCTATGTTGTTTTCAACTAATTCTTTGCCACGAACTCCTCGA +ACAATCATGCCATCAAATGACAACATACGCTCTGATTTAACTTCCTGTCCAACAATAAGA +CGACCCAAAAGGTTGTGTATTGTAACTTTCCACTGAAGCTTACCTTTGGAATAAGATCGT +TCCAAATAAGTCACGTTACATCTTCCGGTTTGAGTATAAATTGTTTGACCAACTAAATTC +TCGTTGATAGAATCAGATTCAATAATGATGTCGTATTCTGTGCCGGAATTTGACTCAATT +TCAATTTGAACATCTTCATTATAAAGAAGCTTAAACAAAAATTGATATGAAGCTTCAACA +CCTTTGGTTGAATAAAAATCTGAACGACGAGCTTCAAAGAATCTTGATACTGCATCGCGT +TTATCTTTATTCAAATAGATATTGCGTTTGTATATTTCAGACCACAGATATTCCCATGAG +TCTTTTTCACGAGGATACTGATTGCGAATAAGATTCAATAAATTGTTGTACTGAGTTCCA +TAACCGTCAGACAGATATTGAATATAAGCTTCACAGAATTTTTCAAAATTACTGTCATCA +AGCAAATATGAATCTGGCATCATTTTATTGATCAAAGGACGTAGATCTGGATCTTGCTCT +CCTGACCTTTCTTCCGGTTTCCATTCCTCATTACGTTCTTGATTTTGGAGATATGCTTTA +AGAAATATCTCAGAAGGCTTCCAAATAATTTTTACTTCATCACGAACTCGGTAATTGAAC +TCATAGTATGAAATTATTTCGCCAGAAGACTTATAAAACAATACTCCTGATGCGTACTTG +GTAAAATAGTTGAATTCTATATTAGGAGATGTTACGGTACAAACTCCTTTATCCCATATT +TCATGTAATACTCTATCTGGAGAACCAGAACCATGGGTATCAATTATTTTAGCATGAGTG +AAACCTGAATAAACTACGACTACTCTATTAGAGTTATCAATCCAGCATCGAGTTCCAGAT +TTACGAGACCAACTGAAGAATGGTTCGGCATAATACTGCATCGGTCCAGGTGTAAAACTT +TCAAATCCAGAATCTTTATTAGATCTGAAAGACATCATGTGATAATGTTTATCTGATAGC +CATTCACGAGGAAATTCGTATTTTACAGCTCCAAGCAGCTGGTACTTATCTGCTGTTTCA +GGATCATCGACAATTTGGTCTTTTTGGAATTTGAAGTTACTGGAAGAAATAAAGATCTCT +TTTCCGTTTGTAGACATGCTAGTATATCCATGCTCAATACGGCGTCTTTCTTCCTCTGTA +TTACCAAATACTCGTTTAAATGTTCCGTCATCCTGAAGAATATAAACTCCTTTGTCTAAA +GAATCTACAACGTTTTCAGGAATAGTCGGATCCATTAACTCTTCTTTAACTTCTCCAGTA +ATAAGAACAAATACTTTTCCATCAACTGAATCCATCTTATAAACAACTGCTTTACTATTA +CCGGTTATAGTCAGAACTTGGTCTTCAAACAGCTTTTCTCCAAATGTAGGAGATAAAGGA +TTTTGATCTATTGGAGCATTTTTAGTTTTAGCAAATCTAACTTTATCTCTAGCAGCTACA +TAAACGTAATTGTCATTGGCGGTAATAGCTTCTGCTTTACGAGATACGTCTCCTGGCAAT +GTCGCATACGTTCCAAATATTTCTACATCAAATCCAAGCTTTAATTGATCTCCAATTTTA +GCAAATGTGACGTCCTGTGAAGAAAAACGAATTTCATCAGAAGACCAGCGGACATCATTA +CTTTTGCGTCCGTAAAACAATTTATCATAACCAAGCAAATATGTAGTATAATCGGTTTGA +TAATATGGTGTCCGTGAAACTGGATTACCTGCCCTGTCAGAAAGAAGTTTGACGGCTTTC +CAAGTTTGTCCTTTGTCGTTTGACACTTTTACTACAGGCTGAAAGCGCTCAAATAGGTAA +AGAACACCCTCAGATTCCATAAGCATGACTCGATTTACGTCTTTACATACCGCTGTAATA +GATCCTTGAATTTCGTGATATTCTTCTTCTTTTAATACAAAATTAGAAACTGAAGAAACA +ATCTCATAGTTTGGGCTAAATTGAAATGATTCATTCATCAAAGCCGCATAAATGGTATCA +CGGTTAAAATTGACGTAGCTTTGATTATTTTTATTGAACTTTTCCTCAATAAACTTTTTA +GCTAACGTCATTTCAAGCATAGTTTCAAATGTATAAGCGTTTTCGCTAAACATTTGAAAC +TCTTCAGTTTGAACCCAATTTGAAGGATCAAATCCTTGCGCTGCGGTTTGAACGCGCATT +GTGTAATAACGATCAGGAGACACAAATGTATCTTCAAAATATTCGTTAGTAGCAACGTAT +CCTAATTTACGCCACTGGTAATTTGATGGGGGAATAATATTCCCCATCATATCTCTTGTT +TCTGCGAGTTCAACAAAATAGTAAAAGTTAGCACCAACATCATCCCATTTAATATAAACA +TGGTTGGCTGATAATTTAACAATTCTGAGACTCGTTACTGATGGTGCTTTTACTGTCATT +GCGCAATAGGCTCCAATGTTATTGTTGTATACTGTGGACGTAGATCATTCTCAAACACGA +TTAATGAACCGTCTCTAGTATAAATGTTATCTTCCACTGGATCTGCATAAAGTTCGATTG +TTTGAACTTCAAACTGATCAGATGAAAGATCTATAGCTGAAATATTCCAATACGTAGTAT +CTCCAGGATAATTAATTTCACCTATAACGAAATAAAGTGTTTGATCACCAATAATTTCAC +GATCGAAATCTGTTCCTGGATATGGTTGAATATCGGTATTTTCTTGCACATCACCCGGTT +TAAATGGCCCAATTACTACTTTACCTTTGCCATCTTCGTTTCGATCTGTACCAAGAATCC +GAACATTATAATCTCCATTTTTTCCATGAAATGCAAAAGCGTTGGATTTTAATGAACGAT +TAGTCATCTGATTATAATATTTAATTCCAGATTCTGGAGTCTGGAAAAAGTTCTGAATTT +CACGAACCATTTGAATAGTAGCTGATGAACCAATAATACTGTGATCGGCATCATCAATAT +AAGTCATCATTTTAGATTTAGCAAATGATGCGTTAAAAATTTCAACTTCTTCAATGTAGT +AACGATCGATTTGATCCAAAATTTTACCACGTAACCATTGGTCAGATTCTTGCAATTTAT +TCAAAGCGTAAGATACTTTGATATTATGACGAAGGAACAAATAGTTAGGTGAAATAACTG +ACGGAGTTATTGGTGCTAAGTTATACGGCTTCAAGTAATTTTGAATATCTTCACGCTGTA +CAGATGTTAAGTACAATCCAGATTTAGGCTTAATTGCAATAAATGCATAACCAGGTTTAT +AGTTATCAGTGAATGTTTGAACAGCTTGAACGATTGAACCAAATCTTTCTGAAACAAATG +TGTCATAATCGCTAGCTGTCACGCATCGAGCTTGAGTTTCACGCTTGATAGTTCCAAGTT +CACGGATTCGTTCAATATCTTCTGGATCACCGCCGCCATCAGCACCTACATAATCTTTTG +AATTATCTGGGTTTTCAAAAATACGTTGAACTGTGATATAAGTTAAAGTATCTGCATAAG +AAAACTCAGTTGCACCGTTAGCAGCTTCACCGTCAGTTCTGATGTATTCTATTACAATAG +TAGAATCTTGAATAGGTTTTAAACCACCGATATAGTTTGATTCAAGTACTCCACCCGCAA +CAGAAGTCGATTGTTCACCTTCACCAAAGAATATTTCGGTATGCCCATCAACAGTTTCAC +GCATATAATAAATCGTAGAAGTAGAACCAGCATGAACCATTGAACGGCGAGTCCAGTTTG +TCCATTCTGTTCCATTAACAGTTAATTTCACTTCATTACGATCTATATTCGGATCACGAA +TAAGAATCGGCTTCATTTTATCGTAACGCAATTCAGTACGAACGATTCGTCCCTGCACTA +ATTTAACTCGTGGAAAATATTGATTATTCGCATCTTTTACTGCTGTAACTTCTTCTGTCG +TAACAAAAGAATATGGATCAGCCGAAGTATCACGAGCATATGCTAAAAATCTAGTTCCAC +GCGGAATGCGAAGCATATATGGGTTTAATGCGTGAGTACACTCCAACATGATTTCTGTTT +GAGCTGCTGATCTAGATGAAGGAAAATATCCATTATCTTGAGCGCCTTGTACAACAGAAG +AACGAAGGTTTGCAGTCCGCATAAATGATTCATAAACTGAACTGTTACCAAACTGTTGAA +TATAAAGAGTGTTGTATGCTAAAAGATCGAGCAATACGTTTAAACGAGAACCCTCAAAAT +CATAATCTAAGAATTCGTTTTGTCCGCGCAACCAATCAAGTAAATCTCTCTTAATTTCAT +TGAATGTTCCTCCAACGAAAATATCTGGAATGGCATTGGCTGTGCGTGTTAATTGATAGT +TAATTGGATCTGCCATTATTTTATGAACACCTTATATGATGATTGCGAGACTGTGTCTCC +GCATGAAATTGGGTCAGCCATTTGAACAGCTTTCTTTCCAGTAACAAATACTTTGGAAGT +GCGGGGTTGAACTGCCCCGCCATGTGTATCATAAGGCTTTATTGTTTTAGTGTGAGGAGT +TATTTGATCTCCATCTACCAATACAGGAATTCCACCAGTAAAAACCTTACTTTGTGTAGA +ATTAACTTCGGTCGGTGGATAAGCATCGTGACCGGAAGTAACGCATTTATCGTATGATAG +TCCAGCCATCATCCTCTCCTATAAACATATTCTCTTAATTGATTTCCCCATTTTGACCAG +TTACCGTAAACAAGCTGTGTGTATGTTTTAGTGATTTTCTTTTCTACCGGAGGTGTTGTA +GTACCACCTCCTGATTCACCAGAACCACCAGAAGTATCTTCTTCCTGGAAGTCATATATA +CATTCAACTGTGTATGTAAAGGTCCTCTCAAGTTTAGAGGGGGCTCTCCAGAGATAAAGA +TCAACATCTTCTGCGGGAGGTAATTCATCCCAACTTGATGCAGATTTTCTTTCATCTCCT +TCTCTATAAAGAAGTGAATCACTACCAACCGAAAATACACTTTCGTAAGTTCCATGATAT +CTTGCTCCAGAAACATTTATTCCTGGAGTGGGCTGGTAATCAATTATATTTATAGACTTC +AAAGTCTCAGTTAACGGGTCAAGTTGAGCAGTAAAAACCACATCAACTTGACCTCCCTCT +TGGATATCGCCAAGGTCATTATTAGCTGGAAGTATCTGCGCCATTAGCCTAGATCTATAC +GTGAACCATCAATAGTGTATTGTCCAGAAGCTTTAGAACTCATCGTGCTCATTGTTTCTG +ACCAATTACCAGCAACAGTCATGTCTACATTACCAAGGACAGACCATTTGACGTTTCCAT +TTACAGTGTAATCGTGATTGCCTTGAACTTCAGTTTTAGCATCACCCTCGACTAAGATGT +CAGCGTTTCCTTGAACTACAACTTTTATATTTCCTTTAACTAAAAGAGTCCCATTGCCTT +CTACTGTCTTGGTTTCATCACCGCGAATAAAAATGGTATTTGACCCATCAATTTGATGAA +GTTTGTCAGCCATGTTATAGTAAATTTCATTCGCGCCGACGTTAACATTTTTGTCACCAG +AAACAAGAAAATTACCGTCACCGTTTGTTATATCGTAAAGATCATTTACAGTTTTACGAG +TGCGTCGGCCGTCAGGAGCAACTTCTTCATAAGTGCCGGTTGGGTGAATTAGGCGATAAC +GTTCATACCCTGGTGTATCATCGAATTCTTGGATATGTCCAGACTCAGTTTCCATCGTAT +GCACATAAGGATATTGTCCTTCATATGAAGATACAGGTTCTTTAAAAAGAATCCTTGAGT +CATTCGGAATTGGTGGTTCACTCGGATCAGAGCTTGTGCGAACTACCGCTGCCGCAGATA +AATCTTTTTTGCCTGATGTGCTTACTGGAATACCATATGACTCCATATTCCCTGTCAATA +TGATCATTGATACTCGAGATGCACGTCCTTTTGTTTGTTGGAACCACAATGAATCTCGTG +CAGAGTCATATGCTTTTTTCCAATCACCGACGAACATGGCTTGAAGCATATTATTGAACT +TAGCAACACCGCCTACACCCATTTGAAATGACATGTTCTCAAGAGCCATTTTTCTTGATT +TGTTCATCTTAGCATAAACTGGGCCAACTTTCGCGTTAGTTTTAATATCCCTCTGAACTG +CGTCTAAGTCTTCTTTAAATAAAGCGGATGCTTCATCCATTGAAATAGAACCAGGATTTC +CCTTTACTTCTCGTCCAACCTGCTTAGATAGAATCTTGTTGATTTGATTCATATCTCGTA +TTTGTTGCATAACAATAAGGTGACCGATACCGATTGTTGGATATCCTTCGGAATCCCAAT +AAACTTTCAAACGAAGACCTTCGTCACGACGAAGCATAGCTTCGATGGTATAGTCGGGAT +TGTTATCTTCTGGAATATCAGCTAAATCCATATCATCAGGATTTATACCAGTGTCAAGGT +TAGCATCTTGAATTATGTTACTGGTTGAATCATATCCTACTTCTCCGCCTTGGTTCAATA +CGTTAGTATCATTTCCAAGATATCGAGGATATTGGCCGGTTGGATCAGAGAATCCTTCTG +TGGTGTTAGGGCGAACTCTAGAATTAGCTCCATAGGTACCCATCACTAAACCGTTTGTTC +GATATTTGTCCAACCAATGACCATATACATGAGTGCCTTCAACCGGGCCAGTAACTGATC +CACCAATTCCAGATATAGAGGCAGAAGTTATTGGCTGAAGAACACTCATCCATGGAAGAT +CTTCAGTTGGGATACCAGAAATAGATCCTTGAGTCTTTTGAAATGGATGTAATCCAATTA +CTCGAACTCGAACTCTACCTTGCTTAAGAGGGTCCATTCGATCTTCTACTACACCAACGA +ACCATTCTACTGCATTACTAATCATATCCATTATGCTATCTCCATCTCACGAATTAAATC +GCTTAAGAATGAATCTATGTCAGCAGGAGAAATTATCTTGATTACGCGCTTTTGCTCATT +ATCTAGCACCGACGCTTCATAAGTATCAACTGCTGCTAAAGCGCCATTATATTGCGGGTA +TTTCATTTCCAAGTCACCTTTATCATACCAAGTCCCTGGATTATTGGGGTCTTCAATAAG +GTTGAAATATTTTTCGCCTCGATCATCGACATGATAAAGTACTTGGTTTCCACCAACCTT +TTCATATCTTTGATCAGCAGCTTGATATGCAGCTTCTTGTGATGTTATCCATCCATAGTA +AGGGTCGTAGTTTTCATTACAGAAAAGCAACACCCAATAAAGTTGTGTGTTTCCATAGAT +CTGATAAGCTAATTCTTCTGGACGAGGACTACCTTGAATGTAATAAGTACGAAGACGATA +TCCAGAAGCTACTCGTTTGAAGTATGCTTTATAATTCCTGAAAATATCAGTCATTTGAAT +AGTCTTAGCATTTTTGTCCACTGTTTTAGCTTTGTAATCTATGGGATCAAAAAAGGAAAA +TATCATGGTGCCTCCCATTTATAAATAATACTAATATTTATTAGAGGAGAACAATATGGC +TTACTCCGGAAAATTTATGCCAGTTAATCACCAAAAATACCGTGGTGATATAAGAAAAAT +TACATACAGGTCTTCATGGGAAAGTTGGTTCATGAAGTGGCTTGACACAAATCCTCAAGT +AGTTAAATGGAATAGTGAAGAAGTGGTAATTCCATATTTTAGCAATGCTGATGGAAAGAA +ACGTAGATATTTTATGGACTTCTGGGTTAAATTTGACACAGGTCAAGAATTTTTCTTTGA +AGTTAAACCTAAGAAAGAAACTATGCCTCCTCCAAAGCCAGCTAAATTGACTACTGCTGC +CAAGAAAAAATATATCGATGCTCTTTACACGTTTTCAGTGAATTCTGATAAATGGAAAGC +TGCCTTGGCGGTTGCAGAAAAAAATAATATCAACTTTCGTTTGATTACGGAAGATGGACT +TAAGCGTTTAGGGTGGAAAGGCTAATGGCTATTTTTGAATACATCAATGAAGGCGTTGCA +CCAGCTCCAAAGCCGGTTTCACGAAATGAAAAGAAATGGGTTGAACTTGGATTAGAGTTC +AAGAAGGCAAAAGCTAAAGGTGCTACTGCTAAATCATTTGCGGAAGAAAAAGAAATTCCT +TATGCAACATTCACCAAGGCAATGTCCCGGTACGCATCTAAAATCAAGTTCGCGGAAAAG +ATCGCGAAGCTTGAGGGTAAACCAACTCATAAGCTTTCTAAACAAGAACGCCAACTGATA +ATGATCAATAGCTTTAGATCATCGATCAGAGATAAAATTAAAAACGAAGGCGCAGCGGTG +AACAACAAGTCAGCTAAATGGTTCGCTGACACTATTAAAAAGAACATCCGTGGTCATTCA +GTATCTAAACCTACTCCAGGTAGATTATATGCTTATATGTATGATGCTAAGCATAAAGAT +ACACTTCCATATTGGGATAGATTTCCACTGATTGTTTATCTTGGACTTGGTAAGCAGGGA +TCAACTACTTTAATGTATGGCCTGAACTTACACTACATTCCGCCTAAAGCACGTCAGCAG +TTTTTAGAAGAACTGCTAAAGCAGTATGCAAATACACCTACAATTACTAATAAAACAAAA +TTGAAAATTAACTGGAGTCAAGTGAAAGGATTTGCTGGTGCTGACAAGATGATCAAGGCG +TATTTGCCTGGTCATATAAAGGGGAGTTTGATAGAGATCAAGCCCAGTGACTGGGCAAAC +GTAGTTTTACTTCCTCTTCAACAGTTCATATCGAAAGGCAAACGTTTCTCTTCAAATACT +GTCTGGAAATCTTAATTCTATTTCCATCTTCCGGTTGATTAGATGTTATTGATTGACCGG +AAGGACAAATTCATTATAACACTCCCTAGAGATAAGCATATGAATACACAACAGATTTTT +AACCAGACCAATATCACCAACTTCTTGGTGGACATCCCTGATGTCGGTCTGACCAAAGGT +TTTACGCTTAATGCGCAATCAGCTAATATACCAGGCATACGGATTCCCATCACTGATGTG +CCATCGGGCACAATGGGGTTAGGACGAGCAAATCTTCCAGGCTCAACGTTTGAGTTTGAC +CCATTGATGATACGCTTTTTGGTTGATGAAGAACTTGAGTCTTGGCTTCAAATGTATAGA +TGGATGATAGGCATCAATAACTATCAAACTGGTGATAACTTTGCTTGGAGAGATGGTTCA +AGTCCTGAACACGTTTCAGTTCACATCTTAGATAACTCTAAGACTCGCATCGTGCTGTCT +ATCCATTATTACGGATGCTGGATTTCTGATCTTGGCGAAGTGGAGTTCAACACAACAGAA +GATACTGACCCGGCTATCACATGCCAAGCTATTCTTCCTTACAAGTATTTGCAAATAGAA +AAAGATGGTAAAATAATTACTACAAGACAAAACATGACTGAAGCAGCAAACAGTCGTATC +GGAATGCATCCTTCTATGAGGAAATAATGAAACTATTCTTTTTGATTGGCAAAAAACGTA +GTGGTAAAGATACAACAGCCGACTACATCATGGATAACTATAACGCGTTTAAGCATCAGC +TTGCGGGTCCAATTAAAGATGCTTTGACTTTCGGATATCAGTCTGCGGTAATGGCTTATG +ATTGCAACCGAGTTCATCCCATTCTCACTCGTAAAGAATGGGAAGGTGAAGGTTACGACC +GTGAAACGAAACTCAATTTAACTACACAACAAGTATACACAATCATGGAGCATTCTATGT +GGTACTTGAACAATGAATTGAAAATTAAAGGTGTACGTTTCAATTCAGAAGGTGAAATGA +GTGCTGGAGCATTCGGAACTATTAAGCGGGTCATAAATAGTATTGAAGAAGACTGGTCAG +TACGACGTCTCATGCAGACCCTTGGGACTGATATCATGGTCAATCACTTCGATCGCATGT +ACTGGGTTAAATGGTTCTCTGTTGTCTATATGGATTCATTCGATAAAACGTTTGAATACT +TCATAGTTCCAGACACTCGTCAAGACCATGAACTTGATGCTGCTCGGGCGATGGGTGCTA +CAGTAATTCATGTAGTTCGTCCGAATAACGAAAGTTCGAAGGTTGACACACATATCACAG +AAGCTGGATTGCCTATTCGTGAAGGCGATACAGTAATCATTAATGACGGTTCTCTTGAAG +AACTTTATGCTAAAATTGAAAAGGCTATCAAATGACAGACAAAATTAAACAACTCGAAAC +CGAAATCGTTTACCTAAAAGCTCGAGCTTTTGAATTGACCGAAGGTAAAGCACAGCTCGA +ATCTCATATCCAGCAATTGTCTGGAGTTCTGTCTAAAGTAACAGAGCTAGTTGGTATTGT +AAGCGAAGATGGTTCTGTTAAGGTTGAAGAATTGTACGCAGCTATCGAAGCTATGCTCCC +TAAGCAGGGCGAAGCTGAAGTCTAATGAAATTCCAGGACTTTAGTTCTGGACTCTACGTT +GCAGCAAAATTCAGTGAACAAACACTAGATGAAATTGAGAACCTTCAGAGAGATTTGAAG +GTTCCTAATCCAGTTCCTCGGCATAAAATCCATTCAACTATTTGTTACTCCAGGGTAAAT +GTTCCTTATGTAGTTTCAACTGGAAGTTTTGAAGTTGCCACTAAAGGACACCTGGAAATA +TGGGATACACAAGATGGTAGAACTCTGGTTCTTGTTTTAGATTCAGAGTACTTAAAGTTC +CGTCATCAATACGCCAGGGCATTAGGAGCTACTCATGATTTTGATGATTACACTCCACAT +ATTACACTGAGCTACAATGTAGGACCTGCTCATTTTGAGGGTGAGGTCCAAGTCCCTGTT +GTTTTAGACAGGGAATATAAAGAACCACTTAAAACCAATTGGGCGGAAGAGTTGAAATGA +AGTTTACCGATTTCTTAAATGAAGCGATGGAACCTGCAACCTTTCGTCATGTTGCAGTCA +CATGCACCGAAGAAGAATTTAAGCAATCTGTTGCTAAGATCAACTGGTTATCAGGCGAGC +TTGTAGATGGAATCGCTAAGTTTTTCGGACCAGAATGTGACATCGATGATTGGATAAAAT +CAAACAGGAATTTTGTAAAATGAAAACATATCAGGAATTTATTGCTGAAGCCAAGGGGCA +TGATGAACTGCCTATAGTTACTAAAACTATAGACGGAACATTAGCGGATTTCAAATCTCT +GCCATCCGACAAACTTCAAAAGCTAACTGCGTCAGTTTCTGAGACTAAAGGTAAAGTGAC +CTTCTCTGCACGAGGTTCTGCGAATCTCAAGAAACTTCTGAAAGCAGTAGGAGCTTAACC +GTTTACTTTCCTTGAGGGCTATGATACTATAGCCCTATCAACAACAAGGAGAATAAAATG +AAACGCTGTGAAATCATTGGAAACATCACTACCGTAGTAACTCTAGGACTTCTCGGAACT +GCCATCGTCGGATGGCCATTCCTTAAAGCTCCTGAACTCATCACCATTATGCTATCAGCA +ATTAGCACTGGTGCTATCTCATTCGTCATGGATAAAATTGCAAATGAAAAATCTCGATAA +ATTCAAGCAGTACATGAAAGATCACTACGACTTCGATAAAGAAGAGTTGACTGTTTGGGA +CTATGCTGCTGGAGGCGTAGCAGTTGGTCTGGTACTCGCAATGATTAAAAGTATCACGCT +AGTTGTTGGTGTTGCACTGTTTATTGGGCACATGACCTTTAAGAAGTAATTTGCTTAAAG +AGATCAGTGTTATTATTAATCTATCAACTAAACATAAGGATTTACATGAAACGCATTGTA +CTGAATATTGATCACGACGCTACCTTTGAACACTGCCATGGCAAAGTTCCTCCTGTTCTG +CGTTTTGTAGTCCGCACTGATAAAATTTGCGGTTGCCAAGAAATGCTAGATGGTTCAGTT +AAGGTAACAATTGATCAAGGCGAAACTCGTCCGATTATGTCAGTAGTTGTACTGGAAAGT +TTTGAACAAGTTTCATCTGCGATGATGGAGTGAATCCATGAGTGAGAAAAATAAAATGAC +AATAACTCAAGCTTTGAATGAACTGCTTTCGAAAATCGGAAGCATTAAAGCTCATGATTA +TTACTCCGCTTCAAGCGGTATAAATATTCCTATGATCCTGGTAACTCCAAAGTTTACCAA +GACTGATTCAATGGGAAATGAAATTTTTCCAGGAGATATCGTAGCGTTCACTACAACTGG +TAAATCTCCTGGTTCGCAGGTTGGTATTCTGCTTGGTTTCACTGAACAGGGTTATCGAGT +ACTTCCATTCAATACTTCGAGAAATAATCCTGAATGGCGAACTTTAACTCGCGGTATTAA +TACACCATATGATGTATGGTTAGTGAAATCTAAAGGATCTGTCATAATTTAAGAATTTGG +GGAGTTATTTCCGTAGAGGTAGCGGGGCAGACTGTAAATCTGTTGCTCATAGAGCTCGGG +TGGTTCGACTCCATCACTCCCCACCAATTCAGGGTTGCTAGCTCAGTTGGTTAGAGCACC +GGACTTTTAATCCGGGTGTCCGAAGTTCGAATCTTCGGCAACCCACCAAATAAGGTCAGT +TGGCTGAGAGGGTAAGCGGCGGACTGTTAATCCGCGTCGGTAACGACAAGGCAGGTTCGA +TACCTGCACTGACCGCCAAATTAATATTCTTTCTAGACGATGCTGCTTGCAGCCTCCGGA +TTGGGAACGCGAACCATTCTGGACCAGTCGTGACAACGCATCTTATTGCAAGAGAGAATA +TTGATATGGTGAATAAATAATAAAAATAACAAAGAGGATTACCATGAAAACTTACAAAGA +ATTTATTGCTGAAAGCAAATGGTCTGGATTTAATTTCTCTTATGAGATTAAAGGTGACAT +GAAAGCTTTCAAGAAAGTATCAAATGCTGACCTTCAGCGATATCAAGCTGGCATCGAAAA +AACTAAAGATGGACGAGTTATTGTTCATTCTGTTGAAAATAATGGTGCCGAAATGGTCGT +TTATAAAGCGTTTGGTACAGACTCTGAGCAAATGAGAGATCTAAAAGCTCAGCAACGCAA +GCAATACTAATAGAAATCCATGAGACGTCATGGAAAGTCGTCCGAAACTTGTTCCCTGCT +ATGTGGTCCCTCACTGAAACCTACATATTACAGCAAGAAGGGCGCACCTAATTTGGACCT +ATAGTTTCAGCGGTTAAAATACTCGCCTGTCACGTGAGAGTCACGGGTTCGAATCCCGTT +AGGTCCGCCAAATACGAGGCAGTTCTTGAAGATGAGTTAGAGTCCTGTAAGTAAATGCCG +AGGACGAAGTAAGTTGTTCCCACGGGATAAGCTCTATATTCGGAAGATTATACCTTTAGC +GTGTTCTACATCGAGCTACTTGTTTGTACTTCAAGAATCCGGATAGATGCGGGTTAACTT +CAGTTGGTAGAATGACGGGTTCATATCCCGTTACGCGATGGTTCGAGTCCATCACCCGCC +TCCAAACAATTTGGGGTATAGCCAAGTTGGTACGGCAGTAGATTTTGATTCTACGATTCC +CTGGTTCGAGTCCAGGTACCCCAGCCAAATTAATATTCTTTCTAGACGATAAACGGAAGG +CGCGCTCTGTGAAATCGTGACAAACGTTAGAGGCACCTGGAACTGACCGGGGTCCAGTGA +GAGAATATTGATGTGGCCGTAGTTCAGTTGGTAGAACTCGAGATTGTGATTCTCGTAGTC +ATGGGTTCAACTCCCATCGGTCACCCCAATTCGGAAGCGTGGTAGAGTTGGTTTATTACA +CCGGTCTTGAAAACCGGAGGCCGTAGTGATACGGTCCGTGGGTTCGAATCCCACCGCTTC +CTCCATTTCATAAGCTATTTAACTTCAGTTCAAGTATTCTATACAGTTTACATGGTATAA +ATATCAGTATATAATAACCTCAATTCAATAAAAGGATATACCATGAAAAGTTTAAAAGAA +TTTTTAGCAGAAAACACTGCTCAGACTTTAAACGAAGATGCAGACGATGCACTCTTTAAT +CAGATTTCTAAAGCTATGGACATTATTCGTGTCGGCAGTAAGCTCCGTAAAGCTATGGAA +GCTTATGAAAAATCTGGTGATAAAGACGCTTTAGCCGCTATCAAGAAAGCTAAATCAGCA +CTTGATGCTGCAAGTGAAGCTCTGGTAAATGTTTCTTCTTCACTGCCTAAATAATTCGGT +TAGTTGAACCCCTCTGCATCCATCGTATAGCGGCTATTATGACTGGCTTCCACCCAGTAG +ATGAGAGTTCGATTCTCTCTGGATGCTCCAAATTAATCAGTACGTAGCGCAGTCTGGTAG +CGTAGGAGCTTTGGATGCTTCGGGTCGTAGGTTCGAATCCTACCGTACTGACCAATTTAG +TTCTTGTAGCTCAGTGGATAGAGCAACGGTCTTCTAAACCGTGGGTCGTTGGTTCAAATC +CAACCAGGAACACCATTTCGGCCCCTTAGCTCAGTTGGTTAGAGCAGTCGACTCATAATC +GATTGGTCGCTGGTTCGAGTCCAGCAGGGGCCACCAAACAAATGAGGAAAATATTATGCT +TTACTACGAAACCACCTATGACTTAGATAAGTCCCCTCCAAAGCGTATGTCACGCAAATC +ATTCCAAGAACGTTATGGTAGACCATTTAGCTTTAAAGAATTTAATGTAAAATTTGAACA +TGCTCATGTAACAGACGATCCTTACTATAGTGATGTTACGGGTGAAACAAAACCTTTAGT +AGAAGGTTCCCTTGAAGATTTAATTATTGCAGATTATGACGCAGGACAAGAAATATGACA +ACTTTTTACGGCCGTGGTGCTTCAATCATTAAGATCGACAAAGATTATCAAGGCGACTAT +TATAAATTTCGCCCACACCGAGATTTCATGCTAAATTCAGATTACGTATTCACATACGAT +GAAAAGTCTCGCACTTGGAAGTTTTTGAAATATCGTTTTACTATGGAAGGTTTCAACTCT +GAGTTTTTTAACGTAGAGCAACTTCTTAGTTATGTGTATAAAGATATTTGTCTTTGCTCT +AGATTCCATCCAGCTAAAGATACATTCGGAATTATCAATGTAATTCGTAGCGAAAAAGCT +GCTCTTAAAATGATGAAAATGTGGAGAAATTATAATGCTTAATTTTGGTCAAGTAATTCC +CGCTGGTTACGCGATTCAAATTGAAAGTTGGGAAAACGATGGTGATGATTATAACCGTCA +GTATTTCTATGGTCTAACAAAAGCCGATATTGAGCAATTCGCTCATGTTCTGCCTTTATT +TAAAAGCTGTCATGGCTGGAAAGAATCTGGCTTAGGGAACAAAGAATTCTCTGAAGTTGC +CGAAGAACTTGGATATTCCTATGCTGAGCTTCTCCGTGATGGTAAGATCAATCTAGAATT +CGCCAATGATTATCTTGGGTTTAATCCTGATAAAGTTGACTTGCTTTTCCAAGATTGGGA +AGAAGAATTTGAAGAAATGATTGCCGAATGGGCAGAAAAACGCATCGGTCTTCCTCGCAT +TCAAGAAATTCTTGGGTATTCGGATTCTTATGATGATTTTGTCCGAGTATTTGAAAGCGC +TAAAGTTATGTTCTTTGAAAAAGAACTCCGTATTCCTGCTGTTAAATTTGAGAAATTGCT +ATGAACATTAAACAAGACTTTAAGAAAGCTGTCCGTCACTGGATTCGCACTCGTTTCCAT +TCAATTAAAGCGTCAACCAACGGCACGACATACGATCGTTATGATACAAAATATGGATTG +CATAAATGCTATACACAGTCAAAATAAATTCCGGCTATGTTAGCGGTGAAATCTTTAACC +TAGCCATCACCCATAAACAAATTATTCTTGATTTGTTGGCTATTAAAGATGACGAAGACC +GTTTTATAGAACTTCGCATTTATTTTAATAGCATTAACCCCGGGCTTCAAGCTTACTTTA +TACGTAAGATTGGCGCCGTTCAAATAGACGAATACTCGTATGAGGCCTAATGTGGATATT +TCATCTGGCTGCGGATACCCTGCTTCTTCGTTAAGCAATTTCGCTCCTCATGGATTTGAA +ATTGATGGTGTACAATGTGCTTCAATGGAGGGGTTCCTGCAATCCCTCAAATTCTCATCA +ATTGAGATGCAAGAACATGTATGCACATTAGTTGGAAAATCTGCGAAGTTCAAGGGTAAG +AAAAAACGTTGGTGGCCTACTCAAACTCTTTATTGGAAAGGCGTACCAATTCATCGTTCA +TCAGAAGCTTATCAAAATCTTTTGACAAAAGCATATGATGCATTAGCTTTAAATGAAGGA +TTCCGAAGAGCATTATTAGCAACTCGTAATGCTACCTTAACCCACTCCATGGGCAAGAAT +AAAGAATCTGAAACAGTGTTGACAGAACGAGAATTCTGTGGACAACTACATCGTGTACGT +GAATTGATGAAATGATTTTTGCCCTGGGTATCTTAGTGATGTCCAGGGCATTTTTGTTTG +TTCCTAATATGATTTATCATCCTTCCCAAGAAAGTTCCTCTCAACCGTTCTGGTGAATTC +ATTTAGCCGTTTACATTCATTAAAGACTATGGTATAGTATAAACTCAATCAACAACAGAG +ATCATTATGAAACGTACAAAAGTGATTAACAAGTCTCGTTTTCGAAAGGGATTTGTTTTA +GCATCAGTGGTAGCTTCGACATTTGCTTTATCTGGGTGTGAAGTTGCTGATCAAACAGTT +AAGATGTATCAAACTGTAGATGAGTGCTCTACTATTGAAGCAACGGAGTATCAGTCCGCT +GAACAATGTAAAGCGTCGTTTGAAGCTGCTAAGGCTGAGCATGAGAAATCCGCCCCAAAG +TTCCAGGGCTACAACGACTGTTCAGCGGAGTTTGGTAATTGCAATTATGATTCTAGTACT +GGATCATTTATGCCTGCTATGATGGGCTTTATGGTTGGTCAGATGATGGGCAACATGCAA +GCGAACTCAAACTTCCGAGCATCTCAGCCTATGTACAATAATCCGAATGGTGGTTACAAA +GACATCTCGGGTAAAAGTTACTCAAACATTAAGCCAGGTAAACCTTTCCAGGTTACTCGT +TCCGCAATGAGTTCAAAACCTGCTTCGACAATACAAGCCCGTTCTACTACTTCATCTCGC +GGTGGATTTGGTGGATCGGCACGTTCTGGAAGTTTTGGCGGATAATTTTTCTTCAGCCGT +TTACATTGGTATGAGATGATGATACTATTACCTCATACCAAACAAACGGTAAAACTTAAA +TCGGAGAATAAAATCATGGCTAAATTCAATACTGTTACTATCGTTGAAATTACTGATAAC +TTTGGTGAATTTGATCGTTTCCGCGCAGTTCTTAACAGAGGTGAACATGAGTATCATGTA +ACAGCTATAGTTGAATCTAAGAAAATGGCTTCAATCGTAGAATATGTTAATTCTCATTGG +CCTACAGCTGAAGTTATTTTTGGCGAAAAGATTTAATGCTTTAAAACAAGAGTATAAAAT +AATCTCTCTTACACTGAAAGGAAATACTATGTCTATTCTGAAAAAACTGGTTGAATTCAT +TCGTTCTAAACTGGGTACCTTCGTTGCTCGTAACACTACAATCGAAGATCAGTACACTCG +TGCGGCGAACTCGATCATTGATGAAATTCACAAACTGCGCACTCGCTACGTAACTGCTGA +ACGTGAAATCAAAGCAAAACGCGATCTGGCGACTGAAAATGATGCTAAGGCTGAATCTAA +AGAAAAAGAAATTCGTCACATCATGGCCAATAACCCGGCACAAGATGTAACAACCCTGGC +TAAACTCGGTCTTCTGTATCGTCGAACCGCCGCAGCTCTTCGTGGTAAAGCTCAAGAGCT +GGAAGAAATGAAACTGGAAATTACAAAAACAGTAGTTGCTCTGGACGATCAGCGCCAAGA +TCTGGCTGTGAAACTTGAGTACATCCGCGAAACTCAGAAAGCTAACTCTATGGGTCTGGA +CACTGGCGCCGACATCATCGAATCTGCTGAACTGGCTAAAGTAGATGTTCAAACGATCAT +CTCTCGTATCGATACCTTCAATACTACACCTGCTGGCGTTGAGACAACCTCTGCTGATGT +AGCGGAATATCTGGAATCTCTGAAGTAATATAAACGGGGCCTTCGGGCCCCAATCGGATA +ATAGAGGAGAATATCATGTTTTATGAAATCGGTGCAAAAAATAAAGCTGATCCCAAAAAA +TTAGAAGAAAAAGACGTTGTTCCAGTAATACATGAACGTATTCGTAGACAACTTCTGAAG +GGCGGAGTTCCAGGGCATTTAATTGATAAGCTAGCGCCTCATACATTTCCAATGAGCCTC +GATTATGAGATTCATTCAGATCAATTTAAACGAATTAAATGCGGTCATCTTTTAGTTCGT +ACATGGTCTGATTTGCGAAACTTTCTTAGCACTCTTCAATCTGAATGCTATTCTATTCGT +TATACTGGAGTGTATGGTTTTTATTATAGCGACATTTCAATCAATAACTTGGACCGTGAA +ATCTCCATCCCGGGAATGACTTTATTTGAAGCAGCAGGTCACTATGCATTTAACTTAGAA +TGTAAAACTTCTACATTCGGTAAACCATTTCGTATTTCGGTTCTTATTGACCGTGAACGC +GCAACTAAACGCAACGGTTTTGAAAAGAAATATGACCTTGAATGCTACTCATCTGAACGA +GCAGAACGTGTCAACGCTATAGCTAAATTTATTTCTAACTATAATAAAACAGATGCGGTT +GATACGAATCTTGACGATTTCATTAATTTATGTCGTGATGAATTAAAGGTGAAAGAATGA +AATTAGCAAATTACTGTTTTGGACATGGTCTTGCTCGTTATGATGTATGGCCAAAATCTC +CGAGTTATTCTTTAGGTTGGTGGTTTCATTCTTTTATGATTGGATTGATGCTAATTTTTA +TCTCTCTCCCAGCTTCAATGATGTATGTAAAAGAAACTGACCGAGTTATATCTGATATAA +ATGTTGTGTTAATTGCTCTTGTAACCGCTGTTGTAACGGTATTTGTTCCGCATATTACTT +ATTTGACTTATTTTTACTTAAAACGATTGAATTATAATGTCCAAGTATTCGTCCATAACT +TGGACTATAAGAAAGAAAAGAAACGAGAGGCTCTTGAAGCTGAACTACAAGCAGCTAGAG +TAGCTCAGAATAAAAAGACTCGTGAGGCTATGGAATTCGTTATGGAGATGCGGAAATGAA +ATACTACCGTCCTGGACCATCTTATTTGTACCAAGATTCTGAAGATGGAATCGCATTGGT +ATTAATAGCATGTGCTATAGTTTCTATGATATCATCTGTTGTGGTAATGTTTATTTGGGG +CGCTATGCATAACATGGATTCTCCGAATTCAGAAACCGTTGAATGGATGGTAAAGGGATT +TGTAATATCATTCATTGCTTCATATATTTTTGGTAATGGTGAACGTCATTTGAATAATAT +TATGGCTGCAAGGAAGTGGCGCAAAGAAGAAAGGGAAAGACTCGCTGATATCACAGCGAA +AAATAAAGTAAACGAGCATAAACAACTGTTGAAATTTATTGAGAATTGTAAAAATGAAAA +ATGATGCTATGAAGAAAATCCATGACATTCTGGATGAAATGCGTGCCGCCGAAGATGAAA +TTTTAGGTCGTGCTGAAGCTGTAGCAGATGAATTCTGTGAATCATTCTCTTATGGAGAAT +ATGGCTCAGGTCGTACATATTTCCCTAAAGGTACTGATGCTGATAGTATCCCTTGGGATT +TTGAAGACGAAGCTAATATTGAAGATGGAAAATCTACTGTTGGCATTTGGGTTTCTTCGA +GTGAGATGTGCTAATGGATAATGTAGATAAGAAAGAAATTAAAGTAGCTCTTGAAAAACA +CGTCCAGGATGCTATTGATTATGCTAAGAAAATAGCAGATAAGTATGATTTGACTTTTAA +TATGTATCCGGCTTATGGTATGGGTGGAAGCTATTATTCTCCAGGCTACTTAAAACAAGA +CTTGGAACATCATCAGTCTAATGGTTATCCACAATTTGCAATTGTTAACCAATATGAATA +CTACACCAGCCTCGAAAATGGTGGTTGGGTCTCTTCTTCTATGGAATGCTAAGGAAAAAT +TATGTCTCGCTCTAATGAATTGATGGAAAAAGCTGCTGCACTGTCTAAACTGTTCAACGA +AGTTGCTGAACTTGCAAAAGACAATGATTACGGTCTGAAATTTGATACTTCCGATGGTAC +AATGGAATTCAATGACTGGTTGTCTTCATCTTGTTTTGGTGAAGGTGATGATGGCTTCGG +TGTCAATGCAGATGGTAGCATCTGGCAAAGTAGTAGTTGCTAATGAAAGCCCTTCAGGGC +TTTTGTTGTTTTGTACTCTGCCGTTTACATTCATTGGTAGATGTGTTATGATAGACTCGT +AATCAACTAGGAGGAAAAATGAATATATTTGTTAAAGACGGATATCTAAGTTGGGAAAAT +GATTATGGCACTTATGATTGTTGCTCGGTAGTTTACTTGCAAAGAGTGTACAATGACCCA +AATTATAAGCCGTCTTTAAGATCCGCTGCTTGGATTTTAGACCAAATGAAGATTGATGTA +AAAGTCAATGCTTGCGGAGTGTGGGACCATGATAATCGAGATTGGAAAGACGTTAAAGAA +AACTCTCCAATAGATATCGAACTTTTTATTATGCAGTGCAGAATGAGGAAAACGGAATGT +TGCTAGTTATTGGTTCTCGTGCTTTACACCATCATGGTTTAATTGAATCTCGTGATATCA +AAAATTCTGATTGGGACTTCATTGCTGATGCTGGTGAATGGGAAGCATTTAAAGGCCAAA +TGTTTGGAGCCAAAGTCGAAGTATCAAATCCAAACGTCAGCGCGTTTAAATGTATGCACA +ATGGTCGTGAGACTCATTTTGAAGCTTATATTGTTCATCGGTCGCCGAATGACCAACCAA +AAGATTCGAGTGAGTTACTTCTCGAATATGCGGAAGGTAACTGCAAATTCGACCGTTTGA +CTGGGTTTCGTTGGGCCAATCCGAACATGTGCTTGGCTATTAAGCTGTCTCATCGTTACA +AGAAGAACAATCCACACTTCCGTAAAACGATGCAACACATCCGATTCCTGCGTAATAAAG +ACGTGCGTCTTACTGAATATTTGATGGACATCAGCAAGCTGCGTGAAAAGGAAACTTTGA +GTTATGCTCATCCGGTATTGGATACAACAAAAGATAAATTCTTCAAAGATGATATCTACA +CCTACGACCATGATACAATTCATGAAGCGGTTGCATTGATGGATCGTCCAGCTTATACTT +TCTACATGAAAGATGGGTCTCAGGTAATGACTGATAAGAAGAAATTCTTTGAATTGCCTA +AAGAAATCCAATTGGCTGGTGTGTACGAAGAAACTTGCGTATTAGCGTTGGAACGTTCTC +AAATTCCAAACGATTTCAAAAATGTTTCATCTGAACATTCCTTTATGATGGCTTTGGAGA +AGGTTTGTACTTCAATCACGAGCGGTTGGTTCCGCGAATATGCTTGGGAAAACTACCACA +CAATCGTTGCGATGTACAAGAAGCTTGGTGTTAATGATTACATAAAACGCTTTAAAGAAA +ACCAAGATCTCCTGAAACCTTTCACTCGAGGCGAAAATGCATAAGCCATTTGAAGAAAAA +GTATGTCCAGGCTGTGGTAAAGTTTTTAGAATAAGAGCTGGTGTAGGACAACATATTTTT +GCTGAACATATTCGTTATTGTAATGCATATAAAATTTCATGGCAATATCCGCGTTAATTG +AGGAAAATATTATGAATAGCCCAAAAACTTTTGATTCACAAGTACGATTCCGCGGCGAGT +TTGCTGAAGATATCACGAATGATCAAATCAAGAACGAAACGATGTTCTTTAACTCTGATT +TGAATTTTGCTTGGGATAAAGGCGGCCCGATTACTCGTAGCTTTATTGATAATCTTCCTT +TTGATTGGACTAATAGAGATGTGGTATTCGATTCTCGAGTGCACATGTTGATGCCAGGTT +GGTATCCTGCTATTCCTGGTTATCACCACGATGATGTTCCACGTCCTGATATTCCAGTAG +GACAACATTTTATTACTGCTGGACAACCTGATTACGATAATCCTCGTTACCATTCTGAGC +ATATTCTTGGATTGGTTAATGCTGATATTTGTCCTACACATTTTGCTTATGGCGAAGACG +TTACATTTAGCCAAATCCCTGAAGGCGAATTGATTTACCGTCAATGGCACAAAGAAGTTC +TTCAGAAAATTGAATCTGGAGAAATGGTAAAACTTGAAGCCCCGGATCGTACTTTATGTG +AATTCAACTGGCAGTCTTTCCACACTGGGTCCATGGCTGTCGGTAACGGATGGCGTTGGT +TTGGTCGAGTATCTCGTAACACCGACCGTGTTAAAAAGATCACTAACGAAATTCGAGTAA +ACGCTCAGGTTTATCTGGAATTCCCAATGGAGGGATGGTAATGGGATACTCAGCTATGAA +TGTTAGTGAATATACAAAAGAGAATTTCCGTAAACTCTGGAACGAAAAGTACGCTGATAG +AATTTGCTGGATTGTACTAGTTTTAACTGTTATTGTGATTAATGGATTAGTCATTTATTT +CAATGGAATAATTATTGAGCTTATTTTTAGTGTTCCTTTGAGCGGTTTCGCCGGTATGAT +TGTAGCTTGTATTAATGACGAAATACTTAAGGCGATTTATATTAGAATTAAAATCAGACG +TAAACTTAAGGCTGATGCTAAACAAAAAGAAGCTGAAAGCTTTAGCAATTTCATAAATTC +ATGTAGGATTAAACGATGAACTATTTAGAAGAACTTCGTGCTCGTTTAGATGAAAATGAC +ATACCATGGGAAATAGTAAAAGATCCTTATGTTCGAATTTTAACAAACAACGGATATTTT +ACTCTGACATTATGTTCAGCTGTTGGTGAAAACGTAGATGAGCTTGTTCAAACATGTCGT +GCATATGCTAAGGTCTGGATTTACTCTGTTGACCCGTTAGCTGTTTCTGGTAAATTAATG +ATGCGTTTTGCGTGGGTGAAATAATGAGTGTAGCAATTTATGTAGAATCAGAATCTGGAG +ACGAGTATCTTTACTCGTTTGGTGATGGCGAAAGCGAAGAAGCTATTAAAGACGAATTGG +AACGTCAAATGGAAATGTTTTCTCCGATGTGTAATTACATGATCTCTATTTCATCTGGAA +CTTCGCCTTCTGTTGACACCCGATTGGAAGAGTTTATGTCTGAACTTTTTGATAAATCGT +GGAAATTTGAGAGGGAAAATGTCTAAGAAAAAAGAACTATCAGCTGGAATTTTATTCTTC +ACCAAAGATTCTCGTCTTTTCATGGGTCGAGTGACTAACTCTGGTTTAGGTGGTGGCCCG +TCTCGTTGGGATATTCCAAAGGGTCATGTAGAAGAAGGTGAAACTCCTAAGCAAGCAGCT +ATCCGTGAATGCCAAGAAGAAACTGGATTCACTGATTACGACCAGGGTTTACTCTTTGAC +CTAGGCCAACATGACTACGCAAGCAATAAAGATATACATCTGTTCGGATATCCTGTCCCT +ATGGATCACTCGCAATTCAAGAATTGTATTTGCACAGCATATCATACTGCAGAAGACGGA +ACGACATTTCCTGAAATCGATGCATTTGCTCTGATTAAGCCAGAACAGTGGAGTTATGTT +ATGGGTCCATCGCTGTTCAATGTAATGCAGAAACTTTACCCGGCACTAGCCAAACGATAA +ATACTCCTATCAAACGATAGGAGACGACATGAACATTTTTGAAATGCTTCGTATCGATGA +AGGATACGATTCTAAAATCTATAAAGATACACGCGGATATTACACTATTGGGATTGGTCA +CCTTTTGACTAAAGACCCGTCTTTGGCTGTTGCTAAAGCTGCCTTAGATAAATTGGTTGG +TCGTAAGTGTGACGGTGTAATCACTAAGGCCGAAGCTGAAAAAATCTTCGCTAAAGACGT +CGATGATGTTGTAGCTGGTATTCAACGTAACGCTTTACTGAAATCAGTTTATGATTCTCT +TAATGGAGATGATCCGCGTCAAGCAGCATTGATGAACATGGTTTTTCAGATGGGTGTAAC +CGGAGTTGCTGGATTCACTAACTCAATGGCTCTGATTAAATCTAAACAGTGGGATAAAGC +CGCTATCAATTTAGCCCAGTCTAAATGGTACAAACAAACCACCAATCGTGCTAAACGCGT +TATTGCAACATTTAAAACAGGAACATGGGCTGCATATGAAAACCTATAAAGAATTTTTGA +CAGAATCTCAATCTAAGATTGAAGAAAACACTCCGATTGTAGAAGCAACAATCAAAGATA +AAGAAGGTTCAACAAACTTCTCTTTGGTATCAGGAAAAGACGGCACTTTCTTCCAGATCG +GTTCTGAACGCTTCCAGACTTCTAAATTGCAGGATGCTGCAGTAGCTAAAGTTCTGCGTG +GCGGCGGTAAATGGAAAGGAACTGAAGGATCTACACAAATTGGTATTGCTGTAGATAATA +ATAGCGCATTTTTCCGTATCGGCGGTGAGTCTTTTACTCTGAGTTCTAAAGCATTCAAAG +AACTTAAGGCTGCGTTTAAGTAATGTTGTACATCTCCTGTGGTTGTGTTACTATACCTAT +ACTGACACAGGAGAACTAAATGACTCGTATTAACTTAACTCTAGTATCTGAACTGGCTGA +CCAGCATCTTATGGCTGAGTATCGCGAACTACCTCGTATCTTTGGTGCAGTTCGCCATCG +TATTGCTAAAGGCCAAGGCTTCAAGGATATTCCTAAAGATTTCTGTCTGGGCGCTGGACA +TGTTAAATTCTTCTATGACAAAATTGGCTTCTTGTTCTTACGCCAAAATGCAATCATCGA +GGAACTTCTAAAACGCGGATTTAAAATTGCAAATACTCATGTCAACGTAGCTGACATTCC +TCTGTACTTAATGAATGACTTTGTTCCTTCTGCCGAGGATATTGCTTTAAGCCAAGCTCG +ACTGGATGAAAAAATTGCTCAAAGGCCTTTGTGGTATAAACATTATGGTAAGGCTATATA +CAAATAATAAAGGCACGCCGACCCTCTCCTATGAACAATGTTCCTCTAGTGAAGGACCTT +TTCCCACCTGTAATAAGGTCGAGCCCGAGTGCGGTAAGGGGTTTACATTCGGTGAAGGCA +AGGAGCCCAATTCGATTCTAGGAAGGAAAATGGACTACCACGTGCCATGGAATGGCCCTC +AACTAATCAGGAAATAAAATGCAATATTTAACATACCCGTATCTAACCTTGATGCATGCT +TTCAAAGACCGTGCATTCGAACGCTTAGATCCACATAATGATTATTGGAAGTGCTTAACC +CCGATGTCTCGAGTTTCTGAATTTGGAACTCTTCGCCTAGACGGTGGACGACAAACCGGT +AAATCTGAAGCTGCTGCTTTATTCGCTGCCGATTGGCTTCATGATGGTAACGACGTAATT +GTTATTTCAACCAAAGCCGCTCAATCCAGAGAGCTCAAAGAACGAATTGAACGTAAAGCA +AAAGGCATTCAGCGAATAGATTCTAATCTCCGTGGATTTTGTGTACATGATACCATCAGA +AGTTTTCTGGATGAAGACTTTAATAAGTACAGAGGTCTTTCGCTTACACGAGCATTGATT +ATAATTGACGAACCAATGAAGATGCCTGATGTTAAGAAGTTCTATGAATCATACTTCTAT +CTGGCTAATCACTGTTTATGCCAAGGCGATAAACCTTTACCTCTTTTCTTTGTGATGGGA +ATGCAATGATGAAATTTATGTTTATGGATGGTCCCTTTCGGGGAATGGTAGTTCGTACTA +AAGCTACTAAAGCTGAATTGAATTCTGTTCCAGATATTCCTATCGAATTTGTAACCGGGC +CTTTTGAAGGCCTGATTACTCGTAGCTTGATTTGCTATGATCGAACAATGATCGAAGCAA +GAGAGCTTAAAATGCGCCCAGGTTCTCAGGGCTCCTATAATTGTACTTTGGACATATCTT +ATAATGGCTAAAATTGTAATTGAATGTGCTGACCATCTGGTTAAAACTTTCTGTGGATGG +TTCAGTAACCAAGGCGAGCAGGATCTTTTTGAAGCTCACTCAAATGGTAAGTGGAATGAA +GAAATCCAAAAGTGGGAAGAGCAAACGACGTATTTGGCTACTGAGGGTTATGGTATTAAT +GAGCCTATTCGCTTAGTTGAATATGATAAAGAGACCGACGAGAGAGTCCCTTATTTCGAC +GGTGAGAAGCTTAGTGCCATACAGGCGATGGTCCCTAACACAGGTCAAATCTTTGAATTG +AGGATGCCGAAATGATTGAAGATATTAAAGGCTATAAGCCACACACCGATGATAAAATCG +GTAAAGTGAATTGTATCAAAGATGCTGAAGTTCGTCTTGGTTTGATCTTTAAAGCACTAG +AAGAAGAACATGTAGCTGCGTACATGGCTCTTGATGTAGACACTATGAGCGATGAAGAAT +TCGATTTAGCTCATTCTAGAATCACTCAGATTCGTAATGCAATTGATCGTCTGAAAGAAG +CTAGTATGTGGGCATGCCGTTCTGTTTTCCAACCTGAAGAGAAATACTAATGAATGATTT +GATCCAAGCTTTACTCCGTGTAGAAGATGAGTGTGCTGGCATTTTATTAATGGCTAAGTT +TGATCCATTTGGTCAAACTGGAATTGATGAAATCAGAGAAATTCATCGTAACACTGTAAA +ATCTTTAAAAGCATTAGTTCAGCTTAAAACTGAAGAAATCTAACCGTTTACATCTCCTGC +AAGATGTGTTACTATGATCTTACACTTACAGGAGAAACAAAATGAACACAACTGAAGTAT +ACATCCGCAGAAACAAACTCCGTCGTCTCTTTGAAACGGAGTTTCATAAAATCAATGCTA +AAATCAAAGACGCATCGAAAGCAGCTGGGGTTCCTGGCTTCCATTTGAAGTACTCTCAGC +ATTTGCTTGACCGTGCTATTCAACGTGAAATCGATGAAAATTACGTCTTTGAACTTTTCC +ACAAGCTTTCGAATCATGTGGTTGAAGTAAACGCTTTCCTGGAACTCCCTGAGCGCCCGG +ACGTTGAAGAAGATCTGGATCCGAACATCGAGTATCGCCCTCTCCGTCTTGAAATCACTG +ACCAAAAGTTGTGGTTAGGATTTACAGTGTCAAAACCAGTTCCAGGTAAAACGTTTTCTA +CTCCATACACACTGAATTGTCGTATGGCTTTCATCAACACAAATCGTCATGAAGGAAAAA +TTAGTAAAACTGTAATCAATCTATGAGGTAAACATGAAAAAAGCTCTATGCGCAGGTCTC +TTGGCCTTCTGTTCAATGGCCTATGGGTCCGAGCACAACTTCAGTAATGTCCAACTCGAA +AATCTGAATTATGCGTATCAGTTTGGAGAGCAATTTGCAAAGGATGGCAAATACAAGACA +CAAGAAAAGCTATATGACAACAAAGGCCTAGGCTATGTTATGGCTGCTTTACTTTGGCAA +GAATCTTCCGCTGGGTTAAAAACCAAAGGAAAGTCAGGTCATCAGGCGTATGGAATGTTC +CAGAATTATTTGCCGACAATGAGAAATCGAGTCGCCGAAATAGGATGGAAAATGACTGAT +GCTGAAATTATTAGAATGTTGAATAAACGATCCAATTCCGCTTCATGGGCGTATATTGAA +CTTTCTTATTGGTTAAATAGACATAATGGTGATATGCGAAAAGCAATTGCCAGTTACAAT +GCCGGGAACAACTGGAAATCTGGAAACAAATATGCCAGTCAAGTCCTAGAAAAAGCATAT +TACCTGAAGTCAAATAAACTTTTACATATTGAGGTAGAATAATGCAGAAACTAGCTCTGG +TACTTGGTCTACTGATTTCATCTGGTGCTTATGCATCAGGTGGATCACTGGAGGAATCTC +TTAAAATTGCTAAATCATTTTGTGCGACAAACACAGAATGTATTGACATCTTGTCATTGC +AATTAGATGGAGCATATGAAGACGGAGTCAGAGCATCTAAGTCTAAGGTAGAGTGGAATG +TACTCATGAACCGAAAGACTAAGCAATTGAATAACCTTTGCGATAAAGCACCTAACGTTG +AAATCTGTCTGGATTATAGAAATCGTTTGATGGAGCAATATATGAAAGGGTTAACGGAGT +GAAAAAATATTTATGTTTGTTGATGGTTCCGTTTGTGTTGAACGCCTGGGATATTCTCCC +AGGCTATCCTGAAAAGATACTTGCGGTCCAAGGAAAACAAATTGAAACAAGCGGCTCATT +TAAACGAAACGTTGAGTTAATCTTCGTTCCAAGTAAAGAGCTTTTAGGGATATCGTTTTA +TAATTATAAAGATAAGGGTGACCAAGTAACAATTCCTTATGGCACCTACAATATCAGAGG +TTGTGAAATGAAAGCTTCTGGTGAAATTGAAGGACCTTACTTTGTCTCTTCGCTGAACAA +TTACAACATCAGCAAAAAGATTATTCGATCATGCTCTACGTTCTTCATTAGAGTATACGA +TCAGACCGATAACTACTCAACTTACGTGGTGGAAAATGATTAAGCAATATATCAAAGGTG +ATATTGTAAAGATCTTCTTGGAAGGTCATAATGTGGCTCACGGTTGTAACTGCTTCCATA +CAATGGGTGGAGTGGCTGGACAATTAGCGAAGGCTTATCCTCCGATTCTAGCAATTGATG +TTCATGAAACTGAGCATGCAGATCCGGATAAATTAGGGAATTATACCCAAGCCACAGGTG +AACGCGGACAAATTTGCTTTAATCTTTATACTCAATATATGCCCGGTAATAATTTAGATT +ATGGCGCATTGCTATATGCTTTCCAAAGTCTAAATTATTGGGCTAGTAAACGCCAGGAAA +CTCCTACGGTTTATATTCCACGTATTGGAGCTGGTATTGCCGGTGGTGATTGGGAAAAGA +TTAAGACTATTATCGATTGGTTCACTCCTGATGTTGATATTATCGTTGTCGATTGGGATG +GTGAATAATGACACATTATATCCATCCGTTTGACCCGAAGAATAAAGCTAATATTCATCG +GCGATGGATTGAAACCAGAAAAACTAAATGTCCAATCGATAGCCCGCATAACGTAGACCG +TTGGTATATTGGTGAATATGTCGAGTACACTTTTATTGATAAAAAGAAACGTGTACAATA +TGTTGAAGAATATTGTCTAAGGATTAAATGGTTATGATGTCCAAAGAAGAACGCGAGCGT +ATTATTGATGATATTGATGAATTGATTCGTTTAGCTAAACACGCTGGTGTTATGGCTGAA +CTTGGTACAGATGATGAATACGCTATGGCCGCTTCAGCTTTATGTAAACAACGATATAAT +GTATTAAGCAAGGACGGAATCGAATGATTACCAGAGAACAAGCTCGTGTAATTTATGATT +TAGTCAAAGACATCGAAGATGATTCTGCTTTCTACACCGGCCAATCATGCGAAGGGACTT +ATTTTGGATTAGAACAAGCTAAACTTGATTTAAGTAATTCCAAAATAAAATTTGAACAAT +TTATTGAGAGTTTAATTGAATGAGCGGCCAATACAAAATTTTGATAACTAGCAAATGTTA +CGCTTATGGTCAAGGCGAAGCAATCTCTGTTCATACTGTTATTGCTGAATTTGATACTAA +AGAGCAGGCTGATTTAGCATTTTATAATATGAAAAATAATTCTGGACCGAGTGATATCGG +CGTTCGCCAAGCTTATATAAAGTTGTATTAAAATGATTACTAAAGAACAGAAAGAGACTA +TTGTTGATTTAGCTCGGTGTTGGGCCCAAGCTGAATCGGCAGTAGCGTGGGAATACTCCT +CATGGAGTTCGAGCCGATCTGCTAAAAAGCAAGCTGAAGAAGAATCCGCCGAAGCAGAAG +CAGATCTTGAACAGTATCTTGATGGGATTATGGCATAATGCAAACTTATATTAAAGAAAA +GCACCGTTGCAAAGATTGTAAATGGCCTATCGTGTTTTCTTTATGTAATGATGGTTTGAT +GGATACTCCACCATATAAGATGTGGGATTGGTGGTTATATTGCTCAAACAAAACATGTAA +GAATCATGCAGGTGAAGGATTTTTCCAATATACTCCAGAGTGGATTGAATCAGGGGAACC +AAAATAATGGCACAACTTTACTTCAACTATGCGAGTATGAATGCCGGGAAATCGGCTAAC +CTTTTGACAGCTGCTCATAACTATAAAGAACGTGGAATGGGCACTCTGATTCTTAAACCA +GCAGTTGATGACCGTGATTCTGCTTCAGAAGTAGTTTCACGAATTGGTTTAAGACAAGAT +GCTAATATAGTTACTCCTGATATGGACATCCTGGAGTTCTTTAAATGGGCTCAGACTCAG +CGGGACATTCACTGTGTATTCGTTGATGAAGCACAATTTTTAAGTGCAAGACATGTTGGA +GAATTAGCTCGGATTGTAGATTTGTACAATGTCCCAGTTATGTGTTACGGGCTCCGTACA +GACTTCCGGGGTGAATTGTTCGAAGGTTCTAAAACGCTTTTAGCGATAAGTGATAAATTA +GTAGAACTTAAAGGTGTTTGTCATTGCGGACGAAAAGCCACAATGGTAGCCCGCATTGAT +GAAAATGGTAACGCAGTTCGTGACGGTGAAGTGGTTGAAATCGGCGGAAACGATAAGTAC +GTTTCCTTATGCAGAAAACATTGGTTTGAGATGCTCGATATATGATTGAACTGTTAACAA +TTCTGGCTGGAGTAGCTGGGATATTCGCTATAGGCTTTATCCTTTACGTTATTCTGATTT +ATTTGGTGTGCTTATGAAATCAACAACTTATAATAGTTTTTTGCTATTAATGACAGTGTT +ATTATTCATTTGGATTGCTGTTGCAGCTTCTACTCAATCAGAACGAAATAAACAACTAAA +AGATCAAAACACTATTTTATGTGAATCCAAAACTGATGGCGCATTCATTGCCAATAGTCT +CGGATGTTTTATTAAGGTAGAACAATGACAGAACAAGAAGCATGGCAAGAACTTCGGGCT +TTACTTAAAAAGCATAAAGTTGAATTACATTTGTCTGAACCTCCAGTATCTCTTGAAGAG +ATTATTAACATCCGTGAAGGCCATGAAGTTTTGCAGAAAATTGACGAGGTATTAAAATGA +AAACGAGTTTCTTTAAATCTGGATTTTACTATCGATTGTGTAACTACAACCGAGGTAAAT +CAAAATGTCAAGAACTATCCGTCGTAAAGGCTGGCATGTAACAACTTCTTCTAAATGGCA +CAATCAGAAGAATAACGAATTCGCTTATATCAAGCGTTATACCGAATACGTTAAAACCAG +CAAAGATAAAGCAAATCAAGCTAAATATGTCGAAAGATATATCGCTGAAAATAAGAAAGA +ACCAGTTCGTCTTGAGAAGTTGATGAAAGAACGTCATCGTGATTCATTCTGGAAGACTCT +ACGTTGGAGTCGCTATGCTTCACCTATTCCTAGAGTGTTTCACAAGATGGAAATTAAAAA +CTCGTTGAGAAACGACACTGATTATAACTGGGACGAGAAAGCCGCTCGTAAGTGTGAGAA +AGGCATCGCTCAAATGAATTGGGATTAAAATTTCGGAGTACAGATGTACTCCGTTATAAA +TACGTTTACTAACTAATGAGGTGTATATGCAGCATTTAGACGTTAAAAAGCTTCGTAATC +TTACTGTAGAACAACTCGATGAAATCAAACGTGAAATTGGGCATGCTGTTGCAAGTCTAA +ATGAAGAAATTCGTCAAAGTGGCTCACGGGCAGATTATATGCGTAAGCGAAATCTGGAAA +AATACCTCGACAATGTTAAGGCTGTACTTCAGCACAAACGTAACACTGGTCAACGATAGG +AGGCCTTATGGCCTTAAAAGCACTGGCATTATCCGCGCTTATTGGAATCATGATGATTCC +TGCTTCCTACGCAGAGGTCGACTTCAATCCGAAGTTTGATGAATATTTTGAGGGTGCATT +GAAGGTTTACTCTCAATATAAGATATACAATAAGCAGGAAAGTGAGCAGTTCTTCACATT +TGTTAAATCAAAATGGGAAAGGCAACCATGCACTAATAACTGTGAAGCTGATGGAGCTTT +GGTTGCGCAAGAGTATTACACCAACCGATTGGTAGAAGGCAAACATGAAATTTGAAGACT +TTGCTAAAGGCAAAGCATCTGAAGCGGATGCTTATCTTGGGTTGTTGATGGCTTCTCGCT +CTTACTTTCATTCAGCCCACTTTGAAACAGAAAGCTATGCTCGTCATAAAGCATACAACT +TCATATTCGACGAGCTTCCGGATTTGATTGATAAGTTCGGTGAACAATGGCTTGGCTTTT +CCGGAAAGAAATATGCTCCTCAAATTCCAGAGCAAAAATCTCTTCCTACTGACACTATAA +AAATGATCGATTTGATCTTGGCTGAATCCGACAAGATCTACTCCAAAGTCCCTCGTGCTA +TTCAAAGCACCTTAGATGATATCGTTGGAACTTTCTACCAACTCAAGTATCTTCTCTCCC +TGAAGTAACACTCTGCCCTGGCTTCGGTCAGGGCATTTTTGTTTATGCTGTTTACATCCT +CAAAAGACTATGATACTATAGACTAGTAATCAACTAGGAGAACAAAATGAAAAGTTTGGT +AGTCGTAGCTTATCTCTACGTTCAGTACAATAATCCGCTTTTCACTCGTAATGTTATCGA +TTTTATCTGGAGCCAATTATGAACGAAGAGAACAAAATCAAATTGCTCGATTTGATTGAA +AAATTGCGTCAGGCCGATTTAGCATATGTTGCCCGGTATGAAGGTTCCGGCACGGCAATT +CCTCAATACAAAGCTATGCAAGCTGCTCAAAAAGAAATGTTTGATTTTATTCAATCTCTG +TGAGGTTTTATGGAAATCCAAGAAAAAGTTTTAGATTATGGAAGTCGTTTCCAATCTATC +AAACGCACTATTGAATACAACAACGGTCATGATGAAAACATGCTGATGGTTGATTTTGAC +AATGGCGAAGCTGTCGGAGCTTCTTTCAAATTCAATGGAACTTTATCATGTGGAGGCGGT +TCTTATAAGATTGAAGAACTTCGTCGCTTTAAAGCACTTTTGAATAGTTTTGAGGAATTG +TAATGGCTGTAGGATTTGCAAAAGACGGAGCAGAACAACTTGAAGTTGAAGCAGTAGTTC +AAGCTGCAATAGTTCATGCACGGTCTCAATTTAACACTACACGAGAGTCTCTTTATCGTT +GCTTTGATTGCGAAGAGCTAATTCCAGAATCTCGTAGACAAGCAGTTAAAGGTTGTTTGT +ATTGTGTAAAATGTCAAGAAATGCATGACGAAACTTTTAAACGTGAACCCCGTAATTGTT +GGCACAGGAGCATGAGATGAGCTTTCCAAAACTTGAAGTTGGTGATCTAGTTTTAACTCG +TACTTACACTGGTGGACAATCAGTAGAAATTTGTCAATATCGCGCACAGACCGGTAATTT +GATGTACATGGCTTATCATCCAGAAGCTATCTTAAAATGCCAGCTGGAGCGCTTCATTAA +AGATACAGATTCAATGCCTTATAGTGTAGATATTGTACGCAAAAGTGATTCTGAAAAATG +GGCAAAGGTAATGATGAGCATCCAAAAGAGGCCGGAGTGATTATGAATTTTGTTTCGCTT +TACGGGTATGAGATTATAATTTCATTACTGATTTTAATCATCGCAATTTTAATGACGAGA +AAATAATGGCTAAATTAATTTGGGAAGGTGTCGGATACGGCGCAAAAATTGAAGAAAACA +TTCCAGGTTCTAATCAAAAGTGGTACACAGAACTTGATGTTATTTCCAACCAGTCGCATG +TAAGCATTTACGATGTTGATAATGGTGATGAAGTTTCACTCACTAAATCCGAAGCGGAAG +CTTTGGTGAAATATTTAAACTCTGTAATTCCAACTATGAAGGAGCATCATAATGAATATT +AATGAAAAATCTTGGCACTGCCGTTTACATGACTTTGCTTTTGACAAATACTCTCGTCCT +CGCTCTCTTTGCCCGTACTTCTGGAAAGTAGTATTTGCTTTATTTGGTATGACGTCATTG +ATTGTATTATTGTCCATCGCATTTACTTTAGTTGGTTGGGAATTGGCTGCAGGTTGGTTA +GCTAAAATTGGTATTACTTCGGTCTGGGCTATTGGAGCTTCTGGATTTACTATCGGGGCA +GTTGGTATTTTGAGTTTAGTTGGTGTAGTATTTGGCACTCTGTTCGGTTTAGCCAAATTA +AAAGATTTGATTGAAGATAAAATCAAAGAACGTAATTATGAAAAATATATTCAAGAATTA +GAAGCTCGTAAAGACCCAAATTACGTTCCACCTAAAAAGAGTATTCTGATGGAATTCATC +CGAGCTCGTAAAGAAAAATTCTGTCCATCTCTGACTTTCACTGAGGAATAAAAATGATCG +GTATACATAAGTTTGAATCATTCGAAGAGACTGTTCGTTTGGAAATTGGTGATCGTCTGA +AGGTGACCCTCAAGGGAACTACAAAATCTCTGCTAATTAAAGTAATTGGTATCACGAACT +ATGGTAGATGGACTGACGGTGATCGTCTAAACGTTATTATTGGCAAAATTGATGATGTTG +CCGGTCATTCGGTAGTTTACATTAATAACGTAAACGGACAGGTTATTCATTATCTTCCTA +ATGCAATTCACACATACAATGTTCACGATATAACTCACAATGATACTTCGATCGCGTATG +AGGATGAAACGCTTTATCCATCACGAGTTAAAATTGCTCGTAAACAAGTACTGGTTCCTA +TTAAAGTCGGTGATGAATTGACTAAGCCAAATCGTTCTGGTACTTACACAGTAGTTTATG +TAAACAACACTGTAAGTAATATTGTTGTTGAACGAAATACTGATAAGAAAATCGAAGTTA +TCAATTTTAAAGATACCGTCGCGCTAAAAGCCTTCGGCCTTAAATGGAGAAGTTGATGAA +GACAGTTGTAAAAAGTTATTTTGGGTCCCAGCTTTATGGGACCTCCACTCCAGAATCCGA +CACCGATTACAAAGAAATCTTTATCCCACATGCAAAAGATATTCTGATGTGCCGGGCAAT +GAATCACACTAACCTGAATACCAACAACTCTGCCACCAAAAACACTCATGATGATGTAGA +TCATGAGTTGTATTCCTTGAAATATTTCCTGGAATTGGCACAGAATGGTGAAACTGTGGC +ACTGGATATGCTTCATACTCCTCCAGAATTGGTCGTTGCTTCTGACCTTCCTGAAGTGTG +GAAATTTATCCAAGACAATCGTAGTAAGTTCTATACCACCGACATGAAAGCTTATCTTGG +TTATGTGCGTAAGCAAGCAGCTAAGTATGGTGTTAAAGGTTCTCGTTTAGCTGAACTTCG +TCGTGTATTGGAAGTTATCAATAAATTTCCTGAATGGAAATACGAAAATCGTCCGAAAGA +TAAAGCTAACAACAGTCGTTGGAAAGTAGCTGATATTGCAAGTAAACTTCCATTGAGTGA +ATTTTTGTTCTGGGAAGATTTTGTTGATGCTAAATGCGGTAAACAACGTTTCTATCATGT +GCTTGGTCGTAAATTCCAGACAACAATCACTGTAGCTGAAATGAAGTACTCCTTAACTAA +ACTTGAAGCTGAATATGGTGAGCGTGCTCGTAAGGCAGAAGCTAACGAAGGCGTAGACTG +GAAAGCATTGAGTCATGCATTACGTGGTGGACTTCAACTTCAAGAAATCTACTCTACCGG +TGACTTAAAGTACCCACTGAAAAACGCACAAGACATTCTAGACGTTAAACTTGGTAAACT +TCCGTTTGTTCAAGTCCAGCAGATGCTTGAAGATACAGTAGATGAAGTTGAGCGTTTAAG +TATTCAAGCTCATAAGAATGGTATGCCTTCGAAAGTTGATATGACATTTTGGAATGATTT +CCTAGAAAAAGTTTACTTGGAAAACCATGGAGCTTACTACAAATGATATGGTGGTACATA +GTGCCTGTGATAATTGCAGTGATTTACCTCGTAGCTGGTTGGTATATCGTAAACGCTCTC +GTTAAACGAGGGGCAATAGAGACACCTCAAGGCTATATCTTTATATTACTATTATGGTTA +CCTGTCGCGGTCGTCTCGATCATCTGGCGAACCCTAGCATGGTTACTACTGTGGCCAAAG +CGCTTTGCTGAATCCCAGATAAACAAACACTCTTCTTAACCTCCTTCGGGAGGTTTTGTT +GTTTTTGAAAAAATGTTGTACATCTTAACTCAATGTGTTATTATAGACTTATCAAATAAA +TGGTAACCCGGAGAAACAAAATGACAGCAGAACAAATTAAAGAGATGATCGCAGCAGAAG +TAAAACGCGTCATCCGTGAAGAACTTAAAATTGAGTACAAATCATCAGAAGATGCTTTGG +ATATCGATTTATCTCTTGATGGTGAAATTGTATCAACGATTCAACTGTCTAAGTATGATT +TACCGATTTAATTGCAAATATTTTTGCTGAACCGTTTACATCAGTTCAGCAATTTGATAT +TATTACCTCATACCAAACAAATAGTAACTCGGAGAATAAAATGACAACCATCACTATCAA +CAAAGGTATTAACTTCGGTAAAGAAATTTCTGGCACTTTCGAATTAGTCGGAGAATGGTT +CCCAGAAACTCTGAAACCCGAAGATGCTGCTCAAGGTGATGGTAAAGTTTTCGTTATCAT +CGACGGTAAGAAAAAAGGTGTTTGGGTTTACAAATCAGACATTTCTTATAACGGAGTAGC +TAAAAAGATTGAACTGATTGAAAGTGTTGATGATATGAAAGCTCGTATCAATAAACGCTT +TAACGTTATGGGGATGATGACTGCCGGAATCATTAACGGAAACATTCGTTCACTGATTAT +CTCGGGAGCCGCTGGTATCGGAAAAACTTACTCCTTAGATAAAGCATTGAATAAAGCAAA +TGATGAGGATAAAATTGAATACAAATCAGTGAATGGTAAAATCTCGGGTATCGGGTTGTA +CTGTCGCTTATGGGAATCACGCTTCGATAATTCAGTTCTGCTTATTGATGATGTAGATGT +ATTCTCTGATATGGATATTCTGAACCTTCTGAAAGCTGCTTTAGATTCTGGAGAAAAACG +TAAAGTTTGCTGGAGTACTGCTTCATCTTACTTAGATGAAAAAGGTATTCCAAATGAATT +TGAATTTGAAGGAACAGTCGTTTTCATCACTAACGTTGATATTGATAAAGAATTAGAACG +CGGTAGCAAATTAGCTCCACATCTCGCTGCTTTGGTATCTCGTTCGGTTTATTTGGACCT +TGGTGTTCACTCAAACGAAGAAATCATGGTCCGAGTTGAAGAAGTAATTATGAATACTCG +GATGTTGCAAAGCCGCGGTTTACGTAATTCCCAGGTTGTTGAAGTATTAGATTTTATGCA +AGAAAATGTATCTCGTCTTCGTAATGTATCTTTACGTACTGCTCTTTATCTCGCTGATTT +CGTCGCCACTGACGAGAAAAATTGGAAAGATATCGCTGAAGTTACGATGCTTAAATAATA +CTCCGGGAGGAGAAATCCTCCCTAAATTTTTGAGGAAAATATCATGGCACATTTAATCTC +TTATCAAACTAAAATTGTTCTGTTTCGTAATGGTAGCTTTGTATGTGATTCTAAAAGTCG +CGAGTCTCTGAAATATATGTCAGATGCTACAGCAATTTCTTATATTGACTTAAACGGGAG +CTGGGTACAATGAGCTACAAATACTATGTGAGAACCCACGCTTGTATTTTCAAAAGTGTA +TGTGAAAAAGACACTGCTGAATATATTCTTAGTCATACTAGAAATTTAACTGCAATTCTT +TTCACTGATTTACAGAATCCAGCTTCAAATCACATTATGGAAAGAATTCGTTTTAATATT +GAAAATCGTGATGTGCAAGCTTTAGAACGTCGCCTCAAAGAAGGCTATGAATATGCTGAA +GAAAATAAGTGGAGATATTAATGAGTATTTTAATGGGAAATTGGGTGAACAACACCGCTT +ATTATCCACCTGCTCATATCTATGCTGGAATGGTCCAGAGTAAGGCTGAGAAGAACGCAA +TCCGCATCTGTGAAGAACTATACAGATTCAACTTCGGAGATTCCCCTAATGTATTGGGTG +AATTGAGAACGGCCTTTCGTGAATTAGATGTAATGCTCCATATGAAGAATTCTTACCCAT +CTCATATGGAACTTCGTCACGAACACGTTGCTGAAGTATTTGGCACGTTTCTTTATTGGG +CTATTCGTGCTAATACTGAAATGGAGCGAATCTATAAGCAACACCAAGACCTTTGGAAAT +GGTATAACACATCCAAATTAACTAATCGTGAAATTAAAGATTGGTGCAAACAACAACTTG +ATTATAATTTGAATTGCATGATGATTGATGTTTACGATAATTTAGTCAGGAGCAAAGGCT +AATGGGCTACGGGTTGGATGAAGATTGGGAATACGAAGACGAGGAAGAATTAGGCACTCG +TTACAGCATAATGAAAATTGTTTATGAACGAAATGCTTCAGCGAAAGTAGGGGCAGAAAT +GTATTGTCCTTATTGTCGAAAGGTTATTGTAAAGCGTAGTTGGCAACATAAGTTTTGTAG +CACTCCATGCAAAGACAAGTATTGGAACTGCGAACCTAAGCGTGCTCATCGAGCAGAATT +CTTTAAGGGCAAATTATGCAGGTAGAACAATTAAAAGAACTTATTCAATTAGTTTCTAAA +GAACAAATCAAGGAACTTATTCGTAATGAGCTGAGAATTGAAGTTCAGCCAGCGGATTAC +ATGGACCCATGCCGAATTCAACTATGGTGGGATAACGAAGTTATTTCCGAAGAATGTATT +TACTTGAGTGATATTCAACGATGATATCGAAAAAAATTATATTATCTCGTATTGAAATGA +TGAGAAGCAATTATGAAATGGCTATGAAACTTTCCGTGTCAATTGCTTTACGTAATATTG +GTGACGAGCGAAGAAGCAATTTAAAGTTTTGTGCACCTGATGACCGCAGAGCGCAATTAG +TTGAAATTACGAAAGCACTTATTCAAATGGACCATTACCAAATCAAAGATGCTAAGATGC +TAGCTACCGAAAAAGAAATTTGTGCTAAAGCTTTAAGGGAGCATCAAAAACAAACTCCAG +TTTCATCTTGGTTCCATGGTGGAGCAGACAAACCAGCCTATTTTTAACTCTAAGTTCGCC +CCGGCTTCGCTGGGGCATTTTTGTATGTTATCCATATAATCAATCATTCCCTTCTAGAAA +GTTCCTCTCAACCGTTCTGGTGAATCCAAAAATTTTTAAATCAACCGTTTACATCCTCCA +AAGATTGTGGTATGATAGTCTCGTAATCAACTAACGGAGAATAAAATGTTAACTGAAATC +ATCACCTCGCTTATCGAAGAAAATCGTAAAGCTCATCAAGATCGCCGAGCGAAAGTTGAA +AAACGCGCTATGGAATTAAATGCTGGATGGACTAAGACCCGCTACGGTCGTGAAGGATTT +GATAAGGTAGTAGCCCCAACTTGGGGAGTAGATGATCGTCCTCATGCACCTTTTGATGGG +TACCTCTGGGAAAATGAATTAGGAGAAGTTGAGTCTTATCATGGTGGTAGTTATCTTCCA +TACGTTACTGAACTCGACTATCTTGATAAGCCTGAATACACAGGAGATCATGGTTGGTGG +AAGCTGCGTCTCACTTCAGACATGCTTTCTGAACTCATGATTTTAAGGCATGAAACTCAG +TGTATTGAGATTCGTACACCTTACAAAAAGTGGACACTCGAAGATAACACTATTGTGGTA +ATGAGTGAAGTACGTGCTCATAAGACGATTCTTAAGGCAATTCAATCTGCTTCAGAAGAA +TGGTTCAATAACTACTACAGTTCACTTAAAGTTAACAAAGGTGAAGCACCGGTTGGTAAG +CAAGTAGTTAAAGGTAAAGTCGTTTCAACTAAAGTATACCAAGACTATTGGGGTGTATCT +GCTAAAATGATGGTCCGTCTTGAAAACGGAGCTACAGTTTATGGTTCTTTACCTGGAATC +GTAGATATCAACTATCGCGGTACTATCGAATTTAAAGCAACGTTTGAACAAGCAAAAGAT +GACTCAACTCACGCTTTCTTCAAACGTCCATCTTCTGTAAAAATTGAAGAATAAACGCTT +TAAGAGAGTCCGTGTTATAATGGTTTCACGGATTCTCAATCAAATCATACGCCTGACAAT +GAAAGAAGAGGAAAAGATTATGAACTTCAAAACCAAAGATGATTTCTATGTAGAAGTATT +TGAATTGATGGAAGTGGTGAACAAACATTCAAGTACAGTGTTTGCTAATCAGAAGAATAA +GATGCTAATTAGTCTACTCCGTGATCGTCTCGTATCAAAGCATAATATCATTGCAGGTAC +TGAATTGAACTCTGTTTTGGCTAAATACGACCAGTACACTCCATGGACAAAAATCGCTGT +AGTAAAATCTTTAAGCAAATCTAAAATCACAACTTATGTTATGTCTCACATGCGCTTGCG +CAATTATCTGAGCATTGATATTGAAGATGAGCGTAACCGTTTGAGCCAAAACCGTGTTGT +CGTTAATACACATCTCAGTATTATAGCTGAAGTTGTACGGACAATGTCCAGTGATATCAA +AGAAATTATTAACATTGGTCGTAAACTATGTCATGCTATTGATAGTCAGAAATTTGAATT +CATCAATGATTTCTTGGGCTGTGATAAAACTGTTTATCCGAAAGTTCGTGTTGGTGTATC +TGGCCGCCCTGAATTTGATATGGCGATCAGAGTAAGCGCTGGTTATCGAGTCAAAAATAC +TCCAGCCCAACGTAATGTAGTTGCTCGTTTGACAACTCAGCTTAAGAAAGCACTGGAACA +AATTCCATTCATCAATACAATTTCTTTGGTTGAACGTGAAAATGATAAGGTGGTTCACTT +CTGTGTCGACCAAGAATTCTTTAAGCCAAAAGAAGTTGCATTGAGTTCTAAAGAACTTCA +TAATTTTGTTCATGATACTGATGTTCAACATATGTACTTGACACCAATCAAACCATTGGT +TATCGAATCGGTTATGACTCAGCAGCTTAATGAGTTGATTGCTAAAATCGATATTGAAAT +TGAAAAAATCGATGCGGATATTGAATCATGGCAGGAACAAATTGCAACTAAACGCGCCGA +AGCAATTAAGCTTCGTAATCGTCGTCAGAAATTGGCATCTGCTGTAGAGGCTTTAAATGA +ATAATCAATTAAAAGAAGATATTGATTTTGGCACGTGGTTTGAACCCTGGACTGATGTGG +ACCTTGAGAAAGGTCCAGAATGGGAAGCTCCTGCTGGATTCGATAAAGGTCTTATAGATT +GGAAAGCAGTTTTAGAAATGGCTGATCGTCGAGAAGCTGCAGCAAAACAAGTTTCGCCTT +GCCCTAAATGTGGTACAATTCAGGTTCAATTGATAGATTGGCGAACTGATACTTTGAAAA +TGAAGTGTCGTCATTGCAAACATAAATTTGAGAAGAAATTAAAATGACTCGTATTAAAGC +AGCTATTATCGCACTGATTCTTATTGTTATTCCATTAACAATGAACCATTTCAACGATTA +TATGACATACCAAAATTATGATGTTAAAGTTGTTAGTGTAGTATCTGGTATGTCACCCGG +AAAGTACTCATCATTAGAGTTCATTGCCATTTACGAACTTGAAGATGGATATCGGTTTGA +CCGACGGATTTCCGCCGCGTCATCAACGCAACTCAGTCCTGGTCAAAATATTACATTAGA +ACTTCGACCGTTTGATGTTAAGCAAACCCCAATGGAAAATACTATCTGGTTCATTGGTGG +TGTATTAGTCAACTGTGCCGGATTTGTTTTTGGCGCAGCATTTGCTTTAATCGCTATTTC +TCGTCGTGTTAATAATTGGATGAACTCATAATGATTGATTTAAAACTTGATACCAACGCA +GTAATGAAGCTGTTTGATACCGAAGAAGCTCGCGTTAATCTTCAGCAAGCAGTTATTAAT +AATGTGGTCAAAGAACTTGTGCTGAAGAATAGTAAGAACAAAGTACGAGAAACTATTCAG +AAAGAAATTTCTTTGGTTGGGGCTCGTCTTCCTGATGTACAGCCGATGGTTAAAGAACAA +CTCAAATACTTCTTTGAATCTAAAGGCTGGAACAAGGTTCAAGGTACTTTTGAATTAGAA +CGTATCATGCGCGAAGAAGCAAACCGTATTGCTTCTACTCAAGTACTTGAAGCAGTCAAT +GCTCAGGTTGATAAAGCAATGAAAGATCTTGAATATAAAATTGATCAAGTGCTTCGAATG +TCAGAAGTACGCATGGAAGAAATGGTCAGTAAACGTCTTATTGATTCATTCGGCTCTGTA +ATTGATAAAGTTATTGCTGAGCGTCTTAAATCTGTATTTCCAGAGGTGGCAAAATGATTG +ATCACAACCCATTTAAAACCACTGGAATCGCTGAATCTGATGAAATGAAAGCTCTTTTCA +AAGAGCTTCGTGAAATTAATGCTAGAATTTGTTTTCAGTATGCAGAAGAAAAAGGAATTG +AGTTTAACGTAGATACAGTTCTTCGCAATATAAATGCTTTAACTGAATTCGATATCGTAA +TGTTCAGGATGTTTGCATATACCGCATTAGCCAATCAGCCTGAAAATACTCTACCAATTG +ATGAACGAATTATCATTGCAGCTAATGAAGCATACAACAAGGTTATTGAAATTGGCTAAA +CGAAAACAATATATGCTTACAGCCGAAGAAGCATTGATGTCAGTTTATCGTGCTTATTTT +GCTGAACATGGCGATATCCCATCAAGCCCGGCGGTTATTAAAGCAGCAATGACTAAAGCG +CATAATGCATTCCATGCTCGAGTCTCTGAAGCAGCTAGGAAAAAATTTGGAAAAAGGTAT +TATAATAGTCCTAATTACTTCGATGAACTAGACCAAATAAAAAGAGAAATGTTATGCTAA +CAATTTACGGATATGATTCTTCAATTCACCGCTGTGTTCACTGTGACAATGCTAAACGAT +TAGCTGAAGTTAAACGTGAAATGTATGAATTTAGGAATGTAATGCCAGAAAAAGGCGTAT +TCGACGATGAAGTTATTGCTGAACTTCTGACTCGTTTAGGTCGTGACACTCAAATCGGTT +TGACAATGCCTCAGATTTTTGATGGCAATGGCGCTCACATCGGTGGTTTTACTGAACTCA +GAGAATATTTCAAATGAAAGAAGGCGTAGACTACATTCATGATTACAGAGGCACAGCTAT +TGGAGTTGGTGATGTAGTTGCGCTTTATTACGGATATGGCGGCCTGGAAACAGGCGAAAT +TATTCAAGTTAAAAATAATCGTGTTAAAGTTGAAGTAACTTATAGCAATGGCTCAAAAGT +TATTTCTAAATGGAAATACGGCGAATGCATGGTGAAATTATGAGTGATATGAAAGAAGTG +GATTTAGTATTCTCAGCCGGCGATCAAATTGATTTAGAGCATCTGCTTGCAGTTGAAATG +ATTCGTCGTGCTTCTGAAGATATTCAGTACGCAATTGATAATCCTTGGGGTGAATTCCGA +ATTCGTCAGGGTAAAGAAATTCACGGTGTTCAATGGACTTATGTTGGTCTGGAACCTGAA +GATTACGAAGAAGTAATGACTGAAGACGGGCGGATTGACTATAAACCTATCGGTCCTTGG +CACTGGGAGTATGGCGGCCCAGATTTTGAAGTTTCATGCTCGTGGTTGGAAAGTAAAGAT +GAAGACTGATTATATCCAAGTATCTGTCAAAGAGTTAGATCGTTTACGTCGGTGCGAAGA +GCTGCTCTGGGAAGTGGAAAGTTCTTTACCATCGGGTTTAGAGAGCTGGATTGATTATGA +AGAAGAACGTGAATTAAGAGGTGAAGAATGACTCCTGAATTAAAAGCAATTTATAGTGAA +ATTATGGAAGACCATGATGGATACTCCGAGAACTACGACTTCGAAAATTCTGATTACTTA +GAAATAGTTGACGAAGAAGAATGGACTCAAAATCATAAGTATCAATATCGTCAAGTGGTT +TATTATTCCAAGAAGCATGATGTTTATGTTGCTGTAAACGAATCTCGTTCAGGTTCTTAT +CACAGTGATTGGTACTACAGCGATCCTGAAGTTTCATTAGTTGAAAAGCAAGAGCGAGTT +GTTACTCGTACAATCACGGAATGGATTACGCTTTAAAGCCTTGGTACACGGCTCGATGGA +AAACCGTTGAGCCAGAGGAAGAAGAACGCTTTCCTGAAGATGATTATAATGAACCTACTA +CAAATGATCTAATTGATATGGAGTTTGGCTATGAGTTTAGTGAATAAGTGCTTCAAAATT +GTTAAAGAAGATAATGACGGCGGCGTTTTCGATATCTATCCAGAACTTACCATTGGAACT +GAATTCAAAGTTCTTTCTGTGGATAAAGAAAATCCAGATGGTATCACTTCTATCTTGATT +AAGAACGGTCCTTACCTTCATATTGGTTCTCGTGAATCTTGGTATTGGTGCTTCTGGGAA +CAAGACACGATGGGTGAAATTGAAGAAATTGAAGAGCTTTCTTCTGATCAGTACAAGATC +CCAGACACAGCTCATTTGTTCAAAGGACGTGATATCGCATCTCAGCTGTTTAAAGTTGCT +GGTGCTGAAAATTGCGATGCCGAAGAACATGATTTAATGCAGGCGGCAGGCGAATATATC +CGTCAGCTTGAAGCTCAATTGAAATTTTCCGATAAGGCTTTCTAATGCAAATTGAATTAA +AATACGTATCATGTCAAGAATCTGGTTGGCATCTGTCGTTTGAATTTGATGATGGATTTG +GAGTCGCTAAATGGTTCCCTTCTAAGCCGACCAAAGCTCAAATCCGATATTATAAGAAAT +GGGCTCGTATATATTGGTTGTATGATTAACAATAAATAGGTTCATCTGATTAAGAGGTGA +ACCTATGTTATTGACTGGCAAATTATACAAAGAACAAAAACAAAAATTTTATGATGCACA +ACATGGCAAGTGCTTAATTTGTAATCGCGAATTAAATCCTGATGTTCAAAGTAATCACCT +TGATCACGACCATGAATTGAATGGACCAAAAGCCGGTAAAGTTCGTGGGTTGCTATGTAA +TCTGTGCAATGCTGCAGAAGGACAAATGAAGCATAAGTTCAACCGCTCTGGTTTAAAAGG +TCAAAACGTTGACTACCTCGAATGGCTTGAGAATTTGCTTGTCTATCTGAAAAATGATTA +TACTAAAAATGACATTCATCCTAATTTCATTGGCGATAAGTCAAAAGAATTCAGTCGACT +TGGTAAACCCGAAATGATAGCTGAAATGAATGCTTACGGGTTTACCTATTCTGAAGATGA +TTCCAAACCAAAGCTTGTTGCTTCATTCAAAAAGCAACTTCGTAAGAGTTTAAAATGACA +ATTGAATCAGAAATCCAGGGGTTAATTAACCGCACCAATAAAGATCTACTTAACGAGAAT +GCTAATAAAGATTCTCGTGTTTTTCCAACTCAACGAGACCTGATGGCGGGGATTGTTTCA +AAACATATTGCTCGTCAAGTTATCTCTCCTACTGTTCTAAATGCTCATGATAAAGGGCTT +ATTCATTTTCATGACTTAGACTATTCTCCAGCTCTTCCATTCACTAACTGCTGTTTAGTT +GATTTGAAGGGAATGCTTAATAACGGATTTAAACTTGGTAATGCTCAAATTGAGACTCCA +AAGTCAATTGGAGTAGCAACCGCTATCATGGCTCAAATTACGGCTCAAGTGGCTTCTCAT +CAATACGGCGGAACTACATTTGCAAACGTAGATGTTGTGCTTGCTCCTTTTGTAGAGAAG +ACTTTCTTTAAGCATTTACGTGATGCAGAAAGATATGGCATTGAGCATGTTAATGACTAT +GTATACGCAATTGAGAAAACAGAAAAAGACGTATACGATGCATTCCAAGCTTATGAATAT +GAAGTCAATACTTTGTTCAGCTCAAATGGCCAAACACCGTTTGTAACAATTACCTTTGGT +ACTGGCACAAACGAATACGAGCGGATGATTCAAAAAGCTATTCTCAATAACAGAATTAAA +GGTCTTGGACGAGACGGAATCACTCCAATCTTTCCTAAACTTGTTATGTTTGTTGAGGAA +GGAATTAACCTTCATCCTACTGACGTTAACTATGATATCAAACAGCTTGCGTTAGAATGC +GCAAGTAAGAGAATGTATCCAGACATTATTAGTTCAAAGAATAACCGTCTGATTACTGGC +TCTTCTGTTCCAGTTTCTCCGATGGGATGCCGTTCATTTCTTAGTGTATGGAAGAACAAG +CACAACGAAGAAATTCTAGATGGACGCAATAATCTCGGTGTAGTGACTATTAACCTTCCA +CGAGTAGCACTTGATTGTATGGTTGATGGCCGTCCAGATTTAACTAAATTCTTTCATATT +CTTGATGATCGTTTACTTATTTGTAAAGAAGCTCTTTTAGCTCGTATCGAATCACTTCGT +GGTGTAACAGCTTCGGTTGCTCCTATTCTTTATCAAGAAGGCGCTTTCGGTGTTCGTCTT +AAGCCAAATGACGAGATTATTGATATCTTCCGAAACGGTCGTTCTTCAATTTCATTAGGG +TACATCGGAATTCATGAAGTTCAAACTATTCTTGGATTTGAAATCGGTTTACTTTTGCTT +AAATGCATGAATGATTATTTGAAAGAATGGACCAAAGAAACTGGATTTGCTTTTAGTCTT +TATTCAACTCCGGCAGAGAATTTGTGCTATCGCTTCTGTAAGATTGATGCTGAAGTTCAT +GGAGATATCAAAGGAGTTACTGACAAAGGTTGGTATACTAATAGTTTCCATGTTTCAGTC +GAAGAAAAGATTTCTCCATTTGAGAAAATAGATCGAGAAGCTCGTTACCATTATATCGCC +AAGGGCGGTCATATCAGTTATGTAGAACTTCCTGATATGAAAAGCAATCTAAAAGGTCTT +GAAGCTGTGTGGGACTATGCTGTTGAGCATCTTGATTACTTTGGTGTAAACATGCCAGTT +GATAAGTGCTTTACTTGTGGCTCAACTCATGAAATGACTCCAACCGAAGATGGTTTCATC +TGTCATGAGTGCGGCGAATCAGACCCTAAAAAGATGAACACAATAAGACGCACATGCGGT +TATCTTGGCAATCCTTCTGAGCGCGGATTTAATCTTGGTAAGAACAAAGAAATAATGCAT +AGGACTAAACACTGTGAGATATGACAGAATTTATCCTTGTGATTTTGTAAATGGCCCTGG +ATGCAGGGTCGTTCTTTTCGTTACAGGATGCTTGCATAAATGTGAAGGATGTTACAATAA +ATCTACTTGGAACCCAAGCAACGGCCAATTGTTCAATGCAAACACCGTAAAAGAATTAGC +TGATTACATTTCGAAGCCTTATATTCAAGGACTTACTCTCACCGGTGGAGATCCTTTATA +CAGATCTAACCGCGAAGATATTGAAGCTCTTGTAAAATGGGTCAAATCGCGGTTTCCAGA +AAAAGACATATGGATGTGGACTGGTTATAAGTTCGAAGATATCAAGGACTTAGAACTGCT +TAATTACGTAGATGTTATTATTGACGGTAAGTATGAAAAAGACTTACCGACTAAAAAACT +ATGGCGTGGTTCTGATAACCAACGTCTGTGGCAAAAACAAAATGAGGTTTGGACACACGA +TGCAATTACATTACCCTTGGATTCATGATGTACAAGTTCACATGAACCGTTATGTTGAAA +AGATGGGCGAAGAATTTCCATCACTTTATTTTCTAGTTTTCTTTGGAATGTACTAATGAA +AGTGGAAATTTATGGAATACCAGAAGAGGTTCATAGATGTCCTGGGTGTGTAAGCGCTCG +TCATCTTCTTGATTCTCTTGGTATTGAATATACTTTCTATTCTGTCATTAATAAAAGCCA +AAATTCTCTCGGTTTTGATTATGACCGAGAGCGTATAACTGAATGTGCAAAAAGAATAGG +ATGCTTTCCTAATCTTATGCTTCGTTATCCAGTTATCTTTATTGATGACAAAAAAGTTCC +TCGTTTAAAACAACACCTTGAAGATCTAGGTTACGATACTGATCTCTAACACGGTTCTAA +GACACTCTCTCCTCTCTTCCATATGTTTATATGGTCATCATTAAGGGAATCTCTCAGGTT +CCCTCATTTCTTTCAAACAACCGTTTACATCCTGTACTCTCTGTGTTATTATACTTCTAT +CAACTACGGAGAAACAAAATGATTATTAAATCTAAAGTATCACACATTGTTATCGATTTC +AACGTTTCAACTGAACGCGGTCGTACTGATCTCATGGTTGAAATTAAGGGACAGGAAGTT +ATCTTCCGAGCTCGTTCAATTCGCTGTGAAATGTCCTTAAATATCGCTAAACATCATCCA +AACGCAATTAATGATTGTGTTAAAACCTTGATCTCTGATATCTACCAAAGCGAAGCTGAT +CTCGTCGTTCGCGAAGTATTTCATACAGTAGGATATGCATAATGTTCAATATGACTTGGG +AAGAAGCCAAACAGGCTATGCGGGAAGGTAAATCTGTTCATCATCGTTACTTCTGTGATG +AATGGTTCCAAATGACAAACGGTCGTATCGTTGATGAAGCTGGCTACTTCATGGATAAAT +GGTACACAGGCGAAGAGTGGCAAAACACTGGATGGGCAATTTATGACTAATTTAGATCTA +TTTCACAAATCAGCACGGTTTTCATACAACGTTCCCGAAGGACGTCTGTTTGTAGATATC +ACATCTGATATGTTTTTGAAATTTATTCATAACAGGCCCGGAAATAATAACTGTATGGAA +GTTCTGGACGTTAAGAATGGATTCAATACTCTTGATGTTCAACACATCATTGCTAAAATT +GGGGTTGAATTATCAGTAGCAGATGCTTATCTGATAAAAGAACAAGTTACTAAATTCCTG +GCATAAGTTTGCTATAGAGATTTTGATATAAGATCTCTATGTCAAAATAACACACTGAGG +AAAATACTATGTCACAGGCTATCAAAAACGTTCTGAATTCTTTCGCATACCCGAAAGTAG +AAGCTATCATGGCAGCAGGTGCTTACGTAACTCCAGAAATTCTGGACAAGTGGGAAGTTG +AACTTCATGGTACGATGAAAGAGAATGGCCAGAAGATCGGTAAAGCTCGCATTCGTGAAT +TGGTGGTAGCTTACATCATCTCTGAATTTGATATCGATGCTTTCGGTATTCCTACTCCGA +AGAAGAAAGAAATCTCTGATACTGCGATTCGCAAGATGAAGAATCAACGTAAGAAAGGGT +TCTCTGATCTCAAAATTGTTAAGGTCGCGAAATGAGTCTGAATATTCAAAACTGTCCGGC +TGATGTGCGGTTTGTAGTACTCAAAATGGAACGATTAGATTTTTGGCACTCAAAAGTTCA +AATAGTGCATTCTTACATCGGTTCCATTGAGCTTAAAACTTATTATGACGCTGGAGTTCT +TCATAATTGTCGAGTGCTCCCACGACCAAAAGGCGGATTTGATACTCATTATCATTGGTC +TGAACCACCGCTCAGCGAAATGTGGGAAGAAGGCCTGAACATGAAAGAACTTGAGGAATA +TCTTGATGCTTAATTTGCCTGAAGACGTTTCAGTTATAATGACCTTTAAAGAATTTGATG +GTAAAATTCATCGAGTTCGTAAAATGACCCAAGGCTTCATCATCGCAAAGGCTTGTGTTG +CATTCAGAGAGTCAAGACGGGATTTTAGAATATTCCCAATGAACTCTAAAACGAAGTACA +CTAAAGTGTCTACTGAATTAGCTTGGAATGAAGGAATGACTCTTTCCGAATTTGAGGAAT +ATCTGAATGACTAAAACAGAAATTGTTGATGATCTTCAATTAGCCGGATATTTTGCTTGT +GTTAAAGATGACCGCATCTGGATTGAAGGAACAAGTGAAAATGGGATTGATTGGGTTATC +GAAGAAGACTTTGATGCTTGGTGGCTATATGAGTTCACCGGAAAAGATTATCATTCAGTA +GATGCTTTTGGTAATATGGATCACGCTTTGAATGGAGCAAACAAATTATGATCAGAGAAA +TTATTTTATCATTTGAATTTGGCAATGTGACGGTAGAAAAATCCGGTAAGGTTGAATTTT +TCGATCCATGGGCAGAAATCTATTGGACTTCAACCGTCAATGAATTCCAAATGGAGTTCG +CTGAAGTTCAATTAAAATCGTTGTATAAAGAATACGATGACTATTCTCTGATGCCGCCGG +ATATTCAGATGACTGATATGCTTTACATCCGTCGTATGGTTACCAAAACATGGGAAATGC +TTAAATGATTTCTGCACTCAAATTTGATAGTCTAAAACTTGAAGTTGCCAATTATGGAAC +TTTCACAGTAACTCCTTTGATGGGAATTACTTTAGACATTGAGTGGTTTGATGAATTTCA +GTGGGTGTCTCATTGCTCTTTGTTGAATGTTAATGGGTATAAGATTGCGTACGAAAGCCT +TGATAAGTTCTGGAAAGAAAATGAGCTTCATCATGCTGCTGATAATATCTCTTTCGATGA +GTTTTGTCGTATCGGTGAAGCCCTGTTCCAGATGTATTTGATTCTTCGTAACAATTAAGT +GCTTTAAACCTATCGCTCGCAACTATGTTATAATTGATATATAAACTTGAAGCGATGAGG +TATTAATGGCGAACTATGTAAACAACAAAGAATTGTATCAAGCAATATGCGATTGGAAGG +AAAAATGTCGCAATGCCCCCGAAGGAGTAATCGTCCGTCAGAATGATATAATTGGTAAAG +CAATTATGCTTATATCTGAGGGTCTGTCAAAACGTTTTAACTTTTCAGGATACACCCAAT +CTTGGAAAGATGAAATGATTGCAGATGGAATTGAGGCCGCAATTAAAGGTCTCAAGAACT +TCGATGAAGAAAAATACAAAAACCCACATGCATACATCACTATGGCTTGTTTCAATGCCT +TCGTTCAGCGTATCAAGAAAGAACGTAAAGAAGTTGCAAAGAAATATAGCTATTTCGTTC +ACAACGTTTATGACGCCCGTGACGATGATATGGTTGCGTTAGTAGATGAAACGTTCATTC +AGGATATCTACGACAAAATGACGCATTATGAAACCTCCACCTACAAACAGCCAGGGTCTG +ATAAAAAGAGCGATATTGTAGATGAAGGACCGACTTTGGATTTTTTATATGAGGCTAAAG +ATTAACCTCTCCGGATTCTTGGAAGAAGTGCCAGACGCAGATGCTATCCCTTATTTGCTT +AAAATGTATATGAGGGAAGTTCTCGAAATGGACATTCACATTGACCCCAAAGATCCACAT +GATACAGAGTTCAAGTGTGATGGCAAGGACTTGAACTACAACTACCACATATCTGATGAT +GACTTTTATATCACATTAGAATACTTTCCAGAATGAGGAATTATGCTACAACCCGGCGAA +GCATTCCAAGCAGAACTTGAATATCAAGATAAGCTGATTGAAGATCCAGACCACCAGAAA +TTAATGGAAGAAGATCGTCTTGCAGCTATCGAAGAAGCACAAGCTCGAGTAGCTGCAACT +GCTAAATCTCAAGCTGATAAAATTATCAAGAAAAATAGTCGCGAACTTGAACGTTTGAAT +AAACATGCTCAACAATCAGTTCTCGATAACAACTTCGCGGCGTATAAGTATGCGATTGAA +AAATCTCGTAAAATCTTACGTCAACCATTCAATGACGAGCTTATCAAAGTCCAATGGGAA +ACCACTCGTCGTCAGATCTGGGAAATTGTAAATGGCTATAAAGCAGGTTAAATTCAAACG +CCTTAAGGTAAATTCAGGTTTCACTCTTTCTGTTGCTGATGGTGTAATGGCTATTAAAGT +ATCTGAAACCCATTACAAAGTCTTGGGTGAAACAGGTCCTATCAATCCAGTAGTTAAAGC +TACTAAAAAGGAATTAGTCTGGGCTGATACAATTATGGTGAAGCCATGGTGGAAGCTGTA +ATATCAAAAGCTGCTGTAGTATCCCGCAACGGAACTGTTTATTCGGCTGAAGCTTTAGAA +AGAGCTATTGATTATGCAAAGATTCATAACGGCAAGACTGAAATGATGAGGCAGTTCAAG +ATGTCATATGATAAAGCTAAGGCTGAATGTACAATTACATACAAGAAAATTTAAGGGCTT +CGGCCCTTGCTCTTTAAGGTAAAATATGGAACAAATTCATGTAGGTGGAACCGACTTTCT +TGTCGCAGTGGTTATTCATCCAGTTGATAATCAAAACGAATTCAAATATGATGTGACAGT +TCGTCATTATCAGTTTGACCGAATTAAGCACGTCGATATCATTGCATTACGCAAAGAATA +CGATAAAGTTGGATATACGGGTGAGCTTAAACTTGTATTAAAACAAGGCTATGAAGAAGA +TTATCCTTGTAGTTCATTTATTAATAATCCGGCTTTCTTTAGTTCAATGACCGAAGAAGA +ACGAGACGAATTTATTGATAGAGTAAATAAGTCTAAAATCCCAGAAATATTACGTAAGAA +ATAAAGGACCTTCGGGTCCTTTTCTGCTTTTTGGAGCATAGAATACAATATCCTTGAGGT +AAAATATGATTACTTACTTAGGTGTACTCTGTTTAATCGTAGGGTTGTACTTGTTTGGCC +GAGCTTGTTGGGTTGGATTCTTTTCTACACCAGATGGGTTCATTTCTATGATTTTAATTC +TTTCAGCTATGACGGCACTTGAAATATGAAAATTTTGCATACAGGTGATTGGCACCTAGG +AGTAAAGGGTGATGACCCTTGGATTCAAAACATTCAGCGAGATGGAATTCGTCAAAAGAT +TGAATATTCTAAAAAGCATGGAATAAAAACTTGGATTCAATATGGAGACATCTTTGATGT +TCGTAAGGCGATTACTCACAAGACAATGGAATTTGCTCGTGAAATAGCTACAATGTTAGA +AGAAGTAGATATCCACATGATTACCGTCGTGGGAAATCACGACATGCACTATAAGCATAA +GATCACTCCCAACGCTTCAATGGAAGTTCTCGGTAAGTATAAGAACATCACAGTCGTTGA +AAAACCAGTTACAATGGATTTCGATGGTGCTTTGATTGACTTAATTCCATGGCTTTGCGA +AGAGAACGTTGCTCAAATAATGAAGCACGTAAAAGAATCTTCTGCTGAGTATTGTGTAGG +CCACTGGGAGCTTAATGGCTTCTATTTCTATAAAGGGTTAAAATCTCATGGTCTCGAACC +AGACTTCCTCAAATCATATAAGCAAGTGTGGTCAGGACATTTCCACACAATCTCCGAAGC +AGCTAATGTCAAATACATTGGAACCCCGTGGACGCTTACAGCGGGTGACGAGAACGACCC +GCGAGGATTCTGGGTTCAAGACACTCGATTACGCACCTTTGATTTCATCCCTAATGAAAC +AACATGGCACAGAAAAATCTTCTACCCAGTAACTGGGCCAATTGATTTCAACGACTATAA +AGACTTATCAGTTCGTGTCGTTATCACAGAAGTTGATAAAGATCTACCGAAGTTTGAAAG +CGAACTTGAAAAAGTAGTCCATGAACTTCGAACTGTTTCAAAAATCGACAACTCTCTTGA +AGTTGAAGATAGTGAAGAAGTTGAAGTAAAAGGCTTATTAGAAATTATGGAAGAATATAT +CAATGCTCTCCCTGATTTATCTGATGACGACAGAACTGCTGTAATTCTGTACGCCAATCA +ACTCTACACTGAGGTTACTAACTCGTGAAACTCCATGAATTTAATTTAGGTGATGGATGG +TTCGGTAATATCGAATACTGGCCAGAAGATGGCGGGTTTAAAGGCATTATGTTTGTTACA +TCTGAATATTCATTGGGCGTATCATGGCAAGAACATTTTGATGTAATGTATGTGTCTGAA +GATTTTATGTTAGAATGTTGCCGCAACTATATTCGTGAGAATAACACATGAAGACGTTTA +AACTTAACCGAGTCAAGTATCAAAATATTATGTCAGTGGGCGGTCAGCCCATTGATATTC +AACTTGACAAGGTTCAAAAAACTCTAATCACCGGTAAGAATGGTGGTGGTAAGAGTACAA +TGCTTGAAGCAATCACGTTTGCTTTATTTGGTAAACCTTTCCGAGATATCAAGAAAGGAC +AATTAGTTAACTCAGTTAACAAGAAGAACTTACTCGTCGAGCTGTGGATGGAATATGATG +GTAAGTCTTTTTATATCAAACGAGGACAGAAACCAAATGTCTTTGAAATTTCAAGAGATG +GTGTCCGACTTGATGAGTCCGCGAGTGTCAAAGACTTTCAGCTCTACTTTGAAGAACTCA +TCCACATGTCATATTCATCATTTAAGCAAATTGTCGTACTTGGAACGGCGGGATATACTC +CGTTCATGGGCTTATCAACACCAGCACGACGAAAACTCGTTGAAGATTTGCTCGAAGTGT +CTACATTGGCTGAAATGGACAAATTGAATAAGTCTCATATCAGAGAGATTAACTCTCAGG +TATCAGTGATTGACGCAAAGAAAGATGGAATCATTCAGCAGATTAAAATCTATGAAGATA +ACGTTGAACGCCAAAGAAAACTTTCAGGTGAAAACGTTGCACGATTCCAGAGTATGTATG +ATGACTTGGTTCGTGAAGCTAAGTCAATAAAGGCTGAAATTGAAGATGCTACGACTAGAT +TGACTTCAATAGTACTAGATGAAGACCCTCGTGAGTCTTTAACGAAGATTGGTCAAGAAT +CTTTCTTGATTAAGTCCAAGATTGACTCATACAACAAAGTGATTTCTCTGTACTCTTCTG +GCGGTGATTGTCCAACGTGTTTCCAACATTTAGACCAGGGTTCCTCTCTGATCACCAAGA +TCACTGATAAGGTCTCTGAATGTAATCATACAGCGGAGCATATTAACAGTCAGAGAGCCG +TTCTGGAGTCACTAGTGCATGAATATGAAGCCAACCTCAACACCCAGCGTTCACTGGCTC +AAGATATTCGTGCTAAGAAGCAAGTGCTGATTGGAACTGTAGATAAAGCCAAAAAAGTTA +AAGCTGCATTAGAAAAAGCTTCACAAGAATTCATTGACCACGCAGATGAAATTAATTCGC +TTAATGAAGAATTGAATAAAATAATTGATACCAAATCCAATATGGTGATGGAAAAATATC +ATCGTGGCATTTTAACTGAAATGCTCAAGGATTCTGGAATAAAAGGCGCAATCATCAACA +AATACATTCCATTGTTCAATAAGCAGATCAATCACTACTTAAAGATAATGGAAGCTGATT +ATGTCTTTACATTGAATGAAGAGTTCGCTGAAACTATCAAGTCCAGAGGACGAGAAGAAT +TCAGTTATGCTTCATTTAGTCAAGGTGAAAAAGCACGTATTGATATCGCTTTGTTATTCA +CATGGCGAGATATTGCTGAGAAAGTTTCTAACGTTAAGATTAACTGTCTTTTCTTAGATG +AAGTTTTCGATTCTGCAACCGATGTGGAAGGTGTAAAATCAATTACATCAATTCTTAATG +GTATGCTAAACTCTAACGTGTTTATTATATCACACCGCGATCATGACCCTCAATCATATG +GACAACATCTTCAAATGAAGAAAGTTGGACGATTTACGGTGATGGAATGAGTAACTTTGT +AAACGGTCAGAATCTTCTGACCGCACCAGAAATAAAGCGGTATGTATTGAAAAATAATTT +TTCAGGACAAGAGCATCTTGCAACTGAAGAACAACTTCGTGCTGCTTTTAAAAATAAGTA +TGATAAAATAACATCCAATCGCGATTCCGCGTGGACAGTATACGAATATTTTGAATAGGA +ATTATTATGAACCTGAATTATGCAATCGAAGTTAAAGACATCCAACCTAAAAACGTACGT +TGTGACTCTAACCCGAATAATCAAAACAAAATCCGTCGAGCATGGGTAACTATTCTAGGT +GAAGAAGGTGCCGAAGCTATTCGCAAACGTTTCCCTGTTGCTGAAGTACGTCATGCTTAT +TATGCGGCGATTGATAATTCAGTCAATGAAAAGTGGATCTCTATTATGCAGAAACATTAC +CAAGACTCTATCAAAGCCGGCGCTAAAATTGTTCTTGATCGTTGTGGTGGTGAGCGTCTG +GAAGATCAATACTGTCTGGATGCTGATGAACAATTAATTTCAGCTGCTCTGATTGTAGCT +GAAGAAGTAGCTATTGAAATCTCTAAATAAGACTTGAAAGGAAAAATAATGAAATTCACT +AAAGAAACTCTCGCAATTCTGAAAAACTTCTCTACCATCAACTCCGGTGTTATGCTTAAG +CCTGGTAAGTTTATTATGACTCGTGCGGTCAATGGTACAACTTACGCAGAAGCTAATATC +GCTGATGAGATTGATTTTGAAGTTGCGATCTACGATCTTCCGAGTTTCCTGGGTATTCTG +GGGCTGGTAAGTGAAGATGCAGAGATCTCTATGGCAGATGACGGTAATATCAAAATTGCC +GATGCTCGTTCAAAAATCTTCTGGCCGGCAGCTGATGCGTCTACAATCGTATTCCCGAGC +AAGCCAATTCCATTCCCAACCGCTTCTGTTATCGTTGATTTCAAAGGCGAAGATCTTCAG +CAGCTGATGCGTGTATCTCGTGGTCTTCAGATCGATACAATTGCTATCGCAAATAAAGAA +GATAAAATCGTTCTGAGCGGTTATAACAAGGTAGAAGATTCTGCTTTGGTTCGTCCGAAA +TATTCTCTGACTCTGGGCGATTATGACGGAACCAATAACTTCAACTTCGTTATCAATATG +GCGAACATGAAGATGCAACCAGCAAGTTACAAACTTCTGCTGTGGGCAGATGGTAAGAAA +ACTGCCGCTAAGTTTGAAGGTGAAGCTGCAAGTTATGTAGTAGCTATGGAAGCAGATTCT +ACTCACGACTTCTAAGTACCATGGGCCTTCGGGCCCAATCGTTTTGAATAAAAATTTATG +AGGAAATTATGTTAAGCATTAATGAAAAAGAGCACATCCTAGAACAAAAATATCGCCCTT +CAACTATTGAAGAGTGTATCCTTCCAGCTTTCGATCGAGAAGTATTCAATACTATCGTTA +AGAAAGGAAAAATTCCTCATCTTATTCTTCACTCTCCATCACCAGGCACCGGTAAGACAA +CAGTAGCAAAAGCATTATGTAACGATGTCAATGCTGATATGATGTTTGTCAACGGTTCAG +ACTGTAAGATTGACTTTGTCCGTGGGCCATTAACTAACTTTGCGTCTGCTGCTTCAATTG +AAGGCCGTCAGAAAGTTATTGTAATTGACGAATTTGACCGTTCAGGTCTTGCAGAATCAC +AACGTCATATGCGTTCGTTTATGGAAGCATATAGTTCAAACTGCTCAATCATTATCACTG +CGAACAACCTCGATGGAATTATCAAACCTCTTCAAGACCGCTGCCGAGTAATTGAATTTG +GTAAGCCTACTCCTGAAGATGAAGCACCGATGATGAAAGAAATGATTCGTCGTCTGATTG +CGATTTGTAAAAATGAAAATATCGAAATCGCTGATCTTAAAGTTGTAGCAGCTCTCGTTA +AGAAGAACTTTCCACGTTTCCGCAATACAATCGGTCAACTGGATATGTACTCTTCGAAAG +GGGTACTTGACGCTGGTATTCTGAGCGTAGTGACAAAAGAATCTGGTTCAATCACCGATG +TTTTAGATGCTTTAAAAAATCGTGATGTGAAACAACTTCGTGCATTGGCTCCAAAATATT +GCACCGATTATTCTTGGTTCGTTGGCAAACTTACATCAGAACTTTATACTATGCTCAAAG +GCCCTGGTATCATGTCGATGTATGAAATCGTCGGTGAAAATAACCAGTACAAAGGTGTAG +CATCTAACGCAGAACTTCACGTTATGTACATGTTCTTACGTTTGACATCTGAACTTAAAG +ATGAGTGGAAATAATGAGCTTATTCGATGATGACGTTCAACTAAATGAGCACCAAGTAGC +TTGGTATTCAAAAGACTGGACTGAAGTCCAGAAAGTATCTGATCAATTCAAGCAGACTGC +TGAGAACGAATTCTTCGAAATCATTGGGGCAATTAATGAGAAGAAACCTTGCTCCATAGC +TCAAAAGAATTATTCAAGGCATATGGTTGAAAATGCTCTGTCTCAACATCCAGAGTGCAT +GCCGGCAGTTTACGTTATGAACCTCGTTGGTTCCGAGCTTTCAGATGAAGACCACTTCAA +TTATATGATGGCTGCTATTCCTCAAGGTCGTCGTTATGGTAAGTGGGCTAAGTTAATCGA +GGATACCGGAGAGTTACTTGTACTCCGGGTATTAATGAAATATTATACGATTAACTTGAA +TGACGCTCAGGTTTATAGAGATACCCTGGTGTCAAAAGGGAAACTATCCTTGGTACTGAA +AGAAGCTAAGGCTTTGGTTACTGACGAGTTCCTGAAGGAATTGACGAAAAACGTCAAAGA +ACAAAAACAATTCAAAAAACAAGCATTGGAATGGTAAACATGATTGAAATTACTTTGAAA +CAACCTGAAGACTTCCTGAAAGTAAAAGAAACCTTAACTCGTATGGGAATTGCTAACAAC +AAAGATAAGATACTATATCAAAGTTGTCATATTCTTCAGAAACAAGGTCGTTACTACATC +GTACACTTCAAGGAAATGCTTAAACTTGATGGTCGTCCGGTAGTGATTGACGAGGAAGAT +GAAGTACGTCGTGATTCAATTGCTCAACTGCTTGAAGATTGGGGTTTAGTTGATATTGCT +CCAGGACAACGTTCTTATATGTTTGAGATGGCCAATAATTTCCGTGTTATCTCTTTCAAA +CAGAAAGACGAATGGACTCTTAAATCCAAGTACACAATAGGTAATTAATATGGACGATAT +CAATTACAGAAAACTTCGAATCGAGTATGGTCTGAGACAATGGGAGACTATATTCGATCT +ATGCGAAGTCGCTCAAGAAGAATTCCAACGTGAACTCGCCATTCGCAATGGCGCTCAACC +GCGTGATGTTCTCCAAGTCTTTATCAGAACTGAATGCGAAGATGATGACACAGTAGATTA +CAAAATCACTCGTAAAACTATTGAAATTTAAGTAAGGGCCTTCGGGCCCTTCATGCTATT +CTCTCGGATGATAAAATATCTACAACAAAGAGACTAATAACTCGGTCTATAAACTAAGGA +AACTCATGCAATTCTATATTTCAATTGAAACAATCGGTAATGACATTGTTGAACGTTATA +TTGACAATGGTGTTGAAAAAACTCGTCGTGTTGAATACGCTCCGACAATGTTCCGTCACT +GCACTCATAAGACTAAGTTCGTTGACATCTATGGCAAAAACTGTGAACCTCAAAAATTCG +CAAATATGAAAGATGCTCGCGACTGGATTAAACGTATGGAAGACGTCGGTCTTGAAGCAA +TGGGTATGGATGATTTCAAACTGGCTTATTTGTCAGACACTTATGGTTCTGAAATTGTTT +ATGATCGCAAATTCGTTCGTGTTGCGAACTGCGACATCGAAGTAACAGGTGATAAATTCC +CAGACCCAATGAAGGCTGAATATGAAATCGATGCCATTACTCACTATGACTCAATTGACG +ATAAATTCTATGTGTTTGACTTGTTGAATTCATTGTATGGGTCAGTTTCTGAATGGGACA +TTAAGTTAGCTGCTAAGTTAGATTCTAAAGGCGGTGATGAAGTTCCACAGGATATTCTTG +ATCGTGTAGTTTATATGCCGTTTGACACCGAAGCTGAACTACTGATGGAATACATCAATC +TTTGGGAACAGAAACGTCCAGCTATTTTCACAGGTTGGAACATCGAAGGCTTTGATATTC +CATACATCATGAATAGAGTCAAGAATGTTTTGGGTGAACGTTCAATGAAACGATTCTCTC +CAATCAACCGAGTCAAATCGAAAGTTATCACTAACATGTACGGCGATAAAGAAGTATTCT +CGATTGATGGCGTAACAATTCTCGATTATTTAGATTTGTATAAAAAGTACTCATTCACTA +ACCAGCCGTCTTATACTCTGGATTATGTCGCGAAGTATGAGACTAAAAAAGGCAAGCTTC +CATATGACGGACCGATTAATAAACTTCGTGAAACTAACCATCAACGTTATATTAGCTATA +ACATTATGGACGTTGAGTCTGTCGGTGGTATTGACCGCGTTCGTGGTTTCATTGATCTGG +CACTTAGTATGTCTTATTATGCTAAAATGCCGTTCGGTGGGGTTATGTCTCCTATCAAGA +CTTGGGATGCGATCATCTTCAACAGTCTTAAAGAGCAAAATAAAGTAATTCCGCAAGGTC +GTTCTCATGTTAAGCAATCTTTCCCAGGTGCTTATGTATTTGAGCCATTAGCATGCGCTC +GTAAGTACATTATGAGTTTTGACTTAACATCTCTGTATCCAAGTATTATTCGTCAGGTGA +ATATTTCTCCTGAAACGATTGTTGGTCAATTTAAACTTCATCCAATTCATGAGTACATCG +CCGGAACAGCACCGCGTCCATCTGATGAATACTCATGTTCACCTAATGGTTGGATGTATG +ATAAGAACAAAGAAGGCGTAATCCCAACCGAAATCGCGAAGGTATTCTTCCAACGTAAAG +ATTGGAAAAAGAAAATGTTCGCGGAAGAAATGAACGCAGAAGATATCAAGAAAGCTATCG +CTGCTGGGGTGTTTGGTTCAGGAAGCTGTGAAGAAAAACGATATGTTCGTTTCACTGACG +AGGAACGTGCTGCACTGAGTAGTTATTCAAAACTTGTTCTTGAAGCAATGCTTGCTCGTT +GTGAAGCCGCCGCGATTTTGGCTGATACGAACCAGTTGAACCGTAAGATTTTAATCAACA +GTCTTTATGGTGCTTTGGGGAATATCTACTTCCGTTATTACGATCTTCGTAACGCAACTG +CAATCACTCTGTTTGGTCAGGTTGGTATTCAATGGATTGCTCGTAAAGTTAATGAATATT +TGAACAGGGTTTGTGGTACTACTGGTCATGATTTCATTGCAGCCGGCGATACAGACTCAA +TTTATGTTTCTGTCGATAAAGTTATAGAGAAGGTTGGTTTAGATCGTTTCAAAACTACCG +ATGAAGTGGTTGAATTTATGAACCAATTCGGTAAGAAGAAAATGGAACCGATGATCGATA +AAGCTTATCGTGAACTTTGTGAATATATGAACAACAAAGAACACCTTATGCATATGGACC +GTGAAGCAATCTCTTGTCCTCCATTAGGTTCTAAAGGTTGTGGTGGATTCTGGAAAGCTA +AGAAGCGTTATGCATTGAACGTATATGACATGGAAGATAAGCGATTCGCTGAACCACACC +TCAAAATTATGGGTATGGAAACACAGCAATCAAGTACTCCAAAGGCGGTTCAGGCTGCAT +TGGAAGAATCAATTCGTCGTATGCTTCAGGAAGGCGAAGAATCCGTACAGGAATATTTCA +AAACATTTGAAAAAGAATATCGTCAACTTGACTATAAAGTGATTGCCGAAGTTAAGACTT +GTAACGATATTTCTAAATATGACGATAACGGTTGGCCAGGTTTCAAATGCCCGTTCCACG +TTCGTGGTGCTCTGACTTACAATCGAGCAACTGCCGGGTTCAGTGCTACTCCGATTCTCG +AGGGTAACAAGGTGATGGTAGTTCCATTGCGTGAAGGTAACCCATTCGGTGATAAATGTA +TCGCGTGGCCGTCAGGTACTGAACTGCCGAAAGAAATTCGTCAAGATGTTTTGGCATGGC +TTGACTACAGTGCTCTGTTCCAGAAATCTTTTGTTAAACCTCTTACGGGTATGTGTGAGT +CCGCAGGTATGGACTACGAAGAGAAAGCGTCATTAGAAGATATGTTTGACTTCTAACTGT +TTACATCCACATGGAAGTGGATTATAATGTTCTCACATTAACCAAACGGATAACAAAAAT +GACTCATCGCGAAATTCATGCTCTTCGAGCTAAACCCGGAAAAGCTGCCGAAAAGAAAAT +CCTGATGAAGGATTATGAGTTGATGAAATCTGTATTATGGAACTTAGTAATTCTATCATG +CGGGAATGAAAATTCCACTTATAACGGTCTTTACCCTAACGGTGTAGGTGCTGCTTTAAA +AGCTCATCGTGAAAACATTAAAACTCTTGAAGATAAAATAAAAGATATCTGTCATTAATG +AATTGGGCCTTCGGGCCCTAAACGGAGAAACAACATGAAATTGAAAATTGCTTTAATCGC +TGCTGCGCTGGCACTAACTGGTTGTCAGGCTTACCATGGACCTATCGTTGGTGAACATCA +AGTTGGCCAAATTTCTTATAAAGGCGGAACTGGACTTGTCTATACTCGAGCAACTCAACA +AGTTTCGCAAGAATCTTTGAGCGCAGGTGACGAAATGGAAGAACGTCGTCGCAACAGTCC +ATTAAGTAAAGCTATCAATGAATCAGTAGCACGAGGTGATGCGTTTCAAAAAGAGCAAGA +TCGCCGTGAATCTGCGCAAAATAAGTGTGAATTCATTGTTGAAGCTCATGAAGCTGTATT +GACCGAAAACGCTATCAAAACTATGAGTGACAAAGACCGCCTGGCTTTGATTCACTATCG +TTCTTCCGGTAAAGTTCGTGCATTCAATAAGTGCATGCAAAACGCTAACAAATAATTTGA +TATAATAAATCAACTGAGGATATTGTAATGGAAATCATCGCAGGTATTATTTCACTGGTA +GTTTACATGATTCCGGCTATTATCGCGTTTATTCGTGGTCACGGTTCAAAATGGGCTATC +ACCGTAGTTAACTTTCTGTTTGGCTGGACATTCATTGGTTGGATTTGGGCATTTATCTGG +TCTCTGACTGGAAATAAGCCTGCTCAGCAACAGGTTATCATTATTAAAGAGGCAAAATGA +TTGTAACACCTTTGACAGTAGAAGATATTCGTGATGAACTTTGCTATGCGCTGGAAAGTG +AACAGTTTGTAATTGACAAAACTGGTGCAAAGACAATTGAAATTATTGGCGCATCATTTA +TTGCAGATGAAGAATTAATCTTTGGCGCAGTGAATAATGAATATGTTGAACGCGAACTTG +AGTGGTACAAATCTCAATCTTTGTTCGTGAAAGACATTCCTGGCGGTACTCCATCTATTT +GGGAACAAGTTTCATCCAAGAACGGTGAGATTAACTCAAACTACGGCTGGGCAATTTGGT +CCGACGAAAACTGTTCGCAATATAATATGTGTCTTGGCGAGCTTGGAAATAATCCAGATA +CTCGTCGTGCTATTATGATTTACACTCGTCCATCAATGCAGTTTGATTATAACCGTGATG +GTATGAGCGATTTTATGTGCACTAACACTGTGCAATATTTGATTCGTAATAAGCGAGTTC +ATGCTATTGTTTCAATGAGAAGCAATGATGTAGTCTTTGGATTCCGCAATGATTATGCAT +GGCAAAAATATGTTCTTGATAAATTGGTGTCTGATTTAAACGAGGGTGATTCTTCTCGTG +AATATAAAGCTGGTGATATTATCTGGAACGCTGGGTCATTACACGTATACGAGCGTCACT +TCTATTTGGTTGATCATTACCTGAAAACTGGCAAGTCTCACGTGTTGAAGAAAGATTATA +AAGGTGAATGGAAATGATTCAGTTTGTAATTCCAAGTTATAATCGTGCTGGGGCAGTTAC +TGCCCTGGACATGTTCCCTACTGGTTATGTTGCTCATTTAGTAGTTCGTGAGTCTCAGAA +AGAAGAATATGAGACTCACTATGGTGCAATTGCTAAAATTGTAACTATTCCTGATGATGT +TAATGGAATCGCTGGTACTCGACGGTTGATCACCGAAATGTATCAAGGCATGCGTATTTG +GATGCTGGACGATGATACAACAATTCATACAACAGAAACTCGTGAACGAGACAATCGCCG +AATTCTTCATGACGTCGGTATGACTTGGGACGAATTTAATAAGCTTTGCCAGTATGTTGA +AGCTGCGATGGATTGTGGATTTTATCATGGTCATTCTCGTCTTCCAATCTTCAAAATCTC +TGGTGATGATGCAAACTTTCGTGAGAACTCTTATGGATTCACGAACACGTTCTACGACTT +AAGCAAACTTTCTGCTGATGACATTGGATATGGTATAGTAGACCTATCCGAAGATACATA +CGCATTCCTTAAACTCATTAATATGGGTTATCCTCATCTGGCGATTTTCAAATATCTCGT +CAAATCGGGTAAAGGTCAAGCTCCAGGTGGTGTATCGTCTATGCGTAATGCCGCTAAACA +AAACCGAGCATTAGAAAAAATCCATGCAGACTTCCCTACGCAAGCTCGTTGGAAATCAGA +AGGTGACCCAACCAAAACTATGTTTGGTACTGATGAACCTTTGAAAGTACTTCGTATGTG +TGTTGCTAAAAAGCAGAAGTCTGACGCATTCCATAAATTTAGTGAGATTGAACCTAATCT +ATGAAAATTGCTATCATCAACATGGGCAACAACATTCAGGGGTTTAAAACAACCCCTGCT +TCTGAAACCATTTATCTGTCTGAGTGCTTGAAAGATATGGGTCTTGATGTAGACCTAATT +TCAATGAAGAACACTCAATATGGAATTTCTTTTGACTCTGTAGAAGACCCGAACGTATAT +GACCGTCTGTTGGTTGTTAACGCTGCTTTGAACTTTTATGGTGGCGAAGAAAACGCAATG +AACAAAGCGGCTTATATGTTCATGAACAAATATAAGTCAAAGATCTATTATCTCTTCACA +GATATTCGTTTGCCATTTGAACAAGCATGGCGTCGTATGTCAAAGAAAAAATGGTCCAGC +AAGTACAAAGAAGAACAATTCATTGTAACTGCTCCTATGCGTATTGTATCGCAAGGTCGA +GATCTTGAACAAGCAAAACGTATTCACTCTGAACGTCTGGTGGGATGTCAATTCGGTAAA +CTAGAGTTCACTCACTTCGCTTTAGACCGTCATAAGATGTATCACAGCGTCTTTAAAATT +GCACCAGATGGAATTAAAATGCGTGACCTGATTTACGGCGGAACATTCCGTTCTGGCAAC +CGTGAAGCTAAGATGGTTGAATATCTGTTTGATACTGGACTTGATGTAGAATTCTTTGGT +TCAGTTAAAGCTGAACAATTTAAGAATCCAGAATTCCCATGGACTATTCCTCCAGTATTT +CCTGGTAAGGTAGATTCTCGTGAAATGGTTCAACGTAACTCTACTGCTTATGCGACTATC +GTATTAGGCGATAAGACTTACGATAATAACCAGATCACTCCTCGTGTATGGGAAGCACTA +GCATCAACTGCAATTGCATTCTTTGACCATACATTTGACCCTGACATGAATATCATGGAT +GGGAACGAGTTCTTTTACGTTAAAAACCGTCAAGAACTAGTTGCTAAAATTAATCGCATC +AAAGAAGACGAAGATTTCCGAGTTCAAATGCTCGCATATCAGCACTCTATTCTCCAGAAA +TATCTGGATGAAAAGCCACAATGGCAAGCTGAATTTAAGAAAGCTATCGATCTGTAATAC +AAAGAGGGTTTAAAATTTTAATTAGCTTTAAACCCTCGGTTATATAATTAATCATCCTTT +AAACCAGTGAGAAAAATATAATGGAGATCAATGGAAAATATTGAATGTCTGATTTAAAAT +CTCGTCTGATTAAAGCATCCACTTCTAAAATGACCGCGGAACTGACTAAGTCTAAATTCT +TCAATGAAAAAGACGTAATCCGTACTAAAATCCCGATGCTGAATATCGCAATCAGTGGGG +CATTAGATGGTGGTATGCAGTCTGGTTTGACAATCTTCGCTGGTCCTTCAAAACACTTCA +AATCAAATATGTCTCTGACTATGGTCAGTGCTTATATGACGAAGCACCCAGATGCAATTT +GTCTGTTCTACGATTCTGAATTCGGTATCACTCCGGCTTATCTGAAGTCTATGGGTGTAG +ATCCTGACCGTGTAATTCATACACCAGTTCAGTCTGTTGAACAACTTAAAATTGACATGG +TGAACCAGCTCGAAGCTATTGAACGTGGTGAGAAAGTTATTGTCTTTATCGACTCTATCG +GGAACTTGGCTTCCAAGAAAGAGACCGAAGATGCTTTGAACGAAAAATCCGTCGCGGATA +TGACTCGTGCTAAAGCATTGAAATCTCTGTTCCGTATCGTTACTCCATATTTCAGTATCA +AAAATATCCCATGCGTAGCGGTTAACCACACAATCGAAACTATTGAGATGTTTAGTAAAA +CTGTAATGACTGGTGGTACTGGTCCAATGTATTCAGCAGATACCGTGTTCATCATTGGTA +AGCGTCAAATCAAAGATGGTACAGATCTTCAAGGTTATCAGTTCGTTCTGAACGCTGAGA +AATCTCGTACTGTCAAAGAGAAGAGTAAGTTCTTCATTGATGTTAAATTTGATGGTGGTA +TTGATCCATACTCTGGTCTGTTGGATATGGCTCTGGAACTTGGATTTGTAGTTAAACCTA +AGAATGGTTGGTATGCTCGTGAGTATCTTGATATCGAAACCGGTGAAATGGTTCGTGAAG +AGAAATCATGGCGCGCTGCCGCTACATCTTGTGTAGATTTTTGGGGTCCGCTGTTTAAGC +ATCAACCGTTCCGTGACGCAATCAAGCGTAAGTATCAACTCGGTGCTATTGATAGTAACG +CAGTAGTTGATGCTGAAGTTGATGAACTGATCAATTCGAAGACTGAAGTCTTTAAAGCAC +CAGAAGGCTCCTCTGCTCCTTCAGCTGCTCAGTTGGAAGATGATCTGGACAATTTTGATG +ATGTAATGGGGCATCCAACAGAAGGTTTATAATGAGTGATTACGATTTAAGTGATCTTGA +CCTTGAAATCGTAGAAGATACCCCCTCTCAGGAGGGGGAATTCGAAAGGATGGAAAGGAT +ATACCAGCGTTCCGCTGAGATTGTTAAGAAGGCTATGGAGAATGTCATCCAGGAAATCCT +GATAACACTAGAGGATGGTTCAAACCATATCGTATATGTTACCTCATTAACTGTTGTTGA +AGGCGGTGGAGTGTCACTAGAGTTCTCTACGTTATCAGAAGATCGTAAAGCCGAACTAAC +ACCACACGTTGAAAAATGTATTAAAATGCAGATAGAAAACTCTTTTAAAGAGAAGAAGAA +AAACCGTTTCAAATTATTTTAATGAGGCTTCAAGTGGTAGAAACAATATTATCGCATTTG +ATTTTTAACCAAGGCTACTTCGCAAAGGTGTGGCCTTATATGGACTCTGAGTATTTCGAG +CATGGTCCAGCTAAAAACGTATTCACCTTACTACAAAAACATATCAATGAATATTCAAGT +GTTCCATCGTTGAATGCTTTGAATATTGCATTAAGTAATTCTTCGCTGGGTGAATCGGAA +GCTGAAGGCGCACAAAAGCTTTTAGACAAATTAGCCGATACTCCTGAAGACTTGTCATGG +TTAGTTAAAGAGACTGAAAAATATGTCCAGTCTCACGCGATGTACAATGCCACATCAAAA +ATAATTGAAATTCAAACTAACGCTGAATTACCTCCAGAGAAACGTAACAAGAAGCTTCCT +GATATCGGTGCTATTCCAGATATCATGAGACAAGCTCTTTCCATCAGCTTTGACTCTTAT +ATTGGTCATGATTGGATGGATGATTATGAAGCTCGTTGGTTAGCATATCAAAATAAAGCT +CGTAAAGTTCCATTCTTGATGAATATCCTGAACCGAATCACGAAGGGCGGCGCAGAAACA +GGCACACTGAATATTTTGATGGCTGGTGTAAACGTCGGTAAGTCGTTAGGATTGTGTTCA +TTAGCAGCCGATTATCTTCAGACTGGTAAGAACGTTCTTTATATCTCTATGGAGATGGCT +GAAGAAGTATGTGCTAAGCGTATTGATGCTAACTTGCTTGATGTGTCTTTGGATGACATC +GATGATGGTAATGTATCTTATGCTGAATACAAGGGTAAGATGGAAAAATGGCGACAAAAG +AATACTCTCGGTCGTCTGGTCATCAAACAATATCCTACTGGTGGTGCACACGCAAATACA +TTCCGCGCACTTCTGAATGAATTGAAACTCAAGAAGAATTTTGTGCCAGATGTCATTATG +ATTGACTACCTCGGTATCTGTGCTTCATGTCGTATTCGTCAATACACTGAAAATAGTTAC +ACATTAGTTAAAGCGATCGCAGAAGAACTTCGTGCACTTGCGGTTGAAACTGAAACTGTA +GTTTGGTCTGCTGCTCAGGTTGGCCGTGGTGCTTGGGATGCTTCTGATATGAACATGAGT +GATATTGCAGAATCGGCGGGTCTACCAGCAACAGCAGACTTTATGCTAGCAGTGATAGAG +ACAGAAGAACTTGCACAGATGGAACAACAACTCATCAAGCAAATTAAGTCTCGTTATGGT +GACAAAAATAAGAACAATAAGTTTTCTGTTGGTGTTAAGAAAGGTAATCAACGCTGGGTT +GAAATCGCACAAGAAGGCGGTGATAAACCTACACCAGTAAGCGAAACATCTGGTGGTCAG +CAGCGCGTAGCAGAGCAAAATCGTATAGCTAAGGTTGAAGTATCTCGAGCCAAACTCGAC +GCATTAGCCGAAGATATGAAATTCTAACCGTTTACATACACATGGAAGTGTGTTACTATG +ATCTTATACAAACAAGAGGAAAACAGCATGAAAAAGATTATCTTAGCAGCAATTTTATCT +CTTTCAGCTTGCGCTGGAACCCCAGCAATGGCAGCAGATGGGTATTCAAGCATTCCATGT +ATTAAGTTCATTGAAGGCGACTGGAAAGATCAAAAGCCTCGCGTCATTAAAGACTTACTA +GCTGTTGCAGATAAAAATCAGGCAATGCTTGTAGAAGATCTCGATGACAATGACCTGGTA +GTTGCTGGTACTAATCTGTACTGCGAAAATATTCCGGCTAAAGATGTTCTGACTTGGGTG +GGACTGTAATGAATATCATGTTAATGTATCAACCAGCATCAGAAATTGTGCGAGGTATGA +AAGTTGAACACCGAGTTCCTGCATTATGGGAAACATTCCTAGATACTGGTTCTAAATTAA +ACTTACCATTTGGCGAAGTAACAATTTTCCAGACCGGTACTAAACCAACTAAACGCCAGC +TCCGTAAGTTCAAACGTATTCATCGTGTCAATATGGTTAAAAGCATAGCTGAGCATGAAT +TTAATAATTCTTGGGAAGGCATCCATTGTGATGTTATGGGGCTGTAATGCATATTTTTAT +TCTGATTCTGGCTCTGACAACCGGCGATTCCGGTGGCGCTGCAATTGATAAGGTTGAAAT +AAAATCTCAAGATTATGCTGAAGCCAGTAAGATGTGCGACCGAGCGGGTGAAAGTTATCG +AAAAGACGTTAAGTCATTCAACGTTTATCCGGAATATACTTGCATCTACGCTGGTGTTAA +ATAGACCAGGAGGTGTTTATGAGCACTATTAAAGGGGCGATGGACGCAGTATATGCGTAC +AAATTTATTCGCCTGATGTCTAAGCCTTTCACTGAGTGGAAAGCATATGAAGCAAAAATA +ATTGACGAAAAGGGAACTGTGCTAAAGCGTCCTAGCACTCCAGAAGAGAAAGTGGCTTAC +TCTGCCTTTCATGCGAGTGTTCGGTCAATTAAGCGTATGATGTCTACAGTTCCAGGATTA +AACGGCGTCGCGTCGATGATGTCGGCTTGGAGTACAGTAGCATCTCGGTACAATATAACA +GAATCCGAACAAAAAGAGATATTTGAGGCTCTTCCATTGTTCGAGGACATGGTAGCTGGT +GATTCCGGTGGAAGTGTCCAGAATATCGCCTCTGGTACCACGACCGGAGCAATCACAAAT +AAAGGTCCTGAGCAAATCCCCGCAAAACGTAAGCGAATCAAAATCAATCCTAACAAGTTG +TGATAAAATGGCCTTAGAAATAAGGCCAAGGAGAATAATATGTCATGGGTTCACAATGAG +TTCGCATTCCGCGCACTATCTCATCTTCCAAAATTCACTCAAGTAAATAACGCAGCACAA +TTTAAACTTCGATGTCGTTGTCCGGTGTGTGGAGACTCACAAAAGGACGAAAACAAAGCA +CGATTCTGGGCGTACGGTCTACCTGATGATGTGCTATTAAAGTGCTATAACTGTGACTAT +GTAAAGCCGATTGGGATTTATTTGAAAGAGTATGAACCAGATCTTTATCGTGAATTCATT +CTTGAGTTACGAAAAGATAAAATGGTTCAACGTGAAAAACCTGTTGAAAAACCTAAACCT +GTTGTGGAAGAGACTAAAGGAATTAAGCTCATTCACTGTGAACGTCTCGATAAAATGGAC +CCTAATCATCCAATAGTTCGATACATCGCCGGACGTAAAATTCCAAAAGATAAATGGAAT +AGACTTTATTTTACGTTGAAATGGCCAGCATTGGTGAACTCAGTAAATCCAGACACTTAT +AAGACGGAGCGAGATGAGCCTCGATTGGTTATTCCAATATTCAATTCAGAAGGAATAATT +GAATCATTTCAAGGCCGTGCTTTAAGAAAAGATGCTCCTCAAAAGTACATCACTATTAAG +TCCAACGAACATGCGACCAAAATATATGGGACAGACACTGCAAAGCCAGGTAAAAATGTT +TATGTACTTGAAGGCCCAATAGACTCATTGTTCTTAGATAATGCTATTGCGATTACTGGC +GGTGCAATGGATTTGAGTTTAGTTCCATTCAAAGAAGATAGAGTCTGGATAATGGACCAT +GAACCACGAAAAGATGACACAATTAAGCGAATGAAACGTTTGATTGATGCAGGTGAAAGG +GTTGTCTTTTGGGATAAAGCGCCATGGGAAAAGAAAGATATAAATGATATGATTCAAAAG +GAAGGGGCTCGTATTGAAGATATCCAGAGCTATATCGAAAACAATATAGCGTCTGGTTTA +ATGGCACAACTCAGACTCAAGAAGTATAGTAAGATCGGTGTTTAAATTCCAACCATTATA +TGAGAAATAACTTGTTCCAAAGGAACAGGTGGAAGTGTTATCCCGTATGCTAACGCAAAA +GGTATGATAATATAATTCCAAGTTGCTACAGCAGCAGAAATTGCTCCGACCAGAATAATC +TTACCTTTCTGGTCTTTTATTTGAGTCTTTAGAGATTTCTTTTCAGTTTCTTCAGACATA +TATCCTCCTAAGGCTATTTAATATGAATCTGCAACAACACACGTTCTTAAAGCTTGGAGA +AGAGTGCAACGAAGTCGCGATGCTCTGTTCCAAGATAATGCAATTCGGTTTAGACTCCGA +GTATCAAGGAGTTACTAATCGGCAACGATTGCAAAATGAATTAAATGATATAATGGCTTC +TATAGAATATATCAGACAATACTCTGATTTCAAATTTGAATCTTCTGAATATGAAATTCA +CAAAAAGATTGATAAAATGAACCACTTCCGAGATATCTCAGAAGAACTCGGTCTTGTAAC +TAATTAAGAATTGAAAGGAAAAATAATGGCACACTTTAACGAATGTAGTCAACTGATTGA +AGGCGCTGATAAAGCACAAGCAGCTTATTACGATACCCTGGTATCTCAGCACAAAGACCC +ACTACAAGTAATGCTCGATATGCAGAAATCTCTGCAAGTTCGTCTGGCAAATGATAAGCC +TGAGCATAATCGTCATCCTGATTCGCTGGAAACAGCTGGCGAAGTCTTGGCTTGGTTACG +TGCAAACGATGATTACATCACTGACGAAACGCGTGAACTGTATACGGCTCTTGGCGGTAT +GTCCAATGGTGAAAAAGCAGCATCCGCAGTATGGAAACCGTGGAAGGCTCAACATGCTGA +AATGCAGGCTCGTAAGATTTCTGAACTGTCTCCTGAAGACCAGCTCGAAATCAAATTCGA +ACTCATCGACCAGCTTCATTTCTTCCTGAATAAGTTCATGGCTCTGGGTATGGATGCTGA +AGAAATCTTCAAGCTGTATTATCTGAAAAATGCTGAGAACTTTGCTCGTCAAGACCGAGG +CTATTAATGAATCATACAATATATGAAAAAGATGGGGTGTTTTTGCTCCATTCCAGATAG +CGAAGATATTTCACTGAACAATATTCTTATTTCTCATGGATTTGTTCCAACGTCTGATCG +TCAAATTATTGCATGCGAAAACAAAGAAGAACTGAACGAATTCTTAGCCTATTTTTATGG +CGAATATTAAAAGGTGAATTTGATGCAATCGAACTTTGATGTCTATCAATGGAACAAAGA +TGGTCGCCCAACTGAGCCGGCCTATGGTTCTGATGCATATGAACTTCGTCGTCACTGGTT +GTTTGCGGTAAATAATTACTTCATGCTCAATGGCAATCCAACTCGACTACATATCAGAGG +CGGTGGGTATGTAACCGTTGATTCAAAATATTATCGCGGTAAAGACTGGGAGTGGTATCA +ATGATTTCATACGCTGATATTGAAAATGCTTTAACCAATTATTGGGGATTGACCCACCTA +ACTCTCGAAGACCGATGTGAGTGTCTTCGAGAACTGATTGCTGATTCTGAAGACATTATT +GCAGATATAGCAGAAGCTCTTAACAACTATTGATATAAATACTCCTGTAATCAACAAAGG +AGAGTTTATGAGCTATGTAAATATCAAAACCTTCGAGCATACCAATGCCGATGGAGTAGT +TGCCGCGATGGAAGTTTCTGTAGCATTTAAGTTGTACAGTGACGTTCATCGTATTGCTCG +TTCTCATTATCAAATTTTCCCTTCAGAGAAAGCTGCTTACTCTACTGTATTTGAAGAGAA +TCAACGAGACGCATGGATTGCTAAAAACGCCGATATGTTTAAGGGCGTTCCAGCATCTGG +TGGTTGATTTTAGGGACTCCTTCGGGAGTCCCTTTTTTGCTTTTAAATGATGTGATATAA +TTCTTTTATCAAATGAGGATAACACAATGAGAACACCGTTTCAAAATCCACTACCATGGA +TTAAAGCTTGGATCAAAAGTCGACAAGAACCTAATGATTGGGTAGAAGAATTTCATTCTG +AACTACGAAAAAATACTAACGCTGAGTTCAAAGAAAAAGAAATTAAGCATCAGTATGAAG +AAGCTGAAGCACTTGCTGACCAATATCTTGGAGATAAGATAAAATGAAAACATCTGCTTG +GATGAAACCAGTTGAATCAGTCGGTGGTATTACAAAACTTATTACTGATCGTTTAATTCA +CGATCATCTTTTTATCATGAATAGCCCAGATCTTTATGATCTAGTAGATATCTTTATTCA +TTGCTATCGTGAAGAAGGTACTACATTACGTGTTGTATATGAAGCTCATTTTCACTTCGT +TGGTGAGCAAGCTGTTATTCGCTTTGGGACATCTTGGTTATGATTAAATGGCTTAAAACT +TTATTCACTCCGGCTCAACCCGATGATCGTCTGGTTCCTTTATCAGTTAATGATGTTATT +GTTCCTATGCAAGAACCAAAAGAGTATGTTTATATTGGTGATGGAAAGATGGAAGAAGTT +ATTCGTCCAAAGGAAACCGATATGCAATACCTGATTCGTCGTAATCATGAAATTCAGGCA +GAACGATTTAAATCTAGGTCTCAGCCAAAAGCTAATCCAGGCCCAAGTGCTAAGCCGTTG +AATGCAAAAGAACTTAAAACTCGAGTACAGGTTGTTAAAAGTCGTCAACAAACTTCATCG +AGTGCTCAATATAATTACAGTCCAAGCAACCCGGCTCCATTTATTTCAGGAGATTCATAT +GACTCAGGTTACTCATCCTGTGATTCAAGTAGTTCAGGTGGATGCGACTAAGTATAACAG +ACCTACATATAAAAGTCCACTAAAGAAATCTAATTTTGATATGTGGTATCGTTCTATGAA +GGCTGCAGCATTTTTGATAATTGCTGCACCTGCTATGATTAAAGCAAACGATAAGTGGTT +TGAAGAAAATAATATTGAAGAAGGTGCTATCTGTGGAAAAATGCGTAAACATCAGTAAGA +AATACTCAATTGAACTATCCAAAAAAGTAAATGGAAGAACTATAATTCAGCAAAATGATG +TGTTTACAGTTATCATTTCAGCTTTTGCTTCAAACTCTTCAACGAAGCATGAAGACTATT +TCAATGAGCAAATCGATAAACTAATTAATGGATTGAGTTTTCCTGAATCTGCAGTATGCT +TTATTAGACATGAAGCTGACGTTACTCAAAAGCCTGGGACTCCATTTGGTCATATAGAAG +CATTAAATCGTCTTGGATATGATGTACCTCGATATCAGCCCGGTGATAAGTTGTTTATTA +ACACTGAACAAAGAACGATATGGAAAAAGTTCCTCATCATTGATAACAATGATTTTGATG +AGCTCCAAAAATTCATCTGGAACCACTATGAAGATCGTGGATTGATCTTCACTGAATCTG +AATCGGCAAAACTCGCTCGCGAAAGCCTATATGAGCAAATGCGTCTTGATAACCTATCAC +TTCGGTACGGTCGATAATGGATCTGTTTGATATGCTAGAGCCGGCTGAAAAGCCGGTTGT +CGATTTACATAAGGTTGATATTTCAAAAGAAATTTTTGAAGTGCTTAAATCACACGGTAT +CGAATCTACTAAAGCTGCAGAAGATCTAGCTGATCTATTCTGCTTTCCTCCTCCCTGGGC +TCCTTGGGCCTAACCGTTTACATCCTCCTCTCACTATGATATGATAGCTTTCGTAAACAA +ACGGAGGCTATCGTGTTATATCAAAAAGAACACCTCGAAGAAATCCGCGAAAGCGCGGAA +CACAACTCGTCTTACTACGAACAAGCTATTTCACAGTTCAGTGACTACGAACAATCTGTT +TTATGGCAATGCTTCAACGATAAAGCGGACCCTAAATTGCATCTAGATTTAGACCCAATC +GTTCGTCGCAACATCACTTCAGACGTGCCAGTAGAACTATATCGTGGAGTATCAAAGAAA +ACCGCCGCTTGGCTTAGTCATATGGAAGTTGGTCGTATTATTGCTGATAATCGTGTCACT +TCATTCTCGTCTGATTTTGCCACTGCAAGACAGTTCGCTGGCGCTTATTGCTATAACACA +AAAGTCATTCTTTCACTTCGTAATTGTCCATTTGCTTTTAACTTTCAAGAGCATGCAATG +AACTTGGTTCTGGCTGCGCCAGATTCTGAGTTTAGATGTAATGCAATTAATGGTGATGAG +AGAATGGAAAAGTTGGAGATGATTAACGCGGAAGATGAGTGGATGTTCCCTATCGGGACT +CAGTTTGAGATAGTCAGTATTGAAGATTATCAGTTAGACCCGTTATCTCCGGTCTACAAA +ATCTATCATTTGAACTTCTATTCTTTCTGACCGTTTACATTCAACGGAAAGTGTAATAGA +ATAATCCTTGAAACCATTATACCACCTTCGCAAATAAAGCAAATAAGGATTCGTCATGGC +TATGCCTCGTGAAGTTGTTATCGCTCAACGTTTAGTTCAAACTTACAAAAGCGCATCATC +ACGCAGCAAAGAGTTCAATCTGAGTATGGACTATCTTTTGAACATCATGGCACAAGACAC +TTGTGCATACTCAGGTGAAAAATTTCATAAAGAGCCCGGTGATCATCAAATGACACTTGA +GCGTTTTAACAACAAAATCGGATACGTAGAAGGAAATGTGATCCCGGTCAAGTTGAAGTA +CAATCGTCTTCGCGCTAATCATGAAATTGAAGATCTTATTCGTCTCCAAGAAACAACTGC +AGCACGTATTGTTGCTCGTGTAGATGCCAAGAAAGATGTCGCTCCAGTAGTGAAAGAAAA +GCCTATTCAAGACGTTCATCGCATTGATTTACCTGGATTTGAAGACATTAACTTGATCTA +TGTCCCTAAGACTCAACGCGAAGAAATCCGCCGAATTGTTCAGAACATCAAATCTCGTCA +AGCTCATATATTACAAAAAGGTGTAACTAAAGAACATAAAAAATCACTGGAAGTTCGTAT +TCACGGCGGAATCACTCGTATAAAAGCGATAATCAAACAAAAATATAAAGCTCCACAGGT +TGTGACTTCCCGCGCTGCGTCGAAGAAGACTTCTAAAGCAGAAAATACTTCTTATGATTA +TGGTATAATTATCCAGGGTTTGAATCGTTTTCAGAATCTCTCTCGTCTTGATAAAGCTAA +ACTGAAGAAAGGTTTGCCACTTTCCGCCACCTTCTTCCAACTGTTAAGAGGTAAAATGTG +ATGCACTATGGTTACATGTTGGTCTACAAAGACAAATCCGGGTATGAAATCCCGGTATAT +GAATTCTACCGAAATAATCCAATCGGCGGAGCTATGATTTACACTAATAAGAATGACGCT +CGTCATGCTTTAGCTCAAGAAGTTGCTGAGTTACAAGAACGTCTTGATCGCGGAATGAAA +GTTGTTACTCAGAAGAAAAAATGGCTTTTCTTCAAACGTGATATTATTACTTACATTCCA +GTTAAAGATGAAGAAACTCGTCGTCATCTGCAATTGCTCATTAACACCATAAAAGTAAAA +CGAGTTTCAGTAGCCTAGGAGTCATTTTGAGAATTACATTTGAACAATTAACTCGAAGCC +AAAAAAGTACGTTTGATACGACTATCACGGCTATTAAAGAGAAGAAAACTCACGTAACAA +TTAATGGTCCAGCAGGTACTGGTAAAACTACTCTTACTCGCTTTATTGTAGACCATTTAA +TTTCTACTGGAGAAACGGGTATTATCTTAGCTGCACCTACTCATGCGGCTAAAAAGGTGT +TGTCTAAATTGTCTGGTATGGCTGCTGCTACTATTCATAGCATCCTCAAAATTAACCCGA +CGACTTATGAAGAGAATATGCTCTTCGAACAAAAAGAAGTTCCAGATTTGGCACAATGTC +GAGTGCTTATTTGTGATGAAGCTTCTATGTGGGACCGTGAGCTGTTCAAGATTTTAATGG +CCTCAATTCCTAGTTGGTGTACAATCATTGCAATTGGTGATGAAGCTCAGATTCGTCCGG +TATCTCCTGGCGATTCTTCAACTCATAAATCGCCTTTCTTTACTCATAAAGATTTCCTAC +AATTAGAACTCGACGAAGTAATGAGAAGTAACGCTCCGATTATTGAAGTTGCTACTGATA +TTCGTCAAGGCAAATGGATTTATGAGCATACCAGAGATGGTCATGGTGTTCATGGATTCC +AAAGCTCGACTGCATTAAAAGATTACATGATGCAGTATTTTAGCATCGTAAAATCTCCAG +AAGATTTATTTGAAAACCGAATGCTAGCATTCACAAATAAGTCAGTAGACAAATTGAATA +GCATTATTCGTCGTAGGTTGTATCAAACTGAAGATGCTTTCGTTACTGGTGAAATCATCG +TTATGCAAGAACCTCTCATGAGAGAGTTGATGTATGATGGTAAGAAATTCACTGAAACAT +TATTCAACAACGGACAATATGTTCGTATATTAGATGCTCAGTACACATCAACCTTTTTAG +GTGCCAAGGGAGTCTCTGGTGAACACCTAATACGTCATTGGGTGTTAGATGTAGAAACAT +ATGGTGATGATGAAGAGTACGCCAGAGAACAAATAAGGGTCATTAGTGACGAACAAGAAA +TGAACAAATTCCAGTTCTTCTTGGCTAAAGCTGCTGATACTTATAAAAACTGGAATAAGG +GTGGTAAAGCACCTTGGTCTGAATTCTGGGAAGCTAAACGTAAGTTCCATAAAGTTAAAG +CACTTCCTTGTTCTACGTTCCATAAAGCTCAAGGTATTTCTGTAGATACAAGTTTTATCT +ATACTCCGTGTATTCACGTTAGCAATGATAACAAATTTAAGTTAGAATTGCTTTATGTGG +GTGCTACTCGTGGTCGTCATGATGTTTTCTTTGTGTGAGGATTTATGTACAGTTTGAATA +TTGATGATTTTGAAAAATTAATTGATGCTGTTAAGATTAACAAGCCTGATGATAAATGGT +GGCAGTGCCGTCAAACTGAATTAGTGTCAGAGCTAAATGAAATCCGTGACAAAGCTTTAG +CTATTGCATGGTTCCAAGGCGAATGTCCGCTTATCGGAATTAGTGATAATATTGCTCAAC +AAATTTATGATTTGAAGGTAGAACTATGTCGTTAAGAGATTTTATTATTGACTATGAAAC +TTTTGGAAACGTTTCAAATACTGCTGTCATCGATTTGGCAGCGGTAGTATTCGATCCTAA +TCCAGAAGTTATCGAAACATTCGATGAGCTAGTTTCTCGTGGAATGAAATTAAAATTCAA +TTTGAAAGCACAAAAAGGTGTTCGTCTGTTCGGTGCTTCTACAATCGAATGGTGGAAAAA +GCAATCCGCTGAAGCTCGCGCTAACTTAGCCCCATCTCCGGAAGATATTGACCACGTTGA +AGGCTTGTATAAACTTCTGGCATTCTTGAAAGAAAATGGAGTTAATGCTTGGGATTCATT +CGGCTGGTGTCGTGGTCAATCTTTTGACTTCCCTATTCTGGTTGATATTCTCCGCGAAGG +AGAACGCCGTAAAGGTATCGCAGATAAAGATATCGATACGTTTGGTTTAGAACCATGCAA +ATTCTGGAACCAACGAGATGTTCGTACCGCAATTGAAAGTCTTTTGATGACTCGTGGTCT +TACAACTACTCCGCTGCATAAGGGTGTTCTTGACGGATTTATTGCACATGATTCTATTCA +TGACTGTGCTAAGGACATCTTGATGTTAAAATATGCTCAACGTTACGCACTAGGCTTGGA +CGAAGCTCCTGTCGGTGATGAAATTGATCCACTCTCTTTACCTAAAGGTCGAGGTTAATA +TGTTTAAGAAAAATGATAAAGTTAAAGTTATTTCTGGTAAGAATGCAGGCGTAGTAGGTG +TAGTTCTCGGCCATTCAGTTCGTGATGGTTATCGCATTCGTAGCAATAAAGATAAAGTGA +TTTACGCTAAACCTTACTATGTAGTTGAAGATCCGATGGCTGAACGCACTGAAATCCGCG +AAGGCGATGTGGTTATCGTGATGGAACCTTTCGCTGCTCATTATAGCACGGTTTCTAAAA +GTCCGTATGAATGCTTATGTAAAAACAAAAACGCGATGGTATCTCTGATTTACACCGACG +AAGAGTATGGTGAAGTCGCGAACATCGTTTACCAAGGCCAGTATGCAATCATCCCTCTGA +GTGCTCTCCGACGTAATCCTACCCGCATCGCAGGAAAACACATCACAGTATAACTCTGCC +GTTTACTTTCCTTGAGGGCTATGATACTATAGCCCTATCAACAAACATGGAGAAACAAAA +TGAAACAGCAACTCACTCAAGACCAATTCGAAGATATTCTCTTCAACCCTGATCTTACTG +TGGTTCAGAAAGATGTATCTGGTCATCTTGAGCACACTACATACGCTTACGTGTATCAAG +GAGCTTTGGCGGTTTACGCTGCAGTTCGTCATATCACTGAAGCCGGCACAACTTACTGGA +AGGAAGCTATATAATGAAATTTGAACAGAAATTTGAACAAGGTAAATTCTACGCTTTTCG +CGGTGAAAATTCTCGCGATAAATTTGAGAGTAGCCATCACACAAATAAAGCTATCGTAGA +TGCTATTATTGCAAATGGCGGGGTATTTGAAGCAATTCAGGTTAACGGTTGGGGTGCATT +AGAAGTTGCCCGCTTTACATCTACTGGCAAAGTCTATCATGAAAATGATGATTGCCAATT +TTATCTGTCAAGCGATGAAGCTGAATTCTTCATCGAAGTCGAGCCTAATGGAAATCCAGC +GAACCCTAAGCCTTCATTGATTCAAATTGATCAGCAGATGACTGCTGCTAATGATGAAAA +ATTTGAAGATTCTGAAATGATCACCGATGATATTCCAGTGCCTCATACTACTCTGGTGAT +CACTAATCTTTCTGAAGCGATTTCTGCTTACAAAATGTTGAAAGGAATTATTCCAGATGC +CAGTCTATAACTATAAATGTCCAGCATGCGCTCAAGAAATTGAAGTCATTCGTAAAATTT +CTGAGCGTGATAATGAAATGATTTGTCCAGTACTTTGTTGCTCGAATCAAATGGAACGTA +CAGTGGCCGCTCCGAAATCAGTTCATGGCGGCTTTTACGACAACTTAAAATCAGGCGGTA +GCAATCTATGAAATGGGAATTAGGAAAGACTTACGCATTTGCTGATGCTAGTGCATTTCA +GATCGGTGTTAATCGTGAGATTCGCGAAATCATTGAAAGTAATATAGATGGATTGTTTCA +GGTTTCGCGTTTGAGCAAAAATCCACTCACTGATGTTGATTATCATGTATATGAAATTGT +ATTATCTGATGGTCGCATAATTGATGGTGAAGTAGCTCGTGAATTACATGGTTTTGGCAA +ACATGATATTTTCGCAATTTTTGCTAGTGAGCGCAAACATTTTAAAGAAGTAGAATGTAA +TCGCGATTTCAAAATTTCTATAGATGATTGTCAATATGACGAATTTGACGAATTTGACAA +TGAATCAGAAGAAAATGTGATGCCCAAAATCGAAGTTGAAGGTCGCATTGCTATTGGTTC +TATTTCTTCCGAAGAAGAACGCCTTTGGTTGATTGATTCACTAAACAGGATTAAATTCAA +ATGAATTCAAAACCTCGTAACATCATCAAACCAGGTGAAACTAAAATAATCAAACTCGCT +GATGGGCGAGTTTTTAAAATCAAAAAGGCTACGAAATGAAAAAGATTCTGATTACTCTAG +CTGTTGTATTTGCAATGGTAGGTTGCACCGATGCCGATAACGCTGTGAAAGTACTGCAAG +CTAATGGTTTCACTAATATTCAAACCACTGGATATAGCTTCTTCTCTTGTGGTAATGATG +ATTCTCAAGCTACCGGATTTACTGCAATAGGTCCTACTGGTGTTCCGGTTAAAGGTGCCG +TTTGTTCAGGCATTTTTATTAAGAACTCAACTATCAGGTTTGAATGATGGAAATTATTAA +AAGCTTTGGCAAGCACAAGTATGAAGATCGCTTGTGCTTTATTACAACATTAAAGGTTTG +TACTCGTACATTGACAAGTTACACCACTGCTCCAATCACTCCAAAGCATTTGCGTGGTAT +GAAGAAAAATATCCGACTACTAGCTTCAGGAAGACGTTTTCCTCATGAAGTTTTAGGTTG +CAAAAACAAAAATTGTAAGCATTGTAAATAGGGCTTCGGCCCTATGGAGGATATATGATT +TTCGGATTAACAACTGCTCAAAAACAAGCCAAGGCCCATCTTGAAGTTGTTGAACGTGCT +ATTGGTCGTTATCGATTTGCTTGGTGGCCTACTCGAATCACAACAGGTCAGACGATTTGG +CTTCAAAAATATTATGAAGTTGAAATCAGAGATATTGTAATACAACTCGATAAAACTTAT +GGTGTAGATACTAATATTACGTATGCAGTATATGCTTATTCTGATATTTCAAAAGCCGAT +TGGAAAATATTCGAAGCTTATAAACAAAACTACGGTCTGTATTACGCAAACAAATGTCAT +AAGGAAATAAAAGGTAATGAGGCCTTTGATCATTTGATTCGTTACAAAGCTGAATTAAAT +GCTTTACTAAATCCGTGATAATATTTCACCACTAAACGAGGAAATGAAATGATCAATAAC +GAAATTAAAATTCTTAGTGACCGTGAGCATATTATCAAACGTAGTGGTATGTACATCGGT +TCTTCTGCTCATGAATCTCATGACCGTTTTATGTTCGGTAAGTTCGGTGCAGTGAAATAT +GTTCCAGGAGTAATTAAACTGATTGATGAAATCATCGATAACTCCGTAGATGAAGCAATT +CGTACTAACTTCAAGTTTGCGAACAAAATTTCTGTAGACCTTAAAGGTAACAAAATTATT +GTTACCGATAATGGTCGTGGACTTCCTCAGGCTGATGTAGTTACTCCTGAAGGTGATACT +ATTCCAGGTCCAGTAGCTGCATGGACTCGTCCTCGCGCAGGTGGTAACTTCGGTGATGAT +GCTGAACGAAAAACTGGCGGTATGAATGGCGTGGGCAGTGCTCTGACCAACATTTTCTCA +GTCACTTTCGCTGGTGCAACATGTGATGGTAAAAATGAAATTGTAGTTCGTTGTTCTAAC +GGTGCTGAAAATATTTCATGGGAAACTGTTCCAGCAGAAAAGAAAGAGCACATTCAAACT +AAGACTGGCACAATCGTTTCATTTATTCCAGATTTTAGTCATTTTGAAAGTACCGGACTG +ACTCAAATTGATGAAGATATTATTCATGATCGTCTTCAAACTCTGTCAGTAGTATTCCCT +GATATTGAATTCAAATTCATGGGTAAGAAAGTACAGGGAAACTTCAAGAAATACGCAAAG +CAATTTGATGAAGAAGCAATTGTATTTGACGAAGAAAATTGTTCAATGGCGATTGGTCGT +TCTGATGATGGTTTTCGTCATCTGAGTTATGTGAACAATATTCATACAAGCAAGGGTGGT +TCTCACGTTGATTTGATTATCGATGAACTGAGTAATGAACTCATCCCGATGCTTAAACGC +AAGTATAAGCTAGATGTTAATAAAGCTCGTATCAAAGAGTGCTTAACTCTGGTTGTATTT +GTTCGTGATATGAGCAATATGCGTTTTGATTCTCAGACTAAAGAGCGTTTAACCTCTCCA +TGGGGTGAAGTAAAAGCTCATATGAATCTGGACTATAAGAAACTTGCGCAGCAAGTTATG +AAAGCTGAAGCAATTCATATGCCGATTATCGAAGCTATGTTGGCTCGTAAATTGGCGGCA +GAGAAAGCAGCTGAAACTAAAGCTGCTAAGAAAGCCCAGAAAGCTAAAGTAGCAAAACAT +ATCAAACCAGCTAAATATGGTGATGATTCTGTTGAGACTACTTTGTTCTTGACAGAAGGT +GATTCGGCAATCGGTTATCTGATTAATACTCGTGACCGTGATCTTCATGGCGGATATCCG +TTGCGTGGTAAAGTAATGAACACGTGGGGAATGTCGGCTGCTGAAGCGATGAAGAACAAA +GAAATCTTCGATATTTGTGCAATCACGGGATTGGTAATCGGTGAAGATTTTGATACTTTG +AACTATAAGAATATCGCTATCATGACCGATGCCGACGTCGATGGTGTTGGTTCAATTTAT +CCAAGTCTGTTAGGTTTCTTCAGTAATTGGCCTCGTCTATTTGAAGAAGGTCGTATTCGT +TTCGTTAAAACTCCAGTCATCATTATGTCAAAGGGTTCTGAACAGAAATGGTACTATTCT +GCTGCTGAATATGAAGCTGAAAAAGAAAAATTATCTGGTTGGAAACTTCGTTACATCAAA +GGTCTTGGTTCTTTAGAAGAAGACGAGTATGAACGAGTTATTCAACAACCAGTTTATGAT +GTAGTTTCTTTGCCTGATAACTGGAAAGAATTATTTGAAATGGTTATGGGTAATGATGCA +GCTCCTCGCAAAGTCTGGATGAGCGAATAAATATACATGAGCAATTCTGCTCTATAATAA +GGAGAGTTTATGTCTCAAGCTTGGATTACACTCGTAGACGGAAGTTATGGTTACATGTGG +GCCGACGCATTGCCACTTCCTGGTGATTGGGTAACAATTCGTGTAAGGCAGATTGATAAC +TCTTTCAAGAAAGTATATGGACAGGTATCGAGAGCTACCTGGTAATTTTAGGGACTCCTT +CGGGAGTCCCATTTTTGTATGGTGACTTATGAAAATGCTTAATGGAAGTTATGTTAATTT +AGACAACGTAGCAGCAACAATCCGAGACAGTAAGAAGATGTACTTGGATAAACTCAAGAA +TACACCAGATGATCTCTGGTTAAATCAATTCTCGAGGATGATAGAACATATTGCTGTACT +GGTAGAAAGTAACCAAGCCATTCCAGTTAACCTACAACAAACAGCACTGAAGCTGGTCTA +TTCAGCTCGTGAAAAATATACTATAAGAGAATTCGCCGCAATTCTCAGAGAGGTTGGTAA +TGAAAAGTTATAATGTGAATTTGACACTCTTTGATGACGCGGTATTCCGTGAGTACAGAA +TCATTCAGCGGTTCTTCGACATCAATGAAGCTGAAATCTTTAAAGACCGCTTTAAAGAAA +TTCGTATTAAAATTAAAAACGACACTGCAACTAAAGATGAACTCTTGGAAGTTGCAGATT +TAATTAAACGACATAACTGATAGGAACAATATGATTATTTCCCAAGAACAGGAAGTGGTA +TTTGGTTCTGCGAATCAACAAGCAACTGCTTTCGGAATTGAAAACAACGCAAAGGCATTT +ATTCTTCTTTCTGATAAGCTGTACACTAACAAGCCTTATGCGATTGTACGTGAATTGTCG +ACAAACTGTCTTGATGCTCATAAGTTGAATGGTCAGACTCGTCCATTTGAAGTTAAAGTT +CCTACACGTTTAGACCCACGATTCGTTATCCGTGATTTTGGTCCAGGTCTGTCTGATGTT +CAGATTCGCGGAGCTAATGGTAAGCCCGGTCTGTATAACACTTATTTCGCTTCGACAAAA +TCTGATTCAAACGACTTTATCGGTGCAATGGGTCTTGGTTCTAAATCTCCGTTTAGTTAT +ACAAAAACATTCACAATCATTTCGTGCCATGATGGCCGCAAAATGGGTTACACCGCGATC +ATGAAAAACCTCGGTCCAGAAATCATTCCTTTGTTTAATGAACCGATGGGCGAAGATGAT +GTGACTGGTATTGAGATCACCGTCCCGGTTAAGACAGACGATATCTCTAAATGGGAAACG +GAAATTAAGCGAACATTCCGTACGTTCGTTGGTGTTGAACCTAAGATTCTAGGCTCGAAA +GTTGAAATTAATTATTTCCCTGAATTCACTCCAGACAAACAATGGTTCAGTCTTAATTCA +AGCCCATTTGAAAATGATCAATCGTTGTACGCAGTTTATGGTCGAATCGTTTATCCAATT +AAGATGAGCGAAATTCCTGATATCAAAGCCGACTGGCTTCTGAACCGATATGGTCGAGTT +TATGTTCATTTTGATTTAGGTGAACTGGATATTACTCCATCTCGCGAAGAACTTTCCTAT +AATGAAGAGACTATCGAGAACATCCGAAATAAAGTTAACAATCTTGAAAATATAACGTTA +GCTGCTGATTTAGAACGTTTCCAAACAATTGAAAATAAACGTCAACTGTCTCGTGAATTA +CAAGCATTAAATCATCGTCAGCGGACGATTTTAGGCACTCGTTCTATTCTTATCCAAGAT +AAGCCATATCAAGATTGGGTATCGATGTTCCACCATAGCAAAGTAGAGAATTTGGTCTAC +AATGCGAATATGGTAGCTTATTATGTCGGTACAGATGCTGAACGGCGTCGAATTTCTAGT +TCATGGAATGTTCGTAATCGTCTGTCAGCACAAAGTTTGTTCTCGATTGATCACAAGAAA +ATAGTATTCATGATTGATGATAAACCATCTCGTCGTGCTTCTACTATTCGTGGTATGTAT +GCTCTTGATATTCACAAGTACTGCTATGTTATCTTGGTTAATCCAGATAATGAGAAAGAA +GTTCATGTGATGAACGAAATCACTAAGTTGTTTGAAGGTGATGAAGTTATATGCTTTAAA +TGTTCTGAAATGGAACAGGCAAGAGTTAAAGACGCTGAACTGAATGCAGCCAATTCAGAT +AAAGAAGGCGCTAAGCGTCCTAAATCTCCTAACATCCAGAAATGGTCTAAGACTGATGGA +AAATGGGAAGTTGATTCGTTCTGCATGAGTGCTAATGAAGTTCGTGAACTTGAAGGTTAC +GCAATTGGAATATCTCGTGATTCTATCGTCGAGTTTCCAAGCGGTAATGAAACATCATTC +GACCAGACAAATATTAAAGGCGCTTGTGACTATGTGAATGTTTCAGAGTTCTGGATGATT +CGTCCAGCTGCAATGAAATACGCGCAAGATGCTGCATTGGACCCTCTGATGGATGAGTTC +GTCAATAAGTTCATTGAATTGATTGATAAAGTTGATGCTGATGTTATTCCACCTTCGACT +ACATCTCGACGTCAAATTAATAATATTCTGTCAATTAAAGCGTTGACTCCGTTAATTAAG +AATTTTTACGACGTTAAAGATTGGCAATCTTCTGTAGAATTGAACCAGTTTGTCAAGACA +TTTAATGGGTCAATTCATGGTGAAGGTGAAAACGCTGAGAAATTAGCTTTATGTCAGAAG +ATTTATAATAGACTCGTAGAAACCGCTAAGTCAGATTTTGAAATAAAAGCGGAAGAGTTT +GAAGAAAAATATCCGGTCATTTGGTATATGCTTGATGAATACTACATCCATGAAGCTAAA +AATCACAATGATCTTGCTAAAATTGCGGCGCTGTTAGGCGCCATCTAAGAGGTTTATATG +GCCGTACAAATGTTTTCTGATCGTCAAAAGATGTCCATCGTTGATTATGCCGAAGCCGGT +TATACTCGCACATGGATTGCTAATCGTTTCAATTGCTCGACTGATACTGTTCGTCGAGTA +ATTAAAGAAATGAAACCTGTTGAAGTCGAAGAGACTGAAGCTCCGGCTCCAGAAGCTCAA +TACATCTGGAACGCATCAAATAAATTCATCAGCATCACCGATCTGTCTACTCATAAGACT +TATCCAGCTGATCATAAAACGAAGGGCTTTAAAATTGCTCTGCAGCGTCTGATTGATGGT +GACATCCCAGGTGCACTGGAAATTATCAACGCAGAAAAAGGTCTGACTACTTTCGTTAAA +GGCAACGTTAAAATCGATAACGGTGCTTTGTTCTTCAAAGACATCGAAATCAAATCCGGT +TTGACCGAGCGTATTCTGGACTCTATGGAAAAAGGTGAAAACTTCGAGCGTTATCTGCCT +TTCCTGGAAAACCTGATGTTGAACCCAAGCCGTAAAGCTGTTTATCGTCTGTTTGATTTC +CTGGTAGCGAACGATATTGAAATCACTGATGACGGTCATTTCATCGCATGGAAAGTAGTT +ACTAAAGATTATAAAGACTGCCGTACTAATAGTTTTGATAACTCTCCTGGTGTAGTGGTT +AAAATGGAACGCAACCAGGTCGATGAAGATGATGAACGTACTTGTTCTTCTGGTCTTCAT +GTTTGCTCCAAGAGCTACATCTCTTACTTCCAGGGTGGTTCTGATCGCGTTGTTTCCGTT +AAGGTGCATCCACGTGATGTAGTAAGTATCCCGGTTGATTATGGTGATGCTAAAATGCGT +ACTTGTGAATATCTCGTACTTGAAGATGTTACTGCACAATGGGGAGTTCGCTAATGTTAC +CGCATCAACACCGCGTAGTGCAAGAACGTGATGATCTTGCTGTAAAAATCGAAGCTTTAG +GCACGTTTATCGATAATCAAAACCCGGTATTCAAGAATCTTGATATTGAAGACCAATTTC +TGCTTAAACAGCAATTAACTATTATGGTTGAATATCATCGAGTACTCGATGCTCGTATCA +ATCGATTCTAATTAAAGCCCTTCGGGGCTTTTTCTTTGAGGTTACTATGATCAATCCAAT +GAACGTGGGCGATTCAAGCATTAAAGAAATTACTTTGCATGGAAACCATTATGCGGAAAT +TTTATACGCATTAGATGTTATCTTAGACCCAAATGCAGATGGTGTATTGGTATGCGAAGA +TTCATTTCTTGGAAAGGAAAATGTTGATAATGCTTTAATGAATCTTGAATGTTTGAATTA +TAACGATCGAGTCTACCAAGGAGTAGTTCGAGTACGTGACTTTTACATTGGTGGTAAAAA +TGAACAAGCAGAACGTGGGGTTAATGAGACTGAAGAGCCTGCCGACATTTACCCGGTCGA +AGAATGATCTTACAGGTGAAGACAAAGTCAAAATAAAAGGCACTGTCCAGTACTCGATGG +AAAAAGATCCAGACCAGGATTTAGAACAAGTTAAACGTCGATGCATGATAGCTCAAATGG +CTGAGCGAGCAGTTGCAACTTGGGTTGATGGATATGTTGCTAACATGAAAGCTGATTACG +AAGATCCATTGACGTTCGCTTGGGATGTGTTAGCTCATCCAAAGTATTCTGGATTGAGAA +TTGAAGTTAAGACTCATCAGTCTGACGCTAAGTGGATTTCTGTCACTACTGGTTATGGCG +GAGACTATCCGTATGGATCTGGGGTTAACCTAGGACCTTTCTTGACGCATCAAGTAGCGG +ACTGTATTATCATATTAGATGTGGAAGAAAGTTCACCGGGGCTGTTCTCGTTTACCCTAA +AGTTCGTTGGGGACCAAGAAGACCTAAAGAAAGTTGTTCGGCGTAGTAACTACACTGGAT +GGTACTTGAACATTTAATCTTCAGCCGTTTACATCCACATGGAAGTGGTATACTATAGAC +TAGAAATCAACTAACGGAGAACAAAATGAAACGTTACTACTTGAAAAACCGAGCTTCAAA +AGAAATTGTTACTGCTACTTTCGATGCTAATGAAGAAGGTGATTGGACAATTCTTGATTT +TGATGGCGAGCAGCCTTATTTTGGTTCTAAAGAAGAACTGGAAAGAATTCGTTCAGGGTT +ATGTAATGATTCATGGTCTTACGAAATAAGCAAATTCGTTAAACTTGCTATTAAATTAGA +ACTTCTTGACATCATTGAGGTTGAATTATGATCCGTGTAACTGTGATGGTCTGGTTTGAA +CTAGAAAACGGTGAGCCTCGTTTCAAAACCTGGGACGATTATATGCATGGCGGAGACGCT +CAGTATGTTGCTAAAAGACTTGCCGAAACATATCGTTATGGACAGTGTAAGATCTTTGAT +AATCTGGATCGTGTCATTGGAAGTGCAGGATTTGAATAAAAGCGGAGGGTAATGCCCTCC +TGGAGCATAAAACTTTAACTAATGAGAGAAATAATATGAAAACTTTACTGACTGTATCTG +TTCTGGAAAAAGCCGGAGCGACTGTTCTGGGTAAAATCAAAAATGCTGATTGGTTTAACA +GTGAACCCGCTCGTGAAGTTCTGAGTGAACCTGGGTTTTATTTTCTGGTAAATCCGGGTT +CTTACACTACAGCTCGTTTTTATGTAGGTCGTCAACGCTCTAAATGCGGATTCAGCAATG +TTCTATCTCAATTAAGTCGCGGTCGTTCTCAGCTGGGTCGTACTCTTCGTTGTAACGATG +TAATCTACGATGTATTCTTTGTTCCAGTTTCAAAAATGAAAGCTTTGACAACTGGTTACA +ATAAAGGCCAACTTTCTCTGATGTTTACTAAATCTCATAAAGAAGCGTTTCAGAACCTGG +AAGAAATGAATCGTATGCTGAATGATAACTTCCTCTTCGGTCTACAGAGTTACTAATGAA +ACAGATCTGGACTTTGGTGTTTCTGATTATGATTTTGCTTCTGGTGTATTCATCTGGGAG +GGATTAATCCCTCCGCCTCGAGTACTTGGAAGTATGTGCTTTGGAATTGCGATATTAGCT +TTAGAACGAATTTTTCTTTTATGTGGGATTGATTAAATGAATAAATTAGTGGGTGCATTG +GGTTGCGGTATTTGGGCTGGGTTATTTGTAAGTTTTGCAACTGGAGTTGCTACTCCTACG +GTATTTTCTTCTAGCATAATGGCGTTGACTCTGTTCATTGTGACTTTGATTAATTTAGTA +AAATGAAACGTGTAACTGCTTCTATTGTTATTTTGGCCATCATTATGATGGCCGTTTTCT +ATGGTGTAGCTTACGGCATTACTGAGATTTTGCTTTTCTTAGTTAATGTTATGATTGACA +TCGGTTCAATAATTTGGTAATCATATGCAATTAAATCAAAGAAGTCTTCAAAGTATTATT +GATAATGAAGCGAAAGAATTCGCAATTTACACTGTCGAGAACCGTGCTATTCCAAACATG +ATTGACGGGTTTAAGCCGGTTCAACGTTTTGTAATGAAGCGAGCTCTTGACTTAAGCCGT +GGAAATAAAGAAAAATTTCATAAACTTGCTTCTGTTGCAGGTGGAGTAGCGGATCTTGGT +TATCACCATGGTGAAGGCTCAGCTCAAGATGCCGGTGCATTGATGGCTAATACATGGAAC +AACAACTTTCCACTATTAGATGGTCAAGGTAACTTTGGTTCTCGTTTGGTTCAAAAGGCT +GCAGCATCTCGTTATATTTTCTGTCGTGTATCTGACAATTTCCGTAAGGTTTATAAAGAC +TTAGAGATTGCTCCGGCTCACAAAGATAAAGAACATGTTCCACCAGCTTTCTATCTTCCT +GTAATTCCTACAGTTCTTTTGAATGGTGTTCGTGGTATTGCAACTGGTTATGCAACAAAT +ATTCTTCCACATAGTTTTGAGTCTGTTGTTGAATGTACTCGATTAGCTTTGGAAGGAAAA +CTTGATAAAGAACCTGAAGTTAAATTCCCTAAATTCAACGGGAAAGTAATTCCAACTGAA +GACGGTGGAGTTGAATTGCACGGTGTGTATAAATTCACTTCAGCAACTCAGATGTATATC +AGTGAAATTCCATATAAGTTTGACCGTGACACTTACGTGGAAAAAGTACTCGATCCTTTA +GAAGAAAAAGGTCTGATCACTTATACTGATGATTGTTCAAAGGCTGGATTTGGTTTTAAA +GTTAAATTCCGCGGTGTGTATAATCTTCCGGTGTCAACTGAAATGCGTCATGACATGATT +ATGCGTGATTTCAAATTGATTGAGAAATTGTCGCAGTTCATTGTTGTCATTGACGAAAAT +GGTAAGTTGAACGATAAGTTCACTAAAGCGTCTGATTTGATTAAGCACTTCGTTGAAGTT +CGTAAGACTTTCGTCGAAAAACGAATTGAATACAAAACAGCTGAAGTCAAAGAGCAATTA +ACTCTGGCTGTTGCTAAAGCTCAATTCATCAAAGACGTTGTTGACGGAAAAATTGTCATT +CAAGGCAAAACTCGTAAAGCGTTGGTATCTGAACTTGAGAAAGTAGATTTATTCAAAGCT +CATGTTGAAAAACTTGTGTCAATGAACATCTATCACATCACAAGTGACGAAGCCAAGAAA +CTGGTAGAAATTGCAAAAGATCTCAAGAAAGAATACAAGTACTGGCAAGAAACTACACCA +GAAGCTGAGTTCATTAAAGATTTGGAGGAGCTATGCGAGTAGCTATAGTTCTTCTTTTGT +TAACTGTTATTCTTTGGTTCATGCCTGCGTTTATTATAGCGCTTGTTCTTGGAGCTCTCG +TAGTTATTGGATTTATGGGCTTCTTGTTGTCACTTCTTCTGATTTTTCTTTAACTCTACA +CGGCTCTAGTGAATTCTAGAGCCCAGTCCATATAATTAATCATCCTTCCTTGTTGTATCC +TCTCAACCGTTCTGGTTAACTTAAAAATATTTTCATTCAACCGTTTACTTCTGCTTTAAG +ATTTGATATTATTACCTCATACCAAACAAACTGATAGAATCTTGGAGAATAAAATGAAAG +TTACCCTGAAAATCGAAGTTACCAAAATGAAAGCTAAAGACGCTCTGACTTCTAACAAAC +TGATTGTCGATAACGTAGAATACGATATCTGCGGAGTACGTGAAGTTGAACCTGGTACTT +TGACTTTCTTCACAATGATTTTTAGCCCAAAAGCTGAAACAGTGTTTAAGCAGTTCGTTT +TCAACCCAGAAGATGAAGTTACTGTTAAAAACGCAAACTTCAAATAATAACCGTTTACAA +GTGCTATAGTATATGATATTATAGCACTATCAAAACTTAAGGAGAATAAAATGAAAAACG +TAATTATTGCCGCTCTTGAAAACGAAGGTTTAATCATCTGCCATAACGATATCAAAGCTG +TTAAGTTTGTACGTAAAACTTGCAACGATAATGTTTTAGGAGCTATCTATCATGCTATCG +TTTATGACGATAACGAAGATATCTTCAACGTGGTGAGTATTTTTATGAACTGTGAAGACC +TTACTGCTGACTTCGGTGGCTCAGCACACTTTGAAGGTTCTTATGATGAATGTGTTGAAT +ACTTCAACGAATGAGGAAATTATGGAAACTTTAGTTAAAAACTTGAAAATGCTGCTGGAC +AACACTGGTGGTGACTTTGAAGATAAGTTAATGCTCGCTCGTCTTCACTCGTCAAATACT +GATACCAACAGCTATCTGACTGTCTGGCATAACGAGCTATGTGGAGGCTACTATCTAGCT +TGGGTTTACGTCAACAACTATGATATGGTTGTGGTCCTGGATGACGAAATCGAAGATGTC +GCAGAAACTTTGAATCAGGCCAAAGAGTTGTTCAAAGAGTTTTTCCGCTAAGCTGTTTGA +TAGCATCTTTTTCGGAAGGTGCTAAACTAACCGTTTACATCCGCTTTAAACTATGGTATT +ATAGCTACATCAAAACAAATATGGAATTCGGAGAAACAAAATGTCAAAAGTAACTTACAT +CATCAAAGCTTCTGAAAACGTTCTGAATGAAAAAACTGCGGCAATTTTGGTTAAGGTAGC +TAAAGGTAACTTCATCACTTCAGCTGAACTTCGTGAAGAACTCGTTGAAACGATGAATGC +TTCTTCAGTTAACAGCAACATCGGTGTTCTGATTAAGAAAGGACTGATCGAAAAATCAGG +TGATGGATTAGTTATCACAGGTGAAGCACAAGACATCATCTCAAATGCTGCGGTACTTTA +CGCACAAGAAAATGCTCCTGAACTTCTGGAAAAACGAAATACTCGTAAAGCTCGTGCAAT +TACTGGCGAGATGGAAAGCGATAAAGATTTCATGATGGAACTTCTGGCAACTAAAGAAGA +ACTTTTCAAAATCAAAAAGCTGGATGTTTATCGTAGCAACTTTATTGCAGTCCTGGAAAA +ACGTACTTTCGGTATTCGTTCGTTCGAGGTTAGCAATAAAGGTAACTTCCGCATCTCCGG +TTACAAAATGACAGATTCTCAAGTGAAACACTTTGAAGATCTTGGTATGACAGCTAAACA +TTCTAAGAACGGTAACATCTACTTAGACATCGCTCGTACTGAAGAAAATATCGAAAACAT +CATCAACTCCGTTGACACTCTGTAAGGAAACTGAAATGACTATCCAACTGAATAAACTGG +TAGAAGATATTAAAAACACCATGAACCGCTCAGAGATTTTGAATGAACTTCAACGCTGCG +TACAACGAGTTGATGATGAATACCACTTACCAACCAACGCATGGGAAGTCTGGTTCCGAG +GTTCTCATCTCGGCTCGATTGAACTGAAAGCCAAAGGTTGTTATGCAGTTTATAGTTCTC +TTGGTCGTCATTGCGGTGATTGTCAGAACTTTATGCAAGCACTGGCTCGCTTCATTAATT +CATGCGCAGTCATTATCGCCAAGCAGCAAATCGAAGAAACCGAAAAATGGATTGACGAAG +TAACTAAAGAGCCAGAAATTCGTCGTTGGGGCGTTACTCGTAAATCTCGTTGGATTGATA +AAGTCAAAGGGTGGTTCAAATGATGGAAACGATGAATCAAAATAATGAATTGGCGGTTCC +AGATATCTGTTTCAAAATCGCCGATTGGTGGGATGGCCGCAAACTTCAACGTCGTATCGT +CTGTGCAGCTAATCGTTTTGAATTAAAAGCGGGTGGGTATCTGGTTATTCCAGGCTCCAG +ACATTATTCAAAAGATATGGCAGAAGTACTGGACCAAGTGAAAGATAAATTAGTGACTGA +TCACGTTCACGATGAAGACCAAGGATTTATTGATCAATGGGGTGAATATCATAATCGTAA +AGATGCACTGATTATTGCTACACATTCTGGTCAAATTAACACAGTCCGTAAAAAGGGCGC +ACCATACGACACATTATTTTCTGAGGATCTTTATTAATGATTAAGTCAACTCGTCAAGTA +GAAATGGTTCGCTATGAAAGTGCGGCTCTTAAAGCGTTTTACGATAAATGTAAAGAAACT +GGCCTAGACTATTATAGTTCCATTGCTGATGAAATCTCAAATGATTTAATGTGGCAATGT +CAAGACGATGTTTTGAAATTAGTAGATAAAGGCGATTTTGACATTATTTCTATTGGTCGT +CCGATTGAAGAACTTATTAAAGAAATTGAAGAAGTTGTTGAAAACTACGAGCTTGAGGAT +TATTTCTAATGAACGCTAAAGAATTACAAATCGATGCAATCAATAACCGAATTCATGCTT +TAACCCGTGCTAATGAAATGATGCATGAAAATTGGGGCACGTACACCAATGAATCTGGGT +TTAAATTCTGCGAGTCAGAATTGGCTAAGAAACTCACCGGAAAAGATTATGTTTGCCCAT +TCGCATCACCAATTAATGGAATGATTAAACCATTGCTGATGGAACTTTACATTCAAATGA +ATGAGTCCATGATTGAAAGTCTGAAGTATCAACTTAAGGTATTGGGTAATGTACAGACAA +AGAGCGACCAAAGCTGAACAAGAAAATGCTAAGCTTCGAGCTGAATTAGCTAAACGTCCT +GATTACGAATGGTTCGTTGAATTAATCAGACGTCATCTTAAGCAAGATGCTACTGTTCCA +TTACAACACTTGGCTGTGCAAGTTAAACAACTTAAAAATGCAAGAGGACATTTAATTGAA +CAATCCAGTAGCGAAACATGATTTCAATAAAGGTGGAGCTCATAAAGACAAAAAGCGCGC +CTCAAATGATTCCAAGCGCAAACAAAAACATAAGGGAAAAGATCATGAATAGAACTGAAT +TCGAACGTCTTGCATATGACCGGTACCTCGGAGTCGTCACACAAGTTAAAGTAAAGCACT +CCATTGATTTGGTTATTCGCATGATTGGAGAAGATTCTGTCCGGCGCGGAATCTTTGTTT +CAAAAATGCTTGCTTATATTAATGTTATGGCGAAGAATAATCACCATGAATTCACTGATT +GCGATGTCACAGTATCCGAAGATAATCGCGGAATTTATATCGAATCTTGGGATAAAGGTC +ATGTGATTAATATGGCTTGGGCGCTTATGTCTTTTGCCGAATCGCTTGACATGACAATCA +CTACGCACTGAGGATAATATGAGTTTACCAACCAAAGCGTTATTTTACAAAAACGGTAAA +GAAATTAAGCAAGCATTTGTCAACTGTGGCTGGGCTTACGATGAAATATCTGCTATCGAT +CAAGTCATCCGAGCAATAAAGGAACATGATATTGATTATGACGAATTTATTGTGTATGGA +AAGACTTATAATCACCGGGTAGAAGATTTACCAGAATATATTGAGCACCTGCGAGCTGAG +ACGAGAAGAATTAGAACCGAAATGCTTGAAAAAGCAAGAAAAGTTTCTAAATCTTCTGAT +CAAATTATGCAAATGGCTCGACGCGCCATTCCAGAATTGTTAGCGAAAGATATTCTTAGT +GTTCAACCAATGAACGTTGATATTAGAGGTTTGCATGAGTCATAATCTTGAAAATGTAAT +TGAGTTCCAGCGCTCTCTTGAAGGTATCATGAACAAGTTGGCTCTCGGAGATATGGTAGA +CTATAGCTTTGACGAGGCAATCAAAATTTGTCACTGGATGGGGCGTAGGGTTCGTCCGAT +AGGTGCTGAATGGTATATTATTGCAGAGAAGAAAGAAACTCGCTACGCGCTCTGGATTGA +CTCTGGTGACAGAGAATACATTACTCAACCAGAACATACCACTCAGCGTTGGGAAGTATT +GAACTAACCGTTTACATTTTCAGAGTACTGTGATACTATACTCTTATCCTTTAAGAAGTA +GGAAATAAAATGACTAATTTTGAAATTGTTCGTGAAGTTGTTACTATTGCATCTATTTTG +ATTAAATTTGGCCGGGATGATATTGTTGAAAAGCGTGATCACTTCATTGCATTCATTAAT +GAAACTCACGTCGATGATAAAGATTGGAGACGGTTAAATCAAGGAAGCTTCCGCAAGCTG +ATCTATGAATTAACCGTTGATGAGAAAAAATTGCTCGTCGAAGAATTCAACGAAGGATAT +GAAGATATTTACCGCCATCTGGCAATGTACACGAATAATTAACGAGCTCTCCCTAGTGTT +CGCGCGGCTTGGTCGCATATAGATTCAAGTCGTGCGGTATTGATATTTTTATTCTTTTCA +TACCAATACATTGTAACAGTTCCTGCGTATACATTATCTAAATTGAAGAATGGACAACTA +AACATATACTTCAATTCTTGAGTTTTGACAGTTGATGGTAAAAATACAAATTCATTTTCA +GAGTAAAAAACCCTTCCTCCTAAGTGAGTTGAATACTCCTGAGACGTTTTGTCAACAGGA +AATCCACCTAAACTTTTTTCTGAAACAGTAGAAGGAAGTTTCCCTTCATATGCTATCAAG +TCCACGAAGTAGTTCAAGTTTTTTGGTCTAAATGAATATACAGCACTGAATTCAGCACCG +CTTGACACATGTACTATTTGAAGTTGTTCTAGAGCAGTGGTTTCAAAGCGAGCTTCTCGA +TCCTTTTGCTGAATATTAGCGTAGGTCTCATAACTCGAATCTTTGTAAGCATTTAGTATT +GCGTCACTCTTAACCCAAGTCATTCCCAAAATAAAAAACACAATTAACACAAAAACCCGG +GAGAAAAGAACTCTCCCGGTTGCATTATCTTTGAATACTCTATCCCAAACTCCAAAAACG +ATGTCTGTTATTGGCAGACTAATTTTTGAAGCCATAAGTTTTCTCCTTTGGAATATTTAT +ACTCGAGATCCATATATAGTGCCTACGTTTTGCCATGTTGGAGCAGTACCAGCAACAGCT +GCTCCACCACCTGCAGGACCCCATCGCCAGCCGGATTCAAAGTTACCGAATGCCGGGTTA +CCACCAGCAGCTACATCACCACCTGTACCGCCAGTAACCGAACGTCTCGATGATTCGTAT +CTAGAACCTGTCCCTGGAGACGAAATAGAAGCATCTGTACCATAAGCGGTGACGCCTGCG +CCAGACGCTGAGCGTCCGTATATACCAAACGGCCGTCCACCGCCACCACCTGCTGATACG +CCTGAAGATTCCCAGCCTTGACCGCCACCGCCGCCGCCACCGCCGGCTATAGCACCGCCA +TTGTTAATTCTTAAGCGTCCACCAATCCAGTTATGGATACACGGGCCACCTTGTTGAGCT +GCAAATACCCAACCAGTAGTTCCACCAACGCCATATCCACCGCGACCAAAAATTGTTACC +CCGTGAATATTCAACTGGACATATTCATTAGCCAAATCTCCAGGGAATTCAAATAATGGA +ACTGTTGCATCATATGAAACCATGTCTCCACGAACATTAATTACAACTGGAGCATTGCCT +TGTTGACGCATCCAACCAATTAGCCAATCTTTATTATAATTATGGTTAGCTGCCAAGTCA +ATAACAACTTCTCTCGATCGACCTATCAAGTGTGACATCCAAAATGGAGTCCCTAATCTT +ACTTGTGATGCAGCAGTTGACATCCAACGTTGTCCAGTTTCGGCTACTGCACTACTTCCA +ACCCATGGTCCTGTTATAGCCATAAAAACTCCTAGGGCCCGAAGGCCCTTTATTAAATTA +ATGCAGAAATGATTTCTTCAAGCTTCTGAATGCGAGCTTTTAATTCAGAAATCTCATCAG +TGTGTTCGTTAATTGTTGCAGTGTTTAGCGCAATAATACCATTGTAGTTTAGACGAAGTA +ATCCTTCTGGATTATCTTTATCGACTGAAATCAACTCAGGCAATACAGCTTGAACTTCTT +GAGCAATTAAACCGGATGACTGTTCCCAATTAATAGAGCCGTCTTCTTTGAACCCTTTCT +TCTGAAGATATAGATATCCATTCATCTTCTTAAGAGTTTCAGAAGGAGATTCAAATTTAC +GAAGTTCACTCTTAACACGAATATCCGAACGAACATAAAGATCGCGAACATAAGTGGAGT +GTGAATCGCTAGCTTGGTTCAAATCGCCATAAGAAACTAATGATTTACCAGCTTCGGTTC +TAAATCCACCTGTATCTAATACGGCAATTTGCTGAGCATTTGCCCAGTGGGTAACAACAC +CATCACTGGCCCAATATATTCCGGTATCTGTATCACCGATAGTTATACCAGGTTTATTTG +GATATGCTGATGATCCTACTCCGGTTAAGAAATGAGGAGCAGTAACATACCGTTGAGATA +TAAAATCACCTTCAATTGTAAAGGTAAAAACACCTTGACGGTTTCTGTCAGGTTCGTATC +CAAATGTACCGACACGGATAATTCCTCGGCCCCATTGAGCTCCACCGTTACGAAGAGTGC +CGAGTTCAATTTTTGTGTTATACCCATGGGTCGTTGCTATAGTTTTCTGAGCAATACACG +GATAATAATCAGAAACCCCAGAGACATTACCCATATCAACTGTAACAGGAGCGTCATAAG +CCCATTGCTCGGCATAGGCATGACCATATGTCAATGGCCATTGTGTTGCATCAATTAATA +CATGATCTTTATTCAGCAGAAGCTTATTAGTTCTTCCTTGAGATGTTACAGTAACAGAAC +CGTCTTTGCGCAAATTTAGACTATTATAACTATTTGCGTCAGCTTTATTTAAATGAGATG +CAAACGAGACAAGACTATCGTCACCTCTAGATTTACCAACCAACCATGCTGATGTATTAT +CTGCTATATTACCTTGCAAATATCCAGAACTAGCAACTGAGCTGTTTATTTGAAAAACGC +CACCAGCACCATTTGCTTTTACACCATTCTTAAGAATTAATGTTCCATCATTCTTAAGAG +TTGAAATCCAGGTATCAATACCTCCACCGGTCCAATAAGTCAAACCATAAGAATAATTAT +CATTTGTGGCTTCCTGAATCATTGCTTCATGCATGATAGTTCCGCCTTCACGAGCACGGA +AGCGACGCAAGCGGTTTAACGATAATGTTTCACCAGCTGGCTGATTAGTCATGCTTGAAA +CATCATAAGATGTTAAAGGCTTATTATAAGTGTCCACACCTGCAGCAGCATATAATACTG +TACTTGTACTTAATACTAAACCGTTTTTAGTATCTACATAAGCTGCACCGCTACCTCTTA +AATAGTGATTATATTTTCCACCTTGGTTAAGTGCAATATAAGTATTACCATCTCCAGCAG +AATTACCATCTAAGTGAGTATCAAATCTTGCCAATGAAGCGCCATCCGGCGCAATAAGAT +TATTATTACCATAATCAGCATTACTATAGCGAAACACCAAAGAACGAACTGATTCTGTAT +ATCCATCTTGAAATCTTAATGTCGCGGCACCATTTGATACCATAGCATATTTGCCATCAG +CTAACCATTTAATGCCAGTGTCAGAATCACCTATTGCAATGGAGTTATCACCTAACACAG +TAGTTCCATTCCAGACAGTGCTAATAACTAGTTTTTTAGCTACGGGATAACTTAGGAACA +CATCTCCATCTAATACAGACTTTACAGCTCCTGTGACCGGGTTAATACTAAATGTTGCTA +TATCTTTGAAATTAGCTTCTGTATCGCCATAAAAACCGCGAAGAATCAAATCAGTGATAT +TGGCGGTTCCAGCTAGAATTTGCTCAAAACGCGTAACTGAGCCTCCGTAGTTAGTGATAA +CCAAAGGCTTTCCTGCGGCGCTGCCGTTAGGACGTATGATAATTGCTTTAGTGGAAGTAT +CGCCAGAGAACGCAACATTTATTGCGGTCATGTCGCCATTTAAATCATAGCGCCCGGTTT +GAATGTAATTGCCAGTTTGAGTTACGTTACCATCGATATTACCGCCTTTAGCAAATCCTA +AATCGATAATAGCACCATTCTGATCTTTAGTGAACAGCATACGATCGGTTAAGTTAATAG +CCAGTTCACCTTCGGCAAGCTGGACGGCTGTAGGTTTCACTCCAGCCTTTGTAGTTCTTT +TGAACTGAATTTGTTTGATAGTTGCCATATATCCTCAGCTATAATATCCGAAGTCTTGAA +TTGAATCTTTTATAACGATTTGATCGAAGCGTGGAACGTGGCCATTTTCTGTTGCAGGCA +AATTACTAAAAAAATTAGGAGCAGCTAAAGGACCACTCATTGTTTGAAGAGTAGAATTTT +GTAATTGTACTTGCTTAGCATTATCTACCAGTGGTAAACCTACTTGTGTTTTAGTTGGTG +GATTACCAGGAGAAAATGTACGTCCGCTTGAATCATACACCTCATTACCAAGTACATTAC +CCATAGTAATAAGACGATATACTCCATTAGTTTTATCTGGGTCAAAGATAACGAATGGTT +GACCGTCTGGTGTTTCCAGAGCGAATGGACCAGTAACTTGAAGTGAAGCTTTATATGTTA +TATCAGGGCTAGTGCTCTTGGGAACCCCAAGTGTCACTGGATTACCCTGCTTGTCATTAA +AGGTTAGGCCTTTACTGAATCTAACGGATTCTGCATAAGTTCCGCCTTGTGCTTTAGAAA +CGAAATCGTTGTCTATAGCCTGAGGTTTATCTTTTTCGGTATAAACCTTGTATGTTTTGA +AGAGCAGCGTGTCACCGGTTGGAAATAGAGGGAAATTGCCCTTATGCCAAATAGGTGAAC +CACCGACTGTACTGTTTGATTTTAAATCGGCCATAAGTCCTCTCTATTGTTATAAGACTA +TTTATACGACAAAGGGCCTTTCGGCCCTATATATCTTTTTCAAACTCCCGCCAATCTGCG +CTATAAACATGCCCAGAAAGATCTTCTCCTGGTAATGGATTACCATTTGAATCAACTTCA +GGTACAGCCATTTTAAGCAGAGTTGGATCATCTCCAGATACTGCTGAGTTCATTCTTATA +CCATTAACAGAAATTGATGTTGAAGTAGAAGATTCTAATGATCTTGATACTTTAGTCAAG +ATCATCGAACGAACTCCTCCTTCTCCTTCAATATTGGTTCCATTAGTTTTAGATACAACT +ATTGTGTATCGTTCTGCGCCTACTGGAACTTCTATGTTCTTTTCGCTTTTTATCCAACTA +CCTGCGTTTGCTGGATTAGAATCTATATTTTCGCTTTTAATTAAACTTCCAGTATTTGAA +AGCCATCTTAAGCTAACTCGTATATTTCTACCGCCAGGAACCATTTCTGTGCTCTGTAAG +AATTCAAATGACCAAAGCATTTTATCAGAAACTTTTATTCCAACTGAAGCTATAGGAACT +CCCAAAACGTCTGATATTGGATAACGCTTTATCGTATCATTTGAAGAAACATACTCATTA +AAATCTTCTAATGCTCTATAAGAAAATCCGGTTGCGCCTATATCATTTGCCTTATCATAC +ACATACTCAATAGCCGGACGCAGGTCTTCTTGTCTAGATACGCCATCTGAGTAAGTAACG +TTTTCAGCAATCATTCTATTAGCTTCAATAGAATAAAGACCAGCATAAATGGCAAAATAA +TTATCTGTGTGCCATTTACTTGGCCATAAAGTACTCCAAAAGGATTTAAATTTAGCTTCA +ATAGATGGAGATTGATTAAAATCACCAAATGTAGTAAAGACTAAAAGATTTTTGCTTGTT +GAAGGAAATGTAGCAAGATATTCTAAAAATGCTTTATTTGGAGCTCCAGAATCTTCTTGT +AAAAAGTTGAAAGTTTTCTTTGGTGATATAGTATTCATCGCTGGATTGAATTCGCGGACA +TTTATTCCAACACCATCTTGTTGGTCACCAATGCGTTCATCTTGAAAGGTGAAATATCCT +AAAGAAGGGGTCGATAATTCCGACCCCGGAGCGAATGAAAACTTATATTTGATGGAATTA +TTTTCTGAAATAACTTGAGTTTTGGTATATCCTTCACCAAAACTCGCCATCATTTTTTCC +ATTAAGGAATCCAAGTGAAATCTACAGATTGAGTTACCGGGTTCGGAGTTATGCGAACAT +TTCCGATCTGCAACCAATCACGAATGGTTAAGTTATCAAATGCAGAACCACCTGCCGCAA +CAGCACCAATTTCCTTAGAAGTAGGTGGGTTATTGCTGGTGTACATTCTGCCCCATTTAT +CCCATTTTCCAGTCAGGGCGTTGAAGTTACGTATCCAGAAAGTTTGAGCAAAATAAGTAG +CATCGACTGTTGCTGGGCGAGGTGCCCAAATTTGCCAAATGCTATTTTTGTCAATACCAT +GCTGAGTTAACGTACCTGGACCTTTAACTTCGGTGTAATCAATTGCAACTGGATTACCTT +CTTCATTTGTTCCATATACCGGAACAACAAAGCCCGGAAGCTTATTGTACACAGCAGAAG +TTTTAATTGAAGAAGACCATGCGTTCGGATTAGAATCAGTTGGCGGCTGATCTGGAGTTA +CAACACCTTCAGGAATCTGTACGCGAAGCGTAGCATCAATCAAAAGGTTTCCGGTCATTG +AATCGCCAATTTTCTTGACGAAGTTCAATCCAATTTGATTTACCATGTTCTTGGTAGTGA +TTACCGTCGAAGTACCGGTTGAATCTGTCACTGTCAAATTACTAGCATCAGTTGTCTGAA +TATTAGTAGCTTTTGACTGTGAACCAAATTGAATATTTGAAGGGGTAGGATTAATAACCA +TTGTACCACCTTCTACACTAAAGCCTTTCATAGCATCGATTCGATTGCCAGCTTTAAGCG +TTTGTCCAACTTCAACATCACCGTCACGCTTCAGACGAAGCACATCAGCATCTGCAGTGA +AATCCAAGTAAGTTGATTGAGACTGAGCATCGATCAACCAATGGTTCCCTTTGGCATTCA +AATTAATCACAGAGTGACTTGAGTCGTCACCAACGGTCAGGTCTATAGTAGCCATTATGT +TGGTGAACTTCGATGTACTAGAGGACACCAGAGGTGCACTAGTGTTTAACTGCTTGGTCA +AAGTTAATGAACCATTGACAGTCTGATCGATATCACGGCGAATAAACTGCAGTGAATCTA +ATCCATCGAGCTTCTGTGAATCTACAGCAATTGCGTTGATCGGCAAAAAATTCTGAAGTG +TTTTATTCATTTCAAATGGAGATACTGCATATCCAGTTTTGAAATAATTTCCAGCATTCA +GTTTAACATCATCATTCTCATAAAGACCAGTAACAGCATTGAACTTCACGCCAGAACCAG +TTACTTTATCACCAACGAAGGTTAATGCTAATTCAGTCAATTTAACTGGACCACGACGAG +TAGGCGTAGCTTCCCAATCAGCTTGTTCTTGAATAGCATACTTCAGATGACTAGGAGGAA +CTGCTTTATTACCCACAGTACCAGTCGTCGTTTCAACACGAGTCGCTATCTGAATAATAC +CTTCTGTCGTTTCTGAAGTCTTCTTATCTTGAAGCTTTTTAGGAGTAATGATAGTTTGGT +CATCTACACCAGCATCTACCAGAATCTTAGTAGCAACCGCCAAAGTACCACGCTGCGTCT +CAGATGCTTTCTTAATATCAAGTTTATAATGGTCCCAGGAGTTACCTGATTCAACCAATC +CAGACAACGGTTCAACAGAGTGTCTAGCAGGATCAGAGAAATAAGTTTTAATTTTAAACG +GAGTCGAGATGACATCGTCCAATGCGCCAGCATTGAATTCTGGTTGAGTAGCGATACGAG +ACAATCCAGTTAAATCTTGGGTTGCTTTACGTCCAGATAATTTCTTAGGCGTAATGTATC +GGAAATCGTCAGTACCAGCATCAGTTTCTGGCTGAGTAGCTACTTCAGTAAAACCAATTC +TTCCTTCCGTTGCGGTCTTCTTATGTAACTCAACTGGGGTTACAACAGTTGGAACTTTAG +GATCATGCACAGTTCCATTAATTACTTCGTTCTCAGTTGCTAAATAAGTACCACCTTGTG +AGGTATAAGTTGCTTTATTTTCAAACAGAGAAGCTGGAGTTACAGCTTTTGTATTTTCAG +TATTATCGTATACGTTAGTTCCTTTAGTATCACGATCTACACCAGCGACAGTGGTTATTC +CGGTTTTAACTAGAGCTAAAATACCAGTCAATATTTCTGAAGCTTTACGATTATGCAACT +TTTTAGGAGTCACAATCGTAGTATCATCAGTTGAACCATCAGTTTCTGACTGAGTAGCAA +TTTCAGCTACACCACGACGAGTTTCAGTAGCTGTTTTCTCATTCAGCATTGCTGGAGTAA +CAATTACATCATCAAGATGAGGACCAGTGGTTGGAGCTTGAATTTCAGTTAAAGTAACTA +AACGAGCAATACCACGGCGAAGTTTAGTAGCAACTCTTTTAGCTAATGTTTCTGGAGTAA +TTGCTAATTCTTTTTCTGGGTTATTTTCAAGATCTACTTGAGCTTGAGCTTCAGTAGCCA +AAGCAATAACACCTAAACGAGCACGAGTAGCATCGGTCTTTGAATCGACACGTTCTACTG +TCGGCGTGTTGTCAGAAACAACCCAGTACTTCAATCCAGCATCTTCAATATAAGATAAGT +CTAATACTGGAACGTAAGAAGTATCACCATTGAAGCTCAGAGTTTTATTTTGAACCCATG +TAGCATCTGGTGGATATTCTGAACGTTTAGGGAATTGAAGCAATGAAACTGAAGCTGCTA +TAGTATCTGTCCCAGTACAAGTGATATCAACTGTTTGACCTTTGCGCATATAGTTCAAAG +AAATCTTAACAGTATCACCAATAGCTGGTGCCGGAGGCAAAGTTATTACAACTTTTTTAA +TAGTACTATTATTTGTTCCAAATACCATCACATGATCATTCGGACGTACTTCAGTATCTT +CTTTTATGATGCGAAGACGAGTGCGGAGATCACCATCCCAAATTCTCCACAGCTTTTCAA +CCGCGTCAAACACGATAAATCCATCACCAGAAGAACGCACTTCTAAAGAAGTTGTTCCTA +CTGATCCAATAGAAGTATTAGCATCGTAAGTGCTTACAAACATATGGAACAGAGGGTTCA +TTCCATCCATGTCTGTGAAATTAATAATATCTCCATCATTCGCAAACTTAGGAAGAGTAA +CACGAATAGGAGCCCCAGTCGTATAACGACGAACGATGAATTCATTAGCTTGTGCTTCAT +GAATACTCGCCGGGGTAATGGTTGTTGCCGTGCGCTCATTGTCGCTGACATAAAGTTGCC +ACAAACGATTACTGAAAACTAAAACCATTTGACTATACGGATGAGTCATTCGTACACTGC +GAACTTGAGATCCAAGCCAAACGATAGATTGAATAGAAGCATCAATTGTTACATCAGCAT +AACCTGGCTGACCACCTATATCTTTTAAGAAAATAGTATCGCCATCTTGTGGATTATTAG +GAAGAACAAATTTAACAGTTCCACGATTTTCTGTATCAACAGAAATAAAGTCCCCTGATT +TAAGGACTACTGTTCCAGCAGATTCGGTTTTCCATTTTGCGTCAGTACGCAAAGCTGTCC +AATAGAGTTCGTTGAAAGCGCCAGAAGGTTTAATGATTTCGCGATTTGCAACCCAAATGC +GGTTATCATAAATCACTGCGAAATTCTTTGGATACCATCTGGTTTCGTCATATTGTTGTA +ATGTATTTTCTTGGACGAGAAATTCAACGTTAACACCATCGCTCGGAATAGTACGATCAG +CGGTTTGTACGTTAATTACTTTCTCACCCGCCGCATCCAGACCTTCTTGAGCTCTGAATT +TTCTTTTTAAATCGGCCATGATGACTCCTGTTTGCTATTATAGAATGTATTTATAATTGA +TCTGTCACTAATTGAACGAGGTTCAAATGAATTTAAACGAAATGTTCGGGTCTGAAGAAG +ACCAAAAAGAAGGTATTGCTTTTGTCGACTTGTCACAACTTGCACTTGCTGTAGCGTTGA +ACACATTTGCGGATGGTGAAAAGATTCCAGTTCCAATGGTTCGGCATTTATTCTTAACTA +CGTTAAAGAAAAACGTATTACAATTCCGTAAACAAGGTTACACTAAAGTTGTTATTTGTG +TAGATAACGCAAAATCTGGCTACTGGCGTCGTGATTTAGCTTATTACTACAAAAAGAACC +GTGCTAAAGGTCGTGAAGAATCTAAATGGGATTGGGAAGGTTACTTCACCGGTATTCGTA +CTGCAGTTGAAGAATTTGAAAAGTACATGCCTTATGTCGTCATGAATATTGACAAATATG +AGGCGGATGATCATATCGGTGTTCTTGTTCCTTATCTTTCTTTGAAAGGGCACAAAATTA +TGATCGTTTCTTCGGATGGTGACTTTAAACAGCTGCACAAATATCCGAATGTTAAGCAAT +GGTCTCCAATGCATAAGAAACTTGTTAAGATTAAACCTGGTGAAGCTGATCTTGAGTGCT +TAACTAAAGTTCTTAAAGGCGACCGTAAAGATAACGTTGCTTCTGTTAAAGTTCGTTCCG +ATTTCTGGTTCACTAAACTAGATGGTGAAAGAACTCCTCCATTTGCAACTAAACTTTTGG +AACAATGTCTAGATGCTGGTCCAGAAGGAATGAAAGAGCTTTTAACAGAAACAGAATACA +ATCGATACCTCGAAAACAGAGTACTTATCGATTTTGAATATATCCCAGAGGATATTGCTA +AAAAGATCATAGATTATTATGAATCATACAAAATTCCTCCTCGCGGTAAGATCTACACGT +ACTTCGTGAAATCAGGTCTTTCTAAATTAACTTCAAAAATTAATGAGTTTTAAAATGGCT +AAAGAAAAGAAAGTTGCAGTTGAATTTGATGAAGCAATTCATGGTGAAGATCTGCGTAAG +AAAATCAAAGAAGCTTCAGATAATATGCTGAAGATCTCTGGTTATAAAGTACTGATCGCT +GATCTTCGTAATTCAGCCAAAGATGATCTTGGTGTTGAAGGCAAAGTATTTAATCAGTTG +TTAGCAATGTATCATAAAGATACTCGCGATCAATTTGAAGAAGAAAAAGATAAGGTGGTA +GAACTGTATGACTCTGTTTTCACTAAATGACGAATCTGTTCAAGAAAAAGCCAGTGTTGA +TGAATTGCTTGACAAACAACAAAATGGCTTTACTATTGAAGCGTTAGTCAATGAACAGGG +TCTTGGTTATCTAGAAGCCACTACAGCTTGGATGGAGGAAAACTCCATCCCTGAGACTCA +GTTCTCAAAGTATATTCCTTCTGGAATTATTGAAAAAATTCGTTCTGAAGCCATTGATGA +ACACATGCTTCGTCCTAGTGTTTCACGTGGTGAAAAGACTAATACATTAGACTTTCTGCT +ATGATTAAAATCCGCATGCCTCCAAATAATAATCGTTACATCAACGGTAAATCAGTTTAT +CTACTTTACTTGATGTTGAAACAACATTTTGCTGGCAAATATGACGTTATAAAATACAAC +TGGTGCATGCGGGTTTCTGATAAGGCGTATCAAAAACGCCGTGATCGTTATTTCTTCGAG +AAACTCGCAGAGAAACACACCCTTAAAGAACTCTCACTCATTTTCATGAGTAACCTGGTG +GCTAACCAAGATGCATGGATCGGAGATATTTCTGACGCAGATGCTCTGGTATTCTATCGA +GAATACATCGGTAAATTGAAAATGATTAAGAGTCAATTTGAAGATGATGTGAAAAACATC +TATTACTTCTCCAAAAAAGTTGAAGTCAAAACGCTTAATGAGATTTTTGAATATAATAAT +AAGGTAAATACATCTTATATCTTCAAACTTCTTCAAAGCAACATTATCTCATTCGAGACA +TTCATAATTCTGGATTCATTTCTGGACATTATAAATAAACATGACCAAGCAACTGACAAT +CTAGTCTGGTCAAACTATTCAACGAAACTAACTGCATATCGCAAGATATTGCAAGTGGAT +TCGTATGAAGCTAAACAACTGTTCATAAAAACTGTAAAAAACTGTAAATACTAAGGTAAA +AATATGTCTATGTTCAAACGTCGTAACCCTGCTGCTCTGCGTACTCAACTGGATTCTCTG +TCTGGTGGTAATAAATCTTTCGCTGACGCTGATAAAGGTGAATGGAAACTGAAACTCGAT +AACGCAGGCAATGGTCAAGCGGTAATTCGTTTTCTGCCTTCTAAGAACGAAGAAACTGCT +CCGTTTGCAATTCTGATTAACCACGGCTTTAAGAAAAACAATCAGTGGTATATCGAAAAC +TGTACTTCTACTCACGGTGATTACGATTCTTGCCCGGTATGTCAATATCTGTCCAAGAAT +GATTCTTATAACACCAACAACGAAGAATACAAACTTCTGAAACGTAAAACTTCTTACTGG +GCGAATATCTTGGTTGTTAAAGATCCTGCTGCTCCAGAAAATGAAGGCAAAGTATTTAAG +TATCGTTTCGGTAAGAAAATTTGGGACAAAATCAACGCAATGATCGCTGTTGATGAAGAA +ATGGGTGAAACTCCAGTTGATGTAACCTGCCCATTTGAAGGTGCAAACTTCGTCCTGAAA +GTTAAGAAGGTATCTGGTTTCAGCAACTACGACGAATCTAAATTCCTCGGCCAGTCTGAA +ATTCCAAATATTGAAGATGAAGCTTATCAGAAAGTTCTGTTTGATAGCTCTGTTGACCTG +TCAGAAATGACAGCAAAAGACAAATTCAAATCTTTTGATGACAACCTGAAGAAATTCGAG +AAAGTAATGGGTACTGCTGCTATGGGTGGTAATGCGGCTCGTGCCGAACGCCAAGCAGAT +AAAGTAGCTGATGATCTTGAAAACTTCGACGAAGATCTGGCGAACTTCAGTGCCGGTTCT +ACAACTCCAGTAGATATCCCAGAAACTTCATCTAGTTCCGATGATGACCTGGACGATATT +CTGAACGGTTTATAATAAGAAGGAGCCTTCGGGCTCCTTTTGTTGTTTCTGGACTCTGCC +GTTTACATTCGTTGAAAGTAGTGTTATGATAGTCTCGTAATCTACTAGGAGTAATAAAAT +GAAAGCATCAGTAATTCTTTTCGTAACTTGGATCGGTGATTTTGAAGTAGACATTGAAAA +GTATAACTTGTTCATGGATGACGTATCACACGAGTGTGGCGGTTACGAAGTTACTTTGAT +GGGTGAACATGAAAAACTCGTTGATTTTCTGACTGATTGCTATATTCCAGGTATGGAAGA +ACAAGACGTTGAAGAGTTAATGAATTCAATAACTGTTTACAACGAAGAAGAACTGTGATA +CTATAATCTAGTCAACAACTGAGGGAAATATTATGGCACGTTTAGAACTTGATATTGTAG +CTGAAGTTCATCGCAATGAATACGGTTATGCAACTGACTTGATCTTCGATGATGGCTCTC +GTTTTTACGATGTTGATCACGGTCTCGACTTTGATCTTATCGAAGAACACGGCCCAGGTG +GCGGTTGGCCAGTGATTTATCTTCGCGGTTCAGAAGCAAATATTCGTAAGTGGCTAGAAG +ATAACCAGTGGGAAGACATCGATTGGATGCTTGAAGAATTTCTTGAAAAAGGTGAATAAA +TGAGTTTAGTTAAAGTACGTTTATTGAATGACGGCGGGTTCAACGGTTTTGTTGATACCA +AATTTCCCGTAGTAGTTATGGGTCAATTGGAAGAAGACTATGGCGCTGTTATTATTAAGC +CAGACGAGTTAAGACGTGTTGGATATGATGTTGATAACTATATGAATAACACAGTGAATG +GCACAAGAACATTTTTTCTTGATGCAGAAGCTGAATTAATTTAACTTCAGCCGTTTACAA +CGTTGATAGGGTATGATACTATTACACTATCAACTAAATGGTAAACAAAACCTTGGAGAA +CAAAATGAAAACTTTAGAAATCGTAGTTAAAAATGTTGAATTAGCTCGTGAAATCGCTGC +ACAGGTTAAAGCGGAAATCGTTTCAGAAAAATTGATTTCAAAATGCACTCTGATCGTTCT +TAAGGGTTCTTTCGATCAGCTGATGGATTTCAATGACGAAATGTTCTTCGAAACTAACCC +AGGTGCTCATAAAGAATACCTGAAAGAAATCATGGCTTAATGAGGAAAATATAATGGAAA +CTGGTAAACTCTACACTTTTAAACCGTCTATGACTCATCTTTTTGTCGCTGAAGCATGTT +CAAACAAATCTATGGCTGAAGCTATTATGTTTAACGGCGGCTATTTTGAAGTTGAAGCTA +TGATTGTTTTGAATAATGAAAGATATGTCACTGCCGTGAAGTTCCCTAAAACTGGCAAGT +GTTTAAACGATGATGGTAGCGGAGACGAGTATTTTGAAATCTATGAAAATGAGTTCAAGT +ACTTCACTGAGTACAATGAAGTCGAACTTAATGACGGTGTTCGTTCGATGACACTTGATG +TCAACAAAGCTAACGCTGTTGAAATGATTCAACTCATTCAACAAATTTTCTTAAAATAAT +GTTTACATCGGTGTAAGATTGTGTTACTATGATCTTACACCAACAAGGAGAATAAAATGA +AACTTCAACGTCAAAGCATTAAATTAGGTTCTGGTTATCGTGGTAAGTGGAACTTCTGCA +TCCTGGACAACAATCCAGAAGAAATCGAACGTGTAGAAGAAATCCTTTGTGGAATGGACA +CTGGCTTCTCTGTTGGCGGTGAAGCTAAAACCTGGGGTGATTATTGCGACCAATGCCCAT +GCTATGAAGACGGTTATAGTTCTGGCTTTTGGATTGACGTTGAAGATGTCCCGGCTTTCA +AAGCTGCATTCAAACTTGCTAAGGCGAAGAAATAATGGCTGATATTTGGTGTTCTGCTGC +TCCTGTAGTTAATATTCGTTGTCAGTTTGATCATATTCCCGGTGTAACGCATATTTCTAT +GCAATACGAAGATGGACGCGGACAAAAAGTGTTTTGTAAAATTAATTTTTCAGGTGGCTT +CGGTCCAGAAGTAGCTTTAAGCGAAAATGACCTTAATGCGGTATTAACTAATGATACCAA +GTTCGGAACTTTGGGACTATTTAATGAAAATGTTTCAGTTGAGCTATGTGAAGCTATCAA +TAAAGGATTTGTAATGCTTCGTAAAATGGTAATGGCGGCTAAGAAGGTAACATCATAATG +CAAATCACTATGGATAAAGATGAATTTGATAAAGCTATCAAAGAAGCTACTATAAACGGA +AAGGCACTTGCACTCGACGAATTAAATGAAGTTATGACAGCGTCATATGAAGATATGACG +AAGGGGTTTTTCAGTCGAGTTGGACGAGAAGCTAACATCGAAATGACTCGGCGCTTTCTG +AACATCATCAGAGAAAGGATTAGAGCTCTATGAAGATTCTTTCTGATTGGGAATGCAAGT +ATTGCGGAAGCGGACTTTTGTTTGCGGGTGGTATTTGTCCTAACTGTAAAATGAGGCAAG +GATAATGGAACTTAAACGTAAAACCATTGAGTTGATTGAAGTTTCGCCTGAATTCTGCTT +AGATACACTTTACGCTAATACTGCTTATGACACCCCGTTTGTTATTGAAGACAAATACGG +GATGCTTCATACCGTATCATTAGCATGGTTCCAAGGTCGTTATAATAATTATCGCTTTGT +TATTGGTGATGATGTGTTCAATGACTTATCTGATTTTGTTCATCCAAAAGATGTTGCTTT +TATTGAGGTGCGATAATGCTACAATTAGTTTATGCAGTATCTCCGACACGTTCGGTTGAA +GGTCAAAATGAATTAGCTTTTGGCCTTGATGATGGTCTACCATGGGGTCATATTAAGCAA +GACCTCCAGAACTTTAAAGCTCGCACCAAAGACACTATATTGATTATGGGTGCCAAAACA +TTCATGGGATTTGATGAGCCGTTGCCTGGACGCAAGTCAATTGTTGTCCAAGATATGTCT +CGTCCATTAGCTACTGCAAAGAATGGTTTCTTCGCCGATGCTTATGTAAGTGAACTTGAA +TTCACTGGGTTCTTGGGCGGCGATATCATGACAGCCAAAACTTCATATAGTCAATACCTG +ATGTTTGACCGTGATAAAGACTATTCTGTTATTGGTGGAATTGAGCTTATTAAGAAAGCC +ACTCCACACGCTGACCGAATTATTCAGACAACTATTCGTAAAAAGCATCGGGTTAATTCA +ACAGTGCAATTTCCTTACGCTACTTTCTGGTATCCACAAGAAGAAGCCACTGGATTTAAG +TTGACTGAAACTCACTGGTGGGCTATCGACGAATTAACTAATATCTCTGAATCGGTGTAT +GTAAAATGAGCCAAGTAAGAATCGCTTTAATCAAAGAAGATAAAGTTCAAAAGAAATGGA +CCGGTGCCCAGAAGAACATGGTTCATGGAATTTATGAGCTGACTTTTCCTACAAGTTATC +TATGGGTCTGGCAGGGTGAATCTAATTTGACTGTAGTTCCTGGATTTGGGCAAGTTGAAT +TAGGCCGTGATATGAAAGATGTTCTTAATGCTATCGAGACTGGAAACATCCAAGTTAAAA +ATGGCATTACAACTATCATTGGACGCTTTGCTAAGAAAGGCGGAATTTTATTCTTCAATC +CGGAGACAACTCGTGGCGAAATTAGTTTTAGTTAAAGCAGATGACACTCAAAAGAAATGG +GCTTTAGTCCTTCAAGGAAATCCACCTACTATTGAGCAGTTCAAACGCGACCCATCCAAG +TATACTGCTTTACCACATGGATTTTATAATGTACAAGCTCCACTAGAAGATATTTGGGAA +GCTGATAACGGAGTTCGAGTTAAGAGCAATGAAGGCGTATTCAAGATTCTTTACTCTAAT +CGCAAATTCATTGACTTTATCGACCAAGGGCTAGTCAAAGTCTCAGGTGGAGTTCTTAGT +ACTACTGGGCGTTTCGATAAGCGTGGTAGTGAAATTCTCTTTATTGTAGGTAAAGAATGA +AACAATACCAAGAACTAATTCAACATATTTTTGACAACGGATATGAGACTGATGATCGCA +CTGGGACAGGTACTATTGCAGTCTTTGGTACTCAATTGCGTTTTGATTTGCAAGAAGGAT +TCCCGGCAGTTACTACGAAGAAACTAGCATGGAACGCTTGTCGTTCTGAATTACTTTGGT +TCCTTCGTGGGTCAACTAATGTCAATGAACTTCGTCAAATTCAACATGGCTCTCTTATTG +AAGGGAATACCGTATGGGACGATAACTACAATAATCAAGCAATTGATATGGGTTATTCCG +GCGGAGAACTTGGTCCAGTTTATGGTAAGCAATGGCGTGATTTTATGGGTGTTGACCAAT +TGAAAATGGTCATTGATCGTATTAAGCAAATGCCTAACGACCGCCGTCAAATTGTTACAG +CCTGGAACCCAGTTGATATTCCAAAGATGGCTCTTCCTCCATGTCACATGATGTATCAAT +TCAACGTTCGTAATGGATTCTTGGACCTTCAATGGTATCAACGATCTGTCGATGTTTTCT +TAGGACTTCCATTTAATATCGCATCTTATGCCGCTCTGATTCATATTATTGCTAAATGCA +CCGGGTTAAAACCAGGACATCTGGTGTTCACTGGTGGTAACACTCATATCTATGTGGATC +ATATCAGTCAATGCAAAGAAGTTCTCCAGCGTGACCCTCTGGAACTCTGTGATATACAGA +TTAACGGTCTGCCGTATAAATTTAGACATCTTTCTACAGAAGAACAAATCAATCGAATCA +CTAATCTTCGAGCTAAAGATTTAATGCTCATTGATTACAAATCTCACCCAGCTATTAAAG +GTAAAATGGCTATATGAAAACTGTATTTGTGAACGAAAACCGTACTAAAGAATTTGGCGC +AACATTAGAACAAATTAATCCAATTCACCTTGTGGTTGGCTCAAAAGTAATGGTCGATGG +TTGGTTCTACATCGTAGATGATAGTTTTGTTTCTGTTGAACATAATAAAACTCCAGAAAT +GGTAGTGGTGGTCCACAAGGCATGAAACTTTGTCGAGTCGTGAATAAATATAAATCCGAT +TTCGACGTAAATATCCAACGTGGTACCATGTGGGGCAATGACGTTGGTAAAAATGCTGGC +AGCCGTGAGGCTGCCATTGAAGCCTTTAAAGAGGACTTTATCCGTCGCATTCGGTCAGGA +GAAATAAAACGTGAGCACCTAGAAACTCTCAGAGGAATGAGACTAGGTTGTACATGTCAC +CCGCTTAATTGCCATGGTGATATAATAGCTCATATAGTTAACAGACTTTTTAAAGACGAC +TTCAGAGTAGAGGATTTATGCAATTAATTAAGTCATCAGGAATTGGACAAGATTTTATCC +CAGAGAAATTCATCAAGGTTTTGTCTTGGGCAGCTAAAGATACAAACGTAGACCCATATG +AATTGTATGAGCAAGTTAAGCCTCATATAGTTGACCGTATGACTACAAAAGAATTACAAC +GAGCAGCTATTAAAGTTGCGGCCAACCTCATCACTGTAGACGAACCGGACTATCAATATG +TTGCTTCTAATCTGGCTATGTTTGCGCTACGCAAAGAAGTGTACGGACAATTCGAGCCAC +CATCATTTATCGACCATATTTCTTATTGTGTTAATGAACGCAAATATGATCCGGAATTAT +TGTCCAAATACAGCGCAGAAGAAATTACTTATCTGGAATCGCGAATTGTACATGACCGAG +ATTTCGAATTAACTTATGCTGGTGCGATGCAGCTAAAAGAAAAATATCTGGTCAAAGATC +GTTCAACCGGAAAGATCTACGAGACTCCTCAATTTGCTTTTATGCTGATCGGTATGGCAC +TGCACCAAGAAGAAAAAGAAAATCGTTTAGCTCATGTCATTCGTTTTTATGATGCTGTGT +CTACTCGACAAGTTTCTTTGCCTACTCCAATTATGGCTGGTGCTCGTACACCTACTCGTC +AATTCAGCTCTTGTGTTGTTATTGAGGCAGGTGACTCACTTAAGTCAATTAACAAAGCCA +GTGCAAGCATTATTGAGTATATCAGTAAGCGTGCAGGTATTGGCATCAATGCGGGGATGT +TACGTGCAGAAGGTTCAAAAATCGGAAATGGTGAAGTCAAGCATACTGGAGTTATTCCTT +TCTGGAAACACTTCCAAACTGCAGTTAAATCCTGTTCCCAAGGCGGAGTTCGTGGTGGTG +CCGCGACATTGTACTATCCAATTTGGCATCTTGAAGTCGAGAATCTACTCGTACTTAAAA +ACAACAAAGGCGTAGATGAAAACCGTATTCGTCACCTGGATTATGGTCTTCAAATCAATG +ACCTGATGATGGAACGTCTTGGTAAAGATGAGTACATCACTCTGTTCAGTCCAGATGTTT +CTGGCGGTGAACTGTATGATGCTTATTTCCGTGACGAAGATTTGTTCCGCGAGTTGTATG +AAGAATTAGAAAAAGATCCTCTGATTCGTAAGAAGAGAATTAAAGCCACTGAACTGTTTG +AATTATTCATGACAGAACGCTCAGGTACTGCTCGAGTTTATCCAGCATTCACTGACAACA +TGAATAACTACACTCCATTTATTCGTGAAACTTCTCCTATTAAGCAAAGTAACCTCTGTT +TAGAGATTGCTCTTCCTACCACGGATGTTGGTTCTAAAGATGCTGAAATTGCTCTGTGTA +CTTTATCTGCTTTCGTCTTAGGAAACTTTGATTGGCAAGACCAAGACAAGATCAACGAAT +TAGCTGAAGTTCAGGTTCGTGCACTTGACAACCTCTTGGACTATCAGAGCTATCCAGTAG +CTGAAGCATTGAAAGCTAAAGAACGGCGAGCATTGGGTGTTGGTGTAACGAACTATGCTG +CTTGGTTAGCTGATAACTTTGCTACATACGAAGACGCTAACGATTTGACTCATGAATTGT +TTGAGAGATTACAATATGCACTTATCAGAGCCTCAATTAAACTCGCAAAAGAAAAAGGAC +ATTGCGGTTATTATTCAGAAACTCGTTGGTCTCGAGGCGAGTTACCTATTGACTGGTACA +ATAAAAAAATTGACCAACTCGCAGCGCCAAACTATGTCTGCGATTGGGAAGCATTACGTG +CAGACCTTAGGACTTACGGAATTCGTAACTCAACTCTGTCCGCGCTCATGCCGTGTGAAT +CATCTAGCCAAGTTTCGAACTCCACGAATGGTATTGAGCCACCACGCGGGCCGGTAAGTA +TTAAAGAATCTAAAGAAGGTTCTTTCCGCCAGGTAGTTCCGAATATTGAGCATAATGCTG +AACTTTACGATTATGCTTGGCTGATGGCTAAACGTGGTAATAAAGGTTACTTGACTCAAG +TTGCTATTATGACAAAGTGGCTTTGCCAATCAGCTTCAGCGAATACTTATTACGATCCGC +AGAACTTTGCTAAAGGTAAAGTTTCGATGGCTCAGATGCTTGAAGACTTGATTTACTTCT +GGTTCTTCGGTGGTAAGACTTTGTATTACCATAACACCCGCGATGGTTCAGGAACAGACG +ATTACGAAATCGAAACTCCAAAAGCCGATGACTGTGCCGCTTGTAAACTATGATATAATT +GGACCACGGATGGTCCTAGGAGTATATTATGGGATGTAAAGAATTCACTTTACCGCCTCT +TCCAATTCAACCACGGCCAAAGCCGCCAAGGGTTCCTAAATGAGTACAGTTTTTAACACT +AAACCAGTTGACGTATTGAATGAGCCAATGTTTTTTGGTTCAGGTCTAGGTATTGCTCGA +TACGATATTCAACGCCATAAACAATTTGAAGACCTAATTGAAAAGCAGTTGAGTTTCTTC +TGGCGCCCAGAAGAAGTTAACTTAATGACTGACCGAGCTCAATACGAGAAGCTTCCAGTT +CATCAGCAAAATATTTTCATCAACAACTTGAAGTATCAAAGTCTGCTGGATTCAATTCAA +GGTCGTGCACCTGCTGCTGTCTTGTCTGCTCTGATTAGCGATCCTTCTTTGGATACTTGG +AACCAGACTTGGACGTTCTCTGAAACGATTCACTCTCGTTCCTACACTCACATCATGCGT +AACTTGTTCAATGATCCAGCAAAGATCTTTGATGAAATTGTTCTTGATGAAGCAATCATG +AAACGTGCCGAATCAATCGGTGTATATTATGATGACGTTCTGAAGAAAACTCGTGAATGG +GAAAATGCTAAAGAACGGTGCTTTAACCCTGACAACTATGAAATCGTTGATGCAAAACGA +GACTTAATGAAGAGTCTTTATCTCTGTCTTCATGTAATCAATGCTCTTGAAGCTATTAGA +TTTTATGTATCTTTCGCTTGTACTTTCAACTTCCATAAGAACATGGAAATCATGGAAGGT +AACGCAAAGATCATGAAGTTCATTGCTCGTGATGAACAACTTCATCTGAAAGGCACTCAG +TACATTATTCGTCAGCTTCAACAAGGTACCGACGGTGAAGAATGGGCTGAAATTGCTCGT +GAATGCGAACAAGAAGCAGTTAACATCTTCATGGAAGTTAACCGCCAAGAAAAAGAATGG +GCTGTTCACCTCTTTAAAGATGGCGGATTACCTGGGCTGAACGTTGAAATATTATGCAAC +TTTATTGATTACTTGACCATGTCTCGTATGAATAGCTGCGGATTACCATGCCCAATCAAA +GATGCTCCTACTCGTCATCCAATTCCTTGGATCAGGGAATATCTGAACTCCGATGCAGTT +CAATCAGCACCCCAAGAAGTTGAAATCTCTTCTTACTTGGTAGCTCAAATCGATAATGAC +GTCGATTCAGAAGTAATCAAATCTTGGAAAAAATACTTTTAAGGTGAGGGCTTCGGCCCT +CTTTTCTCATGAAAGATATTGCTAACGAATTTTCATTTATAAAATATGCTCAACTGGAGC +TTCTACCAGATGCGACTATCGCTTTAGTAGAAGTGCCCAACAAGAAGAATGTAGTATATG +CTATTTCTGTTGATGATATATTAGTCTACATCGGAAAGACGAAAGACCTTCGGAAACGTA +TCAACTACTACCGGACTGCTATCAACAGAAAGGACCAGACATCTGATTCAGTGAAGTCTG +CTAAAATCCTTGAAGCTTTGATGGAAGGCAAGAAAGTAGAGTTCTATGCTCGGCAGTGCT +TTAATTTGCTTATTAACAATGAACTTGGGCAGATGTCAATTTCCACAATGGACCTTGAAG +AGCCGATGTTTATCAAGAAATTTAATCCATGCTGGAATACTCAGCATAAGAGGAAAACAA +AATGAAACTTGAATTAGACCCAGATGTTCGCCCAGAGTTTTTGTCATACACAAACACATT +CAAATCCAAGTATGGCGAAGTCGAAGTGTGGGCTTGTAAAACTTCTGAATCATTTGGTAT +TAATCAGACCAATGATAAAGCAGACGAAGACATTATCGTCATGGATAAGTATGATTTGCT +TAATCTACAGAAACTCGTCAATCATGCAGTAGAAATTATGGAAGGTGAATAATGAAACAA +CTTTATGAAAATCTGATGGCTCTATGTGATGCGAAGGATGAATCCAAATTCTACTTCACT +GACGATGTGTCTCCATCTGGCAAAGAATATCGAATCTTTAGCTATAACTACGCATCTTAT +TCTGATTGGCTACTGCCTGATGCTTTAGAATGCCGTGGCATTATGTTTGAAATGATGAAT +GGTAAGCCAGTTCGAATTGCTGCTCGTCCAATGGAAAAATTCTTTAACTTGAATGAAACT +CCATTCACAATGAATCTTGACCTCTCCAAAGCGCAGTACATGCTTACAAAGGCTGATGGT +TCTTTAGTTTCTTCATTTATGGACGGTATGCTGCTTCGTTTCAAATCTAAAAGCTCCATT +AAATCTGAGCAAGCTCATGCAGCAACTGCGATTTTGACGATGCCTCAACATGAAGCCTTA +TTGGCTCGTTTGATTGATCTTTCTAATGAAGGTTTCACTGCTAACTTTGAATACGTAGCA +CCAGACAACCGAATCATTCTTCCTTATCAAGAACGCGAACTTATTCTTCTGAACATTCGT +GATAATGATACTGGCGAGTACGTAGATTACGATGATATCTATGCAGATGGTGTTCTTCGT +CGGTACTTGGTAGAAGCTCAACCAGTTCCAGAAGGTGATTTTGTTTCTGATATTCGCAAG +ATGGAAGGCATTGAAGGTTTTGTCTTTGTGATGGAAGATGGTTTACGTTTTAAACTCAAA +ACTGAATGGTACTGTGCGCTGCATCATACCAAAGATTCTATCACTAAGAATGATCGTCTT +TTTGAAGTTATCGTAGCGAATGCTGTTGATGATCTGAAAGGTATGTTCGTTGATGATCCT +TATTCTCTGAACAAGATCAACGTTTTCGAGCAGATCTATGTCAATTATCTTGGTGCTGCT +TTTGAGCTGTGTCATGAAACGTATGCTGCTCTTCGTGGTAAAGATCGCCGCGAGTATGCG +ATTGAAGCACAGACCATCGTGAACAAAGCTCAAATGCCAATGTTGTTCAGCGTTATCATG +GCGATGTACACTGGCGGTTGTGACAACGAGAAGCTTCTGGCAGGTGTCAACAAAATGTTC +CTGAAAAATCTCAAGCAGTTCATTCCAACAGAATACATCTAACCGTTTACATCTCCATTT +GGTTGTGTTACTATGATTCTACACCAACCAAATGGAGAAACAAAATGATCAATCAAATCA +CCGCAAACGAACTGGTAGAAATCTACGAAGGTACTCACCACGATGATATCCGCGTATTCA +AAGGACATCGTCCTATCGGATACATCACTGATCTTCGGGTTGCATACTCTCGTGATCAGA +AACGTCAGAAAGCTCGTAAAGAGTACACTAATCGTATCAACGAAGAACGAGCTGAGAAGA +TGCCAGAAGCTGTAAATGAAATGGTTGACTTTTTGAAAAACAACCTTCGTATGAATGCTG +ATGTAATGATTAACATCTCTCAGCCTAACGTTCATGTAAATGGTTGTAAGTGTTATGTAA +TTGTTGACCCGATTCGTGGTAAGCATCGTCTGGGTGTATCTAACCCGAACAGAACCGCAA +GTGAAATGGCTTTAGATGTTGATCCATCTTTCAAAATCCAAGAATCTCCTGCGGAACATC +ACATCTTAATTAATGGTCTTTCGCAAGATGACATCGTGGAGCATATCCGCAAATTATGTT +CAAAATAACAAATCTTCATGTAGCTGCAATAGCAATATTACTTTTTGGTTACGGAGTAAT +TAAGGTTCAATCTGCTAGAATTGATGGGTTGAAATCTGATCTCCAGACTATTCAGCAAGT +TGCAACTCAACAGGGAGAAGCAATTAAGCAGCTTAAAACTGATTATTCTATCATCATGAA +ATATGATGAACAACGTAAAGCTAATCGCGTTGAAGCTGATACTTCGAACGCTAAAATGAC +TAAAGACTCTAAACGAGAAAATGTGGTCAAAGCTAAGCCGAAGTTAGTTGAAAAGCAAAT +AAATGAGTCGTTCAATAAGTTCGCTCTGGATTTACAGGAGACTACTAGATGAAATGTTTA +GTGGCAGCTGTAATAACTATTGGACTACTATCTGGATGTTCCCAGAGTGTACCAGAGGTT +CCCAGAGATAACACTCTACACCCATCATGGCCTGATCCAATAAAGCCATACACTGGTAAG +TGGCAAGTCAAGATAATTGACGATCGCCCATGGGTTGGAATGCCTTTTGATGAATCTCAG +GAATTCAGAATTTGGATGAATGACGTCAATCGTTGGGCTAAAGATGCTAATGGAATGATA +TGTTACTATCGTAAAGACTTAAACGAAACAAGGTGCAAGCAATGGACCCGTTAGTAGTTC +TAATCTTATGTGTAACATTCTACAAAATTTGTAAACTCGCAATGCGATAAGGGCTTCGGC +CCTTTGGATAATAAAATTTTAATGAGGATATCATAATGACTCCACGTTCTAATGTATTCG +TTAGCAATCCGGAATTATTGACAAAAATTAAAAATGAAGATCTCCGTAACTGTATGAAGG +TCGGTTCCTGCATCCGTACTCCGCTGGAAAAGAAGACCACGTTCAATTATTCTTGGACTG +CCACTGAATACGGAGATCGTGTTATCAGTGTAACATGTTATCAACCTAATTCTAGTATTC +CAGGAAAAACGTTCATGGTTGATATTGCTCTATCAGACTGGTATCGTATTACAAGCTCTG +TATCCAATTTTGACGAATGGCAGGAAGAAGTAATTGAGAAAGACCGCATTCGCTCAGTTA +TTAAGACCTTTGAAGAAGCAGCTAAAATCCATGGTGCTCGTCAGGCTACTGTAGCAAATG +GATTTATCTCTGACGCAGAACCATTCTTGCGTGAATCTGGTAAGATTCTGAAAGATGCTC +GAGCAGCTCTTTATGAGGAATTCGATGTATGACACATTCAATTCTTCGTGCATTACTGAA +AGAATACTTTTCTCGTAGTACTGCATATTGTTACGCTTTACACGAAGATGATAAAGTAAG +AACTGGTTCGTCAGCTGATACAGTTAACTACATTGCGAATGATATGGAGCTTCAAGCCGA +TCGGGTTATTGCAGCTTTTAAATCATTTGAAAACGGCGAACTCATGTATAAAGCATTTAA +TTCAATAATGATTCGTCGAATATGGGCTCATGGTGCTTATATTAGTAGCGTAATGCGAAA +CCGTTCTTCACAATGGTCTAAAGATGCTAAGAAAAGACTTGATGACATTGACGAACAAAT +TGATATTATGATTGGGGTATTGAAATGACTCAAACTGAACATACAAGATTAATTCTGTTG +TTTAACAACTATGAGCAACTTAATAATCAGATGAATTTAGCTTATGCAACAAATTCAGAC +CGTGCTGCGTATTATTATACCGAAGCTGCAAGGGCTCGTGAAGCTTTTATCAAGAAACTA +GCGGAACATATCGAATGAAAAAGTTAATCTTAACTCAGGGATGCCCAGGCTCTGGTAAAT +CTACTTGGGCAAATGAATATGTTCAGAAAAATCCTGGATTCTTCATCCTGACCCGTGATG +ATTTCCGTGAAAAGCTTTTTGGGCTAGAAGCTCGTAATCAATATCGTTATAGTAAAGCAA +AAGAAAAAGCAGTATCAGCTGCTCAATTTGCTGCCGCTGAAGCCTTGCTTAAAATGGAAA +CGACTAAAGGCATTATCATTGCGGATACGAACTTGAATCCCTCGACTGTCAAAGCTTGGA +ACGATTTTGCTTACAATGCATTCAATGACGTGTTGGTGGAAATTGAACGTTTCGACGCTC +CGTGGACTGAACTTCTAAAACGCAACCAATATCGTGGTGATAAAGCAGTTCCGATTGATG +TTCTGCGTAGCATGTATAAACTATCAAGACCACATGATGTTTATGTCCCAGATGAGTCTT +TGCCAAAGGCAGTTATTTTTGACCTTGATGGAACATTAGCCGACAATGATCATCGTTCTC +CTTACGACCTTGAGAAATGTGGTGAAGATGCTCCAAAGAAAATGGTTATTGCTCTTTTAG +AAATGCTTCGTGCTCAAGGGTATAAAATCATTACTGTATCTGGTCGAGAATCTGGTACAA +AAGAAGATGCTACAAAATATCGTCGCATTACTACAGAATGGCTTGATAAGCATACTAATG +GCTCAGATGAGCATTACCAACGAGCACAGGGTGATTCTCGTAAAGACGATGTAGTGAAAG +AAGAAATCTTCTGGAATTGTATCGCTGATCGTTTTAACGTGCAATTAGCAGTAGATGACC +GAGCACAAGTCGTAGAAATGTGGCGTCGTATTGGGGTTGAATGTTGGCAAGTTAACCATG +GAGATTTTTGATGGCTGCTTATCATGAAGGTTGGGCTTTAGTCAGTCCTAAAACTAAATG +CATCGCTCATATGGATGACGGCCAATATGCTGTCTTCACTGAAGAAAAATATGCAATTTT +AAAAGCTGCAGAAGTTCTTAAGCAATATGGTAAAACCCTTACTATCCGCCGTGTCAAAAT +TCCTTTGCCTTGGAGCATGTGATGTATAATAAACACCATGAAATTGAAGAAGAGGCTTAT +GGGCTTCTTCGTAAACTCGTCGGTGCAAAGCTCGATCCTGCATTGATTAATGCTCTTGCT +GAAATTCGAACTGATATCAATACTCGATACAAAAACGAATATCATGTAGAGTTCAAGCCC +GTTGGTGAAGTAGTAACCAACTTCGTTGTTAACGTTAAAGTACATACGGTGCACTAATGA +TTACAGTTCAACCTCACGCAAATACTTTGCAGAACGTCGCAAAAATGGTAGTAGCTCAAG +TAGTGGATAACTTTATGTTCACTCCTAATACTGCCGCGAATCAAATTCTTATTCATTCTG +ATATTGTTGCTGTGATGAATATTCTATGGAAAGATACTGATTTCAGAGTAGTTCCTCATT +TCGATAGCTTCGGCTTTACATTTGACTTCACTATTGATCCTGGAACTCCAAATGCTTTCG +GTTTTTCCGTGAAATATTCTTGGGATAATTCAAATGATTTCCGACAAGAAATTTAAGCAA +GAAGAATTTGTGGGTCTGTGTAAAGATTTTGCACAAGCCTGTGCTAATAAGAAACCTGGG +TCTCAAGTAACGTTAAAACAAGACCCTTTAGTGAATGGAGTGATCGTCACAATTAGTTAT +AATGGTAAACAAAGCCATGTGTCTTTGACATTAACTCGTGATGGCTCTGTTAAAATGGAA +ACAATTCTGGGGTATGTATGATCACATTAGTAAGTGCAATTGAAATGATTCGTGAAGCTG +AGTATCAGCATGTTGGTGACAAGTCTTACTTCAGTGAAGAAGGTGTTTTGGATGTTAAAG +CACTACTAGAGTTTGATCGTTGTTTCCAATCCGTTCCAAGTGACACGTATGATGCGGTTA +TCTTGTCGTGCAAAGATTTAATCAACGTAAATGCTCGTGGTTTAGGTGATGTCTTCTTTG +ATAAAAATAAACGCTTTCCAGATGGTTGTTTTATAATTACTTCACAGGTACTGGCAGTTG +AACAACTGTTCAGTGAAATTTATCGAGTTAAAACTAAAAACTCAACTTATCTGGTGATCA +TGTAAATGAAAGCATCCACGGTCTTACAAATTGCTTATCTGGTTTCTCAAGAATCCAAAT +GCTGTTCCTGGAAAGTGGGTGCAGTGATTGAAAAGAATGGACGTATTATCTCTACCGGTT +ATAATGGTTCACCTGCTGGTGGAGTTAACTGTTGTGATCATGCCGATGAAAAAGGTTGGT +TAGTTAAAAAGCCTTCATCTGGTTTACGTCAAGATGGTCCTATTCCAAAATATGGGTTAT +CTACTAAATTTAGAGCAGAGCATAGTGCTTGGTCTTCTGTGAATGAAATTCACGCTGAAC +TAAATGCTATTTTATTTGCTGCTCGTAATGGCTCTTCAATTGAAGGTGCTACGATGTATG +TGACATTATCACCATGTCCAGACTGTGCTAAAGCAATTGCTCAATCCGGTATTAAGAAGT +TGGTGTACTGTGAAACATACGATAAAAACAGAGAAGGCTGGGATGATATCCTTCGTTCGG +CTGGAATTGAGGTATTCAACGTTCCTAAGAAAAATCTTTCGAAGTTGAATTGGTACAATA +TTGATGAATTTTGCGGAATAGAAGAATGATTACTTTTGAAAAAACTCCTGAAGTCGTAGT +GTCAGATATGACTGAAGAATTTATCTTTACGATGGAAGCAAATAATATCCGCTGCATTAA +AGTTCAGCCAACGTTTGTTATTGAGCATCTTGAAAAACAATTTGGCTGTGAAATCCTGTC +AAAATCTACTACTGATTATGACTATGTAATTAAACAGTTCGTTGAGCTTAAGCCTGAGCT +GGTGTTGGTTCGCGAAGTTAAAGAAGAATGTATTGGTGATGATGTCCGGTATATCTTCCG +TGTTGACTATATCAAGGTTAAAGCATGAAAGCTCGATTAGTGCAAACTACTGGATTGAGC +TCTTACGGTGATATCAACATTTCTTATGCGGTCGAATACAAAAAGGGATTTTTCTCTAAA +TGGAAAACTCTTTATCAAACCGATTATGTTGATAGCACAGATGAAGTTCGCACAACCGAC +CGTCGTTCAAAATGTGAAAAACTTCTGAAAGCATTAAAAGAACGTGGTGCACATAAAATT +AAAACTGTTATAGGTGAATAAGATGAAATTAACTAAAGATCAGAAAATCCAAGTTCGTGA +AACATTAAAAGCAATCCTGTCTAATGGTGAATCTCAGGTAGTATTCGAGAAAGCTGATGG +CACTATTCGTTCCATGCGTTGTACTCGTGATAGTGATTCAATTCCGAGTGATCTGGTAGA +AAGCACTGTTAAACCAGCTCGTGCCGAATCTATTGATATGCTGCCAGTGTATGACACTGA +AAAAGAACAGTGGCGCGGATTTAGCTTTGAAAAACTGATCTCTGTGAATGGTGTTAAAGT +TGAGCATTTGATTCAACTGATCACTCATTAATTGCTTAAAGTAGAGCATGATAATATTAA +TTCATGCTCTTAAACATAAAGGTTAAACATGCAGTCTTTACCAATTAAAGCTTTAGGTGA +ATATGTAATCCTCGTTTCTGAACCAGCTCAAGCAGGTGATGAAAAAGTTTCTTCTTCTGG +AATTTTTCTAGGAAAAGAACACCAAGGTCAATTACCAGAAATGTGTGAAATCTACGCAAT +CGGTGATGATGTACCAAAAGGATTTGTTGAAATTGGAGATTTTACTCCTATTCCAACAGG +AAGCATTCGAAATGTCGTTCATCCTTTGGTTGCAGCAGGGCTTAAGCAACCGAAGGAAAT +CAAACAAAAATTCGTAACTTGTCACTATAAATCTCTTTCTTGCGTTTATAAGTGATATAA +ATATCATTATGAAGAACTGGAAAACTACCTCCAAAAGGACATGACCAGTTATTCTATGTG +GTGGATGCGCAGCTTAAACGCTGGTACCGTCCACCAAATTTTCACCTCATTTGAGGAACG +ACATATGTCAATGCACAAACAACTCGAACACGCTCTTACCCTGCAACGTACCGCTTGGAA +TGCAGGCCACGAAAACTATGGCGCATCTATTGATGTTTACGCCGAAGCATTGGAAGTTCT +TAAAGGTTTCAAACACCTGAATCCAGTTCAAGCCGATCTTCGTGATGCGTTGGTCGAAAA +AGACGAACTGAAATTTGCAAAATCTCTTTGCAGCTCAGCTCGTAAAGCTGTTCGTCATTT +CGTAGTAACTCTGAAGTAATATTTTATAAGCGGCAAGTGCATGCTACCCCGAGGCGATGG +CCAATCGGGAGTACGCCTCAAGGCCTATACATCCATCGGTGTATATCTTATCCTCGAGAA +ATCGGACCCGGACCCTTTAAGCTAACGGTGTGCAACAGATAAGAGCGTATGAAGTTAAGG +TTGTGAGAGCCGGAACGTTAAATATAAAAAACTCACAAAGTACCCTTTGAGGGCTTGCGG +GAGCTACAACTGAAAGAACTGTCGAAAGAAGTTGAAACTCAGAAGAACGTGCTCCCATGT +ATTTCTCCAAAATGGAAGATCATAATGTCAAAAGCTAAAAAAGCAGTTAAATCAGTAAAA +GAAGTTGTTGGTACCTCTAAGCGTGCTGGTTACAAACGTAGCACTAACGCTCGTATTGAT +AAACTAGGCGATCAGCTGGCTTCTCGTGCCCGTAAGGTTCTGGCACATGATGCGGCTTTT +GGTAATCCACGTAAGAAAGCGTAAAGCATAAGTTAGGGACTCCTTCGGGAGTCCCTTTTT +TGTTTGTACTCTGCCGTTTACATCCGTTGAAGAACGTGTTATAGTATAAACTCAATAACC +AATACGGTATAATGGAGAATAAAATGAAAATCAATCTGAATGCTACAGTAAAATGTAAAG +ATCACGATGGATATAAAGCTCAGACAGTAAAAGAACAACAATGGATGCTTACTAAGCAAC +AATTTGAATTTGTGAACTGTATGACTCCTGAAGGTCCATCAGATGATTTCTCATGGAAAA +TTATTCTGATCAACTTCTTCACTGGTGAAGAATATGAGTTGAACACTTTGATTCTCGGTA +AGATTCGTTGTGAAACTTATGTCGATGAAGAAGATGGTTACTCGGAAGATGTTACGTGGT +ATCAAAATGGTCGAATTACTGCTGATAATCTGATAGAAGCAATTAAAGCTAAAGGCGTAG +TTGATCTCACTTACTGGACTAAAACTAAATGAGGCCTTCGGGCCTATTGAGGAAAGCATG +AAACTTGATTTAAGCAAAGAATATGTTCTGGATAATGTTAGCGGCTATCTTTATGATAAT +GGAAGCAATACTCACATCAACAATGAAGTAGTTAAGTTCATTGGTGACCGTAAGTTCACG +ATTAAGACAACTGGTTACAACCGAATTGATGGGATTTCTTTTGATAAAGGCGAAACTTGG +GTATCGTTGAAAGATATCTCAGAACAAGCATCAGTGTATGGCTACATTTTCTCTGCTGAA +GAGATCGACCGTGGAGCTATCAAAGTTGCTCCAGAAGATAAAAGTGTTCGTGAATACATG +GTGATCTACACTGACGAAGATGACATTCCGAAGGTTGCTTACAGTGGTTCAGGGAATATG +TTCACTGAAGAAGAGGCTAAGACTGCTTCACTTGAGCTGTTCACCGAAGGCTACAAAATT +AAGAACGTATTAGTCGTTAAGAAAGCGTTTGAAGCTTTGTCAAAAATTGAAGTAAGTTTC +GTCTAACCGTTTACATCCTCCGTTGTATGTGTTATAGTATAAACTCAATCAACAAACATA +CAACGGAGAATAAAATGAACTTTACTAACTTTAATCGCAAATATGTTCAAGGCGCTTTTG +ATAGCACGGTTTGTCTTTGGGAGCATAAAAACGGCACAGTTTGCGAAATCGATATGTACT +GGACAGACAACTACGTCTACATCAACTTTGAAAATGGAATCACGTTAGATATTTCTTTTA +AAGGTTCAGTCATTAAGGTTGGTTTTCATGATGATGTTCGCACTCGCGATTTAGGAACTC +ATCCGTCTTGGAACGGAGATAATCGTAGAACTCTGGTTAAACTTTATCTTCGCCATATCT +TAGGGCAGAAGACTACTGAAGAACAGCGTGAAGCAATTTGGGACATCGTTTCAAACGAAT +TAAAATTTTAATCTTAAACCGGGGCTTTGGCCCCTCTGAGGAAAATATGCAAATTTATCA +ATTCAACGTTGGCGGTTATAAGGAGTTCATTGATGTACATAGTCATGAATTTGTAGCCGC +ATGGGAAAAGAATATGGAAGTAGCCCAAGACTTCATAGTTCTAGCCTATTCAGAAGAAGG +GCATATCACCAAAGTCAAGAATATGAAAACCGGCGAAGTCTTTACCGCTCTTGAAGGTGA +CTTTAAGAATTATTTCGCGTTCTTCTTGGTTCATGAAATTGGTAATGGCAAACAGGTTTC +AGTAGTTCGAGAAATTGTGTCAGAAGAAATGAGTGCGTATGGAATTAATGAAAATGAATC +TGGCGAATTCATTGTAGTCTCCGAGTACAATCAAAATAAAAGTATTTTAGGTCCTTATAA +CTATGAAGAAGCTCTGAAAAAGGCTAAAAGCCAGATTATGCATGGTGCTATTGGCGTGAC +TGTAAAGATCTACAAAGCAGTAAATGAAGTTGAACTCGCTGTCAATGTAAAATCACTCTG +AGGGAAATAACATGATCGTATCTATCGCTAAATCTGTCGCTGCTAAATTTGAACGCATCA +TCGATTGCCCAATGATTGATATCATTGAAGTTCGCGTTCGTAATCATTCGGTTGAGTATG +AAATTGATGCTCCTGATTTCTTTGAATTTCCAGATTGGGCTGTTGTATTATGAAATTATT +CATTGATCTAATGAAGCATCTCTATCCAACATACAGGTTAACATTTGATGTAATTGATGT +TTCACCATCGGGTTGGATAAAAGATCCGTTTCAAGTAACCAGAGAAATCAAACTCTGGTT +TTGGGAAGATAAAACAACAGAATTCCATAAAGTTGAAGAAGCACTTCCTCCGTTTGGTGT +ATGGAGTGATGTTATTCTGAACAAGGTTGAAAAAGTATGAAACCAAAATACATGATATTC +CAAACGGTTCAACTGAAAGGCTCAGGAATTCCCGGAGTAATAAGCGATGTAGCTAATGGA +ATTCCTCGCTATAAAACTCAGCCCGCTTATGAAGTTGATTGGGTTGATGGAACTCGTTCG +GTTCATATGGAAGAAGAAATTTCTCCGATATCTCAATTAAAGGTGATGTAATGCTTTACT +TAGATTTAGATATTATTGCTCAAATGCCAACTAAATCTGGGTATTTGAATCAGCTAGTTA +CAAAGACACTTATTGAAGGTGGAACAGTAGCTTTCACCTCATTTGAGGCTGAATTATCAG +ATCACACAATTAAAATGATTGAGGAAAAATTATGCTTTTACAAGAACCAAAACCAGTCGT +TGCCACCGACGTCGATGGGATTCTCATCAAGTGGCAATCAGGTCTTCCTTACTTCGCGCA +AAAATATGATTTGCCGTTAGATGAAATCCTGAAAACTATCGCAAGTGACTCTTTTGTTAC +TCCAGCAAAATTGTTCAACTGCTCAGAAGAATTCGCTTCTAAATTACTTTTGAAGTACAA +CAACTCCGACTTTATTCGTTATCTGTCGGCTTATGATGATGCTCTTAAGGTAGTTAATGA +GCTCAAGAAGCATTACGATTTTGTTGCAGTAACTGCATTAGGTAACTCAGTAGATGCTCA +CCTGAATCGCCAGTTTAACTTGAGTGCATTGTTTCCGGGAGCCTTTAAAGATATATACGT +CTGTGATTACAATGAATCTAAGGATCACCTGTTAACCCGAGTGTTGGAAAAGTACGGTGA +TCGTGTAGTCTGTTACGTTGATGATCTTGGAAAACATATTGACTCTGCAATTGAAGTTAT +GTCTCATTTAAAAGACTTCAAAACATTCTATTTGCCTCGTGGTGAACGGGATCATTTGCC +ATCTCATTCTGGAACAGCTCATCATACTGTAAAGAACTGGTACGAAATCAAAGATATTTT +GGTGAGTGATTCTTCATCAAAAATGGTTGAACAATTCAAAAAGATGGTCGATGAACTTAA +TAAGCCGGATCGCCCATCCATTTATGATTTTTGGAAACGCCAAGTTCCAATTTTCGAACC +AACCCAGCCATGGCAACGTCCATATCCGAATTATGGTATTGGAACCGGGATTGAATATCT +CATGAATCAACCTAATGCAGTGGTGAATTGTAAAGTATGATTAATGTAATTTTCTGGAAT +CCGCACACTGACAAGACTCATCGAGTAGAGTCTTTCGCTACTAAGCAAGAGTTGTTTGAA +AAGCGAATTACTTCAAAAGCTAATCAGGAACAGCGTGAAATGCAAGATGCGGTATTCAAT +GGAGCTTTGTACATCACTACTCTTCCAAACGGTGGAGATATCTATCACCGAACTCTGGCA +GAATTGATTCGCGAATATAAGTCAAAGATCTAGCTTATCGAGGGGTGTGATATAATAGTC +ATATCCCTAAACACAAAGAGAAAATTATGATTCTTGAAATTATCAATGAAATCGCGTCTA +TTGGTTCAACAAAAGAAAAAGAAGCAATCATTCGTCGTCATAAAGATAATGAACTTCTGA +AACGTGTTTTCAAATTAACGTATGATGGTAAGTTCCAATACTACATCAAGAAATGGAATA +ATCCAGATTGTTATTTACCTAATGCAACACAATTTTCATTGAACTCTGCCTTAGATGTTT +TAGAGAATTTGTTTGCTACTCGTAAAATAACTGGCAATGCTGCTTTAGATAAACTTTCTG +CTACGCTTCAACGAATGCATGAGTCTGATCGTGAAGTACTTAAGAAAGTATTGCTTCGTG +ATCTTCGTTGTGGTGCATCTCGTTCTATTGCAAATAAAGTATGGAAGGGATTAATCCCAG +AACAGCCACAGATGCTTGCTTCATCGTATGATGAAAAGGGTATTGAAAAGAACATTAAGT +TCCCTGCATTTGCTCAGCTCAAAGCCGATGGTGCAAGGGCATTTGCCGAAGTTCGCGGTG +ATGAATTAGATGATGTAAAAATTCTATCTCGTGCCGGGAATGAGTATCTTGGTTTAGATT +TGCTGAAGCAGCAGCTAATCGATATGACTAAAGAAGCTCGTAAACGACACCCTGGTGGTG +TAATGATCGACGGCGAATTAGTTTATCATACTGTTGTTGCTTCGTCTGGTCCATTAGATG +ATATGTTCGGCGATTTGCCTGAACTTAGTAAAGCTAAAGAATTAAAAGAAGAATCTCGTA +CAATGTCCAACGGATTGGCAAATAAATCTCTGAAAGGCACTATCTCTAAGAAAGAAGCTG +ATGGTATGAAATTCCAAGTTTGGGATTATGTTCCATTGGATGTGGTTTATTCTGAAGGTA +AAGAATCTGGATTTGCGTATGATGTTCGGTTCCGTGCTCTAGAGTTAATGGTTCAAGGCT +TCTCTCAGATGATTCTAATCGAGAACCATGTTGTCCATAACCTCGAAGAAGCTAAGGTAA +TTTATCGCAAATACGTTGATCAAGGCCTTGAAGGTATTATTCTGAAGAACATCGGGGCTT +TCTGGGAAAACACCCGTTCTAAGAATCTTTATAAGTTCAAAGAAGTTATCACTATCGATA +TGCGTATTGTAGGGATTTATCCTCATAGTAAACACCCTGGTAAAGCAGGTGGATTCTATC +TAGAATCAGAATGCGGATTAATCAAAACTAAATCAGGTTCTGGATTAAAAGATAAACCGG +GTCCAGATTCGCATGAGTTAGACCGTACTCGTATTTGGGAAAACCAAAATGATTATATTG +GCGGAATTCTTGAATCAGAATGTAATGGTTGGTCAGCAGCTGAAGGTCGTACAGAATATG +TTAAGCTGTTCCTTCCTATTGCTGTTCGTATGCGTCGTGATAAAAATGTAGCAAATACAT +TCGCTGATATCTGGGGCGATTTCCATGAGGTTACTGGGTTATGAGTTATAAAATTCTTTT +AGAAGTTACCGTGATGTCTTCGACTGGACATGTGGCGGTTAGTACTGAACAGCTGGATTT +TTATAGCTGGGATAATGCTAATATGTATTATGAAGCAGTAGAAGTTTATGAAGAAACGCC +AGATATTAAAGTATGGCGTCAAGTAACAAAACTTTATTAAAGCCCTTCGGGGCTTTTGTT +GTCTATAAATATAGTAAACTATAGAGGACTTTTTATGATCGAATTAAATGAAGTCTTCGA +TGAAGGGAAAGAACGTCTAGCAGTTACGAACCTTTATCCGAAGCTCAAGATTCCACAAAT +TTTTGCAATAGACAACACTAAAGTAGCTTATCGTATGTGCTCATATACTGGTGGTGGAGA +TGCAAATAAAAACATCAAACCCGGTGATAAAATGATGCATGTCATTGCATTAGGAGTTAC +TGATAAAGGCCTTGGTCAACTTAAGACCTTAGGTGATAATCCAATTGCTGTTATTGATAC +AATCTTTAACCACGTAATGGGTATCATGAAGTTTTATCGTTTTGACGCTGCTTTATTTCG +TGTTAAAAAGAATAAAACTGGTGGAGCAGGTCGCCAGATGCAAGTTATTGTTGATCGTCT +AATCAAGAAGAAAGGCGGTGGCAAATTCGTTATGCTTAAAGAGTTGTATGATTTTGATAA +GAAATACAACTACATTTTAGTATACAAGAAGAATGCTGATCTTGTCAATATCCCTGGAAT +GACTGAGATCATGGACTCAATTTATAAGAAAGTAGACACTGATGTAGGTGATGCTTATAT +CAACGTTGAGACCGGCAAACAAGTATCTAAGCTTGAAGCTATCGCGGGTTCAATCGCAGC +AGAAAATGATAAACGCTCAGACCAGGCGGTTGCGTCTCGAGCTAAAATATCTCGTCGTGC +TTTAATGGCTTCTCAATATTCAATCCAAGTGGGATTTGATACTCGTAAAGATGCGGTAGA +ACATGATAAGCGATTAGATGTAATTAACTCTAAACCTCCGGTTTATTTGACAGATAAGTC +TTCTGACCAAGTATCGAATATTCAAATGGCTATTGATAATTTCAGAAATGATTCTCAATC +AATTGCTAAAACCGGCGAAGCGTTTAAGACATTTGACCCGTCATGGAAAATGGATGATGA +TCGTCATTCTACTGGTACAATGAAAGCCCAAGAACTTGTTCTAAGGCTCACTAATATATT +AACCAGTGGAACAGTAGACGATTTCAGTCAACATCCTACTGATAGAAGAGAAGCATTTAA +AACATTAGCGGTCAGAGACATTTATCGTATTGGTGAAGCCTGGTCTAAATTAGAGCCTAA +TGACTATTATGGTGCTATTAAAGAACTTACTCGAGTCGCAATGGAAGACAAAGAATGGTC +TTCTGATGCAAATCGTGAATACGCAGTAAAAGAGATTGTAGAATTAATTTCTAAACAGTT +CTCTGATTTAGCAGCTAGCATGTACAAAAATACATCAGATGTGGATCGTTATACTCCGGT +ACAATTGTCAGGTTTACATGCTTACGTCGGTTCATCTTATAAGTACATCAACGACTATCT +TTTAGGCCTTGATGATTATGGCAAAGAAACTGTTGAAAAATGGATTGAGTCTATCGATTC +TGCGTTTGAAAATGGTGTTCGTCTTCCGAAGGGAACTAAGCTATTTCGAGGTCAACATAC +TAAGCGCGAAGCTATTGAAGTTAGTTTAGAAAACAAGCACTTCTATTTCAAGAATTATGT +GTCAACTTCAATGGCTCCTATTATCTTTGGTGGATATGGACGAGCATATGATGCAATGGA +CCCCGCTGCATTGAACACAGATACATCGACTCCTAAAGAAGTGCTTGACTCTGTTTCAAC +TGTTCGGCCTGATAGTATTACTAACTCTGAAATGGGTGAATTGCGTTTAGCGTTCGTTAT +TTCTGGCGCAGAGAAAATAAAGACTATCGTAACCAATGCTGGAATCTCAGGATTGTCATT +TGAAGCTGAAGTTATTCTTCCTCGTGGTACTGTTCTTAGAATTGATAAAATGTATGGAAC +AGCTCAGAAACTTCAAGCTAATGACTACACAGCATCAAAGAGTGTTCTTATGGAATGCAC +TGTAGTATCTCCAGAACAATTATCTGAAACTACAATTTATGATGGCGATAAATTGTTAGA +AGGTGAATTGGTTGAATCTGATTATTCGTTCAGTTCTTTTATTGGTCAATTAAATGAAGC +TAAAGTTGAAACACCAGATTGGTTAGGTGAAGCTCTAGCATCATTTGTTGACATAAATAA +TTTACCAGAACGATTCATAAATTAATATTTTCACATGGACGTGAATTCAGAGAGGGCTTT +ATGGAAATTTTAAACGAAGTACTAGACGAAAGTAAACTGGATTTACCAGTTACGAACCTT +TATCCAAAGACGAAAATTCCACAAATTTTTGCTATTCAAACTAACTCCGAGGGTTCACTG +CCAGCATTCAGGATGTGTTCATATACATCTGGCGGTGATACCAATAAGAACGTTAAACCT +GGCGACAAAATGATTCATGTTGTTATGCTATCATTGAGCGAAAAAGGATCATTAGTTAAG +CTTAAAAACTTAGGCGGCGATCCAATTGGTGTTATCTCTACTACGTTCAATATCGTTTAT +TCAACGATGAAGCAGTATAAAATGGACGCATGCTTGTTCCGAATGGCCAAAAGCAAAATC +GGTGGACAAGCTCGTCAGATGCAGGTTATTATGGACCGACTCGTACGTTCTCGTACTGGT +GGTAAATTTGTTATCCTGAAAGAACTCTGGGATTATGATAAGAAGTACGCATATATTCTT +ATTCATCGTAAAAATGTTGATCTCTCAACCATCCCTGGCGTCCCAGAGATTGATACTGGA +CTGTTCACTGCAGTTGAAACTAAAGTTGGTGAAGTTTATGTTGAAAAGAAATCAGGTCAA +CAAGTAACTAAAGCCCAAGCCGTTGCTGCTTCTATTGCAGTCGAAAACGATAAGCGTTCA +GATCAAAACGTTATTTCTCGTGCTAAGATAAATCGTCGTCAAGCTATTGCTGCTCAGTAT +TCTGTTGATGCATCTAGCATCCAAGGCGATGATCGTGCTGCTGAAGAATTTAAACGCTTA +GAAGCTAAAGTTCCAGTTAAAAGCTCTAAAGGCGCTGAGTCATCAGACATGGTAGCAAAA +GTTAATACCATCGCTGACCGTCAAGGAAATGAGTATATCGGCAAAGTACTAAACTTCATC +ACTAATCCTGAAACATCTCAGGACACAGATGGTAAAGCATTGACTGCACGAATAGGTCAA +TTGCGCCAGTTATCTAAAATGCCTAAAGGTGCCATGTTATCAGGTGGATTTGAAACTGGT +GGTATGAAGTACTACATGGAAAACCAAAAAGAAATGTACAATGAAGTTCGTTCATTTGCT +CGATTGATAGCTGGGGTGAATACAACTAACTCCTTTCAGACGATGAAAGATTTAGTTAAA +ATGGCTTCAGCTGGAACTAGACCTGAAGATCGTGAACAGTTAATTGCAAATTTAATTGGA +TTAGCTTATAAAGAAATAAGTGCAATCATCAGAGATTCATACCAAACTGCAGCAAGTTTA +TCTAAAGAGAATGATCATTATTCTAAAGATGAAAAACAAGCTATCAGTGAATACTGCGCA +AACGCTTTCGAATACGTGAATATGTTCTTAATCGGTAAGCCGGAAGAAGGGTATTCAACT +TCTGATTCTCTCGAGATCATCGATAATATGGACTCTGCGTTTGAAAAAGGAACTCGTTTA +GACAAAGGTACATTATTGTATCGTGGCCAAAAATTAGACCTTCCTACA
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/merlin.fa.forblastn Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,86 @@ +>Merlin +AGAGCTTAAGAAAATTTTAGGCGATAAGAAGCTTCAAAGTACTCCAATTGGACGTTTTGG +AATGAAAGAAAACGTTGATACTTATATTGAAAGTGTAGTGACAGGGCAGTTAGAAGGTGA +ATTTTCTGTAGCAGTTCAAACTGTAGAAAATGATGAAGTTATTTTAACTTTACCAGCTTT +CGTAATTTTCCGCAAATAAAACAATGGGGAGCTATGCTCCCCATTTTTACAATCCAAGTA +TTTTCGAAGTAGAGTTTCGGGTCGAATTAATGACGTGAGACAACCCTCCAGCAGCTCCTC +CAAGTCTAGATAATCTACTTAAACTTCCATTAAGAGACATTTCACTATTAATTCCAGTTA +TCTCTTCTGATAACTCGTTCCAAGGTATTATTTTCCCTTGGTATTTAATTGATTTCACAC +AGGAAGATGCCATCTGTATTCCAGAAACTGCAACCGTTGAAGAAATATCTAAAATAATTG +AAGATAGCGCAGTTGTAATTGAATCAGAAAAGTTTGAATTACCTGAAGGATTTTCTCCTC +ATTTTGAACTAATTGGAAAGGCTCCAGGATATTGTACTCTTCCTTGGTGGATTTATGAGT +GTAAATTCATTGAACGTATCCGTAGTGCTGGATTTGATTTGACAGAAACATGGGTAGCTCG +TGAAAAGATTCATGGTACTAACTTTTCTATTATCATCACCAAAGACACAGTAACGTGTGC +GAAGCGCACCGGACCTATTCTTGAAGCTGAAGACTTCTTTGGTTACGAGATTATTCTTAA +GAAGTATGATAAGTCTATTAAAGCTCTCCAGGACACAATGAAGAATATGACCACAGAATC +TACCGATTTAATTGCAAATATTTTTGCTGAACCGTTTACATCAGTTCAGCAATTTGATAT +TATTACCTCATACCAAACAAATAGTAACTCGGAGAATAAAATGACAACCATCACTATCAA +CAAAGGTATTAACTTCGGTAAAGAAATTTCTGGCACTTTCGAATTAGTCGGAGAATGGTT +CCCAGAAACTCTGAAACCCGAAGATGCTGCTCAAGGTGATGGTAAAGTTTTCGTTATCAT +TAGAAGATATTAAAAACACCATGAACCGCTCAGAGATTTTGAATGAACTTCAACGCTGCG +TACAACGAGTTGATGATGAATACCACTTACCAACCAACGCATGGGAAGTCTGGTTCCGAG +GTTCTCATCTCGGCTCGATTGAACTGAAAGCCAAAGGTTGTTATGCAGTTTATAGTTCTC +TTGGTCGTCATTGCGGTGATTGTCAGAACTTTATGCAAGCACTGGCTCGCTTCATTAATT +CATGCGCAGTCATTATCGCCAAGCAGCAAATCGAAGAAACCGAAAAATGGATTGACGAAG +TAACTAAAGAGCCAGAAATTCGTCGTTGGGGCGTTACTCGTAAATCTCGTTGGATTGATA +AAGTCAAAGGGTGGTTCAAATGATGGAAACGATGAATCAAAATAATGAATTGGCGGTTCC +GCGTCACTCTTAACCCAAGTCATTCCCAAAATAAAAAACACAATTAACACAAAAACCCGG +GAGAAAAGAACTCTCCCGGTTGCATTATCTTTGAATACTCTATCCCAAACTCCAAAAACG +ATGTCTGTTATTGGCAGACTAATTTTTGAAGCCATAAGTTTTCTCCTTTGGAATATTTAT +ACTCGAGATCCATATATAGTGCCTACGTTTTGCCATGTTGGAGCAGTACCAGCAACAGCT +GCTCCACCACCTGCAGGACCCCATCGCCAGCCGGATTCAAAGTTACCGAATGCCGGGTTA +TGTTGACGCATCCAACCAATTAGCCAATCTTTATTATAATTATGGTTAGCTGCCAAGTCA +ATAACAACTTCTCTCGATCGACCTATCAAGTGTGACATCCAAAATGGAGTCCCTAATCTT +ACTTGTGATGCAGCAGTTGACATCCAACGTTGTCCAGTTTCGGCTACTGCACTACTTCCA +ACCCATGGTCCTGTTATAGCCATAAAAACTCCTAGGGCCCGAAGGCCCTTTATTAAATTA +ATGCAGAAATGATTTCTTCAAGCTTCTGAATGCGAGCTTTTAATTCAGAAATCTCATCAG +TGTGTTCGTTAATTGTTGCAGTGTTTAGCGCAATAATACCATTGTAGTTTAGACGAAGTA +ATCCTTCTGGATTATCTTTATCGACTGAAATCAACTCAGGCAATACAGCTTGAACTTCTT +GAGCAATTAAACCGGATGACTGTTCCCAATTAATAGAGCCGTCTTCTTTGAACCCTTTCT +GAAATAAAACGTGAGCACCTAGAAACTCTCAGAGGAATGAGACTAGGTTGTACATGTCAC +CCGCTTAATTGCCATGGTGATATAATAGCTCATATAGTTAACAGACTTTTTAAAGACGAC +TTCAGAGTAGAGGATTTATGCAATTAATTAAGTCATCAGGAATTGGACAAGATTTTATCC +TTGCTTCTAATCTGGCTATGTTTGCGCTACGCAAAGAAGTGTACGGACAATTCGAGCCAC +CATCATTTATCGACCATATTTCTTATTGTGTTAATGAACGCAAATATGATCCGGAATTAT +TGTCCAAATACAGCGCAGAAGAAATTACTTATCTGGAATCGCGAATTGTACATGACCGAG +CATCTAGCCAAGTTTCGAACTCCACGAATGGTATTGAGCCACCACGCGGGCCGGTAAGTA +TTAAAGAATCTAAAGAAGGTTCTTTCCGCCAGGTAGTTCCGAATATTGAGCATAATGCTG +AACTTTACGATTATGCTTGGCTGATGGCTAAACGTGGTAATAAAGGTTACTTGACTCAAG +TTGCTATTATGACAAAGTGGCTTTGCCAATCAGCTTCAGCGAATACTTATTACGATCCGC +AGAACTTTGCTAAAGGTAAAGTTTCGATGGCTCAGATGCTTGAAGACTTGATTTACTTCT +GGTTCTTCGGTGGTAAGACTTTGTATTACCATAACACCCGCGATGGTTCAGGAACAGACG +ATTACGAAATCGAAACTCCAAAAGCCGATGACTGTGCCGCTTGTAAACTATGATATAATT +GGACCACGGATGGTCCTAGGAGTATATTATGGGATGTAAAGAATTCACTTTACCGCCTCT +TCCAATTCAACCACGGCCAAAGCCGCCAAGGGTTCCTAAATGAGTACAGTTTTTAACACT +GGTCGTGCACCTGCTGCTGTCTTGTCTGCTCTGATTAGCGATCCTTCTTTGGATACTTGG +GACTTAATGAAGAGTCTTTATCTCTGTCTTCATGTAATCAATGCTCTTGAAGCTATTAGA +TTTTATGTATCTTTCGCTTGTACTTTCAACTTCCATAAGAACATGGAAATCATGGAAGGT +AACGCAAAGATCATGAAGTTCATTGCTCGTGATGAACAACTTCATCTGAAAGGCACTCAG +TACATTATTCGTCAGCTTCAACAAGGTACCGACGGTGAAGAATGGGCTGAAATTGCTCGT +GAATGCGAACAAGAAGCAGTTAACATCTTCATGGAAGTTAACCGCCAAGAAAAAGAATGG +TAATCTACAGAAACTCGTCAATCATGCAGTAGAAATTATGGAAGGTGAATAATGAAACAA +CTTTATGAAAATCTGATGGCTCTATGTGATGCGAAGGATGAATCCAAATTCTACTTCACT +GACGATGTGTCTCCATCTGGCAAAGAATATCGAATCTTTAGCTATAACTACGCATCTTAT +TCTGATTGGCTACTGCCTGATGCTTTAGAATGCCGTGGCATTATGTTTGAAATGATGAAT +AAATGAGTCGTTCAATAAGTTCGCTCTGGATTTACAGGAGACTACTAGATGAAATGTTTA +GTGGCAGCTGTAATAACTATTGGACTACTATCTGGATGTTCCCAGAGTGTACCAGAGGTT +CCCAGAGATAACACTCTACACCCATCATGGCCTGATCCAATAAAGCCATACACTGGTAAG +TGGCAAGTCAAGATAATTGACGATCGCCCATGGGTTGGAATGCCTTTTGATGAATCTCAG +GAATTCAGAATTTGGATGAATGACGTCAATCGTTGGGCTAAAGATGCTAATGGAATGATA +TGTTACTATCGTAAAGACTTAAACGAAACAAGGTGCAAGCAATGGACCCGTTAGTAGTTC +CAACTCCGACTTTATTCGTTATCTGTCGGCTTATGATGATGCTCTTAAGGTAGTTAATGA +GCTCAAGAAGCATTACGATTTTGTTGCAGTAACTGCATTAGGTAACTCAGTAGATGCTCA +CCTGAATCGCCAGTTTAACTTGAGTGCATTGTTTCCGGGAGCCTTTAAAGATATATACGT +AAGCGAATTACTTCAAAAGCTAATCAGGAACAGCGTGAAATGCAAGATGCGGTATTCAAT +GGAGCTTTGTACATCACTACTCTTCCAAACGGTGGAGATATCTATCACCGAACTCTGGCA +GAATTGATTCGCGAATATAAGTCAAAGATCTAGCTTATCGAGGGGTGTGATATAATAGTC +ATATCCCTAAACACAAAGAGAAAATTATGATTCTTGAAATTATCAATGAAATCGCGTCTA +TAGAATCAGAATGCGGATTAATCAAAACTAAATCAGGTTCTGGATTAAAAGATAAACCGG +GTCCAGATTCGCATGAGTTAGACCGTACTCGTATTTGGGAAAACCAAAATGATTATATTG +CGATTGATAGCTGGGGTGAATACAACTAACTCCTTTCAGACGATGAAAGATTTAGTTAAA +ATGGCTTCAGCTGGAACTAGACCTGAAGATCGTGAACAGTTAATTGCAAATTTAATTGGA +TTAGCTTATAAAGAAATAAGTGCAATCATCAGAGATTCATACCAAACTGCAGCAAGTTTA +TCTAAAGAGAATGATCATTATTCTAAAGATGAAAAACAAGCTATCAGTGAATACTGCGCA +AACGCTTTCGAATACGTGAATATGTTCTTAATCGGTAAGCCGGAAGAAGGGTATTCAACT +TCTGATTCTCTCGAGATCATCGATAATATGGACTCTGCGTTTGAAAAAGGAACTCGTTTA +GACAAAGGTACATTATTGTATCGTGGCCAAAAATTAGACCTTCCTACA
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/vcf/merlin.vcf Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,23 @@ +##fileformat=VCFv4.0 +##fileDate=20090805 +##source=myImputationProgramV3.1 +##reference=1000GenomesPilot-NCBI36 +##phasing=partial +##INFO=<ID=NS,Number=1,Type=Integer,Description="Number of Samples With Data"> +##INFO=<ID=DP,Number=1,Type=Integer,Description="Total Depth"> +##INFO=<ID=AF,Number=.,Type=Float,Description="Allele Frequency"> +##INFO=<ID=AA,Number=1,Type=String,Description="Ancestral Allele"> +##INFO=<ID=DB,Number=0,Type=Flag,Description="dbSNP membership, build 129"> +##INFO=<ID=H2,Number=0,Type=Flag,Description="HapMap2 membership"> +##FILTER=<ID=q10,Description="Quality below 10"> +##FILTER=<ID=s50,Description="Less than 50% of samples have data"> +##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype"> +##FORMAT=<ID=GQ,Number=1,Type=Integer,Description="Genotype Quality"> +##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Read Depth"> +##FORMAT=<ID=HQ,Number=2,Type=Integer,Description="Haplotype Quality"> +#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003 +Merlin 170 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,. +Merlin 1020 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3 +Merlin 3445 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4 +Merlin 5050 . T . 47 PASS NS=3;DP=13;AA=T GT:GQ:DP:HQ 0|0:54:7:56,60 0|0:48:4:51,51 0/0:61:2 +Merlin 20000 microsat1 GTCT G,GTACT 50 PASS NS=3;DP=9;AA=G GT:GQ:DP 0/1:35:4 0/2:17:2 1/1:40:3
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/tool-data/all_fasta.loc.sample Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,18 @@ +#This file lists the locations and dbkeys of all the fasta files +#under the "genome" directory (a directory that contains a directory +#for each build). The script extract_fasta.py will generate the file +#all_fasta.loc. This file has the format (white space characters are +#TAB characters): +# +#<unique_build_id> <dbkey> <display_name> <file_path> +# +#So, all_fasta.loc could look something like this: +# +#apiMel3 apiMel3 Honeybee (Apis mellifera): apiMel3 /path/to/genome/apiMel3/apiMel3.fa +#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /path/to/genome/hg19/hg19canon.fa +#hg19full hg19 Human (Homo sapiens): hg19 Full /path/to/genome/hg19/hg19full.fa +# +#Your all_fasta.loc file should contain an entry for each individual +#fasta file. So there will be multiple fasta files for each build, +#such as with hg19 above. +#
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/tool_data_table_conf.xml.sample Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,7 @@ +<tables> + <!-- Locations of all fasta files under genome directory --> + <table name="all_fasta" comment_char="#"> + <columns>value, dbkey, name, path</columns> + <file path="tool-data/all_fasta.loc" /> + </table> +</tables>
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/tool_data_table_conf.xml.test Wed Jun 05 08:15:49 2024 +0000 @@ -0,0 +1,7 @@ +<tables> + <!-- Locations of all fasta files under genome directory --> + <table name="all_fasta" comment_char="#"> + <columns>value, dbkey, name, path</columns> + <file path="${__HERE__}/test-data/all_fasta.loc" /> + </table> +</tables>