comparison test-data/multimer_output/msas/B/bfd_uniclust_hits.a3m @ 9:3bd420ec162d draft

planemo upload for repository https://github.com/usegalaxy-au/tools-au commit 7726c3cba165bdc8fc6366ec0ce6596e55657468
author galaxy-australia
date Tue, 13 Sep 2022 22:04:12 +0000
parents
children
comparison
equal deleted inserted replaced
8:ca90d17ff51b 9:3bd420ec162d
1 >chain_B
2 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH
3 >ERR1719244_1811598
4 MVQWSDDETKAIQMIWNSVDVNELGPAALRRCLLVYPWTQRYFGKFGDIATPTAIMQNPGVAQHGITVMNGLKLAGGPGGGPGNQPGGQQELWQRGKQQGQQQLWQQGQHGGKQRGqqQRQGQq-PSPRQSX------------------
5 >tr|W5MMD7|W5MMD7_LEPOC Uncharacterized protein OS=Lepisosteus oculatus OX=7918 PE=3 SV=1
6 MVTLTAEDKNNIRHVWGMVYKDPEGngAVVVIRLFTDHPETKQYFKRFKNLDTLEQMQTNPRIKLHGKRVMNTLNQVIDNLDDWAavkEILTALAERHRDVHKIHIHNFKLLFDVIIKVYGEALGPAFTDAACESWSKVFQLLYSFLQSVYT
7 >tr|G3WE01|G3WE01_SARHA Hemoglobin subunit mu OS=Sarcophilus harrisii OX=9305 GN=HBM PE=3 SV=1
8 --MFSAEEQSHIVQIWNYLsgHEAIFGTELLQRLFTVYPSTKSYFPPL-IPG-----LELTQMQNHGEQILMAVGVAVDNMYDLRTALSGLADLHAYGLRVEPTNFHFLIHCFQVMLASHLQSEYTAEMHAAWDKFLTNVAVVLTEKYH
9 >tr|W5PMJ4|W5PMJ4_SHEEP Uncharacterized protein OS=Ovis aries OX=9940 PE=3 SV=1
10 --SLTRAERTIVVSMWSKIstQADVIGTETLERRVTCVSRGPA-P----GSP------QS-------rgRREAGRKGRNDLEtggqgegAGRTGQRLL-RSRLRACTLSF---PPQFLSHCLLVTLASHFPADFTADAHAAWDKFLSLVSGVLTEKYR
11 >tr|A0A1K0GGD5|A0A1K0GGD5_RAT Globin d1 OS=Rattus norvegicus GN=Glnd1 PE=3 SV=1
12 ----------------------MYGLEKEp-R------------ETEGCLS---RKLPSNLQRSSAPWRLHGFQNLLERSQGA--------QRAKPG------------HGAHSHSSVKMAL--SQTDH------------------rlvL
13 >ERR1719474_978995
14 --------------------------------LLQSSWKQ--FRT----------------------------------------FASLSGIRQEELGAGCQHQDLP----------QIQHHLWISEPSTFQQLLtftrsiktftnhylnirclflqmflslrgCVNKDSASRKKH
15 >ERR1719336_830457
16 ----------------------------------------------------------------------------------SINPQSTVDLGAQYISATPLNYKNHQDIYNSLLSNG------VLVPANVSLIEGMRQDRIDEGEE
17 >tr|F6XB67|F6XB67_XENTR Uncharacterized protein OS=Xenopus tropicalis PE=3 SV=1
18 -MILSEAEKAAILSLWAKAsgNVNALGAEALERILYIWQNLFSYLESP-VI---L-----KILQTGKGASVYKIR-GLDHLSTKHSILPLL-TVKKCLCLRDAGFKILLSHAIEVTLAVHFPDDFDATAQAAWDKFLAAISTALTSQYR
19 >tr|A0A1L8EXG7|A0A1L8EXG7_XENLA Uncharacterized protein OS=Xenopus laevis GN=XELAEV_18045093mg PE=3 SV=1
20 -MSLSQAEKTLILAFWNKASglINTIGPQIVNRLLLAYPQLKTHFGNF-NVTPGS-----SDLNTLGIKIITAVGGATQHMDDLPVHLAILTDLHSLTLRIDPGNYKLMIDCIVISMAASLPQDFTAEVQNAMTNFLIIIGDILASKFC
21 >SRR5260364_139532
22 ------------T----VLapDPnPTPHSASPRRMFLSFPTTKTYFPHF-DLSHGS-----AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKVSGGPGAIWVEGRDGAFLAGQRITRvAGGVAQAAAAGLGPRPH
23 >tr|A0A096M318|A0A096M318_POEFO Uncharacterized protein OS=Poecilia formosa OX=48698 PE=3 SV=1
24 ------HDELIITGVFFTSVSECVPP-----VRNIYRQTTNSIENIGNFKNGETFLTNPPVALYVVNMVEFTSKPLMS-LPLNGFYGILDFLK--AKRKNPNGGKLLADCLTIVIASKMGSGFTPEIQATFQKFLAVVVSALGKQYH
25 >tr|A0A146TSR5|A0A146TSR5_FUNHE Hemoglobin cathodic subunit beta (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
26 IFHFIYFYLSTIHYIFSKIYSFFFFPSSLSIFLIFYPFTHIYFFIFFNLYNSSSITSNPNFSSHFNFFLSFLYKSFNNIYYINTTYKYLIFLHSYKLQFYPYNFNLLSYFLTIFLSFHIFSSFTP----------------------
27 >tr|A0A146Z291|A0A146Z291_FUNHE Hemoglobin subunit epsilon (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
28 IFYFSYHYLIIITSIFSNLYYNYFFPNSLIIFLIFYPFTHIYFSNFFNLYNSYSINTNPNIQSHFTNFLHFLYLSFNNIYNINFTYSYFIFLHSYNLHFYPYNFNLLSYFFTIFISSNIFSVIKE----------------------
29 >tr|H3B4U9|H3B4U9_LATCH Cytoglobin OS=Latimeria chalumnae OX=7897 GN=CYGB PE=3 SV=1
30 --QLSDTEVESIRQIWSNVytNCENVGVLVLIRFFVNFPSAKQYFSQFRHLEDPLDMERSVQLRKHARRVMGAINTVVENVEDQDKiasVLAPVGKAHALKHKVEPVYFKILSGVILEILAEEYAQHFTPEVQKAWTKLMSIICCHVTATY-
31 >tr|L8HVQ9|L8HVQ9_9CETA Cytoglobin OS=Bos mutus OX=72004 GN=M91_06698 PE=3 SV=1
32 --ELSEAERKAVQATWARLyaNCEDVGVAILVRNRFWRkKRASSTLEEFQegaqgrdsslGSSQAQKQPGCPQLRKHACRVMGALNTVVENLHDPEKvssVLSLVGKAHALKHKVEPVYFKILSGVILEVIAEEFANDFPPETQRAWAKLRGLIYSHVTAAY-
33 >ERR1711977_7585
34 -MSLSAKDKTLVKKLWEKAEgkSADIGAEALGRMLVAYPQTKTYFSQWGSDLNPQ----HPQVKKHGAVIMGGVGKAVKNIDDLVRGMGALSELHAFKLRVDPANFKILAHNIIWSWPCTSLQTSPPRPTCPLTSSCRTWLWLCPRDT-
35 >tr|A0A1C4HCU8|A0A1C4HCU8_PROAN Myoglobin (Fragment) OS=Protopterus annectens OX=7888 GN=Mb3 PE=2 SV=1
36 --MASAAQWDTTLKFWEAhVagDLKKHGHEALVRLFLKNKDSQKHFPKFKDLASEAEMRGSDGLKNHGETVFTALGKALQQRDGIANELRPLAVTHSQNHKIPLEEFENICEVIDVYLAEICPD-YAGETRTSVKAVLDVFSQSMTTLY-
37 >tr|A0A146P967|A0A146P967_FUNHE Hemoglobin subunit alpha OS=Fundulus heteroclitus PE=3 SV=1
38 ---LSKKEKKLIKDIWERLTpvAEDIGSEALLRMFTSYPGTKTYFSHL-DISPGS-----AHLNSHGKKIVLAIAGGAKDISQLTVTLAPLQTLHAYQLRIDPTNFKSCFHTVCLSRWpvTWAKSSL----RLHTQQWTSTCQPLQPCSL-
39 >tr|A0A146QLZ2|A0A146QLZ2_FUNHE Hemoglobin subunit alpha-2 (Fragment) OS=Fundulus heteroclitus OX=8078 PE=4 SV=1
40 NIILTSNYNYTFNTFFSKFssNSYSIFSYSLSIILFFYPHTNTYFSHFNYLIPFS-----SPFNNHLstfiflfsxxxXXVMGGVEDDVEKIENMKEGIIRISEMNELNMRVEKEKLKIMEKKIIVV---------------------------------
41 >tr|A0A024R1G3|A0A024R1G3_HUMAN Myoglobin OS=Homo sapiens GN=MB PE=3 SV=1
42 AMGLSDGEWQLVLNVWGKVeaDIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNY-
43 >tr|M3YM80|M3YM80_MUSPF Myoglobin OS=Mustela putorius furo GN=MB PE=3 SV=1
44 -MGLSDGEWQLVLNVWGKVeaDLAGHGQAVLISLCQGLESRKEEKKRDPAHACVSSRRslfVSQDLLFHSDAFLVSLGHRSflaPVSGENGQSQKTQPAHHAQHHRQPWNTEKFISDAIIQVLQSKHAGDFGAEAQAAMKKALELFRNDIAAKY-
45 >tr|A0A1Z5LBJ2|A0A1Z5LBJ2_ORNMO Uncharacterized protein (Fragment) OS=Ornithodoros moubata OX=6938 PE=3 SV=1
46 --ALSAAERALLRALWKKLgcNVGVYATEALERTLEAFPRTKIYFSHM-DLSP-----GSAQVRAHGQSPRPQGGRRADPRRRPPGRPArrpVRSERpARAHAARGPPPLRAAGPLSAGDPRPALPWRLRPRH--------------------
47 >tr|S4RW14|S4RW14_PETMA Uncharacterized protein OS=Petromyzon marinus PE=3 SV=1
48 --ALSGAEKAAIADSWKAVysNYEEAGKAILIKFFTSNPGVQDFFPKFKGLDSADQLSKSAAVRWHAERIINAVNDAVVALDDpekLSLKLKALSKKHAQEFNVDPQYFKVLAVNIVEGVSSA-NGGLGAEAQAAWEKFLSQVSILLKSQY-
49 >tr|Q9Y0D5|Q9Y0D5_MYXGL Hemoglobin OS=Myxine glutinosa GN=Hb PE=2 SV=1
50 --RTTEGERAAVRASWAVLmkDYEHAGVQILDKFFKANPAAKPFFTKMKDLHTLEDLASSADARWHVERIIQAVNFAVINIEDrekLSNKFVKLSQDHIEEFHVtDPQYFMILSQTILDEVEKR-NGGLSGEGKSGWHKVMTIICKMLKSKY-
51 >tr|A0A1W0WKD0|A0A1W0WKD0_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_10224 PE=3 SV=1
52 --GLTSNHIKAVRANWKLIekRLPEYGLELFVAYLNKHPDWIGLLPFLKPADMPR-LQQTPRLKAHGTIVLKKLGELLTMLDSppkLIGELLKQGSTHR-ARGLAPENFQAIQHDLNELFVKICGPE---FDIEGWDAVLTLIMTGIEEGL-
53 >tr|K4FYM0|K4FYM0_CALMI Hemoglobin subunit alpha OS=Callorhinchus milii OX=7868 PE=2 SV=1
54 ---LSKTDKALLSSSVGKIQAQATGSDVLARMFASFPQTKVYFVGFSDYTA-----KGPRVQKHGLTVMTKIIEGIQYLDSLRSFLDALSAKHAHELMVDPVNFGFLGECVLSSLAYQLPD-FSPEMHCAWDKYLCEFAYLLAEKYR
55 >tr|H9GUN8|H9GUN8_ANOCA Uncharacterized protein OS=Anolis carolinensis GN=LOC103282340 PE=3 SV=1
56 --KMTDLDRRHIREIWTAAfeNPEENGRLVIIRFFSDYPASKQYFK---TVPTDGDLKAHPQVAFHGRRIMVAFSQVIENMENWNQACVlleRLVNNHKNIHQVPSGMFQLLFQAMLCTFDDLLGRTFTPEKRVSWEKFFQVIQEEVEAAYD
57 >tr|H2YFM6|H2YFM6_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1
58 --SLTTEEVITLRTTWAEiskLGNATVGLAVLHRLFNDCPEVRPFFGSMlppSELSDMDSLKSNPKVVDHASRVALSINNIIQLLEntdELVSYLSFLGKVHG-ERSIPAKHFSDMGPVLLAVISAVLREDLEGVVMQTWAKAYGAIEAGI-----
59 >UPI000197D711 status=active
60 ---LTPKDIYEAKQCWNKAAslgVNKVGVLLFKNIFTIAPEAAKAF-SFGNDP---NFMNNKEMEEHGVKVVMAFDHAVRSLDNIHalqETADGLRDTHSFF-NLSPEHHVIVKEALLQTLKQGLGDEFTDAQRELWNGIYTAIRNMW-----
61 >KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold119418_1 # 1 # 498 # 1 # ID=119418_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510
62 ---ISPLKLRLVQSSWRQASaDEQAGITAFKFFFEMEPVAIGMF-GLQDIR---DLYNSYELKRIAAKIVKAMTHIVNSFDNFEglrPLIKKLGMMHGEK-GVSPSQYNNFGKAFMQTVEEILGDQFTPETRRAWETFFRILTGAL-----
63 >tr|A0A146PHJ5|A0A146PHJ5_FUNHE Hemoglobin cathodic subunit beta OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
64 -----------------------------ASWFCGFHWTQRYFPHIWRPLPPPAIAAKFPKGAAWKTVMGGLEIAVKNIGQHKAAYAKLSVMHSEKLHVDPTTSGFLLNASQWVWLPSLPPRLHPWFPGGWQKFR------------
65 >tr|A0A1E7FQE1|A0A1E7FQE1_9STRA Neuroglobin OS=Fragilariopsis cylindrus CCMP1102 OX=635003 GN=Ngb1 PE=3 SV=1
66 --------MALVVESWAKIKEIENyeevaGELLFRRIFEIKPDAAAYFKFTDGFETTDeALYKQEVFIKHVKMVILTVTSAVDLLEkeNMdelFRMLKLLGAKH-LSagLKLEKEHYNLVGMALLDTLGKALGDTFTEAVKSAWIGVYAIIASKM-----
67 >tr|A0A150AR53|A0A150AR53_9BACT Uncharacterized protein OS=Flammeovirga sp. SJP92 OX=1775430 GN=AVL50_01545 PE=4 SV=1
68 ---VSNKQIELVQNSFTLITphRGQVSELFFSKLFKIDSSLESSLMV--DPK------------DQERRLIPMLSAVVNGLVDfelIIPILQDFGRTHV-EYNIQEKHYEAVQKALFYALQTVLQEKWTSEVDDAWSNIFSVLTNIMKE---
69 >tr|A0A1Q9P386|A0A1Q9P386_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=hmp PE=4 SV=1
70 ---FSNNDIRVIDELWDLILpiKETITDSFYATLFSLDRTIKPMFKT--DLG------------VQGLRLTDTLTFIIKHMGNiedTIQIVKELGVKHL-EYGTKPYHYDLVLEALLETFDKHLEEKFNSEMRLCWIKLYKFLSELMML---
71 >tr|A0A1G1B2A9|A0A1G1B2A9_9PROT Uncharacterized protein OS=Methylotenera sp. RIFCSPLOWO2_02_FULL_45_14 OX=1801615 GN=A3I83_03315 PE=3 SV=1
72 ---MTPMQIDVVQSTWQKVMpfREDIACLFYKRLFEIEPELSMVFKG--DMH------------DCVKKIMFMIDLAILNLGQleeVMPMLQEIGNKYV-QCGMKVDS-NAVRNTLVSTLEQRLGETFTVNVRSDWIQAYDLLVGVMKD---
73 >sp|Q7SID0|GLBF1_EPTBU Globin-F1 OS=Eptatretus burgeri OX=7764 PE=1 SV=1
74 --TLTDGDKKAINKIWPKIykEYEQYSLNILLRFLKCFPQAQASFPKFSTKK--SNLEQDPEVKHQAVVIFNKVNEIINSMDNqeeIIKSLKDLSQKHKTVFKVDSIWFKELSSIFVSTIDGG----------AEFEKLFSIICILLRSAY-
75 >tr|K1QF07|K1QF07_CRAGI Neuroglobin OS=Crassostrea gigas GN=CGI_10026082 PE=3 SV=1
76 --TISEDEKRLVKDSWNLFVsrgdFSDTGSHMYKVLLQDNPHLKTLFSFMKVNGa----PFDSPMFKSHVRNVFTVIGDAVNHIDDLDSLspiLKDLGVKHQ-GYGAKKEYLEPVGNALLCTIEKHLEDDFTQEVHSAWRTFFAVMSYSFA----
77 >tr|Q3MQ26|Q3MQ26_SPISO Nerve hemoglobin OS=Spisula solidissima OX=6584 GN=nHb PE=2 SV=1
78 --KLTKAEKDAVANSWAALKQdwKTIGADFFVKLFETYPNIKAYFKSFDNMDMSE-IKQSPKLRAHSINFCHGLNsfiQSLDEPDVLVILVQKLTVNHFRR-KIAVDRFQEAFALYVSYAQD---HAKfDDFTAAAWTKTLKVVADVI-----
79 >SRR3989338_1269240
80 --DFNDEEIDIIKDTWDAVLYPey---PEEGfnPVLNFSTKFYRRVFehencknlfeE--V------------DMTSQGEKLVKILSVLLVAVQTkslnqdHIHVLRKMGERHRG-YGVSDDMYEIIGGCLLRTLSEVCADVWDDDAKVVWAKLFGVVSEQM-----
81 >tr|A0A2G8K001|A0A2G8K001_STIJA Globin D, coelomic OS=Stichopus japonicus GN=BSL78_21829 PE=4 SV=1
82 TAQLSEVEKNLIRSSWEQAlkNKKVFGVNVFIKLFIQNPSSQDLFEQLRGIPLE-DLKTHRKMKAHALRVMASLNTLVEQIDEVEiltEMFNNVARTHV-IHKVEKAHYDLLGQVLMEVFSEELGAKFDSATKGAWLKAYVIMENIILDKY-
83 >ERR1712150_314552
84 MTALTEERKLHIKSSWSSVndDvdLAGNGVEFLVKLFTDFPEYMTFFPAFDGKTPE-EIRSSPKAKMHGKVLMTTLDKIVANLDDLEtviASLHRVVGSHF-PRGVTASHFKATLECFGSFLAVQLGDAFNNDVKNAWGVAVQILASVMEAEY-
85 >tr|A0A132AHZ9|A0A132AHZ9_SARSC Cytoglobin-1-like protein OS=Sarcoptes scabiei GN=QR98_0086180 PE=3 SV=1
86 -MSLTNRDKEIIVSTWSLIrkDSDQAGIHLFKRFFEANPDYVKYFP-FGDLdDLE-KILVDPRLKWHASRVMAALSTIVDNLDDPVcfeDSLQKVLSSHL-NRKIQLYHFENLKKALVCLFMDKLGpDIMNDETIEAWSKAYDVILDTYRSRL-
87 >sp|Q8T7J9|GLB_YOLEI Globin OS=Yoldia eightsii PE=1 SV=1
88 -MSFSAAQVDTVRSNWCSMtaDIDAAGYRIFELLFQRNPDYQSKFKAFKGLAVS-ALKGNPNAEKHIRIVLGGLGRILGALNTPEldVIYKEMASNHK-PRGVMKQQFKDMGQAIVTALSEIQSKSGGSFDRATWEALFESVANGIGQYQ-
89 >sp|P0C227|GLB_NERAL Globin OS=Nerita albicilla PE=1 SV=1
90 LKSLSADQKAAIKSSWAAFaaDITGNGSNVLVQFFKDYPGDQSYFKKFDGKKPD-ELKGDAQLATHASQVFGSLNNMIDSMDDPDkmvGLLCKNASDHI-PRGVRQQQYKELFSTLMNYMQSLPGANVAGDTKAAWDKALNAMANIIDAEQ-
91 >tr|A0A1B6EVA8|A0A1B6EVA8_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.22480 PE=3 SV=1
92 LEVITERDKYLAREVWMQVETNyvLISKSLFTNWITEFPEHLNFFKGLLD-SSYDDFLTSPKFEQHMANsVLPNVGIMISNLDRptdFRRHILKLAWIHIRKNiALKIDHFNILKGLILRTLKESLGRGIGRDHEVAMFKVITAGFNLFS----
93 >ERR1719240_1900674
94 -----------------AVArvlVHGL-ANLHRRALERLDLLLELVDAHRVVVL-RLLHRLdgrldrlHVLRRHLVLVLE------EG---------LLGAVHR-RVGLILH----------LHLRLAIGVRRGE----------------------
95 >tr|A0A224XVH8|A0A224XVH8_9HEMI Putative hemoglobin-like flavoprotein (Fragment) OS=Panstrongylus lignarius PE=3 SV=1
96 DIGVCNEDVAGIKETWQTVYNDkEnSGIFLFQVMFEMYPDYEKYFVRFRT-EGQKSLFDNPKFINHVKnRVMDALNDVIVNLENDErlvNILETVGENHK-KRNLRKQEFDNIGKVVIETLRRALGTSFTPKLEEAWTKVINCAMETIGK---
97 >tr|A0A1B6KZX4|A0A1B6KZX4_9HEMI Uncharacterized protein (Fragment) OS=Graphocephala atropunctata GN=g.7772 PE=3 SV=1
98 YFHLSLEDKRLAREAWYnNVEGNyViVAKAVFKELFRRAPQAYNFFKHLVD-VNERDMFESPRFKRHMVqRLMVALETIFYNVYWNDvfeNHMYDQGRKHK-KRGVQPAHVKLLLCVIV-----------------------------------
99 >tr|R7TS60|R7TS60_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_200756 PE=3 SV=1
100 -TFLTDEEVEILKASWNDLNddsdLSSIGKRVFLQAFEMRPEMKKIFP-FDNCWGD-KLLQHPKFQAHAQSFMVIIENSVEQVDNESSDFsdslTLLGQSHSDRIGFTRENVQVFLKAILAVWHDLLKS-SDDRTEKIWSKFLAHVVQIMRNGY-
101 >tr|A0A0X3PJM2|A0A0X3PJM2_SCHSO Globin OS=Schistocephalus solidus OX=70667 GN=GLB PE=3 SV=1
102 --QLTEVQKTQLCVEWKQICKNKedkyaLGTEVFRLLFTKYPHYIRLFKRFRDLPNLDSIMQSAAFKAHAMRFIGAIDAIMENLDDescLVELLKRLAEEHRPR-GITENDFYKTLDVAYDALSPALKsDDARVALRQLFDTALSVIRQSL-----
103 >sp|P02214|GLB_BUSCA Globin OS=Busycotypus canaliculatus OX=57622 PE=1 SV=1
104 --GLDGAQKTALKESWKVLGADGptmmkNGSLLFGLLFKTYPDTKKHFKHFDDA-TFAAMDTTGVGKAHGVAVFSGLGSMICSIDDddcVBGLAKKLSRNHLAR-GVSAADFKLLEAVFKZFLDEATQRKATDAQKDADGALLTMLIKAH-----
105 >ERR1719239_1832466
106 --GLSEKDLVLIRGSWGMLgdlkTRKAHGVELFIQLFRAYPYMCeEYFPWFNDMSDEE-LRTSRKMKAHAHNVMNNIGSYVEVCDDPESlvaLIGKMAETHIP-RNVKALQFKELGDMFLPYLVSMMGAAATTDVQEAWRRLLAALVAVVSQ---
107 >tr|A0A1I8JIG1|A0A1I8JIG1_9PLAT Uncharacterized protein OS=Macrostomum lignano GN=BOX15_Mlig002954g1 PE=3 SV=1
108 --MLNEVEKKIILSGWQQAikDKKALGMDVFMTLFEMFPQHQELFRDFKGKSRAE-LEKMPKMRAHALRVVNTLDGAIQSLDDMEVcasSLELIGASHKS-HHLSAKHFEDLNAALAVVFERRLGKA-FVDNKAVWVKLLQGIIPVIQR---
109 >tr|A7RZB2|A7RZB2_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g204383 PE=3 SV=1
110 -IPLDAKETQLVRKTWAILGDRqvEVGKSLFLRFFEEHPTSKDLFPEFRNISNEK-IAESPALYGHARRVMKSVDNAVASIENVQVysaYLYELGTRHQ-TRQLSEEQLKFMGGAFLFAMRLHLRKEWSRATSKAWEKIFSFMADAMMR---
111 >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1887876_1 # 1 # 366 # -1 # ID=1887876_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459
112 -LPVSDENKDILRESWKRLEEEktTLCKNVFIRLLQLNPNLQDTFPSFKGVALDE-LMNSRSLFLHSKRLMEALEIAISSLDDGQDfteYLTHLGERHT-AISITENHFKIMEKALIFALKDMLGESCTEDVANAWREFFQSMAGTMLA---
113 >ERR1719401_2606804
114 ----------------------------------------------------------QKYQAQGSRSQ---GG---ELS-RRrcvPPAQSRRA----RAGLAGDghqahclWHPPGERSEIRGSLRCCGEGSDPKLEMAWTKVFVVVSTTM-----
115 >ERR550519_2895140
116 ---LSKAERKEAENAWRIFevNLVDNGVDAFLNLVRDHPNRKDAFPWVKPELSEEALRNDPEMKKLAKLVFSAVKPAFKSLGDlqsLTNYYLNIGNELS-LMNIPPVMVSYLSDAFKKTCQKLLGSDYTHSLEASIEYVYDFITSRMFE---
117 >ERR1719402_597456
118 ---------------------------ALIA-------LISS----------------------AAGSGCLCDARARPFSM-------LS--AI-KLIRVVSAFRATAKALLPAFEEELGTKYTDDFRYALTTLINFMADNMEK---
119 >ERR1719423_342041
120 ----TGRQRVAVQASWRLVapDAKRHGIAIFIRLFKKHPETQLVFKSFKGQQ-PESLADNKRLAAHATTVMASVATLVDNLDDidtLLELLHKVAENHKRR-GLPIQYSTIWWRRWG----QHWTAAASRGGATSSepstrssplstsgskDNSFRNVCKMCEGISR
121 >tr|Q53I62|Q53I62_9ANNE Intracellular haemoglobin (Fragment) OS=Alvinella pompejana GN=hb-i PE=2 SV=1
122 ------------ADNIAAVrgDVSTHAMNIFVEYFKKFPQHQNAFADYKGKD-PESLKSLPKFKTHTTKVVSKLLDIVEKASDsgaLQSNCTTLAKMPQHK-GLNQQQFADLGAVLVPYLQKALGGACDSA---AWeqayn----------------
123 >SRR6516164_9760095
124 -IVTTPQQVQLVKQSFAKTTpiAEQAAGLFYGRLFETAPQLRPLFK--GDI------------KTQGRKLMSTIALAVGSLQKlpeLVPIVQDLGRRYV-GYGVKDDQLRYRRRRAAVDARQGaRGRLHTRCEGRVDLGLYDPrrYDEERRSAA-
125 >SRR5690348_1420512
126 -----------------------------RHRAESAPAVSGRS------------------HSAKKEADGDDLHDDRRTERfqkAGPGSQEPRRAPC-RLWCDCGGLSIVGEALLWTLEQGLAAEFKPEVRSAWIKLYDMIATTMQAGA-
127 >SRR5258706_3013648
128 -XMLSEKEITLGRNTWDLIapvT-QEMGIQFYEHLFETSPELKPLFKT--NP------------KDQAMKLMFMLSYFVHRLDKendLRAEIKKLAQRQS-GYGAKPEHYKLIRDTLLCSMQNDLRKPWNKETESSCQ---------------
129 >SRR3712207_8213275
130 -RLMREYRLAVIFFFFSSR--RRHTRYWRDWSSDVCSSDLSLFK--GDI------------TEQGRKLMQMIGVAVRSLDRleqVMPAVQALGARHV-GYGRSEERRVGKEGRSRWGPDHX-----------------------------
131 >SRR4029077_8512364
132 --CVTPQQIDLVQASWKQVVpvSETAAQMFYGRLFFLDPSLRRLVL--RGK------------RGGGERGGAVVLG-RQGEEGeegEGSALIHRDRAQA-AGGP-PPRGPAPGAAA----------------------------RHVRRS--
133 >SRR5437868_6476409
134 -----MDEILLLKTSLQKMGpqLEHAAGTFAVRLFQLNPSL-------GEI------------ATRGRELLQMMGAAVQNLGRldqLAPSARQFGRHYA-NCHIREQDYDAVGEAFLWSLGRGLGRDFTEEMEAAWGKVYWLMTEIIRAG--
135 >SRR5689334_13356078
136 ------------QVSFTQVApiAETATQLFYARLFELDPDLELLFK--GNL------------SEQGASLCKCSHLRSTVLTGwsnFCQSCNRLAHDTS-AMGFETKTTTQWDRRFCGRYGKGWV------------RPSHLRLSX------
137 >SRR5437870_6238790
138 -FDVTPIQVDLIRASWAKVEpiQELAASLFYDRLDRKSTRLNSSHVA-ISY------------AV---------FCLKKKKKKkek---------------YTHEHINNNKV----------------------------------------
139 >tr|A0A136P213|A0A136P213_9CHLR Globin OS=Chloroflexi bacterium OLB13 GN=UZ13_01312 PE=3 SV=1
140 -ESLTEHDKKLVQRSFTHIApqNEDIAAVFYARLFELDPDIEHLFS--TGL------------DVQRAKLMRMMADLVNALDApeaLSQSMRELGKQHV-SYGVHDKHYATVGEALIWALRKVCPAVMTPTVTQAWEKTYALFAELAIS---
141 >tr|A0A0C3QP41|A0A0C3QP41_9GAMM Uncharacterized protein OS=Shewanella sp. cp20 GN=DB48_17865 PE=3 SV=1
142 -MPLTDEQKRLIQKSYAEIDrqNSNFAAIFYDCLFAMAPLIRPMFKS--ER------------PVFEYHFNELISTAATKVFEfeeIKPRLVVLGQKHR-GYGVTPAQFDVVRSALMLSIQDCLRDTCNPAIEQAWSCYYDEIAKVMIAA--
143 >SRR5262245_10239308
144 -GPENARPGNL-RHHYadrgrcsGSLLpeAvqaRSVAGRHVSRRHERAAEE--AAAD-ADG------------RRQGARSA----RSGRGGRRgsrPAPRAIRRDRQAL-RHGRHGS---P------LGARGGTRARFTPSVKKAWATVYGLLATTMKNA--
145 >SRR3981081_1073077
146 -VVATPSPSRRRISDFG-------------RLKML-NSGKPEFGAgeGSSC------------CSGRSHLLVAILRHVAGIA-------------------------------------------------------------------
147 >SaaInlV_135m_DNA_2_1039731.scaffolds.fasta_scaffold157242_1 # 1 # 360 # 1 # ID=157242_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.458
148 --LLSPATRELVRSSFPMVEriAPRAGTMFYGRLFATAPEVLPQFR--RDLS------------QPNFQPaaehrfMQLVLFVrstaeHAGLPGsagHDETVGKLAQRHV-GYTTRAPHYAPLGRALLWTLDECLGADFTPAMRAAWSDTYDVLVASMVAPL-
149 >tr|A0A0P1GRZ8|A0A0P1GRZ8_9RHOB Soluble cytochrome O OS=Thalassobius mediterraneus GN=vhb PE=3 SV=1
150 MNLLSKDEVALIQGAYRALGpsKGFLTNSFYRRLFAIAPQARPLFP--QDM------------DEQLKKLEHMLDLLVDNLHQpmfFMGKLKRLAKRHV-GYGAQPEHYALVGEALIFALNDITPGGLPDKERALWVEIYTAISNTMIET--
151 >APLak6261659701_1056019.scaffolds.fasta_scaffold514158_1 # 3 # 230 # 1 # ID=514158_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
152 -IELNAKNKALVKEGWKLLIEtqFPnevggneralarFFDEFYRKFFEVNPSGKRLFEE-GGM------------AVQSKALVKMMSMVVTSLENpsnLDLTIERLGGRHE-LYGVSRSDYLAFTNAMCETLETVLGDKCNQEMKESWSLVLNNLSEKMLT---
153 >SRR3954466_1768845
154 -SCHDSGTGDARS---ADIRpgradRRQGGGDFLRSVVRGRPHGQAVVP--GRH------------SRAAPQTHRHAGGRGPRLSDLpsiLPAASALAKRHV-DYGARPEHYPVVGAALLWTLERGLGPQWTSEAASAWTAAYATLSSFMIA---
155 >SRR6185295_9741709
156 ----------------------------LTTWVKHLRRSIMVCG--DDM------------MDRRKRFTQVVSATVRGLARvdmLLPAVREFGMRHP-LPGEIEQHHANVASALLWMLEKALRKDFTPEVKAAWIKAYGMLSQTIRQS--
157 >tr|D7G782|D7G782_ECTSI Globin OS=Ectocarpus siliculosus OX=2880 GN=Esi_0008_0247 PE=3 SV=1
158 --VDVEGYKAEIRRTFALVEpiSVQAAGIFYPTLWEVDTSTKPLFKD-TDM------------DKQGEKLMKTLGVAVAMLNKmdtLKPILENLGRKHV-DYGVTPEMYPSVGKALLITFEKGLGEECTPLTTKAWTWVFGIISSICIAAA-
159 >SRR5215207_7597532
160 -QTMTRDQIRLVQASFRNVLpiRELAAALFYDRLFEIDPGTRGLFVD-TDL------------RSQGGKLMAAIGMVVHALDApesMVEKLKELARRHV-NYRQLQESSPPDFHRLhrfgsgrgsqRHVVSKGPGVAPVGQ----HVVPTHFASRvsrRLRAC--
161 >SRR5262249_41212017
162 -NVMTPEQKRLVRDTWKQVApiADAAADMFYRRLFEIDPTTRELFHA-TDM------------VAQRKKLLQMLAFAISGLDNlgaLVSKVEDLGRRTP-AVALPTRTTIPWAPRCCGPWNRVSVTRGHP----RWRRHGPRstnccpascatlprapsscktcgplrrgrplerqgICCVFRKR--
163 >ERR1700730_6579985
164 --RQRLADDGVILRVLQRGLgiELEMEALAREEIGELDPDAarfRPHHA--VGG------------GEVGGRHIELLRRHVDQRPpcHaaaNGSARISLPRGHV-SYGAKPRHYPVVGAALLWTLEKGLGDGWTPEVADAWLTAYSTLSGYMIS---
165 >tr|A0A0N0UYC0|A0A0N0UYC0_9BACT Uncharacterized protein OS=bacterium 336/3 OX=1664068 GN=AD998_10010 PE=3 SV=1
166 ------EQKEIIKSSFPRVLihTLKNSTIVYEKLFMDIPEAKDLFKN-TS------------IDKQGQMLVAAIGKIVKGLDNpdiFEKDLVELATRHV-GYGLKPEYFTHFGNALINMFEVSLVDSWDKDLHDAWVAVYQEVAEIMKSVI-
167 >SRR5918994_1539718
168 -------QQELIRESWQRFEpkIKRASPQFYERLFALDPAVRRLFSG-VNM------------AEQERKLMAMLKEIVPELDRptdLVAAVGRRSPFTP-HpepSGWLDPRYAWMRSRTPLP---CSGEX-------------------------
169 >tagenome__1003787_1003787.scaffolds.fasta_scaffold20949172_5 # 2657 # 2851 # 1 # ID=20949172_5;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.626
170 -------DETALLKGFDLAAdvLDEVIDNFYTELLESYPDLQPLFAH-TNT------------QQQRQKLQDVIYLLIENIHNqdvLESALLSLGERHI-RYGALPEHYPVVAEILESNLKKRLGRSWTKAVSTAWIQLLSAAADVMCRPY-
171 >ERR1700753_815890
172 --XMKSSTMELLSSSFARVcaDKNNAAGIFYARLFTTAPELRAAFQS--DF------------DSVQWKLMSSLVQIVEFYRVgvdPTSYLADLGRSRQ-GYAAQRAQFDAVGDAILFTLAQVLGQGFGADIRAAWVSAYAA----------
173 >tr|A0A1H2YYM1|A0A1H2YYM1_9RHOB Hemoglobin-like flavoprotein OS=Albimonas donghaensis OX=356660 GN=SAMN05444336_103306 PE=3 SV=1
174 AMPLDSTNLARMREMLHILRrdAPDASTDFYQALFERAPELRTLFRD-SDL------------AGQGRKFMAMLGLLVDACEDygrLGNEIRELGRGHA-AYGVEARFFPPMEEALIDTMRSNLGERFTPELEADWRKLYAIVANEMMSP--
175 >tr|A0A1T2B631|A0A1T2B631_9RHOB Uncharacterized protein OS=Thioclava sp. DLFJ4-1 OX=1915313 GN=BMI85_03370 PE=4 SV=1
176 EPLLPAERAARVKASAARLDfeDPSLFRDAFARLFAVHPELDQVLPN--SE------------GGQQLKYAAMMEVILSTLDPpeeQELELPGLGQMHV-LFGAEPDYYVWLSEAVIAGLAAKLGDHWTSELAADWAELFSKVSAQMIAG--
177 >tr|A0A2E1AIS1|A0A2E1AIS1_9CHLR Uncharacterized protein OS=Anaerolineaceae bacterium OX=2024896 GN=CL607_22355 PE=3 SV=1
178 MSPVTSRQKLLL--HYTLLHldADQMGKLFYDHILAAMPEVAPMFTD---L------------ESQRKHFMKMMIRIVHTIDEpdhLNIVLRELGHIHK-RLHLKPRHFSKMGVAFSNSLAEVMGDRYTPEIGEAWRILYNRVAEAMQSP--
179 >SRR5262245_62462516
180 --------IFIFLLFFFFCLcf-CFMFFFFFSSRRRHTRCLSDWSS--DVC------------SSDLQKLLAALALVVRSLHTpekILGPVKKLAVKHV-DYGVRPEHYTYVGNALLRTLKKGFGREFTPELSDAWVEAFRMLAKVMKEA--
181 >tr|A0A2D6AZC8|A0A2D6AZC8_9BACT Uncharacterized protein OS=Flammeovirgaceae bacterium GN=CMB80_28915 PE=4 SV=1
182 SNTMTSESINMISKSWDLLSRdPQLVTRFYNRLFDIAPETRRYFK--DDI------------SKQSEKLAHTLNFLVMNLDRldeIKESIEDLGRHHN-KMKIKAEYYVYVKEALLTTIQETLDEQCESGMVEAWDHALSHVASTMINA--
183 >SRR5262245_55554356
184 --CVTPEHRLLAQQAFATIQplADELGLLFYSRLFELDGALRGLFKH--DL------------ANQAHSLMAMLQLTIEGLDApeqFTRARTTWGYATWTmGFSRTSTRLLRRPCSGRSSMRX------------------------------
185 >SRR6516165_4200192
186 ------AQ--------------------------------------SDL------------VDRGRA------YRLLGLADLvdrrnQAaagGLSLFHRRAV----------------------SAGGVAWADRVLDALSlylcgyelrwpQLDHALGRgavhpdacaSLLRE--
187 >ERR1700733_1486793
188 --------------SQAHGGdiVDLyRDVRLVYRLFRRLPPAEQDAIP-GDH------------RRGRLSRaAGRVAL---------APVRRAARRQ---------DRRREG-DVLELRRDGRGDDRRHVFHRDQElswlSDDV--PR-VVRD--
189 >SRR5215831_4136876
190 --KHDPPTDLARAEQLQVRCA----DRVKGRRSLLRPSLRDRSRGP-AA--------------LPRKIIRAEGKVdgdANEDRQqssSAQchFASCTPTRRaaQ-GLRCLDGSLWGSGCCLLWTLEQGLGSAFTPEVKAAWSEAYRTLAGAMQEG--
191 >tr|W5NBV0|W5NBV0_LEPOC Uncharacterized protein OS=Lepisosteus oculatus PE=3 SV=1
192 -VPLTESQKDLIRESWKVVhqDIARLGIIMFIRLFETHPECKDVFFIFREIDDLQELKMSKELQAHGLRVMSFIEKSVARLAQedkLEQIALELGKCHC-RYNAPPKYYEYVGVQFISAVKPILKDSWSPQVEQAWESLFAYLAAVMKRGYH
193 >ERR1711911_21978
194 ATGLTARQKRIIAKNWDLVRpnLKEAGVGLFIAYLTKHPEMQARFKSFATVP-LNELAANRKLQAHAANIMYSMTMLVDSLNDvecLVQHLATIGRNHR-RRHLKRHHFQDLAVVIVDFLEAALAAHWSAEARQSWTLALNVIVDQICNVL-
195 >SRR5215218_21909
196 -CAMNPEQIGLLAESWKGVAgrRDEIARAFYGVLFDRHPELRSMFAH-TDM------------RAQYEKFALMIDEIVQLRTEprqFVRSAVLLGQRHA-AYGVTRDHYGPAGAALIEALAEALGSAFTPAAREAWTEGYLLMSSIMCR---
197 >SRR5688500_19518083
198 -LLITPAP--------------------PSAIHTRYLHDALPIAH-VDM------------GAQYEKFAAMVDEIVGLRTEphrFVRSAVLLGQRHA-RYGVTRDHYAPAGAALIEVLDRKSTRLNSSHLVVSYA----VSCSIQ-----
199 >SRR5258706_7695680
200 --RHDPPPdpadPPVLRPA----RvqGRETRHLDVQAPVPARPRPTPAVQ-------------------------------------------------------------------------------------------------------
201 >SRR4026207_1847514
202 -PLMTSNQRQLVRQSFDAVRdqAGPFSLLFYGKLFELDPSARRMFHV--DL------------ALQGRKIVDTLATVTESLDRfesIRPRLASLGRQHA-GYGVRPEQYDTITAALLWAIGQALGADFDAPTREAWKLALNAVSTATIEGA-
203 >SRR5260221_10622870
204 --IVNAAQQELVMTKAEGVvlMPGVTGVLLCALLISANPSFRPLFKS--DM------------RIQGVKLMTMLAMVVYNLPEpgqVLPAIRDRSEEHT-SELQSHSDFVCR--LLLLHX--------------------------------
205 >SRR6516225_5669596
206 -NVMTPEQKRLAScfrrggppGSWRRPSppLGIETAQVFRIPCVLPN--AAVHTA-GVS------------DHNNSDTYRAALRPAH---R-AASQTASVRNHE-RIQSETAM--REGL--rrvTYARVLRTGS-hRTPYrnVTP------------------
207 >SRR5215203_7560530
208 -RPMTPDQVSLVRDARRAIesRHAEFSAAFHDALHELDVDTCALFRD-TVT------------GGRACNVGAMLDLLQQASDDpraLIEVAAELGRAHA-HAGVRDVHHHVAGVALHRALHRVLGVEFTPAMYEAWAEAFTLLIAVMERAA-
209 >SRR5215470_20101711
210 -KSMTPQQIALVQCSFKSVApiASKAADLFYDPALRDrsrgaaALPH--------RFV------------G----AEGQADGDASNGHQ--------------QSPSARCHFANRAATLRPA-Q-------------------------------
211 >SRR5919197_1191720
212 --VLTRDQADIVQLTWRAVLpvGDTFAELFYGRLFALDPQLRRLFR--ENL------------VEQGRNLTAMLSVAAANLARpekISVALRQLGRRPT-RSSRARCSRSLLRDLLRLPLDARRA--VADGVARVVVafaRAVVAIP-RVIHG--
213 >SRR5690606_39578087
214 --------------------------------------ADHLSP--LPlP------------TRRSSDLLRMLAFIVKSLDWadrqwredvnpdedLMLVVLALGRRHTELYKIPDESYGAVAEALLWTLDYGLGRSEEHTSELQ--S-------REN----
215 >SRR3954469_10060132
216 -QRMTPEHIHTVQSSWNKVLpaGNGKARLLFERLLQTETSLCGLFQ--LDG------------ATWSANLVQMIDVLVTGLSLgdrSAVLTRRVGGRNT-ACPGIEHHYDLIGTALLRTLAKRLRAEFTPRVEAAWAIVYEELVESMRKA--
217 >SRR6266508_6374850
218 NFAMTKEQIALVKNSWKLFrkvDACLIGDVFYSKLFFDNPQLRQLFP--ASM------------EERYRKMIDMLSVIISRLDRlneMTKDIKVMALRHE-SHGVKPRHCKLLGNALRWTMERGLGNDWNDDVKEAGLACYTKLIETMIQ---
219 >SRR5215475_4417451
220 --PMTPLQRRLLHQSFSRIEpfSQRLGDVFYARFFSTSPAMRALFSR--DI------------KVQQSKFMKVISEIIKLPLlsfsvtdsqdSesLVPGAYWSGMLHG-ALSVKQQDFASMKAALLWALSNCP----------------------------
221 >tr|V4A5G6|V4A5G6_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233247 PE=3 SV=1
222 -ADLTEKDKELVKSSWAKFNegdVIADGAHIYYKLFEKAPEAKEKFGFAKD---GEVSLENKQFKAHVRKVLDVFESVVREIDQlegLLPVLNDLGARHK-SYGVPLKYYEILGSCIMYAWDRKLKM--DADTKKAWGKLYGVVQTEMKKG--
223 >SRR5262249_25899110
224 --MMNTQHIARIRLSFAWIApsADVFGELFVANLRALDPSLSGLLA--AEA------------GPQGWQLISILRSIIGGRDRpdrLFWRLQSFGRRLA-GDGLCAEDYDTIGDALMLTLEQCLGERLTPDVAAAWDATYAALAEVVQL---
225 >ERR1719223_727152
226 ---PSSAQVDAVTASWDKVAalgAETVGVLLFKRIFEIAPALESELS-EKPTA---IIIGDLTLAREMT----EEEKETIDLEEkeePeeveekeEPEEVDEQETTE-GRIISTESF-------------------------------------------
227 >ERR1719336_2939639
228 --PLDERDIDLVQQTLGRVAilgLDNVGWVLFMNTFKIAPAAQGLFE-AGFLQlkplnkpfnDMPELAKSSNMKETGGRVVETLAAAVGLLRDlgtLVPILQDLGKKGV-SCGVIPAHYDIFGEALITSLQLALGANFTDPVKNAYLKVYTIVKNTMIG---
229 >tr|A0A1D8RRN7|A0A1D8RRN7_9GAMM Uncharacterized protein OS=Colwellia sp. PAMC 20917 GN=A3Q34_02175 PE=4 SV=1
230 ---MTAKQINLVQQSWQKVLilSPDVGDLFYQQLFVLRPELATLLKN--DK------------QdKirANKDFICLLSQEINLLQPielTEEKV---NTSVT-TNDV-KNYQADVENALLLALTMILDKELKIALKRAWISTIKRLVGSIVIEL-
231 >ERR1700730_15638689
232 --AMTPKQVALVQDSFAKVAltSEAAAVLFYNRLFDIAPQMKAMFP--DDM------------VEQRRKLMSMLAGVVKGLANLeqvFAGRQRTGKAAC-QLRCEGG--ALSGGRRRVAVDAGEGsGGWLDAGSGGcVGHRlWHAVRLHDFPS--
233 >ERR1712166_353516
234 -VVAQFAALNAVDDKW-----VTQGVLLFKHMFRINPGMKQMFS-FRDIP-DDELYDSMKLKKHGVSVYTYIEKAVDGWGTpeIADALQKLGARHL-PREVKMEHFDVVGESILTSLSDVFGDQFDDKSREIWTRVYGVIV--------
235 >tr|A0A1S2XZ06|A0A1S2XZ06_CICAR leghemoglobin-like OS=Cicer arietinum GN=LOC101502441 PE=3 SV=1
236 MDALTEKQEALVNSSWEAFkkNIPHLSIVFYSSILEKAPESKDMFSFLKNF--DGIPHQNSTLEAHAEKIFDMTRDAAIQLRAkgkIdlaNDvTLEYLASVHV-QKGVTEQHFVVLKEAMLKTIKKAMDDKWSEELSCAWSIPYDQLAATIKKAM-
237 >OlaalgELextract3_1021956.scaffolds.fasta_scaffold1056695_1 # 380 # 499 # -1 # ID=1056695_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.392
238 -MALTATDVEVIQTTFKeVAEnvgAEKAGIILFKNVFDAAPGAAKLFS-FGRVEgfdPAADHSTNPAVVKHATGVITTVAKAVASLTDlsaVLPMLTALGKRHS-KYGVKKEHFGIVGAAFLKTLSTALGDKYTKEVEAAYTKLWGVVSKTFREAG-
239 >SRR5271157_4306781
240 -----VSDVEFLKETWGQItDKSSFAERFYSLLLAVFPVAKPLFSK-TDW------------QSQYSLLMASIDYMVMGIKygrNIQPTLHLLGARHD-YYGVAPVFYIPFNACLLITLQK------------------------------
241 >SRR6266566_5437046
242 --DLTPENCDFMTEHHDL--------RILGRLVATE---------------------------------------------------------Q-EQPVKDPDHDQIeeatrhrprscPTLFIWPNRRSQPLhrvlmRYMPvpgpRSPPSWCGPPSRSRSHGPRttT--
243 >SRR5579859_7196529
244 -GARDD--T-----------gsGQaCSAEFLQGR--------------T-HR------------RSGGDpVLRSPVRNCAAGQSDVsrrHDRTAEKADRHA-CGRCeRSgrLALDPAGreracq--TprrLWRQGcalpgrrrrlvvdAGK-GIGRgvdarrrrrmdhrlrhavrfHDFRSLWQCPG------------
245 >SRR6185312_354929
246 ---MVR--A-----------rgSAkC--WKCRWR--------------D-RA--------------SVSnSLPAPATSSAGSACSNfs-------MNGTA---SSkQPefDRVPRGGrgrgrrrKMTpeqVSLVQqsfakvapiseqaavlFYD-RL-FevapavkamfpadmteqrkKLM----------GTLAV-V---
247 >APLak6261666328_1056055.scaffolds.fasta_scaffold241778_1 # 2 # 196 # 1 # ID=241778_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.415
248 -GAKTAGGL---NLLFL--AivSS----EPENGFVTISPAAKDLFP-A-DL------------TEQRKKLIATLAIVVNRLSNLqsiLPAARTLTKRHV-NYGAKPEHYPVVGSAVLH-AGgrPRLGLDARSRLrsdGCVWHAVRLDDgrnleHEFANL---
249 >SRR3954463_16408791
250 ------QQITLVQESFARLAhdKARFGASFFKRLFKVDPTLEQSFAG-VD------------MQAHALKLVDAISFVVGGLRQpetLVGPVQKLGAARC-CRRCPTSSRTSGPRSSVPPGT-------------------------------
251 >SRR3569832_1984102
252 ----------------------------------LEPKARSMFNF--RAD------------EDleaNPQFMVHARAMVDMIdmavgflgPDldpLIEDLSHLGKRHI-SYGVKPEYFSIMERAVMFAMEELLDDKLTKEDRTSWQLVFHFMITH------
253 >tr|B3SDK5|B3SDK5_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_62364 PE=3 SV=1
254 -SYLNYQERQAIIDSWNAIstEKQKYGTILFLKLFELEPRVKSLFTIF-DFN--EpleDIIQSPHFRSHAMRFMQSLETGVLMGFDkesCDFLFKSLGSRHH-FYDLKSEFLDVIPECILHTIKKGCGNNWSNETADAWKIATKVLCELFREG--
255 >tr|C1C1M6|C1C1M6_CALCM Non-symbiotic hemoglobin 1 OS=Caligus clemensi OX=344056 GN=HBL1 PE=2 SV=1
256 MSILTSNELSLISESWKLVvpDLEHHGLSFFLKLFEEYPTYQEKFFPELH-------QDERKIQRHGAIVLKSVGK-LVAFLEankviaLVDAIKRLATNHS-RRGVLREQFYPACRILLEYLAQALGTHLSTEGALAWKRFLGTFVELMQ----
257 >SRR5450759_1049036
258 --ALTAEaPYSELKnlCVWSKT------NAGMGSLYRSQHELVFVF-K-NGMRPHINNvelgrfgrnrtniwnyAGASSFGstrdselamHPTVKPLSLVADAIlDCSKRggivldafagsgtTLIAAEKTGRR---GYGTELDPFYADT----------------------ivrrFEDAYGL-KAVHVE---
259 >DeetaT_11_FD_k123_441726_1 # 2 # 373 # 1 # ID=403715_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.481
260 --GLTDLQIEMIRSSWEKVTpnKKHHGQLLFHKLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
261 >AP82_1055514.scaffolds.fasta_scaffold664619_1 # 53 # 358 # 1 # ID=664619_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.458
262 ---MSGFALRLVLTQRQKATrkrpiaqyvienhSINFAFHYIDRLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
263 >SRR5210317_1560035
264 ------------------XmtSL----KSSMIGFFRNHQNCAKMFGE--DMR------------DQAQKLAAILQVAFDNLDHvdsLVPILEDVGAKHA-TYAVTPEHYGLVAAALIGTISTELGDAFDERAAESFEAVLGTVANVMISG--
265 >tr|A0A037ZKD6|A0A037ZKD6_9RHOB Uncharacterized protein OS=Actibacterium mucosum KCTC 23349 GN=ACMU_09600 PE=3 SV=1
266 --MAHKGRVQTVRDSFQVVrtDADAFARGFYDRLFAKRPEMRGLFAD--DMS------------AQQAKLVTTLVTAVNMFDTpsqLIKPLKQLGASHA-QMGLSQADYQLVVDTIIETLETTLGSAWDVAHDRAWRGLLDFVSNVMQEG--
267 >SRR5688500_932283
268 --MLSDAEKQAIRESWQLVLpvVETAADLFYRRLAEQNPALRARGQ--DQL------------VAQRKEFVTTFSFVVRGLAWeasewrsdapdeddLFLGMLALGQRGSRLARLIEQHYSATGDTLLWTLTYALGKRFDAKARAAWMRLYTLLAIALR----
269 >SRR5688572_29427622
270 ---------------WALCAprADLLAAAYYQRLFERLPALRIRFP--ADL------------APARQRLVGLLRFVARALYWpaddwrrplpieedLLAILLALSRRHRGLGEVDDAVRAVSREALVAAIGEILAGEANPSIIDTWGKLHDLAADAFVL---
271 >APIni6443716594_1056825.scaffolds.fasta_scaffold11231735_1 # 3 # 137 # 1 # ID=11231735_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.400
272 --LLTADERAVLKLDWSRLTrvdQQDMGMRIFLRIFELEPSTKLSFPELYHL-TGDQLISNTLFRCHGARFMRAVAAAVDNVDALdlvvIPNLIQLGRLHQSVDGLRWRHLEVFEQAMTEVWAVELNLSgswSGSTSAVVWSKVFRLITSKVYEGFQ
273 >tr|A7RWR6|A7RWR6_NEMVE Predicted protein OS=Nematostella vectensis OX=45351 GN=v1g203304 PE=3 SV=1
274 -CDMTYEQKYLIRETWKFLEvsKKEIGVSVYKRFLNMHPGLQTYFSEFKHIKID-NI---NGSHGHPRRLLMAIDNAVTALGDsdsFSAYLVELGRRHH-GMnfRPGPTHFNDLRKCFLSVIEEILATAslWDFQVEEAWNRLFDSITAMILRG--
275 >SRR6516164_7981020
276 -SPLTEAQKRLVRESFESMQeyETSVVVLFYGRLFEIAPETRTLFKI--DI------------REQSRSSWIPSGL------------------------------LSIRLTISWNCRQLLR---------NWDESTSltAFSPITMGN--
277 >SRR6185503_3589201
278 ---MKAEQLELVIDSLTVIQpiADQIAKSFYKHLFEIAPQTKKLFT--GDM------------DRQGIMLITSLSLAVNGLSDmenTLPSVQALGERHY-SYGVKPEYYQPAVESFLWSLEYHLGDQFTPELKESWRTAFQALADTMLSVY-
279 >tr|A0A0P6AJ75|A0A0P6AJ75_9CRUS Globin OS=Daphnia magna PE=3 SV=1
280 MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGALLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDNLSKVLIDFLAAKLGGEFTPEARQAWTATMQGINTVVEA---
281 >tr|A0A0P5NXY2|A0A0P5NXY2_9CRUS Globin (Fragment) OS=Daphnia magna PE=3 SV=1
282 MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGELLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDVSKS-FSNFEC-----PENEVSRKDWTKNLSILQ--------
283 >tr|Q93101|Q93101_9ANNE Nerve myoglobin OS=Aphrodita aculeata PE=2 SV=1
284 MAGLSGADIAVIRSTWAKVQgsgSAtDIGRSIFIKFFELDPAAQNEFPCKGESL-AA-LKTNVLLGQHGAKFMEYITTAvNGLDDYagkAHGPLTELGSRHK-TRGTTPANFGKAGEALLAILASVVGGDFTPAAKDAWTKVYNTISSTMQA---
285 >tr|A0A210Q3Q0|A0A210Q3Q0_MIZYE Neuroglobin OS=Mizuhopecten yessoensis GN=KP79_PYT10061 PE=3 SV=1
286 -TYLTPRQIHLVQDTWDIIkdDLSKLGVIVFLRLFETEPDLKHLFPKIVQMNEQNKLeWDIDrdMLTKHAVSVMEGLGAAVESLDEsefLNSVLISIGQTHV-KRHVKPQMLKRLWPSLNYGLKQVLQSKYNKEVNEAWKKVYFYIVAHMKRG--
287 >ERR1719460_671936
288 --MVDAVVKGDVQRTWELVIPpdsgddhvFAIGKLFFDRIFEVTPGAEALFS-FKGE----DRAESAKFRAHAIKVIKTVGVAVAKLDDletLVPILEDLGKKHV-AYGVVASTTT----SSVWRCCGRSRRGWATNSRPTW----------------
289 >ERR1712223_635401
290 IPKLTAEEKSVLQASWANVNkkIEIAGAQTFIRMFESNPETQNQFRKFQGMDL-VQLEQSAEMAQHGKRVLSIVGMTVDNLDNyqiVWDNLIKVGREHF-TFGALPMYFDLMGPHFVIAVRSCLGNDWYEALEYHWLALFNMIVYAMKFGWN
291 >ERR1712062_404977
292 --ILTNQEISVLKSSWELIAkkIEIAGAHTFLPTFDRDPKCPDN------------------IERHCQRVMSVVGGSIELINDyksLWKHLISLGREHF-GKIREWIFASIAGGSTersgcspssINFLSSKINGNITSKK--CFLQ-YKIVIITQX----
293 >SRR6266567_6698575
294 --------------------LIVFTSTCLWSI----RKPNHSLPKR-IC------------VVKLAHCWLHLTTVVAGVlreDNLVPVLQQLGQRHK-SYGVKAEYYPFFRAVLLETFQHYLGPRFTPKMQQAWEEAFEMISTQMLKGA-
295 >SRR5215217_5048650
296 --RVTARGRAR---HVLLRApvRDRRGRGTTVRRHRHGSAA-----------------------PQ---VRRDARQDRARSGRaatLVPDVAALARRHV-GYGVEDRHYTSVGEALLFALGDTLGDRFTSDVHAAWVEAYALLAALMQR---
297 >APDOM4702015191_1054821.scaffolds.fasta_scaffold152199_1 # 3 # 686 # -1 # ID=152199_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.531
298 ------------------------------------------MS--GDF------------SPEQKRYLEGFTS------GLq------IARTGR-GLG-KPAASVPSGPD-----AEHLIAQDQ-----------------------
299 >SRR5262249_5171126
300 ----EPDSALLVQSTIG-VLvqhQRRFTSELYRRLFGLAPGAQALFRS--DM------------ESQGKMLAHMLEFLVYATSRpetMTLGWRELGRGHD-GCGVGAEYYPAFRQAFLESARVVLDEKHTPQVEKAWADTLDMMIVSMLGP--
301 >APCry1669189000_1035189.scaffolds.fasta_scaffold267513_1 # 3 # 467 # -1 # ID=267513_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658
302 -VVLSDQHKKVIVRNWTILStdLSGRGTRIFLLIFGRNPLIKSIFS-FGHLE-GDELVCDPRFKGHALRFMQAVGAVVDNIDDynnaVKPILNDLGRRHTQFKGFKPIYFNEFQDSILQVSENGTCKQngeiriLNPSaagvnfCTPPLGKFSASEMTCIVSsGA-
303 >tr|W6FSH9|W6FSH9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_a PE=2 SV=1
304 -LDFSDDQKADIKSTWETLYsgnKFQLGVELMANLFKAHPDYQDLFPSLKGIPD---VAGSNELRGHAIRVITGINNFVDALDEeeevMREMLHNMARSHK-PRKLTKTHFNEFAPILLETFEKKVD--MSSKARDAWIALYYSIVDNLFAE--
305 >tr|W6FIG9|W6FIG9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_b PE=2 SV=1
306 -MVVSAEQKALIQGAWTPIYagnRFQLGVDIFAHFFKAHPNYANLFPSLVGVPN---PSTSVELRGHAIRVLTGINYFVAALDEkkpvIMEMIHNMARSHK-PRKLTREHFAQFAPVLFDT----IG--VSGPARDAFLPYYNFIADNLFAE--
307 >tr|A0A023RLQ7|A0A023RLQ7_AERME Globin OS=Aeromonas media WS OX=1208104 GN=B224_3582 PE=3 SV=1
308 ---MTPEQIELVQRAWGRVTalNNTYVQEVYAELFRLSPDLINLFPDPAG--------------MPVTKVSETLNTVITSLEQLdalGFIIRDLGRRHR-QFNVQSHQFGLLKQALTLVLARRLGEHFTPALSEAWSQMYDEIAALMLEGL-
309 >SRR5437899_2276119
310 -------------------YpaVQKSGAAVYRPALVAELRDRPY-E--FDI------------QVQLCVYLARMA--------leIVAALN-----AA-GWICVPKDPSPEM------LKAAWAYALDEDAAGVWKSMIAA----------
311 >ERR1700757_2961956
312 ------------------------------------------------------------------RFNRLAGRERRAPARtr----ARQSR-------QRPGPSRHDPTrLALSD----------VSEAERTDIVVS------------
313 >SRR5215213_1430710
314 ---------------------------------YLYPFLRPMFK--ENI------------QLQARKFSAHVSLVIGNIKDrntLQPMFEEMRNLHL-NHNVKTHHYNYVQEALFYALKNHLVKEWDEHTESAWIKFYNIMASQMAA---
315 >SRR4051794_22176940
316 -NRMTEASLQRIASNYELLAgqMQVLTGAFYKRLFAAMPEAQPLFR--IDI------------DLQSQHLAAALALIVRNIRFfdaLEQPLKELGVHHA-HVGVRPEQYPVVCRTMLETFREGSGQSWSPELEADWKAVLELVSRIMMDG--
317 >SRR5262245_41201456
318 --XMTPHQILLVKTSFQAALtqRERIAGFFFAELFAREPAMWQLLR--GKT------------GMRWPALVDGLAAIVGSIHRihsIEPVLQWLSWQGA-VRGVGEGQYEAVGQALVAALEAGLGEAFGSEHRRAWMVAVGKVADIMARA--
319 >tr|A0A0N9QWL5|A0A0N9QWL5_9ANNE Intracellular single-domain globin (Fragment) OS=Eulagiscinae sp. JPG-2015 OX=1732542 PE=2 SV=1
320 ---VSDAQKALIKSSWAGVDLNAAGVAFLNQMEQKAHDVYAVFKV-G-----GGATSNPKAAALGLKVMTFVDEAVKGIDDMgavGGKLDELAQRHT-KYGAKKAHFPVAGPCFLDALAEVCGGRFSADARAAWSDFYDVIAQHLSA---
321 >tr|C7FFW0|C7FFW0_BRASE Extracellular tetra-domain globin (Fragment) OS=Branchipolynoe seepensis OX=326992 PE=3 SV=1
322 ---VSDAQKAAIKASWAGADLQAAGTGFYVHLAAEAPAVYANFNL-G-----ADPH-GAKSQEQGLRVMKFVNQCVNSIDNMaivQAKIDALAHRHM-SYNVKKSDFVPAKPCFLGALADALGGKFNADARAAWAGFYDIIAAGLST---
323 >ERR1719261_40108
324 -------TIAVVQGTWQEIKdalgdgvAETAGVILFKHIFRIAPQALALFS-FKDCAGgnvCDELFENKTLRKHAAKVVGTVDTAVGMLKktrQADSRPGQSGQEAR-GLwggagalrcgrgGVVGDAVGRVGRRVYDRGPRGLGGGLRHHQNHN-----DRQELRLHGR--
325 >ERR1719238_2294225
326 -----------------------------LKVA----SALREFN-TLRAEGivsEQEFLEM------KAKLLAVGKDELG-RSpsgDTLETLVEAThemdssRRRT-RWtrrarraSRSPTTVGVISCQIK--------KSSTRRTTRRW----------------
327 >ERR550532_3331206
328 ------------------------------PLF----PAAH--R-LCRPDGhdgCS---------------------VFGPDRppgE------------------APSTKDIVVTVIL--------X--------------------------
329 >SRR2546430_16462751
330 ---------------------------------------------------------------------------------flLSVVIA-----CS-CWCRHVSSlqhdrad-------HPVGLCPGIVADWSPALSQNVGEGFQQDCSD-dG----
331 >tr|A0A0P6RCU1|A0A0P6RCU1_9RHOB Flavohemoprotein OS=Phaeobacter sp. 11ANDIMAR09 OX=1225647 GN=AN476_12305 PE=3 SV=1
332 ----ASTCKALVLRSFESErmDLEAFIPLFYSNFFEAYPEARAIFPT--DT------------ERLEAKLLASLTHIAEALESserLDGILSELGQKHR-RMQISDSHFDGFIQSFIRSLATTLGPEWSDQSDEAWSQFLRYVAKRMSFLE-
333 >tr|B7QTL6|B7QTL6_9RHOB Globin, putative OS=Ruegeria sp. R11 OX=439497 GN=RR11_330 PE=3 SV=1
334 ----APADRDLILASVESQkmELDQFVSLFYAKFFERCPDTRPMFPH--DM------------SLQEEKLLMSLTHIIEALEHpakLRLILLDQGERHK-ALQINDDHFAGFIDSFTGALKDTLQEDWSEETRQAWLRFLQYVAYQMGFLK-
335 >SRR6218665_311178
336 -TPIYAGHRDVIRRTWPIIAdqMNANGCQIFLCIFELSPGIKRVFA-FGPAMSGAQIVNHPRLVQHASRFMEAMQVAVQHLDELdtvvSPIFINLGKRHIYFEGINADYFNVFSGAILYTWRQVLGERFSAEVRSAWSRLFDFVIQHLRFGY-
337 >GraSoiStandDraft_9_1057307.scaffolds.fasta_scaffold3427870_1 # 1 # 249 # 1 # ID=3427870_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.747
338 --------ADVIFDSWDAVKripdyDVVVGEMMFRKLFENSPSTLKNFS-FGPRFagKEESLYKSRTFEIHTKAMIKMLEDVLSMIMpDlvpMKKTLKALGARHV-TYGVRPNHYELATEALLSTLESLLGYRWTPQVEEGWKTAIGFITNTMVAG--
339 >tr|A0A2C9KJS1|A0A2C9KJS1_BIOGL Uncharacterized protein OS=Biomphalaria glabrata PE=3 SV=1
340 --YVTPKEKELLRSSWNIVsqDISGVGMNIFKKLFDIETDLMKLFKRMLTKGeTGQVVVDSIRLEGHATGVLRQIGLVVENMDNnsaLTTTLIALGEVHA-NYRVRPEMLPLLWPAIRDALKIACEDEFTHQMELAWKHLYDFVTCHLSEG--
341 >tr|A0A1Y5RHX9|A0A1Y5RHX9_9RHOB Flavohemoprotein OS=Palleronia marisminoris GN=hmp PE=3 SV=1
342 ---MPNDDMRLIQPSIARIFvvRRSIGQAFYERLFERQPTFRTMFPT--DL------------RTQARTFDDMIALIVKKTGDpeaVTPVLLAIGRRYL-TYGLRPQDLRVIGEVLMEVLCAQTPGGLSPDEAAAWERSFSRAAEVVKL---
343 >ERR1719321_586101
344 --ELSYSTVSTVIDSWESVKrqenyAENLGRMIFIKFFDREPEAKTIFGFDGKKMKTdDEFYESRAFLAHGKHFVLILNKAFDMLGPdlemLTDILLDLGGTHRTKYGVKPEYFPVLGDALLECIEEMSDPeRFNDETKACWLEAYNALTEIMTT---
345 >tr|A0A2D6RHV2|A0A2D6RHV2_9GAMM Methyl-accepting chemotaxis protein (Fragment) OS=Colwelliaceae bacterium OX=2026726 GN=CL811_09640 PE=4 SV=1
346 ---MTPKQNIAVIESWKKVQpiASQVSQVFYDDLCEKHPSLKALLG--EELS------------SARDQLVAYLNSLVETLVATdevv-I--EDL-AKH-LRIGLAPEQFSDVGPALLTSLEIGLEKDFTATVKRAWTALNKLIVAAMAQ---
347 >tr|B7J6S4|B7J6S4_ACIF2 Globin domain protein OS=Acidithiobacillus ferrooxidans (strain ATCC 23270 / DSM 14882 / CIP 104768 / NCIMB 8455) OX=243159 GN=
348 ----MAINIQLIQSSGAAVkdLGVQVAEHFYNYMFTHFPEVRKMFPG--------------DMSEQRVRLFNSVILIATNIDTmevLVPYLKELGIGHI-KYDTRPEHYPIVGKSLLNTLKHFLGAAWTQEMAESWIEAYNLASTVCIEA--
349 >tr|A0A1Q9NIM3|A0A1Q9NIM3_9ARCH Bacterial hemoglobin OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=vhb_2 PE=4 SV=1
350 --SLNTKDIQLIKNSWEKLteNKKEVRNTFYTGMFEDDPKLKSLFRE--------------SFLSWD-NLPDSFEFMFKHLENlegEILEMKRLGLKHK-TFSVKPKHFPIGRKSLVKTIKQYMGDKYTEELGAAWTKLFDYMSHYMILG--
351 >ERR1719419_74415
352 --PFTPEQRTLINETWGNISTKEtgsmgmLAKQVYERLFRSAPGIKRLFKD-SDM------------LAISRAFGGMLGVLVSAVNQplqFQHIVKGLGVRHQ-VYGVKPDHFRIMYTSLVRTFAQILGDKFTSEHKKAWSCLYNWVIDAMQRSMR
353 >ERR1740128_1504408
354 ---------------LGVSYlarhIVPVDVRFLKEHVKTLFVLSqR---MPGNFV-NETLETRATLLYETLLVMSNLNYWVENLDELdlvVASIQKMATNHA-GRGIMAAQFETIGAVVVEYLKAGLKEALTEEMAGSREKLISTMVSIIKETN-
355 >ERR1719354_333269
356 -MGLEQSDVEAIQRSWEIVKetakLRVHGVNFFEMRFEMIPDWReKYFSHMGP-------KTSAKFRSHATMIMMTLDSWIENLDDLdlvVDAVLRVGQTHA-DRDILSPQFVEINKVIIVYLETGLGDKFTEEMKESWIKLLDTVVTIIKDGN-
357 >SRR5215207_9441599
358 -----PEQLALVRGTASIIDavGDSFAERFDDHLFARYPAARRLFP--DDT------------TTHRGQLTDEIVFLVAAAADlhaLLERARALGAPPP-LRRtrrrlparrrgTRRRGRGRRGRSVVGRNG---G-SLA-----------------------
359 >SRR5690349_3556304
360 -TYLTGQQVLLLKKSFRQMNPAQIAAQFYGTLFQQHPEVKSMFPA--DTV------------ELGSKLMSVFELVVFSFDEKehgrfglqdvlIKPLRALGRKHD-DKGVKPEYYEIANSLLLKIMKE--SEYFTTEMYQSWQLALEHLTYAMQDK--
361 >tr|A0A2A4JK54|A0A2A4JK54_HELVI Uncharacterized protein OS=Heliothis virescens OX=7102 GN=B5V51_782 PE=3 SV=1
362 -SGMTLKDVYNVQHSWKTINanPLDNGYLMFFRLFEVNPESKTFFKILDNARTETEMRDNVRFRAHVLNIMAALNNSIENLNKpeiVVVWMEKLGTAHR-RSHVQERHFLIFKDVLVNILKNDLK--LSEAVVKSWGRYVTFIYSYILP---
363 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9902871_2 # 1417 # 1767 # -1 # ID=9902871_2;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.538
364 -----ALDTKLIKDSFELAKpiSDKLVKRFYENLYSDYPQSKSLYLD--G-----------QLPESQLAILKAINFIVDNLHNkekLGTFLKTLNERYE-LRLNDSVINQSVCSSFLKTLSEAFGSDWTSELAEQWELTYQMVTSFFQDSK-
365 >OM-RGC.v1.013389558 TARA_082_DCM_0.22-3_C19717715_1_gene515718 COG0552 K03110
366 ---WHGESVTTVQRSWARIQqlgLENCGTLFYNTLFERWPEAKQLFSLSvrlkhrapgESEREGPDPTNSPALRKLWGKLLSVVGSLVSGACNpaeVVPTFHAVGVRHA-GYKLKVAHFDAFGGVMASVLKHLLGEEFTTEVQHAWTLAINFLTANIRAGFV
367 >tr|A7C4X7|A7C4X7_9GAMM Bacterial hemoglobin OS=Beggiatoa sp. PS GN=BGP_4395 PE=3 SV=1
368 ---KQHDTIFEIQSTYEKILphLDEFSRLFYQQLFEIKPAFKILFRQT-DL------------RIQKQMVIRMIEVVVQGINNlenFMSIIQRIHQRHY-ELHLKPEDYRLAGQALVLSLEKYFGDEFTPTLKKIWLDFYESIVATMMN---
369 >UPI0004291969 status=active
370 ---KQSDTVFLVQSTLEKVFpqLDEFTNQFFKKFYELDPSVKEIFYEI-DA------------KNKKQMVVNMIGFLTQGINRfdvIIPSIKEINERHF-GREVKPKYYLIASKALVNVLEDYLGEDFTPEVKQTWIEFYEQIVNFMEA---
371 >ETNmetMinimDraft_35_1059890.scaffolds.fasta_scaffold55614_2 # 1284 # 1421 # 1 # ID=55614_2;partial=01;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.623
372 ---KQSDTIFLVQSTLEKVFpqLDKFTDQFFEKFYQLDPSVKKLFNGV-DS------------KNKRQMVVNMIGFLTQGINRfdvIMPSIKEMNERHF-GRDVKPDHYLVAGKTLVNVLEDYLGKDFTPDVKQTWIEFYEQIVHFVED---
373 >ERR1719506_1011120
374 -GPITAREGQIVQDSWKAVKkvGGESGHAvikdIFYQHLLKDPNVKQLFRN-------------SDMKLQATKLWQTLHVAVDGLSTsgpWFLCCRIWARLTS-STGSKRS------TSMPWVRRsSTrspraWGPRsrrssrWRGRKCTAWLLRRX-----------
375 >Cyp1metagenome_2_1107374.scaffolds.fasta_scaffold42158_11 # 5761 # 5952 # -1 # ID=42158_11;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.578
376 -RFLTVAQQNEIIATWAIIKeshaSEAIGMDVFKGLFISAPETFDMFDSFKKDP---DWQNNVHFKHHCKVVINVIGSFVLLLNQpekLISHLEFLGVKHN-FMTITPLQFELLGAELLKAFNKALGARYNSLTKKSWTIFYNKIAEVMQTN--
377 >SRR5688572_5289639
378 --TVTPDRQQLIRDSWRALEpnGPRLVELAFLHLLQIAPAARPLMTG-HSL------------PCVCRNVASILDQLIAALDEpkqFVPLAIGLGRSNP-GHGINAALYPAMGEALLWALHLQLGEGLTPELQTAWLEYHHLVSAIMRRA--
379 >SRR5690349_12423264
380 --XMTPERQQLVQSSWRKVEpnAARLVELAVLHLVSIAPSVRSHLDG-ATL------------PLLCQRIAAILGRLVETLDEpkqFVPLAISLGRENP-DRGLTAKLYPAMGEALIFALHLQLGDAFTLELQAAWLEFERLATAIMQ----
381 >SRR5215467_4845699
382 --------------------------------ALTWPLRR-------------------------RCWGKLLWpswiiwkmCPGCSRPSrswAPSTLGM---------VLLPRCTTGSADALVATLAKPNGEQWTPAHTDAWGEAYRAIVAMMLAGYP
383 >SRR5262245_32871681
384 -------DPQILRETLELTLaaDDSFPKRFYDRLFTRHPEVIPMFHR--NSP-----------GAQRKMFAQKLIMIVDHVEDpawLARELRTVAQSHV-RYGVRPEMYAWIGEALIETLRDACDSDWSESAERAWRNAYTKIVESIFEV--
385 >tr|A0A1C4TW82|A0A1C4TW82_9ACTN NAD(P)H-flavin reductase OS=Micromonospora haikouensis OX=686309 GN=GA0070558_10167 PE=4 SV=1
386 -----RAVSADLGPSWAATAaaVDRAAANFLDTVSDRLPGLLP--------------------ERDHTVVFAALGRLAGGVDDtagRAAALAVLARAHR-GVGLLPQHADLLGDALLAAVARENRAHWTAALATGWERGLRRAVTAVRRA--
387 >tr|R4LFD5|R4LFD5_9ACTN Globin OS=Actinoplanes sp. N902-109 OX=649831 GN=fhbA PE=4 SV=1
388 -----GMDPaddaalnEvrrLLGNSLSMAGgpME-VAGRLRAALAQAQPTLFATLPG--GP------------VAQVEQLAEGLTWLIHHVDQppaLVAGFGRLGMALA-ECGVAPQQLQLAGAALAEAMRAGmAAHGWRQDFDQAWRSTWQHAYEWIAHG--
389 >tr|A0A1H7FRI4|A0A1H7FRI4_9ACTN NAD(P)H-flavin reductase OS=Nonomuraea pusilla OX=46177 GN=SAMN05660976_00171 PE=3 SV=1
390 -----MLGFQRVRDNFELVAkyGDGVPLYLFSDLFLRVPQLREMFPV--NM------------RSQRERLMGALAFAVEHAGDlaaITPYLHHLARSHR-KFGARPEHYAQWSVSVVNAMRRFSGSAWDDELEREWRDFLTAVSQVMIDA--
391 >tr|A0A210PV81|A0A210PV81_MIZYE Globin OS=Mizuhopecten yessoensis GN=KP79_PYT16126 PE=3 SV=1
392 PLGLTERELKMIKVSWDVLAedKKSNGVKFFMTLFTIFPTSKDLFKHFKDVPLDQLKydgettKSNKKMVAHAMSVMYALESYVDSLDDaycLEELVKKVAISHK-PRGIGPDKFKLLTPVLHAVIEDLVKDDDSvdlETIKSGWTKLIDTVCDIVEK---
393 >tr|A0A1L4CYV2|A0A1L4CYV2_9PROT Uncharacterized protein OS=Silvanigrella aquatica GN=AXG55_04100 PE=3 SV=1
394 -----NIDIQIIRDSFELTKpiGDQIINRFYENLFLEHPELKEFLSR-GDI------------QKQKEILLNTLVTTIDNLDKpesLSSFLIHLGEKHL-NYNMIEMYNDFIGRNFIKTLSQFLGRYWSDELNRQWNEVYKFISLNLKKG--
395 >SRR3954469_16801024
396 -------NYALLRNSFEKLKpvAGKVAERFFDILWNDYPETRDFFKN-TQM------------GPQKFAFFQALVFIVENLDQpesLESYLRGLGASHS-AHGVKKEYYGWGCAALHKTFAQTFADEWNDTLSFEWTKVFAMITSLML----
397 >SRR6266851_5623532
398 ------------ACTSPSVRstT-------------------TCAG-----S------------TRNSGYPAGPnSPTHStriSHDTRTDrigpkLIRVHRRRRA-RDGVRPRHYRSAGDALLGALAAHLGSDWTPAAESAWRRAYNLVAEIMIA---
399 >tr|C3Y526|C3Y526_BRAFL Uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_98913 PE=3 SV=1
400 -TGLTPTQSRLVKESWKMFlsKKRENGFVIFRVLFTDYPVTRKLFKGVEQldLDAPGQLESSITLRAHVTRFMHSFDTYMESLDDpedLKQLLYDTGKSHL-IHDIKPEYFDVLETVLMKSLRIVFGSKLTPQLEEAWQTAYSHLKVTIKQG--
401 >SRR5271166_2850757
402 --RWMRPKRNSCARPSPKSRrsPIKAGAMLYEKMFALDPDLRRLFA--IDI------------ETQGAKLMAVFATAIANLHRldeILPTVRELGRRHV-AFGVKDRDYDTGGVALVQTLEAGLGDAFTPAVRDAWMACYEAITGEMKA---
403 >SRR6478735_6705068
404 SPSLTREQKRHIRETFAIIEpaSDLVARLFYMKSVDLDPSLGVLFKS--PN------------RVQRRKFMAAMKVTVLSLDRlqsLQPILKLLGARQR-EEGVTPGHYETFQDAWVWTLEQALQARFPREAKDAWSSLLGEMTAPQRPR--
405 >tr|F2Q9X8|F2Q9X8_BRAFL Globin OS=Branchiostoma floridae OX=7739 GN=lGb13 PE=2 SV=1
406 --PLDAWQRFYLQKSWKTVArkSDQAARTVFLRMLQDNPGLRQKWPRISLL-TEEEIPTSPYIKFLGERIFDCLDYIIDNLGDLDhviSELTKLGRQHSDMNVMTPEDVWAIEAAFLAGVQECLEDRFTIKYEEIYSRFIVFVIETMVIGFD
407 >tr|A0A226E0J1|A0A226E0J1_FOLCA Hemocyanin OS=Folsomia candida GN=Fcan01_14017 PE=3 SV=1
408 KVQLTPDEMIAIKRNWEVIHqdLTGNGMDMYLHWFAAFPHMQKVFKKFAQVP-RDQLKTNDAFKAQATVTLHWIDDMIEAIDSpsdMAAVMKRLGRMHQ-TRHTNIYDFREMVKRIQEVIGTKVGEGYTPAAESGWTKLFAKLVENIGD---
409 >ERR1700732_4531564
410 -----ASPNGRRNSARASmlISsqPIRRSPRFSATTW-----------------------------WHRPRC-SCSLWVRSEVNRmeeLGGGLCALGERHV-DYGVKRADYNKLASVLIQTLKEFLVDEFTVELQHAWGTVD------------
411 >SRR5258708_12476517
412 ---------VLWEWLVDVGGarWRWFGGRLLEIFLETSPELRSLFHK--DI------------AQETGMLEWMLGSLVKGLNRlleIEGGLRALGRRHR-DYKIDQADHEKVLRALLLTLAEFVGDDFTPQVSRAWKTVYGKIPDTMTDR--
413 >SRR5882672_7954690
414 -----------------------------------------------------------------------------------------------HYGNANRYQGVRPSRCIpGESSR-----HRPHGASQPSVG-Q-----------
415 >SRR5215469_12962076
416 -------------------------------------------------------------------SLSARAGRQAGFGl---SG-----------LGSAAT--taiPTPSTSLTGSTARTTG--cSAPYSR-----TGT-----------
417 >SRR6266704_5570200
418 --GIN-----KTPGMFEKISssMPLGRVA---TVDDIIPFISFLAS--DD-----------------S---KMITGAEAGGNs--fVLVLTNLRNIH------------------------------------------------------
419 >SRR5205807_5077868
420 ---------------------RVGHGRVYPRLYIIARHAAGIYAL-TRP------------VAKPgRPRPVCLVPIHKDIA--vmrVTTDQLLARTPL-GrFGEAAevgqlVHYLVSDAA------RFVS-GATVTIDGAWTAYGGWALR-------
421 >ERR1712137_931585
422 -------------------------------------MGTSLLG-VDCE-GEEFVKT-DSFVPQAKKFIGLCDSFIDMLGPdaelMAKILEAEGRKH-EKLGIKLEHYSTMGEALISGVKTL--DeKFNDETELCWKLVYCGVTNNLGKAN-
423 >SRR5437868_6667390
424 --------------------------------------------------------------------------------------REIAASD---------ESEGVGDAEI-------DERRSNRLGDVHRSALGprpvtvrdnhgtrtaVKEGSIRRGV-
425 >ERR1740124_2148144
426 ----------RTRGAAALLLqgrAQPCGVAQAQEACYVCDEHCRCCSQ-GSgGP---QQacarATGPPAHMPYA----THRCRVCCRIGiraRAPPTQALGKRHV-PYGVLPAHYDVVGQALLATLEGGLGAEWNDQVKASWTAVYGIIAKTMIG---
427 >SRR3954451_929548
428 --SMTPEQMQLVRLTLAQAtaDPLALGRDFYRRLFVLAPDLRARFH--GDID------------AESLKLKETLTLAFGALTDmrlLVATLDGLAKRDV-ARGLSEQHCRAIAQSLIWAIERRVGSDFTHQVCNAWIAFMAVAMTCLHG---
429 >SRR4051794_5741567
430 --SMRPEQMQLDGLTLADAttDRLARGRDFYRRLSVPAPYLRGRCD--GDVD------------AESAKLKETRTLALRMLGNmrfMVATLDAMAKRDV-ARGLSEQHCRAIAQSLIWALERRLGAGFSRQVCTAWTEFLAVVMTCLHG---
431 >SRR6516165_10653891
432 --EPSPNQLHQNRPD---R-RPGGGTLLWPPLRDGSR-NPGAVL--QRR------------GRTGSEANGRSCNRCEQSRRFrgdRPHRTRS----C-KAPRRPEHYALVGSALLWTLEQGLGDEFTPALRAAWAAAYCALSEVMIA---
433 >tr|A0A1X7UGV4|A0A1X7UGV4_AMPQE Uncharacterized protein OS=Amphimedon queenslandica PE=3 SV=1
434 -MSLTSAQVALIESTWKVVKkdLQGAGNIMFLKLFQIDVSVRDKFP-FRDVP-YEELEDSESFLKHSLQVMETIDLAITLLlGGemekLVEALVDLGMAHA-MQGLKPEDFDHVGEALVHALGVALGKEFNDEAKKAWTLLYSVVTAKMKEGL-
435 >SRR6266699_274039
436 -------QGELLETSFQAIVlhGEAFVTAFYERLFTRFPETRAFFAA-TDM------------LEQRKKLQQTLALIVQHIQHpevLGDMLQELGQRHV-TYGIRPEHYPSSERCCWRLSPTFSGSTGRRRTTMPGSRGMRQSAAX------
437 >SRR5438045_5489985
438 --------LITRPTSYYLLSlhdalpISLLADVFYSKLFVKNTGLRKMFP--ADL------------QLQRQKLMNMLHFIISNLDQpelFNKEIEGLGLRQD-RKSTRLNSSHLGISYAVFCLKK------------------------------
439 >tr|A0A1E3GPU1|A0A1E3GPU1_9GAMM Bacterial hemoglobin OS=Methylophaga muralis GN=vhb PE=3 SV=1
440 -AKLQEQDIALVEQNFAVLMefSDALAERFYQRLFTEYPEIMPLFKS--V-----------TIEGQHKKLLASMVLLIQHLRDtemIEDYLQGLGARHQ-QYGVETSHFEMFIENWLSVVAEFADQKWDSKLQQAWRNVLEYVAELMQSPT-
441 >SRR3954464_793235
442 --------VDPFRSRFAFGVerEPEVTHRFYDVLFAKYPQVQPLFGR--RSR-----------ADQERMLRDMLVAIVDHVEDppwPQHHPPPPPPNPP-RPAPTP----------------------------------------------
443 >tr|B7QBW9|B7QBW9_IXOSC Beta chain of the tetrameric hemoglobin, putative OS=Ixodes scapularis OX=6945 GN=8038954 PE=3 SV=1
444 -TEMTSQEKHVVRDTWAIFKkeVQTSGVAIFVVLFFKHPAYQKLFVAFAADP-IAELPQNPRAIAHALTVAYAITSIIDTLDEpetSAELVRKVATNHVRHPTISGAQFEHMGQAVVEVLAEKLGSAMNHQAVGSWQKFFAFVVRVSQGVF-
445 >tr|A0A1B6H4C1|A0A1B6H4C1_9HEMI Uncharacterized protein OS=Cuerna arida GN=g.19114 PE=3 SV=1
446 MRRLTEREKENVRLVWKKVedDYPSYGRSVFVKLFDEYPYFKKFFKATIG--NFEDPFMSPRFQKHMLQvLMPTFGGIMDNLDFpeaVNEAVKRLAVSHR-KKELGiaKEHINILGQVIVSVVKRDTL-GCTEEQEEALEKVISIVMAMFC----
447 >SRR5215813_3453690
448 ------------------------------------------------------------------------IASDSEIQVspwtrt--GTLAISARRCS-SSRISSGigsdtTFSLYGNCV------------SSSATIAWNTHGD----IQLDS--
449 >SRR5579859_1863727
450 -------NISSLQLTILNLLtvEDEFVPRFYNNLFNMYPLARSLFVHTe--I------------SLQYNKLRLMLMMIIRTIHDadgLKIQLQQLGQRHK-YYRVEPEHFAILYIVFVQTVVEYLGPKWTAELEAAWAEAYGTIVRMMDME--
451 >Dee2metaT_7_FD_contig_123_47857_length_200_multi_10_in_2_out_1_1 # 3 # 200 # -1 # ID=100007_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.434
452 ----------VLRDREG---lgDPELVVLQRRHLAEHGAILQPLalLARQr--H------------REDLELVRELLLLECDHRVEhprahpaGVGVEGELGVGHH-TERIKRSlspsalLGRWIDLVVVGAVRR---------------HHQGGVVDLRLVE--
453 >SRR5436853_3450426
454 --------PVLLKDSFNLVRseEHTSELQSLRHLVCRLLLEKKKKnkTTTV-----------NYIE---KEKLGKLEA-SCPVEqti-------GIGDKQR-DYQ--QMHHPERTEAQ-----KX-----------------------------
455 >tr|A0A1W2WRJ7|A0A1W2WRJ7_CIOIN cytoglobin-1-like OS=Ciona intestinalis GN=LOC100183004 PE=3 SV=1
456 -MPFTDEELKLLRNSWDEVKklgMKEVGLHIFTGLLNAAPSLRTLFYTI-DLPDEeeltiDVMRENKKVVAHATRIANAISKFIKFLDQpeeLEKLLTSLGESHA-RRQVDPESFEYVAPVILSVIGGHLKLPSNSPTLQAWVKAYGVLRNGIVS---
457 >tr|A0A1W0WQD3|A0A1W0WQD3_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_08524 PE=3 SV=1
458 -TGLKKRERLVVQQTFEAIsKklgRAVLGRDIFYLFFQLHPAYLQLFKALRDIP-PEQLKTHPRLKAHGLNAIQALAAVIENLEDTettVLLLEKTGRDHV-RRKLQSKHFEDFHSTTVALLKRELGPSFTPFVEQSWNKAFTVVNTVIL----
459 >SRR5438034_562795
460 -------AVETLRNSFERVIerSPNLTRRFYEILFEKYPQTRRMFGL--QS-----------GKGKGNGKGAGARQRLRRChcrlhfgkekaTVvpfPLPVPVPLPAFRD-SYX-------------------------------------------------
461 >SRR3954466_4238475
462 --------IRRLTRSYDQILsaGDCLPELMFAQLFDRAPELRTLFPD--DM------------GRVKHQFARMLHWLIAHLHEpqkLRIALVDLGRRHQ-EYGVKPDVYPHLCEALVDAMATICADDWNEELCRDWRQTFDLMVHHMLRAY-
463 >ERR1719359_2370951
464 -------------RLIVTPEhldGCRAGLLALRVVLLHLGEGLGLLG-SDSSGVSdcgVALgeL------------PLQRLDLLGVLLGpr----L---GL--L-NAGVRGLELSLLGRLlrvglselfVAEGLLLGL----------------------------
465 >tr|A0A212ELK8|A0A212ELK8_DANPL Globin 1 (Fragment) OS=Danaus plexippus plexippus GN=KGM_200313A PE=4 SV=1
466 -SGLSRRDVFAVQKSWAIVYanPLANGSELLKSPYISRIL----ILLVDKVS-EI----------------GSIVKAATDVE-------------------------------------------------------------------
467 >ERR1719343_803772
468 -----------------------RAVDCSFDFSRKSPVPRPSLA-SAKKDfngDANSVYDSRKFLDIGKNFIEIVDQAVDMLGPdlqvVAEVLIDLGKKYHNEYDMRPEYYSVLARALIDELEEILGTDkFNTRTKSCWVQVYGAIAADIAA---
469 >EndMetStandDraft_7_1072992.scaffolds.fasta_scaffold3604113_1 # 1 # 288 # 1 # ID=3604113_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.538
470 -NNLTDDQKNVIKKTWITIEenRTKIGKQTFIRVFELNPQIKKMMPEFMTADPIEELNSSRKLFGHSKTLMTCLENAVKSLDDnerFVAYLVELGRRHQ-VRPLKAPYFEVIHEALMFSLKDVFQSDWTTETSESWSALFRYMSEAMIIGL-
471 >tr|A0A136A626|A0A136A626_9ALTE Uncharacterized protein OS=Paraglaciecola sp. S66 GN=AX660_04410 PE=3 SV=1
472 -MILTVEEKSAIKESFAVLLRenANVAECFYNNLFELAPLIKPLFKS--GR------------ENIENHFHELIGTAVNKIDHfndLRADLIALGKRHK-IYGAQQAHFAVVKAAFILSIQYKLKGQCSPFLENSWAKYIDNISSVMIEGL-
473 >ERR1719461_1916292
474 ------------------------NV-SLFSLFAADPGVQtKYFGHMK---------TDADLEKHGVRVMNSIGAMVRAILDqdddrLITKVHEITRNHQ-PRGINRPLLEFFLSVVLDYLAKALDSHLSKEGGA------------------
475 >ERR1712179_865199
476 ---------------------------------------QrKHFPHMM---------NssigksltKSKLKIHGGRVIREISVMVDCVQAgndeaLMAKIKEITVNHG-VmRDImSIEAYRLVLDGLVAFLGSALGDSLNETGHHAWKKLVNNIITGID----
477 >SRR6266699_3297184
478 ----ALARGSLATPCFRSHRAqhFQARMpykPVGSLEAARQHAREGLFRS--DME------------RQYFKLMDMIAAIVGTLDKremFQSIISHSGRQHA-QFGAKPLHFAAFGDALIWGLEQQFGAAFTPEMKEAWIKLYDDVQREMMR---
479 >ERR1719271_149007
480 --AVSARERRLIERTWEKAKedgCDALGANLLQTLLVAEPQVMQLFP-FKDE---ENVYESLRFKAHASKLAVIIDAAVSLLANpvkLESLLISVATSYEYsFKQMLPEHFPLLGEALIRTLTSIVGgTKFTWQAESAWRKVWTIISTVMIGA--
481 >ERR1719203_2782565
482 ---------ITSKFGWTSNmq--------------KIIQSQTHSKT-QDMQ---RDYYLNQK-KTLEI---------------nvRHPLMKELLRRVE-----DNPEDKVAKdMATMMFNTATLRSGFSLKDTVNFAESIELMMRQTLG---
483 >SRR4029078_13512293
484 ---------------------vKRVAAELfYVKLFELDSTLKLLLA--D-Q------------QVREQKFMQIVDATVNGLEHsegMMSAVRELGIRHP-LFGDSDEHHGPVATSLFWSLKKCLRKDFSGEECPRAVGGHALC---------
485 >tr|A0A147B4Z8|A0A147B4Z8_FUNHE Neuroglobin (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
486 MGELSVKDKELIRGSWESLgkNKVPHGVIMFSRLFELDPALLSLFHYSTKCDSKQDCLSSPEFLDHVTKVMLVIDAAVSHLDDlhsLEEFLLNLGRKHQ-AVGVSTQSFTEVGESLLYMLQCSLGQAYTAPLRQAWLNMYSIVVAVMSRGW-
487 >SRR5262245_48005872
488 ----VSMHTSPLRASVELVEqrRSEAVRYFYAHLFAGHPELRTVFPI--SA------------VEEHDRLFTALLYVVKNVHAlpmLAAELQQVGRDHR-KFALSAEHYQVVGASFLATGAAILAEAWTSEIGSGWQSAYRMAASVMSD---
489 >tr|R7WMM5|R7WMM5_9NOCA Flavohemoprotein OS=Rhodococcus rhodnii LMG 5362 GN=Rrhod_2088 PE=3 SV=1
490 --IFDDRTLRRVRATYKDMAArpdwdSHLAQSFYANLFAENPQLRLLFPA--NL------------EAQTHRMLTAIRYVLDNVEQpdrMLTFLGQLGRDHR-KYGVAREHYEAGGRALLQSLRGSLVtLLWTPTVDAAWSEVVGTIVGTMAD---
491 >SRR5258708_3005780
492 --EPTPTDITIVSDSLAPLTkeqVDNVLAAFYHQLFTRQPSLRQLFKSFRSGDQ----PDQQAMKLQRNKLAEIIALGLKLWEKphqLIPALEKLGRQHH-QYGVRDEYYEDVWIALSEVLSEAFGLDRWEDICESWQRFIFLCARHMLNG--
493 >ERR1719347_1330150
494 YFCLSESNIKALKSCHPHLkdRKEEFGHLFYSNLFSNHPDLKSLFDQ-TEE----------GRQLQAQRLADTVVAFLEKCDDlpsLLPTFKKIGKRHT-TKGVKPEMYQIIIDNLVDTLEEMLGKeVFSAEVKQEVLESISFLSNAFIK---
495 >ERR1719284_1036555
496 ----------DVSASLDLVKrlpnYeQVVGVRLYQKVLAAGPQYVKMFP-SVASsltssNDPEEFLKDPVLLKHLTSYIRMICMAVDLLGPdtelFEEQVRELGAKHS-EYGVSQRYYVVMGKALIQTLEELLGDRFTPSTKQAWEKMYDLMSSTMIKG--
497 >SRR3974390_2763688
498 --XMSPETKELLETTWAKVIpiSDVAAGLFYERLFTLDPSLHRLFEN-------------ADMKEQRRKLVQALHAVIYSVDDlpsLIPTLEILGRNHV-RWGGIGGTPRDLGGQSHPEAVGRI-----PNIR---IVAVAvGRPDIMLV---
499 >APLak6261669570_1056073.scaffolds.fasta_scaffold275140_1 # 52 # 198 # 1 # ID=275140_1;partial=01;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.524
500 ---WSTRRVKVVQRSWETFKstqaeSTTVGLAVFKRFLRRSPAFLQLFP-FRDQP-LETLFLNAKVRLHCKLFADTVSRTVGLLGDsvaVKASLRELGARHSDLYKVRSGHYAAMGSALLEVLEHNLGESWDEETKTAWEETWAYITEQMQKG--
501 >ERR1035437_6084348
502 -SSLDQEMIAIVQVSWENVTPDsrLAASMLAMNLCADDRNIASLFEE--DR------------IKMSRDVMQAVSCIVADLDQpetLVPYFGSLGQLLR-RHGLHESGQQTFATALFLTLGQLLGPRYGPVEHNAWAIAYSFVVRIMIAE--
503 >ERR1035437_3078414
504 -SSLDQEMIAIVQVSWENITPNsrLAASMLAMNLCADDLNIASLFEE--DR------------IKMSREVMQTISSIVAGLDQpetLVPYLGSLGKLIR-RHVLHESGQQTFATAFFLPLGQLLGPLYAPVEHNAGAIPX------------
505 >ERR550534_521252
506 -TSFKPNEIMEMRVMWNGWvggDMASRGFEMFCKMFEMHPETKDVFA-FMKGSSVAQMQSSSKVLFHVTRVMKYIDEVMRHADRLdevVPILRQVGGRHGTqGYNIQSGYFPFLGNALRQLLKDHFKTRYTAVLDGHFQKMWGFIVKQMQAG--
507 >ERR1712105_94955
508 -TEFKPNEIMDMRVMWNGWvsgDLASKGFEMFCKMFEMHPETKNVFA-FMKGSSVAQMQSSAKVLFHVTRVMKYIDEVVKHADKLdevVPIMRQVGGRHGThGYNIQSGYFPHLGEAQRLLLKDFFKDRYTANMDAIFKKLWVFIVKQMQAG--
509 >ERR1719483_559503
510 EGPLLAKDVKAIEESFAMVAalgsAKELGIGFFRLLFTTYPEWLEkYFvPNFGDKP-LEEFLMIPRFEVHAPGVIVELSKWVGSLHDldsLVAAIQENARNHY-RRGLNVDHYKKIAGVLLSYISAGLGDSLTTQMETAWTKFLDTMVNVVEEEM-
511 >tr|A0A195EH31|A0A195EH31_9HYME Cytoglobin-2 OS=Trachymyrmex cornetzi GN=ALC57_03526 PE=3 SV=1
512 -LGLTEKQKKLVQNTWAIVRkdEVSVGVALVIAFFKQYPESQKEFKSFKDVP-LDELPKNKRFQAHCINIVATLGKVIEQMHDpelMEASLINFTEKHK-ARGQTPEQFENLKQVILAAFPSLFGKQYTSEVQEAWKKTLDLIFSRICQ---
513 >tr|A0A158NI97|A0A158NI97_ATTCE Uncharacterized protein OS=Atta cephalotes GN=105620364 PE=4 SV=1
514 -----------------------------------------------------------------MNIT--NGTIHDILSGgkNTQKV--FL--FR-HRGRTKEVVEKEEKIRVAGLDtngshradCPKGTDEGREIGDPVTDSLLQMLQKKEK---
515 >SRR5690606_21296714
516 ----lmEWERVKLVQESWSSITpL-gaKFTQVFYRKLFDEHPAVVGLFPE--SM------------AEQEQLLSRMINPAISCLPAesvFENMMHKLGNRHS-EYGINEKHYRMFTQSLLETIRESLAERWTDELESAWAEVLSGMSRRMN----
517 >GraSoiStandDraft_11_1057310.scaffolds.fasta_scaffold26797_1 # 22 # 990 # 1 # ID=26797_1;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.733
518 --VIvTDSDISGCFSCWQTVVdGkapayiEdsdpnkpsglvWFSNVFYGRLFDVNPEAKKLFRD--NN------------ETKARALGNIISTGLRQIWDranFSKILHGIAVSHC-KLGVKAIQYGLVGDVLLWSFAYTMKNMWDQDLRTSWIAV-------------
519 >SRR5690606_23735845
520 -TSFVSLNANVLQRSFEFLApqSDRLAKRVFEKLLKDYPQYRPLFAKV-EI------------VDLRQRLIQSLALVVKSAQRpetMVRYLSELGIRHA-EYGITDNDYRPFTSVLLGVLAEFSGARWTPEVKTAWEEVX------------
521 >SRR5215469_11104805
522 --TGVAEQHLLDLGGVDVLP--APDDHVFDPA--GDPQVaaviedAQVAGV--QP------------AVWIDGFRGAFGHVEVAEHGLvaarADFPG-LAGRHG-FPSDRV----------------------ADGDLYL-----------------
523 >tr|A0A2T7P4Q7|A0A2T7P4Q7_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_10992 PE=3 SV=1
524 -PSLTADIRRVVQQSWYRLvehrSLDQLGIPVFLEIFHLTPAAKKLFH-Y-SeKTTIEELEGDRRLREHATRFMNAVGAVVDNLDKknsddLDVMLREMGADHTNISTFNQVYCVIFREALLSVWERNLGKaRFRGELKNAWRALITYMMEVMREGYD
525 >SRR5438128_5040868
526 --------------------------------------------------------------------------------------------EY-RWAEGSSelaaEFVRLNVDVIV-----TGRLPAVAAKQADIRHSDCVRDSCGP---
527 >WetSurSiteA1Bulk_404760.scaffolds.fasta_scaffold823987_1 # 3 # 239 # -1 # ID=823987_1;partial=10;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.409
528 ----------------------------------------------------------------------------------MPN--------------------DSDSCHSVDNSAILHAVLDSAVDGIISIDESGTMESVNA---
529 >ERR1711918_283694
530 -----------------------------------------------------------GSECSWMCRC---GIARFEQT----RTTSHKSRRA-TYRvqPDRGILAHPGESCDDHFGGAPWGGLHPEVENAWNVVYGFPSSIMISGPR
531 >SRR5262245_16285966
532 ---------XMVEGTLDAVSLPALSADFYRRAFDTDPELARMFTA--DR------------RVQEARFATELAAIVRSIRchdEFVPAGRALGPVPR-L-RRDGRPLPRDGRRPAGIagrcprsdvearGGRGMAPRLQPDRRDDAERRPRAGQLGVTSG--
533 >ERR1712061_521749
534 ---PVGHMKTAVEQSWERVQalgPVVIGAQEHRDVAVVSRTTST---TSTRI-EESDATAAGSLANPF----------------------------------------------------------------------------------
535 >tr|X6EW29|X6EW29_9RHIZ Adenylate cyclase OS=Mesorhizobium sp. LNHC209A00 GN=X738_26865 PE=3 SV=1
536 --------FALAQRSVGLLLddPSAFAAQFYANMFAIQPELEGLFVN-G-T------------GAQGAMLSHMLRTVVSGLERRkhvPAGLQTMGRKHI-GYGVELDHYDSFRGAMLKTIDDIMGAGLTREIEESWSETLDVILGLMKKG--
537 >SRR5215471_14715706
538 --------PAGGPALARLLRr-------HLRRV--VSSRLAPLFLR-LAF------------NDAISYDPATGSGGANGSIRLpeeLARKEVAGLARA-V------------------------ERLRPVKE-------------------
539 >SRR5205085_9494957
540 --------PASGPALSRLLRrhLRCVVTsraapLFLRLAFNDAISFNPATRA-GGC------------NGSirlaeelEREEIQVLSQGIEQLRPLkerFP-HVS-----------------------------------------------------------
541 >SRR5947207_2391870
542 --IISNRQARRTNDRLQIELaaAQARIGLLYFAQHDRTRAAA---------------------------------ALLEGPDAFdqqRPALRAMGLRHV-AYGVVPAHYDTLATAFLWPLGHRLSPEFSPX---------------------
543 >tr|N1VSG6|N1VSG6_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 GN=LEP1
544 -----PDPILEIQKSFDHVLeyNPHWIDSYIDKLKNFSMenvTENQREGDN-ES------------PISSEEFLNSIESIIEKLGNpisVKKEVSKLANIYE-SLGITKKEFPKLLPILLSSLRENLPSEWNPSLESIWTQAITDLTIETIES--
545 >tr|R8ZTT5|R8ZTT5_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira yanagawae serovar Saopaulo str. Sao Paulo = ATCC 700523 GN=L
546 -----KDQILELQRSLELALqlNPNLARDFYIHFLETKPEFQKFFQNT-DM------------ETQAKKLLAMFGKTIERLGNlnqIQIELQNLGKMHE-EMGIPVTDFGAIAPSLLYALEKSLGDQWNAEWKSIWETALGSLVRLMGMK--
547 >SRR6478609_9341681
548 -------DAELLETSLALVDTpdASLDSRFCALLHERHPAVHPGGGD--TA------------ARQAKLLRSAVISVVDHLDDpvwLTETLGDGTARPS-GWQVAPEMCGAVSECMVAAMVEIGGARWTSQMTDAWVEALDAVSGPMLLGS-
549 >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold1207366_1 # 2 # 214 # -1 # ID=1207366_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.286
550 -----YASHQSQAASLAKAAprPRVAVLGLrlpsgeSPQLARLGRAFAELLG--AEL------------AAGERLLVLPAeRVehMKLELGLdeaEAYPLPTLGRIHR-NLGPDLVVVGTlapqeprgtlsvtveVKDCLTGAVTATAKVTGPAAELFTLASQvggelrrrlgssalsgneraelraqrpaSPEVAQLYADG--
551 >tr|V4A611|V4A611_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233216 PE=3 SV=1
552 -IGFTETQIDTIRSTWPLLSrnMVRVGTDVFVRIFTEVPTVKELFSSF-NIVDVNDLHKMPTFRAHAEMFMQVLHLVVDNLETpyseLNHELMVLGARHATFSGFKPEYFKFYVKCLIQVWELELGEEFILEVRDCWKIVFDFLVDNMTEGYE
553 >SRR6266542_3322184
554 MTVMTPEQIEAVEATTAVLapALDDLAADVYARLDRLAPETAELFTG--GPA------------AEVRGRARDDRARHPAPRRLpGacl--------------PARPPARALRGQA------GALRARRC-----------------------
555 >SRR5918994_1217714
556 ------RDiEAYVRT------gRAA------VPVFESDVLLEDCVTS--AA------------NNDWcgVSTRPRNEVWPGFKVGlerAVPVLEQLGRDHR-RFGAVTAHYDAVGASLLATLRHFFGPAWTPELHQTWSEAYGPVAKVMVTA--
557 >SRR5207302_4688282
558 --VVTLEQFRLIQHSWKLVKdGqfaaftaqtliadplGFWGLQLYDTLFALNPSLKPMFKN--TF-------------TQSQMLTEMVGAALGllpgildqalgeektAIDPqLIPILVDLAERHV-SYNVKAAHYGTVGLGLVTTLERTLGSHFDEQKQATCFELWSMMX--------
559 >SRR5437867_13093015
560 ---------------------nqnpsPLWRA---------------------RL-------------PR-------VSIAFGlrwfNCnTSkSYSRKCSTNLLNV-GYNVKAEHYGTVGLGLVTTSERTLGSHFDAQTKAAWVELWSLICTVMIP---
561 >SRR5882757_3847967
562 ----------------------TSI--------------WPIIIN--TaV------------GirnipQDYRNVARVLRLnqFEF-FTKimvpaAAPYIFTGL---------------RIGIGLSWLAI--------------VAA--------------
563 >ERR1700737_3002051
564 ----------------------RDF--------------HHLDLA--DhH------------Q---------HRVagTQW-AN-gsMSNAVWTGV---------------RLKDVLDRAGV--------------KSGAI------------
565 >SRR3954451_23003713
566 ----------------------LKS------------TTGEVFLE--G--------------klv-DE-------PGpdRAI-VFQnhsLLPWLTVYG---------------NVAIATDKVFGGSGARSKSKAERHDWVMHNLELVQM---A--
567 >SRR5206468_1650083
568 ----------------------TNA------------TMGCVLLE--N--------------rev-NS-------PGaaRRR-QGVcerQDPQRAQRMGDAqpqpradgacqgqA-PG-GDFRRYEAARRHCPRAGHATKSAAARRAVRRAGRADPRAPAGL------
569 >SRR5258705_633045
570 ----------------------TSE------------DAGPVALG--N--------------qev-KQ-------PRtqPPV-VFLdpaLPPRPPALD---------------HWLLRAARDAGGP------QPQ--------------------
571 >SRR5690606_21133184
572 ----------------------INP------------LHGAVRLN--D--------------aap-RV-------GDpeVGY-LLArdaLLPWRTALR---------------NVTLPLEV---RGI----ERREREQSARKVLRDVGL---E--
573 >ERR1700682_1967427
574 ----------------------DRA------------SAGRVVVD--G--------------sev-RG-------PSldRGV-VFQspaLLPWLSALK---------------NVAFAVRSRWPRW-----SDEQVVSHAQKYLDMVHL---T--
575 >SRR5699024_2544359
576 ----------------------LSPSSGKIIVAFSSPTSGKIMMD--V--------------ndwtSYKDSEMTALRLkeIGF-IFQeshLLPYLKIRE---------------QLEFVGREAGMDK-------KHARKRAKEILDLFGL---D--
577 >SRR3954447_21976298
578 ----------------------RAA------------TGGVVRWS--V--------------dplvAAG-----GRARhpLSM-VFQkdtVLPWRTVAQ---------------NVGLFYALN---RD----RRAGAEGVVDDLIRLAGL---E--
579 >SRR6266567_262474
580 --SMTPEQIDLVRKSFDALWpfRRKLADQFYGRFFELAPDTRRLFPN--DME------------RQQLKLMDTIAAIVGTLDQreiFQSIISLTGRKHA-DFGVQTSHFACCFYPKSLEAPAHAGGFLCSSpLNVSWNGARARPYPLMHL---
581 >OM-RGC.v1.004444255 TARA_034_DCM_0.22-1.6_scaffold509117_1_gene597562 NOG05352 ""
582 --PfLQPTKFELVVNLKTA----------------------KALGL--EVP------------PTLLARADEVAGVGGSAKRishWPPR------------------------------------------QSRWAGLPRRPERH------
583 >ERR1719401_1263416
584 ----------NVLTSWNTLKskpnyCDETAALIFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCTVSMLGpdlfELSGVLHEMGRRHQ-RNGVDRSYLPYMSEALFHALAKMLGPQFTEDDKEAWKGVMDYMISEMVIG--
585 >ERR1719401_232394
586 ----------NVLTSWNTLKskpnyCEETATLVFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCIVSMLGpdlfELSGVLHEMGRRHQ-SNGVDPSYLPYMSEAFVCALSKMLGPQFTEDDKEAWEVVMDYMISEMLIG--
587 >ERR1711862_565156
588 ---------------------------------------KIMFH-FPVNMNIETVLKSKIFLQHAKFFVKTLDITIGLLGpdtdIIQDVLLEHSKTYQ-NHGVNSAMYLHMGESILYALEKDLGDvNFTSKDREAWAYFYGTIVGVIVGG--
589 >GraSoiStandDraft_1057264.scaffolds.fasta_scaffold343999_2 # 425 # 754 # -1 # ID=343999_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
590 ---RRRMDAELLETSLALVDtPdDGLTKRFYALLFERYPAVRPVFPEEmhRDI------------ARQAKMLRSAIISVVDHLDDpvwLTETLGELGARHA-GWGVLAEMYDAVTECMVAAMAEIGGDDWTPYMTDAWTEALDAVSGLMLLGYP
591 >ERR1044072_5206314
592 ---MAPPQIAVARSTGPKVSPmqQRLAQVFYERLFELDPTTRAFFGG-------------VDLRHHGLKLTETLSAGIEVLGRdgpAPRGS-----------GSGMAALRDGGGCVVHGAGVLPGPRVHDRSPGGLVGGVLG----------
593 >ERR1719389_1465843
594 ----RCNRKLGGSAKEEKLRrndgtrfvCKI---FKISRFLKQQPDASAVFG-F-DNN-DEDVHKTPKFIDFANHFVEVIDQAVQMLGPdfelLTDFFVDLGDKHSKEYGIKPKFYPILGRVFM-----------------------------------
595 >tr|Q17153|Q17153_9BIVA Hemoglobin (2 domain) OS=Barbatia lima GN=hemoglobin PE=2 SV=1
596 ----QPANKGLIRETWNIVAGdRKNGVELMALLFEMAPDSKKEFRRLGDVSPA-NIPNNRKLNGHGITLWYALANFVDQLDNktdLEDVCRKFAVNHV-LRGVLDVKFAWIKEPLAELLKRKCGQRCTEKHVKAWWKLIDVVCAVLEEH--
597 >tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1
598 ----KPANKGLIRETWNMIAGdRKNGVELMALLFEMAPDSKKDFRRLGDVSPS-NIPNNRKLNGHGITLWYALMNFVDQLDNkidLEDVCRKFAVNHV-NRGVLDVKFAWIKEPLAELLRRKCGQTCTDQHIQAWWKLIDVVCAVLEEK--
599 >SRR5262245_28144535
600 --CVTEEQIARVRACFDELTPrtPEVVDRFLARFFAQNAPLRALFP--RDLS------------ALKQDFAAGFRHVVRHLHRldtIAPMLMDLGSRQA-RAGLTPGHFGMAREVLLTTLRDVAGPRWNEQLRQDWTEALNTVVSLMVVGA-
601 >ERR1039457_5537378
602 ---AGPLNPALIRKSLALITagPPRGAGGFSRALFSFDPGVGGLVPA--G------------DERAER----APVRR-------------AGPDRR-AAX-------------------------------------------------
603 >ERR1719498_600299
604 --------INCVQHAWNVlIIEDRsreflraqesatfvyssciswFYSVFYSRLFNVHPLFRPRLNS--KG------------SKSGKSLVMMIATTINGLRDkdmFQRVVTEMAKNLC-SSGVKPVEYGILG---------------------------------------
605 >tr|A0A2H8TS68|A0A2H8TS68_9HEMI Neuroglobin (Fragment) OS=Melanaphis sacchari OX=742174 GN=ngb_3 PE=3 SV=1
606 --YLNKSQTALVKQSWPMITSNNFWTTFYINLFKRNPLYQLQFDRFANVP-FEELESNVHFLAHSFRTGFAFNTAIEHLEKpdeLHRILMDLGEKHR-KFRLTAEHFEAVKDILLCMIEDRIVLTdvpaRNILLVEAWKPCITLVIGVIM----
607 >SRR5215469_6657410
608 ---------RLCPVSQSQMSSvvGatTSaaHRITMSPIWVSpCYSFTWLAI--NRY------------TWDRFGLMTMIQTAVENMHQldqILPAVRDLGRRHA-GYGVKAADYNTVAGALLGTLEQALGSEFTSAVRNAWIAYYQTLAGEMKA---
609 >UPI00001F6528 status=active
610 ---AIIDGLRDLSESFDTLaadeaatApaATELKaavegqfsgvfGAEYAKQTGKQPDTASYTLE---------------------HSAAALAQYHYIVRNphpLGQknKLDKV-AGEA-RYHALHARYHTMLNAYLERFGyydvflidldgdvvysvfkemdyatNLKTGPWRDSgLGRVFRSALESNDtkSTFFDDFA
611 >ERR1712100_346632
612 ---------------------LFFFFFFFFFFFFFFFFFFFFFS-FKNV---EDLYESPMLKAHGKAVVGAVDAAVHLLDDvskLIPILEELEQFHN-RKKIVAAHYDVVGQAVVNVIGSALNG-LSEEQTNAWVKVYLTIKSVMLA---
613 >ERR550532_3561775
614 ---------------------GDSSVSPSGELCSPKTKTPRICSTVLE-----LTMHSADFQAHSGRVFGGLDTVISCLDDeatLVAELAHLKGQHDER-NIPDAYYRHFYQALEKVMNAMLGPCFNY---EAWDACGDIVFHGITGH--
615 >tr|A0A1I3HEN0|A0A1I3HEN0_9RHOB Nitric oxide dioxygenase OS=Jannaschia pohangensis GN=SAMN04488095_0565 PE=3 SV=1
616 --LVTNTQARLLSRSLRRISenGAPLARSFYAELFSAHPEVRPMFHS--DLS------------TQYAKFEDMLVVLVADVLNpgvILRPLQDLAKRHV-EYGVTREMYPIVGDIMMRTLRTLDAAPLTGDELEAWDVLLGRVNAFLMDE--
617 >tr|A0A1Q3FVI8|A0A1Q3FVI8_CULTA Putative globin 1 OS=Culex tarsalis OX=7177 PE=3 SV=1
618 -TGLTNHQKVALIGAWSLVkkDIISHGRNIFVRFFEENPKYLNYFD-FSQDRTASEIGENKSLHAHALNVMHFIGTLIDyGLYNpamFKCSLSKLMKNHL-KRGVKKEDVTIVCGVIMKYCLEVLDQHQSTTLQVAFASLMKGIADAFD----
619 >tr|A0A2M4DSC8|A0A2M4DSC8_ANODA Uncharacterized protein OS=Anopheles darlingi OX=43151 PE=3 SV=1
620 --------------MWCKPthQNpegSSDYISICVRLFQKYPHYTDYFD-FTDDTKADSLVDNKSLFAQSIHIVKAFGSLIEyGLKDprlFHETLKRIARWHE-QRNVYGCDVLLIGEVMLTYLTQTLGRQTPAMLGEAFQKLFQTISYRFP----
621 >tr|A0A0N8DLE0|A0A0N8DLE0_9CRUS Hemoglobin subunit theta-1 (Fragment) OS=Daphnia magna PE=3 SV=1
622 -LPLNARQKYSMLASWKGISraLEPTGVYMFIKLFEEHKELLSLFTKFHQLTTRDEQANSEELAEHASSVMSTLDESIRSLDNVDtflLYLHQVGQSHYKVEGFQKEYFWKIRNPFLEAVKMTLGDRYTENIENIYKVSINLVIETLVEGYE
623 >ERR1719383_1265545
624 -------------HSWKEVGqapADEVAREIFRNIFAIEPGALELFP-FKNES-EDDLwREGGALTVHALKVVSTIDKAVSRLGNmdaVVPMLRKLGIMHV-GPRPQHLGNG-----APMSLP--------RRPTASWRRG-------------
625 >ERR1719383_514948
626 ----------------------------------------------RGRL-VEGRwRFDSARVKSCVddrqGCVETWQHGRRR-----SNAPQVGNHAR-GLRCAQAHYDVVGQALVTTLASY--CTFTDPVKNAWIKLCGVIKATMVH---
627 >ERR1712000_66502
628 --------FPKVQKSWARVLeieakdeSKSFGPIFYNTLFTDFPFLKEqdFKSA--TM------------AEQKMNLPKFITTALSLLGDmpkAVDALQRLGMRHV-LYGTKDAYYPVVGANIIKTLKQILPANEFDQEtQEEWLTLYGVMQKTMIDA--
629 >SRR5258708_4037766
630 --------PGAVGPAPGLQPprNRPGARRGQPALMQSPSAGGPPPGP-HrpRR------------THRTPPRRAALVLLRRSLRDldeVVPGLRAMGARHV-RYGARPEHYPVVGAVLIDSMAEVAWDAWRPAYGRAWAAAFDVVSGAMLAG--
631 >tr|A0A1Y3AX51|A0A1Y3AX51_EURMA Globin-like protein (Fragment) OS=Euroglyphus maynei GN=BLA29_013533 PE=3 SV=1
632 ----------------------------------------QKFKSFKDIPINfqqnHLIRIDKKLIAHGTYVMYTIGMLVDNLERpdmMRQMLKRLSRNHY-RRRISLKAFERLRDTLLEHLSDILGKEiFHRKTMIAWHKAFGYLLKEIESN--
633 >SRR5688572_8260099
634 -----DQEINIVRQTWNRLAaehGNSVAEEFYKRLFECCPHLKDVFKN--DF------------EVHGKEFIENMDHIIIQLDNpcMIREMQILGIKYA-SYGIRYEDYECMKKALFDALKTKLAEHWTPTVMVSWIWFYSTVSHIMKH---
635 >tr|F2Q9X2|F2Q9X2_BRAFL Globin OS=Branchiostoma floridae GN=lGb7 PE=2 SV=1
636 -MSLSAADKKLVQESWDKVSkpsFADAGERVFLKLFRRNESTKAHFKKFKDIPS-DQLAGQAVVRDHGEKVCKVLDDFIKGLDGsGDEAVKKVGRMHK-GLGMSNEQIDQMKGAIIEVLADAgFGD---ANYKGAWGKLWDRFMAVHRA---
637 >tr|A0A1B0G6S0|A0A1B0G6S0_GLOMM Hemoglobin-like flavoprotein OS=Glossina morsitans morsitans PE=3 SV=1
638 YSTMNSDEVYEIKRTWEIPatTPTESGVAILIRFFTKYPSNLQKFSTFKDMTL-DELKNNPRFKAHANRIMKVFDDSIKTLDDncshLEEIWTKIAQSHF-NRQIEKQSFNELKEVILEVLVAACN--LNDQQTEIWLKLLDFVYEIIFKT--
639 >tr|V5YM54|V5YM54_9DIPT Globin OS=Polypedilum nubifer GN=PnHb18 PE=2 SV=1
640 IVALTEADVEIIKRTWKIPsaNPHDSAALIFSTFLEKYPHNQQKFPAFKDKPL-SDIKNTVEFRAHASRIFNVFSSVIDGLDRdtemmkgIKKIIAEVGKFHA-KKKVTKKAHNEVRSVLVDILIEVCK--LSDEEKAAWTKLLDIFFHVMFEC--
641 >tr|O96457|O96457_9MUSC Hemoglobin OS=Gasterophilus intestinalis GN=glob1 PE=1 SV=1
642 ---MNSEEVNDIKRTWEVVaaKMTEAGVEMLKRYFKKYPHNLNHFPWFKEIPF-DDLPENARFKTHGTRILRQVDEGVKALSVdfgdkkFDDVWKKLAQTHH-EKKVERRSYNELKDIIIEVVCSCVK--LNEKQVHAYHKFFDRAYDIAFAE--
643 >SRR4051794_9566520
644 ---------------KALVEdvAERghrrPMEVFYGARsdhdlydidtmlrmAQSHPWLS-VRPV--VA------------TGpaggPMNSLSGQLPDAVRQYGPwreYDAYLSGPPGMIR--NGVD----ALVGVGV---PSDRIRHDSVEELVAAGDX--------------
645 >SRR5215470_9890699
646 -----DFDRGPIRELLKHLAvePDAAMEYLFARLFAAHPDLRGLFPY--GM------------TQTRAAVFGELAAIIGGLDDqerTEQTLARLALGHR-KFGVKDKHYEPFFDAMFVTAQHAAGAAWTGEMAASWRSALDWFGSVMAA---
647 >SRR5262249_54331370
648 --IRLRK-------EIDNEWllIASgVLSVIFGLILVAQPGTGALA---------------------LLYVIGIYAILYGILGPrpcCV----------N-RFGAQTALDRG-----------------TSTYRELWNIS----VARLIG---
649 >SRR4029079_9820506
650 -VRVDGILVEGLQASLATMQpaAAQIAHGFYTLLFARRPDFRAMFP--EDM------------AAQERKLIATLAFVCEHWRKpaaVSVRLADLGALHQ-GLHVKPEHYPIVCDALVTAVMKHRHEALGPHRAR------------------
651 >ERR1719310_1734953
652 ----SASSVKAVQASWAKAEnigLRVVGELFFKELFEASPAAKELFTA--Q-KFGEDAAGQRRFKAHTLNVMQTLSAAVYGLSDlsaLARTLPAPTYAIL-SLSFTLISFTSL--------------SLTPLI--------------------
653 >ERR1712087_347811
654 --------------------------------------HEELFTA--QKKFGEDAAGKAHFKAHTLNVMQTLAAAVYGLSDlsaLARTLPARIYAIL-SLSFTLITFTSLSLTPLIYHTLTLKGARARNSGRaaPWIRRPT-----------
655 >tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii GN=F775_23753 PE=3 SV=1
656 -MAFSEAQEELVLRSWKAMkpDSESIALKFFLRIFEIAPAAKPMFPFLRDAGEDAPLESHPKLKAHAVTVFVMACESATQLRktgDvkvREATLRRLGATHV-RAGVADAHFEVVKTALLDTIEGAVPEMWTPEMKAAWEEAYDQLAAAIKEEM-
657 >SRR5262245_14739337
658 --PCARARLRPR-------RpaL------Y-AQALPPRRLVPRPVRE--L------------AEAQSRKFMAGLKLGIIALNyedGLTPVIRLVGVRNR-RAGIKVRHHRVMAKALLPTLEQSLETRFTRDTKHAWSSFLTQVTRILSG---
659 >SRR6266699_2273235
660 --FFLPFKE-LTEQHFSILGlrkARRAGLVLAQELFEHAPNVGARHSN--AF------------GGRGYCRRMR---------PRtap------VCDSAR-CWAPSCRRQ---APLALR-------------------------SCRPVR---
661 >tr|A0A084QEN9|A0A084QEN9_STAC4 Uncharacterized protein OS=Stachybotrys chlorohalonata (strain IBT 40285) GN=S40285_06080 PE=4 SV=1
662 ------------------------------------------------------------MEKYPRIDIRSPAGVSIIYKDvssLDPAQEEIRVLHL-HGG---PEDSPIECTLHKiALKSNPPPVYE-ALSYTWGDAsvtreIVL-NGHVVS---
663 >ERR1712224_896978
664 -GCLSHRQSTLIRGSLPMLraQGETITSSFYASLLSAHPELHNIFNS-AN----------QATGRQPRALLNIILAFAAAPNHtaeLIPRLERVCQKHC-SLGIRLTSTTSSASTSS---GPLARSS-------------------------
665 >tr|L8LYK6|L8LYK6_9CYAN Hemoglobin-like flavoprotein OS=Xenococcus sp. PCC 7305 GN=Xen7305DRAFT_00009490 PE=4 SV=1
666 ----MSLQIGLLEQSFNCIRPyGkLFVSSFHENLFQTNPEIKSLFMGV-E------------SQIQKNRIWDTLVLIMENIrhpNLLNNTLQGLGARLF-THGLLPKHYPLVKKAFLATFKQFLGNEWNSELEQAWKNAYTYFHDLMQEG--
667 >ERR1022692_2453048
668 --------XMSLPASFTSICngilGREE--------NSGCPAAKGQFLP--DR------------DAWrRssaLLLFGPLHQASRSTGYvshLHegaArppgrRispDRRPGRQAG-RSGRLRAGPRAGPPQVRGHRRALRRGRRQPAGDTGAFRGRHLDARVMIE---
669 >tr|S0BCU7|S0BCU7_LAMSA Extracellular globin OS=Lamellibrachia satsuma OX=104711 GN=v2hb-B2 PE=1 SV=1
670 ---CTTEDRREMQLMWANVWsaqftgrRLAIAQAVFKDLFAHVPDAVGLFDRV-HGT----EIDSSEFKAHCIRVVNGLDSAIGLLSDpstLNEQLSHLATQHQERAGVTKGGFSAIAQSFLRVMPQV-ASCFNP---DAWSRCFNRITNGMTEG--
671 >tr|A0A1Y1ILY9|A0A1Y1ILY9_KLENI Cytochrome b5 isoform OS=Klebsormidium nitens GN=KFL_008610010 PE=3 SV=1
672 -PHLTTSDVKLVQESWAKVVeahGVGAVTLFYVNLFTLAPHLESLFKKTKN--------------IQEAMFTDMMMTLVGKLHDwewVVSALEASAIRHL-RYGVSVSMFPAVGQALLQTLDMGLGVHWTPEVKAAWIKLWTAIVSVMSVHL-
673 >SRR5579875_3194573
674 -------------------------------------------------------------SRCCSRATPSYGRCSRSRCrgpgrrsAtgsPSSSATCRRPGAR-RSCSRRWPGITAGSASvtgtTGRSSRRSGPAWTAELDAAWLAATDWFVSVLAA---
675 >tr|A0A0L8P0I1|A0A0L8P0I1_KITAU Flavohemoprotein OS=Kitasatospora aureofaciens GN=ADK78_37645 PE=4 SV=1
676 ----GAADQRVITEYLELVTpfGE-LITHLYETMFRRWPYLRSLFPE--SM------------EFQRAHLARAFWYLIENLHRpddIAEVFGRLGRDHR-KLGVRPVHFQAFEAALCEALRRTAGPRWADAVEQAWVRMLRFAVAAMVS---
677 >SRR5688572_1436081
678 --RPAPEVIAAVSASCQAVAdrPVRLAEAFYEHLFEIAPQARTMFP--ADMT------------AQMQRMSDTLVGAIAQLEKfdtaqLEAALRRLGADHRTRHGVEAEQYRYVGHALTRAVRDVAGLAYSGALSSAWIAVYQYIEAHMSAG--
679 >SRR5947208_57978
680 --EMTPEQIALVQHSIEVLGprVDTVVERFYQHLFEIDPSVVELFST--DP----------A--VQRRKFeveLRQIIKAISGFDEFAGRAHDLGIRHS-HYGVRARHYRSVGDSLWWAWQSVMGSAVDSEHSKVGEAAQDV----------
681 >SRR3954454_13764990
682 ----VLDPAMLVQSTFALVArqRQRFSERFYANLFAIAPETEVQFAG-TPP------------ELRDRMFVEILFLVARSMSrvdEIAPALTELGARHV-AYGTLGSQLPLAKRALLAALRELLGDAMTAEVEAAWSETYDAMAEPMARGM-
683 >SRR5579864_8015183
684 ----KPDPIFLVHTSFVHLRprMAEFVSNFFRRLLKDSPELAPIFED-ADS------------VRLKTMVAKIFGTTIAGPEqtdQVEADLAELSRRHK-SYGAIPDFLPLVGRAFIATIRESLPDDTTPQTIEAWELLYANTAALMSKGL-
685 >ERR1719483_919245
686 MAVLSKSESDLIYKSWALAAdeKEKHGGAFMVRLFTEHPEVQaKYFPKM-DMN------DFMLLSKHGSKIMAAVDTLVNYVNDgndekLVKTINHVASSHF-RRGVVTrEAFEIVTEVLMNYLITTLGDHLSPEAQLAWKKLLSVLVEVIA----
687 >ERR1711860_359782
688 ----LFSKSNYVFAS---------LSRNTFKLFKDERSLYeKHFSSF-DVN------DILRIRAHGLKVMKAVNSMVEAVSDendesLIDQIHFVAHGHH-LRGITPrNEFEVRRKILNLDYHLLFHyllkkGCLSQSX--------------------
689 >SRR6266545_1588040
690 -------------CDLEQAVdtCPA----------A---LVIGLRP--ATMG------------TL---------CYMGGLAsa------AVCCWRHV-RVVTCSQFF-------------------------------TTASPQSRQ---
691 >DeetaT_16_FD_contig_41_1516467_length_281_multi_3_in_0_out_0_1 # 3 # 167 # 1 # ID=1772959_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.418
692 ------RRMNLVKQTWRSVEfglGHKATQAFYDRLFANHLDTRRLFAG-VGM------------EGQSRKLYDLLRLAVRSLDDldaIIPTVQEMGRRHARSYGVVRDHYGAVTQAFIEILHQYICSqlghmahsRYLVDVADAWAWCLNLIGNIMAD---
693 >ERR1719433_537024
694 --ALRISIVGREKRA-NCTVtlgRVEQGELQVGATVLLVPPGAECGVQSvevdgREVRSAqagefVCMRLLgcQP---SVGHALSSVD---GPLRSatkLKVRSAQAGEFV------------------------------------------------------
695 >ERR1719161_1849694
696 --ALRVMVLGMTADKVG-AAlegHVEQGTLRAGTRCLAAlsEGQAECNVQIvllngVEVSHAgpgehVRLKVTgaAAKGFTAGQVLSCIS---NPVRAigkFKAKLRLMSLPEM-LS----------CSLLVL----------------------------------
697 >ERR1719277_2163216
698 --EATDAMKGAVQRSWDQIQalgTTVVGEHVYRYFFELVPEAVNCFPVHvrlkyREwiADEPdenGDLRNSAALRNLFAKVLNAIGCTVAGLQDaskLVPLLSSLGARHI-GYGVSEEFWPALGKAINRTLQDLLAEAFTPEVENAWNTVYGFMSQIMVESLR
699 >tr|A0A2G8RXV1|A0A2G8RXV1_9APHY Uncharacterized protein OS=Ganoderma sinense ZZ0214-1 OX=1077348 GN=GSI_12102 PE=3 SV=1
700 PKPLTAEQRKLITAIVPVLEqhGKTITTLMYNQMLEENPALKNVFSKS-----------KQERGQQPEVLARSLYAYASHIEDlgpIMPFVERIAHKHA-SVHVEPAHYDVVAKYLTNAIIQVVGaDVLAGALYDAWIAAYWNLAYVFIDR--
701 >ERR1712080_154454
702 -----DLQKIIVKHQWARSYnegmsREYFGQAIWRAFFKLDPGARRFFTRVRGD-----DISHPKFQAHSLRILGGIDMCLSLIDDvptFEAQMKHLQGQHI-EREVPSYYFDRLGTVLQEVMRAATGYCYDE---VAWGACYKYISDRIKANY-
703 >tr|A0A0S2MLM1|A0A0S2MLM1_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1
704 -----PLDRILVKAEWAMASdgghkDSELGSSIFRALVNIDPALRGTFSAVGGE-----DMGSAQFRAFAFRVVAGIERLIAVLDVdavLSADLAVLHSQHV-ARDVSAANYESMLSAIMSVVPSAvGNSCFSS---PSWSRCLNVIAAAM-----
705 >tr|A0A066YRR6|A0A066YRR6_9ACTN Putative oxidoreductase OS=Kitasatospora cheerisanensis KCTC 2395 GN=KCH_40190 PE=4 SV=1
706 --PPDAADLALAGAVLAALRpvADRAMAHFFALMFLRHPELRAVFPA--A------------MDGPREQLLRVLRECVRHGDDpaaLRDRLGPLARRCR-KYGVLSGHYASAADCLVEALARYG-SGWDERAEAAWRRLLAPVARLLVEA--
707 >ERR1719329_2046659
708 -----------IKTVWAKIMkevgTLNAGTMLFKNVFMLAPETKQLFPKFRHLK-DDLLLSNESFKNQAKLSISALSNAIMSFDDppkLKRMLMDLGRIYE-SKGVSLATLPIVGNALMATIEAALGNDSCIETFNFFALFYNEGSNMLAEGYK
709 >ERR1719265_1860150
710 -------------------------------------QALNYFPRFKMnnlLF-SDALFEDEIFKIHAYKLINAITNAIDLLDEpvkLTETLKHLGRIHE-NKGIPAESFVVIINAFNVTVANLISRDSSIETINFFALFMNEGTNLMTDGX-
711 >SRR3569832_2958212
712 ----PALVRSAPDSAAALRrcRCGGTAEKIAERARADD----------------------------------------PESEKsrgAGADDERIGRTAQ-AIRCSAGRLSSGACCAVGGHGGIGGX--------------------------
713 >HubBroStandDraft_4_1064222.scaffolds.fasta_scaffold919957_1 # 1 # 597 # -1 # ID=919957_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.524
714 -HPMDPSRVMRLRISHGWFAPcgEALVARCFQILGEQTPGTRSLFP--ADTA------------SLHPRILRTLRQVLSNAHEfrtLEPPLARLGEKLQ-RRAGGvehlLPHAAAFRDAFICVLAEAGGRSFTHQMEQDWRMLLDGVLGAMIAG--
715 >tr|Q1GDP0|Q1GDP0_RUEST Globin OS=Ruegeria sp. (strain TM1040) GN=TM1040_2494 PE=4 SV=1
716 -AILRQIEVQLIKVSFNRVFaqKAALAEKFYHHLFLELPDAEVMFT--RDFS------------HQTEMFARVLTTGMQSLGRdreMMVLVDDLLQRHK-HLGLTLDQMYTAQRALHLAFCEVMQAELTAAEVSAWDNAIGRLCRALAAGI-
717 >ERR1043166_6829872
718 -LNLTADEIDRVRTSFDQVWaiSSRMADLFYDRLFAGNPFARSLFPA--QQ------------DERKQNFMLNLAVIVAGLDEradMDRSEERLVQAHA-EAGIRVDQSEVMRDALFWSLEQGLGPAWTPGVAAAWRKAYRLLSEHMAS---
719 >tr|A0A257MW93|A0A257MW93_9GAMM Uncharacterized protein OS=Methylococcaceae bacterium NSP1-2 GN=CG439_2278 PE=4 SV=1
720 ---VKVKNRLLVKLCIDEISpkIDIVSQLFYQELFHLNIHLKTIFSG--NVT------------FLNRKFINMMAtfKNVKHLEAIENSVEKMGERHVLHYRVQLKHFPTLKKALLLALKKHLGERFNAELEAAWHEVFDDVAEIMQRA--
721 >SRR5690554_3276444
722 ----xmSDADRLQVQASVERIRgqMDGFAGCFFDKLFALQPALRELLAT--E-E------------GRRSKLRSMVStlANSRDFDKIAPAIRRLGDRHR-DYGVGVQDYVPVQQALLHAVAQVDPQGQSEQVQQAWSGQFQRISALMEPQ--
723 >UPI00042C7A07 status=active
724 ---MNDTQRLLVKADIDSLGndINALSQIFYRELFHIDINLKSVFPG--NVV------------FLNRKFANMLAtfKNLGHLEKIGASLEKMGERHLANYGVQLENFAPVRAALLIALRSYFKENFDAEREAAWQAVFDKVADIMKAA--
725 >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold510383_1 # 42 # 362 # 1 # ID=510383_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
726 ----mTSKDRALLKECVEYIEsesINELCDIFYKKLFDLDPKIKLILSD--NDV------------VLRRKFFNMFStfKSVKYIDKVSEIILQMGARHK-SYGINEKHLELMKEPLFESLHEVLGDEKFNYYKAGWEIGYQEVENLFKEG--
727 >ERR1700737_3653126
728 MTALTADQIARVKATAPVLAehGVTITKHFYKRMFTNHPEWKNVFNQ-AHQQS----------ASQPQALARAVYAYAAHIDNlraLGSAVSHIANKHA-SLNIRPEKYPTCGKICWRQYPKCWAIPSMNPRSTPGAPLMRNSRRFLSGR--
729 >SRR5919197_656730
730 --LLDDDTIGLLDESLRLIDdrSDVVVNHFYAAQFATPPPRGLLGSR--AR------------GC--------LGRGVR-----RDGPGDVGRRSR-GGGGRAGLV--EGRD-------------------------------------
731 >SRR5919106_2778213
732 ----------------------A-VDRFYAA-VLGDPELAGYFTDvdidrvkrhqvlllsdvlggpesyDGPD------------LGQAHRGlgitdghyDKVVGYLVAVFTDLgadGDTIAAAAEVL----ASVK---PQ----I---VEDQAGSRDSHEX--------------------
733 >tr|F4F3R7|F4F3R7_VERMA Oxidoreductase FAD/NAD(P)-binding domain-containing protein OS=Verrucosispora maris (strain AB-18-032) GN=VAB18032_21340 PE=4 S
734 -------MRDHPAAEVGGIAeavFGRAAARFWDTVQEGCPGLLP--------------------EGDAPLILAGLLRLVGGGDDRpgrLALLTVLGRVYR-EHRLRPDHAALVGA----ALT--VAVPSMPPEAATWRRA----WRlVERA---
735 >tr|A0A2T3A5F4|A0A2T3A5F4_9PEZI Flavohemoglobin OS=Coniella lustricola OX=2025994 GN=BD289DRAFT_370338 PE=3 SV=1
736 --ALTFKEAQLVKSTIPFLReqGEELSNLVYGNLVKRNPELNNKLNVI-HLQDG-------RLARALTVVILRFACNINDMSELIPKFERVCNKHC-TVGVQPMHYELLGALVIEAFESLMGDALTPEIRAAWTKAYSILSHMLIGR--
737 >SRR5439155_13306073
738 -VLLD-------GGTLRAVRmsGDTRSEPWLKDLWERGVAVGELRRHLllpletppGLP------------VPRGRILCNCFDVAESEIDAfla-------------------------T-SNSIAELqarlkCGTNCGSCLPELRRKSLCDIG-----------
739 >ERR1043166_8897093
740 ---GTRDQADIVQLTWHSVLpvGGTFAELFYGRLFALDPEVRRLFKD--DI------------VEQGRNLTAMLSVATANLVKperVGRPPGGLHFRRK-D--VDQRVLEREEERVLHQRemlrPHAVSGVALAELMERHADAP---GGVHRHA--
741 >Wag4MinimDraft_6_1082665.scaffolds.fasta_scaffold479856_1 # 2 # 223 # 1 # ID=479856_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.387
742 -IALE-------DGRLRAVRlaGDTRAASALLELWERQAPVDAEDLPEtPAH------------ASRGRIICNCYDVSETEIAAy----------------------------RSLADLqaalrCGTSCGSCLPELRAKFGVIPR-----------
743 >tr|A0A2B4SBA2|A0A2B4SBA2_STYPI Serine palmitoyltransferase 2 OS=Stylophora pistillata OX=50429 GN=Sptlc2 PE=3 SV=1
744 --QISQKQISLVQETWGLVsgDLEKVGVDFYMRLFKANPDVLQLFS-FRDIDKSsdDIMRADDRLKRQGLVTMQHVDLAVNSLNDlgsIVPALRDLGGRHA-MYKVEEHHYVLVGSVLLDTLNNGLGDNFTVEL--FWAALLNTLDKGLGE---
745 >tr|A0A0C1L0Z1|A0A0C1L0Z1_9BACT Uncharacterized protein OS=Flavihumibacter solisilvae OX=1349421 GN=OI18_18680 PE=4 SV=1
746 -MEMTPRQMQCVRNSWRNFrdlDPAFFSEPFYAKLFADHPAAKKVFGD--NL------------AEHFSFLHEMLSQLVSRIDRPdqlLITCSRIARNNA-ALGMNEKFYEWYGHALIWTLRQGAGADWNMETEQSWISYYKYLVD-------
747 >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3668839_2 # 105 # 377 # 1 # ID=3668839_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.656
748 --------SGPLAASLAIFEprLEAVTARLVDVLAASSPHLLALFPP-SSE------------PS-----AALLGRFLTRIVEtesLGqPLGDGLGLDAY-PIP-TRDQWEHLVESFIWSLSAVAGKAFSPPMARAWRATGERLFSTMFES--
749 >LULI01.1.fsa_nt_gb|LULI01000097.1|_29 # 27187 # 28320 # 1 # ID=97_29;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.310
750 ----------------DEIKgrH---HSMFVDEFERQQPQYKD---------------------------------FWARL---NrGEYQAGEYRRY-GKG-GKEVWIQA----------------------------------------
751 >SRR5947209_9205436
752 --------VLSVLRSpssplF---PyttlfRSRltver--DSERDVLMvaggtGIATMRAL--LD--DLA-------------QWgENPRVHLFYGGRTDDDlyaLDd--LHQLDRKST-RLNSSHANISY---Avfclk-------------------------------------
753 >SRR5690606_15697619
754 --------VRVVAGGwvsralvrqtvpgdrW---RvgapMGElwrdr--DVQRDLVLiaggtGVAPLHAV--VE--DLA-------------GRatQPSSVTLFFGGPTADAlyfLPe--LRELAADLP-WLKLVP--------Vte----------dgsvddgergklPEVVTALGGAWSGHDVLVAGSPGMI--
755 >SRR5919202_1970091
756 --------VQMVPGGqvsstmvrslkvgetV---RlgapLGQaltlyag--ERHRDLIMvavgtGLAPLRAH--LE--RIDQ-----------EwqSTgRAPRVRLFHGARLPWGlyeNRl--LQNLAG-RP-WFTYTP--------Vvsddp----------typgrkgwvGDAAAVS-GPLHGLLALVCGSPEMV--
757 >tr|A0A1D8N423|A0A1D8N423_YARLL Uncharacterized protein OS=Yarrowia lipolytica GN=YALI1_A07937g PE=3 SV=1
758 -FNMTREDINLTKELWAKLMndPEtlessaaygtptaLFCEQFYTNLMASHAELTSIFP---SI------------KKQSVAVAGVFGLAIKSLDHiekLDEFLWSVGKRHNRMIGVEPIHYRWLGEAMIKTFADRFGDSFTLEMETAWIKIYSYLANKLL----
759 >SRR6266851_2503075
760 -----------------------------------------------XM------------RNGSASLPLwPARYGAWTTRRpspNISAPSRSTI-----------ANSVCGRAITNWSARRCSPPSVSSAASGWEAAFNRIATIMIQ---
761 >SRR6059036_2276597
762 --ALFPGTSHWVV---AAGMarP-ESKDHPMLTVAQKTLVQ-------DTFA------------IITPIADDAAALLYKKLFEldpSLERM-------------------------------------------------------------
763 >SoimicMinimDraft_1059729.scaffolds.fasta_scaffold91729_1 # 2 # 175 # -1 # ID=91729_1;partial=10;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.661
764 ----SMEDRLEMIHEWETVWsaeftgrRVLIAQELFSRLFEKDGTTQALFKNVG-G----DDVNSALFKAHCVRITDSIDTIVHMASYtdvEHQLLDHLGDQHAHYDGVLGSHFKLFRECFLEVLPQAIP-CFNS---GAWGRCLKVFQDEIALH--
765 >ERR1700754_2066947
766 -------DPGdrQLARELLAGAagGDDLDALvehDRGAVLEIAREAVPVaLAQ-ADR------------DdQLGHLGA--------------DRlLRGPAERPL-GRGAPLQDVALVvhrddavergqqqRAVALAAGAELVGEIWERQERGSLtARRYGSNRSI------
767 >SRR5208337_544005
768 --TMTPQQTRLLAQSYAKLEnrLYELGSAIFERLFEIDPHSRPLFK--GNMD------------EQKLKLARLFGEFIRIRarsqhflpvtgkagQVVIPGIGSLGARHEMVYGVRPEQYAHMRDAVLYAIRSLLGNDYNDEIGQAWSEIFDMLAHAMQE---
769 >tr|A0A2A6CNA4|A0A2A6CNA4_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_32112 PE=3 SV=1
770 --QCNPRYTALLKSTWSDDfEvLFALGAKMYITAFEgpHGVACKSLFPWVAKYEeAGENYADKSEFRLQALRLVQTIVKALDKVDDlqkLEAYLYAVGHRHV-FYlpvWLDPVYWDVFKasratsylgqstmlksaserDAVQVGVNDHLHKlsKLSTddlaRATLIWTDIIEYIFEYVKEGF-
771 >SRR5437763_1847173
772 ------------------------------------------------------------MVRQKRHMVALLSQVLGGPKQy---QGRDLAEAHR-SLGISGLHYERVGNYLLASLLiaqapydvinavtdvlagqrdKIVAAAWAAELAADWTDAYSLVARVMVE---
773 >ERR1719244_1430206
774 -TGLSRKQRFLLKGSWKGVSrdLESTGVSWFLELFETCPNARGSLRQFSHISLDDDLTENQPFREMTEKVLERLDNALFSIEDadsMRSILLETGDYLRSVVGLNNDIILQSEGPLLSAIQRTLDERYTPQMEVIYTVIVKFMINTMVEX--
775 >ERR1712228_920792
776 -----------------------------------------------HISLNEDLTEVQQFREMTEKVLERLDNALFSIEDadsMRSILLEAGDYLRSVVGLNNDIIMRSEGPLLSAIKDFRREIhttngsdlhsdskihdkYNGRMRPL-----------------
777 >SRR2546430_6350501
778 ----GRResRVRGGQGGWV---sRAIVAEPQRGDVGRSGPAMGRMKVD--RG-------------AGRDVVMVAGGT------GlapMRAIIDDL----A-QWGENPRvhlfyggrgrggPYH------PPSLVSTAAAqPGVPVVavagaeaglshkeagspagggvrHGALAGRG------------
779 >SRR6195952_1380156
780 ----VALAGEAVRAIWFRLAdqEADVAHWFGALLFSLAPHLRAQFPA--QA------------DRAARRLLRASIAAMSAVDRpqeFPAAIGTLARETR-ALGLDASADEPVGVALVGAVREFAGELWAPGADAAWVLAYSLAAEPARR---
781 >ERR1700709_350262
782 ---------------------------------------GDLDAD--AT-------------AERELLVVAGGRRGGVGpaprGepaGpsgAGGGRPPRPARLA-AGVDVRRttvivgartaedLHT------LDRFAVIGEDaPWLAVVgacesdplelglapgpvvegitrAGPWLEHDVVVA--------
783 >ERR1700709_656719
784 ----------------------------------------------------------------ADVVAVAGGP------GasgALALGDDLAAQAA-AGVDVRPttvivggrtpedLHT------LDRFAVIGEDaPWLAVGgacesdpldlelapgtvveaitrAGPWLEHDVVVA--------
785 >SRR5262245_28534727
786 -------efHVKTVPGGWV---sASMVNDTQVGDEWKIGPPIGLLGLV--TH-------------SQRDLLLIGGGV------GvapIMSIVPEL----L-RRRSSNRvslfhgvrypheLYL------NGTLDDLAARdPNLEVVkvvsrdrnyagitgslpdvvaqHRDWSAYDVVVS--------
787 >SRR3569833_3303276
788 ---------------------------------------------------------------------------------pNNTNHDKH----T-HRKRNPPehqniggkrpedLYV------LDDLRRLTAVsKWLTVTgvteegaipggdrgtlahavaqRGVWEYYDILVS--------
789 >tr|A0A161TXB5|A0A161TXB5_9DIPT Globin 11 OS=Chironomus riparius OX=315576 PE=2 SV=1
790 -ATLNADEAKLVKGSWDKVKGQE--DGILYAIFKENPDIQAKFPAFVGKN-LEEIKSNDDFTKHADRIVAAVSKYIELVGNeantpaIKTLLNELGQTHR-SRGATKEQFEKFKSSVAKYLKEHSG-AWSDATGAAWNKAFDEMYAIVFSSL-
791 >tr|V5YNC2|V5YNC2_9DIPT Globin OS=Polypedilum nubifer OX=54969 GN=PnHb4 PE=2 SV=1
792 -ATLTESEANSVKTSWNLVKDKE--DEILYAIFKENPDIQARFPLFVSKN-LEEIKTSADFKTHADKIVKAISTYINLLGNeantpaIKTTLNELGQRHK-DRGATTEQFEKFKVSVLKYVKEHAT-GLTADAENAWNKAFEEMYKIVFANL-
793 >tr|Q23764|Q23764_CHITU Hemoglobin IA (Fragment) OS=Chironomus thummi OX=7154 PE=4 SV=1
794 ---------------------------------------------------------------------------------tILAKAKDFGKSHK-SRTS-PAQLDNFRKSLVVYLKGAT--KWDSAVESSWAPVLDFVFSTLKNEL-
795 >ERR1712170_324299
796 -------------------------------------------------------rVCREKLNVHALCVVAMIDKGISVLDKpcdFVELLLIHGRRHK-NHGVARKTFQTLGNFFIQSFKEVLEDDWTDEIEAAWKIFFRFLNIGLEAGY-
797 >SRR5688572_12388254
798 --SMNEEQIKLVETGFQSITgrGERFISRFYENFFAASPKAEKLFAQT-EW------------PNQSRKMLLTIMMVVDNLRDaahIKKMLHEANLVHQ-KFTLQADDFDALTDAMLRTLREFLTDDWSKEAEDAWRAAFAKINAIMLEA--
799 >ERR1044072_9602616
800 -------LEQSGYTVVGRAAdaRELmLKVRSYVPDVA--------VVD--VR------------MPP------DL--------TddgLRAAAEI-RRSHptV-SVlVLSQHREPAYMLELVGDDASGVGYLL-KDRVRDVTQFVDAVQRVAAGG--
801 >SRR4051794_28399871
802 -------EHEAGTDLLELTD------ALVRAGVPCADAAQEAVAG--VE------------LPHGAQLPAER--------LadrLERRRVD---------lD------------------------------RLLRFGEDAG-HLVLGA--
803 >SRR6266545_7915566
804 -------ELDTLETTFDLLAprGEELMDIFYARLFAAAPGGRAAVRR--HR------------PSPPEGSPPRR---------ARAPAQV---------aA------------------------------QPRCDRPDAA---------
805 >SRR4029453_17830486
806 -------DLQALETSFDLVAsrGDVLMDVFYARLfaaapa------VKPLFAG-TDP------------RRQKAMLLGALVRLRGSLRGppaFVPPLPRPGAGPggE-APlrrhrSPAPEGHAARGPraaAWLPARPAGVRSGaatPRGQARRLWRPAGALPGGRRgpdrLHG--
807 >SRR3546814_7943381
808 --------------------------------------vfirlslsliiilvyRFLFFFF-SSR----------RR-HTRCVLVTGVQTCALPIS----TDELIA-------AWAAAYGQ--------------------------------LADLLIA---
809 >ERR1700737_1149585
810 ---------------------------------------------------------------------------------kqPDGSAEKHFEQAC-ESGRPTGAVSHCRGTPAGCDQGSVGRRRNRRDHFHRGKGYGNLADILMG---
811 >tr|A0A255XUI9|A0A255XUI9_9PROT Uncharacterized protein OS=Elstera cyanobacteriorum GN=CHR90_04515 PE=4 SV=1
812 -PMLSSQSIATVKATAPALRphGLNLVVRTYELLLRDPNI-RMLFDP-A--------------rqvnGDQQHIFAETVIAYVNAMDRldtLKATVKHLTIQQA-LLDAQPQHYDAIAIALIQAIHELFGKDAVREITSAWTEALDVLHQESPG---
813 >ERR1043165_5678211
814 ------------------------------------TAglktrkpkgltdsdmdilvpvtA--------------------------ALFLAGMTAYIGILA----LRELSATRLA-SATAAVEHAF--------------------------------LREQISE---
815 >SRR6476660_7153442
816 QYMLPQRTIDIVKSTAPILEehGETLTAHFYRRMFAYNPEVAPLFNP-A----------HQRAGSQQKALAAAICAYAANIDNlevLGGAVELIAQKHA-SLRILPEHVRITPESEIISSFYLQpADGGGLPLFKP-GQYITVRVPDARG---
817 >tr|A0A2D6MWT2|A0A2D6MWT2_9DELT Uncharacterized protein OS=Deltaproteobacteria bacterium OX=2026735 GN=CL908_18525 PE=3 SV=1
818 -----SEVAERLRSSLEIIAEceATFIRRVYEDLFEQHPKTAELFGG--HS------------RAvRGEMVREVLMYAIEHNEGaswVEENLASLGDQHE-VNGVTLEMYGWFVDSLLRIFAEVSGPDWCAELEGSWRTALELVSDLMSSPE-
819 >SRR3954454_17009507
820 --PFDPATVAVVRASVTKLpsEPIELTREFYRQLFEIAPQARVLFAE--DMT------------DQTERLLSAILAGVRAMDRpelVEDHLRRWGVVHRRMHGVTNDLYVYVGHALIRALHRIFGH-LETSVSSAWIAVYEWMAAVMIDG--
821 >ERR1719446_1443192
822 -----------------------------------------------------------------------------LAQDlsaLCPE---CGFK------VG--TMGVC---QTK------ANDAAIE-----------AKDPPVAT--
823 >SRR6187402_970848
824 --GITTADTLLVQTSWNTVSefSTKIIAGFYKHLFASEPEVRPLFKS--NQS------------VQEKRMALMINTIVNSadsLDEFRGSIAQLAKSHV-HMGVKNEYFPIVVKAIISSVEEQYGKGFTSAHKKAWYKILNQISAIMMEE--
825 >SRR5215510_10546783
826 -----------------CLDrcRLFVVFYLIACiivlffFFQAEDGIRDGHVtgvqT--CA------------LPIWARLLGAIVTAVQTIEDperFDGYLRALGRDHR-KFHVEPAHFGVVGAALLDALREFSGTQWSHAFEQAWRDAYGMMARKMLA---
827 >ERR1719150_2276450
828 -MGLTKAQVAAIQNNWATVSqnMQDVGDALFMRYLTANPGDLSFFPKFQGAGVGPQLHSNEDFQHQTLTVMQFLGQIVAHLGDIPaaeGMLRERVKTHH-PRGISMAQFERLLDLVPRLVQEICGA--SGPTADAWRVAVATLMPSMRDEF-
829 >tr|A0A1K0GS94|A0A1K0GS94_9ACTN Globin OS=Couchioplanes caeruleus subsp. caeruleus OX=56427 GN=BG844_22340 PE=4 SV=1
830 --GMNPaddaelhAVQRLLISSLEQAGgQVEVATRLRAALAQAGPALFARIP--GGP------------LAQVEQLAEGLAWLAQHTDqPpaLVAGFGRLGAVLA-ECGIAPQQLQLAGAALAEAMRAgMAANGWRQDYDQAwrstWQHAYQWIAHGMVAA--
831 >ERR1719193_2756600
832 ----------------------------FM--EKKVPSVIV------FLN-SLSLDDDGALETHALSVMNSVNKVVSRLDQpdrLVQLLHDLGRKHI-SYKANMAFLEPIAKHFILTIKPSVA-EWSPEIEDAWQQAFKVIGHIMQE---
833 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold5203666_1 # 3 # 269 # -1 # ID=5203666_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.315
834 -----------------------------------------------------------DFESQGRALTRMLAWIIQNMSNvsqLVPVLAQMGGRHE-IYGVKDADFGTFATTVANSFRSVLGPEIiDDDAHQAWESCISGIGGLMQL---
835 >SRR5215203_6923026
836 --PGDSGADRAGRAD---AerDQAGLRRGRG-RLLPPAVRRRPLRggavhhrA--GH----------PTgEADRGAGCGDALDQAPRRVPAPgrh-ARPAAPGLRG-----------------PPAALRHRAG---------------------------
837 >SRR5215208_6178010
838 ----GRGRPRPDTAIIRRGVagQPTIRHLFYDRLFEHDPETRLLFR--SDLD------------RQRLRLLTMITAMVGPASDdls---------ATNA-GhAGVPPWRWLSLA-----NARDVADP--------------------------
839 >tr|A0A074ZZ62|A0A074ZZ62_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_01589 PE=3 SV=1
840 -----------MFDELPPATdhLSKK--ITSGRA---LGMICSNAN-VHTLS-NEEIAADTRSKQHILAFMDVLSKAIGALDGgredFCEKLMVLGARHAAIPGMKLEYFKVFKQAILMTWEALMYEEFTEDVRRAWAHLMDYIIGILSEG--
841 >tr|A0A2A2WQA6|A0A2A2WQA6_9ACTN Oxidoreductase OS=Dietzia natronolimnaea OX=161920 GN=CEY15_08520 PE=4 SV=1
842 -----STATPPLLALRDLVTDPRFTDLFARALREADPDFRELFPR--DA------------SGVLGEFVRAMSWALETVEnargdeaevaQVVEFARHLGADHR-KLELSTRHHQRFGEALTSTLRHLAGPGWDDRLSTTLGTVYRVLTTALRE---
843 >tr|A0A2W5I8T1|A0A2W5I8T1_9ACTN Uncharacterized protein OS=Lawsonella clevelandensis OX=1528099 GN=DI579_06450 PE=4 SV=1
844 -----PTYYTVLGPAITLLRehPEDFMRHFLAAALTYDFHFHTFFPS--VN------------DHHASRYTHALRYILEALDqstndpdcldDVIDFLSQLGCDQR-KYQLTAEQYQSLAAALRDTFALLLPYQWSTELNDALLTSFEHAINVMQS---
845 >tr|A0A2N6TBK5|A0A2N6TBK5_9CORY NAD(P)H-flavin reductase OS=Corynebacterium kroppenstedtii OX=161879 GN=CJ202_05310 PE=4 SV=1
846 -----GVHEASLVPVVTVLQtdGSRFVDAVFTHLFARRPSFIRRLPA--DL------------SQLKPSFRRALVHVYAKQAtgngldrRTRRFLRHLAEDHR-SFGVEAPDYVAMGDAIIDAGREIIAPQVTSEEFELFAMATGQIIGLMEE---
847 >tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 OX=1581089 GN=HMPREF3121_11375 PE=4 SV=1
848 ------------MRAAAAFGrqAPTIGPEAFRRLLDAEPRFRHMFGG--SK------------TALRDQFMSALSTALVTRAdvgrfpaATIRRLEQLARENR-KFGVAPRDYATLAEHLLDVFGERLPAgpdsgAQVDALREILDEAMSLI-AAAAV---
849 >tr|A0A1Z5KPX1|A0A1Z5KPX1_FISSO Uncharacterized protein OS=Fistulifera solaris GN=FisN_16Lh317 PE=3 SV=1
850 --VASPACVMKVINRWETARqrngfDEQLDIDTLLALFKMDPQVKPIYG-FAVEKEVkAQGMQRMGVLIYGLQVVKMFDVILSALGPdeelFYDVVTEMGEQHC-KHGLTPDHFTLLCGAVMGVLETIMDTEWTKDVRAAWSQVIECVNAEIVK---
851 >tr|V5YLS5|V5YLS5_9DIPT Globin OS=Polypedilum nubifer GN=PnHb25 PE=2 SV=1
852 -PTFTDAQVATIKGDWNNIK--GQGVEILYHFLNKFPGNYPMFKQFGGKD-LNAAKGTPEFSAQATAIINLLNGVMDKLGSdnagAQAILANLGKTHK-AKGITKEQFQQFREATTELLGNLG---L-GGNLGAWNALFDFVLNVVFTA--
853 >AP82_1055514.scaffolds.fasta_scaffold183032_1 # 1 # 312 # -1 # ID=183032_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.529
854 -HPITSEEAETLRTLWSQVK--HREADILYVIFKENPDIQAHFPAFVGKD-LEALRKSLAFAIHSTRIVSFFSKIATLAGDpsnlpaSKTLMNELGSSHK-SRGIQKEFFNKFRASLDGFMQRQS--SWNDNAAVVWNKASDNFYFVLFAS--
855 >SRR4051812_13904716
856 -AGMSPEEVALLRHSLDEMRadGPQAAEAFYAELFRLDPSARELFHL--PV------------EQQSVVFFHELDallSAVSDLPAFVERSRRLGRMHA-GRGVRPEHFEAAAAALDAMLLAVYADGASPELRRAWRHAYRMAAQLMQEA--
857 >ERR1711860_53158
858 ------IYFSDIKSTWDIVKdeIDQIGMLAFLHLFEAHPEAKTKFKMFEDIPT-DDLKTNEIFQNHAHRVVSVIRKVVGKLDEPsvyLNYLKILGGKHI-MFDADVKYIKQMGYMFLSAIQPTLEKevGITLKYV--FKKTFX-----------
859 >SRR6266536_6175029
860 --LMTPEQITLVQSSFERLGpqLPAMATRFYQELFTRDPALRPLFTT--PLP------------QQEVRFAEALTEIVRAMprlDELLTHTRAPRRPArrlR-GTGCRLPDPRRRPprrargrpgRQVRRPHTRGMGPRLQPcrrdharrrsrgPAHQQLTTTAAPTASQADGG--
861 >UPI00012780C8 status=active
862 -MSLTNETKEIIKATVPIIEknEAELTKKIYPLLFTRNPSMKIFFNR-DH----------LRKGTQPRAFIGSIIEYAKNIDNldaIKPLINDIAEKHA-ALNIKPVQYSIVNICLLEVFGKALGTRGTHVVKRAWKDAIEDLANIIIK---
863 >ERR1017187_3590871
864 -----QVDCAILKQSFAHIEsvAEKAVGYFYARLFVANPELRSMFPL--------------AMDATRKHFLAALAHIVWSMDDpqeLADYLPGAHRHSA-H---VQRRYVDLPGAVrLGGgdrSHRHSHDPGGagRRGRASLVAGX------------
865 >ERR1035438_6477963
866 -----------------------------------------------------------------------GARGSPRPAEpaaLSK--------------------QMIDRPLRAAgaaPSMHNTPPWRfgVRPDRLTIELRADIATVMTQA--
867 >SRR3546814_3749254
868 ------------------------CLFFFFCFFFSSIRRHTRCA----LVT-------GVQTCALPILFNAIAAYASNIENlpaLLPAVEKIAQKHT-SFQIKPEQYNIVGTHLLATLDEMFSP--GQGVLDAWGKAYRSEERRV-G---
869 >SRR6266704_3508957
870 ---------TITRAEFCAGRsnrgsKQAFACECYATLIRLHPEVKPLFTH-TSM------------EKQAKKFMASLTLVLHVLGKpdvLTTTLQRLGRRHQ-TMGVRVEHYPMVAEALLATLKSGYAVVLLT----LFVQSYMFL---VRKGA-
871 >SRR5215207_7267255
872 ------QAV-----------agEPEVRGSILRKAVRIGPDRANLVQ--GGP------------RGSEDEAaQHACDDRWSRLSTrdLRLGCRGFGTTSR-TVRCDAGSVFGGRRSL---nleLGRGARTRADPVQARSVERFLQGGSALHVEG--
873 >SRR5215470_13616785
874 ----------------------------------------CMVTL--CH------------CSFTqtcscGTRRRGICSRFRWLPSatgWCMRWAGSCPTSR-TSTPSAGTcRTWGASTASSAPSPSTTPTWTPELAADWKAAYDLVAQVMIG---
875 >SRR4249920_1577195
876 -----------------------------------------VWPC--TA------------TRCRCSSTRTC-----scgtrrRETCsr-SRWPYSATGSCT-RWP-GSCPTSTTWTTSASTCRTWaaSIASSAPAPAADWKAAYELVAQVMVG---
877 >SRR5258708_22654124
878 ------TLARLLKESWSLVEdrADHLANHFYARLFLIDPNLRDMFPV--QM------------AVQRSRLLGALVEPVQTVPNpsqVVPCFLSLALAQP-TIRLLPGQFEAGRSAPIDP---------------------------------
879 >SRR6266511_448526
880 --------RRRRRRAATSSGraSHRLRDsRLEARARDRSRRVLDDASS--WV------------EVVRLGDAGEPVVLVSAVAAiahRDVRRVELAREGE-RVRL-------QVLNVDAEEDDLAGEHWSVEYDQAWRDAYDRIARVMIM---
881 >SRR5579862_1310240
882 --LMDPLRIRMVQDSLVKLTprEGSIVDLFAAELSGSPHDESETGG--DNIA------------YQrERSVLGIMAAAAPFLHAPeciLDEVVAEI---G-AGRIHPADYDHAANAFLRALKKNLGAEFTADLWEAWLEALWTLCNLLSRT--
883 >tr|Q5DGY4|Q5DGY4_SCHJA SJCHGC09035 protein OS=Schistosoma japonicum OX=6182 PE=2 SV=1
884 -LSINDEQLLLLQSSWSIVkqHIEKIGVITFLGIFEQHSDFRDAFTEFRKRK-FVDVKHDPAMQVHGLRVLSIVDKMITRLPKtddIELKLMTIGSKHC-RYVPTIGLISSVSDQLWGAIEPVLkeEGSWSDELAVTWKTVLDYLTKTVR----
885 >GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6550916_1 # 2 # 442 # 1 # ID=6550916_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510
886 --------LELIQQTWEKVKphGKEWGPKFYNNMWTKYPEVRTKFFP--E-SKP---------EIQGPRLYASLNFMIKNASDietLKQYCFNMGDRHK-KYHCGAEHFQVVGDAFIMTLTEFLGEDFTPELKQQFQLLYDTVAEMTI----
887 >ERR1719360_423992
888 -EPLTQAQKEIIFTSWDAItHKENLGVTIMYRIFTGHQEIKHLWKFADDLKTEEEIRGSKTTQFHAKKVINGVNSAIKAVEAgkeVESlGLDKLGARHF-KYGAKPADFRHFVESLFWAIKTIVPE-VSAEMAAAWTNFVMQIIKQMTN---
889 >tr|A0A194RIW1|A0A194RIW1_PAPMA Neuroglobin OS=Papilio machaon GN=RR48_08766 PE=3 SV=1
890 -SPLSAKQQYCMLASWKGIFrqIEKTGIILFVKLFQENEELLHLFEDFRHLQTVEAQVSSTELAEHATKVMHTLDEGIKGLGDMDsffAYVQHVGSTHTQVPGFVADNFMKIEKPFLDAAKTTLGDRYTPNIENIYKITIRFILENLVKGFE
891 >ERR1719153_450463
892 -MPLSEGTISILKACHPIPvaNREDIGSSFYTLLFQQHPETQNLFPL-SHVSASKGGKPGPQMRS----HPTMPYLIF-HTkqlF------------------------TIIYNTKIQSX--------------------------------
893 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold8273257_2 # 299 # 427 # 1 # ID=8273257_2;partial=01;start_type=ATG;rbs_motif=TAAA;rbs_spacer=15bp;gc_cont=0.364
894 ---------NELQTNIEDVYsaGDV-C-----ALFDSSaNRYRPtrtwlscafqgEVAAL-NM------------LGQDKVynegvFFNASHayrSMYAVLGNFNPAQAD-GFEFF-VCNQDKENYE----RMVLKDNKIAGAMFVGSMKNVWSVKQLIEGQVDV----
895 >ERR1719244_2234371
896 -VVLEDAEVEGVQTLWAEVSgdLGNFGARVFGRLVHDHPTIRKYFPWGRNDKTEEQLVAAPDTQAHAEEVFGALGKIIGaagHLNDYRSFLVYKGMQHI-PRGVKPEHFDYLKDALVDTLKEELGDKVTPAGEEGLNKVYSFVEKAMSKGL-
897 >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3481696_1 # 1 # 387 # -1 # ID=3481696_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.584
898 --VLTSNDIALIRESWAYAkDIPAIQTETLLEHFRIQPRTQALFPKFADVP-LNKLPTNDAFIKQARSCVSFGLNFIVANLDNPSLLkDMLGRVdTyG-KWYVDF--MtkeRQMQTTVdifIQVLSKELGGRLSAAAKAAWTRAMTLVFVEMMS---
899 >ERR1712198_397898
900 -QGLTEEEITEIQSTWKSIIsdkTSEHGVNILIRFFKNYPEYKaQYFQNLNTLS-EDELRESPKLRSHGAGFVLAITQIISDLDNmliVEEVAKKIARNHY-NKGIREPlNYKLMTNTIIDYIKDIGN--LADGTMQNFRKMFDIFIISVRKKY-
901 >SRR3954447_20457037
902 ------------------------------------------------------------------HKVKVEDIIVRGGGNLMVEL--MNTDAA-GS-----PLDTPVRAVTDG------TESTAAAREPI--------RLNPG---
903 >SRR4030088_1427564
904 --------------------------------------RRGRDGG-QP-------------R-RRELRRDGQepdepDASRRGdrgRPCAGPASR-----------------R--RGSAAGCRSSPPSPAWPALSYEQWRETCDTLHGhTQVLG--
905 >ERR1700752_5389668
906 ----------------------------------VVPQVPAARSR-VPL------------R-AASFRRGGLehdpdPKGRVSakqEPV-FGK-------------------D--HGQTIRLSARGQSS---PrRNDAARETTCKEARMtPEQVK--
907 >SRR6478735_7013605
908 --IMTPEAIRAIKTSYAAVatQPRQLASRFYSELFTAAPNLRPIFP--ADLT------------LLQGHFEAAIAMVVRNLDEmtaLREPLRDLGAQHV-HWGARPEDYVTAREALIGAVRGTT-RHDRRSAGRCVSRPTRSARpIGSRR---
909 >SRR5262249_59625092
910 --SRHRDAAVLVRTFTCAPpaPPGRRASRLYEGPFPADPDLRPRFP--ADLT------------LLQNHFEAALALVIRNLDDmnaLREPLRDLGAQHV-HWGARPEDYVTAREALVKAIGALS-ASWTATLEQYWRSAVTSIIvT-MLX---
911 >tr|A0A0P5LQ45|A0A0P5LQ45_9CRUS Di-domain hemoglobin OS=Daphnia magna OX=35525 PE=3 SV=1
912 --LLTANDRRIIRKTRDQAKkDGDVTPPILFRFIKAPPEYQKIFKPFADVP-QAELLGNENFLAQAYTLLAGLHVVIQTLFSqelMANQLNALGGAHQ-PRGATPVMFEQFGGILEEVLSEELGSGFTAEARQAWKNGIAALVAGIA----
913 >tr|A0A0P5UVQ8|A0A0P5UVQ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna OX=35525 PE=4 SV=1
914 --LLTANDRRIIRKTWEPRpRrTEDVPPQDPLPFHQGPPRVPEdVQVLRLCSP-SRACEQRKLLGPRPNTILAGLNVVIQSLSThgaYCQPNQRSRSANK-PRGVPPIMFEQFGNVAEEVLAEALGSSFNAEARQAWKNGMRALVTGIT----
915 >SRR3954451_10251525
916 --------TSARRqqWTFPRCGptspRPQRPGTRARCTSTPTCSCAIPRPA--RC------------SRSRWRT-SGTGSSPPSATWlpgsttstrSCPSCSSSGGTTG-SSGPSrRTTRPSVPacWPRSSTSTTS-GARNSPRAGRrptTASRAPDVLATVMIE---
917 >ERR671928_16913
918 -----------------------------------------------------------------ALYFDGIDTGR-----lrVHQTKLLVQVTGG-PVEYDGRELAVAHGGLDITLEHFD-PGWTPELARDWTQAYQLVAKVMID---
919 >SRR3712207_8140349
920 -------------------------------XMIRRPPRSTLFPYTtlFRS------------AHQRDRLFQALGDVVNYVDDldrLVPILQALGRDHR-KFGTVAEQDRKStrLNSSHANI------SYAVfCLKKKKKDSHPSSTTX------
921 >ETNmetMinimDraft_30_1059905.scaffolds.fasta_scaffold1335019_1 # 137 # 232 # 1 # ID=1335019_1;partial=01;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.573
922 --PITPEEKDGAMRVWKMILnnrsehflalkrenKekdvqdaencmDYFMHNFYIRLFDIHPNSKQLFHR--SI------------HKQGSFFLRFLSMCVAEVSEpekLDKTMENLANIHN-KLGVKAVEYGIAGEALFHTIHKCVGPEFNHEAAVGWTKVYSVFLKYLI----
923 >sp|P15447|GLB4_GLYDI Globin, monomeric component M-IV OS=Glycera dibranchiata PE=1 SV=2
924 -MGLSAAQRQVVASTWKDIAgsdnGAGVGKECFTKFLSAHHDIAAVFG-FSGA-------SDPGVADLGAKVLAQIGVAVSHLGDegkMVAEMKAVGVRHK-GYGykhIKAEYFEPLGASLLSAMEHRIGGKMTAAAKDAWAAAYADISGALISGL-
925 >SRR5256885_11466498
926 --------------------------------------------------------------------------------------------XM-LLFF---------FSSRRRHTRLQGDWsSDVCSSDLWGAAYQQLADILIG---
927 >tr|M3IRU3|M3IRU3_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) GN=G210_0056 PE=3 SV=1
928 -QELTPDQLRLITECIPIMEdlNLTLGSKFYRRTTRRHPHLQSYFNE-TH----------HKLLRQPRAFIFTLIMFAKNIHDltpLRDVIRRIVSKHV-GLQVKPDHYPLLGDVLIETLCDMFPYHmVDDKFKTTWSIVYANLASLLIG---
929 >tr|Q86G74|Q86G74_PHAPT Hemoglobin II OS=Phacoides pectinatus OX=244486 PE=2 SV=1
930 MTTLTNPQKAAIRSSWSKFmdNGVSNGQGFYMDLFKAHPETLTPFKSlFGGLT-LAQLQDNPKMKAQSLVFCNGMSSFVDHLDDndmLVVLIQKMAKLHN-NRGIRASDLRTAYDILIHYMEDHNH--MVGGAKDAWEVFVGFICKTLGD---
931 >sp|P41260|GLB1_PHAPT Hemoglobin-1 OS=Phacoides pectinatus OX=244486 PE=1 SV=4
932 -MSLSAAQKDNVKSSWAKAsaAWGTAGPEFFMALFDAHDDVFAKFSGlFKGAA-KGTVKNTPEMAAQAQSFKGLVSNWVDNLDNagaLEGQCKTFAANHK-ARGISAGQLEAAFKVLAGFMKS------YGGDEGAWTAVAGALMGMIRP---
933 >tr|R1EGH0|R1EGH0_EMIHU Putative nitric oxide dioxygenase OS=Emiliania huxleyi OX=2903 GN=EMIHUDRAFT_435200 PE=3 SV=1
934 -SGMSAETIATVDATAGAVApfALDITKDFYGDMIASLPSvVLTVFNP----AHNVPI-----STHQPEALAASVCAYATNIKDlspLlvpGGAVDAINHRHC-ALNIQPAHYLPVHDHLMGSIAhvlgPKLGDALTPEVAGAWSEAVRFLAKVCIDK--
935 >ERR1711974_215400
936 ----------------AKVseNIDINGGILFQKLLTDNPELKELFW-RANKGQQgDQWRNDKNCQKHGKSVILEIGRCLSAVDDaeeFSSLLYKNGVAHK-SRKTTEEHFPLVGEAVIYMLAEALGEELNDECKAAWLGAYGVITEHMLRGL-
937 >AP12_2_1047962.scaffolds.fasta_scaffold738771_1 # 1 # 321 # 1 # ID=738771_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.648
938 ----------------------------------------------------------------------------------------------------MSFITVPGVAArsSFVwlrestaalrgpalvaliyflgaeaafyigtlsdrifalfwpPNVvLFCALLIVPQRRWWLYIAAAFP--------
939 >SRR5260370_506041
940 -----------------VRD-YSSTCSF--------FFFLQAEDG--IR------------DSS--VTGVQ---TCALPIYqerTEQVLSRLAVDHR-KFGVRDKHYEPFFDAVFATAEHAAGPAWTREMATAWRSALDWFGSVMA----
941 >SRR5580658_2929351
942 -----APLRAIV-EEVLRSGgg----------------------------------------------------------------------NVAA-GTGVRRNASLFHGAREPPGFYD--MpGLRELSSSYPWFQV---VP-VIS----
943 >SRR5258708_13478776
944 -----APLKAII-QGILRA--------------------------------------------------------------------------G-GPLLRRETRPLVGAPRGQKALL--PpHPPGSGSVASRPKG---IS-L------
945 >SRR6266704_2687724
946 ------IARPPDR-RPRCGD-GVLLR-P--------AVHRQSRPA-------------------RAVSLRDDANPRGGLPDadrAGQEP--GRRACD-RAGPRPDRQGPpqirrepeALPAVLR-RAVRDGRAFRRPGPDRRDGRGLA----------
947 >SRR6266536_777504
948 -----DGYREALDASFARVAssGEKAVAYFYGRLFAATPRLRGLFPA--AM------------DYQRDRLLCALLQITQRLSNraaLSEYLVQLGRDHR-PPGVPPAV--PGGAACEHPNPTLA-pGVAPllsgvraagqrvarVPHPRRPRRLGQHVPGAVH----
949 >ERR1719498_564827
950 -RRWTERKRLVIQSSWAALLsahgndRMATGSKIFRKLFTGDTAVLRLFP-FRHQ--ARTLFVSAPFKLHAKLFVDTMTELIANLHDLEkveRDVRELGKRHL-TYGVQPAHFDAMGEALIAVLDESCHhpSdevTLDKEERDAWLGFWGFIAKETQR---
951 >SRR3569832_1708069
952 --------------------EEVAGVVLFQRLFEKCPQTKVLFG-FPiDIDpSSKELVTSKRFLMHASYLIQMLDTALNMLGPdqelLTDIMLELGTIQS-AFCVASVCVIC------KELETHLC--f-------------LRLLCQAX----
953 >SRR6478736_5796684
954 ------------------------------FMMGV---IASGMVV-TG----------AERRGRPKAVQPGNREWITVIQAinaEGQA-----------------------------IP-PFIIGAGQYHLANWYRDSNLPGNWAIA---
955 >tr|T0QF73|T0QF73_9STRA Uncharacterized protein OS=Saprolegnia diclina VS20 GN=SDRG_06019 PE=3 SV=1
956 ---ISKDVQALVLANWAAISsgstPAllKIKpaspvvyfyDYFYGMIFEKAPAVKPLFRS--SI------------IVQGKALINIIQSITSavNAPNVIEKVCDLAYRHN-KYGVKIEYFNLLGKCLLLAMHDCTGDTFTDELREAWRAAYAYMVMVMTP---
957 >ERR1719210_139600
958 --------------------------------FTLL-----DPPGQKrnvaqawsAVVqADVAILVVSANPGEFEAGLAK-------------------------GGQTREHAVLAKSAGVENLVVAVNKMDSVDGEGKWSNLryee------I------
959 >SRR5256886_2416282
960 -------DREADADREADADrdGDAEPEPLTAPALSSPPAV-PLAPP--RD------------EAARQHdEPEPAPPPDQVPGAadpretagppeppeeppp-------DGKGEP-AAG-----PDPAIAAGQEALRAFARE--afTSAAEEAWTQVYLAGSSLMIK---
961 >SRR5581483_8202477
962 -----------PDDPVFDGMqgnvGRvaarylphrEGEAYVAGPVGMVRETIRALTRA--GL------------PRERIHYDDALLAEDKQASAqgvagatahtsrtpessrpgRTGEAGNAGPDGH-IrrvaesdqAGPAGGTAEPGQSGLRDAAADIAPQ--------ADTAHQDGGPHDDQagA---
963 >ERR671911_2215695
964 ----------------ELEPacapDKQLVEHVQRlRVEAGAQVVGR------EEERRSRAgqCPRPTSRVDVRGTHDD--------APlecVAEVLVDCGAHAR-VACKVDergraaleLLDRVVPDDLVVDLHAVDEVDGGGQTgHVGPGTSSRRVstarakpQAGTLPQ--
965 >SRR3954453_16132976
966 -------NLQALEESFDAVAphGDELMDEFYGRLFEAAPAVKPLFAH-TDL------------KRQKAMLLAALVLVRKWRPAraLSGHRR--GAHRL-HGCRRGARVDGRVRGRL------GRGAWRGRRRDDRGR--------------
967 >SRR4051794_7197155
968 ------------------------------PHAAAAPVLPARLAG-RPRPAGAGPISPPARRVGRRVRPLDRVPPPARRDVaraARERLRGRGAARA-AGAGGSDLAPPVRHARVGAAVAVRGDLGGAAGIAAESAPSVLPWTTTRSK--
969 >SRR6188474_1917881
970 -----------------------------------------------------------------------------------------------------------LNFVFEkiktKKLIPMTQKQIELVKSTWSTV-----AAMDH---
971 >ERR1711894_485352
972 ----------------ILLYnYrfLTYVIYYYYRFLAEDPTVASVFSRV-NVD----DQQSGEWHAHMLRIMGGVDILINMMDDvnvLTEEVKHLRAQHVVREGVTHERMKAFLIIMMDELPKVMT-HFNH---DAWKSCLSKKLKRIGG---
973 >tr|A0A0S2MLM2|A0A0S2MLM2_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1
974 ----SEGDADIVIKQWASVMnAavsgenrVVIGRQIFNSLFLKQPAAPALFPY---GS----DLDGAEFGAQMSRVLSGLSNAINSLTDddlNVSIMDHLNKQHVVRDGVTAAAMKDMQVSIEDTLKQLVT-DYND---DAWHDCLGVAIERISV---
975 >ERR1712217_222699
976 --------------------IDNIGEVFSQKLFALSPRRHARA----GM--------------EWGPVVKGIGHAVDNLTNLDavaVKYKRLGVLHR-CIGVKEHEMREMGEAFILSLRDVLGKSFGHQAEAGWRAVYCFVAHAMMA---
977 >DEB0MinimDraft_6_1074348.scaffolds.fasta_scaffold06817_4 # 3572 # 3886 # -1 # ID=6817_4;partial=00;start_type=ATG;rbs_motif=TAA;rbs_spacer=12bp;gc_cont=0.311
978 ------LQRVRITRQWRKAYgtgshRLDFGLKVFKHLFEAHPTARALFADHHSD----N-VYSPEFEAFSERILNEFDIVIALLDDpaaLSAQINHLKAKIT-KRHVTTEQLTVFGKNTLEVIPEYVGNHFD---HSAWTDCLKRLRSALTV---
979 >ERR550532_3441629
980 -----YRQVFQLKNSWKTVSrnLDDTAKENLLKFFRDHPEHKALHKKLTKYEDEASLRESQAFEDAALAVFNTFDEAMDMIekDKVdyaITTLHMAGKSHSAIEGFQPAYFKDMEESFLYAVKLTLGDRFTEATEQNFRRLFEFTTQQMIEGM-
981 >sp|P02210|GLB_APLLI Globin OS=Aplysia limacina PE=1 SV=4
982 -MSLSAAEADLAGKSWAPVfaNKDANGDAFLVALFEKFPDSANFFADFKGKS-VADIKASPKLRDVSSRIFTRLNEFVNNAADagkMSAMLSQFAKEHV-GFGVGSAQFENVRSMFPGFVASVAAP--PAGADAAWTKLFGLIIDALKA---
983 >sp|P09965|GLB_DOLAU Globin OS=Dolabella auricularia PE=1 SV=1
984 --ALSAAEAEVVAKSWGPVfaNKDANGDNFLIALFEAYPDSPNFFADFKGKS-IADIRASPKLRNVSSRIVSRLNEFVSSAADagkMAAMLDQFSKEHA-GFGVGSQQFQNVSAMFPGFVASIAAP--PAGADAAWGKLFGLIIDAMKK---
985 >sp|P21660|GLBP3_GLYDI Globin, polymeric component P3 OS=Glycera dibranchiata PE=1 SV=1
986 -MHLTADQVAALKASWPEVSagdgGAQLGLEMFTRYFDENPQMMFVFGY-SG--RTSALKHNSKLQNHGKIIVHQIGQAVSELDDgskFEATLHKLGQEHKGFGDIKGEYFPALGDALLEAMNSKVHG----LDRTLWAAGYRVISDALIAG--
987 >SRR5690625_2040278
988 --------------------RDGFGARFTEELLSRYTEIREALPD--EPA------------WVARAVTAVTDALIDVADDpgaLVTVLERLGVDNR-TVGVHSAHYAPIGHALILAARAVGGTAWTPDIERAWVDGFDVAAEVMVT---
989 >ERR1711963_100213
990 -TSLSEGTVEVLKACHPLLKdvRRVIGKAFYNRLFKEYPQVKPLFSQ--SD---------AARTHQTLALADALIAFTGRQLLegF-EAKQRGQ-ERS-LRLRSLQAGSWQGLWRLPSRDRGERD---QNEGSQIKPQILTIQ---QD---
991 >tr|A0A0G4EPR9|A0A0G4EPR9_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_12573 PE=3 SV=1
992 ---MSDKERgVLIDKTWGLLkeryTLQEIGEELYDNVFKNAPDLRHLFKR-PKELMA---------LKFGEMISTIC-GLFQtDRESLLETMRDLGIRHV-DYGSRPEYFPLFKACLLDTLENLLEDGeFTAATEASWNDMWDEASEMLIS---
993 >tr|A0A0Q5LAI2|A0A0Q5LAI2_9MICO Uncharacterized protein OS=Frigoribacterium sp. Leaf164 OX=1736282 GN=ASF82_14980 PE=4 SV=1
994 --VITSSHLTALRSTLPLVeaRAAAIADDFYARLFADRPDLLrDQFNR-GD----------QAQGRQQRELALTIVTVARDVVgtqvgsgpagsatgpavpvapwsspapspwavrvAARETLSRLAQRHA-AIGVTRDEHDVFERHLRDAFAAALGDDWSGVVVDAWLALWRQTRDELVA---
995 >tr|A0A1Y1I4E0|A0A1Y1I4E0_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_002310190 PE=3 SV=1
996 -VQLSPFEQQLVQKTWKLLQprLADLGQAVFTHLFQKAPKTRPLYTCPLRLADGDrRTPDGHAIPTHAVEIVSTIGLAACRIGSssrILAVLERLGQRHV-AYGAAPDMFSVFKEAFLVALKKTLGGeHFTAQVHKAWSKALDSVVAHLKKG--
997 >ERR1719296_130621
998 ----SVQTNSDVQKSWEKIQeigILRAGEILYKNIFELAPSARETIPPevlekyrissFLvslNEDeLDDAFIENAIWSDRAANIFNVVGHVVRGQHDfgrLVPMLQELGSRHV-GDGMPEAILKVVVPAFKFALHELLGSMLTEDLEHVWMVGLELVNSHMIQGMR
999 >ERR1740115_393061
1000 -NLLTPETVRVVKETSPRIAsmAPALSSSFFKRFLS-HPDLAAYKASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAWDEAIMVLADICVD---
1001 >ERR1740130_2673129
1002 ------------------------------------------KASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAENHRLTINLFL-LE---
1003 >tr|A0A0K2UHU6|A0A0K2UHU6_LEPSM Uncharacterized protein OS=Lepeophtheirus salmonis PE=3 SV=1
1004 --YLSKKQKDLLKRAWVALhnNLSSVGMTTFIKMFETHPEALKFMiPKLTqeeekktqpnySLDSRLDPWHSEKLREHAHRIMKTVSDVISLLNKdeekIEEMLVALGGKHH-GFGVHIEILELMGPHFISAIYPTLKETWTEELQEAWQCLFNYIIALLHIGF-
1005 >tr|A0A0B6ZHC3|A0A0B6ZHC3_9EUPU Uncharacterized protein (Fragment) OS=Arion vulgaris OX=1028688 GN=ORF61548 PE=3 SV=1
1006 -TGLSARDRKLIKDTADIIfgqlKLQNKGVVFLIAFFKAYPHHQRYFKMFRGIP-PDELKSIPHTENHGRRVMSNVALLVQHIEEpnvIKEQLVDLLIKHN-PRSVKPRQMKDMLNMFVDFTSQQLGAKFTSQHETAWRKLTTHILSVLEE---
1007 >tr|A0A2H2IJL2|A0A2H2IJL2_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1
1008 -------------------------------------------------------MNAVELRRHASVYLKGLGKIIESMRNeeeLGKSMSRIAQAHI-KWNVQRNHVIVSMGKTEIRQRATNSYALKS----------------------
1009 >ERR1719270_1027131
1010 -MSLSTETCNILKICKPLLenNRENIGLTFYKKLFDENPGLKNVFN----MGHQR--GVdd-DKPGRQQFALGQALVAYCLHCESldkLASFVERVANKHV-SFDVQPEQYPVVGGILLATLEEVLGKEtFNEDVKKAVADAYFFLADVFIS---
1011 >ERR1719318_1430785
1012 ----------------------------------------------------M--N-----NAQGNSLANAVVAYCANCDQleaLGPTVAKYTVPTC-KYIFHIS-------S-------TRPLKmFLPI---SX----------------
1013 >ERR1712088_143820
1014 -------------------------------------------------------N-----NAQGNSLANAVVAYCANCDQlelLGPTVAKISSRHV-SLEVTPEQYNVVGGAARQRSlqrssQRCRGRGlLFPG---RHLQGERGKNDRRSQ---
1015 >tr|F6WSS9|F6WSS9_CIOIN uncharacterized protein LOC100181975 OS=Ciona intestinalis OX=7719 GN=LOC100181975 PE=3 SV=2
1016 -MPLTEIEIEGVQESWEKVSsggPKTTGLILMEKLFNTYPASIAVFSHLGIPSKPdgaitvSDLASIGGVSNHAVSLASRIGKLVGLLNNeteLKESSTEVGRIHV-KYGVTSEHVDLLGSVLLSVISENQGLSNTSELIGWWSKTWNIIGNYVK----
1017 >SRR6185503_2239525
1018 ---MDSGHKALIRASFGRALtVADLAVELFsGRLYLLDPALWTLLDLGS--------------RRRQQELVQVLAWAIEHLDRfelLASTLEALARRCV-GNGVREAHFERIAGVLLWTLHQVLGDTYTAGTAAAWRSTSGLIVERMKQ---
1019 >ERR1740129_283753
1020 --PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--TPPEQGCLPKLYEWMSRTMTS---
1021 >ERR1740123_30535
1022 --PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--GGLLARSGAeSVFPPGARA-GD---
1023 >ERR1719193_1971274
1024 --VLTADDIKAIKAIWFPImkNPADLGVALFEKFFLLYPQQKDKFKFMKYD-----DLREKGMRAHGEKVVKKLDEAVLLTlYrsRIKHCFQRIGFSHL-QMGIKEEDMQQLGEAIIATVEDAFVDKLTPEEIGSFKKFIKLFTAEF-----
1025 >ERR1719193_859649
1026 ------------------------------------------WRMLKKR-----H------NRDGGKLLH-PLKTILQTcYksRIKNCFQRIGYIHF-RMGVQEEDMEQLGEAIIKTVEAAWGDEFTPEEYAAFRKFMKKFTAAF-----
1027 >tr|I2G907|I2G907_9HEMI Hemoglobin A OS=Anisops deanei GN=HbA PE=2 SV=1
1028 -FSLTDREVEVINQSWNQIKAqeLVVGLQMFKTLFQRYPQYERLFTHLH--QSGKSLYEGDRFQRHVVgNIMSSINKVIETLNssdNAVKTLQDMGVKHK-KLDVHRKHFESFVPFVVDAMVSVRMSMSQDEVASAWTKMMEGVASNLSKG--
1029 >ERR1712157_679996
1030 MKPLSFTTMDCVLSSWEQVRripnyRETVGLAILQKLIHRMPEGREVLHMQRNLIknSPPGIESDKLLLAHARAIVNGLDTVVEllgpLIDDISEILREIGKSQYHDYGDSMALWNpLMRECVLEVIQETLKDDYTHELKVAWTDFLGEVAKDIHSG--
1031 >SRR5438477_4839339
1032 ------------------------------------HGIEP-IPH--RY------------AAIRRVVSGRE-----------AQARRVGQRHH-AAREDQRR-------LRGL----ERRRG-RPPARHVRL---------AA---
1033 >SRR5262245_20667862
1034 -----------------GRAdpLTLLCEREIARFRG----------------------------------------------------------------I---ELDGIGRA----TALF------DGPARAVRFARAMIARGRAL---
1035 >UPI0003969FE8 status=active
1036 ------RPFEAA---------------DRELLFGRAQDIRAVVEQ--LR------------TDPLVLVTGDSGVGKSSLCRagvLPQIREGALNDVR-RWSVAV---LSPGRWLLDTLGDA----LA-----------------------
1037 >OM-RGC.v1.018126893 TARA_122_DCM_0.45-0.8_C18859060_1_gene481717 COG0677 K02474
1038 ------SELW-------RGRprKTSLPAgssiRTRTAvlvplgrgketapssssanfvlnLTDVPPEAQELRiTA--EV------------DDQRIHFQRRVPADVD----kvVMELPEGSLARKV-R--VEVAAFD---------------------------RR-CS-IAAFRA---
1039 >SRR3954454_16888348
1040 -VISRSAVIRHVLPTP----aepaaVDHIGQQVADRTSQQDRGERVLLNRT--------------aHGLR--ALADGAARLRIAAQSvadvtRTPLVGVLRQLRS-ALGDVSHRLCGLSDHAEAllgAIKDVLGDAATDEILAAWGEAYWLLADVliar------
1041 >SRR3954471_17335278
1042 -VISRSAVIRHVLPTP----aepaaVDQIGQQVADRASDKDGGERVLLNRT--------------aHGLR--ALADGAARLRIAIQSiadvmRTPRVGVLGQLGG-ALGDVPHCLSGLSDDALGccaTCGCYLCR--------SRGGASWSFFCHaalr------
1043 >SRR5215204_1408335
1044 -ATGGPTRWATMRGRWPLMS-------MLESIAQSG-SGRPVWYVH-GAR---------DrrahaMGDHARALAADEHAGK---------HRAVRQRT-------------------------------AG---------------------
1045 >tr|A0A167F9Q7|A0A167F9Q7_9ASCO Uncharacterized protein OS=Sugiyamaella lignohabitans OX=796027 GN=AWJ20_2623 PE=3 SV=1
1046 -VVFTPGEISLLRNIWKEISEnnLDhgrglkssqastfFCQQFYENLLGDHPSLQTLFPSL---------------QSQSAAMAWVLGQIIAQLEDVsqaQSVLIKLAKWHSRLMNLEPVHYEYVGSSLLRTLGDRRGDKFTAQEENAWIKLYTFIANVMLK---
1047 >SRR5262249_41403170
1048 ---------QVLKESWARVEgqQEALAAHFYARLFLARPDLRELFPI--------------QMRPQGRRLLVGRARATEPGGAPDgASSRERGRPRR-RYEVSAEHHAVFRECLVAAVRACSGRDWDAEREQAWREGYDVLARRMVA---
1049 >tr|A0A1Z5JNP0|A0A1Z5JNP0_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_8Lh328 PE=3 SV=1
1050 ---LSSTSLLKVIACWEQSKsrggfDETIGIELMLTLFEMNPQARSQFG-FRTDQ---VIDKNnglqrMGILIHGQRFIRTLDCLFSLLgpddDNLEEVLRDFNKESC-QDGMPLPQFLLLLGILVKVMAHTLGGDWTDEVQFCWMEVITHLEVIVT----
1051 >tr|A0A150GQ95|A0A150GQ95_GONPE Uncharacterized protein OS=Gonium pectorale GN=GPECTOR_12g483 PE=3 SV=1
1052 --GMSLEEMEQLQGSWAFLSkgafpgevkeqLESFSVDFFMALFEQSPGLINLFP-FKDVNG---KPIIEQLKVHGLKVFQTIGAVIDMCNNysvLLRVTTDLVARHI-KYGVLAAHYDVLFQVLVGILTNVLGSQFSGTLAAGWVKLAGFILRVVKDVY-
1053 >SRR5215203_5896321
1054 ----LVRERRLVREAVAMVdDQDRLIRDFYMIVFAMGGAeVIGMFPT--DMR------------RQRHEFGRALVQWVsaDDPDSIAAHLDQLGGDHR-KFDVQPAHYAVTGEALVAAVRGRCGGRFTAAHEEALRGSYGRLATIMIDG--
1055 >SRR5580698_8666230
1056 ----PDLEKMAARSPWLTVtA-------------------------------------------------------------------SLSAEPV-SLGHGPRTEHgtvADVLARLGTWREHD--------------AYVCGSSAMVAA--
1057 >SRR5919204_299658
1058 --------------------------------------------------------------------------SDlrSGPTSRCTHVRC-----R-QQRSPPRHHRClRPRSPAPSWSARlsagfrssscrpstnRPARRRGRGRSTILASYTRLASVMLDG--
1059 >SRR5688500_16794215
1060 ------YDARVLRGSFAQLRprIAQYSPVFYEHFWRDYPETRPLFG--RNMSKPE-------LDTRINHFM---LWVTENADRphfTIDYIQSVARRHV-GYRIRRRHFAYVDNTNIKTLRELLGDSFTPEVERHWRASFRFLTLLM-----
1061 >SRR5947199_2475351
1062 ---------------------DELARAVR---lQ--gSRRIMEEHAC-GAE------------GRQLARLFDERGRLARAPRAVDEPGLELGARvsdgrcglakigdvverivqaedvdavRR-AGGDELADEVIVS-------------rtRADDEtseqrepayrigprtqCSDAFRRGLERPAGAPVQT--
1063 >SRR5919197_1330773
1064 ---------------------RATAGGLYGVLprlR--rgrrRVSVRCNHAG-TDL------------KKQKTMLLGTLVLLRKPLrdlDAIVPKLRELGARHV-ADGDEGGDELLEEQEGKGYGED-EGEgdeafdapLIDEX---------------------
1065 >SRR6266516_4891354
1066 -------------------------------------------------------------------------------GLGDGGRAEGGNRDS-GRGEQLEHLGCVHDVLLSFSESTVSTlphqaarpapaaegagpAITRRetadrapprrhrvggfLRSAGAARARSSIDRMTET--
1067 >SRR6266508_4596506
1068 -------------SAFVRL-tdARRVARCLPSAH---pGDETPSTFPS--ET------------GDPVNLN-----------LEALETSFDLVAPRG-DG-SEATEDDVVGHPGPPA--QVA-PRPRGDRPQAA----------------
1069 >SRR6185295_10958302
1070 --------CILLLVA-----CFLTFKLFFYSMFQDYPEYKNLWPKFRHLN-DEALINTGELSNFCSVYMDGWEKVIGELDDnaaLARELKIIAKTHL-RKGVERshimvakkealcqiriheyCYLQNMMPKMLSLLKEKNGT-LDAEVEEAWKTVFIINADIIE----
1071 >AntAceMinimDraft_18_1070375.scaffolds.fasta_scaffold521461_1 # 3 # 443 # -1 # ID=521461_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.569
1072 --------DD-------------DDDDDDdDRMFHDHPEARALFSRVHGDN-----TYSPDFEAHAQRVLGGLDSCISLMDDpdtLASELGHLKAQHA-DHTdVTAEHFDVSICFSsTDVTSTYTsthckimdrpnYTVFQT--RGQrnltksaSRRAHspvRDHPRGS-----
1073 >SRR5476649_891947
1074 -------------------------------------------------------------ATSTRCCS--ATSRKCCRCSikpTRPTASSsarwptpcWLTQEI-SIawNnWARWHRPSStSMCRCKSsgNTIPWSApRCSRRYVKCWAPRWRPmpsstpgpprtvsWRTCWPV---
1075 >tr|A0A2D8PEV6|A0A2D8PEV6_9RHOB Uncharacterized protein OS=Maritimibacter sp. OX=2003363 GN=CMH11_20945 PE=3 SV=1
1076 ---MTSQNAGLIRASLTELFprREEFAERFYERFFEQAPQVRRMFVH--DSE------------KQKLMLYAAIAMTMRGLEServLHSELMAFGSRHA-RLGVREEHFPIFGSAFLETLIHFLPQWDHPDLARAWWGAFTDMSTPIIA---
1077 >SRR5690242_2028058
1078 -------ELALLLQSYGRIGilIPKISENFYRRLFQLRPNLAALFAN--R--------------DADLKVEEMLRRIVAHASDAaaaKAEVQSSGRSHA-QWPLLPEDYRVAGECLIQAIIEAEGAATGSVVASIWRQAYVEVANLMIC---
1079 >OM-RGC.v1.029911412 TARA_036_DCM_0.22-1.6_scaffold294997_1_gene285712 COG0526 K03671
1080 ----------------DRLRarGEPPSGNPYRGAAPYGPGDEALFF--GRR------------AE--------LEVLIDRVQkTpfvLVAGDAGVGKTS------------LCSAGLLPLVREgalGGPRHWACESIACGEEPLAALAAVLARH--
1081 >ERR1719414_683447
1082 MEDLRFETIRCVVQNWERLKynplFEEFAIAFYQRVLRVCPQAKSFFGSSFCLD------DQA---TMTQEFVRLIDRVLDLLGPesqlMVEVLRDLGSRHE-AYGVTVEMYDIMRDAFLLTLEQFEGEKmFTTKVRQAWMTVCSAVADVMMEA--
1083 >ERR1700744_5993147
1084 ---VGLDDRDALGVLRDAFSqdesgsGNELVRRFYNHWVELDVSVRDLFPP--GME------------DQRAAFAQALNWLYservaQRAEEPVAFLAQLGRDHR-KYGVLPSHYETLQRALYATLRSYLSdpsrSAWSDAVDEAAGQSLNLFTGVMSG---
1085 >tr|A0A1E3QTC6|A0A1E3QTC6_9ASCO Uncharacterized protein OS=Babjeviella inositovora NRRL Y-12698 OX=984486 GN=BABINDRAFT_161163 PE=3 SV=1
1086 --NFTPAEIATLKATWSMEAKDTnsgdiadpkntlFGTTsfwehVYSLVGEEHPEVVHLLPP---------------ITHQTQAFSGMVYLCISNLDNlsrLDEYLASLGRRHSRVFNALRLHFEAMGSGVLKSLYNHYGEAFTADISDVWARFYCFLANSLLQ---
1087 >tr|A0A0A9XWX4|A0A0A9XWX4_LYGHE Globin OS=Lygus hesperus OX=30085 GN=GLB_0 PE=3 SV=1
1088 ---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNLHSnnrrkNKELFEKLATIHA-KRKVSAQQTPYIKHTLMDILH--L--EPHSAMEKAWINVIDTLF--------
1089 >SRR5687767_4837246
1090 -----EKQVLLVKHSWSYQAgqLENLGTLFTKKLVALNPGLKAPMKR--SL------------AETGSySLMVAMNQIVAALPDLhkaQNHIQVIVTEYA-ALGITRSDYENALIAFLLALEKRLGKSWSDEIREAWIFIFSSLYH-------
1091 >tr|A0A0S8AZS8|A0A0S8AZS8_9PROT Uncharacterized protein OS=Betaproteobacteria bacterium SG8_39 GN=AMJ64_12515 PE=3 SV=1
1092 --------TGLITESWNALGagQRAFVEAFYQRFFERYPDYRPLFPL--ELN-----------PRHLEKMVQTIALMADQSQDrgrIAPHMHTLGQAHK-AYDLSARDFDNFKRTFVEVLGERLGRQWSAEAEKAWNDAFDAVLVP------
1093 >tr|Q9NG75|Q9NG75_9CRUS Hemoglobin P polymer OS=Parartemia zietziana PE=2 SV=1
1094 -TGITDAEKQLVQESWELLKPDlmGLGQKVFGRIFTKNPEYQTLFTRvgFGDTP-LTQLMANPAYGAHLIKVMRSFDFVIQNLGKpktLLAYLKNVGADHI-ARNVERRHLQAFSESLIPVMQNELKAKLKPEAVAAWRKGLDRIIGVIDQ---
1095 >SRR5579875_723516
1096 ------------RESFARIAprKEEFVASFYQTLLEKYPHLQRMGAGV-------------DVKRQRKSLLATLQVMLNETDRgeeLRTQFRKPGQRHN-ALQIRAEHYPAFGQTLFETLALY-DPQWTGELRVAWAAALEQCVRFMMEDLN
1097 >SRR5579871_3449338
1098 -VPLSALHRYLVRRTFTHLaiHADEVTALFSQRLVELNPALMIIIV---DEA-----------GTQRYRPLEILARVIALMDRpaaLSIQLKLLQAQQQ-R-SVTPDHLRQMGEALLWVIENRLGDSFTPDISAAWLHFYRFLGE-------
1099 >SRR5215472_5690244
1100 -----HFDVQVIGAALTRLAdpAVDAAEYFCSHLYSISPDAAALFPS--EL------------AAQRELFADAVIRVQHSLESgsgLAEQLATIGRQSR-KFGVTERHYAAFMLAMEKTARHFDTGG-------------------------
1101 >tr|F2UQX2|F2UQX2_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_10302 PE=3 SV=1
1102 ----DDSAMKITQESWAMVEREipNWTDIFYDKMF-SDPNIAKLFP-FS----AGDFKTNEKFQTHTQKVRDTMHTAMTSIrefEKLGPVLKKMGERHA-DYGVIPEHSVNFKEAFLHTLKTGYGDKWNEDLDDAWNQCVDALLE-------
1103 >SRR5699024_1886671
1104 -KTLDPQTIETVKKTAPIIKdnVEEIGKTFYNILFSRHPELYNIFNQ-SNQ----------------ERGlqqealaygVYLAGINIVNFEPIQSLVTRVAKNNR-ALKVRPNNTLLLERR-------------------------------------
1105 >SRR5271157_2714777
1106 MPSRIVDRLTALRAFFAEMEpqLPVIVARSYERLFDVEPAIALLFK--GNA------------REHQLRFLAKLQSIVKLTRSsqlwpasaatgqiLIPEVLDFGRSHA-KIGVLPVHFSLLNDMIAWTCKEIAPLRFTPLVEEGLAFVFDVLGASLTAK--
1107 >tr|R7TLW3|R7TLW3_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_227018 PE=3 SV=1
1108 ----------CAEITWAILseNRDGLGTEVFVRMFESYPDLKSAFGPLRHMNKKDAGY-EDVLRAHGIRVLSIVEQVLSKRHnmeEVLSILHDLGRKHL-TFSAKVEYIDIVSQMFLFAIESALKEKWNNSTEKSWGEIIRFVTYVMKET--
1109 >SRR3990170_2029843
1110 ----------------------------SPCTTTRSPCWTRPCAS--W------------AT-----------APTGSWAtstpPsssRLPSCAR--CSRR-RWTCSATG----CSRRSPAPRHYAEDVWVPELEDAWLRAYAAMSTTMIEG--
1111 >tr|O97381|O97381_ARTSA Hemoglobin C1 polymer OS=Artemia salina OX=85549 PE=2 SV=1
1112 -TGLSGLEKNAILNTWGKVrgNLQEVGKATFGKLFAAHPEYQQMFRFFQGVQL-AELVDSPKFAAHTQRVVSALDQTLLALNRpsdFVYMIKELGLDHI-NRGTDRSHFENYQVVFVEYLKETLGDSVDEFTVKSFNHVFEVIINFLNEGL-
1113 >ERR1719468_1094774
1114 -PPLTSNDRKLIVRSWTIVDqqISQVGLSSFLELFRRAPETLSVFPFLKQLG-PEDMEFYHQLKNHSIRITGVISMLVKQLESeerpadeaIRDLLLDLGRRHF-SYGAKTSHMELLGRVFAESLQPIFEGdPEAKAIQEAWLVFFSVIVFWLQKGFR
1115 >SRR5262245_31323877
1116 ----STDGAGLVMASLARVSdrSDQMIASVYEHLFAHRPELRLLFPS--DL------------KHQRAKLAGALRFVIENLRNpehVVTALEELGQRHI-AYGAKVSDLSSLGEALMSALEAHDPNPWDDLTRKAWHSAYDSIARAMSRGM-
1117 >ERR1041384_2362020
1118 --------------------ANVLGERKvVAVLYSDLRGFGTL-----SE------------TGHAVDVLERLNDYFD----rMVAAITSHGG--------------------------------------------------------
1119 >tr|B6BNK3|B6BNK3_SULGG Putative globin OS=Sulfurimonas gotlandica (strain DSM 19862 / JCM 16533 / GD1) GN=SMGD1_2554 PE=4 SV=1
1120 MQELSQKHIDIIKESAELItaNDLKITNKMYEILFYKYPHLEMLFEN--------------APDNQFMKLAEALSLYAVNIDKiekLIPALELIAIKHV-EVNIRPGHYSMVGMALIEAIEEVLGKMAPIGFIDAWREVYKYVSDILIE---
1121 >SRR6185437_15632065
1122 -----ADDVAIVRDSYGRIGprGAALTIAFFGLLSDRVPRVRKFFPP--DD------------KDKRAVAKDLFDLVVGHLESqlnVRWVLERMGRRGL-LDTITPSDVSAVGGCLLDALAELDE-AWSPATERAWSRVYDWAASAVV----
1123 >tr|A0A0K8S6V4|A0A0K8S6V4_LYGHE Uncharacterized protein OS=Lygus hesperus PE=3 SV=1
1124 ---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNYTQttaekTKSCLRNWQRFTP-NGKFPPSKHLTSS-IHLWTFFTWNHIQPWRKHG-------------------
1125 >tr|A0A0S8CN91|A0A0S8CN91_9BACT Uncharacterized protein OS=Nitrospira bacterium SG8_3 GN=AMK69_14025 PE=3 SV=1
1126 --GLPPSDISRIQRSFRMVAsqGEKMASRFYDLLLERSPELQKFFHP-GNLS------------QQHAKFFNGLHSLILHLEHpqaLRAALVQLGEQHQ-GDGIEIQHYPPVVDTLLQVLTEFSGEGMDGETYDAWAHFLHLVRAIMLENH-
1127 >tr|A0A0Q9HRJ4|A0A0Q9HRJ4_9BRAD Uncharacterized protein OS=Bosea sp. Root381 GN=ASE63_23130 PE=4 SV=1
1128 ----GDRAISLALASLETMGSeaEQADIMFNIRLLETYPDVYRVFC--MDFA------------PEERSFLRALAFILAHAGPfgaIGPTVRALAPSDK-VCRLISSRYHELEETLMWTLRRRLGVAFTAEVENAWRSVLREAPGVS-----
1129 >SRR4051812_34838903
1130 -------------------KPirNRAIKLFFSRLIESHPSLLTVIG--DDYE------------AKARSLRPAVEMIIGCLGNmeaLRPILRSMARSNA-ELGMQEHHYLTAVNTILWTMERCLGSAYSAEVDAAWEDVCWQVCEAM-----
1131 >tr|F2UFM9|F2UFM9_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_06664 PE=4 SV=1
1132 -MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETeRLLYSS-DKS--------KSWNERHMARVGKSVGDVIKSLSNyddVIEHLTTGEPHEQ-ACCL--------TDG--YVIGTGLGNT----PRSLWLACGS-------T---
1133 >tr|K0T9D6|K0T9D6_THAOC Uncharacterized protein OS=Thalassiosira oceanica GN=THAOC_11871 PE=4 SV=1
1134 ----------------MEREdssGSL--PSFVSETEIEPSDVQPaaasgenNVDKGRR------------KTSSSSKRTPSITKRIESFSSfksLSSSFS------------------SKLDDERNAGEAGQAERVEsttapESVASGETQGNAGGQHTLN----
1135 >tr|A0A165S3D1|A0A165S3D1_9GAMM Chemotaxis protein OS=Halioglobus sp. HI00S01 GN=A3709_07715 PE=4 SV=1
1136 -----MTAIMMIDRDFTVTYanEAT-----LQLLRDNQATLSSIYPGF---N----------PDKLI--------------------------------GSCIDGFHKNPEHQRNILADPANLPWRTDIEVADLKFS-LNVTAIVDAQ-
1137 >tr|A0A1I2IR29|A0A1I2IR29_9GAMM Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor (Fragment) OS=Fontimonas thermophila GN=SAMN04488120_104136
1138 -----KGVIQYINRDFIEVS------------------------GF---S----------ESELI----GSPQNIVRHPDmPveaFADFWAT----------------------------LKDGKPWTGLVKNRCKNGDHywvLANATPLRAN-
1139 >CZCB01.1.fsa_nt_gi|955242656|emb|CZCB01016507.1|_3 # 1728 # 2327 # 1 # ID=16507_3;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.493
1140 -----GVSSFEMNQQFSAQSsdSIEKNIAAISELWQKYMATnitdeekvladkfvatrgafvkealLPAVDAL---R----------ANdYEKAKLFSTKARDLYNVAHpalVELIQYQAGHAKL-EYDTSVESYKLTRNWTIASLFLAVGFLACFAYFImrSIANPLSvifRVLDNIKSN--
1141 >SRR5918993_5799879
1142 --AMTPEQINLVQRSLPAILaIRDRATARAgERLAVLDRAPGRLFAG-ADI------------GRQGAVLINAVTAAMQALRsgDYGSVLAALSQYHL-SYGIGPQHFRSAGAALARALEQELGSSFTADLGHAWAAACEWVGRII-----
1143 >SRR3954452_18192940
1144 --XMEPQQIKALKQSLATVLsAQEALAVRFhQHMRRFEQCPRPLFTG-APL------------ARQGVLLTNAIAICA-SLPskNlsQAVAAGALSQYHA-SYGIASHHFHSAADALALALKDELGHIVSDVAIDAWAEACRMLGQAL-----
1145 >SRR6516162_8663010
1146 ---MKAETISTIKATAPVL--KEHGQAITQRMyeiaFDARPDARQLFATT-WM------VSSEEGRKQAGRLAGAVYAYAEHIDDlekLAGGSGAYRaaaRRHE-GPaGNLSGHWSVShgryqgcaKRCCHAGNPRRLARGIX-----------------------
1147 >SRR5690348_5860809
1148 --QLPDGSVRLVKKSFAALEpvSADVMQYFYAWLFVQHPELRAMFPL--AM------------TTHRQRVFDALARVVRSTGSpaeFADQISHLARDHR-KFGVRAAHFKPFFAALLAAIREHSTGTWTSATQQAWEEALDCISAGLQT---
1149 >SRR5258705_5637504
1150 ----------LFSQLYQCSKntGRRSRGFSIDTCSKKHPELASMFNA-RDQSD----------GSQARRLAAGVLAYASNIDRlhmLESAITSIGRKHV-SINVRPEQYPIVGKHPLGAIKTVLGDARHPKFWMHGQRPTPNWQRSX-----
1151 >SRR3984885_15745818
1152 ---------------------SRAtgGGWLPTRSPTGRSARTSR------T------------GCRRGRCDGNTRPTV--ggPAALGGGQCEDSARDG-KLGLSADHADSAGAGRVdlAAVRHPGGAGV------------------------
1153 >tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1
1154 -----PANKNLIRSTWNMMVGdRGNGVELMGLLFQRAPDSKIDFKRLGDVS-AENIPYNRKLNGHGITLWYALMNFVDQLDSkkdLEDVCRKFAVNHV-IRGVLDVKFGWIKEPMAELLRRKCGNDCDDA-IQAWWKLIDVICAVLKES--
1155 >HubBroStandDraft_6_1064221.scaffolds.fasta_scaffold2618798_1 # 2 # 181 # -1 # ID=2618798_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.622
1156 ---CSAEDRSIIQEQWKILFkdvdsskiKIAVGRKLVLNLIQRQPDAKVLFDKF-NVD----EPNSPQFSAYALRLFNRIDLIINLLKDpeaLDAALEFNAERYGNIPNIKKAYFQTAAQILAYALPKVLD-DFNA---LSWQSCTRYILTTVASKVS
1157 >SRR4051794_1382573
1158 --ALDPALLNLVERSRPRVEhkITELADQLYTALLAQVPGLRTLFPL--DP------------NGRRAPLTDPLIWLLQRLDDrdeLVRRLADLGRDHR-KHRITAAHYETAGHALLDALAHIHGPTWTPPLAAAWTRAYTAATHDML----
1159 >SRR3954470_25015505
1160 --EISEEQARMVKNGWQAAvdAPGDFGSDFYRDLFTVAPGVIGLFS--GDMT------------EQQGRLTHTLAETVELVDQpttLLLLLRASGVRHH-HYEVKHAYFSVMRDTLLNTMERRAGAVFDAAHRQAWEAMFDNMATIMQDG--
1161 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514
1162 --LISSKNLGLIRDTWAMARrDSDIAPKIFLRMFAQHPETQLMFPRFANVP-QSQLMTNKDFLQQAYTCLAGLNFMVKNMDDEDlviKLLSRMASPAFYvDFPTPGQQLDETTRLFLDVMQEELGNSFTADARNAWTTVMNQIHNVLVQQ--
1163 >GraSoiStandDraft_30_1057271.scaffolds.fasta_scaffold222668_2 # 490 # 1347 # 1 # ID=222668_2;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.654
1164 --LLSIKDKALVRESWTLAKsNNEIAPAVLLKMFAENPDAINLFPKISKAK-IGDLKGNKDLYNYAYSSFAGLNMIIKSIDEVKtiaTLFKNSDNPSIFlDSRSASLD--------------------------------------------
1165 >tr|W4FW63|W4FW63_9STRA Uncharacterized protein OS=Aphanomyces astaci OX=112090 GN=H257_12922 PE=4 SV=1
1166 --VLTPRHVELIKANWSAVCagtsafdVEQHgspdkffHRTFYATLFKADPSLRGIFRS--SL------------TLQGKSLASIIKVMTGvvSASNLVERMQALASGHL-KFGVKRQDYATLGVTLIQTLEIISGSSWSRHVKEAYLTAYCLLFYLV-----
1167 >tr|A0A024UCA0|A0A024UCA0_9STRA Uncharacterized protein OS=Aphanomyces invadans OX=157072 GN=H310_04772 PE=4 SV=1
1168 --VLTPRHVALIKQNWSAICrgtnafdSTKHgspdkffHRTFYSLLFAVMPSLRCIFRS--SL------------TLQGKSLASIIKVMTGvmSTSNIVERMQTLAEGHL-KFGVRKDDYTTMGVTLIRTLEVISGSIWTKEVKEAYLTAYCFLYYLL-----
1169 >tr|R0JHX0|R0JHX0_ANAPL Hemoglobin subunit alpha-A OS=Anas platyrhynchos GN=Anapl_10052 PE=3 SV=1
1170 -------------------------------MFIAYPQTKTYFPHF-DLS-----HGSAQIKAHGKKVAAALVEAVNHIDDIAGALSKLSRRRKKERfQtkPAPKNLPLAAHrCHQLNIASKGTEHygTNPQLAWLSTGHLVSGRELISSKSS
1171 >SRR5690625_6805322
1172 --------------RSPSHsqtltLSPYTTLFRSRNLLRNHPELKNYFNT-ANQV----------NGFQPRALASIILQFAKNINHi-yeiVPKLERVCQKHC-SLGVQPRSEEHTSELQ------SRGHTVCRLL--------------------
1173 >tr|F2UFM8|F2UFM8_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) OX=946362 GN=PTSG_06664 PE=3 SV=1
1174 -MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETERLLYS-SDKSK-------SWNERHMARVGKSVGDVIKSLsnyDDVIEHLTALGTRHA-RYGLHVDQLDLFINAFLWTLGAGLGDSWDHSVKKAWMHVLPFILSPLKS---
1175 >SRR6267143_1520378
1176 ---VTLEQIQMVQASFAKIAPivGPATDRKLRRCSALVAGFrkeTRLST--GVS------------KNPGRSEVRGTLCGASCCGSlss------------------------------NWVANIRRGI----------SP-LALAIASI-----
1177 >tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii OX=37682 GN=F775_23753 PE=3 SV=1
1178 -STFSEEQEALVLSAWDAMkgDSAAIALKFFLRGRNN-------FVQLAHVE--SPKRRIPVVEERKTDL-----------------IFEIRTKTW-KIGQKSTAYRSW--LLLR--QKSLPa----HAPKGHLSElvpldTIDHTHQET-----
1179 >ERR1700722_6370008
1180 ----------------RGIRPhcPavrqhLPCVLPPH--VRAGSVASHAIPQ--LS------------APLTATLTAALEALVGALGDLQPVLVrapALGLRLA-SYGLQPTDISIAASAFLATLDDELDEVSTNAARAAWGCVFWTVA--------
1181 >tr|A0A0M1J4K8|A0A0M1J4K8_9GAMM Uncharacterized protein OS=Achromatium sp. WMS3 OX=1604836 GN=TI05_18490 PE=4 SV=1
1182 SKDIKPTNIYLYQASLNRAiNTSKFCDRLYFNFMNGNIEIANIFKG-RSK------------ERIQHKLQTTLDLVADNANQvpgNNIYLEMLGRIHT-KRHITPEHFKRWKFAVINTIAECDP-NFDTEICAAWEEVLTALIDKLI----
1183 >SRR5260221_159328
1184 ------QALGLVREGFAAVIarPDVFVSELYQDFFTSNPRYRKYFGS-ADIGySGsADIngTGSPEighaaadITRRNAKTVEAATRIVADLDRpgvLLPYLRKLALEYR-KYGVREAHYRAFAGSVMTALERTIGQAWTYEAAEAWVDELTMVASAMLG---
1185 >ERR1719266_796048
1186 -VSGLGTLSIISQASWKAISGeiHSSGVAVFVEIFKAQKEVQQIFQKLNPNPNSSGIkytkdqALKESLHEHGVKVLSGVDEVLSNLDQpslCLSLIRKTGAFHRKLQGFKPKYFKCFEEPFLAMVQSSMGQRFTPQMEIVYQSVASFFVQTLIEGYN
1187 >ERR1719402_1083666
1188 -TDLSTNQKNMIRDAYAVFekNGEKNGADAFIYLITQHPDLKKVFP-WGDVS-NEELRENQVFKDHVYVVYKGLKVAIDRIDNLKAtasYYVHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQTSFNNLLQFLVGNMKV---
1189 >ERR1719295_364028
1190 --DLTPEEKRCIQRTIPVIlqEAEMIGTKTYLKTFHNYPLSMIYFEPLRDKLVTEVKQTDDYLKKHGVLFVKFIGELVAEMDDpdsVDLKLKSLGRFHD-DLGVLKQYLEAIGPLFVQAIRPVLMtqasipsatncgvgvsspnSLWTRDTKPSWIRFFRVIALQMKRAY-
1191 >ERR1711860_326342
1192 --ELNSDEKTLIVTCSKQLleIQKVLGPQMMQQKFQKV-----------------------WSKEAGEL-KQLYDMR------------------------------------------------------------------------
1193 >SRR5215213_6828293
1194 --------RR-----LG------------------------gRIRC-APdR-----------PQRPPVRPRDATDC---------------VQAHV-PRGA--GRAVHRGRPLpAGGGGPGPGEAVTPEVAAAWEEVYWLFAVQLIG---
1195 >SRR6476659_6585810
1196 ---------------------------------------------------------------------------------------HVAN--A-RFTPC-PTYVDDGAavvtNPGKHRGADAGRAFSENLSVDWNAG-VRTAPPLVA---
1197 >tr|A0A2B4SAV5|A0A2B4SAV5_STYPI Uncharacterized protein OS=Stylophora pistillata GN=AWC38_SpisGene8312 PE=3 SV=1
1198 -------------DTFGPKEsRCREESVCKVRLLELNPNLQDAFPSFRGVS-LDELMNSRSLFLHSKRLMAVVEEAVSSLDDakeLIEDLTNLGERHL-AMSITEKHLKNLQRAGPATNQDAKHRLLANKGTAQIDRHIARMEDTRLP---
1199 >tr|A0A1E4GLJ3|A0A1E4GLJ3_9CAUL Uncharacterized protein OS=Phenylobacterium sp. SCN 70-31 OX=1660129 GN=ABS78_22870 PE=4 SV=1
1200 --ATAFARAADIEASLELLAerDIDPTARVYQRMFELHPQMEPYFW--RDTD--------GKIR--GEMLSLAFAAILDFVGErryADHMIGTEMINHE-GYDVPRDVFATFFAIVRDALRDLLGADWTPVFESAWEEMLAEIESYARQ---
1201 >SRR5699024_10012150
1202 --------XLVCLLSLPCPhpHLNSFPT-RRSSDLSKAPELYNIFNQ-TN----------QERGIQQEALAYSVYAAGENIdqlDNLKELISRVTEKHA-ALGVKAEQYPIVGETLLEAVEDILGSdVATAEVIGAWEKAYNYIADAFIE---
1203 >ERR687884_344007
1204 ------------------------------------------FPR--TT------------TAHNGRAQQSSTANRRaDYPRrapMNNLSRLLKESWT-LVEEQQDKYQVVGDALLEALRTFAGDQWTLEYDQAWRDGYALIAQRMIDG--
1205 >tr|A0A0J1H5I9|A0A0J1H5I9_9GAMM Uncharacterized protein OS=Photobacterium aquae OX=1195763 GN=ABT56_07590 PE=4 SV=1
1206 ------DFHQIFNDSYQRCqRHPQFFQIFYRNFWQQEERFQKMFEN-VDM------------TRQIKMLKLSILMIMLASTSeeAKDNIRRYARRHGPdGIGAQPEDFDIWIDSLLKAVKECD-THYNSDIDKAWRTCFKTGMEIMKQET-
1207 >tr|A0A2E7C7Y6|A0A2E7C7Y6_9GAMM Uncharacterized protein OS=Haliea sp. OX=1932666 GN=CME43_15375 PE=4 SV=1
1208 ------TSKELFLHSVTRClTHETFIHAFYLRLFDASEEIRAKFRF-TDL------------EKQNAMLRRSLLLYAEATAgRteALREVNERATTHDRhHLDIQPHLYAVWIDTIVTTARDFD-LQWNDDIEVAWRTILGHVVQQMIRRY-
1209 >tr|A0A0F6YJJ2|A0A0F6YJJ2_9DELT Uncharacterized protein OS=Sandaracinus amylolyticus OX=927083 GN=DB32_003309 PE=4 SV=1
1210 --------MDTTLDSFRRLRERGFAHRFYEQLFVADRRVPRLFAG-TDL------------ARQRDLLEHGISMLLAYQRgSalGEIAMRRLALLHGPrGLDIDHDLYAIWLRVFLDVAGELD-PEWTPELAAAWHAQLGASIAEMHRRG-
1211 >tr|A0A244CWV0|A0A244CWV0_9GAMM Diguanylate cyclase OS=Pseudoalteromonas ulvae OX=107327 GN=B1199_05805 PE=4 SV=1
1212 ---------------------------------------------M---ET----------VNSKAKVLNKLLIA------tsVVLISFIVSLQLA-GVEMGQSSIIAILVFGIASIG---AMAF-------LYKAVEQIADKLNVIEE
1213 >tr|A0A0L0EW98|A0A0L0EW98_9GAMM Chemotaxis protein OS=Pseudoalteromonas rubra OX=43658 GN=AC626_03140 PE=4 SV=1
1214 ---------------------------------------------M---NS----------QSIQSSLNNKIIIA------gvILVISIVVGIQLG-ASGAENMQLVAVALPLFGVVV---ALGY-------LKMALSAVSAQLGCVYR
1215 >tr|A4BJG5|A4BJG5_9GAMM Probable methyl-accepting chemotaxis protein OS=Reinekea blandensis MED297 OX=314283 GN=MED297_02020 PE=4 SV=1
1216 ---------------------------------------------M---NQ----------LNN--ALSARILIV------gtgPALLLVILNLALA-GSGSA--TVLNL----------------------------------------
1217 >SRR4026208_2063884
1218 -R-SVRTSKGHRQGHPPAIQkhGGAITTAMDARLFE-NEEVKAMFDQAAQES-----------GEQPRRLANAILAYarnIDKLDMLTAAVERMAQRHV-ETGVKAQHYPYVANALPPTIRDGAGG--------------------------
1219 >ERR1712080_92393
1220 TMSLSAGEITAVTASFEAVKadLGTNIGKVLQKLVAEHPDLKPHFPW-HAVP-TADLLGNDGFKTHAAQVGRGFAEAAGNLSNLsacEGYYVSLGDRHK-TRGFAAAQVPMVADAFVAALQ------LTGDDASGWTKLITFVGSSIVSG--
1221 >ERR1719334_3108017
1222 -TGLTPKQAQAIISSWENLNSEC-SSLLFKQLFTIFPELKEYFG-FSKRELVDKILNSEEMIAHMDATWNGLDKLVLSTQTgtrFAAIGKGLGYNHF-KFEIDRQDVHKFMDFFKQVLKDDLKSQFHGDLEEAWNIWCKAVEDVFIMGY-
1223 >SRR5207245_2384740
1224 --NPQPST-HAVTEQVVTLDv-----LPWTSGKLGLGPGKarlsEPLAP--GDT------------LE---SL----------LERQrarIPGfeewVYDArerriheHCTLL-VNGQAEYRRHTAEVEI------------------------------------
1225 >SRR5689334_4915957
1226 -----------------------------TASQRVTP----SLR--GKR------------VPSGQmgdRKVPD-VPIVDAHVHLwdpTAFrmpwLDGNKRLNR-PYGLADYREQTAGLPI------------------------------------
1227 >GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold2022664_2 # 351 # 797 # -1 # ID=2022664_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.631
1228 --------------------------------------------------------------------MPDFPI-VDSHVHLwdpNHFritwLDGNPRLNQ-RFAIPEYREHTAGIEV------------------------------------
1229 >MudIll2142460700_1097286.scaffolds.fasta_scaffold02451_1 # 3 # 1031 # -1 # ID=2451_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.574
1230 ---------------------------------------miGSRAL--AAL------------FPHPKTFMDTKRPVADTHIHLwdpGYLtypwLETVpaiagph----G-PAELQVQEPETDRFRL------------------------------------
1231 >SaaInlV_200m_DNA_2_1039689.scaffolds.fasta_scaffold02144_7 # 4497 # 5432 # 1 # ID=2144_7;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.499
1232 ----------------------------------------------------------------LQCGVATVRSVIDSHVHFwqpQRLrylwLDEVpair----H-PFTPHELNQATQAIDL------------------------------------
1233 >tr|A0A0K2U629|A0A0K2U629_LEPSM Cytoglobin1like [Saccoglossus kowalevskii] OS=Lepeophtheirus salmonis OX=72036 PE=3 SV=1
1234 MTLLTKKETFLIRESWKLVTPEmtKHAVGYYIGMFVSYPKWQDrFFRRIKGIP-LRDLRNNPILAAHSSQVFSAVSNLLNNLENtevIVEGVKKIARTHW-PLNIRGKELEAGLVLLLDYLEASFPGQISKECGDAWNKMFNAMSGVIVD---
1235 >ERR1719474_2118124
1236 --SLNPTQKCVIVATWHSIFlkhMNFMGKQLFVDLFKVEPNILKYFDAFRDVG-LANLLQSRSFQNHGVRIMNLVKFAVENLDNpekLQDHMHALGRLHV-KKGIDSKYLNIMGPTFCQAIRPMVMaeGQWSIDIEGAWIQLFKILAQMMRVAYE
1237 >ERR1719328_19047
1238 -NGMTPEQKQLIDDSFAVLKkdVKGNTIVFYETFFKMNPELVAHFPGVSE-ADLVNLGKNEFIIQRGAKFFNMIETTTHLMESKegcLELVRMLKESVP-EGKVTYDRYKVAKEPFIKMMETALGGNFSAETKAAWRKFFDSLAETTK----
1239 >SRR5581483_4578849
1240 -------QIALLEESFELIAgqSVELADRTLSRLIELDPQFRLLAAR-TEM------------AALRSVLFSVLyvlRRSLHNLNTLAPALETLGALRK-DQELSSEHFGTIGIALLDAMAEVGG---------------------------
1241 >SRR5690349_7596073
1242 -------------------------------------------------------------XMQMTRFTDL-GLRTLMLLasaestgrrvtTRTIAVGANASEHH-VAK----------------------------AVSRLAELGMVMADTLIE---
1243 >SRR2546430_1826610
1244 --SMNTLERQLVRATWIDLaaAPELLAAHVYDRLFTLDPSLRLLFLG-AEL------------SSPGATLTHAIDVAVANLERLEQTVARLGPDGT-IPSVQTET-GILGDALLWAVGSMLGPiACNPAVRGAWAKCCALLV--------
1245 >SRR5262249_54424048
1246 --TMNAYDRELVRSTWVELsaDLEVLAENFFDCLFTLDSSLRLLYLN-TDR------------VASGRALMHVVGLGVANLERLEQIAARAA-DED-VHAIGWKTGGIAGDALLRAVERTLGPaVCSPAVRDAWSRCCATLV--------
1247 >tr|A0A2H1V3P2|A0A2H1V3P2_SPOFR SFRICE_008656 (Fragment) OS=Spodoptera frugiperda GN=SFRICE_008656 PE=4 SV=1
1248 ---LFGSqEFKACCsgMGMGKIGKGGIGPPVtsL--tqrnttqalfhvgflPYLRAAIQwctvqvDNSFDYLGIWT-EpVAFSVDPLLIAWlaykpTVKSEASLPAAVKSLSQtqqIP---------FR-RRSTP-----------------------------------------------
1249 >ERR1719309_231760
1250 -TTLTEEEIQTVKTMWAGLleNSADSGLFIFQNFFELYPEQVHRFSFIRDSQGNpiPNYLKSQAMLQHSAMVMDALDGVITGVFehDplLGQMMYNAGYSHH-SKNIAKDDIEKLSNSILEVIKLVASCegSGKATKVEAWRKLLNIVNERFEQGF-
1251 >ERR1712168_640531
1252 -----------------------SGLVIFDHFLKMYPQQVKKFQ-FIQDKNgaiQYHYIVEPRMRVHSEMVMNAMDAAVVGIlrgHNVKQELEDLGRQHQ-SLRLK---qeeAAKEQEEREKEEEEEEEKeEE-AET--------------------
1253 >tr|A0A1X2H2S4|A0A1X2H2S4_SYNRA Uncharacterized protein OS=Syncephalastrum racemosum GN=BCR43DRAFT_446018 PE=4 SV=1
1254 --PPTAAQLKVIRRSWELVSdtrwpnepqtmspCQAFSIAFYDALFALDRTIESALSNI--ILQGKalsgilsHLVRTRVVLDEAK------------sidETHFARKLQAIGATYI-EFNVQPYFFDLVGPALISALQRRLKEEYTATIEDAWLTAQHYASYHL-----
1255 >sp|Q7M416|GLB1_LIOJA Globin-1 OS=Liolophura japonica OX=13599 PE=1 SV=1
1256 ---ISADQAKALKDDIAVVaqNPNGCGKALFIKMFEMNPGWVEKFPAWKGKS-LDEIKASDKITNHGGKVINELANWINNINSASGILKSQGTAHK-GRSIGIEYFENVLPVIDATFAQQMGGAYTAAMKDALKAAWtGVIVPGMKAGY-
1257 >tr|A0A0P5UDG4|A0A0P5UDG4_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
1258 -NILSENDITTMNNSWSILRkRSDFAPKVFVRYFKAKPEAQKLFPEFASIPL-TDLPNNHDFLNAAYSCVASLDYILPHLKIphPerCPVLMELKNKysnvdlkkfgpixxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxrcpvlMELKNKYSNVDLKKFGPIWMTAMQEEMGNALTNEVRDVWKKAFVAFTD-------
1259 >ERR1712000_676789
1260 -MSLTPQQSAQIRSSLPVLKseGETITSLLYASLLHNHPDLHNLFNSV-NQANG-------RQPRALLSSASVKGTARWESHQLS-------------------------------MISSRGTCWRPSR-RSWGPSGRLSX--------
1261 >SRR4051795_8230555
1262 ------PAVT---------------------SPRVpA------------------------------------------------FgSPCPVIRQQ-RWTGAI-----IGTRQEGSVP----------SAHSTTSGD------------
1263 >SRR4051812_47002672
1264 ------RLSA---------------------TPARtG---P---------E----------TRE------E-----------eTPSMaERTLTAMYD-DR---R-----AA---------------------------------------
1265 >SRR5215203_3322109
1266 --ELSERTIALVKATVPALEahGLAITRRMYERMFH-NEAIRDLFNQ-SHHG---------ETGSQPKALAAAILAYARNIEIlaaWGEAYWYLAEVLI-ARERLIyqglaaapGGWTGWRDFTV--AEKRCESEVITSFVLRPTDGGPVLRHR------
1267 >SRR3954470_353290
1268 ------ARRS------------------------------------------------------------------------SPLaEGDPRYHVH-QWDRGRQPRRSTRCRVTPPVT----------NIRRYLVGP------------
1269 >SRR3954464_15980397
1270 ------RRVW--LA---------LL----DV-LRRsGP-AT---------V----------VRS------C-----------sEMPLfrPGNAPRSAM-GSVPIK-----SVNLNSLPCTDVLGEDATPEILGAWGEAYWFLADLLIA---
1271 >SRR6478735_1414904
1272 ------SGSR---------------------PARLaS---R---------P----------SW-------------------nHRPIgEATLVNRYG-RS---A-----AGSDVE--------------RIERDLSGT------------
1273 >SRR3954468_7455402
1274 ------APPD--RA---------LT----GGGETVpG---V---------R----------ASR------P-----------rTIDRsGRTLVSQSE-RS---A-----EGSGVE--------------EIERDLSGT------------
1275 >SRR3954470_12739883
1276 --------------------------------------------------TS------ACSRTRTSATCStsrtmarqapsprrspPPWSPMRAISTtsaRSPRVERIAQKHV-GLNILPEHYPAVAESLLGAIKDVLGVTHYSRGLTDDPDWYPYLKKHEWL---
1277 >SRR5215831_13609655
1278 ---------KPCNRSKPFFRinAFCSAvslalrlQRLCELPESAHPQRC----A-SCLK----------TANPAKNVVPKRFGTFISIHLrdtYIFAVSKIGQKHC-GLNILPEHYHYVAESLLGAIKDVLGEAATEEVLSAWGEAYWFLADVLMA---
1279 >ERR1719273_448027
1280 --------------------------------------------------------------------------------------------MD-AWTDVYN-------ALTKVLQ----------SLEDNIKGA------------
1281 >tr|A0A0P5DF02|A0A0P5DF02_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
1282 ---KPANDRRIIRKTWDQAk--------------------------------------------------------KDGDVPpqiLFRFI-------K-AHPEYQKMFKSFADVpqae------LLGNGNFLAQA-YTILAGLNvviqslssqelianQINALGA----
1283 >tr|A0A0N8DDV1|A0A0N8DDV1_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1
1284 --------RRIIRKTWDQAkkdgdvppqilfrfikahpeyqkmfksfadvpqaell----------------------------------------gngNFLAQAYTILAGLNVaiq---ALSSrslLPTKSTRSEVPIS-PVeLPPSCSSNSATSLrksllk------SSAAPSTprpdkpGRTVCALWSLASPRTSRTPK----
1285 >tr|A0A0P5CUZ8|A0A0P5CUZ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
1286 ---------------------------------------------------------------------------------MFNPAGKT----S-GVPATPSFP-PSSSIssrrlpa------prSTSSNSLANLTKCSWVR---------G----
1287 >tr|A0A0N5DPZ7|A0A0N5DPZ7_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
1288 -MNLSAKELQLIEQSWLDIeNKDELGKEVFKRVLLSNEKIRTIFDL--HTCPDDELDQNETFKRHLKSLSLFIGICATSVavgsERLVSIARRIGEKHVNFRWVtfDAEYWLLIKGIMVDVIASKQRPKEVEKVRSAWNTLLSFVISEIKH---
1289 >ERR1711868_89060
1290 --GLDKKQLALLQKTWKDISteMEAQGVRLFVEIFQSNNEVIHVFPSLNPNLKGNraNEVIHEAFKNMEAKLLPESMRFFT----------------------------------------------------------------------
1291 >tr|A0A090L154|A0A090L154_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti GN=SRAE_0000030700
1292 --NLSHEQQALIRKSWRRVPKQNIGKIIYQKIYQKCPELKNFLSS--DN---------NCVERHFRYFGDMLQCTVDSLNELdkalYPWLTVIGSGHA-GFAITTAHWDAFGEALISSIKQWILSgKEHKETVRAWMKLSCYLIDTLAAA--
1293 >SRR5256885_864722
1294 --VLTDRQRAIVQSTVPLLEtgGEALITHFYQTMLGEYPEVRALFSMAHQQ------------sGAQPRALAYSVLMYAKHIDRLEalgDLPAQIDRKST-RLNSSHLVISYAVFCLKKKKRTGSDS--------FTRSE-----RLVV----
1295 >SRR5256885_6575144
1296 -----------------------------------------------------------------------XMVMSMRGPALEaagTTGCRSCSAAV-CCSFF--------FQAEDGIRDYkvtgvqTCAlP---------------ISDILIGA--
1297 >tr|A0A016SWG0|A0A016SWG0_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0168.g192 PE=3 SV=1
1298 --QLTSEEMDLLRSSVRIIseNATEVGCNTYEMIFEQSPYVKEFFH-FTKSD--DDAYRQKQTVQLAQKYMQVLIAFVEGIEDpsiLEPVSAKLIEIHRKvddVQ--MAAHWGVFTECTLYNIRKALEKDehFNDmdrldAAVMLWRMVIRGIVRRLKA---
1299 >SRR5262249_10507301
1300 ---------------------------------------------------------------------------------NvkySSHHQQHGPQAR-GVRSTNLAFCCVWRRTEMG----------P-ATAVWSGVHCRDAAGMDG---
1301 >tr|L7MTK4|L7MTK4_SYMRO Neuroglobin OS=Symsagittifera roscoffensis OX=84072 PE=1 SV=1
1302 -MQVSEEQQSLIMEDVQVLlpNYDDFVEDVLQQFMEENPETFQIFPW-ADASkTAKEMRSHPRFKSHAKSIGKVISDCLVDLNGvkkHEPKLSSLGAMHT-KKKVPTELFGKLGGCILTQVVKRVSeAKWSEEKKEAWLKAYGIITV-------
1303 >ERR1712227_290716
1304 --KLSTKTIDLLKGSAAEIKenGTAIATELFKILFERYEVFKDLFPA--DVI------KNG---KMISVLPhalSAFAEFADNMLELDDTINRIVSRHV-SNGVQQWHYPLLEECFIDALDKTLKLDKRPELLQAWKDGFKFLANKVM----
1305 >ERR1711868_248053
1306 --RLTPDTIEALKYTALEIKgrGNDIAKSLFDLLFTRYPVFKDIFPD--ENI------QEG---KMFTVLPialHAFAANCDNIAAIDETLARIVTRHV-DRNVQDWHYPMMEECLIGALRMHLEDDEGMDAMEAWKDGFKYLANKIM----
1307 >SRR5262245_20097952
1308 --EVTPQQIELLEQTLSELRrqSVFAAQLFYCRLFSLRPRLRRLLSG--RP------------DFHGTRLLSVMSAAVAGLSDPghfAGLLSLAARPAVREALLQGDCVRVIGDAVHWMLERHFGGQITVEVREAWRAAHIRITQVIE----
1309 >tr|A0A044TBZ8|A0A044TBZ8_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1
1310 --NFDDAEIQLLRRSWKTIKpeKQT---------VLQCPEVRRFFPFM-NSDLKSCEKKNKRFVFQALRFIQvdmtIFNEIIISSF-----S-------------ndIAILMLVFLECSIHQIRITLLNSkldlWNRKdvdnVIILWWHLNSGICGKIK----
1311 >ERR1719186_618842
1312 -----SVQTREIRGTWVVILaqLQKVGVQCIVDLFELHPFVREHFKEIlvqyGKLDPDNDNALQNVLENHAKLVMNIVHELVVNIDNLdglSERLQKLGLFHV-RNAVPKKYSSTIVAFSHTEMHN--CRdlAFNFPETHELHG--------------
1313 >SRR5688500_15455526
1314 --AITPYDALLLQDSFRAIQqqSGPAAERFFRELFSYDSSLKQLFAS--DRW------------RREEVLMKALGRLVDHLNSpdgVGPHLVELAREHP-AYGLSNYHHLYFGAALFSMLELVLGARFK-LVYGAWFKLFQLAVSEVK----
1315 >SRR5690242_19663030
1316 --VITADDVRMIQESFRRVEsvRASAAERFFRELFCYDEMLRGFFPP--DRW------------SREEQLMSDVRGLSEGLTQpdkLKLAIDALALRLD-GSLRRTPLHLYIGAAWFSTLEMVLGSQFDRRLHAAWYKLFEQVVA-------
1317 >tr|A0A1I5XDG1|A0A1I5XDG1_9PSED Globin OS=Pseudomonas borbori OX=289003 GN=SAMN05216190_1566 PE=3 SV=1
1318 -----ADDAALLEETLEMVSsrSEDLTPDVYARFFSRCPAASGLFTvI-DpatPP----------M--GCGQ----MLFEIISLLRDsaagkPYVAsyMQQIATEHaA-FDVRDPALYREFMHSLADVQATLLGPDWSPAHAQAWDRQIAALLRHLP----
1319 >tr|A0A2D8QSR0|A0A2D8QSR0_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP89_08285 PE=4 SV=1
1320 -----SSKDDVIAESLSLVAerAGDVTSVIYEKYFMRCPSAEEVMSH-LDA----------Q--VLGK----MMEEVYRLLMVndyesENDYLNWEVSNHeT-AYNVEPHMYEGFFSAVIDSVREVMGSQWTPALERVWESKCEELRSEIA----
1321 >SRR5207247_8066543
1322 ------LDVQRLQESFARMAmhGDAVPLFFYSDLFLRHPETRDLFPV--SM------------AAQRDRLVDALGRIVSDvehVDADSGDPSGARPEDA-HIQAVRILsnAQQMADNYVADAQEY-----SSQLSTX-----------------
1323 >ERR1719419_503384
1324 -TDLSPKEILDIQMSWAEIHQEgLVnpDVLMFKLFFEESESGRLKYSHLLkNVNLDnlnwmRDWTKVQKLKDSIDKTGEALGDVIKSLNyhdRVVDKLYSHGVVHA-KFGVTRKEIHTFCECLLMTLKMELGTNLSQEAQASWERLLKMIVEVFC----
1325 >SRR6266536_694904
1326 ---------------------------------------GTRFA--DSHR------------PPRTMERTGplrDRLALRALRlgvgdvvwEDVPSLKRSMCG-----------AAAAGAAPVVAAVASAAPGDPQKHLKRADQVYAKSILLRMS---
1327 >ERR1719230_2183946
1328 -SWFTDDRERLLKRSWQQLQldsCEEAGALLCRNYCSQSPEDAASC----G--------------MDWSAVIKVIGFPIDRMDNLafvKKRLRCLGANHA-KWETKEHQFQSMKYAFLSAPRDVFANEFTSDLELAWDLLYDFVSTEMIAGL-
1329 >tr|A0A090KT29|A0A090KT29_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X0
1330 -TKLTENHRKVIKSSFEIFKknGVPNAHNIFLRMFKEYPDYKNVWSQFKNMS-DEELSQTPLLWKHATTFVFGLERVIRTMDDqemMILMIHSTANQHK-SWGLKKEHFFAMVHLITDILMEEKGEpDEKYAIMEAWESFYDVLGT-------
1331 >tr|Q6BBK1|Q6BBK1_9BIVA Hemoglobin chain I OS=Calyptogena kaikoi GN=Hb-I PE=2 SV=1
1332 ---VSASDIKNVQDTWTKLYdqwEAVHASKFYNKLFKDNEDISEAFVKAGTGS-------GIAMKRQALVFGAILQEFVENLSDptaLSLKIKGLCATHK-TRGItNMELFAFALADLVAYMGTTI--SFTAAQKTSWTAVNDVILHQMSSY--
1333 >tr|A0A0N4TEQ4|A0A0N4TEQ4_BRUPA Uncharacterized protein OS=Brugia pahangi PE=3 SV=1
1334 -IPLTRKQKFVLIKNWKGIErdVTTAGIEMFLKMLTEHPEYYEFFN-FRNIANTakEKQASDERLSAHGAAVMKFIGKAISQIENadaFFMLLENNGRQHAHRGAFRPEMFWASYSFTCYSFSNGFIRNFFSNI--------NLLLTKVEMSY-
1335 >SRR5690625_5362168
1336 VLRSPPpphpaasslSLRDALPLCAGVVaeHAEEITTVFYRDMFEAHPDLLNVFNV-A----------NQAVGEQPKALAASvVAFADRKSTrlnsSHVA----MSSAVS-CLKRRSPERR-RG---------------------------------------
1337 >tr|A0A177B679|A0A177B679_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_02502 PE=3 SV=1
1338 --GLTKTDINMVLGSWESINNDEASSIFYRELFNTYPDTKSLFVKFYSVD-NDKLIDNPAALKQLRVTWTAITTLIDYLKKgrideANKAIDYLIEKHRKIKTFQGPMFNMALEPLLYLVKEKL---TSQAYIDAYKKVFGAIFLTIISKY-
1339 >tr|A0A177AVU9|A0A177AVU9_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_06067 PE=3 SV=1
1340 --HINIKDIERVSTTWDLLDDKKSAIRFYKHLFTIYPQTNKIFVKFHNAK-VDSLGTNAQALKIAKAMWGSASHIIISVSEgnlkeIYKSIDYLIKIHVNVPKFSPTMFELAVKPMVATIQEKI---TDPEILQAYVNIFTVIIEKLKTSY-
1341 >ERR1719397_1495121
1342 ---FGAAQTRMIRSSWSIILaqMQTVGVQCIVDLFNLIPYMREHFKKViadsGRMDPDDDSAMQAMLENHAKLVMNIVHQVIINIDDLdliSPKLFRIGVFHK-NTGILPRYLDIMGPVFCNAVRPILLKhkMWSAETEDSWMEVFKVITSIMKRGY-
1343 >tr|A7BZS6|A7BZS6_9GAMM Globin OS=Beggiatoa sp. PS GN=BGP_3767 PE=3 SV=1
1344 ---------ELIGQSWDKLAGkhEEMVATFYDRFFDKFPHYRKFFP--ESM------------EHQLKRMAETIALLARVTHEtevTHPHLVKVGSRHT-GYCLAREDLDNFKTIFVQVVGEYCGDDWNQEYQESWTEAfEQHIIPYM-----
1345 >ERR1712048_439078
1346 ----------NVTTIWDSIKavpgyEEKFGRMLYEKFYEMEPESFKLFKK-TRQPAAEDVFSDPVFVQHSLEFVRLLDFFIQVLGPdielVEESLVDFGETHQ-DYGVTLDTYSSFGEAMTETVEELLGGngKMDETSRRCWVTAYRYMSMHMTRG--
1347 >ERR1712048_1339107
1348 ----------NVTRWWDEIKripgyEQKLGATLYQKFYDLEPDSFETYTS-NLT-PTEDIYSDSTFLENSATFVHLLDFFVQVLGPdlelVEESLIEFGARNYNDFGItTVDSYSSFGEALL-----------------------------------
1349 >SRR6516162_179054
1350 ----RSQTVMDIEESLHHILerEKLVADLFYMVFLEKYPEVRRHFINV-N------------LRRQAVLLTMALQVVVQYYLKgfptAEAYLKILGEEHN-RRGIEPELYPKFCTALLETLSRFHFHDWSEDLAQQWEEALKLAATEMVEASP
1351 >tr|K2K1I7|K2K1I7_9RHOB Globin-coupled methyl-accepting chemotaxis protein OS=Celeribacter baekdonensis B30 GN=B30_11265 PE=4 SV=1
1352 ---LAVKQISLVRNDFRRLAPvrPEMFKRFYERLFEIAPHTRDLYS--ESL------------TEEAIRVNGLLEIAFLSLDHpqaMFATLHTLGRDFS-GFGIWETQSDLVVDLLVEVFAEFGGEDWGTELEKAWHSVLSFIAQGMKEG--
1353 >tr|A0A291GF03|A0A291GF03_9RHOB Uncharacterized protein OS=Celeribacter ethanolicus GN=CEW89_16165 PE=4 SV=1
1354 ---PSARQIALVRNNFRALSPkrPDIFIPVYDRQVGEDPKAAAQYD--GSL------------CQRARVLDGLIELALLSADHptaLFATLHKMGQDYA-HYGSWREKHPFLIGQIIKAFAEATDTHWTDELADAWEQFLYFMAEGMLEG--
1355 >SRR4051794_12469468
1356 --------------------------------PPTMHDLRILLAG--DA------------GVRREQVGQALSWLVDNLDQprvVAATCADLGPALQ-QVGASPQRLDALGVLVADALRANFGAAWRQEHYDAWHSSARLVTSWMGQ---
1357 >tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1
1358 ---ASADDIALVASVWVFVkpNLEEVGNEFYDQFFAKHQDLKATiFL-------------GTNFLTQAIRVMEMFDAAIEAMCDpvaLMELLVPLGERHA-LYGIRKEHYDIFWPALCIALKEQLGDKLTDDVVQSLHRVYYKVIQVMLE---
1359 >tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1
1360 ---FTPTIVRTIRTTWAAAtkDMDAFGDRLYTAVFALDRTLKeTIFKG-TN------------MSAQAHHIIETLDSCVRIMDQpnhLMSMLRQLGVRHG-AYGVGRHHYPTIGKALISALEGSLEDKFTLEVNKSWTKFFNVIERSMLEG--
1361 >tr|A0A1V9Z083|A0A1V9Z083_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_04708 PE=3 SV=1
1362 ---PTATDEDLMTQSWDDIIgcklrAEierrkapstepspeaptttsaivQFYDTFFSHLYVINPETRSVFRN--SM------------HVQSKALVNIVGAIRHVlhSDDAKNMVAAMAVRHI-QYGVKLEYFDNLGVAMIQTLSKLAGTTWTTAMADAWHTVIAYIICLIVPHY-
1363 >tr|A0A1I7UV11|A0A1I7UV11_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis PE=3 SV=1
1364 MDRLTERQKQIFTETFPVVfkDSRRNGLVLFAKYFSEFPHYKNIWPQFRTLQ-DSALLASNELANHCSVYMSGLKEIVEVMDDeekLTYFMARIARSHV-KWNINKYHITNMLEGVDAVLQRSFGDKLTDEIVNAYHTLYDVIGNLLD----
1365 >tr|A0A0P5Q0G6|A0A0P5Q0G6_9CRUS Uncharacterized protein OS=Daphnia magna PE=3 SV=1
1366 --SMKGRGSCFDQGHLESCKkNGNIAPKAFIRYLKLKPEAQKKFAAFAEVDL-ADLPTNSHFLNQAYTCLAGLNAYSDNLGKNPKSCPYLNSP-AF--KdVKPDELKLFGEVMFNVMEKNWTIIFPRQARKAWKDGLTACDVA------
1367 >tr|A0A258C6P4|A0A258C6P4_9PROT Uncharacterized protein OS=Caulobacterales bacterium 32-67-6 GN=B7Z13_12975 PE=4 SV=1
1368 ------MNTQALLDSLDLVAeHGeDPTPRVYERLFARYPETEALFMG--DTR--------GA--ARGQ----MLRQAIETLLDYlgpnafaANFLRAELHNHS-DIGVPTEIFPRFYQAMAEAFADILGGAWTADMQRAWDDLTAKVEQIVRG---
1369 >ERR1719244_673251
1370 ------GQKDLIIASWREIriCLDEVGFDTFKQLFAHHSDIRAYFPAMKKLSS-NDVEMSRKIKEHSTRIMAVLKLFVDNIYDLekiEPSIEDLGRNHS-FRTLLGLFLSE-------RISGQL--AWR--------RCCFNYLNIS-----
1371 >tr|A0A1I3XAR1|A0A1I3XAR1_9PROT Methyl-accepting chemotaxis protein OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_101121 PE=4 SV=1
1372 -----QAAIQRA-EACLTLSadGLVLEA---------NDRFAALL-G---LA----------PAAVADRPHA--ALLTLAERDgatYRRFLDQLAQGR-------------------------------DTVARLWHQGAggagvllELSAAVMAAD--
1373 >tr|A0A1I3XA39|A0A1I3XA39_9PROT Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_10
1374 -----MAAIDMA-QPMMLLGadGVVQDA---------NAPLAALL-G---VS----------ADALAGRPHA--ALLAEAERDsaaFRRFRDAVAAGQ-------------------------------AGHARLRHAGAggntvtlDLMMQPLAAE--
1375 >tr|E3MNQ8|E3MNQ8_CAERE CRE-GLB-30 protein OS=Caenorhabditis remanei GN=Cre-glb-30 PE=3 SV=1
1376 -SHLTPIDREILNKSWAIVskDMQQVAVNIFQMIFEQAPDAKLMFSFM--MKDYKEDKKSNEFIFHAVRFLQVIESTMTHLDDpsqLDAVFLNLGKIHAkheEQLGFSAHYWSVFKECVLFHFRKAMKAHnkFSkhkemsfAEIDSAiilWREVLRFIIDRMKVGYC
1377 >ERR1740129_566420
1378 --QLSSASVETVRQTAALVgsRAQEIVEAFYRGLRARYLELFQFFNR-TNQTSN----------RQSRALAVALTafaSKIDELSEIHGLLEMISVKHC-ALAVRPRHYMLVHENLLAAMEEVLEDQLTPSGYDAWSDAILYLVRLLTEQ--
1379 >ERR1719183_2765469
1380 ---------------ADIFmpRLEEIVMRMYNLILEEQHECINIFNT-PSLSPG----------QPLAALAACIRgliEDINVRPRLEHRVEMIAQKHC-AINLQAHNYLGLQGMFMSAAEDVLGADMTPQRFSAWSQALLFICRLVIER--
1381 >tr|A0A0L0FUF5|A0A0L0FUF5_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_07147 PE=4 SV=1
1382 --ICKPEELHtkdlgfivtHTNNPW--GstDEQDFGVDFFRDHADQ----------------------------SGLTSFFSSIVIIACEMYqefepSIPQLQKLGEEAK-HLDIPCHMEDNIVGYVASTLSR-SK-QFDAIEECAIFKLIWRVVLFVLE---
1383 >tr|A0A2E9QYM9|A0A2E9QYM9_9DELT Nitric-oxide synthase OS=Deltaproteobacteria bacterium OX=2026735 GN=CL920_22905 PE=4 SV=1
1384 --ALSS--MKEAKRLWEEGvgLHTAPGSEWVHQLVAERPEWNHFFAS-SDPE------------AFGEALFSTIDSAVHQLDDevsMFSSLREDSELFT-AWDVRACAFSALPDVLVDFVV---E-DHQTVGAQALRTFLRRVCTIVSL---
1385 >tr|A0A0K0EIZ9|A0A0K0EIZ9_STRER Uncharacterized protein OS=Strongyloides stercoralis OX=6248 PE=3 SV=1
1386 -VPLTERQKFLLVKNWKGISrrARDAGTNLFVQLLSEHQELGDYFI-FGNVKakDKYEMLADERIQNHGEAVMRILDSVITSVNDPQemfRILEEQGKQHAIKKNFKPELFREVEDALFYSIKLILDERYTDNMDSIYRIIMKTVLKTLE----
1387 >ERR1719158_1160759
1388 -------NKHLIDETMDRVanaNIAELGVICHKKLFSLSEDVQNYFYK--P---------NTMVAYILEKVLFILSNLSHEPVKIAHEIRALGMRHI-KYNIPPVHFPLFGKSLMYTFSSTLEGFWTDDIEDAWGSVFDFVCRCMTR---
1389 >ERR1719158_1490032
1390 ---------------------------------------------------------GGQLSFICRGHSSRIN------------RNALRVRRsrI-TNRSHSNCFSSYT----------RCSISSITCASAWATCLLR---RL-----
1391 >SRR5438270_3151649
1392 ---------------------PQIVDRMYTRLFEVAPRVVKIFEG-KDPT------------KQL-RTVHVLRDSFDDLSALTPELEALGERHA-SWGVQEQDYAIMGPILLEAMAASVDPYWRSEYTTAWAALFQTVEDIMVR---
1393 >tr|I2K200|I2K200_DEKBR Globin, putative OS=Brettanomyces bruxellensis AWRI1499 OX=1124627 GN=AWRI1499_0864 PE=3 SV=1
1394 --QLTREEIDLLRWSWRLVTvdddSTSLGGNTFnAADFSSYLFCIQFYNNFISMD-EKVVEMIPSIRHQASSFADVLNQAIGTLEDLskmQELLTNLGKLHARILGIERSYFKTMGEALIKTFRDWFGNNetfFPLILEEAWIKLYCFLANSIIQ---
1395 >tr|A0A0R3PZJ2|A0A0R3PZJ2_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1
1396 --PFTDEEKSELLRSWKVIeaQKQAVGCDIYEMIFNQL------EP-FLCVSIKAPKELHNKFRIIVICIVGRYEEELSSVNE------------------------------------------------------------------
1397 >tr|A0A183UUV2|A0A183UUV2_TOXCA Uncharacterized protein OS=Toxocara canis PE=3 SV=1
1398 --RLSPRHRNLIIKSWSKTNKSKIARDTFVELFKTSADIRSKFV-FGDV-PIKRLKQEDRFLAHCERFVAALDSVIAHLDEIGaviENAEALGKYDISAepihaamaKDLRNEHWRLFGDILVERIIENDTKqpSGGSEVHAAWKMLGQLLVFHMRLGY-
1399 >ERR1719367_1435250
1400 --------KTQLRSTWNVImsDMASIGVVMFLKMFETHPETLSSFIR--NVYSIKEIEmdewYQENLKLHAIRVMAIVEQVIHRLDEVgsvIKILMKRGLSHK-RLGVQRSMLEKMGRSFVLSIQSPLEEanKWDATVEQSWLSMFRFIEFWMGLVY-
1401 >ERR1712004_299484
1402 ----------ILRESWKHLqsRIESLGVVTFLSLFNASSETLHTYLTPEDIATLKEQDkdkmLIEKLRVHPLRIMSVLEKTVHRLEDHqrcLKMLRQYGRKHQ-RFGVPPFMFATWPGVFYLYSSPYWKNlsNGMRTFHKLGKACFNSLHLEYRE---
1403 >tr|A0A132A213|A0A132A213_SARSC Globin-like protein 2 OS=Sarcoptes scabiei OX=52283 GN=QR98_0035350 PE=3 SV=1
1404 MTEFEREEIEVLREQWDRIVhyhQECFGMKLFQRLLQLHPEYRPLFG-FEE--TVEEIQNTQRLKAHGINVVYMLNMLFDNFDDmdmIDELIFKLVKLHM-MRGIDQIWLDDIIEPFELVLEEF-NAKIQIERIEVLRKAFIFIKNRMQELY-
1405 >SRR4051812_15383594
1406 --PMTSDTIALIRASFRLAaaDPQALSQVFFRRLLLRSPGVQRMFPA--SL------------VRDPQRLVGLIDQVLRLLDRrdmLVEGLQNLGRLQA-PYAALPMHYPLIAGAFREALALRVGTLWSVDMEESWAELQALVIRIMGA---
1407 >SRR4051795_1885912
1408 ----------------------------------------ApRTAR-RRLQ----------PGQPGRRLAAdRAGrvgrGLRQRPAegprtdsrapavadraqarvaghrprpvrRRaRQPVLGHRRRAR-EGGHTGGRRRV----GRGLLADglCPGQPGARPLQRAWRAA-----GDGVAR--
1409 >SRR5690554_337115
1410 -------YVKLLETSFQKAvenvGIEELSTRFFSRFFETFPETNSLFKG-TNIDY----FR----KFKMRVIFDFLIDIVKHPNYAEAHIAQEVMRHQ-MYGLqDKEYYFTLAACLLEAVKSALGDAWTDEDESAWNDILLVFKG-------
1411 >ERR1739838_826584
1412 ----LFGSVWPLPLSWDIIShkVDQDGESRFLHKFESNQETEDPILQQ-FT-------QIDASIFNGKSAMIIVALTLENLENyqaLWRNLIRLGRDHF-GYGAQPMYLDLIGPHFVITIRQTLGYDWYEALEYHWLALFELIVYVMKFGWH
1413 >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
1414 -------NLGLVRECWDSICeqytTNELGEMVYDHLFKMAPNLTMLFTKPR--------------SYMAVKMGDMLSMLVSFADSsesMKQQISWLGLRHV-KYKIRPHHIPLMGPVFLAVVAEAAGVHWSQDTEKAWSVLFNMVCVNMADA--
1415 >SRR5690606_39733342
1416 -TEL--YTLSLHDALPIWVAekIGDPTRLVYERLFAEQPEMETLFI--LDTD--------HSARGH------MLTEALNCIFDLlgQRayapvLIQSELTNQD-RKSTRLNSSHVKISX-------------------------------------
1417 >tr|A0A0D2X3G1|A0A0D2X3G1_CAPO3 Uncharacterized protein OS=Capsaspora owczarzaki (strain ATCC 30864) OX=595528 GN=CAOG_004918 PE=3 SV=1
1418 ----RHETRDAIQSSWALAIqkhddHdvtpvATFVNILFAKLFEVCPETRLVFGH--DMV------------RQGKSLSSILTgmlEFVVHPKKLQSQVKRLAHMHV-GLGVTPDMFEAFGFSLLYTIRVRIGSAWNQQIERVWVDTYGGVSNILSQH--
1419 >SRR5215208_3780459
1420 --PLSPEAISVVRATAPVVAahADQITAHFYPRMFAAHPALLRIFNQ-GN----------QATGEQSKALAGSVVAyAVQLIDPeapsFDHVMRRIAY-KH-VSLVSARSSTRSSASTCSPRSVRFSA--------------------------
1421 >SRR5687768_12147577
1422 ------------------------------------------------------------------GLAHARMDsVSLK--PpanphcaiktwvlacgvpartaeWRPMSN-L-SDAP-SPSLLSDQSLSV----VQ-TTATVVAAHADEITAAWSEVYWLVALQLV----
1423 >SRR6476660_4664138
1424 --M-VVVGVDAHKrtHTCVAVDgsGRKLGEKTVPATT----------------------------VGNASALRWARSTf-GpdltwgiedvrnvsRRLE----------QELV-NAGQR---VVRVPTHLMARTRasartrgksdsidaTAVARAvpREPDLPVAqHDSVS--RELQLL----
1425 >ERR1719193_1089955
1426 -------------------------------------------------------LKRHRRNRHEGIRFQCNYCDYD----AgqkGNIKSHMDRKHP-EIPYDHTEFQEVRVEKSkysreakqqELDLAAmqGADAFNMNPLAGIGNMMPFNAHIL-----
1427 >ERR1719378_1531842
1428 --RFHPgaDGVHRIGGEESQ--aeVRRQRSLSLPKFLDSLSGEKEKFAFNFDSMgnVLPNFHASHAQKIHSMKIMDAIDAVISEIlrDHpIKQRLMDVGYAHY-ELHATSKDIRKLTTAFYKGVKDLIGIDDdNDRHLVAWKDFLNKIEEGFKE---
1429 >ERR1719414_1806212
1430 --DFTLEQIECISTVWANLRqsSADNGLYLLQHFYTLYPEEMQKFDFNLGDRqdFRLNFHRSQLVRDHSMKIMNAFDALISEIvhGRpVKQRMIDIGYEHY-ERDATAQDIRKFTKAIYSGVKDLMDADHdgprraaaghDDRHLAAWKVFLDMLAKGYT----
1431 >ERR1712142_47027
1432 --EFSGEELEYICSVWGNLRmnHPDAGLFLLEKMFLKYPELAKKFDFCRDFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKSHMRQMADGMLEGLKEVIGDAKdSTRKLLAWNKLFDMIVEEFGN---
1433 >ERR550534_2245262
1434 ------------------RDlrHPLGLLLALH---------GGFLSFFHGFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKIHMRQMADGMLEGLKEVIGDAKdSTRKL-------------------
1435 >ERR1719192_2788519
1436 -------RREIIGTMWESFRedSVSSGLFILEHFFSTYPDEMDRFTFASGGQtdketPLAFIMKRERMRIHSAQLMNALDRNGHVYGRspgCMDQAPQSHRG-------------NVCRRTGKSSGIA---------VFKWRVA-------------
1437 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514
1438 -TNLTPQDKQIMKEDWLMINEkKTAVNNLLLKFFRSFPQAQAMFPKLAKVP-LSQLPSNVEFIAIVNSIKNGFKFVIDSADDVGLLRQLAGSQDISvftVPGIPVaQQMQETGRVIVEWVQEEMGDRFAERTRVAWIRGLRSISQAFVSGQ-
1439 >tr|A0A0V1CPF8|A0A0V1CPF8_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16047 PE=3 SV=1
1440 -SKFTDEEVELLARTWKKDDfdwLYRIGTDIYTCVFQLAPELKVFFPYVTECeKKNQSWESSKGFRTQALRFVQILGMAVEKTESrmkdddshLHHRLYKLGETHRRfaLKGFTPTHWKGFVIAVRVAMRRAVEAmpNLtpaeCETAIEAWDKLSRYVVHRMEEGY-
1441 >SRR3954453_266974
1442 --MLTEKSRPVLEATLPVVgeNIGKIAERFYQHMFGEHPELLdGLFNR-GNQAEG------TQQQALAGSVALFASALVSHPNHLPdHLPPRLTTQTP-RPS-------------TWCRGSRT---STPRSAFART---------SIRS--
1443 >SRR6478609_8547471
1444 --VlvdveevlrvvfgFDLPQTDVVRSvVLGNPgq----I--------IAVHKVDV----------------------AAGGRIGPQGGRVVPHPRDVClV-LRRVHPLR------------------------------------------------------
1445 >SRR3989304_146361
1446 ----------DLEASVQRIldRGKNLADLFYCVFLDRYPELRRHFTAV-DL------------SHQAALLTMALQVIAENHLRpspaAAEYLLVLGHRHH-AWGIERDEFRRLRFCSPPPPQPSHGKGGPAARPRQWRAAIDEAVDTMRAGY-
1447 >HigsolmetaGSP17D_1036251.scaffolds.fasta_scaffold61070_2 # 263 # 457 # -1 # ID=61070_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.672
1448 --VLNSIDEDLTTKSWNIVMsgtPtENFkakkldpcfhystslswfYDIFYKKLFELCPDVESMFEN---V----------SLVHQGKLLATVIGSALASLKKpiiLKKRLIALAQSHN-GKGVKAIHYCNMGLALFWSLEEVLGVsVMNEETRTSWVKMYSFMLNIII----
1449 >SRR5215510_2422438
1450 -LQMTKEQIEVVQNTFNKVRPmsGTAAQLFYNRLFDVDPSVRETLL--WTLK------------QGlGADFTPEAEVAWGNAYDFLAAVMQQAAKGA-SMX-------------------------------------------------
1451 >Dee2metaT_27_FD_contig_31_2132282_length_204_multi_2_in_0_out_0_1 # 3 # 203 # -1 # ID=1013462_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.592
1452 -----------------------------------------------SAA------------TSNPQF----VAAV----------------------KKAIDYSGL--------LTVAGQGAVQPagiipSVIAGTLPAADALKQDVAG--
1453 >tr|A0A068XSQ8|A0A068XSQ8_HYMMI Neuroglobin OS=Hymenolepis microstoma GN=HmN_000477400 PE=3 SV=1
1454 --YFSEFEKDVLISTWEALLlyTHEHGAFIFRLAAEMCPELKAAYNV--EFNDDDELVISSCALQYSQAYITLIDEAIRSLEDPQEgfydSVLIAGASHATIPQMKPEFFKVLKRATLTTWEGLLGEEFTEDVANSWQTLLDYVVAVMVEGN-
1455 >ERR1719193_549257
1456 --IFTDDELAILKDVWAHLKhhTAGAGLTILDHFFKRQHWALERFEALRDMY-GNihpDYMKIDLMRFLAVDLMEGIDIFVTGFFErdpeVTDLIADVGYAYV-KKIIIESEIEIFVDSMLAAMEELLGEDtWK-KNMAPWKKLMPVVAEHFSRGFK
1457 >tr|A0A0D6L5L7|A0A0D6L5L7_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_14144 PE=3 SV=1
1458 -MLPASEVKKLVKSSLERVAigkepkEVQGAKDFYKYMFTHHPDLRRYFKG-AESFTAEDVQKSERFDKQGQRILLAVYILADTFDDeptFRAYARETVNRHR-QFKMDPELWSAFFTVYVNFLASRGP--LSDDQRKAWAQLGKVFD--------
1459 >ERR1719254_19301
1460 ---------------------REIVDDFYPRMFANNPETKALFNPA-NQ------FEEPNRQRMALtnAVL-AYASNIDEPEKLADAVAIISHKHA-GLGIQAAHYPVVHKNSGLHRARHGR-rrdaGGRRGLERG-----------------
1461 >ERR1719394_777503
1462 ------------------------------------------------------------------------------------AIRLGDFQHI-CT-TPLPFCRESPQVQALHHSILGPEVVTPEIGQGWSDGVLALAEILYK---
1463 >SRR5262245_29633745
1464 ---------------------------------------------------------------------------LGNHSTrCgRSVESSQSNSTA-DFLNSRRIHDAYSpaiRAAKSKSE-------------------------------
1465 >ERR1719193_348913
1466 --KLEQKDIRAIREGWACItaHpgLEKTGVDWLHLSFELQPGTKHHYKNFTNK-TLEEICQTPYMKILAGKYMSEIGILVEHLEHsnfVLMRLENLGHLHA-KMGVPMETLFT----MNIVMQHYFRELYsrqdvPDDCEGAWSKVT------------
1467 >tr|A0A1Y5FEW2|A0A1Y5FEW2_9PROT Uncharacterized protein OS=Halobacteriovorax marinus OX=97084 GN=A9Q84_13980 PE=3 SV=1
1468 -------------------NIDQFVESFYEHFFSLTPEIFELFKN-SEIG------------KQKNEFKISIHTLLINLsqlDKLDSYFKDLGIRHI-CYNVSERHYKLAKESFLYAIKKTYADHWSKVVETKWEEIIDHVTLKMKEG--
1469 >ERR1712238_458974
1470 ---------------------KELIEMTDYPTFDVEGVVLCFL-------------------------------------------EWEHHKHE-NIMTFRD---HAYKALMTG-------TMAPLHHTPWKDALEDTIESYGLA--
1471 >UPI00054DD732 status=active
1472 ---------------------------------------------------------------------------------------LTCARDF-FltfVGVERCR-PKLLKQEPQTITSKLGm-A-PMLQSAFWSIRVMRIASS------
1473 >SRR3712207_8863908
1474 --FFFQ---------AEDGirDIGVTGVQTCALPIYARPDLLdGLFNR-GNQAEG------TQQVALAGSVAAFASALVKTPEQLpEQLLNRIRSEER-R--------------------------VGKECRSRWSPYHX-----------
1475 >SRR6476659_5675031
1476 --STHRPDQALRGGGRPPHraADNNAKGAATGHRVSGRS---SPAEL-PENSMR------EQQQALAGAVAAFASSLIETPERVpQSLLSRIAHKHA-SLGIRPDQYQVVHDNLMWAIVDVLGDAVTAEVAAAWDEVYWLMGNALINQ--
1477 >tr|M3IW96|M3IW96_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) OX=1245528 GN=G210_5766 PE=3 SV=1
1478 --SLGPVELTQIISSWSKIRnKSQFHQSLYTNLIESNPQIGKIFNN--ND--------KNVISQHALIFGDCFNFVVENIQDnalLDEFLFSFVQENQRFANMATQYLEPMGNSLIRTFRKSLGNNFNSVLELMWIKVYVFIANSILQ---
1479 >ERR1719502_1452556
1480 ---LPPEQSALVRRVWQRLVgTPGAAPILVRQLQSVAPEVAALLS-DA--S-STNGRSNinrGGLhavhtdpHGRAAAVLSEVSELTELLDDsaaLRQRLRQLRAR---MPPVGPEVYPSVGKAFLHFVWEGVGSGYDNATAAAFAALWDQVEETMLE---
1481 >tr|A0A1X6PD63|A0A1X6PD63_PORUM Uncharacterized protein OS=Porphyra umbilicalis OX=2786 GN=BU14_0103s0020 PE=3 SV=1
1482 MGALSDDTVRIVKSTAPVLkvHGGAIVDGFYALLFEQHPAAAAYFNVVPTDGgGGGGGGGRGQSKAQIQRLSMAVllyAESIDQLDTLGPVLERISAKHA-SRGIPAEFYPAVGACLLQSIGRVLGDAATPEIVGAWGEAYGFLADALMA---
1483 >SRR5580704_1734515
1484 ------------APRAELATgvAPDYgSPDDVASRRSQSRACRRTLR--RPTT--------------GAVRGEMLARVIEAILDFIgerryahHLIQCEVVTHE-GYDVPPETFGIFFGVVATTVREQLADAWTDAFDEAWRTLLYDLD--------
1485 >SRR5258708_241677
1486 ------SCGEDPAGSSD-----DHDAD----VVASAGQVEGGVD--LVEH--------------PPALGVPIAAPCQWLVDLEgagacaaNRMAAERVNHE-GVGVPPAALARFFPIVAETCRDLLGEAWTGEIEAAWAGLLTRLA--------
1487 >SRR3954465_11422119
1488 ----PCRSSPTTSGRSPGAs-TRT---------------CStAtRGCW-TGPStgatrpR----------APSRSRWPGPSRsspaHWSRSPSRSpSTCSpgSRTSTTHsasprpppP-PPPPARAERGVVQDNLFWAIVDVLGEAVTPEVAAAWDEVYWLMAYALVN---
1489 >SRR3712207_885952
1490 ------------------------------------------LGR---------------------------GlladGLRAHPPGAgALQR---------PRRAAGDGVAGVggRRGENRERGRREPPPAAGAGTPGVDRAAPPGRCRPGT---
1491 >SRR5215467_2668635
1492 -----------YLHSFPT-rrSSDLPPSALYRHLFTTRPELLDgTSNR-GNQAD----------GNEQQALAGAVGafatALVNTPDRLpENl-LARIAQKHA-SLRITSRSNRLSGQGPIAPL---TEDQ----------HPX------------
1493 >SRR3954465_6877418
1494 --AtaaaTAAASSTDIRATRPASleG-------------HDRPHLDTaEAGR-AQLAD----------GEGDIEVGGVDEvvatqHLLRLHERAvGHlgpPTDARRGAGR-LQGVAAEELGTVRLDLDGELVVRLHDL-----VEDLGRRRRVLALVLVD---
1495 >tr|A0A183INM6|A0A183INM6_9BILA Uncharacterized protein OS=Soboliphyme baturini PE=3 SV=1
1496 -VILSNYQKTLLRDSWLRINktgIRNIGTMIFRRLLTKQRSIKQLFQHITVLEGvfSAGLTPIQAYQHHSLLFVELIDNAIKNIDDLsvlIPTWIEHGAKHARfkAYGFEIEYWDMFGSTMTEAAREWEGWRRHRETIRSWTLLISFIVDRLRQGY-
1497 >SRR3954463_14455484
1498 ---AQ--------------------------PRAARPSALRLSRP-GDGAP--------------FLLRAEvACLasGI-----g-----------TF-GPGLRSHPLARLGRS-----RALRGRAVLArCPPKIWSPLD------------
1499 >tr|A0A1I8CQM9|A0A1I8CQM9_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=3 SV=1
1500 MNKLTEKRCDIIKETWEIYKqdGINNTIKIFFHLFTEHPEYKYIWPQFRGIPDS-SFILSSALRNHAEVYTAGLSIIINNMHNkakMYAHIKKIAYAHV-KWIIHQSHVQNMVPGLMMVLKDKVPH-FDDSIEDAWKTLYGVIGSLLE----
1501 >SRR5258707_573086
1502 --------------------------XMILKSFKPNAAIGC-K----TIPT----------W-----FVP-LPTFTAGLTLPKLyplSVFGMRRYN--LGGLGEPH--QVEAALLWLVEKQFEGVLTREMRQAWVQFCQWLV--------
1503 >NOAtaT_7_FD_contig_111_1754_length_212_multi_2_in_0_out_0_1 # 1 # 210 # 1 # ID=13324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662
1504 --RLKPKDAEYLQDSWKVFlErsggLEGAGKEFYRLLFEKEPDLKKLFQV----P--E--------MSQAAAFMRAISRYVSLLAQpeqLKTAIEMLAFMHV-NLGISETSIFAFAESLLECVEDQLHDWDpgeVEQVMVLLTDLTTYIGRVIA----
1505 >SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold554780_1 # 1 # 420 # 1 # ID=554780_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.669
1506 --VLTSSIYlttgTVVTDFSVIVlDaegsAIEPGEAPYSLRVYFTPASTGTstatIQL----P--S--------GLISDgMLAVGARRLQEETINprrLAGACEAYGATVTSnvlTVNVrksgTASDPCDSTDAISLLFAGGMATWNslgTSVTSADFtmstnvdsdsvTYRLTFEENVFL----
1507 >SRR4051812_4293204
1508 --EPLAAEQELLGQTWSDDFefLYELGASIYQHIFNTIPETRQLFPKIPTINNG----RwceSKEFRAQTLRFVQPLSFAVNNRHDierVAEHLFIIGVKHAKlvERGFRA----EYLDCALVSYFLKIFKFkyFIv---FIGFRT--------------
1509 >ERR1719295_1797159
1510 ----------NIHVTFDLAltsDPKGFAENFYKGLLKEQPDIGQLFLD-----------KNTTFDTQSARFMAMLMHAIKMLDDtdhFTQSLDSLSEAHV-GYGVEVPMLDAFGKSLIAQVKVmnikyfeeqakggggggdekdeSLdimRvGEWTKKQDDSWKWFWSVVVGVMSAG--
1511 >GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold789473_1 # 1 # 552 # -1 # ID=789473_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.562
1512 --RIPPLKGSSLSAGWRTASSsgLS---------------------------------------------RNPRGTVSR-----ESGNTVFQSETF-AGAASPRGGSLL-C--FT--GENEPMGMINNLKT------------------
1513 >ERR1712012_1094824
1514 --SLTTSDIAAIRQSWILAkDaapFEVHGPAFYKLMFETYPSWRFAFNHMGGHLSIEVQIENTRFVKHTVTVFRFIDKCVNDLDNPtqiLENIKMVAKIHA-LQGIGVKDFIIIKAFICSKSD-KVGAGRSKNSFIFFPRFL------------
1515 >ERR1719232_197721
1516 --SLTTSDIAAIRQSWTLAkDaapFEVHGPAFYKLMFETYPSWRLAFTHIGGHLPIEVQIGNSRFVKHTVTVFRFIDKCVNDLDNPtqlMDNIKLVAKIHA-FQGIGVKDFVIIKDVVLNYFSTALGPALTDAAALGWSnfmDLM------------
1517 >tr|A0A085MKY1|A0A085MKY1_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_01110 PE=3 SV=1
1518 --------ASIIKEQISKIEvNEENGGKLYEVFFTVKPEFHKFFD-LKHAPEGKDVAHNQRFKTLGKLFLEKLKRIVMACEDehqLKEEIKGLKMDHD-PRHVGLTELKGAKPILMKFIEQQVG--MTEEQKHAWTEMFKKF---------
1519 >tr|A0A183IBE5|A0A183IBE5_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
1520 --------KHVLMEHMKRLNlTNKLGGKFYHQLFQSlPEAKSQFA---EHFDKLEDVENMKYYQQLGHSLLSLLKELPEHCDDdhaLKQEIMKIKKKHD-EKHVDAKMFKKSKPAILKFLTDNTQ--MTNEEKEAWDHLITHS---------
1521 >ERR1712025_717817
1522 --TLSPEHVDPITESAPSGKakGMVIANNLYRKLFSRHEMFRAMFPE---QS------------QQSGKMIQALPSALydfavncDNMGQMQSVVARIANRHV-QQGVQGFDGTFQFIPKKVDLsliPAGQCEAKLKVALNARQPGtgvgdrFQLHPSEVC----
1523 >ERR1719495_824226
1524 ------QDIENVRKTWEKMIakheLQGVGLVVLTAWMNEHKEIRQVFAK--SFPIIDklekdvldlVQLNDPTLNEHATIMASSFGKMIECLDDteFVQMMIDIGKKHT-GFRVSADSFDTsLNSTLITALMALSEEKEDSPNIKSWKTVVEVMKHYLKQ---
1525 >ERR1719210_734039
1526 --HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VFELKANEDLNRHGMYILGVIKKIVGKNDDteyLEKLFDDLSDLHR-RLGVEASGMDIFGKVFCKVMRPILLEkkKWKPEIKDSWMTFFSSIVKVMKK---
1527 >tr|A0A2T7P177|A0A2T7P177_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_12319 PE=3 SV=1
1528 -----------ITRSWKCFYekVCSFGVYEFLNLLTDLPEYEEAMRLI-KLTSSYKFLSAMDFNAHFLSMLTIIEKCMARLevDDlplLEDILHKVGTDHI-GRGVNPENFDLVIPPMVAGMKQMLEDKWTEKEDIAWTNFFTLMIHIMQE---
1529 >SRR6476620_7243483
1530 --MLSDTSLPVIQATLPVVgeHIEEIAKRFYKHMFDARPDLLdGLFNR-GNQADG------RQQQALAGSIAAFAGMLVDKPDEVpDHLLSRVAHKHV-SLGLSPDQYQIVHDHLFWAIVDVLGDAVTPEVAAAWDEVYWLMGNMLINKE-
1531 >tr|B3RTB3|B3RTB3_TRIAD Predicted protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54902 PE=4 SV=1
1532 -----------------------------------------------------DLIKDPLVRSHGLRFMKAIETMLEIeFDSngCIFLFSAIGNRHC-SYGIEADYLDYVPQAFRFMLTKALGNNYTDKIASVWDEILSHIIKAMQDKV-
1533 >tr|A0A2G9TV92|A0A2G9TV92_TELCI Uncharacterized protein (Fragment) OS=Teladorsagia circumcincta GN=TELCIR_17315 PE=4 SV=1
1534 -------------------------------------------------Q-KNSSSNKQAHRKT-----------------tsdTHQDL-RRTRDQP-CEKCPQSPRYHMLEPVLAVVKE-CNDDIDDETIQAWTTLYLIIAD-LIEIY-
1535 >tr|A0A2R7X9G6|A0A2R7X9G6_ONCFA Uncharacterized protein (Fragment) OS=Oncopeltus fasciatus OX=7536 GN=OFAS_OFAS019380 PE=3 SV=1
1536 ----PPVDINAVQKSWNGIKsslgdkaPEAVGKLVFENLFSNYPYMLEFFKNYGET--KEDILNNKKFMFHAKeRVFKTFDKTVNNLGNeaeLNNIASWLAEVHV-SRGIKPPDF-------------------------------------------
1537 >ERR1712018_1077981
1538 ----------------------LIGCQSFQAFFDRSPEILSHFDKFNAIEI-DGVLVSSALKMHSSRVLAIVEDMVENTGNpekIRTILQDLGRNHY-RQVKPILMhFLX-----------------------------------------
1539 >ERR1719199_1665450
1540 --------KPMIRECAAKVvqmDIVELGLRFYVHLFTINPAASAFFTKPKWMI-----------SAIFGGVLRFYVHLFTINPAASAFFTK-----------------------------------------------------------
1541 >tr|B3RTB2|B3RTB2_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54901 PE=3 SV=1
1542 -------------------------------LIKLSPATKIYFHGV-DFEkRDSYLAKNTFLRNHAARFMEAINVIIGQdMDIfsVESYFRVVGSKHH-SYNLKLEHVQDISDAFLEMARNALKKKFTKSTEAAWRSFFQMVTDAIKN---
1543 >tr|A0A1B6G4Z3|A0A1B6G4Z3_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.45438 PE=3 SV=1
1544 --RLDDNEMELIREGWKCITeSEDN----FRTAFSSKLaqknLAKVHFKHVENVSITDEGFSHEFLMSHSVDVMNTMHLMFNDIRNPeswMPEILRIATLHK-LFGVTLEDLKRFRCCVIEVLQQCLGEdGYTPQIKDVWDRVLECIEI-------
1545 >ERR1719383_1602644
1546 ------------------------------------------FGLH-L---------------------QSTMLVGNDLDpvdERG--PDHCQQALW-TASE-GRTLSHRRREPCRSVLEVLGEdVVTPEIGGAWREAVQALAKILID---
1547 >SRR6185437_4905046
1548 ---------------------------------AENPEMEALFVR--DTA--------AL--VRGQMLAVVMEGFLDFVGDqdYsARLMQIERVNHE-GLGVAGRAPRHCGAAGGRSLTHFPGKP-------------------------
1549 >SRR5512135_1032698
1550 --NMDQETLSTVDASLQRCNRdSRFLDLFYEKLLASSPKVREKFAH-TDFV------------RQKRALRSSLWMMLLVAEdeEkgPARYLRGLTAIHGSsGLDIGAELYDFWLDSLLETVAVCDP-EHDAKVNAAWERVMMVGIHYMCTHYH
1551 >ERR1719336_1989132
1552 ------------------------------------------------------------QDRKGGgGTPGKLKVTAKYNDGtefVDefntvifaigrdactakmgleGVGVALNPKNG-KVlhneler-TSVDNIYAIGDvldgkpeltPVAIQAGKLLARrlAGTSEVTTDYVNVCTTVF--------
1553 >ERR1719278_462770
1554 --HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VYELKANEDLNRHGMYILGVIKKIVGKIDDteyLEKLFDDLSDLPL-LLLQQDRPHHLAKNLPKNVHSGSLYAeppvkvaEVVEELLQVLCV-VDLPHNLL-----
1555 >ERR1719210_1454089
1556 -----------------------------------------------------------------------------------rrclgyacf----ASFHKSQ-TIlklshdrdrferqkknPQQSSSFRRCGTsmgqsesslTAANLTQAPTLRpaEWDPNMYQSL----------------
1557 >ERR1719284_537611
1558 --------------------TEEIHSEFQSLLLQHNLELLSVFNI-PRQS--------DDVIDAEteeiasHHLAGVVLAFAAHVGHVQRmrELDQLAAKHC-SHNVHPFHYVVLHEHLLDAMRKALSTMLTPEVQYSWSQSLLFFAKILID---
1559 >SRR6266536_2537548
1560 -APLSGREREIAMLAAAGLASKDIAERLYLSVRTVNNHLQHAYTKLG-VSGR------AGLAEQEIKFAEKLTEIVramPRLDELLTHTRALGARHV-SYGVRAADYQTLGNALLAALAAVLGGSFDAPTREAWTLAYNLVAETMLDG--
1561 >SRR3954465_13942299
1562 -HPLTGREREIAMLAAKGILSKDIAARLSLAVRTVDNHLQRAYTKLG-ITGR------DQLADVLAHDTTTHPGPX-----------------------------------------------------------------------
1563 >SRR5699024_12637729
1564 --TLPKGDHPLV-----LVsaGIGCTPMVAMLHRLVETA--------------------------------RERQVLVLHADHTpEEHAX------------------------------------------------------------
1565 >ERR671932_89059
1566 --S-PTSCGPARACRSCCCtpTPPRRRSR------------YDgVHEG------------------------LMDLSSFPLPDD--ALFYLCgplpfmravREQLL-DLGVSPRDV--qyeVFGPDLWQADAdeGPGDAPEPgahdllgpEERQGPPPA-WSRPG-------
1567 >SRR3712207_7345787
1568 --V-LDDVRALPNATVHVWyeSGAASALP------------VDgVHAG------------------------TMDVRSEEHTSELqSRQYLVCrlllekk--KTI------------kyeSTXX-------------------------------------
1569 >KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1083625_1 # 3 # 881 # -1 # ID=1083625_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.686
1570 ----------------------------------------MEYEI--------------CLEPSGIRFMADAGQNIVEAAKqhgIpIKHGCASGScgDCK-GTILsgDSEQGPFMPLLLLPTERAA-G-------MAILCKLYP-RSDLRL----
1571 >tr|A0A044RBY2|A0A044RBY2_ONCVO Uncharacterized protein OS=Onchocerca volvulus PE=3 SV=2
1572 --ILSEIQQELIRQSWQTISgklevtEQCFGFFVYRRVFERNASLKQVFHV-EEYDSLESVPNEHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAaesFNEETVRLFCSQVVCTVVDLLETDIDPSCMEAWIDMMRYIGCKLLDGF-
1573 >tr|A0A0R3RKB4|A0A0R3RKB4_9BILA Uncharacterized protein OS=Elaeophora elaphi PE=3 SV=1
1574 --ILSEIQQELIRQSWQTITtklesnKRNFGFFLYQRVFKRNSMLKRAFHV-EEYDLLESVPEKHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAeeyFNEETVRLFCSQMVCTVADHLGGNVDPACMEAWIDMMRYIGCKLLDGF-
1575 >ERR1719384_507171
1576 ------------KKCWNELmkDKVNVGERIFDYILTKEISMSKLFMQ-------------TNIEQQSGIFMVMMDKVVGFLDDkesMNDNLIKLGQLHVEKYGVKTKHFKHFRAAFLKAIKKYLP--WNDRREEVGSSFGLELLIKCRC---
1577 >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1216141_1 # 2 # 73 # -1 # ID=1216141_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.347
1578 ---FPDGVCMATIELTVLPvRpled-----DEKFQIILSEAQGGASFNPNDD--------------G----GKDDGvlTIVIKNTLQDpkgLKVLVESFGFQHL-DFDLTVPRVVVFRDSMVELMEAELQDRFTYKAKDG-----------------
1579 >ERR1712214_179591
1580 -------------------------------------------------------------PGHAgRREGRRSARQPGTGKDRqksTKYLLELGKFHR-FSGIPNDYFGVMGTIFVHAVRPYWEEagCASEQTEVVWMMLFAHIARVMTH---
1581 >ERR1719458_2209728
1582 --HLSDEHKTLVIDSWDFVPgfISEAGYKAFTDFVKLCPYYAEAFPFVKKKEEEF-SHLLCEHARKVTGEFGLLAKLISELKTkppeksndqvIHDIMVPLGRRHV-AF--------------------------------------------------
1583 >ERR1711928_171062
1584 ---VSATQESHP-------------------------------LDLDSHE-IQQQRRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFET----LCFRWIQHD-----------CQQYGX---
1585 >ERR1711928_123369
1586 -------------------------------------------------------RRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFES----LCFRWIQHD-----------CQQYGX---
1587 >ERR1740128_75568
1588 ---VTAQEKTLIRATWDQMMfNSEVAPKFMLRLFSEESQHELGgnFaVEHHLVP-GGadegLLLGSNDGFSNTLDVRVG-----------------------SHlLGNDAi-------DVVHDVFQCFLGGSIGRGDlfnglHHNMGRFVQLVDGX------
1589 >ERR1719219_701605
1590 ---VSAAHKSLTRSTWTLMKfNSNVAPKILYKMFTTYPET-QKMyTRLADIP-ASQLMENKQFLALSHSAFAGFNMIVNNMDDPELIKLQLSKVDFPGtFVYPFpgtsLNTSKPPASSWKYSPKN-SAPLSPRKPLPLELPFELRHQGFGK---
1591 >SRR6476646_9453568
1592 --PMLRTRLQLAEASYHRCAeSGAFYNTFYTHLLASDPRIPPMFAR-TEF------------ERQHRLLKHALGLLIIYAKHAnPAMLERIAQRHQ-EIGVLEDLYPAFVESLVLAVAEH-DPEYTPELADAWREALAPGIAFFIKRH-
1593 >ERR1719347_2568912
1594 --------------------------LPPPTHFLPLPGINRKVRIFQRQFgnQTSEFLTGKALRDHSIRVMDALDSVIVDTlKgkDIHKQMVDIGYSHL-KMGVEPRQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED---
1595 >ERR1712189_147645
1596 ----------------------------------------------KPDF---RIPDWKSTPRSQHQSHGSLDSVIVDMlKgkDIHKQMVDIGYSHL-KMGVEPKQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED---
1597 >ERR1719412_2466027
1598 --NLRPLDVTNIKESWHSVEqqLVEVGIRVFISLLENQPNIKRTFRKYRSKR-HSELRINEDLQKLILYLICGLKRVVKYLNDnkaMGKYLRRIVKKHS-PTEIDFTRINpaELSTVFCSAIKDIVdahqaasaklqsvsetsspectspSTCWTIEVEESWTTLFGSLLNATR----
1599 >ERR1711860_392201
1600 ------------------------GVHVFLVLFESQPQMKRIFRSYRGKK-HSELRLNEDLQQLVMYLISVLKKIVKYLEEsrtIVKYLRRIAKKYS-SPSIDLARFDphILTPIRVRRRHLFSresivfekRLKWPQK---------------------
1601 >ERR1719266_3067024
1602 --QLAPNDIANIQSSWTLIEpiLLKVEMAWLLLFRHIAGFMRNGYNSVV----TGPL--------------------IRHTTNcatS--TSSRMSNX-------------------------------------------------------
1603 >ERR1719264_357726
1604 --EVGLCDALNIQQVWPRIEqyLLPVGTRMYISILDGRCDKIIFCNKACCRKNasksssakstrsvysksvsrtcPNQVILNEELQKFVLLLMGLIRRAAKHLDNpshSAKVIRKVTKKrFG-KLNIDVTKIAfePIALNFIASVREIMtnTRHWNTETEASYYTLIRNLIAYVQ----
1605 >tr|A0A2G2R4B7|A0A2G2R4B7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_07540 PE=4 SV=1
1606 ----------------------SASDKFYNVLQNDLPEFTQLFTN--PE-------------KQHMMFYAALRSIDGLKDNktkLAVYLRSIGVKHK-MLGLTHYHMEIGRNAFEQAIFA-GGKDLTHDQRQFYIDSFSQIEKNM-----
1607 >APLak6261687352_1056175.scaffolds.fasta_scaffold62437_1 # 2 # 238 # 1 # ID=62437_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.447
1608 -VRFPKDVIEEAQQAWMSFtmasTKEAAGEALYSAIFHAAPSLQSLYKIPR--------------PTMALRFMNSINAAVAIAHRpsaLKAQAEALGFQHF-DIDVTPSRGDIFREAILEVLDMELGSRFTTRARMAIGAILNYLIGANI----
1609 >GraSoiStandDraft_15_1057317.scaffolds.fasta_scaffold2262553_1 # 37 # 405 # -1 # ID=2262553_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.610
1610 -LQLSQSELFALGRSFELLlqglgnDRDRVGDAIYGAKTANLVVFKDKFITPR--------------AVLSLALFNGFRVLGHKSADpeeLRLFVETMAFKHL-GLDITLQRVTGVTDSFLELCQQNIKD-MPPGSLLAWRKLMTYTGSCFR----
1611 >Go1ome_3_1110792.scaffolds.fasta_scaffold06098_1 # 3 # 227 # -1 # ID=6098_1;partial=10;start_type=ATG;rbs_motif=AAA;rbs_spacer=15bp;gc_cont=0.524
1612 --VLSAGELAAARAAWDLMKDnVKVAESALVKHFVLHPPVQKLIPALADVP-ISELQGTTCSTPSPTRRC--ASPTTX----------------------------------------------------------------------
1613 >ERR1712142_1087278
1614 INALTETEVKVIIDSWDRIHPDKGAKMLFHQFLTDFPLMKIYFG-YQETESVAEIMESEQIKTRCKVVWDVLTKIVHASGDggkLAELVKEVSVKHL-NFNREKKDI----HCFLHALKVTLTC-FSGHLFRPWNIWCKMV---------
1615 >tr|A0A1I2S201|A0A1I2S201_9CORY Uncharacterized protein OS=Corynebacterium spheniscorum OX=185761 GN=SAMN05660282_00995 PE=4 SV=1
1616 --------------------SGHLEPELQLQLYARHPNAQWLLRAG---------------KAVPAELVELSIHAIAAADAegaldalAEARIRDLGLAQR-RFGFPSELYQDIQEIMVSLLRTTGAD-LPFPVEFAAERTIARVCVLLQE---
1617 >tr|Q8NLZ4|Q8NLZ4_CORGL 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases OS=Corynebacterium glutamicum (strain ATCC 13032 / DSM 20
1618 --------------------AQDFLRAVQAKLLTLAPQARGHFPTA--D------------DATHISIAEMVSALLEGTGEegkvddkTLEFFKEAALDAR-RFGLTPEMHSALGEAVRSELLSLCED-LPFENVLFAERAIAATTAVSVE---
1619 >tr|L1MAU4|L1MAU4_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium durum F0235 OX=1035195 GN=HMPREF9997_02488 PE=4 SV=1
1620 --------------------PDLFRTLAQRYFLDDCPEARFLFPTD--D------------STAHADLAAALIFVFNHSNAdgsltpkLVSILEQLGRDHR-KFQVADNHYERFGNALNRALKIVGAHAptYA---ITAAEKAITATLETMRR---
1621 >tr|W5Y4C7|W5Y4C7_9CORY Putative oxidoreductase OS=Corynebacterium vitaeruminis DSM 20294 OX=1224164 GN=B843_11695 PE=4 SV=1
1622 --------------------REELSAIAFDMFFATQRDARTRIRA-------------------TPAIADALTLLARSCDSegklpldVEKRFLQRATTLC-AHGLRVDDLEPLAESAHRAMLITAGG-QPFELVLPIERALQQLARTVVE---
1623 >tr|A0A1W1UZL1|A0A1W1UZL1_9CORY NAD(P)H-flavin reductase OS=Corynebacterium glucuronolyticum OX=39791 GN=SAMN05660745_01670 PE=4 SV=1
1624 --------------------SPEFHEHVRANFFDKCPETMLVFPLH--K------------ENVHADLGRVLSFVFDRTPVdghltdeMRTLITQLGKDHR-KYNVSPRYFHPFVECLRDSLLTLCSD-LQFKYLNGADTALGEVSTLLAR---
1625 >tr|U3GX34|U3GX34_9CORY Uncharacterized protein OS=Corynebacterium argentoratense DSM 44202 OX=1348662 GN=CARG_08960 PE=4 SV=1
1626 --------------------LSHFGDLAHSALLRRAPGLIS---FF--G------------PNPHTELTTAVLFILTHSTPgpqdsgtqtplspridaaGAGALRALATEHV-AYMpPDPALYLAAADALCEALRDSCAD-QPFQQVLAAEKALREACSLMAT---
1627 >tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae OX=1717 GN=mphP PE=4 SV=1
1628 --------------------VTAHSIQAVADElraHRAEFIQAANQKP-------------------DSPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD-LPFAEVLGAERAITSIANTLT----
1629 >tr|C0E6D0|C0E6D0_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium matruchotii ATCC 33806 OX=566549 GN=CORMATOL_02563 PE=4 SV=1
1630 --------------------GDGFSREVFTTYFRYVPDAQLIVSP-------------------DYPLGDALVGLFHGSDNegnlypeTIEHLRDVTEILA-AHGF--RRYRPLADAISPVLDRYCLD-ISAYDVFIIKRAVRQAAEVMDE---
1631 >tr|A0A0G3GTQ0|A0A0G3GTQ0_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium epidermidicanis OX=1050174 GN=CEPID_01535 PE=4 SV=1
1632 --------------------SPAFRRDVLRDFFSQHPHMRLKFAAN--E------------DHAHTELVFALTYLLENPTD-PELIRTLARDHI-KVSPGQEVVADFFAILHRQIHRYCAD-LPYEEVRQADLKLQEIA--------
1633 >tr|A0A0F6R111|A0A0F6R111_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium kutscheri OX=35755 GN=UL82_09495 PE=4 SV=1
1634 ---------------------------MVASHfYADVPLARLSFRL-------------------QPSLVDTLIAGLSHP--lNITAW---AHDLA-HRGVDRSFYVPLSAALQHAVCHICSA-LPLVDVLAVEHRIDQIMKQLLA---
1635 >SRR5580704_16882803
1636 -------------------------------------------PG--RH------------GCAAPAFLPGAQPYRRCPRgpegPRQPRALSAGTRAR-APKFGERHYEVFRRALIATLQRFAAPRWNETAKHAWETAFNHAATVMIE---
1637 >SRR5690348_1231357
1638 --------------------------------------------------------------------------------arapevrrPRAPLRG------G-QAGADRHASAVCRAELEP------------DRQARMGDRVQPRRRIMID---
1639 >SRR6476620_5060594
1640 ----------PAQVSFWLLEpvADAAMTYFYAQLFAKATWTDREVY-----------------ISGPDHMIVKTA-RVLRERgapdRLIHYDLD-----------------------------------------------------------
1641 >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 OX=582737 GN=TSPGSL018_8354 PE=3 SV=1
1642 ----SSKIITLIEKSWAFVEsrcdLMEVSNKFFERLFQRAPALQNMFTKPK--------------RVQYVMLAKALDLIVRSAGEtkvMNEDIKAIALRHI-KYDIRQEHLNVFGSVLVETLANSVGPeNWDEDISAAWASIYGNIAAVF-----
1643 >tr|A0A1Q9C6P6|A0A1Q9C6P6_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene41206 PE=4 SV=1
1644 ---CVCDLAQCRGRSWAAFFvdi--------QAAYYETSRS--LLFEGP---S-----------QDP----------ALVALQLpahVQALISDGALQGL-GI--PQEHIALLQDCvecsfwtftgqtqqvmatsgsrpgdgladvlFGALFAVILtcLEAKCQQCGLVHQSMSDALGVPDR----
1645 >SRR6476646_8240181
1646 -----------------------------------------------------------------------------NINLLF-ALNRHTCPNL-I------------HEPASEFfFGLQRPATH--HEHIRVENIHHL----IK---
1647 >SRR5688572_19725352
1648 -----------------------------KNLFELNPALRPLLPE---STAE-----------QDRLLTRLLNAEAGALAGTRPP----APRSAEGHGNEgTAPCSVAGEALLWTLQEAYGADFTPQARAAWEALYRFVTGTTKSAP-
1649 >ERR1719229_1707680
1650 ---------------------QQLGVLLFANLFKKQPLCRNLFAD-SDI------------SKQSLRLLDMFGWLLRSLVKeknqMrLRTLKSLGDRHV-KYGIKIEFFGPMLDSLSDALQDWFGTNYNTQTRVALTTLFQSACNEMMKQ--
1651 >SRR5512139_12076
1652 ------TDLELIEASIEQMlDlETEIIGDTYARLFAHCDGARALFGP--NTYG-------P--RAQ--MVN---ETIIAGLDLLrgepwvHEYMTQHGVRHRHSYEVTDAMYRTYAESLLGAIRERLGDRFTPELEAAWS---------------
1653 >tr|A0A2E3FAX6|A0A2E3FAX6_9RHOB Uncharacterized protein OS=Rhodobacteraceae bacterium OX=1904441 GN=CML69_02715 PE=3 SV=1
1654 ---LPNENLELIRHSFPLIFqhKAEITTKFYEGLFRDAPELRRLFSK--EMNVQ---------KDMLVSVLTTLAKA--SFDEglVESMIARMARVHS-GLGITSGQFRTGEAALLSALDQSVGDLLSETTLDAWKTAVRRVISAMID---
1655 >tr|Q9NAV7|Q9NAV7_9ANNE Dehaloperoxidase B OS=Amphitrite ornata OX=129555 PE=1 SV=1
1656 ---------------------RTYAQDIFLAFLNKYPDEKRNFKNYVGKS-DQELKSMAKFGDHTEKVFNLMMEVADRATDcvpLASDASTLVQMKQHS-GLTTGNFEKLFVALVEYMRA-SGQSFD---SQSWDRFG------------
1657 >tr|A0A0G3G1X4|A0A0G3G1X4_9GAMM Uncharacterized protein OS=Thioalkalivibrio versutus OX=106634 GN=TVD_07385 PE=4 SV=1
1658 --------PPNVESSYRRCcADASFLARFRLALRAADGQVSGIFDP-LSA------------RQQEVMLDASIRAALDFSSGdpqGASRVSEMIHVHGRQgrVPVPPALYPVWLESLIQAVRETDP-HWSDALERRWRAQLMPAVDMFVELYL
1659 >ERR1719187_3161387
1660 --ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFINdFIPSQMADKATPNTIKAWEKFMTVFIEHVKE---
1661 >tr|A0A2E0SIT0|A0A2E0SIT0_9PLAN Globin OS=Planctomyces sp. GN=CMJ46_04905 PE=4 SV=1
1662 --PVSMTIVDSVRESYARCrQNPDFFDAFYDHFARKSSEIGPLFSN-TDMQ------------KQNELLSDAIDSLISFSEGdvaARRHLDEIALSHDReHLNIKPEWYPLWMEALRDTIHESDP-GATTQLLADWNTVLQPGVNHIVQQH-
1663 >ERR1719487_109746
1664 ----------EIEISHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALAMRHI-KYDAPPEFFGDFESAMLKTLAQSLKSTFTEAVKEAWKAALQFIASTIV----
1665 >ERR1719327_803055
1666 ----------EIEITHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALATSTC-ASSGSRLA--PRPSSTATSI---GRSPFRCRX--------------------
1667 >ERR1719356_1095802
1668 -------------------LMRDIPNTIVALFAI-TVAVfeddySSMLDQ----P-------FlliAVLGFVTLTvilLLNLLIAQLNTTYV-RIYQEVFGWALI-TRGNQIVEV----LD-ACPMS-VWKPFLETLGLDERLE---FNEGDIG----
1669 >ERR1719326_1696685
1670 --------------ASSTQikeLFADVDLS---------------IHA----P-------Ifa---------sTLQSTISSLNNPTELLPLLEDLGKKRI-KYGVQEEHVVAASASLIFTLK-SIDDQWSPQVEAAWTEACNVMQNVAS----
1671 >tr|A0A0N8ALQ3|A0A0N8ALQ3_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
1672 ---------------------------TKARLN----NCMLLFSE---------K---LAAFLaQASPSWPVWNVVIHPCfs--qelMANQLNVLGGAHQ-PRGATPVMLEQFXXXXSPPSSSSSSRKP-PASRNSSPN--------------
1673 >tr|A0A0P5ANB1|A0A0P5ANB1_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
1674 ---GGNDGVETVSDQSNLFVVfAI-FGQGIDGNASEFDEVLLGAGSLLEEL-DEDGGNDGVAVTpDVFPaglniadlVGGQFSLGISQIfgflevlgdASdqsAHTVLPGLSGL-G-VEGAAQRFSKDFLSDVTELLEHDGVSSFNAEARQAWKNGMRALV--------
1675 >tr|A0A0P5ESR8|A0A0P5ESR8_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
1676 ---------------------------------------------FLEDA-SELLEHDGGSS----TGFMGTTESVQLVghqllaeqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGNIGELAEHCLVL--GVGLDEA-EEDLGSDISV-L--------
1677 >tr|A0A0P5I7S0|A0A0P5I7S0_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
1678 ---------------------------------------------FLEDA-AELLEHDGGSS----TGLMGTTESVQLVghqllagqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGKISEGLEHLLVL--GVVLDE-TEEDLGRHISVLL--------
1679 >ERR1712168_1063860
1680 -----------------------------------------------------------CEKAPPIPDCTSSNTVMMRLFKrdpeVAKLIYDVGVQHQ-TRNINEDEMTKMSKSIYSAVQDINVGPHSDKELAALHNLLEVVSYHFKRG--
1681 >SRR5690349_6204932
1682 -TILTDEHRHFIRTSWEKINkrheKTTLGILMFEKVFAFLPDLRNVFGL-NDSS-VSETDRNENFRRHTSLVVNLIDLIIRNIFEmeaeMGPVLLMYGRRHFLKHDLVFQENQLVafAQGLCEFFEEEVDHdddnSLASETKAAWNIF-------------
1683 >SRR4051812_9455799
1684 -GTLTPLRCQLLQKSWEAIIakygMFKPGMIMFQNIFKIQPELMEIFQI-SPEK-LGNFGDlPDEKFRHGRIFTNVLNLSVKNCVEleteVAPVLHLYGRRHVSKHNVDMAHHFLLvfAQGITSFLINEVK---------------------------
1685 >tr|A0A1Y3EGL3|A0A1Y3EGL3_9BILA Globin OS=Trichinella nativa GN=D917_02219 PE=3 SV=1
1686 --FLTKSQRQNVVRSWEKVpNKRALGEEIYIQIFMHKPMLKSLFP-FRTVP-VDQLRNNALFTRQAAIFADFIDCVVGYLaiNNgnlIMELSERVGVNHALMTSVnfDPEWWVLFANSVLDCIRQYCEPKFiclpisrhiTRKIMIAWRILLKEVVDRMSEAF-
1687 >SRR5260370_37911868
1688 --------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVLTRIVERHSRpDLAVPPALYAPFVDSLIATGEQHDP-AFTPEVEHAWRSTAQTVVAYMTSRSX
1689 >SRR5229473_1098235
1690 --------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVQTRIAERHSRrDLAVPPALCAPFVDSLIATGEQHDP-AFTRRWNTPGGAPPKRS----SPTX-
1691 >tr|A0A1I8EE37|A0A1I8EE37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1
1692 ---LSKSQRITIENSWKRATksnaREQVGIQLFARILTARPEMKHLFG-LQKIP-EGRLKYDPRFRRHAIVFIKSFDYIVKNVAykeKLEQHFQALGERHTIlqGRGFDPGYWDTFNDCMRQTVS-LWGKDKDHRTANTWHTLISFVLQNMKIG--
1693 >ERR1719264_1394560
1694 --------ISVVAANFKTVKSnQVLANTLFEHLFELEPSSKALFES-KDL------------TQLKTKFAGFIGQGLKMLqgKNAKKSSGSLPRCTW-RWE-------------------------------------------------
1695 >ERR1712226_1819570
1696 ---------------------------------QYDPSSRQVFEN-SNL------------TEHKQRFIGFIGKGIDTTiEGDREEWKDLVDMHV-DIGVTFKHFLAFEDAFLNTLHDLYADTFSDELLCAWIYVL------------
1697 >ERR1719326_1666808
1698 --------LDIVTKSYETVAAnSTFADILFERFFSYDESAKKLFGN-ADM------------ATHKKKLVGFIGKGLKMAqsSDPDGEMRKMAAFHK-EKKVEISHFIFFEESIIYALRGTLGVAFQDELADAWTLVI------------
1699 >ERR1712071_441310
1700 ---IRRQGEDgrqrpvrhrqrtqrnpqtrlLSLESWTQKDrSPERPSQqvvghpkadccSSNRRFSHPPHGRRRPPW---LP-IQDANRLRAFPHQLHHQGRELP-----cRD--pKLsrX--------------------------------------------------------------
1701 >ERR1719432_409132
1702 ---LRHQEHRrarrfrqqqerCPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPRIADVA-VSDLMNNRKFLSISYSAFAGFNFILNNMDDPEI--IKLQLSKV----DFPGMfvfpfpgtsqqHQ---dtsr-IVLEVFREELGAAFTAEAASGWTSLLNFVSQALIK---
1703 >ERR1712179_658195
1704 ---VSGNSK-nAVRATFDQMRfNSEVAPKiml---KLFTAYPETQKMFHRIADVA-VSDLMNNRKFLHQLLCL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgyFSHR----IGSVQGRAGRRlh----RR--SRLWMDFSAELRQP---
1705 >ERR1712137_151953
1706 ---LRHQEHRrarrfrqqqerRPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPPHrrcprlgfdeqPQIP-VHQLLCLRRIQLHPQ--QHG---RSRDHQTPT--------VQG----RLPRHvrlplpwylsaAP---gyfs-HRIGSFREELGAAFTAEAASGWTSLLNFVSQALIK---
1707 >ERR1711946_32375
1708 ------------------------------------------------------DEQPQIPVHQLLFL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgtYPPS----HSNHTARERTAfqvlFLPQDT--SRIVLEVFRE-------
1709 >ERR1719222_1795957
1710 ---VSAKAKSLIRDSWVQMKfNGEIAPKIYLKTFAAHPKTLAMFPQFAKVP-NRVRPHPYEpLLATAGIDYDVKLWIPSPGSEHNInveELMARNArmleetrDTI----TVPATfmirmlas--------MSNFRR-AGNRSTNDE--------------------
1711 >ERR1719222_245222
1712 -------ARSlgrtqesHPLDLDSHEIqqqRRTQNPLQDVHHLSRDPENVHPFGRYTR------------FSAHGEQTVLGFESLCFRwiqhdcqqYGCSRA--DQVAVVQG----RLPRHfrlslpwhfsaTRANPRIILEVFAEELGSTFTKEAAAAWNSLLNFVTKGLEN---
1713 >ERR1711911_103569
1714 ---------------------------------sraDQVAVVQGRLPRHFR---------------------LSLPW----------------------------HfsatranhphhlgsIR--RRTRLHFHQGSRCrleLPfelRHQGFRKQHRRLATHR---SRP---
1715 >tr|A0A0B2VDB7|A0A0B2VDB7_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_13543 PE=3 SV=1
1716 --SMNDDTKGAICEQWHTILalydgdISRVGVAVYQRIFDAEPQLREVFGIPSFV---TDLSEYEPFQRSGKLFMSVVDLCVRNIYALdaemGPVLVMYGRRHYHQqsRGFHLRYMPIFTQCMKEFVSDCLNEKQkTSDSEDGWSLLFDYIAAKIVDG--
1717 >tr|A0A0N0P721|A0A0N0P721_LEPSE Adenylate cyclase-like protein OS=Leptomonas seymouri GN=ABL78_2595 PE=4 SV=1
1718 ---------FTVQGTWNILEkegmLERFAQQLYDELLTQNARLRVYFYGV-DL------------DEQSKSLVRMIGTAVHFYEKpqvTVEMFTKAGARHR-GYGVNGEVFEEMRDAFFRVFPKFVGADVFSAAEEEWQKFWKLMLDLLQH---
1719 >tr|S9WKS4|S9WKS4_9TRYP Adenylate cyclase-like protein OS=Angomonas deanei GN=AGDE_06844 PE=4 SV=1
1720 ---------NTVLHSWKLLEdggkMDDFGDALYADLLNSNPYIRVFFYGV-QL------------SEQPKALMRMLGTAVYSLNNpnkVDDLFVKTGAKHR-GFGVTTETFQSMETSFFKIFPEFIGEDVYEKTKKEWHDFWKYIIKKLDQ---
1721 >tr|A0A2C9KGE7|A0A2C9KGE7_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 PE=3 SV=1
1722 --LVTDSDIQALRSSWATLTAgpdgrNVFGNNFVLWMLKTIPNMRERFEKFNAHQSDEALKNDNEFVKQVKLIVGGLQSFIDNLENpgqLQATIERLAAIHLKmRPSIGAGYFGPLQNNIHDFIEDTLKVGADDAAPKSWTRLLTAFNDVLNSY--
1723 >tr|A0A2E2XNM9|A0A2E2XNM9_9GAMM Uncharacterized protein OS=Cellvibrionaceae bacterium OX=2026723 GN=CL693_20675 PE=4 SV=1
1724 -------DIDWIESSLELLAphADRLGGLVYPRFFVHFPEAETLFGG-GELG-----------KSTQESMIVPLLMGLKDIADGKtymLTIERWLEDHR-EYGVTLPMYSVMLDSLLLGMREAVGDLWTTEMDGAWQEVLARLLLLVEGVY-
1725 >tr|L7L9M1|L7L9M1_9ACTN Uncharacterized protein OS=Gordonia hirsuta DSM 44140 = NBRC 16056 OX=1121927 GN=GOHSU_25_00750 PE=4 SV=1
1726 -------IRQAVLESLARYEesHGDPTRAIYERFYRVHPEAIEELAF-D--------------TVLENRMMAGILALLADVADGSidpGGAVYWVSDHV-AWEVSETMIMGMFGAVRDTVREGLGPEWTARMDADWAGLLAALAPAMRDAV-
1727 >ERR1712232_1039451
1728 ---------------------------------------------------------SEEMRTHATKVMTFVGNGVASIGNPEkcerfrAECIALGKKNQ-ERGISSQDYDIATQPFVDAVEHSwlqagwrqtdaSGSIWPPGAQGAYTKFYGHMAATIKDG--
1729 >tr|A0A0D6M6J3|A0A0D6M6J3_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=ANCCEY_05408 PE=4 SV=1
1730 --------------------------------MPSCVRTAVTLP-----------------YLEIFEPFVVIEGAVMSLDNlpaLDPILDNLGRRHG-KLEVNGKfrtyYWSTFLECSICIFRKTLTN--------------------------
1731 >SRR2546427_1691122
1732 --------VVLLQTTFLRAAemrigKRNITDFIYEDLFLKRPQLKPMFTN--Q-----------V--LQRHKLGKMLGSIFIHLRDqdwIDEHLRDLGAMHW-RAGATPEVYPWIKDSVLAVLEEGMAPsGWNLRCQREGAGALGVSAQGMLMGY-
1733 >tr|A0A183IHG0|A0A183IHG0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
1734 --HFSLREKELLSVSMKKLEqlEEDNAVKIFIRLFQENPAYKSLFPKLRFMG-DADIVNSTALVAHTQLILKMIKTFINGFQNestCAVVLKRAETAHR-KFDIKPSQVSTLFPILMEILDIS-----HNETQAAWKKLFETFSI-------
1735 >tr|A0A1B6JRB7|A0A1B6JRB7_9HEMI Uncharacterized protein OS=Homalodisca liturata GN=g.2446 PE=3 SV=1
1736 -ASLTDRDLRLGRATWFKNvDaTPDFGMVIFKELFRQYPDVESYFLHLRGN--AGSIFDSRTFRSHMTeRVVPKLKEVFEALDKpehLNEVMTKLGLYHA-KLGVSGHLVENMLSVILDALKSVMHTKMQPDEETAVRTC-------------
1737 >SRR6185369_2033738
1738 ---------------------------LRRVFI-QVASDRSDVSK-TNF------------KFQKLMLRQSLLEMLCfdrGMSGTREEIERLGLRHKV-LGVTPEMYAMWLDSLCEAIKQHDP-SYTPELEQLWRVAMLKSIKE------
1739 >tr|A0A0P5RQ13|A0A0P5RQ13_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1
1740 -TKLTPHQIRDVQRTWEHLRanRNAMVSSIFVKLFKETPRVQKHFAKFANVA-VDALPENGEFNKQIAPVAARLDTIISAMDDklqLLGNINYMRYPHQPPRAIPRQTFEDFARLPIESLEAS---GVSGDDMDSWKGVLTIFVNGVSMRY-
1741 >SRR3954451_11513015
1742 --AASPCAQQLRQGCRDRPA---ACQLVLSSGVRDRPGCEIAVQ--GRH------------GEAGPQADGGADGLIDAIDRLDTI--------------------------------------VPAVEAAWTEAYTILATTMKD---
1743 >tr|A0A1S3CW24|A0A1S3CW24_DIACI uncharacterized protein LOC103506299 OS=Diaphorina citri OX=121845 GN=LOC103506299 PE=3 SV=1
1744 --GLTPKMVGLLKCLGVAIKPeaHRHGVNIFKKLFLMDKTVQRMFPKFACD-DMCGLDENPDFHKHVDAVMKSILYMMESSGsvpDMKSTLALQVKIHK-DLCIPDRHFITFGYAINEYLKETLGAKYSEDVECAVAYFWKFVASEMTAKP-
1745 >ERR1719244_808981
1746 ------------------------------------------------------------------------------------KAPRTRRPPRAALQRENALFQALSRAFLKAIKVYLP--WSDRREAAWQLLWQRIITQMTL---
1747 >tr|A0A2T5C1R0|A0A2T5C1R0_9BACT Hemoglobin-like flavoprotein OS=Mangrovibacterium marinum OX=1639118 GN=C8N47_108138 PE=4 SV=1
1748 ---MTEADITVIEKSYAQIEAalPRMAKYFFNRANELDSDLDPLFEE--DK------------SKHGEAFVALFGKAVEHLNSPealLPEIKKMEAKLK-YYKFNEEVLNTVGVVFVDTLSFGFGNNFTQDIIDPWVKAYKTYSS-------
1749 >tr|A0A1Z4LAZ9|A0A1Z4LAZ9_NOSLI Nitric oxide synthase oxygenase OS=Nostoc linckia NIES-25 GN=nos PE=4 SV=1
1750 --AVPPELLLKMADSWQVMsqNKQQMGIEFYQMLFEKYPFVLPIFGR-ADMD------------YLSLHLFQALEFLVNCLKTgssdeMLRELRFLGQVHG-SADVPTCAYPAITECMIALMERHVP-DLTPQVRQGWVTLLERVINIVK----
1751 >tr|A0A096P8B0|A0A096P8B0_OSTTA Flavoprotein pyridine nucleotide cytochrome reductase OS=Ostreococcus tauri GN=OT_ostta17g00030 PE=4 SV=1
1752 ---------------------------------------------------------------------------------------masvgsgat----DDD-GVDVPVSRCPFAhGTVTVDPYPGYVH-G---KNPRVCPRGCVPRPPSKP----
1753 >ERR1712071_238239
1754 -----ERSFTYWKDSAMMELa--------KWNARLQTPR----------------VYEVKwRRKKRNIPGRVGWRVLGAELWVRSSCRRRIRNRPYQEYFVSyvsiSQQLEETARLIIDALDEELGVRFTSYTRGVWSR-aFHFANSIMAESF-
1755 >tr|A0A2D4BL26|A0A2D4BL26_PYTIN Uncharacterized protein OS=Pythium insidiosum GN=PINS_002968 PE=4 SV=1
1756 ---------------------TTLYDVFYAHLEQHSPELKPVFRS--SV------------HIRGKVLVHISVGMRTLIASenFVDKVLPLTKTHR-RFGVKPEHYEPLGRALLHAMQVVAL------ITRDRGRVEEPTSIILI----
1757 >tr|G8YSE7|G8YSE7_PICSO Piso0_001107 protein OS=Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL Y-12695) GN=Piso0
1758 --EITEQDIYRLSSSWNTIHtnsryhNDSFVSRLYANLLAANPKLLPVFSG--EN----------GLQEHSALFGELLSLTMIYLNDmptLKICIAAYARENPLFTEQCCEIVEPMGSALVLTLRQWLGKgVFDNELQELWIKVYVMLANTLL----
1759 >ERR1719431_737524
1760 ---LDMSQISDLQRCWSTLQlhmgEQAIAAAFYNDIITNFPSIQKYFKNIWTESTFtRTIGNMNDVRKHASLVVSRLTNYMGNLHHLsevNEDLKELGMIHAARYHITEEVVEQFVSSMATTVADLLTKedLFDPVLCGAWKRFFFMILTFLSEG--
1761 >tr|A0A0G4H5Q5|A0A0G4H5Q5_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) OX=1169540 GN=Vbra_6604 PE=3 SV=1
1762 ---------------------SEIGIVFLHNLFSNAPTLQKLFVR---PS-----------ATYGRIFGQILKMLLAHLDDPAEvwqNNKELALRHI-KHGVRPSHVPLFSKLIVETFASIGGEEWTAEHTAAWQALWEVTGSELT----
1763 >ERR1719431_2380502
1764 --ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFiNDFIPSQMADKATPNTIKAWEKFMTVFIEHVKEG--
1765 >tr|A0A1W2GS79|A0A1W2GS79_9BACT Uncharacterized protein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4044 PE=4 SV=1
1766 MKDLNIRERKNIRDTWKVLAPniHEFAFSFYSNLHSLDSSLVPLFEN--EF----------GIIKQGDKALYVLGFVVASLDNLmvaregiKKALEGVFMEHQ---HIKRADEQKVMKAFLQAMKSTLRGVWTNEIAISWYRLLSLISAVSI----
1767 >tr|U1JU51|U1JU51_9GAMM Uncharacterized protein OS=Pseudoalteromonas citrea DSM 8771 GN=PCIT_01118 PE=4 SV=1
1768 -MSISPYQYQLLTQSFTTLKPNFhcFCVSLH-TQLKNYNLELA-------------LPSSSkYLLNIEHNIQLFLSEGIALLPQQsalVDLIKRHKPHFD-ALKLSEQDIAVLCHTMLETLQLHLGRQFTLALRNAWRKALHMFANIIKS---
1769 >tr|A0A290TM25|A0A290TM25_PSEO7 Uncharacterized protein OS=Pseudoalteromonas piscicida GN=PPIS_a0207 PE=4 SV=1
1770 -MSITPYQYQLLTQTLASIRPNFhgFCTSWY-NQIQHYDLRMQ-------------IPTNVgQLIIWEHQIFDFVQNCVMRIPQQsnlLHYLQKQRGTLL-FMGTSEKDISVLLFTFYSNAKKSSWQAFYHSSKKRLEQSTVTHRKY------
1771 >tr|A0A2G1B531|A0A2G1B531_9GAMM Globin OS=Pseudoalteromonas sp. 3D05 GN=CSC79_14765 PE=4 SV=1
1772 -MGISTLEKQLLLNSLHVVKPNFhcFSYTFQ-MHVKREPLDML-------------CLSNSKINEKTYILYCVLERIVMHLDNLrtvTPFIEHYAKNLS-NMGMSHQDTDILCNSFLATLKIHLKGCYPPKLESIWQHAINIFKSIVTG---
1773 >tr|A0Y309|A0Y309_9GAMM Uncharacterized protein OS=Alteromonadales bacterium TW-7 GN=ATW7_05751 PE=4 SV=1
1774 ----MNSHKSVLLKSIGIIKPNFhaFTARFH-KKLVESDISMN-------------TLTAEQFNEKSYILYCTLERIIKNIDNPssvAPFLSHHLQFLK-KLNIQQSDIKPLTDIFYVTLVEHLGRFFNEESHLAWRKVLTYFERYTND---
1775 >tr|A0A0K1PX98|A0A0K1PX98_9DELT Uncharacterized protein OS=Labilithrix luteola OX=1391654 GN=AKJ09_04675 PE=4 SV=1
1776 ---------VVLKESWHLSYrrAPDLAARFYEELSWKYPSARRLLDHVFGAQN--------DI---AVCLSTVAGDLLDNVDDpdaFSAAIVALANAHV-SLDIPPHVVAWMEEVLLDTLEGAAGDDWTPEMRTTWRNAYEDLASRLAR---
1777 >SRR4051794_15895678
1778 ------------------------XmvgitqfyTEFYARLDTLDSSGKfdAIlsahtsgTNK---------------IAAKGEILIRIIKFALSIQGdnpavql----QLYLLGKS-HVQKRIRPWQYSIFVEAMIFTISSRLGTEATHEVMEAWVNIFAFILRSMLPQA-
1779 >SRR6478672_7358577
1780 --------------------------------------------------------------------------SRmp--CNSstlkRRPSatscTESPTSTSP-WESAPSST-PSSASTYSPRSLRFWATPSPPRSPPRGGEVYWLFALQLV----
1781 >tr|A0A1Z5JZN5|A0A1Z5JZN5_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_19Hh029 PE=3 SV=1
1782 MEDISPDVVSAVQDSWERIKdsspawEDDFGDRFLKSIFTKAPLsYKLLFP-FGTT-SGPAMFESEDFIEAARTASTLMDMSVSLLecemDALFGQLLEIGLEHANFPRIQTSHWSMMRDALLRTLASYssaLSEDCKdlEKVLSAWSLVFDNLSNEMVE---
1783 >ERR1700744_2408068
1784 -----------------------------------HPEAESLFRR--GPS--------MR--CPTGRP----------RSGTPG----GscwtkliASAlSA-RHKSRRLKSSLPLEEIRADVGFLL--DRVVVAIDavgdervvRNDRVLVRLDRVQS----
1785 >tr|A0A2S3QTP4|A0A2S3QTP4_9PROT Uncharacterized protein OS=Halobacteriovorax sp. DA5 OX=2067553 GN=C0Z22_01530 PE=3 SV=1
1786 -------DKDLIIESFARIEpnLKNFTNAFFDNVVILEPGMQKVFAH-AD-------------REQLKaSFIRALSITINNLKNpeyLKYYLQGLGGNQI-KYEVSETYFPIFEEAFIQTLMLFHMNSWTPKLETAWRDCFYYIAEYIS----
1787 >ERR1719216_352717
1788 ----------IIKSSWRIIQnkvIARHGTDFFIEIFDSQF---------KP-P----IGVTPVFQGHGEKMIQVVGKAIETLRDgKspteqesqelWDMLIENGRLYL-GYGALPMYFDVLGTFDCKHSKDNVIVntGNCGKQEM------------------
1789 >tr|A0A2D7G1P9|A0A2D7G1P9_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP96_10880 PE=4 SV=1
1790 -------EQTCIERVLDCAAedQPDFQQRLYDRFYQLAPSAEALMIHIDEE-------------VQGKMLAEVIRLFLsPDVaVTDQQYLLFETKNHAQAYFVEPEMYRALNQALFETLKVGAGRIWSSEVESAVHNRLSKMLHGILEAL-
1791 >tr|A0A2E1GZ77|A0A2E1GZ77_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ03_04085 PE=4 SV=1
1792 -------DQAWIETAFDCAAvdNLNFNVDVYQTFYRAEPSVASLMAHIDEL-------------VQNKMLSEVIRLLLnPNIeSEEAGYLNFEVKTHIQGYGVSPLMFLSFNRAVYEVLQSSAARVWEDDLAVAVTRRFAVLSDALTEAL-
1793 >tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ23_00915 PE=4 SV=1
1794 -------MQSSIHALLEQVAttDIDFDKKCFERFFQISEEGKTLMAHMDRV-------------HRGKMMAEIYRLMMaRDLDDEADYLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY-
1795 >tr|A0A2G2R0S2|A0A2G2R0S2_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_09030 PE=4 SV=1
1796 --IVTPDQAIIIQESFARLStsSDSLIQDILGTIAEGNSDLAVTI-TF----------KSQNLVE---QISTALSHIIDQLhtaDNVAEYVAHFGELLL-AQNVQDENYSSFGEALLSGLENALQNDFTAEVRDAWTSGWAMLSGIMRE---
1797 >SRR5258705_7404034
1798 ----------------------CPTSSSRPVLWAAvrdCAGGQTLVPR--RY------------DGTRLQADGDAGRCGQQSGQSRsrvAGGERSCQASR-RPWREGGYYTPVGAALLWTLEQGFRI--------------------------
1799 >tr|F0W0M6|F0W0M6_9STRA Uncharacterized protein AlNc14C5G666 OS=Albugo laibachii Nc14 OX=890382 GN=AlNc14C5G666 PE=3 SV=1
1800 ---------------------------------LNAPELKPVFKT----------------SKHARnVVLQHIVGGLRTMlahDVHIERVRALTRTHL-QFGVKMEYFDLLGQAVIFSMRHCSGSHWSSEIEEAWRRLYGHCSVILL----
1801 >SRR5271163_4883858
1802 ----------RTDSLYAQLGgkttIASIVDRFYEKVL-ADPDLKPFFAK-ANM------------AGIKQRQAQFLTQALGGPIDA--RNHETRPAHA-SLLSDTRHFERAATHLAVTLSEM-----------------------------
1803 >ERR1711911_155006
1804 --DIIRKNCLMLYTNFTATKiaFKWILLCLNCRYFEIKPEAQKLFPAFANVPL-KDLPKNYA-------FLAAVNTCFANVHYLIekagrnprdcPVFSKVV---A-KYD--ARDVKQFGDIMMNSLKSELGSQFTDEIEESWNLALEEIAKMVS----
1805 >tr|A0A286GHZ2|A0A286GHZ2_9BACT Sulfite reductase, alpha subunit (Flavoprotein) OS=Spirosoma fluviale GN=SAMN06269250_4620 PE=4 SV=1
1806 --ALTPDMIRLMRQVGDQLsaDARVIGTDFYHALFQTHPDIIPYFNR-TDID------------SLTEHLMQAVGFLVRSLASgvdITKELRELSQIHT-NFSVPPDAYPKLVEPLLTVMRKH-VPGFSTEQEHAWVILLNRVTNVLRQ---
1807 >ERR550539_353004
1808 --------------------------------------------------------------AMMQHLVKNLHDISRF---dsdIRELLTRLGQQWL-QKRVPLDFAVLLGNEYLEAvlpffHSNV-GATLALKLEVSLAYLYKEAMHFLLL---
1809 >LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3583117_1 # 3 # 191 # -1 # ID=3583117_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
1810 --ALAPEAVTKMRAGAEAMlaHPQEAGVFFYETLFDARPDLVSLFRT-ANMD------------ALSRHLIDTVVFLSRAADDltgLRDDLRNLARVHQ-VNQIPPSEYAHLAAPLLETLSRF-GHPLDAQMIRGWEVLFDRVSRIVAE---
1811 >ERR1719359_219123
1812 ------------------IdEepmaEVVSGeDALV----AIA-DLlyQKL-------------------------------------SGdeaMAQFLENVDLT--QlanNLRSLlalvfngsdWPEMHLS--gSLiddgYEDFSSILQETL----qaSPg-DDALL--ESLDKL----
1813 >ERR1719487_376807
1814 ------------------EeEgateEVASGeEALV----AIA-DMlyQKL-------------------------------------SGdqaMAEFLENVDLA--QlakNLRTLlaavfegndWPEINLS--aSIidegYEDFSSVLQETL----qtCLg-DNAML--ESLDKL----
1815 >ERR1712100_485805
1816 ---SVGHVVLVV---GRCSfEcrniVVVEGlDGSLDRLLALRkvvgiglGLPilQQL-------------------------------------G-VLRHVGNVA-------------------lKVlrchFLQFSNHVLEVRSRLRldefclvgdivievilrDHgggkHeRD---------------
1817 >ERR1719171_2780585
1818 --NLSEEMITEVQKSWSEVLrrvdsKTEIGRIIYDSLFDRLPHLRKMFKT-NRL-------------TVAMRFANSVHSLVGILNNkeqTEEYVYNMALRHV-QYwsgdgSIAQANMSAFLKAVLIVFDNALDDKWTQRMEEAWGALFSYVGEAMVA---
1819 >ERR1719265_1594411
1820 --------VDTIVKDWAGLDLEKLGDTTFGMMVQNNPEIKTIFGG--DVHPG---VAQQGLKSQAATFVGFMSYAMTWLKKkdfivLEQKMVELGQRHV-HYGVNVSHFVSFQEAMFTALREQLGTRFE-DNKYAWTFT-------------
1821 >ERR1740139_1939294
1822 ----DSDTIAVVKQTWKAITalPeqqEYVGMRLLHNlhpcyetsltfllvielyylsYLRVVPSARAFFPPTSD-----SLIDDESFRESASNLMMCIDKAINTLENqrhlrFKALLQTYGKKLS-RLHIPPSCYTMAWFALIETLQDVLEDRFTELMLAYWIDIIDPINT-------
1823 >SRR5690606_18427011
1824 ---VSHRN---AHEKHQPCHaKL-------------RPLLRE-----------------PRLLRRLLY--DLSGqLTRR-A--GEVRPERHG-----GAEASAX---------------------------------------------
1825 >SRR5690606_42132731
1826 ---MPMKNTNRVMQSYGRCCaSPGFFDDFYTTFLASSPAVREKSAQ-SDMA------AQKHLLRAGIP--NLVPLARG-M--PDTKLDRKSTRLN-----------------------------------------------------
1827 >ERR1719487_109746
1828 -MIMSAEAVQVVQDSFHRVDscvqiRDALEDVFFPHLFASSTQIKELFAD---V----------DLNMQAPMFANILNSTISSLNNpteLRPLLADFGEKCK-KYGVQGEHIATAGESLIFTMKSI-DDQWDAEVEAAWMAACSAMENAA-----
1829 >tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1
1830 -----PMEVALVQSTWQRFLesPnlTTEFSAIFQRMFQMVPTAMQAFRYV-NSTDLDSLVANKDLQKVVTMMMSEVNATLQLLDQpqaLISLIRSHGARHA-TYGVTRQWEETMLNAILYAVETKLSPsGFNQSEKNAWRSVLDMLGRNF-----
1831 >tr|A0A0C9M7G1|A0A0C9M7G1_9FUNG Type 11 methyltransferase OS=Mucor ambiguus OX=91626 GN=MAM1_0030c02374 PE=3 SV=1
1832 --PPTQAQIDIVRYTWERVSeihldtddPtvsatHAFGLAFYDALFKLDPSLEPLFSNIFQQAralagMVSYIARSPKVTGPNKpksatSLsegcgmstaklekvptireinarkrketnATTFEELVSSAatskpkaeDDeeqLLYKLRELGARHY-FYNVEPKFLALVGPAALSALKTRLGKDFLPEVAEAWTRAHAYAAYHM-----
1833 >ERR1719365_124985
1834 -SEMSGKQKKIVWRTWNSMLgkqesdYNDFGINFVLWLFDNFPKMRNKFDELYGR-SRNSLIVDQHFIAHTENVVKELDRLIKDLPFprlLSKRISKLADSHLNqEP--------------------------------------------------
1835 >tr|C9CRM3|C9CRM3_9RHOB Uncharacterized protein OS=Silicibacter sp. TrichCH4B OX=644076 GN=SCH4B_0097 PE=4 SV=1
1836 ---ISSRDIDLLQSSCATAFlkKGVLASAFYNKLFEIEPAYVNKFS---NIN------------KQKIMFEAMLAYCISGITSgykVEALTARLRSYHM-HLEISDIDIANARSALMYALGSVLGEDFHSDLKQAWDAAFSSVSEALR----
1837 >SRR5688500_3946624
1838 ---VDSRTIALIKESFTPIAgrTLELADRFFNNLFTRQTSVRGFFPA--DVTEQ---------KRQLPGVIQTILENGDKLENLEPQLREVGREYA-KQGALPTHYGAVARTFVDTVREMSGIGWQARYTRAWTSLFDSLTKAIV----
1839 >GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6338290_1 # 1 # 129 # -1 # ID=6338290_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
1840 -------------------ReagLEQYAGALLRSGFDDLEtllaiedadmkdLGIPaCHVVRlrkklqelqRQRSGTRGDFDASNP---VVAFL-----ENAGLGQya--KLLLQNGFDdmDV-LLDIEDADLKDLGvprghaIKLKKGLRELQLQQYAQEDPMPLHAAA------------
1841 >SRR4051794_36238122
1842 --------RRTAKASYLRLQgggrERAFFAAFYENLLVSCPDVKPFFVP-ERMA------------HQ----QSMLNRAIQLLLDFdracgCPQLRQLADGHA-GYQLTRWHYDQFVEALIRTIEQS-G-ITNPAELSAWRTTVMPAIEFM-----
1843 >ADurb_Met_03_Slu_FD_contig_21_1037173_length_469_multi_2_in_0_out_0_1 # 1 # 468 # 1 # ID=69395_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.588
1844 --------RRTALASYLRFQspdkVQKFSRGLYEHLFDRHEELERLFKP--DLK------------AQ----YEALNRALQALVDFrpedpdsAKAIETIATRHR-GYSISKAHLVTFLDAVAVGLACA-D-ERDPETHDAWHEVLVAAFKPF-----
1845 >SRR3569833_2455512
1846 --------MKDVQARFGRCClHPNFLDTFYNAFMATSPEVARLFKN-TDF------------TRQKKMLQMSLNLLIShamGIGIVDGYLHQLAAKHSRhHLNPEPQHTTPPPNSLMKAVNQHDP-KYTPSLDHARRTGHGHGIELI-----
1847 >SRR5439155_1005251
1848 --------KATtalAKASYDRCCqAPEFLQVFYRNFLAACPEAVPRFAG-TNF------------DQQTRLLRHAIGLLLIfpnQPNKEPNLLARLARGPGPcRRQGCA--CGQ---DRSDRTARTDGAsrqrrcraPCSRRpdarGSRKWVRAAP-----------
1849 >SRR5262245_66279004
1850 ---LEPTDRIRAKQSYLKHcmGKNDFYRKFYERFFQGPEGTmakEMFAD--KDL------------NQQYVKLDQSLHYLLNFGDQdmMEpTVLTTTATIHQ-TKGVAPEQLERFIECLIDTLSKDYQV--SGIEVDAWKNVCGP----------
1851 >ERR1719277_2718232
1852 --VLTDETIAIVKSTAPAMKehAYKISETMYQNMFAEKPEIRKLFTP-EDQ----KVQPGQTQKKQPLNLARAIQAYATHIDDldkKKSRIGRRIDrvrkKEC-SIESKNG---FNGK-RSEIVKEELTELERKNVVLrakmdSMEREvkllkKKFLSDIS-----
1853 >ERR1719209_1562507
1854 -----------------------------------------------GDHsh-AQSYH-----EVHEHLWRSLAFSVLNQVlsrDkRIKQDLFNLGYTHH-ERGLKEDDMLQLEYAVIDGIHDHLV---TDVHERAWRKVFQLIRIHF-----
1855 >ERR1719487_2840864
1856 -----------VRQSWAMIQaiqtS-sagGFGDALFFNISVMSSEIWSLFSV--SKE------------VMAVTFTDAFTLIVSYIADpvgLAEELFGEADGVG-DVGDDQGEGiregdghDLLGHGEQ--TPDLAAHDGDVEEERVAE---------------
1857 >ERR1719171_2815737
1858 ----------------------agaendeelrensgvedsfasgsvptTFNEMFLFNLTVMGAGARK------NKA------------ImWMTEVLTSFDTIVANVANskrLQEECDVLGLRIS-KYPLDFVKLPEFKACMLSSLRSLLPRTWSGTHEVAWSWLWENIERML-----
1859 >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 OX=905079 GN=GUITHDRAFT_143733 PE=3 SV=1
1860 --------SARIASSWTELvkksDYAEIGRRIYGS-VKANDTLEPLFR-FTNQ------------TVQGTKFVDMLSSIVENINNPqtiFEKVNELAPMHH-RKGVKAAHMPIMKGIIVSLLKHVLGDEFTNEDEEAWNWIWQYLTQILD----
1861 >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold759411_1 # 1 # 798 # -1 # ID=759411_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.594
1862 ----------IAAQFWEEHiSyksladKLEIGCAIYFGMMVHNKEMKRILKKNlhhHQ-----------SIENSSVKFLDMMGWLLRSLlrSDidLCGSLQQLGAFHR-NMGVNINHFDPMLKSMHETFSYYFPIKYGIQIKYAIDQIFTLAARIMTG---
1863 >ERR1719396_104066
1864 ---------FNIIESWELLRfhpslKEDLGTAIFRELFKEHPELREHFGL--PLVGLDALCKNQTFLSLSNQFVDVFARTMDTLGPdeelMDESIRELGEKCV-SIGIETSHLSLLRKPILSAVEKILLEDFDD---ESWKKFYSILATDLAE---
1865 >tr|A0A0P5AEE1|A0A0P5AEE1_9CRUS Di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1
1866 --KLtp--HQIQDVQRSWENI-rngLNALVSS-IFVKLFKETPRIQKFFAKF------ANVAVD------SLAGn----------------AEYEKQI-ALVD--TPTPNVEFPV--------------------------------------
1867 >tr|A0A0P4WPK3|A0A0P4WPK3_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
1868 --KLap--HQIRDVQTSWENIRgdRNSIVPPSSSSSSRRLPAPRSTSSN--SLA-LPSMP--------------------CpKManttnklllGDklqLLCNINYMRYTHQPPRAIPRERFEDFARLLLDVLSSK---GVSADDMDSWRGVLTIFVDGVS----
1869 >ERR1719510_2339612
1870 --SLTDNEVILIKSSWTYLKPhiNTILIESFMSLFAENSDVKEKFYSFKNHAIEdlnkkrgVGLASTNGLQRHIPRVSRAITKVVNSIENldrVSRYLEMLGKIHQ-QIGIEVQELMMLGAFFINSSKRHLPSSMQAdrHYSDSWLHLFTVISTMMRKGF-
1871 >tr|A0A2V3J537|A0A2V3J537_9FLOR Flavohemoprotein OS=Gracilariopsis chorda OX=448386 GN=BWQ96_00611 PE=4 SV=1
1872 ----DPETEALIKNTLPIFtkHSQQIAVQLYANLFEQHPQLKPMFC-LEFLQTPGQCKKSPgtGMSPQAKILSDSIVNFCANLDNIdmmNNAIERICAKHV-SRHVKSDHYPAVAGAFSRAVRQVLKNELSESDLKAWDTAVSALAGVLV----
1873 >tr|A0A2G5SLB2|A0A2G5SLB2_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-17 PE=4 SV=1
1874 -TEMSDEEVSAIREVWIRAKTDNVGKKILQTLIEKRPKFAEYFG-IQSeSLDIRALNQSKEFHLQAHRIQNFLDTAVGSLGFcpissVYDMAHRIGQIHFY-RGVNfgADNWLVFKKVTVDQVTTGATDsSKekdkdetnsngtangkvdteanpipvgiadinnvysgeNCLARLGWNKLMTVIVREMKRGF-
1875 >tr|A0A2P8XQA5|A0A2P8XQA5_BLAGE Uncharacterized protein OS=Blattella germanica OX=6973 GN=C0J52_27026 PE=3 SV=1
1876 ---LAREEKKFITESWHAFmrLPPANSVDAFVKFLQENPKYIKFFKSVDGIP-LEDLRYSFRVPKHVTAVLLYVNSMVHCLDNADAMfflSLQVGLMHS-NMGLTVEDFKLFNGYMVNILEDELG--LNDEGVAVWNKVLEIFM--------
1877 >tr|T1FHE7|T1FHE7_HELRO Uncharacterized protein OS=Helobdella robusta OX=6412 GN=20208246 PE=3 SV=1
1878 -----------------------------GTLLQSNPLVKNTFEKFRQMDPMSDFTDSSVFSTHAMVVMSAFEDIFDNLDDseIVKDILEQGKSHG-KFseDFAPETFWAIEEPFMSSMKDILGRKMSSQLEKIYKKTIKFILSVLIKGLR
1879 >SRR5580658_3791175
1880 -------DPALVREAWSFVSdrADQLVMNFYAELFYVFKEAPTMFPS--NMT--------RQRQEFGRAVVQWIIS--DDQEGL-----------------------------------------------------------------
1881 >SRR3990167_4175368
1882 -TGLTDGEKGMIQQSWNLLSKVEFTKILYKKIFELAPHVRCLFQN--SIES-----------QHENfsIMMDMmINEHINDELDLFAVVLQLAKRHF-HYKVKTDYYSIFRDGFLWSLEQTLSIEtlnktITnestnqpTTIKSIWLKFVNYLISVMV----
1883 >LauGreDrversion2_5_1035112.scaffolds.fasta_scaffold830278_1 # 2 # 232 # -1 # ID=830278_1;partial=10;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.316
1884 -------------------------MAFWN----KHPEPAAQFVA---P----------TQdtltdefepeeeqGISKEQLLSALNAAQT----ALMMIDR----D------FNITYLNqKSVDLLKTHEALFQSIWPNFQATeefllGYCIdlfhanpshqrqmlsnpsNLPYTTTITVKDV-
1885 >SoimicmetaTmtHMA_FD_contig_51_4416696_length_1368_multi_2_in_0_out_0_1 # 1 # 216 # -1 # ID=2511055_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.685
1886 --------VALHTVEFAVADPsaRATI--------------------------------------------------ATHGLtpdDMAMLLSK---RE------------LIGPAFPALLDEFYGKVVEN----------------------
1887 >tr|Q5D2M7|Q5D2M7_9TREM Myoglobin 1 OS=Paragonimus westermani OX=34504 GN=myo1 PE=2 SV=1
1888 MAPLTQAEVDGVVSELNPfLAsdakKVELGLGAYKALLTAKPEYIQLFSKLHGLT-IDNVFQSEGIKYYARTLVEDLVKMLTAAAKddeLQKVLVHSGHQHT-TRKVTKQQFLSGEPIFIDFFNKTLSK---PENKAAMEKFLKHAFPVIANN--
1889 >tr|A0A1S8X4B3|A0A1S8X4B3_9TREM Globin OS=Opisthorchis viverrini OX=6198 GN=X801_02811 PE=3 SV=1
1890 MAPLTQSQIAGIHKELLPiLSndeaKTSFGVGAYKAFLGAHPEYIQYFSKLNGLT-IDNVFESEGIKYYGRTLVDEIVKMLTAGADdekLKQVLHDSGKAHT-ARNIDNATFMvsklfmflkrvsemrlarglygpfpifaqSGLPVFVDYFNKSLTV---PENQTAMEAFLNHVFPNISKD--
1891 >ERR1719167_330163
1892 -IDLTDKERELIQHTWWRFREEpYCRLRIMTHYFSANSSIKKKFQR-KNEENAAngNlmtAMVSWNIRRFSIRLVEFMDKVVRDLETEnyqdiYDISELQGAKHYRlKRMVEPGDMEALGQSIQTTISEHFGEKFNRSHILAWRRLFIVICSRF-----
1893 >tr|A0A0T6BC68|A0A0T6BC68_9SCAR Uncharacterized protein OS=Oryctes borbonicus OX=1629725 GN=AMK59_2266 PE=3 SV=1
1894 -TGLTSQQKSLIQSTFNVIRPhiLNVGIDLFVRVLEVEPEHHRVLP-FSHIP-IADLHESFEFKFHCLAVVYSCSAIIDHLHDdgiLIPLMKKYASDL--KASIPLDIFQMIHDPLLEALDVHDDVKISEEALEAVRTLLRNLTNFLI----
1895 >ERR1719199_1566639
1896 ---------------------------IFQHSGIQRPVFSTSSSSR-R-------------LCRP-CDLSMAFRPSDVLHSstrLKAQVETMGFGHL-HLDVTPARCKLFHGALVDFFVVELGDKLTPLAAEGWKRVLTYVASGLM----
1897 >ERR1719362_342361
1898 --RLSASAVTFLRSSWEHVPKDSFGMEFMKRACSEEPSLSDVFDC-P-V-------------ARPDNLAKVVQMLLDQAEielvprleRLAHGIAALSFKFG---KLRMSHLAPMKRALVRTVVAFAPGNQKAMTNRAWEAFFYAIAAVVA----
1899 >ETNmetMinimDraft_19_1059907.scaffolds.fasta_scaffold284136_1 # 1 # 639 # -1 # ID=284136_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.595
1900 --RLPKACVSLLRQSWKQVPQASFRKEFFDRLYIEDSSLQQIFQH-PMV-------------EVPENAWNVVQLMLDLLNvenvprleRFVHALAGLAFRHG---RFRLAHLAPIKRALVRTVTSHASKQEKKKLSQAWEAFFYALAAVAA----
1901 >SRR5262245_21272653
1902 ------QNVEVFRASLKRCLaAPYFMSRFYDLFMGSSDEVREHFGD-TDFK------VETRVLADSLYLMAVIAQ-GEAEAPAWTEMSRLAKRHSKaELDICPELYDLWLKCLIEAARLHD-AQFSEAVEQAWRATLAPGIEYLSSRRX
1903 >tr|A0A2A4SWC3|A0A2A4SWC3_9GAMM Uncharacterized protein OS=Thiotrichales bacterium GN=COB61_05140 PE=4 SV=1
1904 ------MEFQDIRTSMGRAItHGDLFGRFYDIFLASNPKIKSMFVG-TNLE------TQKALLRQGVNLALMFAE-GKAIGK--SAMNRLRDSHSKsHLGIEPSMYRYWLDSFIKALKEFD-PDFDSALEKQWRQALGAAIEHIAAGYS
1905 >tr|A0A1R1LTH4|A0A1R1LTH4_9GAMM Globin OS=Motiliproteus sp. MSK22-1 GN=BGP75_17400 PE=4 SV=1
1906 ------DFEHIFDSSYsrvlAVTYnKQGFFETFYQRFVVADEKVSELFKN-TDMA------RQQKLLESSVYFLRDFYT--TSYAD--DVLQKIAILHSKrVLDIPPALYDLWLEVLLSTVSDFD-PLFDENIELAWRLVLSAGITFMKFKHN
1907 >tr|A0A2A2KP63|A0A2A2KP63_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_06989 PE=3 SV=1
1908 -SGLTREEKRIIQVCWFKCNqkqLRKCAEDIFADILHMDDDLLRLFR-L-DHIQSNRLRDAEFFKSHASNFAIVLSLVVTNLQEhVeqaCEALQNLGRQHAA-F--LDKFFQSMyWDTFTDCFERNPPPAFRKgSEREAWSRMILFIIAQMKIGFQ
1909 >tr|A0A1I7TYQ0|A0A1I7TYQ0_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=3 SV=1
1910 -SGLTRDDKRIIETCWFKCSqkqLRKSSCDMFWDILHTDEDILRLFR-L-DHVSPNRLKDNEYFKSHASNLALVLNLVVTNLQDnFeqaQDALQALGYQHLH-L--IDRtHFQSMyWDIFTDCFERNPPPSFRKgAEREVWSRMILFIMGQMKTGYQ
1911 >SRR5215204_501118
1912 --RVTRRDWQRLLENWERLQpsADRFATVFFDTLFAWEPQARQLFGG-------------ATLETQFLRFAHLLTSLVSAQDHpdeLDRRIDAVIRCFA-GGDPPRKREDAIRVAVAAMLNDVYAAGITPETRASWQSAYIGVITTIRS---
1913 >tr|H3NRG3|H3NRG3_9GAMM Uncharacterized protein OS=gamma proteobacterium HIMB55 GN=OMB55_00005550 PE=4 SV=1
1914 ----SQSDIAIISESLTLCgdCLEDITPHVYRRFFELDASAASLMEYS-DEH------------MRGR----MFASVLELFlsddpFESDGFLAWELDNHVSSYSVTKSMYESLFKAFFEVAEETLGEDWSGDFERAWTNRIARIMAEVS----
1915 >tr|A0A2V1ABH2|A0A2V1ABH2_9ASCO Uncharacterized protein OS=[Candida] duobushaemulonis OX=1231522 GN=CXQ87_003270 PE=4 SV=1
1916 --QLSTADRNKVRASWGDAMaakdykTEQVIHEMFSSLIEQSEDARDLFEN--KK----------VRAQQETLFAEIMGFTMMYLHNitvLDECMNEFIREnpHIVRCGV--RYLEPMGAVLIQYLRQTLGPQFHAGLETLWVQTYIYIANCIL----
1917 >ERR1719396_219344
1918 -------------NTAAAVAPkaLDITKTFYGGMLQDYPELLAYFNPAHNVP---------ISENQPMALAGSIVAYASNIRDLSPllvpngPLMAICHRHC-ALCITPPQYNVVHENVMKSIAKVLGASSRRRSRPPGARRSSSSRR-PA----
1919 >ERR1719396_178111
1920 --------------------------------------------------------------------AHGPGRLHRRLREQHPglvpaagaqrPADGDLPPAL-RLVYHPPAVQRGARERDEVHRQGPGGVVTPEIAAAWSEAVLFLSKACI----
1921 >SRR3546814_8055804
1922 ---------------------KDITPFFYDRFFALYPEQRANFYHFES--------------TSGTMVNEMITSVLALASNearSEEHT-----------sELQSLMRISYAVFCLKKKNKT-----------------------------
1923 >SRR3546814_13566968
1924 ---------------------FTIYTTLSLNVVLPFVTHRSNFDHVES--------------TSESMVIEMITLVLALASKeawLTNSFQNFVAALR-SYgDIPPDAYARLLDVLVVTLAQVAGSRWTDEFETAWRWYVSGM---------
1925 >ERR1719171_2136978
1926 ---------EAIRITVPMLEeigLENVGQVFYGHLFTESPQIQMHFIK------------------PNRMLAYIVRKAIFMVRDlhpkpkeVMAELKPLALRHI-KYDAPPELFADFLVSFTKTLEENLKEGFTTDCAEGWESATNFLANTITR---
1927 >ERR1719171_2291403
1928 ---------PRIcgelwrkqtfklrfnilgkqihspgiPRFFQKMEnvgGLLVSalllaMCFYDPEIvAHEEQIGIHIID------------------RNDAIYYVLEACNACILWllvtnVFGFSvQLSAFKHC-VSQMaeDLAKFGTFAVVFLMAFGCAIhiTMPYDPDFEDMWVTILTLFAI-------
1929 >UPI000297C1C9 status=active
1930 --ELDEYSIGEVRNGWENLERRCGtPKAAA-EEFLHKVSAAIPKTE--HM------------QKRASTVWSKLNGLLASMHDqsmFTGQLEYLALRHM-NQDISAAEIETFKGLLLEFCASKLGGMMTPEFQYGVSRLVDAVGASYQ----
1931 >ERR1719334_589756
1932 -IMLSPAAIQAIKSSWQHV--KNVGFQFFGHLLfsfwlGNQPRALEIYCLHyhGDKR-KGVVELLPRFRRLGEIYAKRIDTWVSHLDDPftlFLILYEHGFNPP-KKavGINEKDFELMVPSLMDAISSAMGSKMTHRLFEQWKSFWKYVLTQIAEG--
1933 >tr|A0A0E9N6V9|A0A0E9N6V9_9BACT Uncharacterized protein OS=Flavihumibacter petaseus NBRC 106054 OX=1220578 GN=FPE01S_06_00290 PE=4 SV=1
1934 --QMNQQEIQLVCQSWQQAAeePLRLAILFFDRLFEEAPELRQVFRT--PMS------------EKTRQLLVFFGFHINRLASgsIrRPSFEAYVW----EELLTDAQKGFLMETLSDTVAALLKPDWTPALQGAWGSFRK-----------
1935 >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
1936 --------NDLVLSSWDIVRqrteVQELGEKFWKYLNCMSPEQTNLFRR--SL------------SMWGhllHHIVNMLLISITDPEEYYDLMFELTIRHI-RYGVRSEYLNPFGNALFATFEEILSDVWEEKTTKAWKLVWKRATCNMSRG--
1937 >ERR1719242_319529
1938 ------EYKNVLQSTWTKLlqKKEEIGKRIYESIvFDTTC-TT----T-GTSLSTSIIFENTNIGQSASRFMDMLDTVICKLDEpdaLVQKLEALSAFHSSNFNVQKRHYIDFEKGFMKAIKWELGAQRTILHDRAWRWFWNFLISKMC----
1939 >KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1947561_2 # 429 # 647 # 1 # ID=1947561_2;partial=01;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.584
1940 -------------------------------LFETNSDIKTMFAKLKDYETVAELRSSKILEDHSMKVICTIDDAIANLDDMeyvNRMLQTIAQAHSTRFpNFDPEFFM------------------------------------------
1941 >SRR4029077_13489679
1942 -----------VQADVHAISvm--LNLMQPFRALRRRVDQFAKLWL--DPL------------WKTGRKAARIPA--TSTSITGRtgfAGRGRTGKAAC-----------------------------------------------------
1943 >SRR5579859_1650388
1944 ------------------------------------------------------------------NFLQALHTILLKMQRhdpsVFQFVQQLGARHE-KYGVTREHFRLVGGFFLTVLQRYVGVLWTRPMQRTWEALFGVLTDVMLFGY-
1945 >tr|A0A0N4ZKI8|A0A0N4ZKI8_PARTI Uncharacterized protein OS=Parastrongyloides trichosuri PE=3 SV=1
1946 --GLTYYQIQAIQRAWRHMSkagQVSCGRQIITKIYKNNTEIRNIFQTYVTIENLS-INQMepveWGVLKHGEEIVNLLDYVIKNLNNIemvEEKCEEVGRSHRKmkQYGMKEEHWDSLGEALSETIRENYG---------------------------
1947 >ERR1719326_2865515
1948 --NMPPEAIEQVKATWTKLLsmttHIELGSLMYDALFEKLPKIRSMFVS-------------PRL-ATASRGETNIDRIFGSFSKSas--------------YMrdpssMX-----------------------------------------------
1949 >GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold4300996_1 # 1 # 264 # 1 # ID=4300996_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.629
1950 ------------------------TQAFYEEYFRLCPDSRDLMKHV-DEH------------VQGRMLASVHELLMLPDPDEQaRFIAFETQTHR-SYGARRYMYDRLFRALRSVVRDVSGDDWNPAWTTPGIAASRPCSRAST----
1951 >ERR1719174_1428107
1952 ---------------------------------------------------------------------------VVDCQDqrsTLGYPPSAST----SVRCCVEQVARRaflwrkswfLTTLTIFIAGQ-AiLKYSHLDNLATERLLVFLFRAFI----
1953 >ERR1719284_2194575
1954 ----------------------------------------------------------------------------SWREStssMRPCPPSLKL----LGIASL-------------------HSLKLDEKLEFGNGdIGLPGGIQI----
1955 >ERR1719277_1813735
1956 ----------------------------------------------------------------------------------CMCAAETRIAHL-IGRASVANMHNLRNAVGSEVCLLSSlAIRFEANHVGWAHVsvadvVAVCSSISL----
1957 >ERR1719310_1375130
1958 --MLPQEQSQQLQQAWALVinmsgNRDALADLIYSAFFYRLGePR-APLRNPA--------------GSRSLPFLHGHQHLRRQLRrPwssaqfrrnveLRSHVLGYHRPSG-EHHSX-----------------------------------------------
1959 >ERR1719310_407492
1960 --ILPLEQSEQLQQAWALVinmsgNRDALADLIYSAFFGASASLEYLFVTPR--------------AVAAFRFFTGINTFV-AFCgDpaqLRRNSQLRSHvpGHY-NSSCEHHPX-------------------------------------------
1961 >MEHZ01.5.fsa_nt_MEHZ011529165.1_2 # 173 # 307 # -1 # ID=206391_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
1962 --YMSIDtgnleaakvmlqdlvtiradrsryyyclddlFKWHPDIVWKLTv--------------DAPELLrtmldGMIWRSRV--------------VvngnrrvnyylkhllvDEHGKFSNAM-SCIVKLQDpEIAIHPILVQ----LGDLVWNDLVYWrflrgklslVCTAGIFMVSQSMl-QYVESAGSFEERVATFICRLVV----
1963 >tr|A0A067CC73|A0A067CC73_SAPPC Uncharacterized protein OS=Saprolegnia parasitica (strain CBS 223.65) GN=SPRG_06598 PE=4 SV=1
1964 --ILNTAYLLDCSKSWKLIVtantdrMRQYgksgivlfYDEFFFRLFQRDFTLEEVFP---DI------------GKRGEVLVKAMTFMLKSSaENpkqIVNKCHYLGHRHRSFGGVRPHHWAQYTSTVIEVIMYWLGEYASPDVGAAWSNIVGFFLMHILESF-
1965 >ERR1712194_94606
1966 -----------VQDTWISATctfeyKECLGTQLLYNLMHIEPSFLDAAPFFDNTVLLGDGFDDESLIQCAIYIVQCITELVTMLDKyHEPKFRILINSHLSrlaKYNIYPSSFAKVAQALLMTLSDVMQEEFTKKVESYWMSVLIILF--------
1967 >tr|A0A2M8U0Y4|A0A2M8U0Y4_9PROT Uncharacterized protein OS=Ferrovibrio sp. OX=1917215 GN=CTR53_17535 PE=4 SV=1
1968 -SPLSPAHLGLVRATFQILAadRDRLTEMFYARAVALDPHIQRPQ-----LV--------SNMVAQRLQFMLVLTDVVQQLDDLpslAQTAATFARRHG-TYGASDPRFRTARAALAWAVDRILETERNSAIQLAWNAAFDLVEALV-----
1969 >tr|A0A1I8F573|A0A1I8F573_9PLAT Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1
1970 -------------------------------------------------------STNQKPPSDGDRLLYWINVQ------ptAQPQLLRGASEGC-VRLFSPRILTRSCISSNLCVRAGRGRNS----SSTeTTSAEGADAVVAA----
1971 >SRR2546429_8650734
1972 ------DAQYLLTESLAVLRpyADELVAEFADRLATGHPALGAIFEP--RL----------------LTVLLELAATYDRPQGLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTLRDFPGAAWTPAHHGARVRAYAFAAATMM----
1973 >SRR2546423_13669166
1974 ------DDQYLLTESLAVLTpcADELAAEFADRLATGHPALRAIFEP--RL----------------LTVLLELAATYDRPQRLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTPRDFAGAPGAPAPHRAGGRADAVAAAPPK----
1975 >SRR5690348_18181078
1976 ------------------SrrRHTRWTGDWSSDVCSSDLETRALFRT--EGS------------ELVKG--SMLAMTVEAIIDFAgersGkfrMIACEVMSHD-AYGTSRELRSEERRVGKEC--RFGWVAYPX----------------------
1977 >ERR1719323_1074371
1978 --LIPFEQRTLITEVWNVLQestIRYVSNTMFLpLIVRSNKSLQKCFAALDQSLHGMELVECygSkfDRTKHGSLFLSKlLIRVVPNMDQmdrVLPYLAELGALHQ-RHGVAKQHIDLLGLAFCAAIRGVVAgggvkGGHLHETTKAWITLIQAVCTGMKMGY-
1979 >tr|A0A1I8C1X6|A0A1I8C1X6_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=3 SV=1
1980 --DLSPHQIGLIKRAWKNLlksvNENEIAIKLLLRIFQLDPRNLAYFSL-NEYSPFDeyLIKENNIFINHVKTFESTLINVMTHPGNatkLSKHLQQLGGRHVNYTGVTykCSYWKCFIQSLIDVLTLNKDKNTSEDLHEAILILGEFCVEQMKIGY-
1981 >tr|A0A0N5CQY3|A0A0N5CQY3_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1
1982 --QLNAPQLLLVRKTWAHARSqGalEPAMSIFRNSFFKCSEIRSLIMN------GPKNEGHERLKSHAKAFTEIMDQLICGLETkelIMYELRAAGRSHIFLprdatdnkskgCTFRLAHFEHFASAMIErTLEWGEKKDRNETTQTAWTKIVLFVTEQLREGYQ
1983 >SRR4051812_28599342
1984 ------------------------------------------------------------------------------WVRprsRGGRSPRSRSSRS-SARRWPSGRPRPPSTS--RPDMRSGPSscgmsrarwqsifpapsrtgcasPIGVLGDP-----------------
1985 >SRR6516225_8820395
1986 ---------------YSVHCegKTNFYRLFYKRFFDKPPKWRTFFRK-HKIS----------MARQY----KLLDQAVASLANFHigaepTSLSHVARVHA-NLQLGREQYAMFTDSFLESISEM-GEK-DED---------------------
1987 >SRR3569833_2822653
1988 ----------------------------------APPERHTVLHE--AI------------VTNPVEVAGAIGWVVEHLHRteeVATACGELGPALARLLAGHEQHLDACGRSIIDAIRTGLADRWKPEFDGATSSAWELVAEWLRRG--
1989 >SRR4051812_2284027
1990 ----------------------------------TLPEMRTVLHD--AA------------IADPHALGRAVVWLMDNLTRpfvVTAGCELIGPALGDLLAEHPRDLEAFEPALTDAFRTALGTAWKPDHVTALHQAWDLTVKW------
1991 >tr|L8JU91|L8JU91_9BACT Uncharacterized protein OS=Fulvivirga imtechensis AK7 GN=C900_03083 PE=4 SV=1
1992 --TMEIGKITLVQNSYGRCL---SSGKLLETfyenFLSSSRDVADKFR-------------NTDFEQQRKLLRHGINLMIMYAaGNIagQTGLKRIKESHSRgRMNIEPRFYALWKAALIKAIAEHD-RDFNVEIKAAWNEVLDKGIVLITEGY-
1993 >tr|A0A1Z9IBY6|A0A1Z9IBY6_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium TMED162 GN=CBD22_07770 PE=4 SV=1
1994 MVGVTQTQEQLIEQSLTHYAarHGDPYDAAFQKLYAAAPHYEGLFVL--DTD--EGLR-----RNMMRTTLEMIATYIDDAYAAENLVTGARLVHL-TYEITDD-FDLFFQITRDVIAEGCADIWSDAHAAAWNTMLKDF---------
1995 >ERR1719295_1776256
1996 --YLQPQEIVHIQGSWATVErqLFNLGARVFISLMENQPNIKRTFRQYRNKR-HSELRINEDLQKLIMLLLCGMKRVVKYLNDtkaLTKYLKRMAKRHSPTeidfARINPAEVASVFCAALREIAPAEKDQWTQEVEDSWTSLIGGLLAA------
1997 >ERR1712029_417561
1998 -------------------------------------------------H-GSDWKV-VQVDRIILI-FRTIT--------vIIVRVQSVEKDHI-hT--------RKSF---------TQVLKVETVVEDSWTSLIGGLLAA------
1999 >ERR1712071_338654
2000 ---PTAEEIALIRESWPIVKkNKNVFVEFVLEHFRVHPKTQDLLPEFANLAI-ADMPSNKFFVQLTEtYVVMAMQEIIDNLDNagvLTDLLQCLNSNWYVDyVSLDRQN-RETLRIRRVGQEQKSYSRNMESneiQQQRCPQNLRQAVH-------
2001 >ERR1712179_849736
2002 ---PSAGV-------------------------------------------------------------------------PVNKLEENEDFQVLAyYSSAVATFivtnLDQEDILTHILVQQTKP--------------EQFVD-------
2003 >tr|A0A077ZE79|A0A077ZE79_TRITR Globin OS=Trichuris trichiura OX=36087 GN=TTRE_0000613901 PE=3 SV=1
2004 -------EWYNFKNFWKTVQrnKDNCAKLMFFKYLEQNPDLLQAYAKLRNMEMNeETAFNNSDFEHLANQYLDVFDEAITTIEsnpgDvssVVEELQNVGKRHRRIscieassfavtttvskDWLSVAILQKLQEGFMEMARQVLQDRFTEKCENSFGKFFDFVAKNLQQGF-
2005 >tr|Q7M422|Q7M422_9DIPT Hemoglobin V OS=Tokunagayusurika akamusi OX=28383 PE=1 SV=1
2006 -VGLSDSEEKLVRDAWAPIHGDlqGTANTVFYNYLKKYPSNQDKFETLKGHP-LDEVKDTANFKLIAGRIFTIFDNCVKNVGNdkgFQKVIADMSGPHV-ARPITHGSYNDLRGVIYDSMH------LDSTHGAAWNKMMDNFF--------
2007 >ERR1719253_2317543
2008 ---ILSPAGRVLRLRGPGFLpprcrfgrlspnhccsrvspdriavarrPPPRPRSRPTSSPSPRTSTRGc-WAATRSC----------CSSSTrpttspsprt--SLR--------PSPAPSrptPPTSPTC-LPS-WSPAGPWRPSVTA----------TSPSPSTRCSTSWCTTTSwrpsprswatssrrrsrpagprPSSSSPRP---
2009 >ERR1719253_507459
2010 ---LSQSAIDVVVSVAGRDArrARPRAGPRR----------TDp-WRRRRRA----------ARGG-gpgrragevqtraaegASTLGHGLVR------RGRalgHGLVRHGRGHC-HDS-------------------------------------------------
2011 >tr|A0A016TEH5|A0A016TEH5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=Acey_s0110.g162 PE=3 SV=1
2012 ----------------------DTAGEYHKQLFTLHPEIAKYYDA-EDID-PDSIPKAQKFIMLGQQELQFFFRLPDVVDNerqWRSALSSFKE-TFGDNNVPMSEFNKVTDAFLAAMQKNAGG-VTPEQKKEWEELLAKAYADMK----
2013 >tr|A0A0B2W4R6|A0A0B2W4R6_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_05310 PE=3 SV=1
2014 ----------------------DTAGEFHKQLFKKHPDMAAFYDA-EDLD-PDSIPKSQKFIMHGMSELQFFFKLPQAFSDerkWRSALSSFKD-QYEDVGVPMKEFNKTTDAFLAAMEKNAGG-VTAEQKKDWEELLAKAYADMK----
2015 >ERR1711965_451221
2016 -----------------------------------AGAVR---------P------------RP--------AAVI---GFPFPLFP-LLETADMtsvAVGAHPRLRA-----L-----LRDR-G---AWYLTGPQELASVIGRLERLER
2017 >SRR5882757_2588511
2018 --SLSSRQQILARRFFDAVEAsdKPLAAMFHERLSEIDDRLDGLLL--EEE---------GCLLREAMVIVRTLSRNVDRLNRMVPIFRAFGRTCA-AQGIASANYEKIAPVLFWIAQECVGSEFSVEMGRALTALYDQLSREMKD---
2019 >SRR5262245_14724532
2020 --------EDVVKKAYQRHCYrqPEFYRSFYENFFSRVPKARAMFK---DMA-----------RQHE-----MLDFALGQLLNysqqqSEpTTLTQFVERHS-RLGLTADDFKRFGEALIATFDSELRGdCEHHRTMAALEIVI------------
2021 >tr|A0A183IYP9|A0A183IYP9_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
2022 ------------------------------GLFTSSPEIRSLFPTLVDW--GDDIKTCQKFRNQGLKFVHVISLSLTTLHDkehLDTLLKEIGTRHVEfmPGGIKMEYWDIFEKAMVKCILQQIRwtDDFDEAIQskaaIAWRILCAYIVQKI-----
2023 >tr|A0A0C2M2P6|A0A0C2M2P6_THEKT Uncharacterized protein OS=Thelohanellus kitauei OX=669202 GN=RF11_12769 PE=3 SV=1
2024 --FLTLEERLKLKESWIKIYqkiqdlPdVDITFEIFVRLMERRPEMSKNFE--KDV------YKYSRMKSHSDKMLVILNNMIRNLDDeqkMLKYLSGMVRRHR-NYGIRQGDCKMWEEIFLDIISR------------------------------
2025 >tr|A0A1I7YD88|A0A1I7YD88_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1
2026 --LLTLRQRKILQRSWNKSQrtgLDNIGAHIFLKIYAKDSSVGYLFN-LGNCP-HSELKYRKFFQDHAMTFTRSLDFVMNHLDDLErvsKFCVELGKTHVKfmRRGFKTSFWDIFAEALTECAIDWEGGLRCRDVLNGWRTLVSFVIEEMRKGF-
2027 >SRR5262245_33555564
2028 --------------------------TFYEHLFEGAPELRSLFPI--NM------------AAQERKLLLTISVVVKNLDRdeeLKRLALHLRDVHE-GIRIEEGHIEAFLGSLAHAFQQVHGSPFPRH---DWLTLRRAV---------
2029 >SRR3954452_7277257
2030 ------------------------------HLFQANPEIRMLFPI--NM------------AAQARKLLLTISVVVKHLDReteLQRVALHMRDVHS-HIRIDEGHIELFLASLAHAFQQVNGGAFPHQ---DWKNLRRAI---------
2031 >tr|W4XW92|W4XW92_STRPU Uncharacterized protein OS=Strongylocentrotus purpuratus PE=3 SV=1
2032 ---------------------------------STHPEDSLHLHQ--GCCSHLASRESCRFVDQAMQVMQTIGNAIQNFDNKelfNTNMKELGLLHC-PVRDDtlavIHNHEVFKDALYNTLRKSLTESLTPEMTFAWKAF-------------
2033 >KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold7330878_1 # 87 # 278 # 1 # ID=7330878_1;partial=01;start_type=ATG;rbs_motif=GGxGG;rbs_spacer=5-10bp;gc_cont=0.391
2034 ------------------------------------MASQTQFvygDE--DTVMACLTKESCRFLEHAMSVFQSVGGLVTSFADPpsdRKFNLDLGLKDQ-PKDVQDRHYKVFMKCLLKSVRFHLADSYDLAMHFAWKAF-------------
2035 >SRR3982751_838383
2036 ------GINDQLRESAAMLTsgGteatDAVIRDFYIALFRNAPSLIAIFPG--NPAQGDFG-SDHRGAKQRELLLGALAGLADLYdpgdaermTHLDSVLKRFGRSHAAFtrpdgtvSGATLDEYKAVKDALFSTLVRAAGDRWRAEYTVAWSQAFDYAAASMLL---
2037 >SRR5690606_20444479
2038 ---------DIVKQSFERSkQRKTLATIFYQNLFFLKPKIKNYIKQ-TDF------------AHQEKAIMDEMEFLMAFLDDkdrhARQQILRIAGTHSAkNLNIHPHDYYYWLEALIMTAKEC-DHLWRDDFQYYWRECLSFPLTFIISQYY
2039 >tr|M6F3R8|M6F3R8_9LEPT Uncharacterized protein OS=Leptospira kirschneri serovar Bulgarica str. Nikolaevo OX=1240687 GN=LEP1GSC008_4081 PE=4 SV=1
2040 KMNISENQIRSLNESFDIVNLDriKFAELFFIYLKENHPKYENIFSRI-QL-------------EDVKHFMNSARNISLSsVQYsqLERAIQNFGVECL-KICNQAEEIPILEKAWLFALEKWLGPWYSHEVEKSWQEVFKMIHTSS-----
2041 >tr|V6I1Y8|V6I1Y8_9LEPT Uncharacterized protein OS=Leptospira alexanderi serovar Manhao 3 str. L 60 OX=1049759 GN=LEP1GSC062_2771 PE=4 SV=1
2042 GMNISENQIRNLNESFDIINLDriKFAEIFFVYLKEKNPKFENIFSKI-QL-------------EEAKSFMNSARNIALSgAQNvqLEKAIQDFKMECI-KICNRTEEIPLLEKAWLFALEEWLGPWYSHRVEESWQKIFQMLYSEE-----
2043 >ERR1719272_197188
2044 --SLSATQRASILASWRQLCGEDGGATfcasLLGGAFEAVPETRALAGV-PEAAPEPeAvpeaeaavaapapapakgkagatavpeaaaaveeaaeeaveSAESVALRAAAAHAAVAMEIMAQQLSapeALKESLTELGVKAA-SRGLGcGAPFDRLGEALQTTLQASLGDeAFPEALAEAWRQLYAQASQEIQLQY-
2045 >SRR5262249_23394332
2046 -------------------------ELFFSRLFAIEPGLRHCFDG--C------------FLGRRRAFEWMIGAAVRGRPDLRSFIQALEFMVAPSDATVHQECERLRDAFISSLSGSLGPRFTVEMMNGWLAVFELLH--------
2047 >SRR5438034_714626
2048 --SMTEASIIAFNESFERCMaSGRFFDVFYDHFLRSSPEIAAKFQG-TYF------------NRQKRMLNQRPATTVGQpr-------------RSAReSRKTPAAQFVStcqampsaFVSELTKSGSTX-----------------------------
2049 >SRR5258708_7736634
2050 ------------------------------RFTGTSDAIREKFKN-SDF------------AVQHQAMADSLYLMAVSvqggPEN-LARHDMKRLYPKHqRMEITASMYDVWLDCFVATARIH-DPECTPAIESAWRECLTPGIAAMKSGA-
2051 >SRR5690242_5369812
2052 --LVTEDDLALFLDSFDSCVaNKEFVARFYEIFLSTSPEIRALFAK-TDF------------HHQRRALKASLHVVAACaarrRAD-YSALDELADR--HrELRIEPRHYAVWQESLLAAVSEC-AERWDPDVERVWREGLSEAIAHMAS---
2053 >SRR5512134_285705
2054 --ALTPTHATLVRESWARLAPGrAAAVhRFRARLEAVSPRTAARFTCL-DH------------EAQRDGLMIELDQAIAAtgsDDDLVPALARIARRFR-ESGPASSEYPMVRDALLEVLAEADRGIAPPELRRAWGSLFGLLAALV-----
2055 >ERR1719232_1195758
2056 -------ETVIIKDTWETIHkqVKAIGMEAFEKLFALNSDMSAYLPQTDDLDQDETRRLSDKVKSHAKLTMETLEQVIAAIPDMTEvynVITKMKKLHP-----QTGLLEVIGPVFCNTTRHFLliQGRWSLDVQRAWLALFGEVSAMIRASY-
2057 >ERR1719189_1497217
2058 -------GRQADEQ----VGreEAGPGHRGHRP----AQDDPAHLRgarDCGQRVRGRARRHGDRGV-QGRGQGEQS-QH--------------HRHQG-----S------HGQ----------lHGRHX-----------------------
2059 >ERR550519_213
2060 -------NIVLLRDTWSVIHrqVNTLGMETFQKLFEINSEVSHYVSpscpDLDPd----CIDSTTQAIKAHATHTITILHNTVSNLCNLgd--lagE------------------MNRLGKLHCDLGIDHGil----------------------------
2061 >ETNmetMinimDraft_22_1059887.scaffolds.fasta_scaffold1682169_1 # 3 # 206 # -1 # ID=1682169_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.363
2062 ---------GTVFSQWRRMKIEDFGECMY-RSLVQDASLEKLFRR-------------ERMRTQSLLFAAFIQVALCWLEErdfrkVERDMISLGLRHR-SYGIQPSYVCVFQIALLQTLCQNLNG-LSLQAEISWSVVWSHF---------
2063 >SRR6266567_3650358
2064 ----------------------------------------------------------------------------------RAPSKAWGsgtspmascqstipssersfwkpsatywesaglqrtmmpgrkptkgsarscwkgpthrsqpeqssrqchrydlwererqdkikkgeatldtkqaaqkgfeQQHA-VVIGGSMAGLLAARVLSTHFGQVSVieRDHLPDGA-------------------
2065 >SRR5579885_1989414
2066 ------------------------------------------------------------------------------------------xmsnqqssrsgfgGQHA-VVIGASMAGLLASRVLSEHFEQVTVieRDQLPQEV-------------------
2067 >SRR5579864_4130097
2068 ------LQIELLETSFQAIApcGEAFVTAFYERLFMRFPQTRAFFAS-AE------------RNIKHVLAKPTIVTTLQPTRSascRTTRIT------F-PSSVGTAGVPISRS------TGYAGs---------------------------
2069 >ERR1719414_1806988
2070 ----TVAQAEKVVAQWDAADQDAFIVAMYQAMMKTHPEWRALFNK-PTGA---PTPAEAEWKKQFDLTKAVLDRGLRsratDVDALKERMHAMAGRHV-NYGVTQTHFQALKPILTDVLAATVTG----ADMDAWSAVTYFMLDSI-----
2071 >tr|A0A090RS91|A0A090RS91_9VIBR Uncharacterized protein OS=Vibrio sp. C7 OX=1001886 GN=JCM19233_1279 PE=4 SV=1
2072 -----------------------FLTFFLQHFCSTNPRFAERFCGV-DS------------EQQTKMLKASIILVQnaAENPYIRNNVKSLAKRHKEmNLNIKPEELVAWRESLLATVANFD-PLFDDDIDQACAQRWN-----------
2073 >tr|A0A139A347|A0A139A347_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 OX=1344416 GN=M427DRAFT_73171 PE=4 SV=1
2074 --MLSAEQARLLKKNWKDIGASsvanpmmFVVAQFYRRLLRK-KGYKRIFEGI-DI------------ETQYFKMQGALTACVEfaeNLDKFADTIRRIGARHA-RYNMTPNMMNDVVDSLVPSLKEFsldHGITWNEEIEEAYDEWLEQVTGYF-----
2075 >SRR5262249_57009646
2076 -------------------------------------------------------FRKTDFPRQTRVAADTLFlmaVAAGARDHavAWRGRDRLPGTPPPpGLHSSPRHHPAQLVCPL-----------------------------------
2077 >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 GN=TSPGSL018_8354 PE=3 SV=1
2078 -----------------------VGAGFLKLYAQRNPWAVEQFS-FG-LR-----------PQHAEKMGLALELIVNSATRpqvLQHQLRVLALGHV-QMGIKPEMFKSFEEALFAFLGQVLGAhnTFDEETEGAWRWMWGIVNAVFTQ---
2079 >tr|A0A090LKP0|A0A090LKP0_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_2000335800 PE=4 SV=
2080 -EELPKADKDIIISTYNILL--QADPELFSKAWimsaSRSTSIRKAFS----LIDP----NSTHIEVDFTKFSAVIERFFTriiceeKLVNesFEKSCINLGKKHVDfvPIGFHSNYWDIFMNCMIDVIAETVIIAFNEdnkqqqQVQKCWNKFVGRIVFLMQSGF-
2081 >tr|A0A0M3JT43|A0A0M3JT43_ANISI Uncharacterized protein OS=Anisakis simplex OX=6269 PE=3 SV=1
2082 -RSFTTPQLTSVFNAHFSMI--QLNPDVIKDCWiktsKRSSSIKKAFG----MLEH----EEPETNASFMNLPITIQAFFKelifelDCDSvkIRQRCEQLGARHVDfsERGFHSNFWDIFQVCTIEVIAEC--NLGLNedqhrSYELAWIHLLSSVVKSMRNGY-
2083 >tr|A0A0A9Z6R2|A0A0A9Z6R2_LYGHE Neuroglobin OS=Lygus hesperus OX=30085 GN=NGB PE=4 SV=1
2084 --SLEEDEIERIKKSWVLVKEndfrfiDILRQEMLCDI----MMYELYFNPG-R-KADVCVSELTEFKNHPKNVYSTLDFIVGDLENenvIIEKMIEIGKNHG-RLGISRKHISFMTSTIYQAVECTIGPcMFDRLVDQSWEKFLTSFND-------
2085 >SRR3990167_8699843
2086 -------------------------RLFYAHLFAKAAHLKPLFG---DSE-----------DTQNFKVIKMFELIIDNVEDLtqvQPICLDMAKRHS-FYGVKNDFYQYIDEAFVWCIQQQLSLSIQDPIIHAWYAATKYISSIMID---
2087 >SRR5690606_19766530
2088 ----VSDQYTDLQQSFGRCLrDKNFIERFYEVFMASNAEVAAMFAR-TDF------------QKQRLALRRGISVAIFHAAGssVvKRSMQQMADVHSRSgrCPVAPHLYPYWIDSLLTVIAETDA-EADEALLARWREAMGVTIGTFIGAYN
2089 >tr|A0A023F5X6|A0A023F5X6_TRIIF Putative globin (Fragment) OS=Triatoma infestans OX=30076 PE=2 SV=1
2090 --ALTADEKEILKESWKNRgiNKSTLAMMWFTKLFKANAEEIVEQNR-GQV--VEELFMDEANFDYVDKLADIFNIVVKNIHKstLcTKLIWEIGMYHC-CLDLRDGYFELMKETLLDTLKENMQPPLTSEQIEAWKKFIGVMFDIVHE---
2091 >tr|A0A0N4YMT1|A0A0N4YMT1_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis PE=4 SV=2
2092 ---LSLEVHDLARAHWIQLHkLNRQSnliQNALLYIVENYKHTRPIWQ-FGlGIDEstkdwKTLLFNNFYFRHHSASIQAAITMVMENMDDrdcMKKLLNEIGAHHF-FYDACEPHLELFEQGMIHSLRTTLVGhvKIDESTEQSWTLFLKDLKTFMGEG--
2093 >ERR1719326_703414
2094 --------------------------------------------E-HPM-------------IPITMTEES----VKLVQDsl-SRVDSLVQV-----RDALQDvFFPHLF---------------------------------------
2095 >ERR1719487_2229452
2096 -----------------------AALSL--------P-------T-EQE-------------SPVTMTAEA----VQMVQDsl-RRVDSAVQV-----RDAMEDvFFPHLF---------------------------------------
2097 >ERR1712176_999243
2098 -------------------------------------------------------------------------SY-AHRDTfdqladAPRTI--FYTQK---------QGHPECSEMVEKMKNIVGDE-------------------------
2099 >tr|W8BTT7|W8BTT7_CERCA Uncharacterized protein OS=Ceratitis capitata OX=7213 PE=2 SV=1
2100 -LGLTITERRSLQNGWSIIKqkQRRAALTIYVNLFTEHENLYEVFRSDGV-------LNIEFASQHQKEVLTVFQMIIEQVDNarfVKTMLKELALRHE-AASVTNTQWQLYTNEVRKYFLETLADAISPTFVHALDKLMNFVCN-------
2101 >tr|A0A1A9YF90|A0A1A9YF90_GLOFF Uncharacterized protein OS=Glossina fuscipes fuscipes OX=201502 PE=4 SV=1
2102 -MGFTPLEIVALQNIWRLFKkrFKYHSMQIFLAFFNQNHKLIERFRLpSGK-------FQLNYLCQHSEKMLLLYENVIDkCLDNmanFHGIMADVTVSHR-HSGVTYEDVSLKSEHVRRYILDYFANQSSPTLVSALAKLSEHFND-------
2103 >ERR1719370_117345
2104 ---------------------------------------------------------NATRMFPAKAALQESVEVmVDVLERrgmWGSGIRDAGISHH-KLGIKRRDMEKLATSILAAISDLLGDcDLDRKllQLNAWKKLLNAIADEFSA---
2105 >ERR1719234_1549997
2106 -----------------------------------------------------SLWhrssiQLEGASNHNKALMNAIDSVmVEVLERrpmSKSGIRDAGISHH-KFGIKRLDMDKLTTAILAAISDVLGDcDLDRKmlQLNAWKKFLNAIGDEFSV---
2107 >ERR1711972_141202
2108 --SISETEKTYCIKEWVKIcsDRSKTGTLLLSHVYQENPQLLTH-PAWKDLS-QDQLKENQHFKNLAEKTMGSVEQILTHIDNVDkvaSMFEQQGKDYK-SAGKSMSH---IMACLETFLPLDHPSlEVTEEYRGITQEILGIIKQSLMKGYR
2109 >tr|A0A0N5DD39|A0A0N5DD39_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
2110 --NLTPHQKQLLVQSWPQVQlynRIHGGDAMFARFCEKNSIARETFQKIAVVQSfASNEASESVLKKHEQYLVQLLSEAVENLNNdCEPLLReclDYGAQHVT-LHelLNETVWEQLAEAIIDRIHKVNLVRRHKDLSKAWTMLIILLIDKIREGY-
2111 >JI8StandDraft_2_1071088.scaffolds.fasta_scaffold105816_3 # 981 # 1154 # 1 # ID=105816_3;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.718
2112 -----------------------------RNLFKIHPELKHALNI--EIK-KSGIQH-----VPLASIVFSYAANIDNADKFLVIIRHIVDKYS-SLGITVNDCPIIGSLLLDAIKESLGYAATTHLLAAWAEAFGLFTNALVQ---
2113 >ERR1719199_1194134
2114 -------HAGYIEKSRESVlnlDAAQLGADIHVKFLNVYPAAASLFQK--TLR----------M-LITTKIMGTLMAVISDPTGTLEDVRAVGVRHT-KYGISERYLLPFGAMLWEIVGTMLPGMWSDEHSAAWAFYLDFIASTMTRA--
2115 >SRR5882724_2518483
2116 -----EEVRRKARKSYRELQDSAFYCNFYAELFRAAPDVRQLFRNI-NM------------DEQYEKLHAAVGKLLNfrPTDDPNP-MSRHAESHE-RLGLQPKHFEGFRDAFLTALSSRK--TADNYAMDAWRAIFDAGIAYMTTK--
2117 >tr|A0A2P8AX05|A0A2P8AX05_9ACTN Terephthalate 1,2-dioxygenase, reductase component 1 OS=Micromonospora sp. MH33 OX=1945509 GN=tphA1I PE=4 SV=1
2118 ----------PDPQRLLAALgaPDQAADHFWSYMEDRSVRV---LP-----------------QQFAPMFFSTLAEMVARRGDpaaRRAELALMGRMYL-RFGLYPYHHTVVAAAMVDTVRRFAGASWEPDLAGYWEvgcrRSLRLAE--------
2119 >tr|E5XPI8|E5XPI8_9ACTN Uncharacterized protein OS=Segniliparus rugosus ATCC BAA-974 OX=679197 GN=HMPREF9336_01410 PE=4 SV=1
2120 ----------TFVRSFHlELFgaAPELAARFPPGLGEHRGGF---VR-----------------M------AEHILETFAEGADpprLIDLLGQLGRDHR-KHRLDERDYRLAQAAFAKALVATARG---SGDGAFAAraaaLVCQVME--------
2121 >tr|A0A246RU09|A0A246RU09_9ACTN Uncharacterized protein OS=Micromonospora wenchangensis OX=1185415 GN=B5D80_01060 PE=4 SV=1
2122 -------------------------MREADELRSALPDR---LA-----------------AHDAELLIATLRRLATD-PEpaaQAVTLTVLGHAFR-RFALLPHAKLISALAGAD-------------------VPVELLR--------
2123 >tr|A0A085M5J8|A0A085M5J8_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_06691 PE=3 SV=1
2124 -TCLTKRQRRCILKSWRKVqNKAQLGEEIYIQIFMQKPVLKSLFP-FRAT-PVNELHDNVLFTRQAVIFIDFIDNVVAYVGinNgrlLQELCTRVGISHALMtrVNFDPEWWYLFANSVLDGMQKFCLPNFSCEpiatyigsqSMLAWRILLKHVVEMMSDAF-
2125 >tr|A0A2C9LD65|A0A2C9LD65_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106067556 PE=3 SV=1
2126 --QLSHKDKLFILNSWLNFrNgkrEEDIGMEAALEMYSIYPEIKDIFTIYRDARM-KHLTDKEMIRTHSQQVASVVDKCVMRMDDAHAfamIAVDEGSVHI---KIQERFMRCYVDCYIREIKKYSKLKWSRANQMAWEVFFDTIVVNMKNGW-
2127 >ERR1712086_1089461
2128 -------MG---KEHGDGDSsadaNTAAGLDVMQGKKPEQKESKRWFSlgssaakgkqerS-----------KEEKEEKIADKALEMSAEMYKDPTRIQGETMGLGLRHI-MYNVDPAFFDALVTAYVEEMAVRTT---------------------------
2129 >tr|B3LWC8|B3LWC8_DROAN Uncharacterized protein OS=Drosophila ananassae GN=Dana\GF16358 PE=3 SV=2
2130 --GFTCVEKAALRNAWRLIEPfqRRFGKDNFYNFLTTHQDLIHNFRL--DPRSSDSPINLSKLHGHALAMMKLLARLVQTLDiNLqfRLALDENLPAHL-RRGIDPSYMKMLATALKRYILESsvIQNHNSSTLTSALTQLVSII---------
2131 >tr|B5DW13|B5DW13_DROPS Uncharacterized protein OS=Drosophila pseudoobscura pseudoobscura GN=Dpse\GA26483 PE=3 SV=1
2132 --GFTLCEKVALRQAWNLIRPreRRFGQDVFYTFLNEWYWSISKFKK-------GEDINIALLHAHALTFIRFVGALINESDPImfQVMINENNQTHS-RCRVGADYIAMLGQALTDYILKVLDKVRSPSLEQGLQRIVEKF---------
2133 >ERR1719162_2542559
2134 --------------------RSDIGMCVWNRVFVEDPKAENFFKQ-SN----------Q---RLIYIVTMAIKYSVEFYGDpekTKMAIEALALKHI-MYQVQPRMFMLFVTCYDEEIKARTDD---KLVQSGMHWSISIIASIMA----
2135 >tr|A0A0V1BAT0|A0A0V1BAT0_TRISP Globin-like host-protective antigen OS=Trichinella spiralis OX=6334 GN=T01_2203 PE=3 SV=1
2136 ---------------------MENGGQLLANVFKANPELRKFYDV-EDID-PDDTKKSRLIQQAGGNLLNSVTFMVNNYDNErsfKQEIKEQICDLR-EKGMKLEDARKLKTGFVNYVKSKLSQPMTAKEEKEWDMFFQRFFDALKQ---
2137 >SRR6476620_89806
2138 --------------------RHATRQQRRPDVF----------HER-QRTAGE------D--lnVLRERDVGQVH--ESLARAgvavIDGVVPRIGCEVV-DLSSEMQNG--------FPQGVIL-SAAVGVGDDDG----------------
2139 >tr|A0A2W4R8Q8|A0A2W4R8Q8_9CHLR Uncharacterized protein OS=Chloroflexi bacterium OX=2026724 GN=DIU68_09390 PE=4 SV=1
2140 --RLSRQQKRIIQRTFSAVAvrHDLVARLTIERLRElsRTPAS-TC---FGNTP------------EDRRRLMHLLALLVQRMDDRGA-LHDACVAQTRQMGCDPFeggSTSLLAEAFIGALQSALAGRFEAKTEAAWREFFQMVERVLR----
2141 >tr|A0A0L0FDI4|A0A0L0FDI4_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12917 PE=4 SV=1
2142 ---KTDSEVELIRSSWRALLaGDGtaaqmpllrFVEQYYKRLFRLFPDSRGVFKT-RDTQ--------------SKSLSLLLSIIINVADEpeLemNAKKKKLEMMYK-EYGMNSLLAVIAGRVLIQSLQAFLEAsnKFQASVKDAWVKCYTSIADQL-----
2143 >ERR1719203_545915
2144 ---------LILKDTWAVIveQIHELGLPTFVKLFRLSANLRYYYPKHnRPES--TEV--QENINTHFDQLVAVVDDVVRCLPDLsthIQYLRNLGPVHC-DVEVQPRLLELMGPVFAILSDLYCWskadgvirLKWPGYYYFDILLDScemVTIQLLLDLX--
2145 >ERR1719232_1194111
2146 ---------IMLKDTWSGIieQMHELGLTAVVRLFKINYNLRFYNSPNvRYHP-TTHTNvkvlrgttaapatpaavasgstaaataagpsakdqatgksNLEDLSIVFNLLVSIIDHMISSLPNGsspTSHAGRNGksngtkakftlsaATMK-QLQILRQPTDWVGPVFCNTVRPLLLvqGKWSYQVEIAWRLLFRHLVRKNRTFD-
2147 >tr|V6U182|V6U182_GIAIN Flavohemoprotein (Fragment) OS=Giardia intestinalis OX=5741 GN=GSB_151570 PE=3 SV=1
2148 -MPLSEDTIKAVEATADLVAaqGLDFTRAFYERMLTRNEELKDVFNLshQRDLRQPKALLDSL--VAYARS-IRKINELhelqeqglpvpAERLAELqgfFAVAERIAHKHA-SVGIQPAQYQIVGAHLLATIEERVTA--DKAILAAWSKAYDFLAHLFV----
2149 >tr|A0A1R1LGI5|A0A1R1LGI5_9GAMM Uncharacterized protein OS=Motiliproteus sp. MSK22-1 OX=1897630 GN=BGP75_23395 PE=4 SV=1
2150 --------LDKIYSTLQLLDdekSEKLINETYSIFFNAHPEAVLLWSK--DDPE-----------SRSKMFNGVILTIIDNLTRpdiFKNNLLSDVKDHD-EYGVDKEMYGGFFLSLTEALKKTLGSEFNQEMELAWKHQLAHIRE-------
2151 >ERR1740121_1123239
2152 --------------------------------------------------------------------------------------------------vWIVVGSA----------SVrHR--LrAFGSASGSSSgRRLSGidY---------
2153 >ERR1740121_2035324
2154 ------------------FTplt-----Cqwa-----TPHDGPAQHVL-------------------------CEDGHFahFATDKCesAgHgA-RVQCPSDMPEMcaDttcgggqehccrpaggCTGgERPCPT--------TASASgSA--SgsaSGSASSRRLAgIDYE-----------
2155 >ERR1719271_1314470
2156 ----------------------------------------------------------------------------------------------ghRqdeqhglQVPwCHQIPAVRGDC--PGLALQpCR--V---------HrREWC-----------
2157 >ERR1719240_2235476
2158 ------------YE---DEE-------------------------------------------------------------------------GAqvdvmkgEDALVATADLLYQKMSEDAN---MQT-lLGNIELAELAsKLQKALa---------
2159 >ERR1740122_169377
2160 -----K------GE--ADKSgnAEAAGGgqGDTPETGAAQDTAAGV-------------------------TDEHS--------KaLGIEISS--FDELkvDqkciaaaIDAwKLFISTAESREAAGEAV---YNA-lFEGAPS--LQALFVTPRAE------
2161 >ERR1719243_286169
2162 ------------------------------------SHPVNV-------------------------LVSDTMwkGY----t-vRgIRRVNYY--VKYMmlTrdgnvsqALGwFKDAADCKIISH-PVNVLVsDT--MwKGIVRKQFLGgRLWFII---------
2163 >ERR1719158_147189
2164 ------------RV--CYLYplvhcNILAVLrelnfdGAAESLCLDAPALLPT-------------------------MLDGLIwrSR----vTeNgQRRVNYY--IKYFivDaeggfskTTEvMTDNGDPTIVCR-PVVSLVtDM--IwGRVAFRTFLYgKAWFLF---------
2165 >ERR1740121_2502219
2166 ---------------------KSFALEVFKRLFAMVPHSESFFKQ-----------SNTRLIFIVSRALDMCMNIYKEPTRLVNEITALGIRHI-MWNIPTTYFDPFVQCMLDEAIVRYGAS--QQAIEGLEWSMRIIASIMV----
2167 >SRR5262245_17232684
2168 ---VEEETRALARYSYLQWlDDDEFFSAFYESFFAGATGAKGKFR---NV------------EQQRLKLRDAMTAVLNFYpGNEPTSLHRLIAVHA-ARDVTGTEIEQFERSFLEVLHQRLVERKIaeqlgpdvvAKIEQGWRELLHPVVQYVMGV--
2169 >ERR1712137_24889
2170 ---LPRESITVIRDTWAMVErNVDIAPKMLLKMFQLYPMTQNLIPLLRGVS-LEDMPTNKRFLQLAYGSQFAMSAIVDKLHRpdmLEEIIG--GGMHAFVDGLSTS-FQMAaTTAlFNKIMTEELGSAYTAEAQEAFIATGDMMTSIMV----
2171 >SRR5262245_32700325
2172 --WLNSNQRDLIRRNWDSssK-RYELCRRIYCRVFARRPEIRRIFSIGYDW----------WRLEI-VTFADFVQSIVDNLDDAkrvRQSAFEFGRDHAKwrRFGFRSDFWVQLAESTTREcvyLDAAVH--PPDESLETWTKFVSIVF--------
2173 >SRR5271165_4656598
2174 ------------------------------XMFYKKPDLKPTFIeIGHhidpendggLT----------WEV-EAQRFTNLLTDLIGNLNNLdrfEELSFDWGRNCVQwrEFGFKPEFWLHFSEAMTTEclyMDQAVH--SVGEVIEAW----------------
2175 >SRR2546423_8132340
2176 ----------------------DVADEMFtARLLELEPQWQRVLS---DEP-----------TEWGRRLLRAIRQAVASFTClggFAEALRELGGVPA--AHVGYRDYERQGAAFVGRLEHSLDKPMAGAMRESWQRVFRLLAE-------
2177 >tr|A0A2A3E2S2|A0A2A3E2S2_APICC Globin OS=Apis cerana cerana OX=94128 GN=APICC_08732 PE=3 SV=1
2178 -------------------------------------------------------------EAHCQNTASGCIDALDDVDLMEAILHTIGERHG-RRGQDRQQFIDMKGVIIEVMKDTLKSKFTIEIEAAWDRYP------------
2179 >tr|A0A1W0WMU5|A0A1W0WMU5_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_09357 PE=4 SV=1
2180 --ALTHVQINLVRESWRWLNFnrplQETAVRFFlDFYFKQNPDCLPMFG-MKTVD-----HYNKAFSIHALTVMHAIKYAVEYIGNpeqFQRLFRTVGQTHL-RFGLTDLHVERFLEQWLAFLRANDAKVFDAATVEAWNLAGRIVVSQI-----
2181 >ERR1711911_15016
2182 --------VDLVRKILDKAKqNGNVAPKVFFKYFKAKPASMKAFPAISGLA-LSDLPRNGAFLSNVYTCFAGLKAYTLETDV-STRCPVFAKA---SGKYKSEDIDLFTSILKGVVAEELGADYDDVAKEAFEQFLDAVALTVT----
2183 >SRR5690554_6373173
2184 -----------------------LYLSCYDIFMGQSADIGAQLFN-TRMS------------AQHGLLRGGIMWLIMHARGMsDSNIRALGKSHSRdQLYFHPSHYALWLDALMETLYKHVP-EFNLQLELAWRRTLEPSIDKIISMY-
2185 >ERR1711879_742838
2186 -----------------------FFEDFYSIFMTKSPDVLNMFAN-TDME------------AQRALLRSGILWLGMHARGMpDTKIRALGESHSKkKDEHQPHVLFHVAGRSDGNAFPPRP-G----LHSRTGANLAPYPTAHVT---
2187 >ERR1719461_1661620
2188 ---------------------IEVGCYTFTQLFSQYPM-MDYLAKFDGLEV-EGVCIGEALRAHADAIGSVVAEIqenAGNPERIRMSLAQAGHRRF-LEGVERAQLDMLGPNMAETViIKDTWevISKQVKSigMESFEKLFSLNSDMSaYLPQ-
2189 >ERR550519_213
2190 ---------------------IQVGCDTFTQLFQKYPQVNNYIAEFDDMEV-GGIKVGPALRAHASAVRSVVTEIqenAGNPERIRSSLAAAGHQQL-MAGVERKQLDVLGPVLCHVIRPLVWekGIWSVEVEKSWTHLFDIVACLMKLGY-
2191 >tr|A0A173LPQ6|A0A173LPQ6_9ACTN Phenol hydroxylase P5 protein OS=Dietzia timorensis GN=BJL86_2914 PE=4 SV=1
2192 ---------------------PDFRRALEDALNTEAPYLRADLPR--NLD---------GPFA---TFVKLYRFLLTrvedsggdraKVDDVLDLCRELGHDLA-KYNVVEEQYERFGHALNAALARVAGEEWTGELSKVQNQFYVIIARALHK---
2193 >tr|A0A0M3HYR2|A0A0M3HYR2_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=4 SV=1
2194 -PSLTPSQVQTIRKSWKHINtkgLYTVIRRCFQQLECMCPSVSNAFNSA-NNQLSANISTVRTLVEHTKFMLILIDRIVENDQDSIIELRRIGASHVVlkeSFGFGENELEKFGEMLAEAFLKLDGIRQSKETSRAWRLVIASMIDQLRAGF-
2195 >tr|A0A1I8CNT8|A0A1I8CNT8_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 PE=4 SV=1
2196 -IGLSNYQQKLILQCWPNIYttgnSSTFATNIYPNLCTRNQKAKALLQK-AD---GVAVFSQSeidCTSMHSKLTLEIIDSVVRNFDSnpisLIGYLNEIGHAHRSlkSIGMPSSMWDDLGDSILEGVRRNDLVRKHKELRRAWLAIIAFLTDNLKQGQ-
2197 >tr|A0A0N5AJ93|A0A0N5AJ93_9BILA Uncharacterized protein OS=Syphacia muris OX=451379 PE=4 SV=1
2198 --QLTVAQSVLVRKTWAHARnqgSMEPAMSIFRNSFFKSPDIRALMMA-GS-----KNTGYERLKRHAILFTNVMDKLIAGRvEEidsVIEELKNAGKEHACitreQYACpfRTSLLDQFAAAMIErTLEWGEKKDRTEVTQTAWTKIVLFIMEQMKAGFH
2199 >tr|A0A0H5S8S8|A0A0H5S8S8_BRUMA BMA-GLB-3 OS=Brugia malayi OX=6279 GN=Bma-glb-3 PE=4 SV=1
2200 --QLSSYQIHLLQQSWQRLRcSPNFFINVFRTVISKNTIAKELFRKT-SIIDGFTSYKCYDVKEHADSLIELIDFALREIHSsikvVQDRCMLMGAAHCNTCeNSMSSSWDQFGDSLAESIAKAEAIRGKRKCLKAWNALLSFIVDRIKGGY-
2201 >tr|A0A0N4XUJ2|A0A0N4XUJ2_NIPBR Globin-like protein 9 (inferred by orthology to a C. elegans protein) OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1
2202 -ASLSFSQKQALTTSWRLLRpqAAGFFRKILLELEIVSNTVKQIFYKAQFVDAfNKDEENIATMDAHIKLMVKFFDDILASLDDeteCVERMKRIGSCHAVlvrSCGFSSDIWERLGEISMERICAHEIVQKTREASRAWRVLLACIIDELRCGF-
2203 >tr|A0A2A2LCK8|A0A2A2LCK8_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_21707 PE=3 SV=1
2204 -STLSFSQKQALSLSWRALRpqAAALFRKVFLELEIASVKVKQIFYKASLVDAfNRDEENSATMEVHIKLLIKFFDDLIPLLDDekeAVDLIRRIGSTHAIlakSCSFTSDIWERLGEITMERVCTHETLQKTREASRAWRTLLACVIDELRSGF-
2205 >tr|A0A261C2G6|A0A261C2G6_9PELO Uncharacterized protein (Fragment) OS=Caenorhabditis latens OX=1503980 GN=FL83_09405 PE=3 SV=1
2206 -ASLTFSQKQALNLSWRLLKpqASACFRKIFLELEIASPKVKQIFYKAALVDAfNKDEDNSATMEVHIKLTTKFFDELLSTLDDeneFVAKIRGIGSAHAIlakGSNFSSDIWERLGEIAMERVCSHEVVTKTREASRAWRTLIAILIDELRGGF-
2207 >tr|A0A1Y0I5V1|A0A1Y0I5V1_9GAMM Uncharacterized protein OS=Oleiphilus messinensis GN=OLMES_1782 PE=4 SV=1
2208 -----TQDQRLFWNSFDRCLsspqrDQQFAEDFYQRLYSSDRAIAEIFDR-VSV------------SDQLHAVRQAVYLLQEMTplKQAEITLDKIQAIHH-QheIRLSNAMLDKWLECLLASVELAD-PEFNETVKQAWIDILTPAVHIL-----
2209 >tr|A0A1I7TWD1|A0A1I7TWD1_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=4 SV=1
2210 --RLSKIQKRAIRFTWHRLQtrnggkrVENVFEEVFDKLVKNLPNIRDMFST--RMF-LCAMsrGTTSTLRDHSKSCVKMIEAVIKNFDTeKskrtdtgtENDPRVIGRAHSIlkPYGLAGNYWEKFGEVMIDVVLAQEAVRDLPGAGQAWVIFTACLVDQMRAGFD
2211 >SRR5439155_18881238
2212 ----------------------PVLQGFQQAVSGFFTEVGRQFPK-NR------------FRQTPRKTQTSFLLVMGNIApgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSTQVEAAWRYTMGAGILFLKA---
2213 >SRR5256885_16048310
2214 -----------------------FFFNDTATTEIYT-LSLHDALP-IY------------FRKQRRMLQTSFYMLVEYIAlgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSRSEERRVGKECRSR----WS---
2215 >tr|A0A1I7ZQR2|A0A1I7ZQR2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1
2216 -IPLTAAQIHLVRTLWRQIFlskgPTVIGSTIFHKFFFKCPKVKEQFRR---CPLPRNFPNHDSFaKAHCKAMSELVDQVIENLENldtMTADLERVGRLHAEVmnGELSTKIWNDIAETFIDCTLEWgDRRCRTETVRKAWALIIAFMIEKIKLG--
2217 >SRR2546427_190033
2218 --NMTYAELAHFDDSLTRCTrEPRFLERFCALFFASSDEVLQKFSQ-TDV------------QKQRRVLQASLYIQLSASPIvtnGSLIFCNPSVTWSIiQVQRSPAMRTLRthSSCPLVGYPLKA-GQCGVGHVPX-----------------
2219 >SRR5213596_3505323
2220 -----------------------FLCVIFGLLRRGPSQVHTD----RLA------------EATEDVTGVVPQILMLEADGkpeGAVHLAPLAALHSQqHLDIPPHLYDLWLDCLIQAVRESD-PQCTPETESVWRRMMANGLAFMKVRYH
2221 >SRR3569833_2178475
2222 --------------------HPNNHNTNKKTNKTTTHKKTQKNKN-TK------------NTQQKKKLQMSLNLLISHAMGigiVDGYLHQHAEKHSRhHLNVEPHHYTARLNSHMKAVKQHD-PKYSPALEQAWRTGLGHGIELIKS---
2223 >ERR1719347_979638
2224 -PIVTDEEMASINELWSCLRadAMHSSRFIFARFFEAHPEFLEPMPFVKDYYGniSPKYMDTQEMQDYCLKFMSTLDAVMTRVFArdkeALQVMRDIGYSHH-EFGLTSDMTVKFMNKMHDSVLELWGTEASRRDSKALDNIFKTIATEINVG--
2225 >SRR5437762_8994925
2226 ------PAAS--------------SDHHIPSQLAAGTRAKDRKGG-VEY------------PGHVCRGQRRCARDRPHILAspelCIPRACRTKSA------------AFCAVCENRCCETC-RSPPAKKPETARRSAERTG---------
2227 >SRR5690625_2752079
2228 ------SDYSDVQASYGRCVrNRDFIPGFYQRLLSKDKRIAAIFKR-TNW------------SVQNRALRRGISIALTWAGGskiVDRQLEEMADAHS-RKGrvpVDPVLYVFLREALKIGRASCR-ERVGVTVGDGcvpqdESGAATGG---------
2229 >tr|A0A085LV25|A0A085LV25_9BILA Uncharacterized protein (Fragment) OS=Trichuris suis GN=M513_10305 PE=3 SV=1
2230 --EFTAKEFAIAELTWAKLKvrfNNQVGMEIFRQIFASCPKVKNLFGV-QNRE-DQKALCDQRMARHTAIFQDIIELLIVDLSQrsdsLTQSLITLGAQHWFftQRGFRPEFWVIFGNTLVNLIRSLPLSlSQRYLARRTWIKLIVYLLDCVMFGY-
2231 >tr|A0A0N5DS84|A0A0N5DS84_TRIMR Uncharacterized protein OS=Trichuris muris PE=3 SV=1
2232 --EFTPKEFAIAELTWAKLKlrfNNQVGLEIFRQIFASCSQVKGLFGL-QNKE-DHTALGDQRMARHTAIFQDIIELLIVDLSKrsdsLTQSLITLGAQHWFfnQRGFRPEYWVIFGNVLVNLIRSLPLSlSQRYLARRTWVKLIVYLLDCVLFGY-
2233 >tr|A0A183BUR6|A0A183BUR6_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1
2234 -TGLSAHQIQILQKIWERSPeseISDCARNIMSHLLRSNAQMYQFFDLLGH--SDREIANSPIFARQSANFAVLLDFVLANLLEevqkVCLALQHLGAQHARlRWPIETHHWALFCRCFEDNPPKEV--FLNAEGHDLWKTMINFIIVQMRVGYD
2235 >tr|B1KNW6|B1KNW6_SHEWM Uncharacterized protein OS=Shewanella woodyi (strain ATCC 51908 / MS32) OX=392500 GN=Swoo_3305 PE=4 SV=1
2236 -----------FNDSYDFVLrnEELFFSTFYEIFVSSSPQVKAAFKH-TNM------------AKQNEMVRESFGFIICFFVtKiADEQLVKLAIDHKDKFHVDSELYAVFVNSVLAALEKIYP-KYNNECAVAWRITMAPGIEFMKH---
2237 >tr|A0A176H0Y0|A0A176H0Y0_9GAMM Uncharacterized protein OS=Oleiphilus sp. HI0069 OX=1822245 GN=A3741_11335 PE=4 SV=1
2238 -----------FDDSYDFILsnDSNFFDSFYTHFFNSSNLIKNAFAY-IDM------------DKQKQMLRESIKHLVKFYCtNkESEYLKTIARHHADKVRADEYMYKLFVDSFIQAIEDTYP-NFCEEAALVWRCALKPGIDFMNS---
2239 >tr|A0A090LM85|A0A090LM85_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X000017100 PE=4 SV=
2240 --NLTTSQIMSIKKSWKHINtkgLFNVLRRCYQRCQSCCPNVAKVFST-ENIKK-QQNIYSCGVSEHTKYFISLLDRIIDNEPNIEHELRNVGKEHAKlyeEYKLSITDIERLGEIIADVFLKLDGIRQNKETSKSWRILIASIIDEVSVGYE
2241 >tr|A0A183CLY2|A0A183CLY2_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1
2242 --LLTRTQRVLIENSWKRVKkaavEGGMGAKVFHNVLVAQPDMKLLFGL-EKVP-QGRLKYEGQFRRHAGLLNRTLEYVIKNVQytdKLGQHFRALGKKHCQmngGRAFPTNYWDTFLECILQSVLETDGSisgRYhrCREAALAWRNLVGL----------
2243 >tr|A0A0M4CP70|A0A0M4CP70_SPHS1 Uncharacterized protein OS=Sphingopyxis sp. (strain 113P3) OX=292913 GN=LH20_00550 PE=4 SV=1
2244 ----ERSDAALMEATLAAVAetGIDIRHTLFERFFSAYPERHPAFLNL-DA-------------ASRRMTDETLQILFGLATDegwVWPLVAELVATHR-NYGmLPTDEYDAFIDLAIDELGRAAGRAWTGAHAAAWRRQGEIL---------
2245 >tr|A0A1Y5Q3I5|A0A1Y5Q3I5_9SPHN Uncharacterized protein OS=uncultured Sphingopyxis sp. OX=310581 GN=SPPYR_3232 PE=4 SV=1
2246 ----PARDIAAMEASLAAVAdaGVEIRHALFDRFFDAFPDRRASFMIV-DA-------------SSRRMTDETLAMMLGLAKGegwVWPLVAELVFTHR-AYGpLPIAEYDAFIDMTVEELGTAAGAAWSAPAAAAWQRQAEAL---------
2247 >tr|A0A2N3CVZ2|A0A2N3CVZ2_9PROT Uncharacterized protein OS=Alphaproteobacteria bacterium HGW-Alphaproteobacteria-17 OX=2013663 GN=CVT78_05625 PE=4 SV=1
2248 ----SARDAGQMEASLIAVAdaGIDIRHKLFERFFAAYPERRASFISV-DA-------------ASRRMTDETLQMMFGLAKGedwVWPLVAELVFTHR-SYGaLPIAEYDAFIDMTVEELGLAAGAAWSDETAAALQRHAEAL---------
2249 >tr|A0A0D6LRF9|A0A0D6LRF9_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_06233 PE=4 SV=1
2250 --PFFRIDNRLVPDSAVAtDMV-QAQIHSYVYSSLQSTVSREMFQKM---SIVEGFRTNQccDLNMHAKVLCDLFDSIVSDLQQaskiVQARCMDVGGSHV---HMNekccGSLWDQLGECLAEVITKVECVRSKRECTKAWIMLISYVVDGMKCGY-
2251 >tr|A0A1I7RN92|A0A1I7RN92_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1
2252 --GLTDDQCEQLATAFSNIPdKYYAFEQMFLNLfMKEDPQLAVVFGF-EGIR-PEELRRMSPFRTHVCKFQRFMTTVLDMLPKknreeeLIQIIRMVGRQHCNvkLLSFTAQKWLSFKNGMLNALAKG---GESHKYYSSWNILISFMISEMKDAY-
2253 >tr|A0A183BTK8|A0A183BTK8_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=4 SV=1
2254 --QLDDTECEQLSTVFAAMPdKYHLFEACLRPMpMPeVDPQIALTFGM-ANIA-EIELRRKTPFRYSV--------------QKrgreeeLVQIIRMVGRQHCQvkQLSFTAARWLSFKSALTWTFSRG---EQKDKLHVQWSLLISFLICEIKDAY-
2255 >SRR5688572_1577071
2256 ---LARHDWHVLLDRWQRLQpnADRFATAFFDTLFGQQPAFLQIFAS-APL------------DAQFLRFAHLLSEIVSAADDadeLPRCVELVVQRFA-NDDCETDRSRAVRAAINAMLTEVSAAHMTPHMRASWHAAYVAVTAIL-----
2257 >SRR5690348_16468503
2258 --------------------ADAAMTYFYAELSSAARATWAdrdIYMS----------------GPDHMIVRT--ARALVErg------------------APSRLIHYDLVDPRVTEGQX-------------------------------
2259 >SRR5258708_24656334
2260 --------------------ADAAMTYFYAQLFAMDTEIRAMFPA--AM------------DVQRRRFFEGSAGSPLPsraRpttIASCLTCRNSGPHHM-IAETAP----------------------------------------------
2261 >SRR6185437_6364830
2262 --------------------ADAAMTYFYAQLFAMNTEIR-aVFPP--RP------------GPVKRMSRT--SSGACRrtrRs------------AAR-RPRPRPCHTSAGPAR-------------------------------------
2263 >tr|A0A016TZT5|A0A016TZT5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0066.g3721 PE=3 SV=1
2264 ----ANKSKKLVIAEWPRLLehEPNLFKIVWSSSAARSTSIKQAFGI-TD---NESPLENESFMKLSPTIQAFFYKLVIsmQLDEdmVRSACEQLGARHVDfiARGFNSNFWDIFLVCMAEAIDATLSSYITDeakraEMILAWQRVFNMIVHHMRTGYN
2265 >tr|A0A0R3Q1W4|A0A0R3Q1W4_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=3 SV=1
2266 ----ANRDKKLVIQEWPRLLeqQPHLFQIVWNASSTRSNSIKKAFGI-GD---DESPQENAVFMRLSETIAAFFEKIVItmQLDDdiVRSTCEQLGARHVDfiARGFNSNFWDIFLVCMAETIDETLSSYMTDegkraEMILAWQRVFNMVVHHMRTGYN
2267 >ERR550534_360735
2268 ---------ADAKASWANVDTAAFGKAFFKNWMASDPEVKNVFKK-SSFP-----------QGPAQFLVERFDILLGVLDDevaLSQQLMSVAKTHM-DKGVDPEHLVTFQDSFVKTLAGF-DSDWSRERSESWAYVLSHVIT-------
2269 >ERR550539_1411929
2270 ---------SLVETSWANVEKEAFGKAFFKNWMAIEPHVDEIFKK-SSFP-----------QGPAQFLVERFDILLDVLEDevaLSNELTVVAKTHM-ERGVEPDDIVTFQDAFLKTLPGF-DSDWTRDRSEAWAYVLSHVIT-------
2271 >ERR1719192_2654783
2272 ---------GAQS---APTPPKPVGQTwtkRLSEKLSSEPEVADVFKK-SSFP-----------QGPAQFLVERFDILLDVMDDeasLSKELQVVAKTHM-DKDVSPDDLVTFQDAFLKTLPGF-DSEWTRDRSEAWAYVLSHVIT-------
2273 >ERR1719242_19104
2274 ----------------------------------------------------------------------------------------------------------TPLIGMA--AQS-PLSWEQEK-----YVKLgQRWT-------
2275 >tr|A0A0C2FEY2|A0A0C2FEY2_9BILA Uncharacterized protein (Fragment) OS=Ancylostoma duodenale GN=ANCDUO_24724 PE=4 SV=1
2276 --SLMPSQVSVIRKSWRHINTKGLITVLSrvfQRFNA----ID-------GQE--YAKVYDMTIYGIIEF--------------------------------------------------------------------------------
2277 >tr|A0A0C2G6K1|A0A0C2G6K1_9BILA Globin OS=Ancylostoma duodenale GN=ANCDUO_17195 PE=4 SV=1
2278 --CLSYKHRKLLRATFQQMNsSGaflKLMEQVFRRLEAKYPDIRSIFLTTAFVNSLSRERSSPPLvrteHDHCKCLVALFEKIMDNLSDdtQLMVIRQYGEKHAQmkESGMSGGMIESFGEIAVAVIASQYSYWIQKPVDDVTrrkgrDEGLVYLNDYEYIIL-
2279 >tr|E1NZ07|E1NZ07_CAEEL GLoBin related OS=Caenorhabditis elegans OX=6239 GN=glb-29 PE=4 SV=1
2280 --NLSVKQKKLLRQSFNAMNsGGtflKLMEKIFRRLETKCPDMRSIFLTTAFVNSLSRERQTPPLvkteYDHCKCMVGIFERLIENLENIneqLTMIRHYGEKHAQmaESGFTGAMIEQFGEISVFVIGSQDVVKFNHETVKAWRLLLACVTDEMKVGFD
2281 >ERR1719431_1401903
2282 -----------------QLTtnSIRSGFCGRLCETTRyNPDCtsSNTFSMRfRKR--RKNFHSPMINTEISRRILWRRKRLMTRLFKrdpeATKRIYDVGFHHQ-MMSITEHDMTMLSSSIYSAVQDILGKKASDKDLAAWRHLLGLVSYHFKRG--
2283 >tr|A0A1Q9NTV3|A0A1Q9NTV3_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_3 OX=1841598 GN=hmp PE=4 SV=1
2284 ----TSKEADILTQSLKALEekTDDLPKLFYYHFLEPtsNKEIISLFNK-SDM------------TKQYMMFHQSLAIIVSSIKDshlLNQILKDLVKRHK-NYGVKYAHVQIFSSAFYKTIEEIFPK--DEKVKILWIKLINFVLSKFNE---
2285 >ERR1719238_586270
2286 ----PKEVIAEVRRCWEAFIkasgsKEAASEHLYAALYDAVPSVQHLFVT--PR------------VVQAMRFMTQLQTFITLLDQPkqsKVTMEAIGFAHM-QRDITVELCVLVRDAILDLLQVELGDNLSSSAAAGFKGLLNWM---------
2287 >tr|A0A2A2L6E6|A0A2A2L6E6_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_22934 PE=3 SV=1
2288 --KLTKLQKKALKFTWSRLQtrnggkrVESVFEDVFDRVVRYLPQTREMFNT--RAF-LCAIsrNETSSLRDHARMTVRMIDVAVRNLEVetrkrsdtgSDMDPLLIGIVN-----WRGSRYS---CRIINRI--------------------------------
2289 >tr|A0A2G5VGS5|A0A2G5VGS5_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-26 PE=4 SV=1
2290 ------SERSIKLRKYDYEKddgSK--------KLL---SFYKKVREK-------------FTFKRSGSEMVAVVVSVMQSLDEpdkISKMCQEIGQLHA-KYrrskGMKIDYWDKLGEAITETIREYQGWKIHRESLRAATVLVSYVVDQLRFGY-
2291 >tr|A0A1I8EM37|A0A1I8EM37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1
2292 -PSLTSAQIHLIRNIWRQVYitkgPTVIGSTLLHGIYFKSKKIKDQFFR-CPFP--HRFPNrDSFNKAHAKAVGEMLDKIVDNLENlesMSGYLFSIGATHANliRRQVSKEIWNLMAEAFIDCTLDWGdKKGRTEASRKAWAFIISFAIEKIKRG--
2293 >SRR5690606_37396704
2294 ---FSDTDTYILHTGLKWIEeaPETFAAKLYQRLLRDHPECQASLHAI-GL------------ESFNRNFIHFLKMVKEELLErhtIHVAPREFLALHALpvEKVRHSNYVIKMGRTFLDIFAELAEDAWSPALESTWNKAIEEVK--------
2295 >GraSoiStandDraft_42_1057292.scaffolds.fasta_scaffold716659_1 # 2 # 607 # -1 # ID=716659_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.685
2296 ------TEIQILENGLRWIKesQDRFGDKFYHRLLREHPEVNPLLQSI-DP------------WSFNKDFVQSVDAIIGEIRAqgdVISPLKDFWPELSStaMTPLKPSELIKVAETFLDLISELAEDAWSPALEYVWRKAIKTVM--------
2297 >SRR5215207_8455447
2298 --------------DFDTVVCSSFAERFYSRLFTHEGGehLRALFPDN--I------------QPQHAQFTTMLGDILAYNFRigRSLLGD-TFRKHI-DFNIRESDVDVFRKAFVEEVGSTFLH--LG----------------------
2299 >ERR1711972_144950
2300 ---------SQVLQSWEQVKllgLESVGEMLRANTFELDPQVVALFRIPGVVSTGEGMLQRMALRRLFSKVLRFVGSVVAGRYDyqrLVETLSR-----------------------LGATRAAGGATEVHFKI-------------------
2301 >tr|A0A238BIH0|A0A238BIH0_9BILA Globin OS=Onchocerca flexuosa OX=387005 GN=X798_07861 PE=3 SV=1
2302 --------LFTLKNYWKTVRrnERDCAKMMLAKYLKQNPDNKEKYPKLKNIDVntVDVATANSGFETVAANYLKVFDDVITTVEEkpgdvsdACSRLTAVGKMHRTkVNGMDGSEFQLLEEPFLYMISEILQDRYNDKAENLFRKFYQFCLKYILEGFN
2303 >SRR5215467_3799544
2304 ----------QVSESYWRCCtNPLFIEELYQTLFSKCGEIKQLFEQ-KNVS----------MKRQYAMLRYALDIFVDYPHDMTATFPDIARKHT---GLDPRFYETFIEALIETVGKCDPK-WVPSLEHAWRERMT-----------
2305 >OlaalgELextract3_1021956.scaffolds.fasta_scaffold865191_2 # 285 # 404 # 1 # ID=865191_2;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.492
2306 -----RHEWHVLLERWQKLQpnADRFATVFFDTLFAADPELRQFFGG-ASL------------EAQFLRFAHLMTEIVSAAGDpeeldhrVEVVVQRFARDDS-A----TDQSRAMKLAIAAMLEEVAASDMTRQMRADWKAAYAAVGAM------
2307 >ERR1712159_177610
2308 ---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNITFFNRA-HFTS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLRRISNKHV-GFGIKPQHYPIFFENLFVAFKEVLGTKATPELISSWKELVSLVQ--------
2309 >ERR1712159_799488
2310 ---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNVPFFNRA-HFAS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLEESPTNML-DSESNHNTTRSSS-----------KTCSLPSKKS------------------
2311 >SoimicmetaTmtLAA_FD_contig_31_10253239_length_247_multi_1_in_0_out_0_1 # 3 # 245 # -1 # ID=589621_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.671
2312 --GLSEYERGLVVNSWKALTkpdfspldGTSSLSNFYDAVWTKWLKIDEF---------ANKMFRSRGFKGRVQHLLRIMGVIIKCAEDPlrgLEQLRSIGVQHC-IWGINSQSFASLALSIIHGLDQANGKEINAELKELWLAL-------------
2313 >tr|A0A1V9ZGT6|A0A1V9ZGT6_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_12918 PE=3 SV=1
2314 -PVLTPTNVDICRRTWDLIQtagtdkMRqygkpgiiLFYDEFFYRIFERDTTIREVFPKV---------------QQRAEVLIKAINFILSTRAGtpasvmeTVNACRFLGHKHRAFAKVRPHHFAVYTNTCIEVIMYWLGEFGSHEVGTAWSHTVGFILRHILEAF-
2315 >tr|A0A1I7XNU2|A0A1I7XNU2_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1
2316 ---------NTT------DSglqlEGIVVQNCFIYILSKYKHLRPIWQFGKKIEDneenwTLALYEDFYFRHHCASIQAGLTMIMENKDDpesIKKLLNEIGAHHF-FYDACEPHLELLDQ----------------------------VKGHVSDG--
2317 >tr|A0A2A6BP14|A0A2A6BP14_PRIPA Glb-18 (Fragment) OS=Pristionchus pacificus GN=PRIPAC_48995 PE=4 SV=1
2318 ---STPEDKKLMEKTWSEEFdvLLTLGSDIYNYIFKNMSACKRLFPWIIKYEdEGVDWKKTTEFKDQALKFVQVIDTVVWGIIDgdkSEPFLYDVGQRHVQyaSRGFKASYWDVFLDAMQYAQDQRIPKmnnlnaQEKQRAKQIWHDVAAYIIKHMKSGF-
2319 >UPI0002C4E217 status=active
2320 --------------------------DFGTAFFEYCPDLKGQFPS--NYA------------L----VTKMIQKFINNViegKNLERLARHYGRTHW-RYDLEERHFLGFAEALADTINIRIGNFGTIELMKIWREEATMICKMLEDQY-
2321 >SRR5215831_15107384
2322 ----------------------LFFSKFYTNLFGRADDIEDRFKEL-DM------------ERQYRILNLAIHKLLEFRPEqpaTQKQLRDLSLRHA-KLGLTNHAPAWNR-IH-LDLRGIGA--DGRSsGVAAADKALAX----------
2323 >tr|A0A085LU76|A0A085LU76_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_10599 PE=3 SV=1
2324 --NLTTHQKQLLVQSWPKVQtynRIHGGDAIFARFCEKNSIGRIFQETFQKiavvQSFAINEASESVLKKHEQYLLQLLTQAVENLNNdrepLLRECLAYGAQHI-TLQelLNETVWDQLTEAIIERIHMVSFVRRHRNLSKAWTMLITLLVEKIREGY-
2325 >tr|A0A2E0SMS8|A0A2E0SMS8_9PLAN Uncharacterized protein OS=Planctomyces sp. OX=37635 GN=CMJ46_12130 PE=4 SV=1
2326 MSQISERQYHLIHDSYRRCMlADDFLVMFHRNFMEKSPQIPKFFAD-HTL------------QQQHRILAKSVARLVSFVDGkpqaeqdMRDTMRI---LHDGNLRLTPEHYAFWATALMETICTI-DEACNDEVAVAWEQTISYGTGVLK----
2327 >tr|A0A0B2VQV3|A0A0B2VQV3_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_12261 PE=4 SV=1
2328 --NFNKRERVCLRETFQKLAdPkELIGAIFVDIVNDIAPELKKVFGV--DRAPKAAMLKMPKLGGHVARFTDLIDQLTNMVGyteNVlgaWQLVRKTGRAHT-KQYFletnqsarGTNYFALVANTFILEFTPYLTGekeepnvdekkkvrfasTYTStMISDVWARFFKVITAQLTDAF-
2329 >tr|A0A1I7YWT2|A0A1I7YWT2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=4 SV=1
2330 --SFTKKERICLRETYQRLQdPkEIIGRIFLDIVNDVAPEVKKVFGV--ERVPRPNMLKMPKLGGHVARVNDIFDQTTSMLGyteNVlgaWQLIRKTGRAHT-KQQFllenlnqlEKNYFQVVIDYFQEQFLPYLTGekegqerkkvrfaqNYTTiLIEDVWKRFFSILIAQMTDSF-
2331 >SRR5512138_1182700
2332 --------HRRVQGSYSTFQatdrADRLYRTFYANLFASVPEARRMFAH-TDWS------------RQYNAINEALKLLLDFDADpqraadAAKQIGSVALKHQ-QYGLGERELRAFEGALLHALRSC-G-ECKPATLEDWRMILAPGFHHMRG---
2333 >SanBayMetagenome_1026888.scaffolds.fasta_scaffold228792_1 # 28 # 387 # -1 # ID=228792_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.353
2334 ----EPNQRALAKASYRTWIepDTRFFEDFYRRFFATTAAKrahsVHKFK---DR------------KEQHDKLRNGMAAVLNFYpGNEPTSLRYVIDVHR-RKKVTEPELKQFSATFLELVSERLNRKLtgtgsaarRKEIMDAWTALFDQVLKHFRE---
2335 >tr|A0A0V1CBX7|A0A0V1CBX7_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16916 PE=3 SV=1
2336 --ELNDNDRQAIRQTWQKIGdHTLWAQRLFAKILVACPAFSKATSF-HSL-AGKHLLNDAKFRSFCQRFADFWQNLVQLLCvsdDpadwqqAVDSIRGLGQRHSLNRKVTfeAPIWLMIKNEIVLSITGY-SDICRSKDCLSWNKLLMFTVAEMKSAF-
2337 >SRR5262249_4116633
2338 ---------------------TKFFRSFYEILRE-SPEIHDMFTSP--FS----------VAKQAQKLNNAMEKILNFRTYMnTSSIGREVQRHR-KLNIKPEHYGPFRDDFVKALKKAkIDDGYS---EDAWCAVLDPALDYMRT---
2339 >ERR1719347_1935341
2340 -TGLSQNEVTLIWSHWESLKphKRRLAKRILKVYIKEHPRARELFPNWVDIP-TVELVKLTSFSRKAVDTWEAFSRAWECIDDaplCRKVCYAFGKKHI-ECnarikghgQIDEHHVKNFIRIFLRIILVSAR----EGSEEAWRKATEFFSINFVRG--
2341 >ERR1712142_116161
2342 -THLSQNEITIIWSHWESLKphKLKLAKKILKVYLKEHPKARELFPpHWKGIS-MADLVKLHSFRRKANDTWEAFTRVWECIDDpklCQRVCFTFGKKHV-EWnarlrqtrgQIDEHHLKNFMHCFSKTVLDNSR----AGSSEAWRKATDYFSLHFLRG--
2343 >ERR1719313_2808357
2344 --SLSDATHELLQKTWQAAKPegpg--LGEAWYEELRsdtSYVDDLGVILNF--PV-------------CRPENVSRVVQALLDLLPRecqetpepglmlpvprFTKLLLAAATLAQ-----------------------------------------------------
2345 >tr|A0A0D6M2N5|A0A0D6M2N5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=ANCCEY_04360 PE=4 SV=1
2346 --NITPFEIRYLKYSWEKASsTMDIGCELVARLLNDN---RTRFRALIEshsgdLLgsanfAAEDVKKFRRARSVAHGVVMFFNQVISELDEpnsadfIAVISQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALQVKkttsfacgktisMsDKKAREVWYKVIQFVIQNMKRGF-
2347 >tr|A0A1I7W801|A0A1I7W801_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1
2348 --NISSQEIQYLKYSWERASsASDIGCELVARLLNDN---RTRFRALIEshsghLLgssnfTADDVKKFKRARAVASGVVMFFNQVISKLDEpdaadkISLLSQSLGASHF-RMKvwFQAENWLCVKNCLLDAIMTALRKNggssllcgkrhmHnIKRATDVWYKVIQFVIQNMKRGF-
2349 >tr|A0A0K0DKR1|A0A0K0DKR1_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1
2350 --LLSTLVANNLQIYFSRANnATDVGCELVAGLLNDN---RTRFRALIEshsndWLgsatfTAEDVKKFKRAHSVANGVVMFFNQVISKLDEedaverIALQSQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALMTKpfmvcgksitMnQKKSREIWYKVIQFVIQNMKKGF-
2351 >tr|A0A1I8BDP5|A0A1I8BDP5_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=4 SV=1
2352 ----MRYTNYLSKIVLARTLnQVDIGNEIVIHLLNDK---RSLFKNLLEqsspyEKeikniyDKKSLSkYSPRSLEISNGVTKFFKNLSLLLnqkgmEIeekedkLVEICKNNGKMHY-QMKvwFQAENWICLENSVIETIIKGNNLEkenFeSNQTIIVWSKLMQAIIGWMKQGF-
2353 >tr|A0A158P8J3|A0A158P8J3_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1
2354 --NLRKEQVRALRMTWTRLCepprsnckgIVNLVERVWEKLDRKDSSVRNIFYNAAFvetMHDRCERRrskgSIATLRDHTHFFVSLVSQVIQSLDLnpenILNHVDTIgKSNHAylKQYGFRSQHWEKIGEYFVDVVVIQDCVRGFPEACRAWTILVAALVDRLRAAP-
2355 >SRR5262245_41417288
2356 ------------RASYPRCMaSGNLHARIYEAFFAACPEAKPLFDN-TDL------KRQYQLLHQAIVLMLAFH---VSPNrEEPTILSRVAARHS-ELGVhiPPAWFDAFSAAIQQSLEAA-DTQFSDKTREAWAAVLADGIGYMQ----
2357 >ERR1711884_327085
2358 --------------------------------------------------------------------------------------------------SNESFSvIFKHLAFIKYL-HItktglFDELFGQHVCRIRRiLPFKLIIRL-SSNF-
2359 >ERR1719471_2433215
2360 -----------------------------------------------------------------KGIMKVVSKVLCHLNDlsrVEDYLRVVGRLHD-SAGVEIAYLSVTGDAFCTSLKRLgtHADIWNDEVKQTWNAFFRVVVDLMSAGY-
2361 >SRR6266436_7042579
2362 -----------------------------------------------------------------------------------------------RVFITAqysCRYHSFSATFYVMAGdkerwkVYM-SHQQMSLhARSKDGLYSRRttQGY------
2363 >SRR5437870_11165056
2364 ----------------------------------------------------------------------------------------------------AqysCLNHILSATFYVMAGdkerlkVYM-SHQQMSLhARSKDGLYSRRttQGY------
2365 >SRR4051812_43285676
2366 -------EVEVARDSYKRILddevkEEKFFRSFYQRFFRKCPDAAKEFAA-KEFPRRVAlsGRggnaREGKWPRQYRLIKQAVVLLltFKLLDDteGLTILTDIADKHE-RYP--QEFYDSFRDALIDTVISLDKDsgsgLQRYELRDAWEKSIQPGIDYIMN---
2367 >SRR5262249_5830581
2368 -------DVEVARDSYRRILddverQREFFHTFYGLFLRRCPEAAAVFEA-KGYPALAQlgGPrvedSAGRGPQPPNPLKSAIVMLiaFNILGEkeEPTILDNLVDKHK-GFP--KRYYVAFQDALLETVVQFDDPsrcgMPPDELQHAWKQAIQPGGDYLID---
2369 >tr|A0A2T7PRA6|A0A2T7PRA6_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_02930 PE=4 SV=1
2370 ---FEPHDKTIVAESWKLLRsiFPDLIESAFVEMCRRVPRLKLQFGNV-DVDDD--EERHMNFLKHVWDVSFFFDQLLLYLPfksKLEECSFHIGLVHA-SVEVPAWYVDLFLVEFIRAAQETVQLEWTPAMENAWAVFLRYLCYYMKDA--
2371 >tr|A0A2A6C3W4|A0A2A6C3W4_PRIPA Glb-17 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_39254 PE=3 SV=1
2372 -MELTDEEVAAVRNVWIRAKTEDIGKKILQTLIEKRPKFAEYFGILCQSDklDMNSLKESKEFHLQAHRIQNFLDTAVGSLGYcpvtsIYDMAHRIGQIHF-YRGVNfgADNWLVFKRVTVDQVTKGvtstqasqanlLegtkepevveqhpmadvQNPFSGEnclARLGWNKLMTVIVREMKRGF-
2373 >tr|S9VAV3|S9VAV3_9TRYP Uncharacterized protein OS=Angomonas deanei GN=AGDE_12480 PE=4 SV=1
2374 -------------AAWSHLLtspnGGEFCSTLYEKLCQNLTYIPDYIRNLKD---------EE---RVIDHYINVITKTLELYENphvMIDELPKIAARHR-GFGVSSDAFFVMRNIFMELLPEYMDPKVYEQSKKDWLKFWRLVLDLMVSGS-
2375 >ERR1719354_143580
2376 ------------------------------------------------------------------AFWDILDHICGHLDRlenLIPQLRDFALQCF-NSGLFSDDYNILGECLVTILSTNFD-PWEETHSDSWAWCLDLVMSTLVT---
2377 >tr|A0A1I3QX19|A0A1I3QX19_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter neptunius OX=588602 GN=SAMN04487991_1987 PE=4 SV=1
2378 ----DEQMIALVKASLKELQphAGAVFATFQSKLAQRAPELAYRYDEV-DP------------ERQGELLFEKLAIAlggVRFLDRLVPALGGVGLDAG-SASLTSCDFARLSEVLIAAFAEVSGNRFDPCIGAAWTTLFEELSWHMFE---
2379 >SRR3954469_11252496
2380 ------------------------------------------------------------------------------------DGGAIRRHHV-RSGIGGPDYGRFGDAIPAVMVDVGGNDLPKPIGGSWGDAFWAVIGRTKQR--
2381 >tr|E0VF27|E0VF27_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236389 PE=3 SV=1
2382 -----------VLNDWPKIRknYKKIFIDSFINYFAENPNYKLLFPSFSNVS-EDDLPFNHCFRLHCFAVYKAINFLMSNWlGEyeedDSKILPVIGKTHF-DRGITLEMMNLYKHSIVYSCNNHLKPNL--KRKLSWQTVFDHIFDY------
2383 >ERR1719461_240742
2384 -----------AVASWNNIDdKTAFGKAFFSNWLESNPRIKDVFAQ-SSFK-----------QGPAQFLVERFDILLGVIEDeeqLAEELYQVAKTHK-KVGVDQSDLYSFQASFMKLFLPS-TLItaqrsqtlgltpFLtssSLLWSRWQLSLPV----------
2385 >ERR1712165_596852
2386 ----------------------------RLFLPSTLTSLQRLETH-----------------GLTPF---------------------------------------------------------------SHVITAP----------
2387 >SRR5580704_4499342
2388 ------------------------LGDFYRRLLQHHPQLAAYFEGV-NI------------DFQVQKLVVVLSTIARDLPDrsvLDRVLFHQGVAHV-ERGIGRGEFNEFIALLANVVSCKTTLVGAAESYAVWYQELSAVATSML----
2389 >tr|A0A0G4HY87|A0A0G4HY87_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_33490 PE=3 SV=1
2390 ------NRIHLLQSSLAACLkmstkEEFVGRLMYDTLMRTLPEPGIIAKR--GR------------TMMSRAFNDtvaALVAFVSEPSHMETYMDWLALRHV-HYKIDTTLFPQFRQAMLVSLEQVMADQWNAEIERAWSEAYEMTSQALQ----
2391 >SRR4051794_14672716
2392 --------------------SPAFAESFYTHLCR-SDAVRDLFVTAHRKRVPAALnrQESpaIPDETQRRKLVDGLKAVLNFRPGcSPSSIDSVAARHV-DLHLTTDHFDVFEKSFLETLEQHVTRSEdreeMEEITHAWEKLFATVRDEMLD---
2393 >ERR1740139_220892
2394 -------TRAALLKSWEMVQeaGTvPAANLLMKHLRERDAEALRVNTSH-ARP-KTGETEEDAVRKLAVRTVQILGSAATGMSDtvsLVQHLHKVGAGFA-GTGIKEGYFAMVRDASPFALRELLGDRFTADIASACRITGPFLASLIIAGLR
2395 >ERR1712194_173361
2396 -------TRAVLLKSWEVLAevGTaTAANVLTKHMRELDAEALRSYTSQ-AQP-KDGETEDDVVQKLAVRTVQMFGTAvtA---NDtasLIQHLHKVGAGFA-GTGIEEGYFSLVDKASPLALRELMGDRYTADIASACSMTGDFLTSFVREGFR
2397 >ERR1719446_598571
2398 --------------------KKAYGLNAFNRFFCKAATIGNSFQHI-QC-------------ASVCSgnarSPAVSGYLQGAYTlgeCGHLTWPQTHHVQH-FYRLLX----------------------------------------------
2399 >ERR1719240_1501566
2400 ------------------------------------------------------------------------------------------------VQHFYRILRLLLEACCEELADWVKD---PAAVEGVEWALTQIAAIMI----
2401 >ERR1719235_1367256
2402 ---LPGVTVEFLRSSLARISEDEFGDMFVQKLRETGDmlsegTIEGVLNT--PI-------------VRPTNLRKMIVYAL-----------------------------------------------------------------------
2403 >SRR3989338_2963815
2404 ---------TPLYHLYKENVppqkERELGLLFYKLLFDSNPELLDFFANV-DLD------------HLSDHLVQTIRLFLESRnslVSLVPAMKALGIIHQ-RAMIPSWAFPLVIENMAKLFSILLGDRFTVELASALVLSFDLLTSF------
2405 >SRR3990167_6716616
2406 ---------NPIYStlknIWlETVStpeiKSAVGELFYKNLFQYHPELLEYFNNV-DMD------------SLALHLSQALDFVFQSInkiGDYksqwRTVLEHLGEVHR-AALIPTWGYPIIGQQILKIFPYNEKAGFSTKQL--etaLATLYREIVII------
2407 >SRR5436309_231744
2408 -------------------------------------EIGQLFEG-RKVT----------MEDQYRKLDRAMFSILSFNRRlKATTLDPQVASHS-EFGLKREYFQFFREAFLAALRETQAS--DDYSREAWSALLNPALAYMSD---
2409 >ERR1719183_3286062
2410 --------AISLRDSWVHIEvlkeeddSGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFSTLVHAMGDpqkFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDSMVRNFR
2411 >ERR1719183_785787
2412 --------AISLRDSWVHIEvlkeeddTGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFAVLVQSMADpakFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDCMVRNFR
2413 >tr|A0A0N4UGY4|A0A0N4UGY4_DRAME Uncharacterized protein OS=Dracunculus medinensis OX=318479 PE=4 SV=1
2414 --RLSDKQKLWIKLGYKKWRsksKMVPGEWVHAYAIKKYPTMKALFKK--HEN---------LARVYTQTITKIIEMAVESVdslDDsLGPLLISYASENgileERgmasiftirndklllfLEGFDRRFWGYVAEALCALSRDFPLKRHKWDTISAWRIIVLFIVKKLEYGF-
2415 >tr|A0A2A6D1B3|A0A2A6D1B3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_35146 PE=4 SV=1
2416 --TLNHQQRKLIKNGYDSWRkksCISSGRWVHSFVSSKDDRLKEIMEG--NEE---------TTRIHEETITHLLDMAVESLeslDDsLGPLLISYTGPQgvfeEK-DGFDRLYWSRVSEGMCQLARNFPSKANKYETVCAWRIVVLFICNKIELGF-
2417 >tr|A0A2A6B4U3|A0A2A6B4U3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_54703 PE=3 SV=1
2418 --GLTKDKTDLMANLWPSHYgtLYDMGIAAWDKLFAHNPGLKKHFGF-AENDPSSSWKNDERIKKMVLSLQQLLTEAVNTLGfgDtealtsFVNNLRELGGLHRAiADGVNPDAFTLLFAILPEVIVDVTSnrskdgplsSENRSELLAIWRAITRFMANQVMTGW-
2419 >SRR5687767_14811217
2420 --------------------SREFMSRFYRRLFAARPELRSQFKNV---------------TTQHDMLAEAIRDLVLFRpGDQEARFLDYVETHR-RMNITVHDIEAFRLAFVAEVIATSMQngnAQARSHGDAWNAALKLGLGVMAK---
2421 >SRR4029453_11133516
2422 -------------------------HLIILKLQRIAMQGAflSVIPAtgFSEH----------FITNSCEFLPK---PQSSSREKalgenEPNILSRIAEMHNKnNYNISPESYKAFVSALTATICGSAPEipePFAPqckisvneknLIKNAWQKALKPGIDYMIMRYS
2423 >SRR5262245_37180117
2424 --------INKVHESLKRCRlQPGFFRDFYQQLVKNDAIQ-AIFTKrgLDVL----------KSDKQQWLLREGLDLLISYADEpkspGLHVLSRVAESHSI-YRVGIEMYDGFLEALLVTVRRHDLEfqdP---skddskVIEAAWRRALKPGLDYLKSQRP
2425 >SRR5262245_45185474
2426 ---------------------PTFLEAFYKLFTA-DEVVGKRF--vkFDDI----------EWKRQHGLLQQALDACFDFASLlsmqnlrelpEPNAMTKYVVRHGPgrgNLGITSTEYDAFVEALITTVCGNPGNgqaPYDPecadaerkdVIEFAWRRLMKLIVEHFKKVAR
2427 >GraSoiStandDraft_39_1057311.scaffolds.fasta_scaffold195098_2 # 276 # 1100 # -1 # ID=195098_2;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.692
2428 ----SFDVFEIAKDSFNRCMgadgGALFFKTFYERLLSKLPVP-yaRQLSQkgVGTS----------SSHRQYDMLRQGIFILLQFGQHklyerEPNILSTVAVLHDQhHHNIPPNLYAAFTGALIDTVAGAPPAiptAFDKqcetdmdIITDAWEKALAPGIRYMTEKYF
2429 >tr|M1PA46|M1PA46_9CORY Flavohemoprotein OS=Corynebacterium halotolerans YIM 70093 = DSM 44683 GN=A605_12675 PE=4 SV=1
2430 --------------------SGEFRDEVHRRFYLDVLEARQVFPL--TLR------------ETHVDLASSLAWVLERtssdgtLPDdVLARIRRLGVDHR-RHGFPAEVYPAFLTALRGGLRTVTAEHggVDDPLVDAAGDVFARVCGAMADA--
2431 >tr|A0A097IIH9|A0A097IIH9_9CORY 2-polyprenylphenol hydroxylase OS=Corynebacterium doosanense CAU 212 = DSM 45436 GN=CDOO_12240 PE=4 SV=1
2432 --------------------SEKFRDLVHEQLFSTELQSRQVFPS--SRA------------RSHLDLAPALAWVLERstidarVPDeVMRTARRLGLSHR-RHGFPSEIYTPFADMLVHALREVNFRAdpqLSAGLIIPAETIIRNVCNAMRAS--
2433 >tr|A0A0G3HGP7|A0A0G3HGP7_9CORY Uncharacterized protein OS=Corynebacterium uterequi GN=CUTER_09860 PE=4 SV=1
2434 --------------------PDEFRSRTLTGFFAAEFQARQLFGL--HAT------------QAHDGLPEVIAWALERcgidghVPSeVLDRLQRLALVNR-RFGFAPSAYSSYAEAITTALKDLAYVHfgeVNIlpSQMFAATLALDTCARYMQRA--
2435 >tr|K0YDT0|K0YDT0_9CORY Uncharacterized protein OS=Turicella otitidis ATCC 51513 GN=HMPREF9719_01398 PE=4 SV=1
2436 --------------------RTAFRDATVDYLLRRLPRLRRVAPL--RQR------------HRAEALAERAVGLVARspqgmLRGeDAADLERAGRANR-RLGVPLRVYPVLAQALKAGLRAAFEAAgepYTA-AARDAEALAEAACASLARG--
2437 >SRR6478735_8357209
2438 -----------------------REIAFLVARGLPsKEIAEQLFLSVR---------------TVQNHLQR----IFTKLG-VTSRGEVAGVLQG-LEGPSSX---------------------------------------------
2439 >ERR1712130_811490
2440 ----------------------------------------------EAAlagmKAVEDLGGKFDRTKHGSLFLSVvLTRVVPHLDQrdrVLPYLVELGALHQ-REELQDITLICWVLHIalPSGVWSRVeecVGGYC--TRQPRLGLVWSLPS-------
2441 >SRR5436309_12080688
2442 ------------------------MHRFHAHLEQLNPRLRYHLPP--ALL------------RYVrFELLQAVRQQT--PMEVGSGLRRFGVHLR-AQGFEGPDLDTLGAAWLVALDEVLGDRFDSEAREQWLRFYKVLRSA------
2443 >tr|A0A0N4Y9E2|A0A0N4Y9E2_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1
2444 ----------RIQHSFKTASfhltvnqlrsRPTIGDAILKRAISNRPEMRTFLNRLTE----------QQVEHMGKQFYSLIAVSVENIERpeavryfs-RLPFFAMFETYATlcQLGFRPDYFAPLADAAIAECVKLDGGaHKRCETLLAWSQLISAIFTSVRDGY-
2445 >tr|A0A183LHE9|A0A183LHE9_9TREM Uncharacterized protein OS=Schistosoma margrebowiei PE=3 SV=1
2446 --------------------KIKVGKEIFRQLLIKNPHYMKMYKPLQSVT-LPQALNLDYLTKMAICYVDNIMKIVRNFNEeekLQETVKYLAAIHT-NRGLTVAHFVSILPIFTDTIVSYME---------------------------
2447 >tr|A0A183WH41|A0A183WH41_TRIRE Uncharacterized protein OS=Trichobilharzia regenti PE=4 SV=1
2448 -----------------------------------------MYKPIQSVT-LPQALNSDYLTTMAIRYVDSIVDIVENFNDeenLQQKIKYLAGKHT-NCGLTVAHFVVSLQILCICVHIWQT---------------------------
2449 >ERR1700755_1321676
2450 ------------------------------------------------------LN-SKG-HRQRDELLNALVSILSKYDPdrpdsqpmieLEADAMGWGRRHASfaalggrPA--GPDQYRVVRDVLWQLLIDASDGRWDAGHTEALVDAYHWVQTIMMW---
2451 >tr|A0A0V1KYG9|A0A0V1KYG9_9BILA Uncharacterized protein OS=Trichinella nativa GN=T02_16304 PE=4 SV=1
2452 --SLSAGELKLLRWLWKQMKqvhQGLASAKLFQIIFATCPEIKRFFGL-AKDT-IDMIINSLSYDNE----------------QLAQLMIAFGCQHSFytRRNFDPKYWNVFGDAMLHLVDDLPLKAFKrYRAKSIWFRFVYFVISHMQLGY-
2453 >tr|A0A1I7VKJ4|A0A1I7VKJ4_LOALO Uncharacterized protein OS=Loa loa OX=7209 PE=4 SV=1
2454 ---------------------------------------------------------------------NALKKIIESLKNeqiPYEVLQRISVKHA-RHNIQTHHIQKMIKPLVENVRRALGR-QDENAERAWETLFQTIAII------
2455 >SRR4051812_9951159
2456 MTPLPPEVAQTIRSSCRPLLerQEQFHGDFHASLVDLMPEVPMMREP--A------------GEQVSRWLVECVLWAVNADEPvpmIGATLQGVGLDAH-RLGFPRAGYQAVGHALLRTVRGASQNDWSGTLSSSWIGYHSWLCEYWVSG--
2457 >ERR1711890_22380
2458 -MHLSDTEKSAVVSSWSNVN-SSLLDSVLLQLVQENADMRAAMSR-GDLA-EDSIREQETFKADVTKLTCCITKLVTRLGNTGEVSSCPatCLKNC-P-YLQPKHVPLFISSFCD------KLELTEDAKKGWKFIMEKTAERI-----
2459 >ERR1712018_299478
2460 ----------------SDVA-ENHLEDVLLQLVRENSELRSSFSW-GNLP-EDCLRDDDKFKEDVKRLNTCISKVVDILSSSGDApLACPvsSFTSC-P-YLKSVDMPLFIKCFNS------GNKFSENAKSGWTAIFEMAGKKM-----
2461 >SRR5262249_47865225
2462 ---MNHRQVELVRSSYERIRrvRHLFADLFNRRLTLIAPVLERLLPP--ET------------ARRDAAALELVEFVVAGLDRLDVLLPALAVQARVwrLKGVEAADYDVAGMALAWTVEQVLV---------------------------
2463 >SRR5215470_9720857
2464 ----------EAKRSYRQFArDISFYRELSKRLFRKIPGIEKKFRH-RTM------------EEQYKVLRDSLWLLLSYASapdQqEPTILSRIAHTYA-R--FPKEWFDTFREVILDVVAQRDP-----SSVRAWKHAMAPGLEYL-----
2465 >ERR1719487_1476365
2466 -------YKTILDRCYERMTtqldLVAMVTLFQGIFFGRDIRIQSYFSKP-N-------------ATLRYVVLRIINFLVNVYHkpaAITGELRALGVSHV-KWEIPPDLFVPLGEALFITLEICLGG--------------------------
2467 >ERR1719271_344116
2468 -----------------------IRKDIYSTFFTQAPAGQDYFKQS-N----------TYLHVVADKIMVMTLELYQNPVKMVDDISALGLRHV-GYAIPTELFGPFVSACVEVLMTRTSD---EATIESFRWSLGLTSKML-----
2469 >LSQX01.3.fsa_nt_gb|LSQX01333836.1|_8 # 4697 # 5665 # -1 # ID=41498_8;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.475
2470 -----------------------LRQEFFLNFFKLAPSGQDFFKQS-L----------TRLYFIADKIIELCLEIYRQPRAMVEDISGLGLRHV-GYAIPPELFGPFVGSAVEMFSLATTN---ETAIDGFKWAMQLVSKIL-----
2471 >SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold703673_1 # 2 # 517 # 1 # ID=703673_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.653
2472 -----------------------SSSIIVSSFMRDssrPCRRVRTIKQS-N----------TRLHFIAESATNMSLKLLQDPWRMVDDVSALGLRHV-GYGIPTEMFGPFTEAAVDALRGHVDE---TLALEAFNWSLSIISQML-----
2473 >tr|A0A1I7RTA6|A0A1I7RTA6_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1
2474 -TGMTRHHKMILQKIWMRASeadINECSRNMMSHLLRSNQQLYQMFNLV-GMT-DKEIQQSIPFNRQAANFAMVFDFVITNLTDdlnrVAFALEFLGQHHA-DLGFTIdqPFWALFNRVFEDNPPKLV--FQNPEGHQVWKLMVNFVVRQVKNGY-
2475 >tr|E3MDQ4|E3MDQ4_CAERE CRE-GLB-31 protein OS=Caenorhabditis remanei OX=31234 GN=Cre-glb-31 PE=4 SV=1
2476 -------DVERIRAVWMDhINgNDDYFQEVIHRICKRNDGIRCAMLTQnAQHA-ESAAEEDFVLSNIADRISQFFHQLIEddvllNTVELKKCCYDLGRQHS-AYSkkqFKISFWEEFTLTMMDVLEQNYP-QTTKEEQKAWLHFQRFVNENMLDGY-
2477 >tr|A0A0B2VIR8|A0A0B2VIR8_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_08540 PE=4 SV=1
2478 ---QTSTRIALLQSSWTSVQtmtSGQFGARIVYSMLRKDPSLFDVFTTVqydgeetplrqtsgliarfynfGSIPdktppnngEetplrqtsgliarkSFDLLTCPQYYEVGDRIMNFMGELIQMMQDgqseqaIIERIRLVGATHY-ERNVmfSSCVWREFKASTLAIVGESTFEseSIRVETLKAWSSFVSLIIREMKNG--
2479 >tr|A0A1Y5SIU2|A0A1Y5SIU2_9RHOB Uncharacterized protein OS=Roseisalinus antarcticus OX=254357 GN=ROA7023_01630 PE=4 SV=1
2480 -------QAELVADSLSRVGdkVIWLASDYYEALFDASPQLHGVLPH--QM------------SEQTNMLGHALAHALANLRDpdgAAPMAQDAGLADR-SARMPPRMRRTIVRTLVHALSLWHGPTWTKDHARAWNEGLLGVA--------
2481 >tr|A0A0N5CYF2|A0A0N5CYF2_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1
2482 --ALSTVQRQIVKECMDKA-KDDIAERIYRRIFERRSDFRKFILA---LPD-------KQRWALTDSLHNYLKSAVNQIKDgsaVRKISEDFGAFHVQyrSFGFRPDFFVSTADAVTTEFVLLDAaVHQASDTLCAWSTLTGFMFSSVRDGY-
2483 >tr|A0A0K6SA08|A0A0K6SA08_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_8920.t1.CR2 PE=3 SV=1
2484 ---------------------AAMAEKFFELVPKRAPNLRMIFEKRQDI-----------YKHHFGEI---TKRLLAYLDSpeeVWKEDPELAIKHI-EFGVMPCDVPVFANVFLQILAELAGPAWTQRHRDTWDKLFSIVSGALA----
2485 >tr|A0A0G4H7J1|A0A0G4H7J1_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_24983 PE=3 SV=1
2486 ---------------------AVFSREFFKRLSTFAPSVHAVFVKSEEK-----------YTRTIKDL---LGRLLAYIDDpsaIWSDDEELAMRHV-IFGVMPTDIPLYNRVMVQTMAGIAGGEWNLQHDAVWTKMMGLATETLS----
2487 >SRR5215468_7630418
2488 ----SPEVMRVIRFSAGLLAelQDMFVRQLHSEVTALIPGLAA------NG------------RIFCERMVRSLLWAATAgqpPHAAAGALRQVGAANR-RDGFPEERYADVARALVLALRNVSGSSWDNSIGSAWISYFRWAEPHLRAG--
2489 >SRR5215469_6664897
2490 ----APAAGRVGCQSAIRLSrnQDAFIRQLYDDFKELDPDSaqtqAP------DL------------LVFCERMVRALLWVALTdqpLRVVADELRQVGAQNW-YES-------------------------------------------------
2491 >SRR4051812_31756681
2492 ----APSVMRLLASCTADLGpqQPELAEALYQRLLELLPEVatlAE------RG------------RPLSDRILHAVLYPTEPgrtPLNVATVVQQVGAQNY-LDGLVGEHYSSVTHAVLHAAREMYRGEWSSALSSAWVEYLLWLRGHLLAG--
2493 >ERR1712232_311801
2494 --------------------RREMSMAIWNRMFKKDPEAERVFKQ-SN----------ERLIFIVEKAFENAAKIYQSPSETREYIQGFLVLMK-LLLMAL--LGRFLSSRAPWL--------------------------------
2495 >tr|A0A0G4HD16|A0A0G4HD16_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6316 PE=4 SV=1
2496 ---LTFEQKeEIVRSAWTTLSstyqLQEIGRVLYETICEEAPGLSSRYTKPGE--------------VMALRFGEMLATLIHlfldFPNDLQQKMEELAIRHV-NYNVDLEYLPVFEISILRTVQELYCeGEFDVEVAT------------------
2497 >tr|A0A2W4YK05|A0A2W4YK05_9SPHN Uncharacterized protein OS=Altererythrobacter marensis OX=543877 GN=DI636_06370 PE=4 SV=1
2498 --------AALIERGLERAAqqLGDITPLVMREFYRRIPEAEASFRHH-APHDPH--------GLEAEMVGNTLHYIMRWHEAPmeiRIDMDTSVPHHRVALDVPPDWYRGMIEAAIDVILSSVPSSA-SDERTAWKQLRDQLVSL------
2499 >tr|A0A1Y6FH01|A0A1Y6FH01_9SPHN Uncharacterized protein OS=Altererythrobacter xiamenensis OX=1316679 GN=SAMN06297468_2444 PE=4 SV=1
2500 --------STLAERSFERLAeqRGDITQDVLERYYRRYPDGRASFEHH-GLGNRA--------ELEGRMVSTTAFLLMQWAQDPggtRIEQGTTIVHHQDTLEIGPRLYLGLIDAVLEVLFETIPDES-AEERAFWLSLRGEIADF------
2501 >tr|A0A2E8LSZ4|A0A2E8LSZ4_9ACTN Uncharacterized protein OS=Actinobacteria bacterium OX=1883427 GN=CL510_01665 PE=4 SV=1
2502 --------SELAQRSLERLSevGGDVTRPVLDAYYARHPDARASFEHH-GLGHTA--------ELEGRMVAESLYLLLTWIEDPataRIDHGTAIVHHNDSLHIPPRWYLGLVDAALDVLLRTVPEDS-PDERALWVALREEFAAF------
2503 >tr|A0A1E4JTP1|A0A1E4JTP1_9SPHN Uncharacterized protein OS=Sphingopyxis sp. SCN 67-31 OX=1660142 GN=ABS88_06340 PE=4 SV=1
2504 --------LELLDRSLTRAAdaIGDITPVVMARYYARHPDAAASFERH-GMGRTS--------ALEHEMVDNCLYCLMYCLERPteiEILLENSVPHHQFTLQVSFDWYRGLVDATIDVIAESVPADA-ADERQVWDEIRSVLGGV------
2505 >tr|A0A2E0VIY1|A0A2E0VIY1_9GAMM Uncharacterized protein OS=Porticoccaceae bacterium OX=2026782 GN=CMK32_09515 PE=4 SV=1
2506 --------NDLILNSFESAAesLGDITPHVYRRFFLQYPEAESLFNIK-GAQFQD--------ELKVQMVRDAIYAYLEYLETPeevEIVFKYTIPQHV-DLDIPIRYFIALLEAVADVVCDSVDDRTQADTKASWSELLQEFRQM------
2507 >ERR1711865_325941
2508 ---------------------SQFGLNAFNRLFDTEPRSEDHFKT-SN----------A---RLSMLATKSLELSMQMYKEptrVMNEVTSLGLRYI-FPAHD-----------------------------------------------
2509 >SRR2546421_6426420
2510 ------------------------------XMIRRPPRstlfPYTTLFR-SDF------------ERQNKLLRHAFGLLLIFPNQartEPSVLTRVAERHSRrDLDIPRSEEHTSElqsRSDLVCRLLLEKKK-KNQV--------------------
2511 >tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae GN=mphP PE=4 SV=1
2512 --------------------VTAHSIQAVADELRAHraeFIQAANQ------------------KPD-SPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD--------------------------
2513 >tr|D9QCQ3|D9QCQ3_CORP2 Oxidoreductase OS=Corynebacterium pseudotuberculosis (strain C231) GN=CpC231_1874 PE=4 SV=1
2514 --------------------KDAFHTQVFANF--YHsnPYARATI------------------APS-EQLVPAVISLIGHLENngfisdeVKQKFLEHTKLLD-ARGF--HHYTALASAVRSALQTMCTD--------------------------
2515 >ERR1719474_106261
2516 ----STASLELVLDFWRCTVhrlsvhdRAMMGGDLFRGMSRQDAACRALLESL--N------PTSERMDLWGLRFLDTTGWMLRRANaaDLDASLKAMGAEDR-ARGLTVAYYRVLVERLHSELAARFPTKYSETVQAAMEEVIWSFVRR------
2517 >ERR1719499_858439
2518 ------------------------GRAIIEGMNHE-------------N------TSPNQMDMRTVRLLDTLGWMIRMSciPtmDLKVLYAAWNGMAA-EVGYSAEYHVSWIQYIEAQLTERFPSEYTDSVRSAVRELLRWSIPN------
2519 >ERR1719410_2598304
2520 -------------------------------------------------------------PSHALKILNVFGYVIRNLIHpsnhlkLFKQLQSLGTVHR-AHSLNNEMYEAMLKSFNYAMEEKFANHYKIRIRFCLSQLYRVIVDIMTG---
2521 >ERR1719216_785110
2522 -------------------------------------------------------------PKHTIKIITTFGYIIKNLIYskehtkIFKQLQSLGEMHQ-CHSMInTDIYMELLNAWHFAMEEKFQNKYKNNTRFCFNQLYRLIVDTLMG---
2523 >tr|E0VF51|E0VF51_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236397 PE=3 SV=1
2524 --------VKIVTPTWESIKedFDWYCTKIEETFFQNDTTKKELFTL-PKFEeELTDDVVNKRLFKHSSAVLNFMECIVQFMNGneeTKPVLFVLGRNHY-TIGVNEKLFLEMKDAICSVIKYKIG----TENAKAWDTILQYI---------
2525 >tr|A0A0M3IFG8|A0A0M3IFG8_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=3 SV=1
2526 -TGLSMHQKAILTARWRQLPqgiVFDLGKRVFGTLFQKDPNLLVVINL-EHLQGTDAWRDHVNFHMHAQRFTHALSQCMRHLVEpivAADRLQEFGATYAEmedsenfnRSRIPHSYWDRLISAMTSTAKEFHEnpsqksrrnslsvddalvatnerldLQIDSANISAWSALATFVSNQIRFGYE
2527 >ERR1719199_711328
2528 ---FKPSHISLIQNQMSALIsefgsIEGAGEFLITQICALDEYVAKLFSG-AAL------------RVQGFKFLGQIARWVTYLADpetVEADLYNLGIRHL-GY-VTQQDFAKFLPaviqCMQKSLKDVLDEQWSALAAESWKMFLGYAGGH------
2529 >ERR1712070_698694
2530 ---------------------------------------------------------------LCFIIARVIDIAAQlfvEPDVCIAEVLQLGLRHI-MYKVPADFFGPFAGIIADEIEARCD---------------------------
2531 >sp|O76243|GLBB_CERLA Body wall hemoglobin OS=Cerebratulus lacteus OX=6221 PE=1 SV=3
2532 -----------------------VVDAFYVELFTAHPQYQDRFA-FKGVA-LGDLKGNAAYQTQASKTVDYITAALAGSAD----AAGLASRHV-GRNVGAPEFTHAKACLAKACA-------------------------------
2533 >tr|A0A2C9LKZ0|A0A2C9LKZ0_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106051185 PE=4 SV=1
2534 --GISLADIKVITNQWEDVLrcSDLFGKLLVLYVLDNCPKVNALHPGLHAR--LTDARD-SVEKQIGLRVIQSISCVIHNLNRapaVESMVRDTFKKLQ-QHGYTKNTILECSEAFLSFMNQYFSKRWLKQHSDAWFKVLKALL--------
2535 >SRR5690606_9602430
2536 -------------------------RAFYPILYSSVSGAQELFEA--TVG------------TDNRKMLQILAKLFGfisNVNhsSefMkSDAFIERGKYYA-DHGISETMMRGFSSALVLTLRRTLGELFTISHVRAWGIFLDTISHAL-----
2537 >SRR4051812_40179264
2538 -------------------------RIFFPILYSTVPSSQELIEE--AVG------------TDSIKMLQLLVKIFRiisDINhdPevMkSEAFLERGKFYA-DHNISENMLRGFNSALTLSLRRSLGERFTISHVRAWGAFLEMISHSL-----
2539 >SRR5690242_7041980
2540 -------------------------RAFYPILFSTVSSSQEIFEE--HIG------------SDQTRMTETLRHVLEffiSVNlnPqiLsSDKVIERAKKYA-DLGISENMLKGFSFSFLKALKQVLGGALSAEAMREMVRLLDNISIQI-----
2541 >tr|A0A0G4HHE4|A0A0G4HHE4_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6802 PE=3 SV=1
2542 -------------------------DALLGILFEASPTMRSVFVKNGDL--------------YADLIEHLLRRIIAYADDpgaLWTDDQHLALDHI-NFGMSMSDLPLFGASLMNCLAGVLGENWCDEWQRAWEKAWQICCQSL-----