view test-data/multimer_output/msas/B/bfd_uniclust_hits.a3m @ 11:81d1ef460bb3 draft

planemo upload for repository https://github.com/usegalaxy-au/tools-au commit 01b5c58f98b79c9b11757013dc6e69ea06dbd709
author galaxy-australia
date Fri, 16 Sep 2022 05:13:45 +0000
parents 3bd420ec162d
children
line wrap: on
line source

>chain_B
MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH
>ERR1719244_1811598
MVQWSDDETKAIQMIWNSVDVNELGPAALRRCLLVYPWTQRYFGKFGDIATPTAIMQNPGVAQHGITVMNGLKLAGGPGGGPGNQPGGQQELWQRGKQQGQQQLWQQGQHGGKQRGqqQRQGQq-PSPRQSX------------------
>tr|W5MMD7|W5MMD7_LEPOC Uncharacterized protein OS=Lepisosteus oculatus OX=7918 PE=3 SV=1
MVTLTAEDKNNIRHVWGMVYKDPEGngAVVVIRLFTDHPETKQYFKRFKNLDTLEQMQTNPRIKLHGKRVMNTLNQVIDNLDDWAavkEILTALAERHRDVHKIHIHNFKLLFDVIIKVYGEALGPAFTDAACESWSKVFQLLYSFLQSVYT
>tr|G3WE01|G3WE01_SARHA Hemoglobin subunit mu OS=Sarcophilus harrisii OX=9305 GN=HBM PE=3 SV=1
--MFSAEEQSHIVQIWNYLsgHEAIFGTELLQRLFTVYPSTKSYFPPL-IPG-----LELTQMQNHGEQILMAVGVAVDNMYDLRTALSGLADLHAYGLRVEPTNFHFLIHCFQVMLASHLQSEYTAEMHAAWDKFLTNVAVVLTEKYH
>tr|W5PMJ4|W5PMJ4_SHEEP Uncharacterized protein OS=Ovis aries OX=9940 PE=3 SV=1
--SLTRAERTIVVSMWSKIstQADVIGTETLERRVTCVSRGPA-P----GSP------QS-------rgRREAGRKGRNDLEtggqgegAGRTGQRLL-RSRLRACTLSF---PPQFLSHCLLVTLASHFPADFTADAHAAWDKFLSLVSGVLTEKYR
>tr|A0A1K0GGD5|A0A1K0GGD5_RAT Globin d1 OS=Rattus norvegicus GN=Glnd1 PE=3 SV=1
----------------------MYGLEKEp-R------------ETEGCLS---RKLPSNLQRSSAPWRLHGFQNLLERSQGA--------QRAKPG------------HGAHSHSSVKMAL--SQTDH------------------rlvL
>ERR1719474_978995
--------------------------------LLQSSWKQ--FRT----------------------------------------FASLSGIRQEELGAGCQHQDLP----------QIQHHLWISEPSTFQQLLtftrsiktftnhylnirclflqmflslrgCVNKDSASRKKH
>ERR1719336_830457
----------------------------------------------------------------------------------SINPQSTVDLGAQYISATPLNYKNHQDIYNSLLSNG------VLVPANVSLIEGMRQDRIDEGEE
>tr|F6XB67|F6XB67_XENTR Uncharacterized protein OS=Xenopus tropicalis PE=3 SV=1
-MILSEAEKAAILSLWAKAsgNVNALGAEALERILYIWQNLFSYLESP-VI---L-----KILQTGKGASVYKIR-GLDHLSTKHSILPLL-TVKKCLCLRDAGFKILLSHAIEVTLAVHFPDDFDATAQAAWDKFLAAISTALTSQYR
>tr|A0A1L8EXG7|A0A1L8EXG7_XENLA Uncharacterized protein OS=Xenopus laevis GN=XELAEV_18045093mg PE=3 SV=1
-MSLSQAEKTLILAFWNKASglINTIGPQIVNRLLLAYPQLKTHFGNF-NVTPGS-----SDLNTLGIKIITAVGGATQHMDDLPVHLAILTDLHSLTLRIDPGNYKLMIDCIVISMAASLPQDFTAEVQNAMTNFLIIIGDILASKFC
>SRR5260364_139532 
------------T----VLapDPnPTPHSASPRRMFLSFPTTKTYFPHF-DLSHGS-----AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKVSGGPGAIWVEGRDGAFLAGQRITRvAGGVAQAAAAGLGPRPH
>tr|A0A096M318|A0A096M318_POEFO Uncharacterized protein OS=Poecilia formosa OX=48698 PE=3 SV=1
------HDELIITGVFFTSVSECVPP-----VRNIYRQTTNSIENIGNFKNGETFLTNPPVALYVVNMVEFTSKPLMS-LPLNGFYGILDFLK--AKRKNPNGGKLLADCLTIVIASKMGSGFTPEIQATFQKFLAVVVSALGKQYH
>tr|A0A146TSR5|A0A146TSR5_FUNHE Hemoglobin cathodic subunit beta (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
IFHFIYFYLSTIHYIFSKIYSFFFFPSSLSIFLIFYPFTHIYFFIFFNLYNSSSITSNPNFSSHFNFFLSFLYKSFNNIYYINTTYKYLIFLHSYKLQFYPYNFNLLSYFLTIFLSFHIFSSFTP----------------------
>tr|A0A146Z291|A0A146Z291_FUNHE Hemoglobin subunit epsilon (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
IFYFSYHYLIIITSIFSNLYYNYFFPNSLIIFLIFYPFTHIYFSNFFNLYNSYSINTNPNIQSHFTNFLHFLYLSFNNIYNINFTYSYFIFLHSYNLHFYPYNFNLLSYFFTIFISSNIFSVIKE----------------------
>tr|H3B4U9|H3B4U9_LATCH Cytoglobin OS=Latimeria chalumnae OX=7897 GN=CYGB PE=3 SV=1
--QLSDTEVESIRQIWSNVytNCENVGVLVLIRFFVNFPSAKQYFSQFRHLEDPLDMERSVQLRKHARRVMGAINTVVENVEDQDKiasVLAPVGKAHALKHKVEPVYFKILSGVILEILAEEYAQHFTPEVQKAWTKLMSIICCHVTATY-
>tr|L8HVQ9|L8HVQ9_9CETA Cytoglobin OS=Bos mutus OX=72004 GN=M91_06698 PE=3 SV=1
--ELSEAERKAVQATWARLyaNCEDVGVAILVRNRFWRkKRASSTLEEFQegaqgrdsslGSSQAQKQPGCPQLRKHACRVMGALNTVVENLHDPEKvssVLSLVGKAHALKHKVEPVYFKILSGVILEVIAEEFANDFPPETQRAWAKLRGLIYSHVTAAY-
>ERR1711977_7585
-MSLSAKDKTLVKKLWEKAEgkSADIGAEALGRMLVAYPQTKTYFSQWGSDLNPQ----HPQVKKHGAVIMGGVGKAVKNIDDLVRGMGALSELHAFKLRVDPANFKILAHNIIWSWPCTSLQTSPPRPTCPLTSSCRTWLWLCPRDT-
>tr|A0A1C4HCU8|A0A1C4HCU8_PROAN Myoglobin (Fragment) OS=Protopterus annectens OX=7888 GN=Mb3 PE=2 SV=1
--MASAAQWDTTLKFWEAhVagDLKKHGHEALVRLFLKNKDSQKHFPKFKDLASEAEMRGSDGLKNHGETVFTALGKALQQRDGIANELRPLAVTHSQNHKIPLEEFENICEVIDVYLAEICPD-YAGETRTSVKAVLDVFSQSMTTLY-
>tr|A0A146P967|A0A146P967_FUNHE Hemoglobin subunit alpha OS=Fundulus heteroclitus PE=3 SV=1
---LSKKEKKLIKDIWERLTpvAEDIGSEALLRMFTSYPGTKTYFSHL-DISPGS-----AHLNSHGKKIVLAIAGGAKDISQLTVTLAPLQTLHAYQLRIDPTNFKSCFHTVCLSRWpvTWAKSSL----RLHTQQWTSTCQPLQPCSL-
>tr|A0A146QLZ2|A0A146QLZ2_FUNHE Hemoglobin subunit alpha-2 (Fragment) OS=Fundulus heteroclitus OX=8078 PE=4 SV=1
NIILTSNYNYTFNTFFSKFssNSYSIFSYSLSIILFFYPHTNTYFSHFNYLIPFS-----SPFNNHLstfiflfsxxxXXVMGGVEDDVEKIENMKEGIIRISEMNELNMRVEKEKLKIMEKKIIVV---------------------------------
>tr|A0A024R1G3|A0A024R1G3_HUMAN Myoglobin OS=Homo sapiens GN=MB PE=3 SV=1
AMGLSDGEWQLVLNVWGKVeaDIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNY-
>tr|M3YM80|M3YM80_MUSPF Myoglobin OS=Mustela putorius furo GN=MB PE=3 SV=1
-MGLSDGEWQLVLNVWGKVeaDLAGHGQAVLISLCQGLESRKEEKKRDPAHACVSSRRslfVSQDLLFHSDAFLVSLGHRSflaPVSGENGQSQKTQPAHHAQHHRQPWNTEKFISDAIIQVLQSKHAGDFGAEAQAAMKKALELFRNDIAAKY-
>tr|A0A1Z5LBJ2|A0A1Z5LBJ2_ORNMO Uncharacterized protein (Fragment) OS=Ornithodoros moubata OX=6938 PE=3 SV=1
--ALSAAERALLRALWKKLgcNVGVYATEALERTLEAFPRTKIYFSHM-DLSP-----GSAQVRAHGQSPRPQGGRRADPRRRPPGRPArrpVRSERpARAHAARGPPPLRAAGPLSAGDPRPALPWRLRPRH--------------------
>tr|S4RW14|S4RW14_PETMA Uncharacterized protein OS=Petromyzon marinus PE=3 SV=1
--ALSGAEKAAIADSWKAVysNYEEAGKAILIKFFTSNPGVQDFFPKFKGLDSADQLSKSAAVRWHAERIINAVNDAVVALDDpekLSLKLKALSKKHAQEFNVDPQYFKVLAVNIVEGVSSA-NGGLGAEAQAAWEKFLSQVSILLKSQY-
>tr|Q9Y0D5|Q9Y0D5_MYXGL Hemoglobin OS=Myxine glutinosa GN=Hb PE=2 SV=1
--RTTEGERAAVRASWAVLmkDYEHAGVQILDKFFKANPAAKPFFTKMKDLHTLEDLASSADARWHVERIIQAVNFAVINIEDrekLSNKFVKLSQDHIEEFHVtDPQYFMILSQTILDEVEKR-NGGLSGEGKSGWHKVMTIICKMLKSKY-
>tr|A0A1W0WKD0|A0A1W0WKD0_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_10224 PE=3 SV=1
--GLTSNHIKAVRANWKLIekRLPEYGLELFVAYLNKHPDWIGLLPFLKPADMPR-LQQTPRLKAHGTIVLKKLGELLTMLDSppkLIGELLKQGSTHR-ARGLAPENFQAIQHDLNELFVKICGPE---FDIEGWDAVLTLIMTGIEEGL-
>tr|K4FYM0|K4FYM0_CALMI Hemoglobin subunit alpha OS=Callorhinchus milii OX=7868 PE=2 SV=1
---LSKTDKALLSSSVGKIQAQATGSDVLARMFASFPQTKVYFVGFSDYTA-----KGPRVQKHGLTVMTKIIEGIQYLDSLRSFLDALSAKHAHELMVDPVNFGFLGECVLSSLAYQLPD-FSPEMHCAWDKYLCEFAYLLAEKYR
>tr|H9GUN8|H9GUN8_ANOCA Uncharacterized protein OS=Anolis carolinensis GN=LOC103282340 PE=3 SV=1
--KMTDLDRRHIREIWTAAfeNPEENGRLVIIRFFSDYPASKQYFK---TVPTDGDLKAHPQVAFHGRRIMVAFSQVIENMENWNQACVlleRLVNNHKNIHQVPSGMFQLLFQAMLCTFDDLLGRTFTPEKRVSWEKFFQVIQEEVEAAYD
>tr|H2YFM6|H2YFM6_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1
--SLTTEEVITLRTTWAEiskLGNATVGLAVLHRLFNDCPEVRPFFGSMlppSELSDMDSLKSNPKVVDHASRVALSINNIIQLLEntdELVSYLSFLGKVHG-ERSIPAKHFSDMGPVLLAVISAVLREDLEGVVMQTWAKAYGAIEAGI-----
>UPI000197D711 status=active
---LTPKDIYEAKQCWNKAAslgVNKVGVLLFKNIFTIAPEAAKAF-SFGNDP---NFMNNKEMEEHGVKVVMAFDHAVRSLDNIHalqETADGLRDTHSFF-NLSPEHHVIVKEALLQTLKQGLGDEFTDAQRELWNGIYTAIRNMW-----
>KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold119418_1 # 1 # 498 # 1 # ID=119418_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510
---ISPLKLRLVQSSWRQASaDEQAGITAFKFFFEMEPVAIGMF-GLQDIR---DLYNSYELKRIAAKIVKAMTHIVNSFDNFEglrPLIKKLGMMHGEK-GVSPSQYNNFGKAFMQTVEEILGDQFTPETRRAWETFFRILTGAL-----
>tr|A0A146PHJ5|A0A146PHJ5_FUNHE Hemoglobin cathodic subunit beta OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
-----------------------------ASWFCGFHWTQRYFPHIWRPLPPPAIAAKFPKGAAWKTVMGGLEIAVKNIGQHKAAYAKLSVMHSEKLHVDPTTSGFLLNASQWVWLPSLPPRLHPWFPGGWQKFR------------
>tr|A0A1E7FQE1|A0A1E7FQE1_9STRA Neuroglobin OS=Fragilariopsis cylindrus CCMP1102 OX=635003 GN=Ngb1 PE=3 SV=1
--------MALVVESWAKIKEIENyeevaGELLFRRIFEIKPDAAAYFKFTDGFETTDeALYKQEVFIKHVKMVILTVTSAVDLLEkeNMdelFRMLKLLGAKH-LSagLKLEKEHYNLVGMALLDTLGKALGDTFTEAVKSAWIGVYAIIASKM-----
>tr|A0A150AR53|A0A150AR53_9BACT Uncharacterized protein OS=Flammeovirga sp. SJP92 OX=1775430 GN=AVL50_01545 PE=4 SV=1
---VSNKQIELVQNSFTLITphRGQVSELFFSKLFKIDSSLESSLMV--DPK------------DQERRLIPMLSAVVNGLVDfelIIPILQDFGRTHV-EYNIQEKHYEAVQKALFYALQTVLQEKWTSEVDDAWSNIFSVLTNIMKE---
>tr|A0A1Q9P386|A0A1Q9P386_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=hmp PE=4 SV=1
---FSNNDIRVIDELWDLILpiKETITDSFYATLFSLDRTIKPMFKT--DLG------------VQGLRLTDTLTFIIKHMGNiedTIQIVKELGVKHL-EYGTKPYHYDLVLEALLETFDKHLEEKFNSEMRLCWIKLYKFLSELMML---
>tr|A0A1G1B2A9|A0A1G1B2A9_9PROT Uncharacterized protein OS=Methylotenera sp. RIFCSPLOWO2_02_FULL_45_14 OX=1801615 GN=A3I83_03315 PE=3 SV=1
---MTPMQIDVVQSTWQKVMpfREDIACLFYKRLFEIEPELSMVFKG--DMH------------DCVKKIMFMIDLAILNLGQleeVMPMLQEIGNKYV-QCGMKVDS-NAVRNTLVSTLEQRLGETFTVNVRSDWIQAYDLLVGVMKD---
>sp|Q7SID0|GLBF1_EPTBU Globin-F1 OS=Eptatretus burgeri OX=7764 PE=1 SV=1
--TLTDGDKKAINKIWPKIykEYEQYSLNILLRFLKCFPQAQASFPKFSTKK--SNLEQDPEVKHQAVVIFNKVNEIINSMDNqeeIIKSLKDLSQKHKTVFKVDSIWFKELSSIFVSTIDGG----------AEFEKLFSIICILLRSAY-
>tr|K1QF07|K1QF07_CRAGI Neuroglobin OS=Crassostrea gigas GN=CGI_10026082 PE=3 SV=1
--TISEDEKRLVKDSWNLFVsrgdFSDTGSHMYKVLLQDNPHLKTLFSFMKVNGa----PFDSPMFKSHVRNVFTVIGDAVNHIDDLDSLspiLKDLGVKHQ-GYGAKKEYLEPVGNALLCTIEKHLEDDFTQEVHSAWRTFFAVMSYSFA----
>tr|Q3MQ26|Q3MQ26_SPISO Nerve hemoglobin OS=Spisula solidissima OX=6584 GN=nHb PE=2 SV=1
--KLTKAEKDAVANSWAALKQdwKTIGADFFVKLFETYPNIKAYFKSFDNMDMSE-IKQSPKLRAHSINFCHGLNsfiQSLDEPDVLVILVQKLTVNHFRR-KIAVDRFQEAFALYVSYAQD---HAKfDDFTAAAWTKTLKVVADVI-----
>SRR3989338_1269240 
--DFNDEEIDIIKDTWDAVLYPey---PEEGfnPVLNFSTKFYRRVFehencknlfeE--V------------DMTSQGEKLVKILSVLLVAVQTkslnqdHIHVLRKMGERHRG-YGVSDDMYEIIGGCLLRTLSEVCADVWDDDAKVVWAKLFGVVSEQM-----
>tr|A0A2G8K001|A0A2G8K001_STIJA Globin D, coelomic OS=Stichopus japonicus GN=BSL78_21829 PE=4 SV=1
TAQLSEVEKNLIRSSWEQAlkNKKVFGVNVFIKLFIQNPSSQDLFEQLRGIPLE-DLKTHRKMKAHALRVMASLNTLVEQIDEVEiltEMFNNVARTHV-IHKVEKAHYDLLGQVLMEVFSEELGAKFDSATKGAWLKAYVIMENIILDKY-
>ERR1712150_314552
MTALTEERKLHIKSSWSSVndDvdLAGNGVEFLVKLFTDFPEYMTFFPAFDGKTPE-EIRSSPKAKMHGKVLMTTLDKIVANLDDLEtviASLHRVVGSHF-PRGVTASHFKATLECFGSFLAVQLGDAFNNDVKNAWGVAVQILASVMEAEY-
>tr|A0A132AHZ9|A0A132AHZ9_SARSC Cytoglobin-1-like protein OS=Sarcoptes scabiei GN=QR98_0086180 PE=3 SV=1
-MSLTNRDKEIIVSTWSLIrkDSDQAGIHLFKRFFEANPDYVKYFP-FGDLdDLE-KILVDPRLKWHASRVMAALSTIVDNLDDPVcfeDSLQKVLSSHL-NRKIQLYHFENLKKALVCLFMDKLGpDIMNDETIEAWSKAYDVILDTYRSRL-
>sp|Q8T7J9|GLB_YOLEI Globin OS=Yoldia eightsii PE=1 SV=1
-MSFSAAQVDTVRSNWCSMtaDIDAAGYRIFELLFQRNPDYQSKFKAFKGLAVS-ALKGNPNAEKHIRIVLGGLGRILGALNTPEldVIYKEMASNHK-PRGVMKQQFKDMGQAIVTALSEIQSKSGGSFDRATWEALFESVANGIGQYQ-
>sp|P0C227|GLB_NERAL Globin OS=Nerita albicilla PE=1 SV=1
LKSLSADQKAAIKSSWAAFaaDITGNGSNVLVQFFKDYPGDQSYFKKFDGKKPD-ELKGDAQLATHASQVFGSLNNMIDSMDDPDkmvGLLCKNASDHI-PRGVRQQQYKELFSTLMNYMQSLPGANVAGDTKAAWDKALNAMANIIDAEQ-
>tr|A0A1B6EVA8|A0A1B6EVA8_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.22480 PE=3 SV=1
LEVITERDKYLAREVWMQVETNyvLISKSLFTNWITEFPEHLNFFKGLLD-SSYDDFLTSPKFEQHMANsVLPNVGIMISNLDRptdFRRHILKLAWIHIRKNiALKIDHFNILKGLILRTLKESLGRGIGRDHEVAMFKVITAGFNLFS----
>ERR1719240_1900674
-----------------AVArvlVHGL-ANLHRRALERLDLLLELVDAHRVVVL-RLLHRLdgrldrlHVLRRHLVLVLE------EG---------LLGAVHR-RVGLILH----------LHLRLAIGVRRGE----------------------
>tr|A0A224XVH8|A0A224XVH8_9HEMI Putative hemoglobin-like flavoprotein (Fragment) OS=Panstrongylus lignarius PE=3 SV=1
DIGVCNEDVAGIKETWQTVYNDkEnSGIFLFQVMFEMYPDYEKYFVRFRT-EGQKSLFDNPKFINHVKnRVMDALNDVIVNLENDErlvNILETVGENHK-KRNLRKQEFDNIGKVVIETLRRALGTSFTPKLEEAWTKVINCAMETIGK---
>tr|A0A1B6KZX4|A0A1B6KZX4_9HEMI Uncharacterized protein (Fragment) OS=Graphocephala atropunctata GN=g.7772 PE=3 SV=1
YFHLSLEDKRLAREAWYnNVEGNyViVAKAVFKELFRRAPQAYNFFKHLVD-VNERDMFESPRFKRHMVqRLMVALETIFYNVYWNDvfeNHMYDQGRKHK-KRGVQPAHVKLLLCVIV-----------------------------------
>tr|R7TS60|R7TS60_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_200756 PE=3 SV=1
-TFLTDEEVEILKASWNDLNddsdLSSIGKRVFLQAFEMRPEMKKIFP-FDNCWGD-KLLQHPKFQAHAQSFMVIIENSVEQVDNESSDFsdslTLLGQSHSDRIGFTRENVQVFLKAILAVWHDLLKS-SDDRTEKIWSKFLAHVVQIMRNGY-
>tr|A0A0X3PJM2|A0A0X3PJM2_SCHSO Globin OS=Schistocephalus solidus OX=70667 GN=GLB PE=3 SV=1
--QLTEVQKTQLCVEWKQICKNKedkyaLGTEVFRLLFTKYPHYIRLFKRFRDLPNLDSIMQSAAFKAHAMRFIGAIDAIMENLDDescLVELLKRLAEEHRPR-GITENDFYKTLDVAYDALSPALKsDDARVALRQLFDTALSVIRQSL-----
>sp|P02214|GLB_BUSCA Globin OS=Busycotypus canaliculatus OX=57622 PE=1 SV=1
--GLDGAQKTALKESWKVLGADGptmmkNGSLLFGLLFKTYPDTKKHFKHFDDA-TFAAMDTTGVGKAHGVAVFSGLGSMICSIDDddcVBGLAKKLSRNHLAR-GVSAADFKLLEAVFKZFLDEATQRKATDAQKDADGALLTMLIKAH-----
>ERR1719239_1832466
--GLSEKDLVLIRGSWGMLgdlkTRKAHGVELFIQLFRAYPYMCeEYFPWFNDMSDEE-LRTSRKMKAHAHNVMNNIGSYVEVCDDPESlvaLIGKMAETHIP-RNVKALQFKELGDMFLPYLVSMMGAAATTDVQEAWRRLLAALVAVVSQ---
>tr|A0A1I8JIG1|A0A1I8JIG1_9PLAT Uncharacterized protein OS=Macrostomum lignano GN=BOX15_Mlig002954g1 PE=3 SV=1
--MLNEVEKKIILSGWQQAikDKKALGMDVFMTLFEMFPQHQELFRDFKGKSRAE-LEKMPKMRAHALRVVNTLDGAIQSLDDMEVcasSLELIGASHKS-HHLSAKHFEDLNAALAVVFERRLGKA-FVDNKAVWVKLLQGIIPVIQR---
>tr|A7RZB2|A7RZB2_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g204383 PE=3 SV=1
-IPLDAKETQLVRKTWAILGDRqvEVGKSLFLRFFEEHPTSKDLFPEFRNISNEK-IAESPALYGHARRVMKSVDNAVASIENVQVysaYLYELGTRHQ-TRQLSEEQLKFMGGAFLFAMRLHLRKEWSRATSKAWEKIFSFMADAMMR---
>WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1887876_1 # 1 # 366 # -1 # ID=1887876_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459
-LPVSDENKDILRESWKRLEEEktTLCKNVFIRLLQLNPNLQDTFPSFKGVALDE-LMNSRSLFLHSKRLMEALEIAISSLDDGQDfteYLTHLGERHT-AISITENHFKIMEKALIFALKDMLGESCTEDVANAWREFFQSMAGTMLA---
>ERR1719401_2606804
----------------------------------------------------------QKYQAQGSRSQ---GG---ELS-RRrcvPPAQSRRA----RAGLAGDghqahclWHPPGERSEIRGSLRCCGEGSDPKLEMAWTKVFVVVSTTM-----
>ERR550519_2895140
---LSKAERKEAENAWRIFevNLVDNGVDAFLNLVRDHPNRKDAFPWVKPELSEEALRNDPEMKKLAKLVFSAVKPAFKSLGDlqsLTNYYLNIGNELS-LMNIPPVMVSYLSDAFKKTCQKLLGSDYTHSLEASIEYVYDFITSRMFE---
>ERR1719402_597456
---------------------------ALIA-------LISS----------------------AAGSGCLCDARARPFSM-------LS--AI-KLIRVVSAFRATAKALLPAFEEELGTKYTDDFRYALTTLINFMADNMEK---
>ERR1719423_342041
----TGRQRVAVQASWRLVapDAKRHGIAIFIRLFKKHPETQLVFKSFKGQQ-PESLADNKRLAAHATTVMASVATLVDNLDDidtLLELLHKVAENHKRR-GLPIQYSTIWWRRWG----QHWTAAASRGGATSSepstrssplstsgskDNSFRNVCKMCEGISR
>tr|Q53I62|Q53I62_9ANNE Intracellular haemoglobin (Fragment) OS=Alvinella pompejana GN=hb-i PE=2 SV=1
------------ADNIAAVrgDVSTHAMNIFVEYFKKFPQHQNAFADYKGKD-PESLKSLPKFKTHTTKVVSKLLDIVEKASDsgaLQSNCTTLAKMPQHK-GLNQQQFADLGAVLVPYLQKALGGACDSA---AWeqayn----------------
>SRR6516164_9760095 
-IVTTPQQVQLVKQSFAKTTpiAEQAAGLFYGRLFETAPQLRPLFK--GDI------------KTQGRKLMSTIALAVGSLQKlpeLVPIVQDLGRRYV-GYGVKDDQLRYRRRRAAVDARQGaRGRLHTRCEGRVDLGLYDPrrYDEERRSAA-
>SRR5690348_1420512 
-----------------------------RHRAESAPAVSGRS------------------HSAKKEADGDDLHDDRRTERfqkAGPGSQEPRRAPC-RLWCDCGGLSIVGEALLWTLEQGLAAEFKPEVRSAWIKLYDMIATTMQAGA-
>SRR5258706_3013648 
-XMLSEKEITLGRNTWDLIapvT-QEMGIQFYEHLFETSPELKPLFKT--NP------------KDQAMKLMFMLSYFVHRLDKendLRAEIKKLAQRQS-GYGAKPEHYKLIRDTLLCSMQNDLRKPWNKETESSCQ---------------
>SRR3712207_8213275 
-RLMREYRLAVIFFFFSSR--RRHTRYWRDWSSDVCSSDLSLFK--GDI------------TEQGRKLMQMIGVAVRSLDRleqVMPAVQALGARHV-GYGRSEERRVGKEGRSRWGPDHX-----------------------------
>SRR4029077_8512364 
--CVTPQQIDLVQASWKQVVpvSETAAQMFYGRLFFLDPSLRRLVL--RGK------------RGGGERGGAVVLG-RQGEEGeegEGSALIHRDRAQA-AGGP-PPRGPAPGAAA----------------------------RHVRRS--
>SRR5437868_6476409 
-----MDEILLLKTSLQKMGpqLEHAAGTFAVRLFQLNPSL-------GEI------------ATRGRELLQMMGAAVQNLGRldqLAPSARQFGRHYA-NCHIREQDYDAVGEAFLWSLGRGLGRDFTEEMEAAWGKVYWLMTEIIRAG--
>SRR5689334_13356078 
------------QVSFTQVApiAETATQLFYARLFELDPDLELLFK--GNL------------SEQGASLCKCSHLRSTVLTGwsnFCQSCNRLAHDTS-AMGFETKTTTQWDRRFCGRYGKGWV------------RPSHLRLSX------
>SRR5437870_6238790 
-FDVTPIQVDLIRASWAKVEpiQELAASLFYDRLDRKSTRLNSSHVA-ISY------------AV---------FCLKKKKKKkek---------------YTHEHINNNKV----------------------------------------
>tr|A0A136P213|A0A136P213_9CHLR Globin OS=Chloroflexi bacterium OLB13 GN=UZ13_01312 PE=3 SV=1
-ESLTEHDKKLVQRSFTHIApqNEDIAAVFYARLFELDPDIEHLFS--TGL------------DVQRAKLMRMMADLVNALDApeaLSQSMRELGKQHV-SYGVHDKHYATVGEALIWALRKVCPAVMTPTVTQAWEKTYALFAELAIS---
>tr|A0A0C3QP41|A0A0C3QP41_9GAMM Uncharacterized protein OS=Shewanella sp. cp20 GN=DB48_17865 PE=3 SV=1
-MPLTDEQKRLIQKSYAEIDrqNSNFAAIFYDCLFAMAPLIRPMFKS--ER------------PVFEYHFNELISTAATKVFEfeeIKPRLVVLGQKHR-GYGVTPAQFDVVRSALMLSIQDCLRDTCNPAIEQAWSCYYDEIAKVMIAA--
>SRR5262245_10239308 
-GPENARPGNL-RHHYadrgrcsGSLLpeAvqaRSVAGRHVSRRHERAAEE--AAAD-ADG------------RRQGARSA----RSGRGGRRgsrPAPRAIRRDRQAL-RHGRHGS---P------LGARGGTRARFTPSVKKAWATVYGLLATTMKNA--
>SRR3981081_1073077 
-VVATPSPSRRRISDFG-------------RLKML-NSGKPEFGAgeGSSC------------CSGRSHLLVAILRHVAGIA-------------------------------------------------------------------
>SaaInlV_135m_DNA_2_1039731.scaffolds.fasta_scaffold157242_1 # 1 # 360 # 1 # ID=157242_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.458
--LLSPATRELVRSSFPMVEriAPRAGTMFYGRLFATAPEVLPQFR--RDLS------------QPNFQPaaehrfMQLVLFVrstaeHAGLPGsagHDETVGKLAQRHV-GYTTRAPHYAPLGRALLWTLDECLGADFTPAMRAAWSDTYDVLVASMVAPL-
>tr|A0A0P1GRZ8|A0A0P1GRZ8_9RHOB Soluble cytochrome O OS=Thalassobius mediterraneus GN=vhb PE=3 SV=1
MNLLSKDEVALIQGAYRALGpsKGFLTNSFYRRLFAIAPQARPLFP--QDM------------DEQLKKLEHMLDLLVDNLHQpmfFMGKLKRLAKRHV-GYGAQPEHYALVGEALIFALNDITPGGLPDKERALWVEIYTAISNTMIET--
>APLak6261659701_1056019.scaffolds.fasta_scaffold514158_1 # 3 # 230 # 1 # ID=514158_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
-IELNAKNKALVKEGWKLLIEtqFPnevggneralarFFDEFYRKFFEVNPSGKRLFEE-GGM------------AVQSKALVKMMSMVVTSLENpsnLDLTIERLGGRHE-LYGVSRSDYLAFTNAMCETLETVLGDKCNQEMKESWSLVLNNLSEKMLT---
>SRR3954466_1768845 
-SCHDSGTGDARS---ADIRpgradRRQGGGDFLRSVVRGRPHGQAVVP--GRH------------SRAAPQTHRHAGGRGPRLSDLpsiLPAASALAKRHV-DYGARPEHYPVVGAALLWTLERGLGPQWTSEAASAWTAAYATLSSFMIA---
>SRR6185295_9741709 
----------------------------LTTWVKHLRRSIMVCG--DDM------------MDRRKRFTQVVSATVRGLARvdmLLPAVREFGMRHP-LPGEIEQHHANVASALLWMLEKALRKDFTPEVKAAWIKAYGMLSQTIRQS--
>tr|D7G782|D7G782_ECTSI Globin OS=Ectocarpus siliculosus OX=2880 GN=Esi_0008_0247 PE=3 SV=1
--VDVEGYKAEIRRTFALVEpiSVQAAGIFYPTLWEVDTSTKPLFKD-TDM------------DKQGEKLMKTLGVAVAMLNKmdtLKPILENLGRKHV-DYGVTPEMYPSVGKALLITFEKGLGEECTPLTTKAWTWVFGIISSICIAAA-
>SRR5215207_7597532 
-QTMTRDQIRLVQASFRNVLpiRELAAALFYDRLFEIDPGTRGLFVD-TDL------------RSQGGKLMAAIGMVVHALDApesMVEKLKELARRHV-NYRQLQESSPPDFHRLhrfgsgrgsqRHVVSKGPGVAPVGQ----HVVPTHFASRvsrRLRAC--
>SRR5262249_41212017 
-NVMTPEQKRLVRDTWKQVApiADAAADMFYRRLFEIDPTTRELFHA-TDM------------VAQRKKLLQMLAFAISGLDNlgaLVSKVEDLGRRTP-AVALPTRTTIPWAPRCCGPWNRVSVTRGHP----RWRRHGPRstnccpascatlprapsscktcgplrrgrplerqgICCVFRKR--
>ERR1700730_6579985 
--RQRLADDGVILRVLQRGLgiELEMEALAREEIGELDPDAarfRPHHA--VGG------------GEVGGRHIELLRRHVDQRPpcHaaaNGSARISLPRGHV-SYGAKPRHYPVVGAALLWTLEKGLGDGWTPEVADAWLTAYSTLSGYMIS---
>tr|A0A0N0UYC0|A0A0N0UYC0_9BACT Uncharacterized protein OS=bacterium 336/3 OX=1664068 GN=AD998_10010 PE=3 SV=1
------EQKEIIKSSFPRVLihTLKNSTIVYEKLFMDIPEAKDLFKN-TS------------IDKQGQMLVAAIGKIVKGLDNpdiFEKDLVELATRHV-GYGLKPEYFTHFGNALINMFEVSLVDSWDKDLHDAWVAVYQEVAEIMKSVI-
>SRR5918994_1539718 
-------QQELIRESWQRFEpkIKRASPQFYERLFALDPAVRRLFSG-VNM------------AEQERKLMAMLKEIVPELDRptdLVAAVGRRSPFTP-HpepSGWLDPRYAWMRSRTPLP---CSGEX-------------------------
>tagenome__1003787_1003787.scaffolds.fasta_scaffold20949172_5 # 2657 # 2851 # 1 # ID=20949172_5;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.626
-------DETALLKGFDLAAdvLDEVIDNFYTELLESYPDLQPLFAH-TNT------------QQQRQKLQDVIYLLIENIHNqdvLESALLSLGERHI-RYGALPEHYPVVAEILESNLKKRLGRSWTKAVSTAWIQLLSAAADVMCRPY-
>ERR1700753_815890 
--XMKSSTMELLSSSFARVcaDKNNAAGIFYARLFTTAPELRAAFQS--DF------------DSVQWKLMSSLVQIVEFYRVgvdPTSYLADLGRSRQ-GYAAQRAQFDAVGDAILFTLAQVLGQGFGADIRAAWVSAYAA----------
>tr|A0A1H2YYM1|A0A1H2YYM1_9RHOB Hemoglobin-like flavoprotein OS=Albimonas donghaensis OX=356660 GN=SAMN05444336_103306 PE=3 SV=1
AMPLDSTNLARMREMLHILRrdAPDASTDFYQALFERAPELRTLFRD-SDL------------AGQGRKFMAMLGLLVDACEDygrLGNEIRELGRGHA-AYGVEARFFPPMEEALIDTMRSNLGERFTPELEADWRKLYAIVANEMMSP--
>tr|A0A1T2B631|A0A1T2B631_9RHOB Uncharacterized protein OS=Thioclava sp. DLFJ4-1 OX=1915313 GN=BMI85_03370 PE=4 SV=1
EPLLPAERAARVKASAARLDfeDPSLFRDAFARLFAVHPELDQVLPN--SE------------GGQQLKYAAMMEVILSTLDPpeeQELELPGLGQMHV-LFGAEPDYYVWLSEAVIAGLAAKLGDHWTSELAADWAELFSKVSAQMIAG--
>tr|A0A2E1AIS1|A0A2E1AIS1_9CHLR Uncharacterized protein OS=Anaerolineaceae bacterium OX=2024896 GN=CL607_22355 PE=3 SV=1
MSPVTSRQKLLL--HYTLLHldADQMGKLFYDHILAAMPEVAPMFTD---L------------ESQRKHFMKMMIRIVHTIDEpdhLNIVLRELGHIHK-RLHLKPRHFSKMGVAFSNSLAEVMGDRYTPEIGEAWRILYNRVAEAMQSP--
>SRR5262245_62462516 
--------IFIFLLFFFFCLcf-CFMFFFFFSSRRRHTRCLSDWSS--DVC------------SSDLQKLLAALALVVRSLHTpekILGPVKKLAVKHV-DYGVRPEHYTYVGNALLRTLKKGFGREFTPELSDAWVEAFRMLAKVMKEA--
>tr|A0A2D6AZC8|A0A2D6AZC8_9BACT Uncharacterized protein OS=Flammeovirgaceae bacterium GN=CMB80_28915 PE=4 SV=1
SNTMTSESINMISKSWDLLSRdPQLVTRFYNRLFDIAPETRRYFK--DDI------------SKQSEKLAHTLNFLVMNLDRldeIKESIEDLGRHHN-KMKIKAEYYVYVKEALLTTIQETLDEQCESGMVEAWDHALSHVASTMINA--
>SRR5262245_55554356 
--CVTPEHRLLAQQAFATIQplADELGLLFYSRLFELDGALRGLFKH--DL------------ANQAHSLMAMLQLTIEGLDApeqFTRARTTWGYATWTmGFSRTSTRLLRRPCSGRSSMRX------------------------------
>SRR6516165_4200192 
------AQ--------------------------------------SDL------------VDRGRA------YRLLGLADLvdrrnQAaagGLSLFHRRAV----------------------SAGGVAWADRVLDALSlylcgyelrwpQLDHALGRgavhpdacaSLLRE--
>ERR1700733_1486793 
--------------SQAHGGdiVDLyRDVRLVYRLFRRLPPAEQDAIP-GDH------------RRGRLSRaAGRVAL---------APVRRAARRQ---------DRRREG-DVLELRRDGRGDDRRHVFHRDQElswlSDDV--PR-VVRD--
>SRR5215831_4136876 
--KHDPPTDLARAEQLQVRCA----DRVKGRRSLLRPSLRDRSRGP-AA--------------LPRKIIRAEGKVdgdANEDRQqssSAQchFASCTPTRRaaQ-GLRCLDGSLWGSGCCLLWTLEQGLGSAFTPEVKAAWSEAYRTLAGAMQEG--
>tr|W5NBV0|W5NBV0_LEPOC Uncharacterized protein OS=Lepisosteus oculatus PE=3 SV=1
-VPLTESQKDLIRESWKVVhqDIARLGIIMFIRLFETHPECKDVFFIFREIDDLQELKMSKELQAHGLRVMSFIEKSVARLAQedkLEQIALELGKCHC-RYNAPPKYYEYVGVQFISAVKPILKDSWSPQVEQAWESLFAYLAAVMKRGYH
>ERR1711911_21978
ATGLTARQKRIIAKNWDLVRpnLKEAGVGLFIAYLTKHPEMQARFKSFATVP-LNELAANRKLQAHAANIMYSMTMLVDSLNDvecLVQHLATIGRNHR-RRHLKRHHFQDLAVVIVDFLEAALAAHWSAEARQSWTLALNVIVDQICNVL-
>SRR5215218_21909 
-CAMNPEQIGLLAESWKGVAgrRDEIARAFYGVLFDRHPELRSMFAH-TDM------------RAQYEKFALMIDEIVQLRTEprqFVRSAVLLGQRHA-AYGVTRDHYGPAGAALIEALAEALGSAFTPAAREAWTEGYLLMSSIMCR---
>SRR5688500_19518083 
-LLITPAP--------------------PSAIHTRYLHDALPIAH-VDM------------GAQYEKFAAMVDEIVGLRTEphrFVRSAVLLGQRHA-RYGVTRDHYAPAGAALIEVLDRKSTRLNSSHLVVSYA----VSCSIQ-----
>SRR5258706_7695680 
--RHDPPPdpadPPVLRPA----RvqGRETRHLDVQAPVPARPRPTPAVQ-------------------------------------------------------------------------------------------------------
>SRR4026207_1847514 
-PLMTSNQRQLVRQSFDAVRdqAGPFSLLFYGKLFELDPSARRMFHV--DL------------ALQGRKIVDTLATVTESLDRfesIRPRLASLGRQHA-GYGVRPEQYDTITAALLWAIGQALGADFDAPTREAWKLALNAVSTATIEGA-
>SRR5260221_10622870 
--IVNAAQQELVMTKAEGVvlMPGVTGVLLCALLISANPSFRPLFKS--DM------------RIQGVKLMTMLAMVVYNLPEpgqVLPAIRDRSEEHT-SELQSHSDFVCR--LLLLHX--------------------------------
>SRR6516225_5669596 
-NVMTPEQKRLAScfrrggppGSWRRPSppLGIETAQVFRIPCVLPN--AAVHTA-GVS------------DHNNSDTYRAALRPAH---R-AASQTASVRNHE-RIQSETAM--REGL--rrvTYARVLRTGS-hRTPYrnVTP------------------
>SRR5215203_7560530 
-RPMTPDQVSLVRDARRAIesRHAEFSAAFHDALHELDVDTCALFRD-TVT------------GGRACNVGAMLDLLQQASDDpraLIEVAAELGRAHA-HAGVRDVHHHVAGVALHRALHRVLGVEFTPAMYEAWAEAFTLLIAVMERAA-
>SRR5215470_20101711 
-KSMTPQQIALVQCSFKSVApiASKAADLFYDPALRDrsrgaaALPH--------RFV------------G----AEGQADGDASNGHQ--------------QSPSARCHFANRAATLRPA-Q-------------------------------
>SRR5919197_1191720 
--VLTRDQADIVQLTWRAVLpvGDTFAELFYGRLFALDPQLRRLFR--ENL------------VEQGRNLTAMLSVAAANLARpekISVALRQLGRRPT-RSSRARCSRSLLRDLLRLPLDARRA--VADGVARVVVafaRAVVAIP-RVIHG--
>SRR5690606_39578087 
--------------------------------------ADHLSP--LPlP------------TRRSSDLLRMLAFIVKSLDWadrqwredvnpdedLMLVVLALGRRHTELYKIPDESYGAVAEALLWTLDYGLGRSEEHTSELQ--S-------REN----
>SRR3954469_10060132 
-QRMTPEHIHTVQSSWNKVLpaGNGKARLLFERLLQTETSLCGLFQ--LDG------------ATWSANLVQMIDVLVTGLSLgdrSAVLTRRVGGRNT-ACPGIEHHYDLIGTALLRTLAKRLRAEFTPRVEAAWAIVYEELVESMRKA--
>SRR6266508_6374850 
NFAMTKEQIALVKNSWKLFrkvDACLIGDVFYSKLFFDNPQLRQLFP--ASM------------EERYRKMIDMLSVIISRLDRlneMTKDIKVMALRHE-SHGVKPRHCKLLGNALRWTMERGLGNDWNDDVKEAGLACYTKLIETMIQ---
>SRR5215475_4417451 
--PMTPLQRRLLHQSFSRIEpfSQRLGDVFYARFFSTSPAMRALFSR--DI------------KVQQSKFMKVISEIIKLPLlsfsvtdsqdSesLVPGAYWSGMLHG-ALSVKQQDFASMKAALLWALSNCP----------------------------
>tr|V4A5G6|V4A5G6_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233247 PE=3 SV=1
-ADLTEKDKELVKSSWAKFNegdVIADGAHIYYKLFEKAPEAKEKFGFAKD---GEVSLENKQFKAHVRKVLDVFESVVREIDQlegLLPVLNDLGARHK-SYGVPLKYYEILGSCIMYAWDRKLKM--DADTKKAWGKLYGVVQTEMKKG--
>SRR5262249_25899110 
--MMNTQHIARIRLSFAWIApsADVFGELFVANLRALDPSLSGLLA--AEA------------GPQGWQLISILRSIIGGRDRpdrLFWRLQSFGRRLA-GDGLCAEDYDTIGDALMLTLEQCLGERLTPDVAAAWDATYAALAEVVQL---
>ERR1719223_727152
---PSSAQVDAVTASWDKVAalgAETVGVLLFKRIFEIAPALESELS-EKPTA---IIIGDLTLAREMT----EEEKETIDLEEkeePeeveekeEPEEVDEQETTE-GRIISTESF-------------------------------------------
>ERR1719336_2939639
--PLDERDIDLVQQTLGRVAilgLDNVGWVLFMNTFKIAPAAQGLFE-AGFLQlkplnkpfnDMPELAKSSNMKETGGRVVETLAAAVGLLRDlgtLVPILQDLGKKGV-SCGVIPAHYDIFGEALITSLQLALGANFTDPVKNAYLKVYTIVKNTMIG---
>tr|A0A1D8RRN7|A0A1D8RRN7_9GAMM Uncharacterized protein OS=Colwellia sp. PAMC 20917 GN=A3Q34_02175 PE=4 SV=1
---MTAKQINLVQQSWQKVLilSPDVGDLFYQQLFVLRPELATLLKN--DK------------QdKirANKDFICLLSQEINLLQPielTEEKV---NTSVT-TNDV-KNYQADVENALLLALTMILDKELKIALKRAWISTIKRLVGSIVIEL-
>ERR1700730_15638689 
--AMTPKQVALVQDSFAKVAltSEAAAVLFYNRLFDIAPQMKAMFP--DDM------------VEQRRKLMSMLAGVVKGLANLeqvFAGRQRTGKAAC-QLRCEGG--ALSGGRRRVAVDAGEGsGGWLDAGSGGcVGHRlWHAVRLHDFPS--
>ERR1712166_353516
-VVAQFAALNAVDDKW-----VTQGVLLFKHMFRINPGMKQMFS-FRDIP-DDELYDSMKLKKHGVSVYTYIEKAVDGWGTpeIADALQKLGARHL-PREVKMEHFDVVGESILTSLSDVFGDQFDDKSREIWTRVYGVIV--------
>tr|A0A1S2XZ06|A0A1S2XZ06_CICAR leghemoglobin-like OS=Cicer arietinum GN=LOC101502441 PE=3 SV=1
MDALTEKQEALVNSSWEAFkkNIPHLSIVFYSSILEKAPESKDMFSFLKNF--DGIPHQNSTLEAHAEKIFDMTRDAAIQLRAkgkIdlaNDvTLEYLASVHV-QKGVTEQHFVVLKEAMLKTIKKAMDDKWSEELSCAWSIPYDQLAATIKKAM-
>OlaalgELextract3_1021956.scaffolds.fasta_scaffold1056695_1 # 380 # 499 # -1 # ID=1056695_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.392
-MALTATDVEVIQTTFKeVAEnvgAEKAGIILFKNVFDAAPGAAKLFS-FGRVEgfdPAADHSTNPAVVKHATGVITTVAKAVASLTDlsaVLPMLTALGKRHS-KYGVKKEHFGIVGAAFLKTLSTALGDKYTKEVEAAYTKLWGVVSKTFREAG-
>SRR5271157_4306781 
-----VSDVEFLKETWGQItDKSSFAERFYSLLLAVFPVAKPLFSK-TDW------------QSQYSLLMASIDYMVMGIKygrNIQPTLHLLGARHD-YYGVAPVFYIPFNACLLITLQK------------------------------
>SRR6266566_5437046 
--DLTPENCDFMTEHHDL--------RILGRLVATE---------------------------------------------------------Q-EQPVKDPDHDQIeeatrhrprscPTLFIWPNRRSQPLhrvlmRYMPvpgpRSPPSWCGPPSRSRSHGPRttT--
>SRR5579859_7196529 
-GARDD--T-----------gsGQaCSAEFLQGR--------------T-HR------------RSGGDpVLRSPVRNCAAGQSDVsrrHDRTAEKADRHA-CGRCeRSgrLALDPAGreracq--TprrLWRQGcalpgrrrrlvvdAGK-GIGRgvdarrrrrmdhrlrhavrfHDFRSLWQCPG------------
>SRR6185312_354929 
---MVR--A-----------rgSAkC--WKCRWR--------------D-RA--------------SVSnSLPAPATSSAGSACSNfs-------MNGTA---SSkQPefDRVPRGGrgrgrrrKMTpeqVSLVQqsfakvapiseqaavlFYD-RL-FevapavkamfpadmteqrkKLM----------GTLAV-V---
>APLak6261666328_1056055.scaffolds.fasta_scaffold241778_1 # 2 # 196 # 1 # ID=241778_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.415
-GAKTAGGL---NLLFL--AivSS----EPENGFVTISPAAKDLFP-A-DL------------TEQRKKLIATLAIVVNRLSNLqsiLPAARTLTKRHV-NYGAKPEHYPVVGSAVLH-AGgrPRLGLDARSRLrsdGCVWHAVRLDDgrnleHEFANL---
>SRR3954463_16408791 
------QQITLVQESFARLAhdKARFGASFFKRLFKVDPTLEQSFAG-VD------------MQAHALKLVDAISFVVGGLRQpetLVGPVQKLGAARC-CRRCPTSSRTSGPRSSVPPGT-------------------------------
>SRR3569832_1984102 
----------------------------------LEPKARSMFNF--RAD------------EDleaNPQFMVHARAMVDMIdmavgflgPDldpLIEDLSHLGKRHI-SYGVKPEYFSIMERAVMFAMEELLDDKLTKEDRTSWQLVFHFMITH------
>tr|B3SDK5|B3SDK5_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_62364 PE=3 SV=1
-SYLNYQERQAIIDSWNAIstEKQKYGTILFLKLFELEPRVKSLFTIF-DFN--EpleDIIQSPHFRSHAMRFMQSLETGVLMGFDkesCDFLFKSLGSRHH-FYDLKSEFLDVIPECILHTIKKGCGNNWSNETADAWKIATKVLCELFREG--
>tr|C1C1M6|C1C1M6_CALCM Non-symbiotic hemoglobin 1 OS=Caligus clemensi OX=344056 GN=HBL1 PE=2 SV=1
MSILTSNELSLISESWKLVvpDLEHHGLSFFLKLFEEYPTYQEKFFPELH-------QDERKIQRHGAIVLKSVGK-LVAFLEankviaLVDAIKRLATNHS-RRGVLREQFYPACRILLEYLAQALGTHLSTEGALAWKRFLGTFVELMQ----
>SRR5450759_1049036 
--ALTAEaPYSELKnlCVWSKT------NAGMGSLYRSQHELVFVF-K-NGMRPHINNvelgrfgrnrtniwnyAGASSFGstrdselamHPTVKPLSLVADAIlDCSKRggivldafagsgtTLIAAEKTGRR---GYGTELDPFYADT----------------------ivrrFEDAYGL-KAVHVE---
>DeetaT_11_FD_k123_441726_1 # 2 # 373 # 1 # ID=403715_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.481
--GLTDLQIEMIRSSWEKVTpnKKHHGQLLFHKLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
>AP82_1055514.scaffolds.fasta_scaffold664619_1 # 53 # 358 # 1 # ID=664619_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.458
---MSGFALRLVLTQRQKATrkrpiaqyvienhSINFAFHYIDRLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
>SRR5210317_1560035 
------------------XmtSL----KSSMIGFFRNHQNCAKMFGE--DMR------------DQAQKLAAILQVAFDNLDHvdsLVPILEDVGAKHA-TYAVTPEHYGLVAAALIGTISTELGDAFDERAAESFEAVLGTVANVMISG--
>tr|A0A037ZKD6|A0A037ZKD6_9RHOB Uncharacterized protein OS=Actibacterium mucosum KCTC 23349 GN=ACMU_09600 PE=3 SV=1
--MAHKGRVQTVRDSFQVVrtDADAFARGFYDRLFAKRPEMRGLFAD--DMS------------AQQAKLVTTLVTAVNMFDTpsqLIKPLKQLGASHA-QMGLSQADYQLVVDTIIETLETTLGSAWDVAHDRAWRGLLDFVSNVMQEG--
>SRR5688500_932283 
--MLSDAEKQAIRESWQLVLpvVETAADLFYRRLAEQNPALRARGQ--DQL------------VAQRKEFVTTFSFVVRGLAWeasewrsdapdeddLFLGMLALGQRGSRLARLIEQHYSATGDTLLWTLTYALGKRFDAKARAAWMRLYTLLAIALR----
>SRR5688572_29427622 
---------------WALCAprADLLAAAYYQRLFERLPALRIRFP--ADL------------APARQRLVGLLRFVARALYWpaddwrrplpieedLLAILLALSRRHRGLGEVDDAVRAVSREALVAAIGEILAGEANPSIIDTWGKLHDLAADAFVL---
>APIni6443716594_1056825.scaffolds.fasta_scaffold11231735_1 # 3 # 137 # 1 # ID=11231735_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.400
--LLTADERAVLKLDWSRLTrvdQQDMGMRIFLRIFELEPSTKLSFPELYHL-TGDQLISNTLFRCHGARFMRAVAAAVDNVDALdlvvIPNLIQLGRLHQSVDGLRWRHLEVFEQAMTEVWAVELNLSgswSGSTSAVVWSKVFRLITSKVYEGFQ
>tr|A7RWR6|A7RWR6_NEMVE Predicted protein OS=Nematostella vectensis OX=45351 GN=v1g203304 PE=3 SV=1
-CDMTYEQKYLIRETWKFLEvsKKEIGVSVYKRFLNMHPGLQTYFSEFKHIKID-NI---NGSHGHPRRLLMAIDNAVTALGDsdsFSAYLVELGRRHH-GMnfRPGPTHFNDLRKCFLSVIEEILATAslWDFQVEEAWNRLFDSITAMILRG--
>SRR6516164_7981020 
-SPLTEAQKRLVRESFESMQeyETSVVVLFYGRLFEIAPETRTLFKI--DI------------REQSRSSWIPSGL------------------------------LSIRLTISWNCRQLLR---------NWDESTSltAFSPITMGN--
>SRR6185503_3589201 
---MKAEQLELVIDSLTVIQpiADQIAKSFYKHLFEIAPQTKKLFT--GDM------------DRQGIMLITSLSLAVNGLSDmenTLPSVQALGERHY-SYGVKPEYYQPAVESFLWSLEYHLGDQFTPELKESWRTAFQALADTMLSVY-
>tr|A0A0P6AJ75|A0A0P6AJ75_9CRUS Globin OS=Daphnia magna PE=3 SV=1
MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGALLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDNLSKVLIDFLAAKLGGEFTPEARQAWTATMQGINTVVEA---
>tr|A0A0P5NXY2|A0A0P5NXY2_9CRUS Globin (Fragment) OS=Daphnia magna PE=3 SV=1
MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGELLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDVSKS-FSNFEC-----PENEVSRKDWTKNLSILQ--------
>tr|Q93101|Q93101_9ANNE Nerve myoglobin OS=Aphrodita aculeata PE=2 SV=1
MAGLSGADIAVIRSTWAKVQgsgSAtDIGRSIFIKFFELDPAAQNEFPCKGESL-AA-LKTNVLLGQHGAKFMEYITTAvNGLDDYagkAHGPLTELGSRHK-TRGTTPANFGKAGEALLAILASVVGGDFTPAAKDAWTKVYNTISSTMQA---
>tr|A0A210Q3Q0|A0A210Q3Q0_MIZYE Neuroglobin OS=Mizuhopecten yessoensis GN=KP79_PYT10061 PE=3 SV=1
-TYLTPRQIHLVQDTWDIIkdDLSKLGVIVFLRLFETEPDLKHLFPKIVQMNEQNKLeWDIDrdMLTKHAVSVMEGLGAAVESLDEsefLNSVLISIGQTHV-KRHVKPQMLKRLWPSLNYGLKQVLQSKYNKEVNEAWKKVYFYIVAHMKRG--
>ERR1719460_671936
--MVDAVVKGDVQRTWELVIPpdsgddhvFAIGKLFFDRIFEVTPGAEALFS-FKGE----DRAESAKFRAHAIKVIKTVGVAVAKLDDletLVPILEDLGKKHV-AYGVVASTTT----SSVWRCCGRSRRGWATNSRPTW----------------
>ERR1712223_635401
IPKLTAEEKSVLQASWANVNkkIEIAGAQTFIRMFESNPETQNQFRKFQGMDL-VQLEQSAEMAQHGKRVLSIVGMTVDNLDNyqiVWDNLIKVGREHF-TFGALPMYFDLMGPHFVIAVRSCLGNDWYEALEYHWLALFNMIVYAMKFGWN
>ERR1712062_404977
--ILTNQEISVLKSSWELIAkkIEIAGAHTFLPTFDRDPKCPDN------------------IERHCQRVMSVVGGSIELINDyksLWKHLISLGREHF-GKIREWIFASIAGGSTersgcspssINFLSSKINGNITSKK--CFLQ-YKIVIITQX----
>SRR6266567_6698575 
--------------------LIVFTSTCLWSI----RKPNHSLPKR-IC------------VVKLAHCWLHLTTVVAGVlreDNLVPVLQQLGQRHK-SYGVKAEYYPFFRAVLLETFQHYLGPRFTPKMQQAWEEAFEMISTQMLKGA-
>SRR5215217_5048650 
--RVTARGRAR---HVLLRApvRDRRGRGTTVRRHRHGSAA-----------------------PQ---VRRDARQDRARSGRaatLVPDVAALARRHV-GYGVEDRHYTSVGEALLFALGDTLGDRFTSDVHAAWVEAYALLAALMQR---
>APDOM4702015191_1054821.scaffolds.fasta_scaffold152199_1 # 3 # 686 # -1 # ID=152199_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.531
------------------------------------------MS--GDF------------SPEQKRYLEGFTS------GLq------IARTGR-GLG-KPAASVPSGPD-----AEHLIAQDQ-----------------------
>SRR5262249_5171126 
----EPDSALLVQSTIG-VLvqhQRRFTSELYRRLFGLAPGAQALFRS--DM------------ESQGKMLAHMLEFLVYATSRpetMTLGWRELGRGHD-GCGVGAEYYPAFRQAFLESARVVLDEKHTPQVEKAWADTLDMMIVSMLGP--
>APCry1669189000_1035189.scaffolds.fasta_scaffold267513_1 # 3 # 467 # -1 # ID=267513_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658
-VVLSDQHKKVIVRNWTILStdLSGRGTRIFLLIFGRNPLIKSIFS-FGHLE-GDELVCDPRFKGHALRFMQAVGAVVDNIDDynnaVKPILNDLGRRHTQFKGFKPIYFNEFQDSILQVSENGTCKQngeiriLNPSaagvnfCTPPLGKFSASEMTCIVSsGA-
>tr|W6FSH9|W6FSH9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_a PE=2 SV=1
-LDFSDDQKADIKSTWETLYsgnKFQLGVELMANLFKAHPDYQDLFPSLKGIPD---VAGSNELRGHAIRVITGINNFVDALDEeeevMREMLHNMARSHK-PRKLTKTHFNEFAPILLETFEKKVD--MSSKARDAWIALYYSIVDNLFAE--
>tr|W6FIG9|W6FIG9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_b PE=2 SV=1
-MVVSAEQKALIQGAWTPIYagnRFQLGVDIFAHFFKAHPNYANLFPSLVGVPN---PSTSVELRGHAIRVLTGINYFVAALDEkkpvIMEMIHNMARSHK-PRKLTREHFAQFAPVLFDT----IG--VSGPARDAFLPYYNFIADNLFAE--
>tr|A0A023RLQ7|A0A023RLQ7_AERME Globin OS=Aeromonas media WS OX=1208104 GN=B224_3582 PE=3 SV=1
---MTPEQIELVQRAWGRVTalNNTYVQEVYAELFRLSPDLINLFPDPAG--------------MPVTKVSETLNTVITSLEQLdalGFIIRDLGRRHR-QFNVQSHQFGLLKQALTLVLARRLGEHFTPALSEAWSQMYDEIAALMLEGL-
>SRR5437899_2276119 
-------------------YpaVQKSGAAVYRPALVAELRDRPY-E--FDI------------QVQLCVYLARMA--------leIVAALN-----AA-GWICVPKDPSPEM------LKAAWAYALDEDAAGVWKSMIAA----------
>ERR1700757_2961956 
------------------------------------------------------------------RFNRLAGRERRAPARtr----ARQSR-------QRPGPSRHDPTrLALSD----------VSEAERTDIVVS------------
>SRR5215213_1430710 
---------------------------------YLYPFLRPMFK--ENI------------QLQARKFSAHVSLVIGNIKDrntLQPMFEEMRNLHL-NHNVKTHHYNYVQEALFYALKNHLVKEWDEHTESAWIKFYNIMASQMAA---
>SRR4051794_22176940 
-NRMTEASLQRIASNYELLAgqMQVLTGAFYKRLFAAMPEAQPLFR--IDI------------DLQSQHLAAALALIVRNIRFfdaLEQPLKELGVHHA-HVGVRPEQYPVVCRTMLETFREGSGQSWSPELEADWKAVLELVSRIMMDG--
>SRR5262245_41201456 
--XMTPHQILLVKTSFQAALtqRERIAGFFFAELFAREPAMWQLLR--GKT------------GMRWPALVDGLAAIVGSIHRihsIEPVLQWLSWQGA-VRGVGEGQYEAVGQALVAALEAGLGEAFGSEHRRAWMVAVGKVADIMARA--
>tr|A0A0N9QWL5|A0A0N9QWL5_9ANNE Intracellular single-domain globin (Fragment) OS=Eulagiscinae sp. JPG-2015 OX=1732542 PE=2 SV=1
---VSDAQKALIKSSWAGVDLNAAGVAFLNQMEQKAHDVYAVFKV-G-----GGATSNPKAAALGLKVMTFVDEAVKGIDDMgavGGKLDELAQRHT-KYGAKKAHFPVAGPCFLDALAEVCGGRFSADARAAWSDFYDVIAQHLSA---
>tr|C7FFW0|C7FFW0_BRASE Extracellular tetra-domain globin (Fragment) OS=Branchipolynoe seepensis OX=326992 PE=3 SV=1
---VSDAQKAAIKASWAGADLQAAGTGFYVHLAAEAPAVYANFNL-G-----ADPH-GAKSQEQGLRVMKFVNQCVNSIDNMaivQAKIDALAHRHM-SYNVKKSDFVPAKPCFLGALADALGGKFNADARAAWAGFYDIIAAGLST---
>ERR1719261_40108
-------TIAVVQGTWQEIKdalgdgvAETAGVILFKHIFRIAPQALALFS-FKDCAGgnvCDELFENKTLRKHAAKVVGTVDTAVGMLKktrQADSRPGQSGQEAR-GLwggagalrcgrgGVVGDAVGRVGRRVYDRGPRGLGGGLRHHQNHN-----DRQELRLHGR--
>ERR1719238_2294225
-----------------------------LKVA----SALREFN-TLRAEGivsEQEFLEM------KAKLLAVGKDELG-RSpsgDTLETLVEAThemdssRRRT-RWtrrarraSRSPTTVGVISCQIK--------KSSTRRTTRRW----------------
>ERR550532_3331206
------------------------------PLF----PAAH--R-LCRPDGhdgCS---------------------VFGPDRppgE------------------APSTKDIVVTVIL--------X--------------------------
>SRR2546430_16462751 
---------------------------------------------------------------------------------flLSVVIA-----CS-CWCRHVSSlqhdrad-------HPVGLCPGIVADWSPALSQNVGEGFQQDCSD-dG----
>tr|A0A0P6RCU1|A0A0P6RCU1_9RHOB Flavohemoprotein OS=Phaeobacter sp. 11ANDIMAR09 OX=1225647 GN=AN476_12305 PE=3 SV=1
----ASTCKALVLRSFESErmDLEAFIPLFYSNFFEAYPEARAIFPT--DT------------ERLEAKLLASLTHIAEALESserLDGILSELGQKHR-RMQISDSHFDGFIQSFIRSLATTLGPEWSDQSDEAWSQFLRYVAKRMSFLE-
>tr|B7QTL6|B7QTL6_9RHOB Globin, putative OS=Ruegeria sp. R11 OX=439497 GN=RR11_330 PE=3 SV=1
----APADRDLILASVESQkmELDQFVSLFYAKFFERCPDTRPMFPH--DM------------SLQEEKLLMSLTHIIEALEHpakLRLILLDQGERHK-ALQINDDHFAGFIDSFTGALKDTLQEDWSEETRQAWLRFLQYVAYQMGFLK-
>SRR6218665_311178 
-TPIYAGHRDVIRRTWPIIAdqMNANGCQIFLCIFELSPGIKRVFA-FGPAMSGAQIVNHPRLVQHASRFMEAMQVAVQHLDELdtvvSPIFINLGKRHIYFEGINADYFNVFSGAILYTWRQVLGERFSAEVRSAWSRLFDFVIQHLRFGY-
>GraSoiStandDraft_9_1057307.scaffolds.fasta_scaffold3427870_1 # 1 # 249 # 1 # ID=3427870_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.747
--------ADVIFDSWDAVKripdyDVVVGEMMFRKLFENSPSTLKNFS-FGPRFagKEESLYKSRTFEIHTKAMIKMLEDVLSMIMpDlvpMKKTLKALGARHV-TYGVRPNHYELATEALLSTLESLLGYRWTPQVEEGWKTAIGFITNTMVAG--
>tr|A0A2C9KJS1|A0A2C9KJS1_BIOGL Uncharacterized protein OS=Biomphalaria glabrata PE=3 SV=1
--YVTPKEKELLRSSWNIVsqDISGVGMNIFKKLFDIETDLMKLFKRMLTKGeTGQVVVDSIRLEGHATGVLRQIGLVVENMDNnsaLTTTLIALGEVHA-NYRVRPEMLPLLWPAIRDALKIACEDEFTHQMELAWKHLYDFVTCHLSEG--
>tr|A0A1Y5RHX9|A0A1Y5RHX9_9RHOB Flavohemoprotein OS=Palleronia marisminoris GN=hmp PE=3 SV=1
---MPNDDMRLIQPSIARIFvvRRSIGQAFYERLFERQPTFRTMFPT--DL------------RTQARTFDDMIALIVKKTGDpeaVTPVLLAIGRRYL-TYGLRPQDLRVIGEVLMEVLCAQTPGGLSPDEAAAWERSFSRAAEVVKL---
>ERR1719321_586101
--ELSYSTVSTVIDSWESVKrqenyAENLGRMIFIKFFDREPEAKTIFGFDGKKMKTdDEFYESRAFLAHGKHFVLILNKAFDMLGPdlemLTDILLDLGGTHRTKYGVKPEYFPVLGDALLECIEEMSDPeRFNDETKACWLEAYNALTEIMTT---
>tr|A0A2D6RHV2|A0A2D6RHV2_9GAMM Methyl-accepting chemotaxis protein (Fragment) OS=Colwelliaceae bacterium OX=2026726 GN=CL811_09640 PE=4 SV=1
---MTPKQNIAVIESWKKVQpiASQVSQVFYDDLCEKHPSLKALLG--EELS------------SARDQLVAYLNSLVETLVATdevv-I--EDL-AKH-LRIGLAPEQFSDVGPALLTSLEIGLEKDFTATVKRAWTALNKLIVAAMAQ---
>tr|B7J6S4|B7J6S4_ACIF2 Globin domain protein OS=Acidithiobacillus ferrooxidans (strain ATCC 23270 / DSM 14882 / CIP 104768 / NCIMB 8455) OX=243159 GN=
----MAINIQLIQSSGAAVkdLGVQVAEHFYNYMFTHFPEVRKMFPG--------------DMSEQRVRLFNSVILIATNIDTmevLVPYLKELGIGHI-KYDTRPEHYPIVGKSLLNTLKHFLGAAWTQEMAESWIEAYNLASTVCIEA--
>tr|A0A1Q9NIM3|A0A1Q9NIM3_9ARCH Bacterial hemoglobin OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=vhb_2 PE=4 SV=1
--SLNTKDIQLIKNSWEKLteNKKEVRNTFYTGMFEDDPKLKSLFRE--------------SFLSWD-NLPDSFEFMFKHLENlegEILEMKRLGLKHK-TFSVKPKHFPIGRKSLVKTIKQYMGDKYTEELGAAWTKLFDYMSHYMILG--
>ERR1719419_74415
--PFTPEQRTLINETWGNISTKEtgsmgmLAKQVYERLFRSAPGIKRLFKD-SDM------------LAISRAFGGMLGVLVSAVNQplqFQHIVKGLGVRHQ-VYGVKPDHFRIMYTSLVRTFAQILGDKFTSEHKKAWSCLYNWVIDAMQRSMR
>ERR1740128_1504408
---------------LGVSYlarhIVPVDVRFLKEHVKTLFVLSqR---MPGNFV-NETLETRATLLYETLLVMSNLNYWVENLDELdlvVASIQKMATNHA-GRGIMAAQFETIGAVVVEYLKAGLKEALTEEMAGSREKLISTMVSIIKETN-
>ERR1719354_333269
-MGLEQSDVEAIQRSWEIVKetakLRVHGVNFFEMRFEMIPDWReKYFSHMGP-------KTSAKFRSHATMIMMTLDSWIENLDDLdlvVDAVLRVGQTHA-DRDILSPQFVEINKVIIVYLETGLGDKFTEEMKESWIKLLDTVVTIIKDGN-
>SRR5215207_9441599 
-----PEQLALVRGTASIIDavGDSFAERFDDHLFARYPAARRLFP--DDT------------TTHRGQLTDEIVFLVAAAADlhaLLERARALGAPPP-LRRtrrrlparrrgTRRRGRGRRGRSVVGRNG---G-SLA-----------------------
>SRR5690349_3556304 
-TYLTGQQVLLLKKSFRQMNPAQIAAQFYGTLFQQHPEVKSMFPA--DTV------------ELGSKLMSVFELVVFSFDEKehgrfglqdvlIKPLRALGRKHD-DKGVKPEYYEIANSLLLKIMKE--SEYFTTEMYQSWQLALEHLTYAMQDK--
>tr|A0A2A4JK54|A0A2A4JK54_HELVI Uncharacterized protein OS=Heliothis virescens OX=7102 GN=B5V51_782 PE=3 SV=1
-SGMTLKDVYNVQHSWKTINanPLDNGYLMFFRLFEVNPESKTFFKILDNARTETEMRDNVRFRAHVLNIMAALNNSIENLNKpeiVVVWMEKLGTAHR-RSHVQERHFLIFKDVLVNILKNDLK--LSEAVVKSWGRYVTFIYSYILP---
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9902871_2 # 1417 # 1767 # -1 # ID=9902871_2;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.538
-----ALDTKLIKDSFELAKpiSDKLVKRFYENLYSDYPQSKSLYLD--G-----------QLPESQLAILKAINFIVDNLHNkekLGTFLKTLNERYE-LRLNDSVINQSVCSSFLKTLSEAFGSDWTSELAEQWELTYQMVTSFFQDSK-
>OM-RGC.v1.013389558 TARA_082_DCM_0.22-3_C19717715_1_gene515718	COG0552	K03110
---WHGESVTTVQRSWARIQqlgLENCGTLFYNTLFERWPEAKQLFSLSvrlkhrapgESEREGPDPTNSPALRKLWGKLLSVVGSLVSGACNpaeVVPTFHAVGVRHA-GYKLKVAHFDAFGGVMASVLKHLLGEEFTTEVQHAWTLAINFLTANIRAGFV
>tr|A7C4X7|A7C4X7_9GAMM Bacterial hemoglobin OS=Beggiatoa sp. PS GN=BGP_4395 PE=3 SV=1
---KQHDTIFEIQSTYEKILphLDEFSRLFYQQLFEIKPAFKILFRQT-DL------------RIQKQMVIRMIEVVVQGINNlenFMSIIQRIHQRHY-ELHLKPEDYRLAGQALVLSLEKYFGDEFTPTLKKIWLDFYESIVATMMN---
>UPI0004291969 status=active
---KQSDTVFLVQSTLEKVFpqLDEFTNQFFKKFYELDPSVKEIFYEI-DA------------KNKKQMVVNMIGFLTQGINRfdvIIPSIKEINERHF-GREVKPKYYLIASKALVNVLEDYLGEDFTPEVKQTWIEFYEQIVNFMEA---
>ETNmetMinimDraft_35_1059890.scaffolds.fasta_scaffold55614_2 # 1284 # 1421 # 1 # ID=55614_2;partial=01;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.623
---KQSDTIFLVQSTLEKVFpqLDKFTDQFFEKFYQLDPSVKKLFNGV-DS------------KNKRQMVVNMIGFLTQGINRfdvIMPSIKEMNERHF-GRDVKPDHYLVAGKTLVNVLEDYLGKDFTPDVKQTWIEFYEQIVHFVED---
>ERR1719506_1011120
-GPITAREGQIVQDSWKAVKkvGGESGHAvikdIFYQHLLKDPNVKQLFRN-------------SDMKLQATKLWQTLHVAVDGLSTsgpWFLCCRIWARLTS-STGSKRS------TSMPWVRRsSTrspraWGPRsrrssrWRGRKCTAWLLRRX-----------
>Cyp1metagenome_2_1107374.scaffolds.fasta_scaffold42158_11 # 5761 # 5952 # -1 # ID=42158_11;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.578
-RFLTVAQQNEIIATWAIIKeshaSEAIGMDVFKGLFISAPETFDMFDSFKKDP---DWQNNVHFKHHCKVVINVIGSFVLLLNQpekLISHLEFLGVKHN-FMTITPLQFELLGAELLKAFNKALGARYNSLTKKSWTIFYNKIAEVMQTN--
>SRR5688572_5289639 
--TVTPDRQQLIRDSWRALEpnGPRLVELAFLHLLQIAPAARPLMTG-HSL------------PCVCRNVASILDQLIAALDEpkqFVPLAIGLGRSNP-GHGINAALYPAMGEALLWALHLQLGEGLTPELQTAWLEYHHLVSAIMRRA--
>SRR5690349_12423264 
--XMTPERQQLVQSSWRKVEpnAARLVELAVLHLVSIAPSVRSHLDG-ATL------------PLLCQRIAAILGRLVETLDEpkqFVPLAISLGRENP-DRGLTAKLYPAMGEALIFALHLQLGDAFTLELQAAWLEFERLATAIMQ----
>SRR5215467_4845699 
--------------------------------ALTWPLRR-------------------------RCWGKLLWpswiiwkmCPGCSRPSrswAPSTLGM---------VLLPRCTTGSADALVATLAKPNGEQWTPAHTDAWGEAYRAIVAMMLAGYP
>SRR5262245_32871681 
-------DPQILRETLELTLaaDDSFPKRFYDRLFTRHPEVIPMFHR--NSP-----------GAQRKMFAQKLIMIVDHVEDpawLARELRTVAQSHV-RYGVRPEMYAWIGEALIETLRDACDSDWSESAERAWRNAYTKIVESIFEV--
>tr|A0A1C4TW82|A0A1C4TW82_9ACTN NAD(P)H-flavin reductase OS=Micromonospora haikouensis OX=686309 GN=GA0070558_10167 PE=4 SV=1
-----RAVSADLGPSWAATAaaVDRAAANFLDTVSDRLPGLLP--------------------ERDHTVVFAALGRLAGGVDDtagRAAALAVLARAHR-GVGLLPQHADLLGDALLAAVARENRAHWTAALATGWERGLRRAVTAVRRA--
>tr|R4LFD5|R4LFD5_9ACTN Globin OS=Actinoplanes sp. N902-109 OX=649831 GN=fhbA PE=4 SV=1
-----GMDPaddaalnEvrrLLGNSLSMAGgpME-VAGRLRAALAQAQPTLFATLPG--GP------------VAQVEQLAEGLTWLIHHVDQppaLVAGFGRLGMALA-ECGVAPQQLQLAGAALAEAMRAGmAAHGWRQDFDQAWRSTWQHAYEWIAHG--
>tr|A0A1H7FRI4|A0A1H7FRI4_9ACTN NAD(P)H-flavin reductase OS=Nonomuraea pusilla OX=46177 GN=SAMN05660976_00171 PE=3 SV=1
-----MLGFQRVRDNFELVAkyGDGVPLYLFSDLFLRVPQLREMFPV--NM------------RSQRERLMGALAFAVEHAGDlaaITPYLHHLARSHR-KFGARPEHYAQWSVSVVNAMRRFSGSAWDDELEREWRDFLTAVSQVMIDA--
>tr|A0A210PV81|A0A210PV81_MIZYE Globin OS=Mizuhopecten yessoensis GN=KP79_PYT16126 PE=3 SV=1
PLGLTERELKMIKVSWDVLAedKKSNGVKFFMTLFTIFPTSKDLFKHFKDVPLDQLKydgettKSNKKMVAHAMSVMYALESYVDSLDDaycLEELVKKVAISHK-PRGIGPDKFKLLTPVLHAVIEDLVKDDDSvdlETIKSGWTKLIDTVCDIVEK---
>tr|A0A1L4CYV2|A0A1L4CYV2_9PROT Uncharacterized protein OS=Silvanigrella aquatica GN=AXG55_04100 PE=3 SV=1
-----NIDIQIIRDSFELTKpiGDQIINRFYENLFLEHPELKEFLSR-GDI------------QKQKEILLNTLVTTIDNLDKpesLSSFLIHLGEKHL-NYNMIEMYNDFIGRNFIKTLSQFLGRYWSDELNRQWNEVYKFISLNLKKG--
>SRR3954469_16801024 
-------NYALLRNSFEKLKpvAGKVAERFFDILWNDYPETRDFFKN-TQM------------GPQKFAFFQALVFIVENLDQpesLESYLRGLGASHS-AHGVKKEYYGWGCAALHKTFAQTFADEWNDTLSFEWTKVFAMITSLML----
>SRR6266851_5623532 
------------ACTSPSVRstT-------------------TCAG-----S------------TRNSGYPAGPnSPTHStriSHDTRTDrigpkLIRVHRRRRA-RDGVRPRHYRSAGDALLGALAAHLGSDWTPAAESAWRRAYNLVAEIMIA---
>tr|C3Y526|C3Y526_BRAFL Uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_98913 PE=3 SV=1
-TGLTPTQSRLVKESWKMFlsKKRENGFVIFRVLFTDYPVTRKLFKGVEQldLDAPGQLESSITLRAHVTRFMHSFDTYMESLDDpedLKQLLYDTGKSHL-IHDIKPEYFDVLETVLMKSLRIVFGSKLTPQLEEAWQTAYSHLKVTIKQG--
>SRR5271166_2850757 
--RWMRPKRNSCARPSPKSRrsPIKAGAMLYEKMFALDPDLRRLFA--IDI------------ETQGAKLMAVFATAIANLHRldeILPTVRELGRRHV-AFGVKDRDYDTGGVALVQTLEAGLGDAFTPAVRDAWMACYEAITGEMKA---
>SRR6478735_6705068 
SPSLTREQKRHIRETFAIIEpaSDLVARLFYMKSVDLDPSLGVLFKS--PN------------RVQRRKFMAAMKVTVLSLDRlqsLQPILKLLGARQR-EEGVTPGHYETFQDAWVWTLEQALQARFPREAKDAWSSLLGEMTAPQRPR--
>tr|F2Q9X8|F2Q9X8_BRAFL Globin OS=Branchiostoma floridae OX=7739 GN=lGb13 PE=2 SV=1
--PLDAWQRFYLQKSWKTVArkSDQAARTVFLRMLQDNPGLRQKWPRISLL-TEEEIPTSPYIKFLGERIFDCLDYIIDNLGDLDhviSELTKLGRQHSDMNVMTPEDVWAIEAAFLAGVQECLEDRFTIKYEEIYSRFIVFVIETMVIGFD
>tr|A0A226E0J1|A0A226E0J1_FOLCA Hemocyanin OS=Folsomia candida GN=Fcan01_14017 PE=3 SV=1
KVQLTPDEMIAIKRNWEVIHqdLTGNGMDMYLHWFAAFPHMQKVFKKFAQVP-RDQLKTNDAFKAQATVTLHWIDDMIEAIDSpsdMAAVMKRLGRMHQ-TRHTNIYDFREMVKRIQEVIGTKVGEGYTPAAESGWTKLFAKLVENIGD---
>ERR1700732_4531564 
-----ASPNGRRNSARASmlISsqPIRRSPRFSATTW-----------------------------WHRPRC-SCSLWVRSEVNRmeeLGGGLCALGERHV-DYGVKRADYNKLASVLIQTLKEFLVDEFTVELQHAWGTVD------------
>SRR5258708_12476517 
---------VLWEWLVDVGGarWRWFGGRLLEIFLETSPELRSLFHK--DI------------AQETGMLEWMLGSLVKGLNRlleIEGGLRALGRRHR-DYKIDQADHEKVLRALLLTLAEFVGDDFTPQVSRAWKTVYGKIPDTMTDR--
>SRR5882672_7954690 
-----------------------------------------------------------------------------------------------HYGNANRYQGVRPSRCIpGESSR-----HRPHGASQPSVG-Q-----------
>SRR5215469_12962076 
-------------------------------------------------------------------SLSARAGRQAGFGl---SG-----------LGSAAT--taiPTPSTSLTGSTARTTG--cSAPYSR-----TGT-----------
>SRR6266704_5570200 
--GIN-----KTPGMFEKISssMPLGRVA---TVDDIIPFISFLAS--DD-----------------S---KMITGAEAGGNs--fVLVLTNLRNIH------------------------------------------------------
>SRR5205807_5077868 
---------------------RVGHGRVYPRLYIIARHAAGIYAL-TRP------------VAKPgRPRPVCLVPIHKDIA--vmrVTTDQLLARTPL-GrFGEAAevgqlVHYLVSDAA------RFVS-GATVTIDGAWTAYGGWALR-------
>ERR1712137_931585
-------------------------------------MGTSLLG-VDCE-GEEFVKT-DSFVPQAKKFIGLCDSFIDMLGPdaelMAKILEAEGRKH-EKLGIKLEHYSTMGEALISGVKTL--DeKFNDETELCWKLVYCGVTNNLGKAN-
>SRR5437868_6667390 
--------------------------------------------------------------------------------------REIAASD---------ESEGVGDAEI-------DERRSNRLGDVHRSALGprpvtvrdnhgtrtaVKEGSIRRGV-
>ERR1740124_2148144
----------RTRGAAALLLqgrAQPCGVAQAQEACYVCDEHCRCCSQ-GSgGP---QQacarATGPPAHMPYA----THRCRVCCRIGiraRAPPTQALGKRHV-PYGVLPAHYDVVGQALLATLEGGLGAEWNDQVKASWTAVYGIIAKTMIG---
>SRR3954451_929548 
--SMTPEQMQLVRLTLAQAtaDPLALGRDFYRRLFVLAPDLRARFH--GDID------------AESLKLKETLTLAFGALTDmrlLVATLDGLAKRDV-ARGLSEQHCRAIAQSLIWAIERRVGSDFTHQVCNAWIAFMAVAMTCLHG---
>SRR4051794_5741567 
--SMRPEQMQLDGLTLADAttDRLARGRDFYRRLSVPAPYLRGRCD--GDVD------------AESAKLKETRTLALRMLGNmrfMVATLDAMAKRDV-ARGLSEQHCRAIAQSLIWALERRLGAGFSRQVCTAWTEFLAVVMTCLHG---
>SRR6516165_10653891 
--EPSPNQLHQNRPD---R-RPGGGTLLWPPLRDGSR-NPGAVL--QRR------------GRTGSEANGRSCNRCEQSRRFrgdRPHRTRS----C-KAPRRPEHYALVGSALLWTLEQGLGDEFTPALRAAWAAAYCALSEVMIA---
>tr|A0A1X7UGV4|A0A1X7UGV4_AMPQE Uncharacterized protein OS=Amphimedon queenslandica PE=3 SV=1
-MSLTSAQVALIESTWKVVKkdLQGAGNIMFLKLFQIDVSVRDKFP-FRDVP-YEELEDSESFLKHSLQVMETIDLAITLLlGGemekLVEALVDLGMAHA-MQGLKPEDFDHVGEALVHALGVALGKEFNDEAKKAWTLLYSVVTAKMKEGL-
>SRR6266699_274039 
-------QGELLETSFQAIVlhGEAFVTAFYERLFTRFPETRAFFAA-TDM------------LEQRKKLQQTLALIVQHIQHpevLGDMLQELGQRHV-TYGIRPEHYPSSERCCWRLSPTFSGSTGRRRTTMPGSRGMRQSAAX------
>SRR5438045_5489985 
--------LITRPTSYYLLSlhdalpISLLADVFYSKLFVKNTGLRKMFP--ADL------------QLQRQKLMNMLHFIISNLDQpelFNKEIEGLGLRQD-RKSTRLNSSHLGISYAVFCLKK------------------------------
>tr|A0A1E3GPU1|A0A1E3GPU1_9GAMM Bacterial hemoglobin OS=Methylophaga muralis GN=vhb PE=3 SV=1
-AKLQEQDIALVEQNFAVLMefSDALAERFYQRLFTEYPEIMPLFKS--V-----------TIEGQHKKLLASMVLLIQHLRDtemIEDYLQGLGARHQ-QYGVETSHFEMFIENWLSVVAEFADQKWDSKLQQAWRNVLEYVAELMQSPT-
>SRR3954464_793235 
--------VDPFRSRFAFGVerEPEVTHRFYDVLFAKYPQVQPLFGR--RSR-----------ADQERMLRDMLVAIVDHVEDppwPQHHPPPPPPNPP-RPAPTP----------------------------------------------
>tr|B7QBW9|B7QBW9_IXOSC Beta chain of the tetrameric hemoglobin, putative OS=Ixodes scapularis OX=6945 GN=8038954 PE=3 SV=1
-TEMTSQEKHVVRDTWAIFKkeVQTSGVAIFVVLFFKHPAYQKLFVAFAADP-IAELPQNPRAIAHALTVAYAITSIIDTLDEpetSAELVRKVATNHVRHPTISGAQFEHMGQAVVEVLAEKLGSAMNHQAVGSWQKFFAFVVRVSQGVF-
>tr|A0A1B6H4C1|A0A1B6H4C1_9HEMI Uncharacterized protein OS=Cuerna arida GN=g.19114 PE=3 SV=1
MRRLTEREKENVRLVWKKVedDYPSYGRSVFVKLFDEYPYFKKFFKATIG--NFEDPFMSPRFQKHMLQvLMPTFGGIMDNLDFpeaVNEAVKRLAVSHR-KKELGiaKEHINILGQVIVSVVKRDTL-GCTEEQEEALEKVISIVMAMFC----
>SRR5215813_3453690 
------------------------------------------------------------------------IASDSEIQVspwtrt--GTLAISARRCS-SSRISSGigsdtTFSLYGNCV------------SSSATIAWNTHGD----IQLDS--
>SRR5579859_1863727 
-------NISSLQLTILNLLtvEDEFVPRFYNNLFNMYPLARSLFVHTe--I------------SLQYNKLRLMLMMIIRTIHDadgLKIQLQQLGQRHK-YYRVEPEHFAILYIVFVQTVVEYLGPKWTAELEAAWAEAYGTIVRMMDME--
>Dee2metaT_7_FD_contig_123_47857_length_200_multi_10_in_2_out_1_1 # 3 # 200 # -1 # ID=100007_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.434
----------VLRDREG---lgDPELVVLQRRHLAEHGAILQPLalLARQr--H------------REDLELVRELLLLECDHRVEhprahpaGVGVEGELGVGHH-TERIKRSlspsalLGRWIDLVVVGAVRR---------------HHQGGVVDLRLVE--
>SRR5436853_3450426 
--------PVLLKDSFNLVRseEHTSELQSLRHLVCRLLLEKKKKnkTTTV-----------NYIE---KEKLGKLEA-SCPVEqti-------GIGDKQR-DYQ--QMHHPERTEAQ-----KX-----------------------------
>tr|A0A1W2WRJ7|A0A1W2WRJ7_CIOIN cytoglobin-1-like OS=Ciona intestinalis GN=LOC100183004 PE=3 SV=1
-MPFTDEELKLLRNSWDEVKklgMKEVGLHIFTGLLNAAPSLRTLFYTI-DLPDEeeltiDVMRENKKVVAHATRIANAISKFIKFLDQpeeLEKLLTSLGESHA-RRQVDPESFEYVAPVILSVIGGHLKLPSNSPTLQAWVKAYGVLRNGIVS---
>tr|A0A1W0WQD3|A0A1W0WQD3_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_08524 PE=3 SV=1
-TGLKKRERLVVQQTFEAIsKklgRAVLGRDIFYLFFQLHPAYLQLFKALRDIP-PEQLKTHPRLKAHGLNAIQALAAVIENLEDTettVLLLEKTGRDHV-RRKLQSKHFEDFHSTTVALLKRELGPSFTPFVEQSWNKAFTVVNTVIL----
>SRR5438034_562795 
-------AVETLRNSFERVIerSPNLTRRFYEILFEKYPQTRRMFGL--QS-----------GKGKGNGKGAGARQRLRRChcrlhfgkekaTVvpfPLPVPVPLPAFRD-SYX-------------------------------------------------
>SRR3954466_4238475 
--------IRRLTRSYDQILsaGDCLPELMFAQLFDRAPELRTLFPD--DM------------GRVKHQFARMLHWLIAHLHEpqkLRIALVDLGRRHQ-EYGVKPDVYPHLCEALVDAMATICADDWNEELCRDWRQTFDLMVHHMLRAY-
>ERR1719359_2370951
-------------RLIVTPEhldGCRAGLLALRVVLLHLGEGLGLLG-SDSSGVSdcgVALgeL------------PLQRLDLLGVLLGpr----L---GL--L-NAGVRGLELSLLGRLlrvglselfVAEGLLLGL----------------------------
>tr|A0A212ELK8|A0A212ELK8_DANPL Globin 1 (Fragment) OS=Danaus plexippus plexippus GN=KGM_200313A PE=4 SV=1
-SGLSRRDVFAVQKSWAIVYanPLANGSELLKSPYISRIL----ILLVDKVS-EI----------------GSIVKAATDVE-------------------------------------------------------------------
>ERR1719343_803772
-----------------------RAVDCSFDFSRKSPVPRPSLA-SAKKDfngDANSVYDSRKFLDIGKNFIEIVDQAVDMLGPdlqvVAEVLIDLGKKYHNEYDMRPEYYSVLARALIDELEEILGTDkFNTRTKSCWVQVYGAIAADIAA---
>EndMetStandDraft_7_1072992.scaffolds.fasta_scaffold3604113_1 # 1 # 288 # 1 # ID=3604113_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.538
-NNLTDDQKNVIKKTWITIEenRTKIGKQTFIRVFELNPQIKKMMPEFMTADPIEELNSSRKLFGHSKTLMTCLENAVKSLDDnerFVAYLVELGRRHQ-VRPLKAPYFEVIHEALMFSLKDVFQSDWTTETSESWSALFRYMSEAMIIGL-
>tr|A0A136A626|A0A136A626_9ALTE Uncharacterized protein OS=Paraglaciecola sp. S66 GN=AX660_04410 PE=3 SV=1
-MILTVEEKSAIKESFAVLLRenANVAECFYNNLFELAPLIKPLFKS--GR------------ENIENHFHELIGTAVNKIDHfndLRADLIALGKRHK-IYGAQQAHFAVVKAAFILSIQYKLKGQCSPFLENSWAKYIDNISSVMIEGL-
>ERR1719461_1916292
------------------------NV-SLFSLFAADPGVQtKYFGHMK---------TDADLEKHGVRVMNSIGAMVRAILDqdddrLITKVHEITRNHQ-PRGINRPLLEFFLSVVLDYLAKALDSHLSKEGGA------------------
>ERR1712179_865199
---------------------------------------QrKHFPHMM---------NssigksltKSKLKIHGGRVIREISVMVDCVQAgndeaLMAKIKEITVNHG-VmRDImSIEAYRLVLDGLVAFLGSALGDSLNETGHHAWKKLVNNIITGID----
>SRR6266699_3297184 
----ALARGSLATPCFRSHRAqhFQARMpykPVGSLEAARQHAREGLFRS--DME------------RQYFKLMDMIAAIVGTLDKremFQSIISHSGRQHA-QFGAKPLHFAAFGDALIWGLEQQFGAAFTPEMKEAWIKLYDDVQREMMR---
>ERR1719271_149007
--AVSARERRLIERTWEKAKedgCDALGANLLQTLLVAEPQVMQLFP-FKDE---ENVYESLRFKAHASKLAVIIDAAVSLLANpvkLESLLISVATSYEYsFKQMLPEHFPLLGEALIRTLTSIVGgTKFTWQAESAWRKVWTIISTVMIGA--
>ERR1719203_2782565
---------ITSKFGWTSNmq--------------KIIQSQTHSKT-QDMQ---RDYYLNQK-KTLEI---------------nvRHPLMKELLRRVE-----DNPEDKVAKdMATMMFNTATLRSGFSLKDTVNFAESIELMMRQTLG---
>SRR4029078_13512293 
---------------------vKRVAAELfYVKLFELDSTLKLLLA--D-Q------------QVREQKFMQIVDATVNGLEHsegMMSAVRELGIRHP-LFGDSDEHHGPVATSLFWSLKKCLRKDFSGEECPRAVGGHALC---------
>tr|A0A147B4Z8|A0A147B4Z8_FUNHE Neuroglobin (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
MGELSVKDKELIRGSWESLgkNKVPHGVIMFSRLFELDPALLSLFHYSTKCDSKQDCLSSPEFLDHVTKVMLVIDAAVSHLDDlhsLEEFLLNLGRKHQ-AVGVSTQSFTEVGESLLYMLQCSLGQAYTAPLRQAWLNMYSIVVAVMSRGW-
>SRR5262245_48005872 
----VSMHTSPLRASVELVEqrRSEAVRYFYAHLFAGHPELRTVFPI--SA------------VEEHDRLFTALLYVVKNVHAlpmLAAELQQVGRDHR-KFALSAEHYQVVGASFLATGAAILAEAWTSEIGSGWQSAYRMAASVMSD---
>tr|R7WMM5|R7WMM5_9NOCA Flavohemoprotein OS=Rhodococcus rhodnii LMG 5362 GN=Rrhod_2088 PE=3 SV=1
--IFDDRTLRRVRATYKDMAArpdwdSHLAQSFYANLFAENPQLRLLFPA--NL------------EAQTHRMLTAIRYVLDNVEQpdrMLTFLGQLGRDHR-KYGVAREHYEAGGRALLQSLRGSLVtLLWTPTVDAAWSEVVGTIVGTMAD---
>SRR5258708_3005780 
--EPTPTDITIVSDSLAPLTkeqVDNVLAAFYHQLFTRQPSLRQLFKSFRSGDQ----PDQQAMKLQRNKLAEIIALGLKLWEKphqLIPALEKLGRQHH-QYGVRDEYYEDVWIALSEVLSEAFGLDRWEDICESWQRFIFLCARHMLNG--
>ERR1719347_1330150
YFCLSESNIKALKSCHPHLkdRKEEFGHLFYSNLFSNHPDLKSLFDQ-TEE----------GRQLQAQRLADTVVAFLEKCDDlpsLLPTFKKIGKRHT-TKGVKPEMYQIIIDNLVDTLEEMLGKeVFSAEVKQEVLESISFLSNAFIK---
>ERR1719284_1036555
----------DVSASLDLVKrlpnYeQVVGVRLYQKVLAAGPQYVKMFP-SVASsltssNDPEEFLKDPVLLKHLTSYIRMICMAVDLLGPdtelFEEQVRELGAKHS-EYGVSQRYYVVMGKALIQTLEELLGDRFTPSTKQAWEKMYDLMSSTMIKG--
>SRR3974390_2763688 
--XMSPETKELLETTWAKVIpiSDVAAGLFYERLFTLDPSLHRLFEN-------------ADMKEQRRKLVQALHAVIYSVDDlpsLIPTLEILGRNHV-RWGGIGGTPRDLGGQSHPEAVGRI-----PNIR---IVAVAvGRPDIMLV---
>APLak6261669570_1056073.scaffolds.fasta_scaffold275140_1 # 52 # 198 # 1 # ID=275140_1;partial=01;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.524
---WSTRRVKVVQRSWETFKstqaeSTTVGLAVFKRFLRRSPAFLQLFP-FRDQP-LETLFLNAKVRLHCKLFADTVSRTVGLLGDsvaVKASLRELGARHSDLYKVRSGHYAAMGSALLEVLEHNLGESWDEETKTAWEETWAYITEQMQKG--
>ERR1035437_6084348 
-SSLDQEMIAIVQVSWENVTPDsrLAASMLAMNLCADDRNIASLFEE--DR------------IKMSRDVMQAVSCIVADLDQpetLVPYFGSLGQLLR-RHGLHESGQQTFATALFLTLGQLLGPRYGPVEHNAWAIAYSFVVRIMIAE--
>ERR1035437_3078414 
-SSLDQEMIAIVQVSWENITPNsrLAASMLAMNLCADDLNIASLFEE--DR------------IKMSREVMQTISSIVAGLDQpetLVPYLGSLGKLIR-RHVLHESGQQTFATAFFLPLGQLLGPLYAPVEHNAGAIPX------------
>ERR550534_521252
-TSFKPNEIMEMRVMWNGWvggDMASRGFEMFCKMFEMHPETKDVFA-FMKGSSVAQMQSSSKVLFHVTRVMKYIDEVMRHADRLdevVPILRQVGGRHGTqGYNIQSGYFPFLGNALRQLLKDHFKTRYTAVLDGHFQKMWGFIVKQMQAG--
>ERR1712105_94955
-TEFKPNEIMDMRVMWNGWvsgDLASKGFEMFCKMFEMHPETKNVFA-FMKGSSVAQMQSSAKVLFHVTRVMKYIDEVVKHADKLdevVPIMRQVGGRHGThGYNIQSGYFPHLGEAQRLLLKDFFKDRYTANMDAIFKKLWVFIVKQMQAG--
>ERR1719483_559503
EGPLLAKDVKAIEESFAMVAalgsAKELGIGFFRLLFTTYPEWLEkYFvPNFGDKP-LEEFLMIPRFEVHAPGVIVELSKWVGSLHDldsLVAAIQENARNHY-RRGLNVDHYKKIAGVLLSYISAGLGDSLTTQMETAWTKFLDTMVNVVEEEM-
>tr|A0A195EH31|A0A195EH31_9HYME Cytoglobin-2 OS=Trachymyrmex cornetzi GN=ALC57_03526 PE=3 SV=1
-LGLTEKQKKLVQNTWAIVRkdEVSVGVALVIAFFKQYPESQKEFKSFKDVP-LDELPKNKRFQAHCINIVATLGKVIEQMHDpelMEASLINFTEKHK-ARGQTPEQFENLKQVILAAFPSLFGKQYTSEVQEAWKKTLDLIFSRICQ---
>tr|A0A158NI97|A0A158NI97_ATTCE Uncharacterized protein OS=Atta cephalotes GN=105620364 PE=4 SV=1
-----------------------------------------------------------------MNIT--NGTIHDILSGgkNTQKV--FL--FR-HRGRTKEVVEKEEKIRVAGLDtngshradCPKGTDEGREIGDPVTDSLLQMLQKKEK---
>SRR5690606_21296714 
----lmEWERVKLVQESWSSITpL-gaKFTQVFYRKLFDEHPAVVGLFPE--SM------------AEQEQLLSRMINPAISCLPAesvFENMMHKLGNRHS-EYGINEKHYRMFTQSLLETIRESLAERWTDELESAWAEVLSGMSRRMN----
>GraSoiStandDraft_11_1057310.scaffolds.fasta_scaffold26797_1 # 22 # 990 # 1 # ID=26797_1;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.733
--VIvTDSDISGCFSCWQTVVdGkapayiEdsdpnkpsglvWFSNVFYGRLFDVNPEAKKLFRD--NN------------ETKARALGNIISTGLRQIWDranFSKILHGIAVSHC-KLGVKAIQYGLVGDVLLWSFAYTMKNMWDQDLRTSWIAV-------------
>SRR5690606_23735845 
-TSFVSLNANVLQRSFEFLApqSDRLAKRVFEKLLKDYPQYRPLFAKV-EI------------VDLRQRLIQSLALVVKSAQRpetMVRYLSELGIRHA-EYGITDNDYRPFTSVLLGVLAEFSGARWTPEVKTAWEEVX------------
>SRR5215469_11104805 
--TGVAEQHLLDLGGVDVLP--APDDHVFDPA--GDPQVaaviedAQVAGV--QP------------AVWIDGFRGAFGHVEVAEHGLvaarADFPG-LAGRHG-FPSDRV----------------------ADGDLYL-----------------
>tr|A0A2T7P4Q7|A0A2T7P4Q7_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_10992 PE=3 SV=1
-PSLTADIRRVVQQSWYRLvehrSLDQLGIPVFLEIFHLTPAAKKLFH-Y-SeKTTIEELEGDRRLREHATRFMNAVGAVVDNLDKknsddLDVMLREMGADHTNISTFNQVYCVIFREALLSVWERNLGKaRFRGELKNAWRALITYMMEVMREGYD
>SRR5438128_5040868 
--------------------------------------------------------------------------------------------EY-RWAEGSSelaaEFVRLNVDVIV-----TGRLPAVAAKQADIRHSDCVRDSCGP---
>WetSurSiteA1Bulk_404760.scaffolds.fasta_scaffold823987_1 # 3 # 239 # -1 # ID=823987_1;partial=10;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.409
----------------------------------------------------------------------------------MPN--------------------DSDSCHSVDNSAILHAVLDSAVDGIISIDESGTMESVNA---
>ERR1711918_283694
-----------------------------------------------------------GSECSWMCRC---GIARFEQT----RTTSHKSRRA-TYRvqPDRGILAHPGESCDDHFGGAPWGGLHPEVENAWNVVYGFPSSIMISGPR
>SRR5262245_16285966 
---------XMVEGTLDAVSLPALSADFYRRAFDTDPELARMFTA--DR------------RVQEARFATELAAIVRSIRchdEFVPAGRALGPVPR-L-RRDGRPLPRDGRRPAGIagrcprsdvearGGRGMAPRLQPDRRDDAERRPRAGQLGVTSG--
>ERR1712061_521749
---PVGHMKTAVEQSWERVQalgPVVIGAQEHRDVAVVSRTTST---TSTRI-EESDATAAGSLANPF----------------------------------------------------------------------------------
>tr|X6EW29|X6EW29_9RHIZ Adenylate cyclase OS=Mesorhizobium sp. LNHC209A00 GN=X738_26865 PE=3 SV=1
--------FALAQRSVGLLLddPSAFAAQFYANMFAIQPELEGLFVN-G-T------------GAQGAMLSHMLRTVVSGLERRkhvPAGLQTMGRKHI-GYGVELDHYDSFRGAMLKTIDDIMGAGLTREIEESWSETLDVILGLMKKG--
>SRR5215471_14715706 
--------PAGGPALARLLRr-------HLRRV--VSSRLAPLFLR-LAF------------NDAISYDPATGSGGANGSIRLpeeLARKEVAGLARA-V------------------------ERLRPVKE-------------------
>SRR5205085_9494957 
--------PASGPALSRLLRrhLRCVVTsraapLFLRLAFNDAISFNPATRA-GGC------------NGSirlaeelEREEIQVLSQGIEQLRPLkerFP-HVS-----------------------------------------------------------
>SRR5947207_2391870 
--IISNRQARRTNDRLQIELaaAQARIGLLYFAQHDRTRAAA---------------------------------ALLEGPDAFdqqRPALRAMGLRHV-AYGVVPAHYDTLATAFLWPLGHRLSPEFSPX---------------------
>tr|N1VSG6|N1VSG6_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 GN=LEP1
-----PDPILEIQKSFDHVLeyNPHWIDSYIDKLKNFSMenvTENQREGDN-ES------------PISSEEFLNSIESIIEKLGNpisVKKEVSKLANIYE-SLGITKKEFPKLLPILLSSLRENLPSEWNPSLESIWTQAITDLTIETIES--
>tr|R8ZTT5|R8ZTT5_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira yanagawae serovar Saopaulo str. Sao Paulo = ATCC 700523 GN=L
-----KDQILELQRSLELALqlNPNLARDFYIHFLETKPEFQKFFQNT-DM------------ETQAKKLLAMFGKTIERLGNlnqIQIELQNLGKMHE-EMGIPVTDFGAIAPSLLYALEKSLGDQWNAEWKSIWETALGSLVRLMGMK--
>SRR6478609_9341681 
-------DAELLETSLALVDTpdASLDSRFCALLHERHPAVHPGGGD--TA------------ARQAKLLRSAVISVVDHLDDpvwLTETLGDGTARPS-GWQVAPEMCGAVSECMVAAMVEIGGARWTSQMTDAWVEALDAVSGPMLLGS-
>SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold1207366_1 # 2 # 214 # -1 # ID=1207366_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.286
-----YASHQSQAASLAKAAprPRVAVLGLrlpsgeSPQLARLGRAFAELLG--AEL------------AAGERLLVLPAeRVehMKLELGLdeaEAYPLPTLGRIHR-NLGPDLVVVGTlapqeprgtlsvtveVKDCLTGAVTATAKVTGPAAELFTLASQvggelrrrlgssalsgneraelraqrpaSPEVAQLYADG--
>tr|V4A611|V4A611_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233216 PE=3 SV=1 
-IGFTETQIDTIRSTWPLLSrnMVRVGTDVFVRIFTEVPTVKELFSSF-NIVDVNDLHKMPTFRAHAEMFMQVLHLVVDNLETpyseLNHELMVLGARHATFSGFKPEYFKFYVKCLIQVWELELGEEFILEVRDCWKIVFDFLVDNMTEGYE
>SRR6266542_3322184 
MTVMTPEQIEAVEATTAVLapALDDLAADVYARLDRLAPETAELFTG--GPA------------AEVRGRARDDRARHPAPRRLpGacl--------------PARPPARALRGQA------GALRARRC-----------------------
>SRR5918994_1217714 
------RDiEAYVRT------gRAA------VPVFESDVLLEDCVTS--AA------------NNDWcgVSTRPRNEVWPGFKVGlerAVPVLEQLGRDHR-RFGAVTAHYDAVGASLLATLRHFFGPAWTPELHQTWSEAYGPVAKVMVTA--
>SRR5207302_4688282 
--VVTLEQFRLIQHSWKLVKdGqfaaftaqtliadplGFWGLQLYDTLFALNPSLKPMFKN--TF-------------TQSQMLTEMVGAALGllpgildqalgeektAIDPqLIPILVDLAERHV-SYNVKAAHYGTVGLGLVTTLERTLGSHFDEQKQATCFELWSMMX--------
>SRR5437867_13093015 
---------------------nqnpsPLWRA---------------------RL-------------PR-------VSIAFGlrwfNCnTSkSYSRKCSTNLLNV-GYNVKAEHYGTVGLGLVTTSERTLGSHFDAQTKAAWVELWSLICTVMIP---
>SRR5882757_3847967 
----------------------TSI--------------WPIIIN--TaV------------GirnipQDYRNVARVLRLnqFEF-FTKimvpaAAPYIFTGL---------------RIGIGLSWLAI--------------VAA--------------
>ERR1700737_3002051 
----------------------RDF--------------HHLDLA--DhH------------Q---------HRVagTQW-AN-gsMSNAVWTGV---------------RLKDVLDRAGV--------------KSGAI------------
>SRR3954451_23003713 
----------------------LKS------------TTGEVFLE--G--------------klv-DE-------PGpdRAI-VFQnhsLLPWLTVYG---------------NVAIATDKVFGGSGARSKSKAERHDWVMHNLELVQM---A--
>SRR5206468_1650083 
----------------------TNA------------TMGCVLLE--N--------------rev-NS-------PGaaRRR-QGVcerQDPQRAQRMGDAqpqpradgacqgqA-PG-GDFRRYEAARRHCPRAGHATKSAAARRAVRRAGRADPRAPAGL------
>SRR5258705_633045 
----------------------TSE------------DAGPVALG--N--------------qev-KQ-------PRtqPPV-VFLdpaLPPRPPALD---------------HWLLRAARDAGGP------QPQ--------------------
>SRR5690606_21133184 
----------------------INP------------LHGAVRLN--D--------------aap-RV-------GDpeVGY-LLArdaLLPWRTALR---------------NVTLPLEV---RGI----ERREREQSARKVLRDVGL---E--
>ERR1700682_1967427 
----------------------DRA------------SAGRVVVD--G--------------sev-RG-------PSldRGV-VFQspaLLPWLSALK---------------NVAFAVRSRWPRW-----SDEQVVSHAQKYLDMVHL---T--
>SRR5699024_2544359 
----------------------LSPSSGKIIVAFSSPTSGKIMMD--V--------------ndwtSYKDSEMTALRLkeIGF-IFQeshLLPYLKIRE---------------QLEFVGREAGMDK-------KHARKRAKEILDLFGL---D--
>SRR3954447_21976298 
----------------------RAA------------TGGVVRWS--V--------------dplvAAG-----GRARhpLSM-VFQkdtVLPWRTVAQ---------------NVGLFYALN---RD----RRAGAEGVVDDLIRLAGL---E--
>SRR6266567_262474 
--SMTPEQIDLVRKSFDALWpfRRKLADQFYGRFFELAPDTRRLFPN--DME------------RQQLKLMDTIAAIVGTLDQreiFQSIISLTGRKHA-DFGVQTSHFACCFYPKSLEAPAHAGGFLCSSpLNVSWNGARARPYPLMHL---
>OM-RGC.v1.004444255 TARA_034_DCM_0.22-1.6_scaffold509117_1_gene597562	NOG05352	""
--PfLQPTKFELVVNLKTA----------------------KALGL--EVP------------PTLLARADEVAGVGGSAKRishWPPR------------------------------------------QSRWAGLPRRPERH------
>ERR1719401_1263416
----------NVLTSWNTLKskpnyCDETAALIFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCTVSMLGpdlfELSGVLHEMGRRHQ-RNGVDRSYLPYMSEALFHALAKMLGPQFTEDDKEAWKGVMDYMISEMVIG--
>ERR1719401_232394
----------NVLTSWNTLKskpnyCEETATLVFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCIVSMLGpdlfELSGVLHEMGRRHQ-SNGVDPSYLPYMSEAFVCALSKMLGPQFTEDDKEAWEVVMDYMISEMLIG--
>ERR1711862_565156
---------------------------------------KIMFH-FPVNMNIETVLKSKIFLQHAKFFVKTLDITIGLLGpdtdIIQDVLLEHSKTYQ-NHGVNSAMYLHMGESILYALEKDLGDvNFTSKDREAWAYFYGTIVGVIVGG--
>GraSoiStandDraft_1057264.scaffolds.fasta_scaffold343999_2 # 425 # 754 # -1 # ID=343999_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
---RRRMDAELLETSLALVDtPdDGLTKRFYALLFERYPAVRPVFPEEmhRDI------------ARQAKMLRSAIISVVDHLDDpvwLTETLGELGARHA-GWGVLAEMYDAVTECMVAAMAEIGGDDWTPYMTDAWTEALDAVSGLMLLGYP
>ERR1044072_5206314 
---MAPPQIAVARSTGPKVSPmqQRLAQVFYERLFELDPTTRAFFGG-------------VDLRHHGLKLTETLSAGIEVLGRdgpAPRGS-----------GSGMAALRDGGGCVVHGAGVLPGPRVHDRSPGGLVGGVLG----------
>ERR1719389_1465843
----RCNRKLGGSAKEEKLRrndgtrfvCKI---FKISRFLKQQPDASAVFG-F-DNN-DEDVHKTPKFIDFANHFVEVIDQAVQMLGPdfelLTDFFVDLGDKHSKEYGIKPKFYPILGRVFM-----------------------------------
>tr|Q17153|Q17153_9BIVA Hemoglobin (2 domain) OS=Barbatia lima GN=hemoglobin PE=2 SV=1
----QPANKGLIRETWNIVAGdRKNGVELMALLFEMAPDSKKEFRRLGDVSPA-NIPNNRKLNGHGITLWYALANFVDQLDNktdLEDVCRKFAVNHV-LRGVLDVKFAWIKEPLAELLKRKCGQRCTEKHVKAWWKLIDVVCAVLEEH--
>tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1
----KPANKGLIRETWNMIAGdRKNGVELMALLFEMAPDSKKDFRRLGDVSPS-NIPNNRKLNGHGITLWYALMNFVDQLDNkidLEDVCRKFAVNHV-NRGVLDVKFAWIKEPLAELLRRKCGQTCTDQHIQAWWKLIDVVCAVLEEK--
>SRR5262245_28144535 
--CVTEEQIARVRACFDELTPrtPEVVDRFLARFFAQNAPLRALFP--RDLS------------ALKQDFAAGFRHVVRHLHRldtIAPMLMDLGSRQA-RAGLTPGHFGMAREVLLTTLRDVAGPRWNEQLRQDWTEALNTVVSLMVVGA-
>ERR1039457_5537378 
---AGPLNPALIRKSLALITagPPRGAGGFSRALFSFDPGVGGLVPA--G------------DERAER----APVRR-------------AGPDRR-AAX-------------------------------------------------
>ERR1719498_600299
--------INCVQHAWNVlIIEDRsreflraqesatfvyssciswFYSVFYSRLFNVHPLFRPRLNS--KG------------SKSGKSLVMMIATTINGLRDkdmFQRVVTEMAKNLC-SSGVKPVEYGILG---------------------------------------
>tr|A0A2H8TS68|A0A2H8TS68_9HEMI Neuroglobin (Fragment) OS=Melanaphis sacchari OX=742174 GN=ngb_3 PE=3 SV=1
--YLNKSQTALVKQSWPMITSNNFWTTFYINLFKRNPLYQLQFDRFANVP-FEELESNVHFLAHSFRTGFAFNTAIEHLEKpdeLHRILMDLGEKHR-KFRLTAEHFEAVKDILLCMIEDRIVLTdvpaRNILLVEAWKPCITLVIGVIM----
>SRR5215469_6657410 
---------RLCPVSQSQMSSvvGatTSaaHRITMSPIWVSpCYSFTWLAI--NRY------------TWDRFGLMTMIQTAVENMHQldqILPAVRDLGRRHA-GYGVKAADYNTVAGALLGTLEQALGSEFTSAVRNAWIAYYQTLAGEMKA---
>UPI00001F6528 status=active
---AIIDGLRDLSESFDTLaadeaatApaATELKaavegqfsgvfGAEYAKQTGKQPDTASYTLE---------------------HSAAALAQYHYIVRNphpLGQknKLDKV-AGEA-RYHALHARYHTMLNAYLERFGyydvflidldgdvvysvfkemdyatNLKTGPWRDSgLGRVFRSALESNDtkSTFFDDFA
>ERR1712100_346632
---------------------LFFFFFFFFFFFFFFFFFFFFFS-FKNV---EDLYESPMLKAHGKAVVGAVDAAVHLLDDvskLIPILEELEQFHN-RKKIVAAHYDVVGQAVVNVIGSALNG-LSEEQTNAWVKVYLTIKSVMLA---
>ERR550532_3561775
---------------------GDSSVSPSGELCSPKTKTPRICSTVLE-----LTMHSADFQAHSGRVFGGLDTVISCLDDeatLVAELAHLKGQHDER-NIPDAYYRHFYQALEKVMNAMLGPCFNY---EAWDACGDIVFHGITGH--
>tr|A0A1I3HEN0|A0A1I3HEN0_9RHOB Nitric oxide dioxygenase OS=Jannaschia pohangensis GN=SAMN04488095_0565 PE=3 SV=1
--LVTNTQARLLSRSLRRISenGAPLARSFYAELFSAHPEVRPMFHS--DLS------------TQYAKFEDMLVVLVADVLNpgvILRPLQDLAKRHV-EYGVTREMYPIVGDIMMRTLRTLDAAPLTGDELEAWDVLLGRVNAFLMDE--
>tr|A0A1Q3FVI8|A0A1Q3FVI8_CULTA Putative globin 1 OS=Culex tarsalis OX=7177 PE=3 SV=1
-TGLTNHQKVALIGAWSLVkkDIISHGRNIFVRFFEENPKYLNYFD-FSQDRTASEIGENKSLHAHALNVMHFIGTLIDyGLYNpamFKCSLSKLMKNHL-KRGVKKEDVTIVCGVIMKYCLEVLDQHQSTTLQVAFASLMKGIADAFD----
>tr|A0A2M4DSC8|A0A2M4DSC8_ANODA Uncharacterized protein OS=Anopheles darlingi OX=43151 PE=3 SV=1
--------------MWCKPthQNpegSSDYISICVRLFQKYPHYTDYFD-FTDDTKADSLVDNKSLFAQSIHIVKAFGSLIEyGLKDprlFHETLKRIARWHE-QRNVYGCDVLLIGEVMLTYLTQTLGRQTPAMLGEAFQKLFQTISYRFP----
>tr|A0A0N8DLE0|A0A0N8DLE0_9CRUS Hemoglobin subunit theta-1 (Fragment) OS=Daphnia magna PE=3 SV=1
-LPLNARQKYSMLASWKGISraLEPTGVYMFIKLFEEHKELLSLFTKFHQLTTRDEQANSEELAEHASSVMSTLDESIRSLDNVDtflLYLHQVGQSHYKVEGFQKEYFWKIRNPFLEAVKMTLGDRYTENIENIYKVSINLVIETLVEGYE
>ERR1719383_1265545
-------------HSWKEVGqapADEVAREIFRNIFAIEPGALELFP-FKNES-EDDLwREGGALTVHALKVVSTIDKAVSRLGNmdaVVPMLRKLGIMHV-GPRPQHLGNG-----APMSLP--------RRPTASWRRG-------------
>ERR1719383_514948
----------------------------------------------RGRL-VEGRwRFDSARVKSCVddrqGCVETWQHGRRR-----SNAPQVGNHAR-GLRCAQAHYDVVGQALVTTLASY--CTFTDPVKNAWIKLCGVIKATMVH---
>ERR1712000_66502
--------FPKVQKSWARVLeieakdeSKSFGPIFYNTLFTDFPFLKEqdFKSA--TM------------AEQKMNLPKFITTALSLLGDmpkAVDALQRLGMRHV-LYGTKDAYYPVVGANIIKTLKQILPANEFDQEtQEEWLTLYGVMQKTMIDA--
>SRR5258708_4037766 
--------PGAVGPAPGLQPprNRPGARRGQPALMQSPSAGGPPPGP-HrpRR------------THRTPPRRAALVLLRRSLRDldeVVPGLRAMGARHV-RYGARPEHYPVVGAVLIDSMAEVAWDAWRPAYGRAWAAAFDVVSGAMLAG--
>tr|A0A1Y3AX51|A0A1Y3AX51_EURMA Globin-like protein (Fragment) OS=Euroglyphus maynei GN=BLA29_013533 PE=3 SV=1
----------------------------------------QKFKSFKDIPINfqqnHLIRIDKKLIAHGTYVMYTIGMLVDNLERpdmMRQMLKRLSRNHY-RRRISLKAFERLRDTLLEHLSDILGKEiFHRKTMIAWHKAFGYLLKEIESN--
>SRR5688572_8260099 
-----DQEINIVRQTWNRLAaehGNSVAEEFYKRLFECCPHLKDVFKN--DF------------EVHGKEFIENMDHIIIQLDNpcMIREMQILGIKYA-SYGIRYEDYECMKKALFDALKTKLAEHWTPTVMVSWIWFYSTVSHIMKH---
>tr|F2Q9X2|F2Q9X2_BRAFL Globin OS=Branchiostoma floridae GN=lGb7 PE=2 SV=1
-MSLSAADKKLVQESWDKVSkpsFADAGERVFLKLFRRNESTKAHFKKFKDIPS-DQLAGQAVVRDHGEKVCKVLDDFIKGLDGsGDEAVKKVGRMHK-GLGMSNEQIDQMKGAIIEVLADAgFGD---ANYKGAWGKLWDRFMAVHRA---
>tr|A0A1B0G6S0|A0A1B0G6S0_GLOMM Hemoglobin-like flavoprotein OS=Glossina morsitans morsitans PE=3 SV=1
YSTMNSDEVYEIKRTWEIPatTPTESGVAILIRFFTKYPSNLQKFSTFKDMTL-DELKNNPRFKAHANRIMKVFDDSIKTLDDncshLEEIWTKIAQSHF-NRQIEKQSFNELKEVILEVLVAACN--LNDQQTEIWLKLLDFVYEIIFKT--
>tr|V5YM54|V5YM54_9DIPT Globin OS=Polypedilum nubifer GN=PnHb18 PE=2 SV=1
IVALTEADVEIIKRTWKIPsaNPHDSAALIFSTFLEKYPHNQQKFPAFKDKPL-SDIKNTVEFRAHASRIFNVFSSVIDGLDRdtemmkgIKKIIAEVGKFHA-KKKVTKKAHNEVRSVLVDILIEVCK--LSDEEKAAWTKLLDIFFHVMFEC--
>tr|O96457|O96457_9MUSC Hemoglobin OS=Gasterophilus intestinalis GN=glob1 PE=1 SV=1
---MNSEEVNDIKRTWEVVaaKMTEAGVEMLKRYFKKYPHNLNHFPWFKEIPF-DDLPENARFKTHGTRILRQVDEGVKALSVdfgdkkFDDVWKKLAQTHH-EKKVERRSYNELKDIIIEVVCSCVK--LNEKQVHAYHKFFDRAYDIAFAE--
>SRR4051794_9566520 
---------------KALVEdvAERghrrPMEVFYGARsdhdlydidtmlrmAQSHPWLS-VRPV--VA------------TGpaggPMNSLSGQLPDAVRQYGPwreYDAYLSGPPGMIR--NGVD----ALVGVGV---PSDRIRHDSVEELVAAGDX--------------
>SRR5215470_9890699 
-----DFDRGPIRELLKHLAvePDAAMEYLFARLFAAHPDLRGLFPY--GM------------TQTRAAVFGELAAIIGGLDDqerTEQTLARLALGHR-KFGVKDKHYEPFFDAMFVTAQHAAGAAWTGEMAASWRSALDWFGSVMAA---
>SRR5262249_54331370 
--IRLRK-------EIDNEWllIASgVLSVIFGLILVAQPGTGALA---------------------LLYVIGIYAILYGILGPrpcCV----------N-RFGAQTALDRG-----------------TSTYRELWNIS----VARLIG---
>SRR4029079_9820506 
-VRVDGILVEGLQASLATMQpaAAQIAHGFYTLLFARRPDFRAMFP--EDM------------AAQERKLIATLAFVCEHWRKpaaVSVRLADLGALHQ-GLHVKPEHYPIVCDALVTAVMKHRHEALGPHRAR------------------
>ERR1719310_1734953
----SASSVKAVQASWAKAEnigLRVVGELFFKELFEASPAAKELFTA--Q-KFGEDAAGQRRFKAHTLNVMQTLSAAVYGLSDlsaLARTLPAPTYAIL-SLSFTLISFTSL--------------SLTPLI--------------------
>ERR1712087_347811
--------------------------------------HEELFTA--QKKFGEDAAGKAHFKAHTLNVMQTLAAAVYGLSDlsaLARTLPARIYAIL-SLSFTLITFTSLSLTPLIYHTLTLKGARARNSGRaaPWIRRPT-----------
>tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii GN=F775_23753 PE=3 SV=1
-MAFSEAQEELVLRSWKAMkpDSESIALKFFLRIFEIAPAAKPMFPFLRDAGEDAPLESHPKLKAHAVTVFVMACESATQLRktgDvkvREATLRRLGATHV-RAGVADAHFEVVKTALLDTIEGAVPEMWTPEMKAAWEEAYDQLAAAIKEEM-
>SRR5262245_14739337 
--PCARARLRPR-------RpaL------Y-AQALPPRRLVPRPVRE--L------------AEAQSRKFMAGLKLGIIALNyedGLTPVIRLVGVRNR-RAGIKVRHHRVMAKALLPTLEQSLETRFTRDTKHAWSSFLTQVTRILSG---
>SRR6266699_2273235 
--FFLPFKE-LTEQHFSILGlrkARRAGLVLAQELFEHAPNVGARHSN--AF------------GGRGYCRRMR---------PRtap------VCDSAR-CWAPSCRRQ---APLALR-------------------------SCRPVR---
>tr|A0A084QEN9|A0A084QEN9_STAC4 Uncharacterized protein OS=Stachybotrys chlorohalonata (strain IBT 40285) GN=S40285_06080 PE=4 SV=1
------------------------------------------------------------MEKYPRIDIRSPAGVSIIYKDvssLDPAQEEIRVLHL-HGG---PEDSPIECTLHKiALKSNPPPVYE-ALSYTWGDAsvtreIVL-NGHVVS---
>ERR1712224_896978
-GCLSHRQSTLIRGSLPMLraQGETITSSFYASLLSAHPELHNIFNS-AN----------QATGRQPRALLNIILAFAAAPNHtaeLIPRLERVCQKHC-SLGIRLTSTTSSASTSS---GPLARSS-------------------------
>tr|L8LYK6|L8LYK6_9CYAN Hemoglobin-like flavoprotein OS=Xenococcus sp. PCC 7305 GN=Xen7305DRAFT_00009490 PE=4 SV=1
----MSLQIGLLEQSFNCIRPyGkLFVSSFHENLFQTNPEIKSLFMGV-E------------SQIQKNRIWDTLVLIMENIrhpNLLNNTLQGLGARLF-THGLLPKHYPLVKKAFLATFKQFLGNEWNSELEQAWKNAYTYFHDLMQEG--
>ERR1022692_2453048 
--------XMSLPASFTSICngilGREE--------NSGCPAAKGQFLP--DR------------DAWrRssaLLLFGPLHQASRSTGYvshLHegaArppgrRispDRRPGRQAG-RSGRLRAGPRAGPPQVRGHRRALRRGRRQPAGDTGAFRGRHLDARVMIE---
>tr|S0BCU7|S0BCU7_LAMSA Extracellular globin OS=Lamellibrachia satsuma OX=104711 GN=v2hb-B2 PE=1 SV=1
---CTTEDRREMQLMWANVWsaqftgrRLAIAQAVFKDLFAHVPDAVGLFDRV-HGT----EIDSSEFKAHCIRVVNGLDSAIGLLSDpstLNEQLSHLATQHQERAGVTKGGFSAIAQSFLRVMPQV-ASCFNP---DAWSRCFNRITNGMTEG--
>tr|A0A1Y1ILY9|A0A1Y1ILY9_KLENI Cytochrome b5 isoform OS=Klebsormidium nitens GN=KFL_008610010 PE=3 SV=1
-PHLTTSDVKLVQESWAKVVeahGVGAVTLFYVNLFTLAPHLESLFKKTKN--------------IQEAMFTDMMMTLVGKLHDwewVVSALEASAIRHL-RYGVSVSMFPAVGQALLQTLDMGLGVHWTPEVKAAWIKLWTAIVSVMSVHL-
>SRR5579875_3194573 
-------------------------------------------------------------SRCCSRATPSYGRCSRSRCrgpgrrsAtgsPSSSATCRRPGAR-RSCSRRWPGITAGSASvtgtTGRSSRRSGPAWTAELDAAWLAATDWFVSVLAA---
>tr|A0A0L8P0I1|A0A0L8P0I1_KITAU Flavohemoprotein OS=Kitasatospora aureofaciens GN=ADK78_37645 PE=4 SV=1
----GAADQRVITEYLELVTpfGE-LITHLYETMFRRWPYLRSLFPE--SM------------EFQRAHLARAFWYLIENLHRpddIAEVFGRLGRDHR-KLGVRPVHFQAFEAALCEALRRTAGPRWADAVEQAWVRMLRFAVAAMVS---
>SRR5688572_1436081 
--RPAPEVIAAVSASCQAVAdrPVRLAEAFYEHLFEIAPQARTMFP--ADMT------------AQMQRMSDTLVGAIAQLEKfdtaqLEAALRRLGADHRTRHGVEAEQYRYVGHALTRAVRDVAGLAYSGALSSAWIAVYQYIEAHMSAG--
>SRR5947208_57978 
--EMTPEQIALVQHSIEVLGprVDTVVERFYQHLFEIDPSVVELFST--DP----------A--VQRRKFeveLRQIIKAISGFDEFAGRAHDLGIRHS-HYGVRARHYRSVGDSLWWAWQSVMGSAVDSEHSKVGEAAQDV----------
>SRR3954454_13764990 
----VLDPAMLVQSTFALVArqRQRFSERFYANLFAIAPETEVQFAG-TPP------------ELRDRMFVEILFLVARSMSrvdEIAPALTELGARHV-AYGTLGSQLPLAKRALLAALRELLGDAMTAEVEAAWSETYDAMAEPMARGM-
>SRR5579864_8015183 
----KPDPIFLVHTSFVHLRprMAEFVSNFFRRLLKDSPELAPIFED-ADS------------VRLKTMVAKIFGTTIAGPEqtdQVEADLAELSRRHK-SYGAIPDFLPLVGRAFIATIRESLPDDTTPQTIEAWELLYANTAALMSKGL-
>ERR1719483_919245
MAVLSKSESDLIYKSWALAAdeKEKHGGAFMVRLFTEHPEVQaKYFPKM-DMN------DFMLLSKHGSKIMAAVDTLVNYVNDgndekLVKTINHVASSHF-RRGVVTrEAFEIVTEVLMNYLITTLGDHLSPEAQLAWKKLLSVLVEVIA----
>ERR1711860_359782
----LFSKSNYVFAS---------LSRNTFKLFKDERSLYeKHFSSF-DVN------DILRIRAHGLKVMKAVNSMVEAVSDendesLIDQIHFVAHGHH-LRGITPrNEFEVRRKILNLDYHLLFHyllkkGCLSQSX--------------------
>SRR6266545_1588040 
-------------CDLEQAVdtCPA----------A---LVIGLRP--ATMG------------TL---------CYMGGLAsa------AVCCWRHV-RVVTCSQFF-------------------------------TTASPQSRQ---
>DeetaT_16_FD_contig_41_1516467_length_281_multi_3_in_0_out_0_1 # 3 # 167 # 1 # ID=1772959_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.418
------RRMNLVKQTWRSVEfglGHKATQAFYDRLFANHLDTRRLFAG-VGM------------EGQSRKLYDLLRLAVRSLDDldaIIPTVQEMGRRHARSYGVVRDHYGAVTQAFIEILHQYICSqlghmahsRYLVDVADAWAWCLNLIGNIMAD---
>ERR1719433_537024
--ALRISIVGREKRA-NCTVtlgRVEQGELQVGATVLLVPPGAECGVQSvevdgREVRSAqagefVCMRLLgcQP---SVGHALSSVD---GPLRSatkLKVRSAQAGEFV------------------------------------------------------
>ERR1719161_1849694
--ALRVMVLGMTADKVG-AAlegHVEQGTLRAGTRCLAAlsEGQAECNVQIvllngVEVSHAgpgehVRLKVTgaAAKGFTAGQVLSCIS---NPVRAigkFKAKLRLMSLPEM-LS----------CSLLVL----------------------------------
>ERR1719277_2163216
--EATDAMKGAVQRSWDQIQalgTTVVGEHVYRYFFELVPEAVNCFPVHvrlkyREwiADEPdenGDLRNSAALRNLFAKVLNAIGCTVAGLQDaskLVPLLSSLGARHI-GYGVSEEFWPALGKAINRTLQDLLAEAFTPEVENAWNTVYGFMSQIMVESLR
>tr|A0A2G8RXV1|A0A2G8RXV1_9APHY Uncharacterized protein OS=Ganoderma sinense ZZ0214-1 OX=1077348 GN=GSI_12102 PE=3 SV=1
PKPLTAEQRKLITAIVPVLEqhGKTITTLMYNQMLEENPALKNVFSKS-----------KQERGQQPEVLARSLYAYASHIEDlgpIMPFVERIAHKHA-SVHVEPAHYDVVAKYLTNAIIQVVGaDVLAGALYDAWIAAYWNLAYVFIDR--
>ERR1712080_154454
-----DLQKIIVKHQWARSYnegmsREYFGQAIWRAFFKLDPGARRFFTRVRGD-----DISHPKFQAHSLRILGGIDMCLSLIDDvptFEAQMKHLQGQHI-EREVPSYYFDRLGTVLQEVMRAATGYCYDE---VAWGACYKYISDRIKANY-
>tr|A0A0S2MLM1|A0A0S2MLM1_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1
-----PLDRILVKAEWAMASdgghkDSELGSSIFRALVNIDPALRGTFSAVGGE-----DMGSAQFRAFAFRVVAGIERLIAVLDVdavLSADLAVLHSQHV-ARDVSAANYESMLSAIMSVVPSAvGNSCFSS---PSWSRCLNVIAAAM-----
>tr|A0A066YRR6|A0A066YRR6_9ACTN Putative oxidoreductase OS=Kitasatospora cheerisanensis KCTC 2395 GN=KCH_40190 PE=4 SV=1
--PPDAADLALAGAVLAALRpvADRAMAHFFALMFLRHPELRAVFPA--A------------MDGPREQLLRVLRECVRHGDDpaaLRDRLGPLARRCR-KYGVLSGHYASAADCLVEALARYG-SGWDERAEAAWRRLLAPVARLLVEA--
>ERR1719329_2046659
-----------IKTVWAKIMkevgTLNAGTMLFKNVFMLAPETKQLFPKFRHLK-DDLLLSNESFKNQAKLSISALSNAIMSFDDppkLKRMLMDLGRIYE-SKGVSLATLPIVGNALMATIEAALGNDSCIETFNFFALFYNEGSNMLAEGYK
>ERR1719265_1860150
-------------------------------------QALNYFPRFKMnnlLF-SDALFEDEIFKIHAYKLINAITNAIDLLDEpvkLTETLKHLGRIHE-NKGIPAESFVVIINAFNVTVANLISRDSSIETINFFALFMNEGTNLMTDGX-
>SRR3569832_2958212 
----PALVRSAPDSAAALRrcRCGGTAEKIAERARADD----------------------------------------PESEKsrgAGADDERIGRTAQ-AIRCSAGRLSSGACCAVGGHGGIGGX--------------------------
>HubBroStandDraft_4_1064222.scaffolds.fasta_scaffold919957_1 # 1 # 597 # -1 # ID=919957_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.524
-HPMDPSRVMRLRISHGWFAPcgEALVARCFQILGEQTPGTRSLFP--ADTA------------SLHPRILRTLRQVLSNAHEfrtLEPPLARLGEKLQ-RRAGGvehlLPHAAAFRDAFICVLAEAGGRSFTHQMEQDWRMLLDGVLGAMIAG--
>tr|Q1GDP0|Q1GDP0_RUEST Globin OS=Ruegeria sp. (strain TM1040) GN=TM1040_2494 PE=4 SV=1
-AILRQIEVQLIKVSFNRVFaqKAALAEKFYHHLFLELPDAEVMFT--RDFS------------HQTEMFARVLTTGMQSLGRdreMMVLVDDLLQRHK-HLGLTLDQMYTAQRALHLAFCEVMQAELTAAEVSAWDNAIGRLCRALAAGI-
>ERR1043166_6829872 
-LNLTADEIDRVRTSFDQVWaiSSRMADLFYDRLFAGNPFARSLFPA--QQ------------DERKQNFMLNLAVIVAGLDEradMDRSEERLVQAHA-EAGIRVDQSEVMRDALFWSLEQGLGPAWTPGVAAAWRKAYRLLSEHMAS---
>tr|A0A257MW93|A0A257MW93_9GAMM Uncharacterized protein OS=Methylococcaceae bacterium NSP1-2 GN=CG439_2278 PE=4 SV=1
---VKVKNRLLVKLCIDEISpkIDIVSQLFYQELFHLNIHLKTIFSG--NVT------------FLNRKFINMMAtfKNVKHLEAIENSVEKMGERHVLHYRVQLKHFPTLKKALLLALKKHLGERFNAELEAAWHEVFDDVAEIMQRA--
>SRR5690554_3276444 
----xmSDADRLQVQASVERIRgqMDGFAGCFFDKLFALQPALRELLAT--E-E------------GRRSKLRSMVStlANSRDFDKIAPAIRRLGDRHR-DYGVGVQDYVPVQQALLHAVAQVDPQGQSEQVQQAWSGQFQRISALMEPQ--
>UPI00042C7A07 status=active
---MNDTQRLLVKADIDSLGndINALSQIFYRELFHIDINLKSVFPG--NVV------------FLNRKFANMLAtfKNLGHLEKIGASLEKMGERHLANYGVQLENFAPVRAALLIALRSYFKENFDAEREAAWQAVFDKVADIMKAA--
>SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold510383_1 # 42 # 362 # 1 # ID=510383_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
----mTSKDRALLKECVEYIEsesINELCDIFYKKLFDLDPKIKLILSD--NDV------------VLRRKFFNMFStfKSVKYIDKVSEIILQMGARHK-SYGINEKHLELMKEPLFESLHEVLGDEKFNYYKAGWEIGYQEVENLFKEG--
>ERR1700737_3653126 
MTALTADQIARVKATAPVLAehGVTITKHFYKRMFTNHPEWKNVFNQ-AHQQS----------ASQPQALARAVYAYAAHIDNlraLGSAVSHIANKHA-SLNIRPEKYPTCGKICWRQYPKCWAIPSMNPRSTPGAPLMRNSRRFLSGR--
>SRR5919197_656730 
--LLDDDTIGLLDESLRLIDdrSDVVVNHFYAAQFATPPPRGLLGSR--AR------------GC--------LGRGVR-----RDGPGDVGRRSR-GGGGRAGLV--EGRD-------------------------------------
>SRR5919106_2778213 
----------------------A-VDRFYAA-VLGDPELAGYFTDvdidrvkrhqvlllsdvlggpesyDGPD------------LGQAHRGlgitdghyDKVVGYLVAVFTDLgadGDTIAAAAEVL----ASVK---PQ----I---VEDQAGSRDSHEX--------------------
>tr|F4F3R7|F4F3R7_VERMA Oxidoreductase FAD/NAD(P)-binding domain-containing protein OS=Verrucosispora maris (strain AB-18-032) GN=VAB18032_21340 PE=4 S
-------MRDHPAAEVGGIAeavFGRAAARFWDTVQEGCPGLLP--------------------EGDAPLILAGLLRLVGGGDDRpgrLALLTVLGRVYR-EHRLRPDHAALVGA----ALT--VAVPSMPPEAATWRRA----WRlVERA---
>tr|A0A2T3A5F4|A0A2T3A5F4_9PEZI Flavohemoglobin OS=Coniella lustricola OX=2025994 GN=BD289DRAFT_370338 PE=3 SV=1
--ALTFKEAQLVKSTIPFLReqGEELSNLVYGNLVKRNPELNNKLNVI-HLQDG-------RLARALTVVILRFACNINDMSELIPKFERVCNKHC-TVGVQPMHYELLGALVIEAFESLMGDALTPEIRAAWTKAYSILSHMLIGR--
>SRR5439155_13306073 
-VLLD-------GGTLRAVRmsGDTRSEPWLKDLWERGVAVGELRRHLllpletppGLP------------VPRGRILCNCFDVAESEIDAfla-------------------------T-SNSIAELqarlkCGTNCGSCLPELRRKSLCDIG-----------
>ERR1043166_8897093 
---GTRDQADIVQLTWHSVLpvGGTFAELFYGRLFALDPEVRRLFKD--DI------------VEQGRNLTAMLSVATANLVKperVGRPPGGLHFRRK-D--VDQRVLEREEERVLHQRemlrPHAVSGVALAELMERHADAP---GGVHRHA--
>Wag4MinimDraft_6_1082665.scaffolds.fasta_scaffold479856_1 # 2 # 223 # 1 # ID=479856_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.387
-IALE-------DGRLRAVRlaGDTRAASALLELWERQAPVDAEDLPEtPAH------------ASRGRIICNCYDVSETEIAAy----------------------------RSLADLqaalrCGTSCGSCLPELRAKFGVIPR-----------
>tr|A0A2B4SBA2|A0A2B4SBA2_STYPI Serine palmitoyltransferase 2 OS=Stylophora pistillata OX=50429 GN=Sptlc2 PE=3 SV=1 
--QISQKQISLVQETWGLVsgDLEKVGVDFYMRLFKANPDVLQLFS-FRDIDKSsdDIMRADDRLKRQGLVTMQHVDLAVNSLNDlgsIVPALRDLGGRHA-MYKVEEHHYVLVGSVLLDTLNNGLGDNFTVEL--FWAALLNTLDKGLGE---
>tr|A0A0C1L0Z1|A0A0C1L0Z1_9BACT Uncharacterized protein OS=Flavihumibacter solisilvae OX=1349421 GN=OI18_18680 PE=4 SV=1
-MEMTPRQMQCVRNSWRNFrdlDPAFFSEPFYAKLFADHPAAKKVFGD--NL------------AEHFSFLHEMLSQLVSRIDRPdqlLITCSRIARNNA-ALGMNEKFYEWYGHALIWTLRQGAGADWNMETEQSWISYYKYLVD-------
>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3668839_2 # 105 # 377 # 1 # ID=3668839_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.656
--------SGPLAASLAIFEprLEAVTARLVDVLAASSPHLLALFPP-SSE------------PS-----AALLGRFLTRIVEtesLGqPLGDGLGLDAY-PIP-TRDQWEHLVESFIWSLSAVAGKAFSPPMARAWRATGERLFSTMFES--
>LULI01.1.fsa_nt_gb|LULI01000097.1|_29 # 27187 # 28320 # 1 # ID=97_29;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.310
----------------DEIKgrH---HSMFVDEFERQQPQYKD---------------------------------FWARL---NrGEYQAGEYRRY-GKG-GKEVWIQA----------------------------------------
>SRR5947209_9205436 
--------VLSVLRSpssplF---PyttlfRSRltver--DSERDVLMvaggtGIATMRAL--LD--DLA-------------QWgENPRVHLFYGGRTDDDlyaLDd--LHQLDRKST-RLNSSHANISY---Avfclk-------------------------------------
>SRR5690606_15697619 
--------VRVVAGGwvsralvrqtvpgdrW---RvgapMGElwrdr--DVQRDLVLiaggtGVAPLHAV--VE--DLA-------------GRatQPSSVTLFFGGPTADAlyfLPe--LRELAADLP-WLKLVP--------Vte----------dgsvddgergklPEVVTALGGAWSGHDVLVAGSPGMI--
>SRR5919202_1970091 
--------VQMVPGGqvsstmvrslkvgetV---RlgapLGQaltlyag--ERHRDLIMvavgtGLAPLRAH--LE--RIDQ-----------EwqSTgRAPRVRLFHGARLPWGlyeNRl--LQNLAG-RP-WFTYTP--------Vvsddp----------typgrkgwvGDAAAVS-GPLHGLLALVCGSPEMV--
>tr|A0A1D8N423|A0A1D8N423_YARLL Uncharacterized protein OS=Yarrowia lipolytica GN=YALI1_A07937g PE=3 SV=1
-FNMTREDINLTKELWAKLMndPEtlessaaygtptaLFCEQFYTNLMASHAELTSIFP---SI------------KKQSVAVAGVFGLAIKSLDHiekLDEFLWSVGKRHNRMIGVEPIHYRWLGEAMIKTFADRFGDSFTLEMETAWIKIYSYLANKLL----
>SRR6266851_2503075 
-----------------------------------------------XM------------RNGSASLPLwPARYGAWTTRRpspNISAPSRSTI-----------ANSVCGRAITNWSARRCSPPSVSSAASGWEAAFNRIATIMIQ---
>SRR6059036_2276597 
--ALFPGTSHWVV---AAGMarP-ESKDHPMLTVAQKTLVQ-------DTFA------------IITPIADDAAALLYKKLFEldpSLERM-------------------------------------------------------------
>SoimicMinimDraft_1059729.scaffolds.fasta_scaffold91729_1 # 2 # 175 # -1 # ID=91729_1;partial=10;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.661
----SMEDRLEMIHEWETVWsaeftgrRVLIAQELFSRLFEKDGTTQALFKNVG-G----DDVNSALFKAHCVRITDSIDTIVHMASYtdvEHQLLDHLGDQHAHYDGVLGSHFKLFRECFLEVLPQAIP-CFNS---GAWGRCLKVFQDEIALH--
>ERR1700754_2066947 
-------DPGdrQLARELLAGAagGDDLDALvehDRGAVLEIAREAVPVaLAQ-ADR------------DdQLGHLGA--------------DRlLRGPAERPL-GRGAPLQDVALVvhrddavergqqqRAVALAAGAELVGEIWERQERGSLtARRYGSNRSI------
>SRR5208337_544005 
--TMTPQQTRLLAQSYAKLEnrLYELGSAIFERLFEIDPHSRPLFK--GNMD------------EQKLKLARLFGEFIRIRarsqhflpvtgkagQVVIPGIGSLGARHEMVYGVRPEQYAHMRDAVLYAIRSLLGNDYNDEIGQAWSEIFDMLAHAMQE---
>tr|A0A2A6CNA4|A0A2A6CNA4_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_32112 PE=3 SV=1
--QCNPRYTALLKSTWSDDfEvLFALGAKMYITAFEgpHGVACKSLFPWVAKYEeAGENYADKSEFRLQALRLVQTIVKALDKVDDlqkLEAYLYAVGHRHV-FYlpvWLDPVYWDVFKasratsylgqstmlksaserDAVQVGVNDHLHKlsKLSTddlaRATLIWTDIIEYIFEYVKEGF-
>SRR5437763_1847173 
------------------------------------------------------------MVRQKRHMVALLSQVLGGPKQy---QGRDLAEAHR-SLGISGLHYERVGNYLLASLLiaqapydvinavtdvlagqrdKIVAAAWAAELAADWTDAYSLVARVMVE---
>ERR1719244_1430206
-TGLSRKQRFLLKGSWKGVSrdLESTGVSWFLELFETCPNARGSLRQFSHISLDDDLTENQPFREMTEKVLERLDNALFSIEDadsMRSILLETGDYLRSVVGLNNDIILQSEGPLLSAIQRTLDERYTPQMEVIYTVIVKFMINTMVEX--
>ERR1712228_920792
-----------------------------------------------HISLNEDLTEVQQFREMTEKVLERLDNALFSIEDadsMRSILLEAGDYLRSVVGLNNDIIMRSEGPLLSAIKDFRREIhttngsdlhsdskihdkYNGRMRPL-----------------
>SRR2546430_6350501 
----GRResRVRGGQGGWV---sRAIVAEPQRGDVGRSGPAMGRMKVD--RG-------------AGRDVVMVAGGT------GlapMRAIIDDL----A-QWGENPRvhlfyggrgrggPYH------PPSLVSTAAAqPGVPVVavagaeaglshkeagspagggvrHGALAGRG------------
>SRR6195952_1380156 
----VALAGEAVRAIWFRLAdqEADVAHWFGALLFSLAPHLRAQFPA--QA------------DRAARRLLRASIAAMSAVDRpqeFPAAIGTLARETR-ALGLDASADEPVGVALVGAVREFAGELWAPGADAAWVLAYSLAAEPARR---
>ERR1700709_350262 
---------------------------------------GDLDAD--AT-------------AERELLVVAGGRRGGVGpaprGepaGpsgAGGGRPPRPARLA-AGVDVRRttvivgartaedLHT------LDRFAVIGEDaPWLAVVgacesdplelglapgpvvegitrAGPWLEHDVVVA--------
>ERR1700709_656719 
----------------------------------------------------------------ADVVAVAGGP------GasgALALGDDLAAQAA-AGVDVRPttvivggrtpedLHT------LDRFAVIGEDaPWLAVGgacesdpldlelapgtvveaitrAGPWLEHDVVVA--------
>SRR5262245_28534727 
-------efHVKTVPGGWV---sASMVNDTQVGDEWKIGPPIGLLGLV--TH-------------SQRDLLLIGGGV------GvapIMSIVPEL----L-RRRSSNRvslfhgvrypheLYL------NGTLDDLAARdPNLEVVkvvsrdrnyagitgslpdvvaqHRDWSAYDVVVS--------
>SRR3569833_3303276 
---------------------------------------------------------------------------------pNNTNHDKH----T-HRKRNPPehqniggkrpedLYV------LDDLRRLTAVsKWLTVTgvteegaipggdrgtlahavaqRGVWEYYDILVS--------
>tr|A0A161TXB5|A0A161TXB5_9DIPT Globin 11 OS=Chironomus riparius OX=315576 PE=2 SV=1
-ATLNADEAKLVKGSWDKVKGQE--DGILYAIFKENPDIQAKFPAFVGKN-LEEIKSNDDFTKHADRIVAAVSKYIELVGNeantpaIKTLLNELGQTHR-SRGATKEQFEKFKSSVAKYLKEHSG-AWSDATGAAWNKAFDEMYAIVFSSL-
>tr|V5YNC2|V5YNC2_9DIPT Globin OS=Polypedilum nubifer OX=54969 GN=PnHb4 PE=2 SV=1
-ATLTESEANSVKTSWNLVKDKE--DEILYAIFKENPDIQARFPLFVSKN-LEEIKTSADFKTHADKIVKAISTYINLLGNeantpaIKTTLNELGQRHK-DRGATTEQFEKFKVSVLKYVKEHAT-GLTADAENAWNKAFEEMYKIVFANL-
>tr|Q23764|Q23764_CHITU Hemoglobin IA (Fragment) OS=Chironomus thummi OX=7154 PE=4 SV=1
---------------------------------------------------------------------------------tILAKAKDFGKSHK-SRTS-PAQLDNFRKSLVVYLKGAT--KWDSAVESSWAPVLDFVFSTLKNEL-
>ERR1712170_324299
-------------------------------------------------------rVCREKLNVHALCVVAMIDKGISVLDKpcdFVELLLIHGRRHK-NHGVARKTFQTLGNFFIQSFKEVLEDDWTDEIEAAWKIFFRFLNIGLEAGY-
>SRR5688572_12388254 
--SMNEEQIKLVETGFQSITgrGERFISRFYENFFAASPKAEKLFAQT-EW------------PNQSRKMLLTIMMVVDNLRDaahIKKMLHEANLVHQ-KFTLQADDFDALTDAMLRTLREFLTDDWSKEAEDAWRAAFAKINAIMLEA--
>ERR1044072_9602616 
-------LEQSGYTVVGRAAdaRELmLKVRSYVPDVA--------VVD--VR------------MPP------DL--------TddgLRAAAEI-RRSHptV-SVlVLSQHREPAYMLELVGDDASGVGYLL-KDRVRDVTQFVDAVQRVAAGG--
>SRR4051794_28399871 
-------EHEAGTDLLELTD------ALVRAGVPCADAAQEAVAG--VE------------LPHGAQLPAER--------LadrLERRRVD---------lD------------------------------RLLRFGEDAG-HLVLGA--
>SRR6266545_7915566 
-------ELDTLETTFDLLAprGEELMDIFYARLFAAAPGGRAAVRR--HR------------PSPPEGSPPRR---------ARAPAQV---------aA------------------------------QPRCDRPDAA---------
>SRR4029453_17830486 
-------DLQALETSFDLVAsrGDVLMDVFYARLfaaapa------VKPLFAG-TDP------------RRQKAMLLGALVRLRGSLRGppaFVPPLPRPGAGPggE-APlrrhrSPAPEGHAARGPraaAWLPARPAGVRSGaatPRGQARRLWRPAGALPGGRRgpdrLHG--
>SRR3546814_7943381 
--------------------------------------vfirlslsliiilvyRFLFFFF-SSR----------RR-HTRCVLVTGVQTCALPIS----TDELIA-------AWAAAYGQ--------------------------------LADLLIA---
>ERR1700737_1149585 
---------------------------------------------------------------------------------kqPDGSAEKHFEQAC-ESGRPTGAVSHCRGTPAGCDQGSVGRRRNRRDHFHRGKGYGNLADILMG---
>tr|A0A255XUI9|A0A255XUI9_9PROT Uncharacterized protein OS=Elstera cyanobacteriorum GN=CHR90_04515 PE=4 SV=1
-PMLSSQSIATVKATAPALRphGLNLVVRTYELLLRDPNI-RMLFDP-A--------------rqvnGDQQHIFAETVIAYVNAMDRldtLKATVKHLTIQQA-LLDAQPQHYDAIAIALIQAIHELFGKDAVREITSAWTEALDVLHQESPG---
>ERR1043165_5678211 
------------------------------------TAglktrkpkgltdsdmdilvpvtA--------------------------ALFLAGMTAYIGILA----LRELSATRLA-SATAAVEHAF--------------------------------LREQISE---
>SRR6476660_7153442 
QYMLPQRTIDIVKSTAPILEehGETLTAHFYRRMFAYNPEVAPLFNP-A----------HQRAGSQQKALAAAICAYAANIDNlevLGGAVELIAQKHA-SLRILPEHVRITPESEIISSFYLQpADGGGLPLFKP-GQYITVRVPDARG---
>tr|A0A2D6MWT2|A0A2D6MWT2_9DELT Uncharacterized protein OS=Deltaproteobacteria bacterium OX=2026735 GN=CL908_18525 PE=3 SV=1
-----SEVAERLRSSLEIIAEceATFIRRVYEDLFEQHPKTAELFGG--HS------------RAvRGEMVREVLMYAIEHNEGaswVEENLASLGDQHE-VNGVTLEMYGWFVDSLLRIFAEVSGPDWCAELEGSWRTALELVSDLMSSPE-
>SRR3954454_17009507 
--PFDPATVAVVRASVTKLpsEPIELTREFYRQLFEIAPQARVLFAE--DMT------------DQTERLLSAILAGVRAMDRpelVEDHLRRWGVVHRRMHGVTNDLYVYVGHALIRALHRIFGH-LETSVSSAWIAVYEWMAAVMIDG--
>ERR1719446_1443192
-----------------------------------------------------------------------------LAQDlsaLCPE---CGFK------VG--TMGVC---QTK------ANDAAIE-----------AKDPPVAT--
>SRR6187402_970848 
--GITTADTLLVQTSWNTVSefSTKIIAGFYKHLFASEPEVRPLFKS--NQS------------VQEKRMALMINTIVNSadsLDEFRGSIAQLAKSHV-HMGVKNEYFPIVVKAIISSVEEQYGKGFTSAHKKAWYKILNQISAIMMEE--
>SRR5215510_10546783 
-----------------CLDrcRLFVVFYLIACiivlffFFQAEDGIRDGHVtgvqT--CA------------LPIWARLLGAIVTAVQTIEDperFDGYLRALGRDHR-KFHVEPAHFGVVGAALLDALREFSGTQWSHAFEQAWRDAYGMMARKMLA---
>ERR1719150_2276450
-MGLTKAQVAAIQNNWATVSqnMQDVGDALFMRYLTANPGDLSFFPKFQGAGVGPQLHSNEDFQHQTLTVMQFLGQIVAHLGDIPaaeGMLRERVKTHH-PRGISMAQFERLLDLVPRLVQEICGA--SGPTADAWRVAVATLMPSMRDEF-
>tr|A0A1K0GS94|A0A1K0GS94_9ACTN Globin OS=Couchioplanes caeruleus subsp. caeruleus OX=56427 GN=BG844_22340 PE=4 SV=1
--GMNPaddaelhAVQRLLISSLEQAGgQVEVATRLRAALAQAGPALFARIP--GGP------------LAQVEQLAEGLAWLAQHTDqPpaLVAGFGRLGAVLA-ECGIAPQQLQLAGAALAEAMRAgMAANGWRQDYDQAwrstWQHAYQWIAHGMVAA--
>ERR1719193_2756600
----------------------------FM--EKKVPSVIV------FLN-SLSLDDDGALETHALSVMNSVNKVVSRLDQpdrLVQLLHDLGRKHI-SYKANMAFLEPIAKHFILTIKPSVA-EWSPEIEDAWQQAFKVIGHIMQE---
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold5203666_1 # 3 # 269 # -1 # ID=5203666_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.315
-----------------------------------------------------------DFESQGRALTRMLAWIIQNMSNvsqLVPVLAQMGGRHE-IYGVKDADFGTFATTVANSFRSVLGPEIiDDDAHQAWESCISGIGGLMQL---
>SRR5215203_6923026 
--PGDSGADRAGRAD---AerDQAGLRRGRG-RLLPPAVRRRPLRggavhhrA--GH----------PTgEADRGAGCGDALDQAPRRVPAPgrh-ARPAAPGLRG-----------------PPAALRHRAG---------------------------
>SRR5215208_6178010 
----GRGRPRPDTAIIRRGVagQPTIRHLFYDRLFEHDPETRLLFR--SDLD------------RQRLRLLTMITAMVGPASDdls---------ATNA-GhAGVPPWRWLSLA-----NARDVADP--------------------------
>tr|A0A074ZZ62|A0A074ZZ62_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_01589 PE=3 SV=1
-----------MFDELPPATdhLSKK--ITSGRA---LGMICSNAN-VHTLS-NEEIAADTRSKQHILAFMDVLSKAIGALDGgredFCEKLMVLGARHAAIPGMKLEYFKVFKQAILMTWEALMYEEFTEDVRRAWAHLMDYIIGILSEG--
>tr|A0A2A2WQA6|A0A2A2WQA6_9ACTN Oxidoreductase OS=Dietzia natronolimnaea OX=161920 GN=CEY15_08520 PE=4 SV=1
-----STATPPLLALRDLVTDPRFTDLFARALREADPDFRELFPR--DA------------SGVLGEFVRAMSWALETVEnargdeaevaQVVEFARHLGADHR-KLELSTRHHQRFGEALTSTLRHLAGPGWDDRLSTTLGTVYRVLTTALRE---
>tr|A0A2W5I8T1|A0A2W5I8T1_9ACTN Uncharacterized protein OS=Lawsonella clevelandensis OX=1528099 GN=DI579_06450 PE=4 SV=1
-----PTYYTVLGPAITLLRehPEDFMRHFLAAALTYDFHFHTFFPS--VN------------DHHASRYTHALRYILEALDqstndpdcldDVIDFLSQLGCDQR-KYQLTAEQYQSLAAALRDTFALLLPYQWSTELNDALLTSFEHAINVMQS---
>tr|A0A2N6TBK5|A0A2N6TBK5_9CORY NAD(P)H-flavin reductase OS=Corynebacterium kroppenstedtii OX=161879 GN=CJ202_05310 PE=4 SV=1
-----GVHEASLVPVVTVLQtdGSRFVDAVFTHLFARRPSFIRRLPA--DL------------SQLKPSFRRALVHVYAKQAtgngldrRTRRFLRHLAEDHR-SFGVEAPDYVAMGDAIIDAGREIIAPQVTSEEFELFAMATGQIIGLMEE---
>tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 OX=1581089 GN=HMPREF3121_11375 PE=4 SV=1
------------MRAAAAFGrqAPTIGPEAFRRLLDAEPRFRHMFGG--SK------------TALRDQFMSALSTALVTRAdvgrfpaATIRRLEQLARENR-KFGVAPRDYATLAEHLLDVFGERLPAgpdsgAQVDALREILDEAMSLI-AAAAV---
>tr|A0A1Z5KPX1|A0A1Z5KPX1_FISSO Uncharacterized protein OS=Fistulifera solaris GN=FisN_16Lh317 PE=3 SV=1
--VASPACVMKVINRWETARqrngfDEQLDIDTLLALFKMDPQVKPIYG-FAVEKEVkAQGMQRMGVLIYGLQVVKMFDVILSALGPdeelFYDVVTEMGEQHC-KHGLTPDHFTLLCGAVMGVLETIMDTEWTKDVRAAWSQVIECVNAEIVK---
>tr|V5YLS5|V5YLS5_9DIPT Globin OS=Polypedilum nubifer GN=PnHb25 PE=2 SV=1
-PTFTDAQVATIKGDWNNIK--GQGVEILYHFLNKFPGNYPMFKQFGGKD-LNAAKGTPEFSAQATAIINLLNGVMDKLGSdnagAQAILANLGKTHK-AKGITKEQFQQFREATTELLGNLG---L-GGNLGAWNALFDFVLNVVFTA--
>AP82_1055514.scaffolds.fasta_scaffold183032_1 # 1 # 312 # -1 # ID=183032_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.529
-HPITSEEAETLRTLWSQVK--HREADILYVIFKENPDIQAHFPAFVGKD-LEALRKSLAFAIHSTRIVSFFSKIATLAGDpsnlpaSKTLMNELGSSHK-SRGIQKEFFNKFRASLDGFMQRQS--SWNDNAAVVWNKASDNFYFVLFAS--
>SRR4051812_13904716 
-AGMSPEEVALLRHSLDEMRadGPQAAEAFYAELFRLDPSARELFHL--PV------------EQQSVVFFHELDallSAVSDLPAFVERSRRLGRMHA-GRGVRPEHFEAAAAALDAMLLAVYADGASPELRRAWRHAYRMAAQLMQEA--
>ERR1711860_53158
------IYFSDIKSTWDIVKdeIDQIGMLAFLHLFEAHPEAKTKFKMFEDIPT-DDLKTNEIFQNHAHRVVSVIRKVVGKLDEPsvyLNYLKILGGKHI-MFDADVKYIKQMGYMFLSAIQPTLEKevGITLKYV--FKKTFX-----------
>SRR6266536_6175029 
--LMTPEQITLVQSSFERLGpqLPAMATRFYQELFTRDPALRPLFTT--PLP------------QQEVRFAEALTEIVRAMprlDELLTHTRAPRRPArrlR-GTGCRLPDPRRRPprrargrpgRQVRRPHTRGMGPRLQPcrrdharrrsrgPAHQQLTTTAAPTASQADGG--
>UPI00012780C8 status=active
-MSLTNETKEIIKATVPIIEknEAELTKKIYPLLFTRNPSMKIFFNR-DH----------LRKGTQPRAFIGSIIEYAKNIDNldaIKPLINDIAEKHA-ALNIKPVQYSIVNICLLEVFGKALGTRGTHVVKRAWKDAIEDLANIIIK---
>ERR1017187_3590871 
-----QVDCAILKQSFAHIEsvAEKAVGYFYARLFVANPELRSMFPL--------------AMDATRKHFLAALAHIVWSMDDpqeLADYLPGAHRHSA-H---VQRRYVDLPGAVrLGGgdrSHRHSHDPGGagRRGRASLVAGX------------
>ERR1035438_6477963 
-----------------------------------------------------------------------GARGSPRPAEpaaLSK--------------------QMIDRPLRAAgaaPSMHNTPPWRfgVRPDRLTIELRADIATVMTQA--
>SRR3546814_3749254 
------------------------CLFFFFCFFFSSIRRHTRCA----LVT-------GVQTCALPILFNAIAAYASNIENlpaLLPAVEKIAQKHT-SFQIKPEQYNIVGTHLLATLDEMFSP--GQGVLDAWGKAYRSEERRV-G---
>SRR6266704_3508957 
---------TITRAEFCAGRsnrgsKQAFACECYATLIRLHPEVKPLFTH-TSM------------EKQAKKFMASLTLVLHVLGKpdvLTTTLQRLGRRHQ-TMGVRVEHYPMVAEALLATLKSGYAVVLLT----LFVQSYMFL---VRKGA-
>SRR5215207_7267255 
------QAV-----------agEPEVRGSILRKAVRIGPDRANLVQ--GGP------------RGSEDEAaQHACDDRWSRLSTrdLRLGCRGFGTTSR-TVRCDAGSVFGGRRSL---nleLGRGARTRADPVQARSVERFLQGGSALHVEG--
>SRR5215470_13616785 
----------------------------------------CMVTL--CH------------CSFTqtcscGTRRRGICSRFRWLPSatgWCMRWAGSCPTSR-TSTPSAGTcRTWGASTASSAPSPSTTPTWTPELAADWKAAYDLVAQVMIG---
>SRR4249920_1577195 
-----------------------------------------VWPC--TA------------TRCRCSSTRTC-----scgtrrRETCsr-SRWPYSATGSCT-RWP-GSCPTSTTWTTSASTCRTWaaSIASSAPAPAADWKAAYELVAQVMVG---
>SRR5258708_22654124 
------TLARLLKESWSLVEdrADHLANHFYARLFLIDPNLRDMFPV--QM------------AVQRSRLLGALVEPVQTVPNpsqVVPCFLSLALAQP-TIRLLPGQFEAGRSAPIDP---------------------------------
>SRR6266511_448526 
--------RRRRRRAATSSGraSHRLRDsRLEARARDRSRRVLDDASS--WV------------EVVRLGDAGEPVVLVSAVAAiahRDVRRVELAREGE-RVRL-------QVLNVDAEEDDLAGEHWSVEYDQAWRDAYDRIARVMIM---
>SRR5579862_1310240 
--LMDPLRIRMVQDSLVKLTprEGSIVDLFAAELSGSPHDESETGG--DNIA------------YQrERSVLGIMAAAAPFLHAPeciLDEVVAEI---G-AGRIHPADYDHAANAFLRALKKNLGAEFTADLWEAWLEALWTLCNLLSRT--
>tr|Q5DGY4|Q5DGY4_SCHJA SJCHGC09035 protein OS=Schistosoma japonicum OX=6182 PE=2 SV=1
-LSINDEQLLLLQSSWSIVkqHIEKIGVITFLGIFEQHSDFRDAFTEFRKRK-FVDVKHDPAMQVHGLRVLSIVDKMITRLPKtddIELKLMTIGSKHC-RYVPTIGLISSVSDQLWGAIEPVLkeEGSWSDELAVTWKTVLDYLTKTVR----
>GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6550916_1 # 2 # 442 # 1 # ID=6550916_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510
--------LELIQQTWEKVKphGKEWGPKFYNNMWTKYPEVRTKFFP--E-SKP---------EIQGPRLYASLNFMIKNASDietLKQYCFNMGDRHK-KYHCGAEHFQVVGDAFIMTLTEFLGEDFTPELKQQFQLLYDTVAEMTI----
>ERR1719360_423992
-EPLTQAQKEIIFTSWDAItHKENLGVTIMYRIFTGHQEIKHLWKFADDLKTEEEIRGSKTTQFHAKKVINGVNSAIKAVEAgkeVESlGLDKLGARHF-KYGAKPADFRHFVESLFWAIKTIVPE-VSAEMAAAWTNFVMQIIKQMTN---
>tr|A0A194RIW1|A0A194RIW1_PAPMA Neuroglobin OS=Papilio machaon GN=RR48_08766 PE=3 SV=1
-SPLSAKQQYCMLASWKGIFrqIEKTGIILFVKLFQENEELLHLFEDFRHLQTVEAQVSSTELAEHATKVMHTLDEGIKGLGDMDsffAYVQHVGSTHTQVPGFVADNFMKIEKPFLDAAKTTLGDRYTPNIENIYKITIRFILENLVKGFE
>ERR1719153_450463
-MPLSEGTISILKACHPIPvaNREDIGSSFYTLLFQQHPETQNLFPL-SHVSASKGGKPGPQMRS----HPTMPYLIF-HTkqlF------------------------TIIYNTKIQSX--------------------------------
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold8273257_2 # 299 # 427 # 1 # ID=8273257_2;partial=01;start_type=ATG;rbs_motif=TAAA;rbs_spacer=15bp;gc_cont=0.364
---------NELQTNIEDVYsaGDV-C-----ALFDSSaNRYRPtrtwlscafqgEVAAL-NM------------LGQDKVynegvFFNASHayrSMYAVLGNFNPAQAD-GFEFF-VCNQDKENYE----RMVLKDNKIAGAMFVGSMKNVWSVKQLIEGQVDV----
>ERR1719244_2234371
-VVLEDAEVEGVQTLWAEVSgdLGNFGARVFGRLVHDHPTIRKYFPWGRNDKTEEQLVAAPDTQAHAEEVFGALGKIIGaagHLNDYRSFLVYKGMQHI-PRGVKPEHFDYLKDALVDTLKEELGDKVTPAGEEGLNKVYSFVEKAMSKGL-
>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3481696_1 # 1 # 387 # -1 # ID=3481696_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.584
--VLTSNDIALIRESWAYAkDIPAIQTETLLEHFRIQPRTQALFPKFADVP-LNKLPTNDAFIKQARSCVSFGLNFIVANLDNPSLLkDMLGRVdTyG-KWYVDF--MtkeRQMQTTVdifIQVLSKELGGRLSAAAKAAWTRAMTLVFVEMMS---
>ERR1712198_397898
-QGLTEEEITEIQSTWKSIIsdkTSEHGVNILIRFFKNYPEYKaQYFQNLNTLS-EDELRESPKLRSHGAGFVLAITQIISDLDNmliVEEVAKKIARNHY-NKGIREPlNYKLMTNTIIDYIKDIGN--LADGTMQNFRKMFDIFIISVRKKY-
>SRR3954447_20457037 
------------------------------------------------------------------HKVKVEDIIVRGGGNLMVEL--MNTDAA-GS-----PLDTPVRAVTDG------TESTAAAREPI--------RLNPG---
>SRR4030088_1427564 
--------------------------------------RRGRDGG-QP-------------R-RRELRRDGQepdepDASRRGdrgRPCAGPASR-----------------R--RGSAAGCRSSPPSPAWPALSYEQWRETCDTLHGhTQVLG--
>ERR1700752_5389668 
----------------------------------VVPQVPAARSR-VPL------------R-AASFRRGGLehdpdPKGRVSakqEPV-FGK-------------------D--HGQTIRLSARGQSS---PrRNDAARETTCKEARMtPEQVK--
>SRR6478735_7013605 
--IMTPEAIRAIKTSYAAVatQPRQLASRFYSELFTAAPNLRPIFP--ADLT------------LLQGHFEAAIAMVVRNLDEmtaLREPLRDLGAQHV-HWGARPEDYVTAREALIGAVRGTT-RHDRRSAGRCVSRPTRSARpIGSRR---
>SRR5262249_59625092 
--SRHRDAAVLVRTFTCAPpaPPGRRASRLYEGPFPADPDLRPRFP--ADLT------------LLQNHFEAALALVIRNLDDmnaLREPLRDLGAQHV-HWGARPEDYVTAREALVKAIGALS-ASWTATLEQYWRSAVTSIIvT-MLX---
>tr|A0A0P5LQ45|A0A0P5LQ45_9CRUS Di-domain hemoglobin OS=Daphnia magna OX=35525 PE=3 SV=1
--LLTANDRRIIRKTRDQAKkDGDVTPPILFRFIKAPPEYQKIFKPFADVP-QAELLGNENFLAQAYTLLAGLHVVIQTLFSqelMANQLNALGGAHQ-PRGATPVMFEQFGGILEEVLSEELGSGFTAEARQAWKNGIAALVAGIA----
>tr|A0A0P5UVQ8|A0A0P5UVQ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna OX=35525 PE=4 SV=1
--LLTANDRRIIRKTWEPRpRrTEDVPPQDPLPFHQGPPRVPEdVQVLRLCSP-SRACEQRKLLGPRPNTILAGLNVVIQSLSThgaYCQPNQRSRSANK-PRGVPPIMFEQFGNVAEEVLAEALGSSFNAEARQAWKNGMRALVTGIT----
>SRR3954451_10251525 
--------TSARRqqWTFPRCGptspRPQRPGTRARCTSTPTCSCAIPRPA--RC------------SRSRWRT-SGTGSSPPSATWlpgsttstrSCPSCSSSGGTTG-SSGPSrRTTRPSVPacWPRSSTSTTS-GARNSPRAGRrptTASRAPDVLATVMIE---
>ERR671928_16913 
-----------------------------------------------------------------ALYFDGIDTGR-----lrVHQTKLLVQVTGG-PVEYDGRELAVAHGGLDITLEHFD-PGWTPELARDWTQAYQLVAKVMID---
>SRR3712207_8140349 
-------------------------------XMIRRPPRSTLFPYTtlFRS------------AHQRDRLFQALGDVVNYVDDldrLVPILQALGRDHR-KFGTVAEQDRKStrLNSSHANI------SYAVfCLKKKKKDSHPSSTTX------
>ETNmetMinimDraft_30_1059905.scaffolds.fasta_scaffold1335019_1 # 137 # 232 # 1 # ID=1335019_1;partial=01;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.573
--PITPEEKDGAMRVWKMILnnrsehflalkrenKekdvqdaencmDYFMHNFYIRLFDIHPNSKQLFHR--SI------------HKQGSFFLRFLSMCVAEVSEpekLDKTMENLANIHN-KLGVKAVEYGIAGEALFHTIHKCVGPEFNHEAAVGWTKVYSVFLKYLI----
>sp|P15447|GLB4_GLYDI Globin, monomeric component M-IV OS=Glycera dibranchiata PE=1 SV=2
-MGLSAAQRQVVASTWKDIAgsdnGAGVGKECFTKFLSAHHDIAAVFG-FSGA-------SDPGVADLGAKVLAQIGVAVSHLGDegkMVAEMKAVGVRHK-GYGykhIKAEYFEPLGASLLSAMEHRIGGKMTAAAKDAWAAAYADISGALISGL-
>SRR5256885_11466498 
--------------------------------------------------------------------------------------------XM-LLFF---------FSSRRRHTRLQGDWsSDVCSSDLWGAAYQQLADILIG---
>tr|M3IRU3|M3IRU3_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) GN=G210_0056 PE=3 SV=1
-QELTPDQLRLITECIPIMEdlNLTLGSKFYRRTTRRHPHLQSYFNE-TH----------HKLLRQPRAFIFTLIMFAKNIHDltpLRDVIRRIVSKHV-GLQVKPDHYPLLGDVLIETLCDMFPYHmVDDKFKTTWSIVYANLASLLIG---
>tr|Q86G74|Q86G74_PHAPT Hemoglobin II OS=Phacoides pectinatus OX=244486 PE=2 SV=1
MTTLTNPQKAAIRSSWSKFmdNGVSNGQGFYMDLFKAHPETLTPFKSlFGGLT-LAQLQDNPKMKAQSLVFCNGMSSFVDHLDDndmLVVLIQKMAKLHN-NRGIRASDLRTAYDILIHYMEDHNH--MVGGAKDAWEVFVGFICKTLGD---
>sp|P41260|GLB1_PHAPT Hemoglobin-1 OS=Phacoides pectinatus OX=244486 PE=1 SV=4
-MSLSAAQKDNVKSSWAKAsaAWGTAGPEFFMALFDAHDDVFAKFSGlFKGAA-KGTVKNTPEMAAQAQSFKGLVSNWVDNLDNagaLEGQCKTFAANHK-ARGISAGQLEAAFKVLAGFMKS------YGGDEGAWTAVAGALMGMIRP---
>tr|R1EGH0|R1EGH0_EMIHU Putative nitric oxide dioxygenase OS=Emiliania huxleyi OX=2903 GN=EMIHUDRAFT_435200 PE=3 SV=1
-SGMSAETIATVDATAGAVApfALDITKDFYGDMIASLPSvVLTVFNP----AHNVPI-----STHQPEALAASVCAYATNIKDlspLlvpGGAVDAINHRHC-ALNIQPAHYLPVHDHLMGSIAhvlgPKLGDALTPEVAGAWSEAVRFLAKVCIDK--
>ERR1711974_215400
----------------AKVseNIDINGGILFQKLLTDNPELKELFW-RANKGQQgDQWRNDKNCQKHGKSVILEIGRCLSAVDDaeeFSSLLYKNGVAHK-SRKTTEEHFPLVGEAVIYMLAEALGEELNDECKAAWLGAYGVITEHMLRGL-
>AP12_2_1047962.scaffolds.fasta_scaffold738771_1 # 1 # 321 # 1 # ID=738771_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.648
----------------------------------------------------------------------------------------------------MSFITVPGVAArsSFVwlrestaalrgpalvaliyflgaeaafyigtlsdrifalfwpPNVvLFCALLIVPQRRWWLYIAAAFP--------
>SRR5260370_506041 
-----------------VRD-YSSTCSF--------FFFLQAEDG--IR------------DSS--VTGVQ---TCALPIYqerTEQVLSRLAVDHR-KFGVRDKHYEPFFDAVFATAEHAAGPAWTREMATAWRSALDWFGSVMA----
>SRR5580658_2929351 
-----APLRAIV-EEVLRSGgg----------------------------------------------------------------------NVAA-GTGVRRNASLFHGAREPPGFYD--MpGLRELSSSYPWFQV---VP-VIS----
>SRR5258708_13478776 
-----APLKAII-QGILRA--------------------------------------------------------------------------G-GPLLRRETRPLVGAPRGQKALL--PpHPPGSGSVASRPKG---IS-L------
>SRR6266704_2687724 
------IARPPDR-RPRCGD-GVLLR-P--------AVHRQSRPA-------------------RAVSLRDDANPRGGLPDadrAGQEP--GRRACD-RAGPRPDRQGPpqirrepeALPAVLR-RAVRDGRAFRRPGPDRRDGRGLA----------
>SRR6266536_777504 
-----DGYREALDASFARVAssGEKAVAYFYGRLFAATPRLRGLFPA--AM------------DYQRDRLLCALLQITQRLSNraaLSEYLVQLGRDHR-PPGVPPAV--PGGAACEHPNPTLA-pGVAPllsgvraagqrvarVPHPRRPRRLGQHVPGAVH----
>ERR1719498_564827
-RRWTERKRLVIQSSWAALLsahgndRMATGSKIFRKLFTGDTAVLRLFP-FRHQ--ARTLFVSAPFKLHAKLFVDTMTELIANLHDLEkveRDVRELGKRHL-TYGVQPAHFDAMGEALIAVLDESCHhpSdevTLDKEERDAWLGFWGFIAKETQR---
>SRR3569832_1708069 
--------------------EEVAGVVLFQRLFEKCPQTKVLFG-FPiDIDpSSKELVTSKRFLMHASYLIQMLDTALNMLGPdqelLTDIMLELGTIQS-AFCVASVCVIC------KELETHLC--f-------------LRLLCQAX----
>SRR6478736_5796684 
------------------------------FMMGV---IASGMVV-TG----------AERRGRPKAVQPGNREWITVIQAinaEGQA-----------------------------IP-PFIIGAGQYHLANWYRDSNLPGNWAIA---
>tr|T0QF73|T0QF73_9STRA Uncharacterized protein OS=Saprolegnia diclina VS20 GN=SDRG_06019 PE=3 SV=1
---ISKDVQALVLANWAAISsgstPAllKIKpaspvvyfyDYFYGMIFEKAPAVKPLFRS--SI------------IVQGKALINIIQSITSavNAPNVIEKVCDLAYRHN-KYGVKIEYFNLLGKCLLLAMHDCTGDTFTDELREAWRAAYAYMVMVMTP---
>ERR1719210_139600
--------------------------------FTLL-----DPPGQKrnvaqawsAVVqADVAILVVSANPGEFEAGLAK-------------------------GGQTREHAVLAKSAGVENLVVAVNKMDSVDGEGKWSNLryee------I------
>SRR5256886_2416282 
-------DREADADREADADrdGDAEPEPLTAPALSSPPAV-PLAPP--RD------------EAARQHdEPEPAPPPDQVPGAadpretagppeppeeppp-------DGKGEP-AAG-----PDPAIAAGQEALRAFARE--afTSAAEEAWTQVYLAGSSLMIK---
>SRR5581483_8202477 
-----------PDDPVFDGMqgnvGRvaarylphrEGEAYVAGPVGMVRETIRALTRA--GL------------PRERIHYDDALLAEDKQASAqgvagatahtsrtpessrpgRTGEAGNAGPDGH-IrrvaesdqAGPAGGTAEPGQSGLRDAAADIAPQ--------ADTAHQDGGPHDDQagA---
>ERR671911_2215695 
----------------ELEPacapDKQLVEHVQRlRVEAGAQVVGR------EEERRSRAgqCPRPTSRVDVRGTHDD--------APlecVAEVLVDCGAHAR-VACKVDergraaleLLDRVVPDDLVVDLHAVDEVDGGGQTgHVGPGTSSRRVstarakpQAGTLPQ--
>SRR3954453_16132976 
-------NLQALEESFDAVAphGDELMDEFYGRLFEAAPAVKPLFAH-TDL------------KRQKAMLLAALVLVRKWRPAraLSGHRR--GAHRL-HGCRRGARVDGRVRGRL------GRGAWRGRRRDDRGR--------------
>SRR4051794_7197155 
------------------------------PHAAAAPVLPARLAG-RPRPAGAGPISPPARRVGRRVRPLDRVPPPARRDVaraARERLRGRGAARA-AGAGGSDLAPPVRHARVGAAVAVRGDLGGAAGIAAESAPSVLPWTTTRSK--
>SRR6188474_1917881 
-----------------------------------------------------------------------------------------------------------LNFVFEkiktKKLIPMTQKQIELVKSTWSTV-----AAMDH---
>ERR1711894_485352
----------------ILLYnYrfLTYVIYYYYRFLAEDPTVASVFSRV-NVD----DQQSGEWHAHMLRIMGGVDILINMMDDvnvLTEEVKHLRAQHVVREGVTHERMKAFLIIMMDELPKVMT-HFNH---DAWKSCLSKKLKRIGG---
>tr|A0A0S2MLM2|A0A0S2MLM2_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1
----SEGDADIVIKQWASVMnAavsgenrVVIGRQIFNSLFLKQPAAPALFPY---GS----DLDGAEFGAQMSRVLSGLSNAINSLTDddlNVSIMDHLNKQHVVRDGVTAAAMKDMQVSIEDTLKQLVT-DYND---DAWHDCLGVAIERISV---
>ERR1712217_222699
--------------------IDNIGEVFSQKLFALSPRRHARA----GM--------------EWGPVVKGIGHAVDNLTNLDavaVKYKRLGVLHR-CIGVKEHEMREMGEAFILSLRDVLGKSFGHQAEAGWRAVYCFVAHAMMA---
>DEB0MinimDraft_6_1074348.scaffolds.fasta_scaffold06817_4 # 3572 # 3886 # -1 # ID=6817_4;partial=00;start_type=ATG;rbs_motif=TAA;rbs_spacer=12bp;gc_cont=0.311
------LQRVRITRQWRKAYgtgshRLDFGLKVFKHLFEAHPTARALFADHHSD----N-VYSPEFEAFSERILNEFDIVIALLDDpaaLSAQINHLKAKIT-KRHVTTEQLTVFGKNTLEVIPEYVGNHFD---HSAWTDCLKRLRSALTV---
>ERR550532_3441629
-----YRQVFQLKNSWKTVSrnLDDTAKENLLKFFRDHPEHKALHKKLTKYEDEASLRESQAFEDAALAVFNTFDEAMDMIekDKVdyaITTLHMAGKSHSAIEGFQPAYFKDMEESFLYAVKLTLGDRFTEATEQNFRRLFEFTTQQMIEGM-
>sp|P02210|GLB_APLLI Globin OS=Aplysia limacina PE=1 SV=4
-MSLSAAEADLAGKSWAPVfaNKDANGDAFLVALFEKFPDSANFFADFKGKS-VADIKASPKLRDVSSRIFTRLNEFVNNAADagkMSAMLSQFAKEHV-GFGVGSAQFENVRSMFPGFVASVAAP--PAGADAAWTKLFGLIIDALKA---
>sp|P09965|GLB_DOLAU Globin OS=Dolabella auricularia PE=1 SV=1
--ALSAAEAEVVAKSWGPVfaNKDANGDNFLIALFEAYPDSPNFFADFKGKS-IADIRASPKLRNVSSRIVSRLNEFVSSAADagkMAAMLDQFSKEHA-GFGVGSQQFQNVSAMFPGFVASIAAP--PAGADAAWGKLFGLIIDAMKK---
>sp|P21660|GLBP3_GLYDI Globin, polymeric component P3 OS=Glycera dibranchiata PE=1 SV=1
-MHLTADQVAALKASWPEVSagdgGAQLGLEMFTRYFDENPQMMFVFGY-SG--RTSALKHNSKLQNHGKIIVHQIGQAVSELDDgskFEATLHKLGQEHKGFGDIKGEYFPALGDALLEAMNSKVHG----LDRTLWAAGYRVISDALIAG--
>SRR5690625_2040278 
--------------------RDGFGARFTEELLSRYTEIREALPD--EPA------------WVARAVTAVTDALIDVADDpgaLVTVLERLGVDNR-TVGVHSAHYAPIGHALILAARAVGGTAWTPDIERAWVDGFDVAAEVMVT---
>ERR1711963_100213
-TSLSEGTVEVLKACHPLLKdvRRVIGKAFYNRLFKEYPQVKPLFSQ--SD---------AARTHQTLALADALIAFTGRQLLegF-EAKQRGQ-ERS-LRLRSLQAGSWQGLWRLPSRDRGERD---QNEGSQIKPQILTIQ---QD---
>tr|A0A0G4EPR9|A0A0G4EPR9_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_12573 PE=3 SV=1
---MSDKERgVLIDKTWGLLkeryTLQEIGEELYDNVFKNAPDLRHLFKR-PKELMA---------LKFGEMISTIC-GLFQtDRESLLETMRDLGIRHV-DYGSRPEYFPLFKACLLDTLENLLEDGeFTAATEASWNDMWDEASEMLIS---
>tr|A0A0Q5LAI2|A0A0Q5LAI2_9MICO Uncharacterized protein OS=Frigoribacterium sp. Leaf164 OX=1736282 GN=ASF82_14980 PE=4 SV=1
--VITSSHLTALRSTLPLVeaRAAAIADDFYARLFADRPDLLrDQFNR-GD----------QAQGRQQRELALTIVTVARDVVgtqvgsgpagsatgpavpvapwsspapspwavrvAARETLSRLAQRHA-AIGVTRDEHDVFERHLRDAFAAALGDDWSGVVVDAWLALWRQTRDELVA---
>tr|A0A1Y1I4E0|A0A1Y1I4E0_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_002310190 PE=3 SV=1 
-VQLSPFEQQLVQKTWKLLQprLADLGQAVFTHLFQKAPKTRPLYTCPLRLADGDrRTPDGHAIPTHAVEIVSTIGLAACRIGSssrILAVLERLGQRHV-AYGAAPDMFSVFKEAFLVALKKTLGGeHFTAQVHKAWSKALDSVVAHLKKG--
>ERR1719296_130621
----SVQTNSDVQKSWEKIQeigILRAGEILYKNIFELAPSARETIPPevlekyrissFLvslNEDeLDDAFIENAIWSDRAANIFNVVGHVVRGQHDfgrLVPMLQELGSRHV-GDGMPEAILKVVVPAFKFALHELLGSMLTEDLEHVWMVGLELVNSHMIQGMR
>ERR1740115_393061
-NLLTPETVRVVKETSPRIAsmAPALSSSFFKRFLS-HPDLAAYKASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAWDEAIMVLADICVD---
>ERR1740130_2673129
------------------------------------------KASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAENHRLTINLFL-LE---
>tr|A0A0K2UHU6|A0A0K2UHU6_LEPSM Uncharacterized protein OS=Lepeophtheirus salmonis PE=3 SV=1
--YLSKKQKDLLKRAWVALhnNLSSVGMTTFIKMFETHPEALKFMiPKLTqeeekktqpnySLDSRLDPWHSEKLREHAHRIMKTVSDVISLLNKdeekIEEMLVALGGKHH-GFGVHIEILELMGPHFISAIYPTLKETWTEELQEAWQCLFNYIIALLHIGF-
>tr|A0A0B6ZHC3|A0A0B6ZHC3_9EUPU Uncharacterized protein (Fragment) OS=Arion vulgaris OX=1028688 GN=ORF61548 PE=3 SV=1 
-TGLSARDRKLIKDTADIIfgqlKLQNKGVVFLIAFFKAYPHHQRYFKMFRGIP-PDELKSIPHTENHGRRVMSNVALLVQHIEEpnvIKEQLVDLLIKHN-PRSVKPRQMKDMLNMFVDFTSQQLGAKFTSQHETAWRKLTTHILSVLEE---
>tr|A0A2H2IJL2|A0A2H2IJL2_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1
-------------------------------------------------------MNAVELRRHASVYLKGLGKIIESMRNeeeLGKSMSRIAQAHI-KWNVQRNHVIVSMGKTEIRQRATNSYALKS----------------------
>ERR1719270_1027131
-MSLSTETCNILKICKPLLenNRENIGLTFYKKLFDENPGLKNVFN----MGHQR--GVdd-DKPGRQQFALGQALVAYCLHCESldkLASFVERVANKHV-SFDVQPEQYPVVGGILLATLEEVLGKEtFNEDVKKAVADAYFFLADVFIS---
>ERR1719318_1430785
----------------------------------------------------M--N-----NAQGNSLANAVVAYCANCDQleaLGPTVAKYTVPTC-KYIFHIS-------S-------TRPLKmFLPI---SX----------------
>ERR1712088_143820
-------------------------------------------------------N-----NAQGNSLANAVVAYCANCDQlelLGPTVAKISSRHV-SLEVTPEQYNVVGGAARQRSlqrssQRCRGRGlLFPG---RHLQGERGKNDRRSQ---
>tr|F6WSS9|F6WSS9_CIOIN uncharacterized protein LOC100181975 OS=Ciona intestinalis OX=7719 GN=LOC100181975 PE=3 SV=2
-MPLTEIEIEGVQESWEKVSsggPKTTGLILMEKLFNTYPASIAVFSHLGIPSKPdgaitvSDLASIGGVSNHAVSLASRIGKLVGLLNNeteLKESSTEVGRIHV-KYGVTSEHVDLLGSVLLSVISENQGLSNTSELIGWWSKTWNIIGNYVK----
>SRR6185503_2239525 
---MDSGHKALIRASFGRALtVADLAVELFsGRLYLLDPALWTLLDLGS--------------RRRQQELVQVLAWAIEHLDRfelLASTLEALARRCV-GNGVREAHFERIAGVLLWTLHQVLGDTYTAGTAAAWRSTSGLIVERMKQ---
>ERR1740129_283753
--PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--TPPEQGCLPKLYEWMSRTMTS---
>ERR1740123_30535
--PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--GGLLARSGAeSVFPPGARA-GD---
>ERR1719193_1971274
--VLTADDIKAIKAIWFPImkNPADLGVALFEKFFLLYPQQKDKFKFMKYD-----DLREKGMRAHGEKVVKKLDEAVLLTlYrsRIKHCFQRIGFSHL-QMGIKEEDMQQLGEAIIATVEDAFVDKLTPEEIGSFKKFIKLFTAEF-----
>ERR1719193_859649
------------------------------------------WRMLKKR-----H------NRDGGKLLH-PLKTILQTcYksRIKNCFQRIGYIHF-RMGVQEEDMEQLGEAIIKTVEAAWGDEFTPEEYAAFRKFMKKFTAAF-----
>tr|I2G907|I2G907_9HEMI Hemoglobin A OS=Anisops deanei GN=HbA PE=2 SV=1
-FSLTDREVEVINQSWNQIKAqeLVVGLQMFKTLFQRYPQYERLFTHLH--QSGKSLYEGDRFQRHVVgNIMSSINKVIETLNssdNAVKTLQDMGVKHK-KLDVHRKHFESFVPFVVDAMVSVRMSMSQDEVASAWTKMMEGVASNLSKG--
>ERR1712157_679996
MKPLSFTTMDCVLSSWEQVRripnyRETVGLAILQKLIHRMPEGREVLHMQRNLIknSPPGIESDKLLLAHARAIVNGLDTVVEllgpLIDDISEILREIGKSQYHDYGDSMALWNpLMRECVLEVIQETLKDDYTHELKVAWTDFLGEVAKDIHSG--
>SRR5438477_4839339 
------------------------------------HGIEP-IPH--RY------------AAIRRVVSGRE-----------AQARRVGQRHH-AAREDQRR-------LRGL----ERRRG-RPPARHVRL---------AA---
>SRR5262245_20667862 
-----------------GRAdpLTLLCEREIARFRG----------------------------------------------------------------I---ELDGIGRA----TALF------DGPARAVRFARAMIARGRAL---
>UPI0003969FE8 status=active
------RPFEAA---------------DRELLFGRAQDIRAVVEQ--LR------------TDPLVLVTGDSGVGKSSLCRagvLPQIREGALNDVR-RWSVAV---LSPGRWLLDTLGDA----LA-----------------------
>OM-RGC.v1.018126893 TARA_122_DCM_0.45-0.8_C18859060_1_gene481717	COG0677	K02474
------SELW-------RGRprKTSLPAgssiRTRTAvlvplgrgketapssssanfvlnLTDVPPEAQELRiTA--EV------------DDQRIHFQRRVPADVD----kvVMELPEGSLARKV-R--VEVAAFD---------------------------RR-CS-IAAFRA---
>SRR3954454_16888348 
-VISRSAVIRHVLPTP----aepaaVDHIGQQVADRTSQQDRGERVLLNRT--------------aHGLR--ALADGAARLRIAAQSvadvtRTPLVGVLRQLRS-ALGDVSHRLCGLSDHAEAllgAIKDVLGDAATDEILAAWGEAYWLLADVliar------
>SRR3954471_17335278 
-VISRSAVIRHVLPTP----aepaaVDQIGQQVADRASDKDGGERVLLNRT--------------aHGLR--ALADGAARLRIAIQSiadvmRTPRVGVLGQLGG-ALGDVPHCLSGLSDDALGccaTCGCYLCR--------SRGGASWSFFCHaalr------
>SRR5215204_1408335 
-ATGGPTRWATMRGRWPLMS-------MLESIAQSG-SGRPVWYVH-GAR---------DrrahaMGDHARALAADEHAGK---------HRAVRQRT-------------------------------AG---------------------
>tr|A0A167F9Q7|A0A167F9Q7_9ASCO Uncharacterized protein OS=Sugiyamaella lignohabitans OX=796027 GN=AWJ20_2623 PE=3 SV=1 
-VVFTPGEISLLRNIWKEISEnnLDhgrglkssqastfFCQQFYENLLGDHPSLQTLFPSL---------------QSQSAAMAWVLGQIIAQLEDVsqaQSVLIKLAKWHSRLMNLEPVHYEYVGSSLLRTLGDRRGDKFTAQEENAWIKLYTFIANVMLK---
>SRR5262249_41403170 
---------QVLKESWARVEgqQEALAAHFYARLFLARPDLRELFPI--------------QMRPQGRRLLVGRARATEPGGAPDgASSRERGRPRR-RYEVSAEHHAVFRECLVAAVRACSGRDWDAEREQAWREGYDVLARRMVA---
>tr|A0A1Z5JNP0|A0A1Z5JNP0_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_8Lh328 PE=3 SV=1
---LSSTSLLKVIACWEQSKsrggfDETIGIELMLTLFEMNPQARSQFG-FRTDQ---VIDKNnglqrMGILIHGQRFIRTLDCLFSLLgpddDNLEEVLRDFNKESC-QDGMPLPQFLLLLGILVKVMAHTLGGDWTDEVQFCWMEVITHLEVIVT----
>tr|A0A150GQ95|A0A150GQ95_GONPE Uncharacterized protein OS=Gonium pectorale GN=GPECTOR_12g483 PE=3 SV=1
--GMSLEEMEQLQGSWAFLSkgafpgevkeqLESFSVDFFMALFEQSPGLINLFP-FKDVNG---KPIIEQLKVHGLKVFQTIGAVIDMCNNysvLLRVTTDLVARHI-KYGVLAAHYDVLFQVLVGILTNVLGSQFSGTLAAGWVKLAGFILRVVKDVY-
>SRR5215203_5896321 
----LVRERRLVREAVAMVdDQDRLIRDFYMIVFAMGGAeVIGMFPT--DMR------------RQRHEFGRALVQWVsaDDPDSIAAHLDQLGGDHR-KFDVQPAHYAVTGEALVAAVRGRCGGRFTAAHEEALRGSYGRLATIMIDG--
>SRR5580698_8666230 
----PDLEKMAARSPWLTVtA-------------------------------------------------------------------SLSAEPV-SLGHGPRTEHgtvADVLARLGTWREHD--------------AYVCGSSAMVAA--
>SRR5919204_299658 
--------------------------------------------------------------------------SDlrSGPTSRCTHVRC-----R-QQRSPPRHHRClRPRSPAPSWSARlsagfrssscrpstnRPARRRGRGRSTILASYTRLASVMLDG--
>SRR5688500_16794215 
------YDARVLRGSFAQLRprIAQYSPVFYEHFWRDYPETRPLFG--RNMSKPE-------LDTRINHFM---LWVTENADRphfTIDYIQSVARRHV-GYRIRRRHFAYVDNTNIKTLRELLGDSFTPEVERHWRASFRFLTLLM-----
>SRR5947199_2475351 
---------------------DELARAVR---lQ--gSRRIMEEHAC-GAE------------GRQLARLFDERGRLARAPRAVDEPGLELGARvsdgrcglakigdvverivqaedvdavRR-AGGDELADEVIVS-------------rtRADDEtseqrepayrigprtqCSDAFRRGLERPAGAPVQT--
>SRR5919197_1330773 
---------------------RATAGGLYGVLprlR--rgrrRVSVRCNHAG-TDL------------KKQKTMLLGTLVLLRKPLrdlDAIVPKLRELGARHV-ADGDEGGDELLEEQEGKGYGED-EGEgdeafdapLIDEX---------------------
>SRR6266516_4891354 
-------------------------------------------------------------------------------GLGDGGRAEGGNRDS-GRGEQLEHLGCVHDVLLSFSESTVSTlphqaarpapaaegagpAITRRetadrapprrhrvggfLRSAGAARARSSIDRMTET--
>SRR6266508_4596506 
-------------SAFVRL-tdARRVARCLPSAH---pGDETPSTFPS--ET------------GDPVNLN-----------LEALETSFDLVAPRG-DG-SEATEDDVVGHPGPPA--QVA-PRPRGDRPQAA----------------
>SRR6185295_10958302 
--------CILLLVA-----CFLTFKLFFYSMFQDYPEYKNLWPKFRHLN-DEALINTGELSNFCSVYMDGWEKVIGELDDnaaLARELKIIAKTHL-RKGVERshimvakkealcqiriheyCYLQNMMPKMLSLLKEKNGT-LDAEVEEAWKTVFIINADIIE----
>AntAceMinimDraft_18_1070375.scaffolds.fasta_scaffold521461_1 # 3 # 443 # -1 # ID=521461_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.569
--------DD-------------DDDDDDdDRMFHDHPEARALFSRVHGDN-----TYSPDFEAHAQRVLGGLDSCISLMDDpdtLASELGHLKAQHA-DHTdVTAEHFDVSICFSsTDVTSTYTsthckimdrpnYTVFQT--RGQrnltksaSRRAHspvRDHPRGS-----
>SRR5476649_891947 
-------------------------------------------------------------ATSTRCCS--ATSRKCCRCSikpTRPTASSsarwptpcWLTQEI-SIawNnWARWHRPSStSMCRCKSsgNTIPWSApRCSRRYVKCWAPRWRPmpsstpgpprtvsWRTCWPV---
>tr|A0A2D8PEV6|A0A2D8PEV6_9RHOB Uncharacterized protein OS=Maritimibacter sp. OX=2003363 GN=CMH11_20945 PE=3 SV=1
---MTSQNAGLIRASLTELFprREEFAERFYERFFEQAPQVRRMFVH--DSE------------KQKLMLYAAIAMTMRGLEServLHSELMAFGSRHA-RLGVREEHFPIFGSAFLETLIHFLPQWDHPDLARAWWGAFTDMSTPIIA---
>SRR5690242_2028058 
-------ELALLLQSYGRIGilIPKISENFYRRLFQLRPNLAALFAN--R--------------DADLKVEEMLRRIVAHASDAaaaKAEVQSSGRSHA-QWPLLPEDYRVAGECLIQAIIEAEGAATGSVVASIWRQAYVEVANLMIC---
>OM-RGC.v1.029911412 TARA_036_DCM_0.22-1.6_scaffold294997_1_gene285712	COG0526	K03671
----------------DRLRarGEPPSGNPYRGAAPYGPGDEALFF--GRR------------AE--------LEVLIDRVQkTpfvLVAGDAGVGKTS------------LCSAGLLPLVREgalGGPRHWACESIACGEEPLAALAAVLARH--
>ERR1719414_683447
MEDLRFETIRCVVQNWERLKynplFEEFAIAFYQRVLRVCPQAKSFFGSSFCLD------DQA---TMTQEFVRLIDRVLDLLGPesqlMVEVLRDLGSRHE-AYGVTVEMYDIMRDAFLLTLEQFEGEKmFTTKVRQAWMTVCSAVADVMMEA--
>ERR1700744_5993147 
---VGLDDRDALGVLRDAFSqdesgsGNELVRRFYNHWVELDVSVRDLFPP--GME------------DQRAAFAQALNWLYservaQRAEEPVAFLAQLGRDHR-KYGVLPSHYETLQRALYATLRSYLSdpsrSAWSDAVDEAAGQSLNLFTGVMSG---
>tr|A0A1E3QTC6|A0A1E3QTC6_9ASCO Uncharacterized protein OS=Babjeviella inositovora NRRL Y-12698 OX=984486 GN=BABINDRAFT_161163 PE=3 SV=1
--NFTPAEIATLKATWSMEAKDTnsgdiadpkntlFGTTsfwehVYSLVGEEHPEVVHLLPP---------------ITHQTQAFSGMVYLCISNLDNlsrLDEYLASLGRRHSRVFNALRLHFEAMGSGVLKSLYNHYGEAFTADISDVWARFYCFLANSLLQ---
>tr|A0A0A9XWX4|A0A0A9XWX4_LYGHE Globin OS=Lygus hesperus OX=30085 GN=GLB_0 PE=3 SV=1
---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNLHSnnrrkNKELFEKLATIHA-KRKVSAQQTPYIKHTLMDILH--L--EPHSAMEKAWINVIDTLF--------
>SRR5687767_4837246 
-----EKQVLLVKHSWSYQAgqLENLGTLFTKKLVALNPGLKAPMKR--SL------------AETGSySLMVAMNQIVAALPDLhkaQNHIQVIVTEYA-ALGITRSDYENALIAFLLALEKRLGKSWSDEIREAWIFIFSSLYH-------
>tr|A0A0S8AZS8|A0A0S8AZS8_9PROT Uncharacterized protein OS=Betaproteobacteria bacterium SG8_39 GN=AMJ64_12515 PE=3 SV=1
--------TGLITESWNALGagQRAFVEAFYQRFFERYPDYRPLFPL--ELN-----------PRHLEKMVQTIALMADQSQDrgrIAPHMHTLGQAHK-AYDLSARDFDNFKRTFVEVLGERLGRQWSAEAEKAWNDAFDAVLVP------
>tr|Q9NG75|Q9NG75_9CRUS Hemoglobin P polymer OS=Parartemia zietziana PE=2 SV=1
-TGITDAEKQLVQESWELLKPDlmGLGQKVFGRIFTKNPEYQTLFTRvgFGDTP-LTQLMANPAYGAHLIKVMRSFDFVIQNLGKpktLLAYLKNVGADHI-ARNVERRHLQAFSESLIPVMQNELKAKLKPEAVAAWRKGLDRIIGVIDQ---
>SRR5579875_723516 
------------RESFARIAprKEEFVASFYQTLLEKYPHLQRMGAGV-------------DVKRQRKSLLATLQVMLNETDRgeeLRTQFRKPGQRHN-ALQIRAEHYPAFGQTLFETLALY-DPQWTGELRVAWAAALEQCVRFMMEDLN
>SRR5579871_3449338 
-VPLSALHRYLVRRTFTHLaiHADEVTALFSQRLVELNPALMIIIV---DEA-----------GTQRYRPLEILARVIALMDRpaaLSIQLKLLQAQQQ-R-SVTPDHLRQMGEALLWVIENRLGDSFTPDISAAWLHFYRFLGE-------
>SRR5215472_5690244 
-----HFDVQVIGAALTRLAdpAVDAAEYFCSHLYSISPDAAALFPS--EL------------AAQRELFADAVIRVQHSLESgsgLAEQLATIGRQSR-KFGVTERHYAAFMLAMEKTARHFDTGG-------------------------
>tr|F2UQX2|F2UQX2_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_10302 PE=3 SV=1
----DDSAMKITQESWAMVEREipNWTDIFYDKMF-SDPNIAKLFP-FS----AGDFKTNEKFQTHTQKVRDTMHTAMTSIrefEKLGPVLKKMGERHA-DYGVIPEHSVNFKEAFLHTLKTGYGDKWNEDLDDAWNQCVDALLE-------
>SRR5699024_1886671 
-KTLDPQTIETVKKTAPIIKdnVEEIGKTFYNILFSRHPELYNIFNQ-SNQ----------------ERGlqqealaygVYLAGINIVNFEPIQSLVTRVAKNNR-ALKVRPNNTLLLERR-------------------------------------
>SRR5271157_2714777 
MPSRIVDRLTALRAFFAEMEpqLPVIVARSYERLFDVEPAIALLFK--GNA------------REHQLRFLAKLQSIVKLTRSsqlwpasaatgqiLIPEVLDFGRSHA-KIGVLPVHFSLLNDMIAWTCKEIAPLRFTPLVEEGLAFVFDVLGASLTAK--
>tr|R7TLW3|R7TLW3_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_227018 PE=3 SV=1 
----------CAEITWAILseNRDGLGTEVFVRMFESYPDLKSAFGPLRHMNKKDAGY-EDVLRAHGIRVLSIVEQVLSKRHnmeEVLSILHDLGRKHL-TFSAKVEYIDIVSQMFLFAIESALKEKWNNSTEKSWGEIIRFVTYVMKET--
>SRR3990170_2029843 
----------------------------SPCTTTRSPCWTRPCAS--W------------AT-----------APTGSWAtstpPsssRLPSCAR--CSRR-RWTCSATG----CSRRSPAPRHYAEDVWVPELEDAWLRAYAAMSTTMIEG--
>tr|O97381|O97381_ARTSA Hemoglobin C1 polymer OS=Artemia salina OX=85549 PE=2 SV=1
-TGLSGLEKNAILNTWGKVrgNLQEVGKATFGKLFAAHPEYQQMFRFFQGVQL-AELVDSPKFAAHTQRVVSALDQTLLALNRpsdFVYMIKELGLDHI-NRGTDRSHFENYQVVFVEYLKETLGDSVDEFTVKSFNHVFEVIINFLNEGL-
>ERR1719468_1094774
-PPLTSNDRKLIVRSWTIVDqqISQVGLSSFLELFRRAPETLSVFPFLKQLG-PEDMEFYHQLKNHSIRITGVISMLVKQLESeerpadeaIRDLLLDLGRRHF-SYGAKTSHMELLGRVFAESLQPIFEGdPEAKAIQEAWLVFFSVIVFWLQKGFR
>SRR5262245_31323877 
----STDGAGLVMASLARVSdrSDQMIASVYEHLFAHRPELRLLFPS--DL------------KHQRAKLAGALRFVIENLRNpehVVTALEELGQRHI-AYGAKVSDLSSLGEALMSALEAHDPNPWDDLTRKAWHSAYDSIARAMSRGM-
>ERR1041384_2362020 
--------------------ANVLGERKvVAVLYSDLRGFGTL-----SE------------TGHAVDVLERLNDYFD----rMVAAITSHGG--------------------------------------------------------
>tr|B6BNK3|B6BNK3_SULGG Putative globin OS=Sulfurimonas gotlandica (strain DSM 19862 / JCM 16533 / GD1) GN=SMGD1_2554 PE=4 SV=1
MQELSQKHIDIIKESAELItaNDLKITNKMYEILFYKYPHLEMLFEN--------------APDNQFMKLAEALSLYAVNIDKiekLIPALELIAIKHV-EVNIRPGHYSMVGMALIEAIEEVLGKMAPIGFIDAWREVYKYVSDILIE---
>SRR6185437_15632065 
-----ADDVAIVRDSYGRIGprGAALTIAFFGLLSDRVPRVRKFFPP--DD------------KDKRAVAKDLFDLVVGHLESqlnVRWVLERMGRRGL-LDTITPSDVSAVGGCLLDALAELDE-AWSPATERAWSRVYDWAASAVV----
>tr|A0A0K8S6V4|A0A0K8S6V4_LYGHE Uncharacterized protein OS=Lygus hesperus PE=3 SV=1
---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNYTQttaekTKSCLRNWQRFTP-NGKFPPSKHLTSS-IHLWTFFTWNHIQPWRKHG-------------------
>tr|A0A0S8CN91|A0A0S8CN91_9BACT Uncharacterized protein OS=Nitrospira bacterium SG8_3 GN=AMK69_14025 PE=3 SV=1
--GLPPSDISRIQRSFRMVAsqGEKMASRFYDLLLERSPELQKFFHP-GNLS------------QQHAKFFNGLHSLILHLEHpqaLRAALVQLGEQHQ-GDGIEIQHYPPVVDTLLQVLTEFSGEGMDGETYDAWAHFLHLVRAIMLENH-
>tr|A0A0Q9HRJ4|A0A0Q9HRJ4_9BRAD Uncharacterized protein OS=Bosea sp. Root381 GN=ASE63_23130 PE=4 SV=1
----GDRAISLALASLETMGSeaEQADIMFNIRLLETYPDVYRVFC--MDFA------------PEERSFLRALAFILAHAGPfgaIGPTVRALAPSDK-VCRLISSRYHELEETLMWTLRRRLGVAFTAEVENAWRSVLREAPGVS-----
>SRR4051812_34838903 
-------------------KPirNRAIKLFFSRLIESHPSLLTVIG--DDYE------------AKARSLRPAVEMIIGCLGNmeaLRPILRSMARSNA-ELGMQEHHYLTAVNTILWTMERCLGSAYSAEVDAAWEDVCWQVCEAM-----
>tr|F2UFM9|F2UFM9_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_06664 PE=4 SV=1
-MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETeRLLYSS-DKS--------KSWNERHMARVGKSVGDVIKSLSNyddVIEHLTTGEPHEQ-ACCL--------TDG--YVIGTGLGNT----PRSLWLACGS-------T---
>tr|K0T9D6|K0T9D6_THAOC Uncharacterized protein OS=Thalassiosira oceanica GN=THAOC_11871 PE=4 SV=1
----------------MEREdssGSL--PSFVSETEIEPSDVQPaaasgenNVDKGRR------------KTSSSSKRTPSITKRIESFSSfksLSSSFS------------------SKLDDERNAGEAGQAERVEsttapESVASGETQGNAGGQHTLN----
>tr|A0A165S3D1|A0A165S3D1_9GAMM Chemotaxis protein OS=Halioglobus sp. HI00S01 GN=A3709_07715 PE=4 SV=1
-----MTAIMMIDRDFTVTYanEAT-----LQLLRDNQATLSSIYPGF---N----------PDKLI--------------------------------GSCIDGFHKNPEHQRNILADPANLPWRTDIEVADLKFS-LNVTAIVDAQ-
>tr|A0A1I2IR29|A0A1I2IR29_9GAMM Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor (Fragment) OS=Fontimonas thermophila GN=SAMN04488120_104136 
-----KGVIQYINRDFIEVS------------------------GF---S----------ESELI----GSPQNIVRHPDmPveaFADFWAT----------------------------LKDGKPWTGLVKNRCKNGDHywvLANATPLRAN-
>CZCB01.1.fsa_nt_gi|955242656|emb|CZCB01016507.1|_3 # 1728 # 2327 # 1 # ID=16507_3;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.493
-----GVSSFEMNQQFSAQSsdSIEKNIAAISELWQKYMATnitdeekvladkfvatrgafvkealLPAVDAL---R----------ANdYEKAKLFSTKARDLYNVAHpalVELIQYQAGHAKL-EYDTSVESYKLTRNWTIASLFLAVGFLACFAYFImrSIANPLSvifRVLDNIKSN--
>SRR5918993_5799879 
--AMTPEQINLVQRSLPAILaIRDRATARAgERLAVLDRAPGRLFAG-ADI------------GRQGAVLINAVTAAMQALRsgDYGSVLAALSQYHL-SYGIGPQHFRSAGAALARALEQELGSSFTADLGHAWAAACEWVGRII-----
>SRR3954452_18192940 
--XMEPQQIKALKQSLATVLsAQEALAVRFhQHMRRFEQCPRPLFTG-APL------------ARQGVLLTNAIAICA-SLPskNlsQAVAAGALSQYHA-SYGIASHHFHSAADALALALKDELGHIVSDVAIDAWAEACRMLGQAL-----
>SRR6516162_8663010 
---MKAETISTIKATAPVL--KEHGQAITQRMyeiaFDARPDARQLFATT-WM------VSSEEGRKQAGRLAGAVYAYAEHIDDlekLAGGSGAYRaaaRRHE-GPaGNLSGHWSVShgryqgcaKRCCHAGNPRRLARGIX-----------------------
>SRR5690348_5860809 
--QLPDGSVRLVKKSFAALEpvSADVMQYFYAWLFVQHPELRAMFPL--AM------------TTHRQRVFDALARVVRSTGSpaeFADQISHLARDHR-KFGVRAAHFKPFFAALLAAIREHSTGTWTSATQQAWEEALDCISAGLQT---
>SRR5258705_5637504 
----------LFSQLYQCSKntGRRSRGFSIDTCSKKHPELASMFNA-RDQSD----------GSQARRLAAGVLAYASNIDRlhmLESAITSIGRKHV-SINVRPEQYPIVGKHPLGAIKTVLGDARHPKFWMHGQRPTPNWQRSX-----
>SRR3984885_15745818 
---------------------SRAtgGGWLPTRSPTGRSARTSR------T------------GCRRGRCDGNTRPTV--ggPAALGGGQCEDSARDG-KLGLSADHADSAGAGRVdlAAVRHPGGAGV------------------------
>tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1
-----PANKNLIRSTWNMMVGdRGNGVELMGLLFQRAPDSKIDFKRLGDVS-AENIPYNRKLNGHGITLWYALMNFVDQLDSkkdLEDVCRKFAVNHV-IRGVLDVKFGWIKEPMAELLRRKCGNDCDDA-IQAWWKLIDVICAVLKES--
>HubBroStandDraft_6_1064221.scaffolds.fasta_scaffold2618798_1 # 2 # 181 # -1 # ID=2618798_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.622
---CSAEDRSIIQEQWKILFkdvdsskiKIAVGRKLVLNLIQRQPDAKVLFDKF-NVD----EPNSPQFSAYALRLFNRIDLIINLLKDpeaLDAALEFNAERYGNIPNIKKAYFQTAAQILAYALPKVLD-DFNA---LSWQSCTRYILTTVASKVS
>SRR4051794_1382573 
--ALDPALLNLVERSRPRVEhkITELADQLYTALLAQVPGLRTLFPL--DP------------NGRRAPLTDPLIWLLQRLDDrdeLVRRLADLGRDHR-KHRITAAHYETAGHALLDALAHIHGPTWTPPLAAAWTRAYTAATHDML----
>SRR3954470_25015505 
--EISEEQARMVKNGWQAAvdAPGDFGSDFYRDLFTVAPGVIGLFS--GDMT------------EQQGRLTHTLAETVELVDQpttLLLLLRASGVRHH-HYEVKHAYFSVMRDTLLNTMERRAGAVFDAAHRQAWEAMFDNMATIMQDG--
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514
--LISSKNLGLIRDTWAMARrDSDIAPKIFLRMFAQHPETQLMFPRFANVP-QSQLMTNKDFLQQAYTCLAGLNFMVKNMDDEDlviKLLSRMASPAFYvDFPTPGQQLDETTRLFLDVMQEELGNSFTADARNAWTTVMNQIHNVLVQQ--
>GraSoiStandDraft_30_1057271.scaffolds.fasta_scaffold222668_2 # 490 # 1347 # 1 # ID=222668_2;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.654
--LLSIKDKALVRESWTLAKsNNEIAPAVLLKMFAENPDAINLFPKISKAK-IGDLKGNKDLYNYAYSSFAGLNMIIKSIDEVKtiaTLFKNSDNPSIFlDSRSASLD--------------------------------------------
>tr|W4FW63|W4FW63_9STRA Uncharacterized protein OS=Aphanomyces astaci OX=112090 GN=H257_12922 PE=4 SV=1
--VLTPRHVELIKANWSAVCagtsafdVEQHgspdkffHRTFYATLFKADPSLRGIFRS--SL------------TLQGKSLASIIKVMTGvvSASNLVERMQALASGHL-KFGVKRQDYATLGVTLIQTLEIISGSSWSRHVKEAYLTAYCLLFYLV-----
>tr|A0A024UCA0|A0A024UCA0_9STRA Uncharacterized protein OS=Aphanomyces invadans OX=157072 GN=H310_04772 PE=4 SV=1
--VLTPRHVALIKQNWSAICrgtnafdSTKHgspdkffHRTFYSLLFAVMPSLRCIFRS--SL------------TLQGKSLASIIKVMTGvmSTSNIVERMQTLAEGHL-KFGVRKDDYTTMGVTLIRTLEVISGSIWTKEVKEAYLTAYCFLYYLL-----
>tr|R0JHX0|R0JHX0_ANAPL Hemoglobin subunit alpha-A OS=Anas platyrhynchos GN=Anapl_10052 PE=3 SV=1
-------------------------------MFIAYPQTKTYFPHF-DLS-----HGSAQIKAHGKKVAAALVEAVNHIDDIAGALSKLSRRRKKERfQtkPAPKNLPLAAHrCHQLNIASKGTEHygTNPQLAWLSTGHLVSGRELISSKSS
>SRR5690625_6805322 
--------------RSPSHsqtltLSPYTTLFRSRNLLRNHPELKNYFNT-ANQV----------NGFQPRALASIILQFAKNINHi-yeiVPKLERVCQKHC-SLGVQPRSEEHTSELQ------SRGHTVCRLL--------------------
>tr|F2UFM8|F2UFM8_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) OX=946362 GN=PTSG_06664 PE=3 SV=1 
-MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETERLLYS-SDKSK-------SWNERHMARVGKSVGDVIKSLsnyDDVIEHLTALGTRHA-RYGLHVDQLDLFINAFLWTLGAGLGDSWDHSVKKAWMHVLPFILSPLKS---
>SRR6267143_1520378 
---VTLEQIQMVQASFAKIAPivGPATDRKLRRCSALVAGFrkeTRLST--GVS------------KNPGRSEVRGTLCGASCCGSlss------------------------------NWVANIRRGI----------SP-LALAIASI-----
>tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii OX=37682 GN=F775_23753 PE=3 SV=1
-STFSEEQEALVLSAWDAMkgDSAAIALKFFLRGRNN-------FVQLAHVE--SPKRRIPVVEERKTDL-----------------IFEIRTKTW-KIGQKSTAYRSW--LLLR--QKSLPa----HAPKGHLSElvpldTIDHTHQET-----
>ERR1700722_6370008 
----------------RGIRPhcPavrqhLPCVLPPH--VRAGSVASHAIPQ--LS------------APLTATLTAALEALVGALGDLQPVLVrapALGLRLA-SYGLQPTDISIAASAFLATLDDELDEVSTNAARAAWGCVFWTVA--------
>tr|A0A0M1J4K8|A0A0M1J4K8_9GAMM Uncharacterized protein OS=Achromatium sp. WMS3 OX=1604836 GN=TI05_18490 PE=4 SV=1
SKDIKPTNIYLYQASLNRAiNTSKFCDRLYFNFMNGNIEIANIFKG-RSK------------ERIQHKLQTTLDLVADNANQvpgNNIYLEMLGRIHT-KRHITPEHFKRWKFAVINTIAECDP-NFDTEICAAWEEVLTALIDKLI----
>SRR5260221_159328 
------QALGLVREGFAAVIarPDVFVSELYQDFFTSNPRYRKYFGS-ADIGySGsADIngTGSPEighaaadITRRNAKTVEAATRIVADLDRpgvLLPYLRKLALEYR-KYGVREAHYRAFAGSVMTALERTIGQAWTYEAAEAWVDELTMVASAMLG---
>ERR1719266_796048
-VSGLGTLSIISQASWKAISGeiHSSGVAVFVEIFKAQKEVQQIFQKLNPNPNSSGIkytkdqALKESLHEHGVKVLSGVDEVLSNLDQpslCLSLIRKTGAFHRKLQGFKPKYFKCFEEPFLAMVQSSMGQRFTPQMEIVYQSVASFFVQTLIEGYN
>ERR1719402_1083666
-TDLSTNQKNMIRDAYAVFekNGEKNGADAFIYLITQHPDLKKVFP-WGDVS-NEELRENQVFKDHVYVVYKGLKVAIDRIDNLKAtasYYVHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQTSFNNLLQFLVGNMKV---
>ERR1719295_364028
--DLTPEEKRCIQRTIPVIlqEAEMIGTKTYLKTFHNYPLSMIYFEPLRDKLVTEVKQTDDYLKKHGVLFVKFIGELVAEMDDpdsVDLKLKSLGRFHD-DLGVLKQYLEAIGPLFVQAIRPVLMtqasipsatncgvgvsspnSLWTRDTKPSWIRFFRVIALQMKRAY-
>ERR1711860_326342
--ELNSDEKTLIVTCSKQLleIQKVLGPQMMQQKFQKV-----------------------WSKEAGEL-KQLYDMR------------------------------------------------------------------------
>SRR5215213_6828293 
--------RR-----LG------------------------gRIRC-APdR-----------PQRPPVRPRDATDC---------------VQAHV-PRGA--GRAVHRGRPLpAGGGGPGPGEAVTPEVAAAWEEVYWLFAVQLIG---
>SRR6476659_6585810 
---------------------------------------------------------------------------------------HVAN--A-RFTPC-PTYVDDGAavvtNPGKHRGADAGRAFSENLSVDWNAG-VRTAPPLVA---
>tr|A0A2B4SAV5|A0A2B4SAV5_STYPI Uncharacterized protein OS=Stylophora pistillata GN=AWC38_SpisGene8312 PE=3 SV=1
-------------DTFGPKEsRCREESVCKVRLLELNPNLQDAFPSFRGVS-LDELMNSRSLFLHSKRLMAVVEEAVSSLDDakeLIEDLTNLGERHL-AMSITEKHLKNLQRAGPATNQDAKHRLLANKGTAQIDRHIARMEDTRLP---
>tr|A0A1E4GLJ3|A0A1E4GLJ3_9CAUL Uncharacterized protein OS=Phenylobacterium sp. SCN 70-31 OX=1660129 GN=ABS78_22870 PE=4 SV=1
--ATAFARAADIEASLELLAerDIDPTARVYQRMFELHPQMEPYFW--RDTD--------GKIR--GEMLSLAFAAILDFVGErryADHMIGTEMINHE-GYDVPRDVFATFFAIVRDALRDLLGADWTPVFESAWEEMLAEIESYARQ---
>SRR5699024_10012150 
--------XLVCLLSLPCPhpHLNSFPT-RRSSDLSKAPELYNIFNQ-TN----------QERGIQQEALAYSVYAAGENIdqlDNLKELISRVTEKHA-ALGVKAEQYPIVGETLLEAVEDILGSdVATAEVIGAWEKAYNYIADAFIE---
>ERR687884_344007 
------------------------------------------FPR--TT------------TAHNGRAQQSSTANRRaDYPRrapMNNLSRLLKESWT-LVEEQQDKYQVVGDALLEALRTFAGDQWTLEYDQAWRDGYALIAQRMIDG--
>tr|A0A0J1H5I9|A0A0J1H5I9_9GAMM Uncharacterized protein OS=Photobacterium aquae OX=1195763 GN=ABT56_07590 PE=4 SV=1
------DFHQIFNDSYQRCqRHPQFFQIFYRNFWQQEERFQKMFEN-VDM------------TRQIKMLKLSILMIMLASTSeeAKDNIRRYARRHGPdGIGAQPEDFDIWIDSLLKAVKECD-THYNSDIDKAWRTCFKTGMEIMKQET-
>tr|A0A2E7C7Y6|A0A2E7C7Y6_9GAMM Uncharacterized protein OS=Haliea sp. OX=1932666 GN=CME43_15375 PE=4 SV=1
------TSKELFLHSVTRClTHETFIHAFYLRLFDASEEIRAKFRF-TDL------------EKQNAMLRRSLLLYAEATAgRteALREVNERATTHDRhHLDIQPHLYAVWIDTIVTTARDFD-LQWNDDIEVAWRTILGHVVQQMIRRY-
>tr|A0A0F6YJJ2|A0A0F6YJJ2_9DELT Uncharacterized protein OS=Sandaracinus amylolyticus OX=927083 GN=DB32_003309 PE=4 SV=1
--------MDTTLDSFRRLRERGFAHRFYEQLFVADRRVPRLFAG-TDL------------ARQRDLLEHGISMLLAYQRgSalGEIAMRRLALLHGPrGLDIDHDLYAIWLRVFLDVAGELD-PEWTPELAAAWHAQLGASIAEMHRRG-
>tr|A0A244CWV0|A0A244CWV0_9GAMM Diguanylate cyclase OS=Pseudoalteromonas ulvae OX=107327 GN=B1199_05805 PE=4 SV=1
---------------------------------------------M---ET----------VNSKAKVLNKLLIA------tsVVLISFIVSLQLA-GVEMGQSSIIAILVFGIASIG---AMAF-------LYKAVEQIADKLNVIEE
>tr|A0A0L0EW98|A0A0L0EW98_9GAMM Chemotaxis protein OS=Pseudoalteromonas rubra OX=43658 GN=AC626_03140 PE=4 SV=1
---------------------------------------------M---NS----------QSIQSSLNNKIIIA------gvILVISIVVGIQLG-ASGAENMQLVAVALPLFGVVV---ALGY-------LKMALSAVSAQLGCVYR
>tr|A4BJG5|A4BJG5_9GAMM Probable methyl-accepting chemotaxis protein OS=Reinekea blandensis MED297 OX=314283 GN=MED297_02020 PE=4 SV=1
---------------------------------------------M---NQ----------LNN--ALSARILIV------gtgPALLLVILNLALA-GSGSA--TVLNL----------------------------------------
>SRR4026208_2063884 
-R-SVRTSKGHRQGHPPAIQkhGGAITTAMDARLFE-NEEVKAMFDQAAQES-----------GEQPRRLANAILAYarnIDKLDMLTAAVERMAQRHV-ETGVKAQHYPYVANALPPTIRDGAGG--------------------------
>ERR1712080_92393
TMSLSAGEITAVTASFEAVKadLGTNIGKVLQKLVAEHPDLKPHFPW-HAVP-TADLLGNDGFKTHAAQVGRGFAEAAGNLSNLsacEGYYVSLGDRHK-TRGFAAAQVPMVADAFVAALQ------LTGDDASGWTKLITFVGSSIVSG--
>ERR1719334_3108017
-TGLTPKQAQAIISSWENLNSEC-SSLLFKQLFTIFPELKEYFG-FSKRELVDKILNSEEMIAHMDATWNGLDKLVLSTQTgtrFAAIGKGLGYNHF-KFEIDRQDVHKFMDFFKQVLKDDLKSQFHGDLEEAWNIWCKAVEDVFIMGY-
>SRR5207245_2384740 
--NPQPST-HAVTEQVVTLDv-----LPWTSGKLGLGPGKarlsEPLAP--GDT------------LE---SL----------LERQrarIPGfeewVYDArerriheHCTLL-VNGQAEYRRHTAEVEI------------------------------------
>SRR5689334_4915957 
-----------------------------TASQRVTP----SLR--GKR------------VPSGQmgdRKVPD-VPIVDAHVHLwdpTAFrmpwLDGNKRLNR-PYGLADYREQTAGLPI------------------------------------
>GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold2022664_2 # 351 # 797 # -1 # ID=2022664_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.631
--------------------------------------------------------------------MPDFPI-VDSHVHLwdpNHFritwLDGNPRLNQ-RFAIPEYREHTAGIEV------------------------------------
>MudIll2142460700_1097286.scaffolds.fasta_scaffold02451_1 # 3 # 1031 # -1 # ID=2451_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.574
---------------------------------------miGSRAL--AAL------------FPHPKTFMDTKRPVADTHIHLwdpGYLtypwLETVpaiagph----G-PAELQVQEPETDRFRL------------------------------------
>SaaInlV_200m_DNA_2_1039689.scaffolds.fasta_scaffold02144_7 # 4497 # 5432 # 1 # ID=2144_7;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.499
----------------------------------------------------------------LQCGVATVRSVIDSHVHFwqpQRLrylwLDEVpair----H-PFTPHELNQATQAIDL------------------------------------
>tr|A0A0K2U629|A0A0K2U629_LEPSM Cytoglobin1like [Saccoglossus kowalevskii] OS=Lepeophtheirus salmonis OX=72036 PE=3 SV=1 
MTLLTKKETFLIRESWKLVTPEmtKHAVGYYIGMFVSYPKWQDrFFRRIKGIP-LRDLRNNPILAAHSSQVFSAVSNLLNNLENtevIVEGVKKIARTHW-PLNIRGKELEAGLVLLLDYLEASFPGQISKECGDAWNKMFNAMSGVIVD---
>ERR1719474_2118124
--SLNPTQKCVIVATWHSIFlkhMNFMGKQLFVDLFKVEPNILKYFDAFRDVG-LANLLQSRSFQNHGVRIMNLVKFAVENLDNpekLQDHMHALGRLHV-KKGIDSKYLNIMGPTFCQAIRPMVMaeGQWSIDIEGAWIQLFKILAQMMRVAYE
>ERR1719328_19047
-NGMTPEQKQLIDDSFAVLKkdVKGNTIVFYETFFKMNPELVAHFPGVSE-ADLVNLGKNEFIIQRGAKFFNMIETTTHLMESKegcLELVRMLKESVP-EGKVTYDRYKVAKEPFIKMMETALGGNFSAETKAAWRKFFDSLAETTK----
>SRR5581483_4578849 
-------QIALLEESFELIAgqSVELADRTLSRLIELDPQFRLLAAR-TEM------------AALRSVLFSVLyvlRRSLHNLNTLAPALETLGALRK-DQELSSEHFGTIGIALLDAMAEVGG---------------------------
>SRR5690349_7596073 
-------------------------------------------------------------XMQMTRFTDL-GLRTLMLLasaestgrrvtTRTIAVGANASEHH-VAK----------------------------AVSRLAELGMVMADTLIE---
>SRR2546430_1826610 
--SMNTLERQLVRATWIDLaaAPELLAAHVYDRLFTLDPSLRLLFLG-AEL------------SSPGATLTHAIDVAVANLERLEQTVARLGPDGT-IPSVQTET-GILGDALLWAVGSMLGPiACNPAVRGAWAKCCALLV--------
>SRR5262249_54424048 
--TMNAYDRELVRSTWVELsaDLEVLAENFFDCLFTLDSSLRLLYLN-TDR------------VASGRALMHVVGLGVANLERLEQIAARAA-DED-VHAIGWKTGGIAGDALLRAVERTLGPaVCSPAVRDAWSRCCATLV--------
>tr|A0A2H1V3P2|A0A2H1V3P2_SPOFR SFRICE_008656 (Fragment) OS=Spodoptera frugiperda GN=SFRICE_008656 PE=4 SV=1
---LFGSqEFKACCsgMGMGKIGKGGIGPPVtsL--tqrnttqalfhvgflPYLRAAIQwctvqvDNSFDYLGIWT-EpVAFSVDPLLIAWlaykpTVKSEASLPAAVKSLSQtqqIP---------FR-RRSTP-----------------------------------------------
>ERR1719309_231760
-TTLTEEEIQTVKTMWAGLleNSADSGLFIFQNFFELYPEQVHRFSFIRDSQGNpiPNYLKSQAMLQHSAMVMDALDGVITGVFehDplLGQMMYNAGYSHH-SKNIAKDDIEKLSNSILEVIKLVASCegSGKATKVEAWRKLLNIVNERFEQGF-
>ERR1712168_640531
-----------------------SGLVIFDHFLKMYPQQVKKFQ-FIQDKNgaiQYHYIVEPRMRVHSEMVMNAMDAAVVGIlrgHNVKQELEDLGRQHQ-SLRLK---qeeAAKEQEEREKEEEEEEEKeEE-AET--------------------
>tr|A0A1X2H2S4|A0A1X2H2S4_SYNRA Uncharacterized protein OS=Syncephalastrum racemosum GN=BCR43DRAFT_446018 PE=4 SV=1
--PPTAAQLKVIRRSWELVSdtrwpnepqtmspCQAFSIAFYDALFALDRTIESALSNI--ILQGKalsgilsHLVRTRVVLDEAK------------sidETHFARKLQAIGATYI-EFNVQPYFFDLVGPALISALQRRLKEEYTATIEDAWLTAQHYASYHL-----
>sp|Q7M416|GLB1_LIOJA Globin-1 OS=Liolophura japonica OX=13599 PE=1 SV=1 
---ISADQAKALKDDIAVVaqNPNGCGKALFIKMFEMNPGWVEKFPAWKGKS-LDEIKASDKITNHGGKVINELANWINNINSASGILKSQGTAHK-GRSIGIEYFENVLPVIDATFAQQMGGAYTAAMKDALKAAWtGVIVPGMKAGY-
>tr|A0A0P5UDG4|A0A0P5UDG4_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
-NILSENDITTMNNSWSILRkRSDFAPKVFVRYFKAKPEAQKLFPEFASIPL-TDLPNNHDFLNAAYSCVASLDYILPHLKIphPerCPVLMELKNKysnvdlkkfgpixxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxrcpvlMELKNKYSNVDLKKFGPIWMTAMQEEMGNALTNEVRDVWKKAFVAFTD-------
>ERR1712000_676789
-MSLTPQQSAQIRSSLPVLKseGETITSLLYASLLHNHPDLHNLFNSV-NQANG-------RQPRALLSSASVKGTARWESHQLS-------------------------------MISSRGTCWRPSR-RSWGPSGRLSX--------
>SRR4051795_8230555 
------PAVT---------------------SPRVpA------------------------------------------------FgSPCPVIRQQ-RWTGAI-----IGTRQEGSVP----------SAHSTTSGD------------
>SRR4051812_47002672 
------RLSA---------------------TPARtG---P---------E----------TRE------E-----------eTPSMaERTLTAMYD-DR---R-----AA---------------------------------------
>SRR5215203_3322109 
--ELSERTIALVKATVPALEahGLAITRRMYERMFH-NEAIRDLFNQ-SHHG---------ETGSQPKALAAAILAYARNIEIlaaWGEAYWYLAEVLI-ARERLIyqglaaapGGWTGWRDFTV--AEKRCESEVITSFVLRPTDGGPVLRHR------
>SRR3954470_353290 
------ARRS------------------------------------------------------------------------SPLaEGDPRYHVH-QWDRGRQPRRSTRCRVTPPVT----------NIRRYLVGP------------
>SRR3954464_15980397 
------RRVW--LA---------LL----DV-LRRsGP-AT---------V----------VRS------C-----------sEMPLfrPGNAPRSAM-GSVPIK-----SVNLNSLPCTDVLGEDATPEILGAWGEAYWFLADLLIA---
>SRR6478735_1414904 
------SGSR---------------------PARLaS---R---------P----------SW-------------------nHRPIgEATLVNRYG-RS---A-----AGSDVE--------------RIERDLSGT------------
>SRR3954468_7455402 
------APPD--RA---------LT----GGGETVpG---V---------R----------ASR------P-----------rTIDRsGRTLVSQSE-RS---A-----EGSGVE--------------EIERDLSGT------------
>SRR3954470_12739883 
--------------------------------------------------TS------ACSRTRTSATCStsrtmarqapsprrspPPWSPMRAISTtsaRSPRVERIAQKHV-GLNILPEHYPAVAESLLGAIKDVLGVTHYSRGLTDDPDWYPYLKKHEWL---
>SRR5215831_13609655 
---------KPCNRSKPFFRinAFCSAvslalrlQRLCELPESAHPQRC----A-SCLK----------TANPAKNVVPKRFGTFISIHLrdtYIFAVSKIGQKHC-GLNILPEHYHYVAESLLGAIKDVLGEAATEEVLSAWGEAYWFLADVLMA---
>ERR1719273_448027
--------------------------------------------------------------------------------------------MD-AWTDVYN-------ALTKVLQ----------SLEDNIKGA------------
>tr|A0A0P5DF02|A0A0P5DF02_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
---KPANDRRIIRKTWDQAk--------------------------------------------------------KDGDVPpqiLFRFI-------K-AHPEYQKMFKSFADVpqae------LLGNGNFLAQA-YTILAGLNvviqslssqelianQINALGA----
>tr|A0A0N8DDV1|A0A0N8DDV1_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1
--------RRIIRKTWDQAkkdgdvppqilfrfikahpeyqkmfksfadvpqaell----------------------------------------gngNFLAQAYTILAGLNVaiq---ALSSrslLPTKSTRSEVPIS-PVeLPPSCSSNSATSLrksllk------SSAAPSTprpdkpGRTVCALWSLASPRTSRTPK----
>tr|A0A0P5CUZ8|A0A0P5CUZ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
---------------------------------------------------------------------------------MFNPAGKT----S-GVPATPSFP-PSSSIssrrlpa------prSTSSNSLANLTKCSWVR---------G----
>tr|A0A0N5DPZ7|A0A0N5DPZ7_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
-MNLSAKELQLIEQSWLDIeNKDELGKEVFKRVLLSNEKIRTIFDL--HTCPDDELDQNETFKRHLKSLSLFIGICATSVavgsERLVSIARRIGEKHVNFRWVtfDAEYWLLIKGIMVDVIASKQRPKEVEKVRSAWNTLLSFVISEIKH---
>ERR1711868_89060
--GLDKKQLALLQKTWKDISteMEAQGVRLFVEIFQSNNEVIHVFPSLNPNLKGNraNEVIHEAFKNMEAKLLPESMRFFT----------------------------------------------------------------------
>tr|A0A090L154|A0A090L154_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti GN=SRAE_0000030700 
--NLSHEQQALIRKSWRRVPKQNIGKIIYQKIYQKCPELKNFLSS--DN---------NCVERHFRYFGDMLQCTVDSLNELdkalYPWLTVIGSGHA-GFAITTAHWDAFGEALISSIKQWILSgKEHKETVRAWMKLSCYLIDTLAAA--
>SRR5256885_864722 
--VLTDRQRAIVQSTVPLLEtgGEALITHFYQTMLGEYPEVRALFSMAHQQ------------sGAQPRALAYSVLMYAKHIDRLEalgDLPAQIDRKST-RLNSSHLVISYAVFCLKKKKRTGSDS--------FTRSE-----RLVV----
>SRR5256885_6575144 
-----------------------------------------------------------------------XMVMSMRGPALEaagTTGCRSCSAAV-CCSFF--------FQAEDGIRDYkvtgvqTCAlP---------------ISDILIGA--
>tr|A0A016SWG0|A0A016SWG0_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0168.g192 PE=3 SV=1
--QLTSEEMDLLRSSVRIIseNATEVGCNTYEMIFEQSPYVKEFFH-FTKSD--DDAYRQKQTVQLAQKYMQVLIAFVEGIEDpsiLEPVSAKLIEIHRKvddVQ--MAAHWGVFTECTLYNIRKALEKDehFNDmdrldAAVMLWRMVIRGIVRRLKA---
>SRR5262249_10507301 
---------------------------------------------------------------------------------NvkySSHHQQHGPQAR-GVRSTNLAFCCVWRRTEMG----------P-ATAVWSGVHCRDAAGMDG---
>tr|L7MTK4|L7MTK4_SYMRO Neuroglobin OS=Symsagittifera roscoffensis OX=84072 PE=1 SV=1
-MQVSEEQQSLIMEDVQVLlpNYDDFVEDVLQQFMEENPETFQIFPW-ADASkTAKEMRSHPRFKSHAKSIGKVISDCLVDLNGvkkHEPKLSSLGAMHT-KKKVPTELFGKLGGCILTQVVKRVSeAKWSEEKKEAWLKAYGIITV-------
>ERR1712227_290716
--KLSTKTIDLLKGSAAEIKenGTAIATELFKILFERYEVFKDLFPA--DVI------KNG---KMISVLPhalSAFAEFADNMLELDDTINRIVSRHV-SNGVQQWHYPLLEECFIDALDKTLKLDKRPELLQAWKDGFKFLANKVM----
>ERR1711868_248053
--RLTPDTIEALKYTALEIKgrGNDIAKSLFDLLFTRYPVFKDIFPD--ENI------QEG---KMFTVLPialHAFAANCDNIAAIDETLARIVTRHV-DRNVQDWHYPMMEECLIGALRMHLEDDEGMDAMEAWKDGFKYLANKIM----
>SRR5262245_20097952 
--EVTPQQIELLEQTLSELRrqSVFAAQLFYCRLFSLRPRLRRLLSG--RP------------DFHGTRLLSVMSAAVAGLSDPghfAGLLSLAARPAVREALLQGDCVRVIGDAVHWMLERHFGGQITVEVREAWRAAHIRITQVIE----
>tr|A0A044TBZ8|A0A044TBZ8_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1
--NFDDAEIQLLRRSWKTIKpeKQT---------VLQCPEVRRFFPFM-NSDLKSCEKKNKRFVFQALRFIQvdmtIFNEIIISSF-----S-------------ndIAILMLVFLECSIHQIRITLLNSkldlWNRKdvdnVIILWWHLNSGICGKIK----
>ERR1719186_618842
-----SVQTREIRGTWVVILaqLQKVGVQCIVDLFELHPFVREHFKEIlvqyGKLDPDNDNALQNVLENHAKLVMNIVHELVVNIDNLdglSERLQKLGLFHV-RNAVPKKYSSTIVAFSHTEMHN--CRdlAFNFPETHELHG--------------
>SRR5688500_15455526 
--AITPYDALLLQDSFRAIQqqSGPAAERFFRELFSYDSSLKQLFAS--DRW------------RREEVLMKALGRLVDHLNSpdgVGPHLVELAREHP-AYGLSNYHHLYFGAALFSMLELVLGARFK-LVYGAWFKLFQLAVSEVK----
>SRR5690242_19663030 
--VITADDVRMIQESFRRVEsvRASAAERFFRELFCYDEMLRGFFPP--DRW------------SREEQLMSDVRGLSEGLTQpdkLKLAIDALALRLD-GSLRRTPLHLYIGAAWFSTLEMVLGSQFDRRLHAAWYKLFEQVVA-------
>tr|A0A1I5XDG1|A0A1I5XDG1_9PSED Globin OS=Pseudomonas borbori OX=289003 GN=SAMN05216190_1566 PE=3 SV=1
-----ADDAALLEETLEMVSsrSEDLTPDVYARFFSRCPAASGLFTvI-DpatPP----------M--GCGQ----MLFEIISLLRDsaagkPYVAsyMQQIATEHaA-FDVRDPALYREFMHSLADVQATLLGPDWSPAHAQAWDRQIAALLRHLP----
>tr|A0A2D8QSR0|A0A2D8QSR0_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP89_08285 PE=4 SV=1
-----SSKDDVIAESLSLVAerAGDVTSVIYEKYFMRCPSAEEVMSH-LDA----------Q--VLGK----MMEEVYRLLMVndyesENDYLNWEVSNHeT-AYNVEPHMYEGFFSAVIDSVREVMGSQWTPALERVWESKCEELRSEIA----
>SRR5207247_8066543 
------LDVQRLQESFARMAmhGDAVPLFFYSDLFLRHPETRDLFPV--SM------------AAQRDRLVDALGRIVSDvehVDADSGDPSGARPEDA-HIQAVRILsnAQQMADNYVADAQEY-----SSQLSTX-----------------
>ERR1719419_503384
-TDLSPKEILDIQMSWAEIHQEgLVnpDVLMFKLFFEESESGRLKYSHLLkNVNLDnlnwmRDWTKVQKLKDSIDKTGEALGDVIKSLNyhdRVVDKLYSHGVVHA-KFGVTRKEIHTFCECLLMTLKMELGTNLSQEAQASWERLLKMIVEVFC----
>SRR6266536_694904 
---------------------------------------GTRFA--DSHR------------PPRTMERTGplrDRLALRALRlgvgdvvwEDVPSLKRSMCG-----------AAAAGAAPVVAAVASAAPGDPQKHLKRADQVYAKSILLRMS---
>ERR1719230_2183946
-SWFTDDRERLLKRSWQQLQldsCEEAGALLCRNYCSQSPEDAASC----G--------------MDWSAVIKVIGFPIDRMDNLafvKKRLRCLGANHA-KWETKEHQFQSMKYAFLSAPRDVFANEFTSDLELAWDLLYDFVSTEMIAGL-
>tr|A0A090KT29|A0A090KT29_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X0
-TKLTENHRKVIKSSFEIFKknGVPNAHNIFLRMFKEYPDYKNVWSQFKNMS-DEELSQTPLLWKHATTFVFGLERVIRTMDDqemMILMIHSTANQHK-SWGLKKEHFFAMVHLITDILMEEKGEpDEKYAIMEAWESFYDVLGT-------
>tr|Q6BBK1|Q6BBK1_9BIVA Hemoglobin chain I OS=Calyptogena kaikoi GN=Hb-I PE=2 SV=1
---VSASDIKNVQDTWTKLYdqwEAVHASKFYNKLFKDNEDISEAFVKAGTGS-------GIAMKRQALVFGAILQEFVENLSDptaLSLKIKGLCATHK-TRGItNMELFAFALADLVAYMGTTI--SFTAAQKTSWTAVNDVILHQMSSY--
>tr|A0A0N4TEQ4|A0A0N4TEQ4_BRUPA Uncharacterized protein OS=Brugia pahangi PE=3 SV=1
-IPLTRKQKFVLIKNWKGIErdVTTAGIEMFLKMLTEHPEYYEFFN-FRNIANTakEKQASDERLSAHGAAVMKFIGKAISQIENadaFFMLLENNGRQHAHRGAFRPEMFWASYSFTCYSFSNGFIRNFFSNI--------NLLLTKVEMSY-
>SRR5690625_5362168 
VLRSPPpphpaasslSLRDALPLCAGVVaeHAEEITTVFYRDMFEAHPDLLNVFNV-A----------NQAVGEQPKALAASvVAFADRKSTrlnsSHVA----MSSAVS-CLKRRSPERR-RG---------------------------------------
>tr|A0A177B679|A0A177B679_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_02502 PE=3 SV=1
--GLTKTDINMVLGSWESINNDEASSIFYRELFNTYPDTKSLFVKFYSVD-NDKLIDNPAALKQLRVTWTAITTLIDYLKKgrideANKAIDYLIEKHRKIKTFQGPMFNMALEPLLYLVKEKL---TSQAYIDAYKKVFGAIFLTIISKY-
>tr|A0A177AVU9|A0A177AVU9_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_06067 PE=3 SV=1
--HINIKDIERVSTTWDLLDDKKSAIRFYKHLFTIYPQTNKIFVKFHNAK-VDSLGTNAQALKIAKAMWGSASHIIISVSEgnlkeIYKSIDYLIKIHVNVPKFSPTMFELAVKPMVATIQEKI---TDPEILQAYVNIFTVIIEKLKTSY-
>ERR1719397_1495121
---FGAAQTRMIRSSWSIILaqMQTVGVQCIVDLFNLIPYMREHFKKViadsGRMDPDDDSAMQAMLENHAKLVMNIVHQVIINIDDLdliSPKLFRIGVFHK-NTGILPRYLDIMGPVFCNAVRPILLKhkMWSAETEDSWMEVFKVITSIMKRGY-
>tr|A7BZS6|A7BZS6_9GAMM Globin OS=Beggiatoa sp. PS GN=BGP_3767 PE=3 SV=1
---------ELIGQSWDKLAGkhEEMVATFYDRFFDKFPHYRKFFP--ESM------------EHQLKRMAETIALLARVTHEtevTHPHLVKVGSRHT-GYCLAREDLDNFKTIFVQVVGEYCGDDWNQEYQESWTEAfEQHIIPYM-----
>ERR1712048_439078
----------NVTTIWDSIKavpgyEEKFGRMLYEKFYEMEPESFKLFKK-TRQPAAEDVFSDPVFVQHSLEFVRLLDFFIQVLGPdielVEESLVDFGETHQ-DYGVTLDTYSSFGEAMTETVEELLGGngKMDETSRRCWVTAYRYMSMHMTRG--
>ERR1712048_1339107
----------NVTRWWDEIKripgyEQKLGATLYQKFYDLEPDSFETYTS-NLT-PTEDIYSDSTFLENSATFVHLLDFFVQVLGPdlelVEESLIEFGARNYNDFGItTVDSYSSFGEALL-----------------------------------
>SRR6516162_179054 
----RSQTVMDIEESLHHILerEKLVADLFYMVFLEKYPEVRRHFINV-N------------LRRQAVLLTMALQVVVQYYLKgfptAEAYLKILGEEHN-RRGIEPELYPKFCTALLETLSRFHFHDWSEDLAQQWEEALKLAATEMVEASP
>tr|K2K1I7|K2K1I7_9RHOB Globin-coupled methyl-accepting chemotaxis protein OS=Celeribacter baekdonensis B30 GN=B30_11265 PE=4 SV=1
---LAVKQISLVRNDFRRLAPvrPEMFKRFYERLFEIAPHTRDLYS--ESL------------TEEAIRVNGLLEIAFLSLDHpqaMFATLHTLGRDFS-GFGIWETQSDLVVDLLVEVFAEFGGEDWGTELEKAWHSVLSFIAQGMKEG--
>tr|A0A291GF03|A0A291GF03_9RHOB Uncharacterized protein OS=Celeribacter ethanolicus GN=CEW89_16165 PE=4 SV=1
---PSARQIALVRNNFRALSPkrPDIFIPVYDRQVGEDPKAAAQYD--GSL------------CQRARVLDGLIELALLSADHptaLFATLHKMGQDYA-HYGSWREKHPFLIGQIIKAFAEATDTHWTDELADAWEQFLYFMAEGMLEG--
>SRR4051794_12469468 
--------------------------------PPTMHDLRILLAG--DA------------GVRREQVGQALSWLVDNLDQprvVAATCADLGPALQ-QVGASPQRLDALGVLVADALRANFGAAWRQEHYDAWHSSARLVTSWMGQ---
>tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1
---ASADDIALVASVWVFVkpNLEEVGNEFYDQFFAKHQDLKATiFL-------------GTNFLTQAIRVMEMFDAAIEAMCDpvaLMELLVPLGERHA-LYGIRKEHYDIFWPALCIALKEQLGDKLTDDVVQSLHRVYYKVIQVMLE---
>tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1
---FTPTIVRTIRTTWAAAtkDMDAFGDRLYTAVFALDRTLKeTIFKG-TN------------MSAQAHHIIETLDSCVRIMDQpnhLMSMLRQLGVRHG-AYGVGRHHYPTIGKALISALEGSLEDKFTLEVNKSWTKFFNVIERSMLEG--
>tr|A0A1V9Z083|A0A1V9Z083_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_04708 PE=3 SV=1
---PTATDEDLMTQSWDDIIgcklrAEierrkapstepspeaptttsaivQFYDTFFSHLYVINPETRSVFRN--SM------------HVQSKALVNIVGAIRHVlhSDDAKNMVAAMAVRHI-QYGVKLEYFDNLGVAMIQTLSKLAGTTWTTAMADAWHTVIAYIICLIVPHY-
>tr|A0A1I7UV11|A0A1I7UV11_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis PE=3 SV=1
MDRLTERQKQIFTETFPVVfkDSRRNGLVLFAKYFSEFPHYKNIWPQFRTLQ-DSALLASNELANHCSVYMSGLKEIVEVMDDeekLTYFMARIARSHV-KWNINKYHITNMLEGVDAVLQRSFGDKLTDEIVNAYHTLYDVIGNLLD----
>tr|A0A0P5Q0G6|A0A0P5Q0G6_9CRUS Uncharacterized protein OS=Daphnia magna PE=3 SV=1
--SMKGRGSCFDQGHLESCKkNGNIAPKAFIRYLKLKPEAQKKFAAFAEVDL-ADLPTNSHFLNQAYTCLAGLNAYSDNLGKNPKSCPYLNSP-AF--KdVKPDELKLFGEVMFNVMEKNWTIIFPRQARKAWKDGLTACDVA------
>tr|A0A258C6P4|A0A258C6P4_9PROT Uncharacterized protein OS=Caulobacterales bacterium 32-67-6 GN=B7Z13_12975 PE=4 SV=1
------MNTQALLDSLDLVAeHGeDPTPRVYERLFARYPETEALFMG--DTR--------GA--ARGQ----MLRQAIETLLDYlgpnafaANFLRAELHNHS-DIGVPTEIFPRFYQAMAEAFADILGGAWTADMQRAWDDLTAKVEQIVRG---
>ERR1719244_673251
------GQKDLIIASWREIriCLDEVGFDTFKQLFAHHSDIRAYFPAMKKLSS-NDVEMSRKIKEHSTRIMAVLKLFVDNIYDLekiEPSIEDLGRNHS-FRTLLGLFLSE-------RISGQL--AWR--------RCCFNYLNIS-----
>tr|A0A1I3XAR1|A0A1I3XAR1_9PROT Methyl-accepting chemotaxis protein OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_101121 PE=4 SV=1
-----QAAIQRA-EACLTLSadGLVLEA---------NDRFAALL-G---LA----------PAAVADRPHA--ALLTLAERDgatYRRFLDQLAQGR-------------------------------DTVARLWHQGAggagvllELSAAVMAAD--
>tr|A0A1I3XA39|A0A1I3XA39_9PROT Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_10
-----MAAIDMA-QPMMLLGadGVVQDA---------NAPLAALL-G---VS----------ADALAGRPHA--ALLAEAERDsaaFRRFRDAVAAGQ-------------------------------AGHARLRHAGAggntvtlDLMMQPLAAE--
>tr|E3MNQ8|E3MNQ8_CAERE CRE-GLB-30 protein OS=Caenorhabditis remanei GN=Cre-glb-30 PE=3 SV=1
-SHLTPIDREILNKSWAIVskDMQQVAVNIFQMIFEQAPDAKLMFSFM--MKDYKEDKKSNEFIFHAVRFLQVIESTMTHLDDpsqLDAVFLNLGKIHAkheEQLGFSAHYWSVFKECVLFHFRKAMKAHnkFSkhkemsfAEIDSAiilWREVLRFIIDRMKVGYC
>ERR1740129_566420
--QLSSASVETVRQTAALVgsRAQEIVEAFYRGLRARYLELFQFFNR-TNQTSN----------RQSRALAVALTafaSKIDELSEIHGLLEMISVKHC-ALAVRPRHYMLVHENLLAAMEEVLEDQLTPSGYDAWSDAILYLVRLLTEQ--
>ERR1719183_2765469
---------------ADIFmpRLEEIVMRMYNLILEEQHECINIFNT-PSLSPG----------QPLAALAACIRgliEDINVRPRLEHRVEMIAQKHC-AINLQAHNYLGLQGMFMSAAEDVLGADMTPQRFSAWSQALLFICRLVIER--
>tr|A0A0L0FUF5|A0A0L0FUF5_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_07147 PE=4 SV=1
--ICKPEELHtkdlgfivtHTNNPW--GstDEQDFGVDFFRDHADQ----------------------------SGLTSFFSSIVIIACEMYqefepSIPQLQKLGEEAK-HLDIPCHMEDNIVGYVASTLSR-SK-QFDAIEECAIFKLIWRVVLFVLE---
>tr|A0A2E9QYM9|A0A2E9QYM9_9DELT Nitric-oxide synthase OS=Deltaproteobacteria bacterium OX=2026735 GN=CL920_22905 PE=4 SV=1
--ALSS--MKEAKRLWEEGvgLHTAPGSEWVHQLVAERPEWNHFFAS-SDPE------------AFGEALFSTIDSAVHQLDDevsMFSSLREDSELFT-AWDVRACAFSALPDVLVDFVV---E-DHQTVGAQALRTFLRRVCTIVSL---
>tr|A0A0K0EIZ9|A0A0K0EIZ9_STRER Uncharacterized protein OS=Strongyloides stercoralis OX=6248 PE=3 SV=1
-VPLTERQKFLLVKNWKGISrrARDAGTNLFVQLLSEHQELGDYFI-FGNVKakDKYEMLADERIQNHGEAVMRILDSVITSVNDPQemfRILEEQGKQHAIKKNFKPELFREVEDALFYSIKLILDERYTDNMDSIYRIIMKTVLKTLE----
>ERR1719158_1160759
-------NKHLIDETMDRVanaNIAELGVICHKKLFSLSEDVQNYFYK--P---------NTMVAYILEKVLFILSNLSHEPVKIAHEIRALGMRHI-KYNIPPVHFPLFGKSLMYTFSSTLEGFWTDDIEDAWGSVFDFVCRCMTR---
>ERR1719158_1490032
---------------------------------------------------------GGQLSFICRGHSSRIN------------RNALRVRRsrI-TNRSHSNCFSSYT----------RCSISSITCASAWATCLLR---RL-----
>SRR5438270_3151649 
---------------------PQIVDRMYTRLFEVAPRVVKIFEG-KDPT------------KQL-RTVHVLRDSFDDLSALTPELEALGERHA-SWGVQEQDYAIMGPILLEAMAASVDPYWRSEYTTAWAALFQTVEDIMVR---
>tr|I2K200|I2K200_DEKBR Globin, putative OS=Brettanomyces bruxellensis AWRI1499 OX=1124627 GN=AWRI1499_0864 PE=3 SV=1 
--QLTREEIDLLRWSWRLVTvdddSTSLGGNTFnAADFSSYLFCIQFYNNFISMD-EKVVEMIPSIRHQASSFADVLNQAIGTLEDLskmQELLTNLGKLHARILGIERSYFKTMGEALIKTFRDWFGNNetfFPLILEEAWIKLYCFLANSIIQ---
>tr|A0A0R3PZJ2|A0A0R3PZJ2_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1
--PFTDEEKSELLRSWKVIeaQKQAVGCDIYEMIFNQL------EP-FLCVSIKAPKELHNKFRIIVICIVGRYEEELSSVNE------------------------------------------------------------------
>tr|A0A183UUV2|A0A183UUV2_TOXCA Uncharacterized protein OS=Toxocara canis PE=3 SV=1
--RLSPRHRNLIIKSWSKTNKSKIARDTFVELFKTSADIRSKFV-FGDV-PIKRLKQEDRFLAHCERFVAALDSVIAHLDEIGaviENAEALGKYDISAepihaamaKDLRNEHWRLFGDILVERIIENDTKqpSGGSEVHAAWKMLGQLLVFHMRLGY-
>ERR1719367_1435250
--------KTQLRSTWNVImsDMASIGVVMFLKMFETHPETLSSFIR--NVYSIKEIEmdewYQENLKLHAIRVMAIVEQVIHRLDEVgsvIKILMKRGLSHK-RLGVQRSMLEKMGRSFVLSIQSPLEEanKWDATVEQSWLSMFRFIEFWMGLVY-
>ERR1712004_299484
----------ILRESWKHLqsRIESLGVVTFLSLFNASSETLHTYLTPEDIATLKEQDkdkmLIEKLRVHPLRIMSVLEKTVHRLEDHqrcLKMLRQYGRKHQ-RFGVPPFMFATWPGVFYLYSSPYWKNlsNGMRTFHKLGKACFNSLHLEYRE---
>tr|A0A132A213|A0A132A213_SARSC Globin-like protein 2 OS=Sarcoptes scabiei OX=52283 GN=QR98_0035350 PE=3 SV=1
MTEFEREEIEVLREQWDRIVhyhQECFGMKLFQRLLQLHPEYRPLFG-FEE--TVEEIQNTQRLKAHGINVVYMLNMLFDNFDDmdmIDELIFKLVKLHM-MRGIDQIWLDDIIEPFELVLEEF-NAKIQIERIEVLRKAFIFIKNRMQELY-
>SRR4051812_15383594 
--PMTSDTIALIRASFRLAaaDPQALSQVFFRRLLLRSPGVQRMFPA--SL------------VRDPQRLVGLIDQVLRLLDRrdmLVEGLQNLGRLQA-PYAALPMHYPLIAGAFREALALRVGTLWSVDMEESWAELQALVIRIMGA---
>SRR4051795_1885912 
----------------------------------------ApRTAR-RRLQ----------PGQPGRRLAAdRAGrvgrGLRQRPAegprtdsrapavadraqarvaghrprpvrRRaRQPVLGHRRRAR-EGGHTGGRRRV----GRGLLADglCPGQPGARPLQRAWRAA-----GDGVAR--
>SRR5690554_337115 
-------YVKLLETSFQKAvenvGIEELSTRFFSRFFETFPETNSLFKG-TNIDY----FR----KFKMRVIFDFLIDIVKHPNYAEAHIAQEVMRHQ-MYGLqDKEYYFTLAACLLEAVKSALGDAWTDEDESAWNDILLVFKG-------
>ERR1739838_826584
----LFGSVWPLPLSWDIIShkVDQDGESRFLHKFESNQETEDPILQQ-FT-------QIDASIFNGKSAMIIVALTLENLENyqaLWRNLIRLGRDHF-GYGAQPMYLDLIGPHFVITIRQTLGYDWYEALEYHWLALFELIVYVMKFGWH
>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
-------NLGLVRECWDSICeqytTNELGEMVYDHLFKMAPNLTMLFTKPR--------------SYMAVKMGDMLSMLVSFADSsesMKQQISWLGLRHV-KYKIRPHHIPLMGPVFLAVVAEAAGVHWSQDTEKAWSVLFNMVCVNMADA--
>SRR5690606_39733342 
-TEL--YTLSLHDALPIWVAekIGDPTRLVYERLFAEQPEMETLFI--LDTD--------HSARGH------MLTEALNCIFDLlgQRayapvLIQSELTNQD-RKSTRLNSSHVKISX-------------------------------------
>tr|A0A0D2X3G1|A0A0D2X3G1_CAPO3 Uncharacterized protein OS=Capsaspora owczarzaki (strain ATCC 30864) OX=595528 GN=CAOG_004918 PE=3 SV=1
----RHETRDAIQSSWALAIqkhddHdvtpvATFVNILFAKLFEVCPETRLVFGH--DMV------------RQGKSLSSILTgmlEFVVHPKKLQSQVKRLAHMHV-GLGVTPDMFEAFGFSLLYTIRVRIGSAWNQQIERVWVDTYGGVSNILSQH--
>SRR5215208_3780459 
--PLSPEAISVVRATAPVVAahADQITAHFYPRMFAAHPALLRIFNQ-GN----------QATGEQSKALAGSVVAyAVQLIDPeapsFDHVMRRIAY-KH-VSLVSARSSTRSSASTCSPRSVRFSA--------------------------
>SRR5687768_12147577 
------------------------------------------------------------------GLAHARMDsVSLK--PpanphcaiktwvlacgvpartaeWRPMSN-L-SDAP-SPSLLSDQSLSV----VQ-TTATVVAAHADEITAAWSEVYWLVALQLV----
>SRR6476660_4664138 
--M-VVVGVDAHKrtHTCVAVDgsGRKLGEKTVPATT----------------------------VGNASALRWARSTf-GpdltwgiedvrnvsRRLE----------QELV-NAGQR---VVRVPTHLMARTRasartrgksdsidaTAVARAvpREPDLPVAqHDSVS--RELQLL----
>ERR1719193_1089955
-------------------------------------------------------LKRHRRNRHEGIRFQCNYCDYD----AgqkGNIKSHMDRKHP-EIPYDHTEFQEVRVEKSkysreakqqELDLAAmqGADAFNMNPLAGIGNMMPFNAHIL-----
>ERR1719378_1531842
--RFHPgaDGVHRIGGEESQ--aeVRRQRSLSLPKFLDSLSGEKEKFAFNFDSMgnVLPNFHASHAQKIHSMKIMDAIDAVISEIlrDHpIKQRLMDVGYAHY-ELHATSKDIRKLTTAFYKGVKDLIGIDDdNDRHLVAWKDFLNKIEEGFKE---
>ERR1719414_1806212
--DFTLEQIECISTVWANLRqsSADNGLYLLQHFYTLYPEEMQKFDFNLGDRqdFRLNFHRSQLVRDHSMKIMNAFDALISEIvhGRpVKQRMIDIGYEHY-ERDATAQDIRKFTKAIYSGVKDLMDADHdgprraaaghDDRHLAAWKVFLDMLAKGYT----
>ERR1712142_47027
--EFSGEELEYICSVWGNLRmnHPDAGLFLLEKMFLKYPELAKKFDFCRDFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKSHMRQMADGMLEGLKEVIGDAKdSTRKLLAWNKLFDMIVEEFGN---
>ERR550534_2245262
------------------RDlrHPLGLLLALH---------GGFLSFFHGFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKIHMRQMADGMLEGLKEVIGDAKdSTRKL-------------------
>ERR1719192_2788519
-------RREIIGTMWESFRedSVSSGLFILEHFFSTYPDEMDRFTFASGGQtdketPLAFIMKRERMRIHSAQLMNALDRNGHVYGRspgCMDQAPQSHRG-------------NVCRRTGKSSGIA---------VFKWRVA-------------
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514
-TNLTPQDKQIMKEDWLMINEkKTAVNNLLLKFFRSFPQAQAMFPKLAKVP-LSQLPSNVEFIAIVNSIKNGFKFVIDSADDVGLLRQLAGSQDISvftVPGIPVaQQMQETGRVIVEWVQEEMGDRFAERTRVAWIRGLRSISQAFVSGQ-
>tr|A0A0V1CPF8|A0A0V1CPF8_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16047 PE=3 SV=1
-SKFTDEEVELLARTWKKDDfdwLYRIGTDIYTCVFQLAPELKVFFPYVTECeKKNQSWESSKGFRTQALRFVQILGMAVEKTESrmkdddshLHHRLYKLGETHRRfaLKGFTPTHWKGFVIAVRVAMRRAVEAmpNLtpaeCETAIEAWDKLSRYVVHRMEEGY-
>SRR3954453_266974 
--MLTEKSRPVLEATLPVVgeNIGKIAERFYQHMFGEHPELLdGLFNR-GNQAEG------TQQQALAGSVALFASALVSHPNHLPdHLPPRLTTQTP-RPS-------------TWCRGSRT---STPRSAFART---------SIRS--
>SRR6478609_8547471 
--VlvdveevlrvvfgFDLPQTDVVRSvVLGNPgq----I--------IAVHKVDV----------------------AAGGRIGPQGGRVVPHPRDVClV-LRRVHPLR------------------------------------------------------
>SRR3989304_146361 
----------DLEASVQRIldRGKNLADLFYCVFLDRYPELRRHFTAV-DL------------SHQAALLTMALQVIAENHLRpspaAAEYLLVLGHRHH-AWGIERDEFRRLRFCSPPPPQPSHGKGGPAARPRQWRAAIDEAVDTMRAGY-
>HigsolmetaGSP17D_1036251.scaffolds.fasta_scaffold61070_2 # 263 # 457 # -1 # ID=61070_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.672
--VLNSIDEDLTTKSWNIVMsgtPtENFkakkldpcfhystslswfYDIFYKKLFELCPDVESMFEN---V----------SLVHQGKLLATVIGSALASLKKpiiLKKRLIALAQSHN-GKGVKAIHYCNMGLALFWSLEEVLGVsVMNEETRTSWVKMYSFMLNIII----
>SRR5215510_2422438 
-LQMTKEQIEVVQNTFNKVRPmsGTAAQLFYNRLFDVDPSVRETLL--WTLK------------QGlGADFTPEAEVAWGNAYDFLAAVMQQAAKGA-SMX-------------------------------------------------
>Dee2metaT_27_FD_contig_31_2132282_length_204_multi_2_in_0_out_0_1 # 3 # 203 # -1 # ID=1013462_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.592
-----------------------------------------------SAA------------TSNPQF----VAAV----------------------KKAIDYSGL--------LTVAGQGAVQPagiipSVIAGTLPAADALKQDVAG--
>tr|A0A068XSQ8|A0A068XSQ8_HYMMI Neuroglobin OS=Hymenolepis microstoma GN=HmN_000477400 PE=3 SV=1
--YFSEFEKDVLISTWEALLlyTHEHGAFIFRLAAEMCPELKAAYNV--EFNDDDELVISSCALQYSQAYITLIDEAIRSLEDPQEgfydSVLIAGASHATIPQMKPEFFKVLKRATLTTWEGLLGEEFTEDVANSWQTLLDYVVAVMVEGN-
>ERR1719193_549257
--IFTDDELAILKDVWAHLKhhTAGAGLTILDHFFKRQHWALERFEALRDMY-GNihpDYMKIDLMRFLAVDLMEGIDIFVTGFFErdpeVTDLIADVGYAYV-KKIIIESEIEIFVDSMLAAMEELLGEDtWK-KNMAPWKKLMPVVAEHFSRGFK
>tr|A0A0D6L5L7|A0A0D6L5L7_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_14144 PE=3 SV=1
-MLPASEVKKLVKSSLERVAigkepkEVQGAKDFYKYMFTHHPDLRRYFKG-AESFTAEDVQKSERFDKQGQRILLAVYILADTFDDeptFRAYARETVNRHR-QFKMDPELWSAFFTVYVNFLASRGP--LSDDQRKAWAQLGKVFD--------
>ERR1719254_19301
---------------------REIVDDFYPRMFANNPETKALFNPA-NQ------FEEPNRQRMALtnAVL-AYASNIDEPEKLADAVAIISHKHA-GLGIQAAHYPVVHKNSGLHRARHGR-rrdaGGRRGLERG-----------------
>ERR1719394_777503
------------------------------------------------------------------------------------AIRLGDFQHI-CT-TPLPFCRESPQVQALHHSILGPEVVTPEIGQGWSDGVLALAEILYK---
>SRR5262245_29633745 
---------------------------------------------------------------------------LGNHSTrCgRSVESSQSNSTA-DFLNSRRIHDAYSpaiRAAKSKSE-------------------------------
>ERR1719193_348913
--KLEQKDIRAIREGWACItaHpgLEKTGVDWLHLSFELQPGTKHHYKNFTNK-TLEEICQTPYMKILAGKYMSEIGILVEHLEHsnfVLMRLENLGHLHA-KMGVPMETLFT----MNIVMQHYFRELYsrqdvPDDCEGAWSKVT------------
>tr|A0A1Y5FEW2|A0A1Y5FEW2_9PROT Uncharacterized protein OS=Halobacteriovorax marinus OX=97084 GN=A9Q84_13980 PE=3 SV=1
-------------------NIDQFVESFYEHFFSLTPEIFELFKN-SEIG------------KQKNEFKISIHTLLINLsqlDKLDSYFKDLGIRHI-CYNVSERHYKLAKESFLYAIKKTYADHWSKVVETKWEEIIDHVTLKMKEG--
>ERR1712238_458974
---------------------KELIEMTDYPTFDVEGVVLCFL-------------------------------------------EWEHHKHE-NIMTFRD---HAYKALMTG-------TMAPLHHTPWKDALEDTIESYGLA--
>UPI00054DD732 status=active
---------------------------------------------------------------------------------------LTCARDF-FltfVGVERCR-PKLLKQEPQTITSKLGm-A-PMLQSAFWSIRVMRIASS------
>SRR3712207_8863908 
--FFFQ---------AEDGirDIGVTGVQTCALPIYARPDLLdGLFNR-GNQAEG------TQQVALAGSVAAFASALVKTPEQLpEQLLNRIRSEER-R--------------------------VGKECRSRWSPYHX-----------
>SRR6476659_5675031 
--STHRPDQALRGGGRPPHraADNNAKGAATGHRVSGRS---SPAEL-PENSMR------EQQQALAGAVAAFASSLIETPERVpQSLLSRIAHKHA-SLGIRPDQYQVVHDNLMWAIVDVLGDAVTAEVAAAWDEVYWLMGNALINQ--
>tr|M3IW96|M3IW96_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) OX=1245528 GN=G210_5766 PE=3 SV=1
--SLGPVELTQIISSWSKIRnKSQFHQSLYTNLIESNPQIGKIFNN--ND--------KNVISQHALIFGDCFNFVVENIQDnalLDEFLFSFVQENQRFANMATQYLEPMGNSLIRTFRKSLGNNFNSVLELMWIKVYVFIANSILQ---
>ERR1719502_1452556
---LPPEQSALVRRVWQRLVgTPGAAPILVRQLQSVAPEVAALLS-DA--S-STNGRSNinrGGLhavhtdpHGRAAAVLSEVSELTELLDDsaaLRQRLRQLRAR---MPPVGPEVYPSVGKAFLHFVWEGVGSGYDNATAAAFAALWDQVEETMLE---
>tr|A0A1X6PD63|A0A1X6PD63_PORUM Uncharacterized protein OS=Porphyra umbilicalis OX=2786 GN=BU14_0103s0020 PE=3 SV=1 
MGALSDDTVRIVKSTAPVLkvHGGAIVDGFYALLFEQHPAAAAYFNVVPTDGgGGGGGGGRGQSKAQIQRLSMAVllyAESIDQLDTLGPVLERISAKHA-SRGIPAEFYPAVGACLLQSIGRVLGDAATPEIVGAWGEAYGFLADALMA---
>SRR5580704_1734515 
------------APRAELATgvAPDYgSPDDVASRRSQSRACRRTLR--RPTT--------------GAVRGEMLARVIEAILDFIgerryahHLIQCEVVTHE-GYDVPPETFGIFFGVVATTVREQLADAWTDAFDEAWRTLLYDLD--------
>SRR5258708_241677 
------SCGEDPAGSSD-----DHDAD----VVASAGQVEGGVD--LVEH--------------PPALGVPIAAPCQWLVDLEgagacaaNRMAAERVNHE-GVGVPPAALARFFPIVAETCRDLLGEAWTGEIEAAWAGLLTRLA--------
>SRR3954465_11422119 
----PCRSSPTTSGRSPGAs-TRT---------------CStAtRGCW-TGPStgatrpR----------APSRSRWPGPSRsspaHWSRSPSRSpSTCSpgSRTSTTHsasprpppP-PPPPARAERGVVQDNLFWAIVDVLGEAVTPEVAAAWDEVYWLMAYALVN---
>SRR3712207_885952 
------------------------------------------LGR---------------------------GlladGLRAHPPGAgALQR---------PRRAAGDGVAGVggRRGENRERGRREPPPAAGAGTPGVDRAAPPGRCRPGT---
>SRR5215467_2668635 
-----------YLHSFPT-rrSSDLPPSALYRHLFTTRPELLDgTSNR-GNQAD----------GNEQQALAGAVGafatALVNTPDRLpENl-LARIAQKHA-SLRITSRSNRLSGQGPIAPL---TEDQ----------HPX------------
>SRR3954465_6877418 
--AtaaaTAAASSTDIRATRPASleG-------------HDRPHLDTaEAGR-AQLAD----------GEGDIEVGGVDEvvatqHLLRLHERAvGHlgpPTDARRGAGR-LQGVAAEELGTVRLDLDGELVVRLHDL-----VEDLGRRRRVLALVLVD---
>tr|A0A183INM6|A0A183INM6_9BILA Uncharacterized protein OS=Soboliphyme baturini PE=3 SV=1
-VILSNYQKTLLRDSWLRINktgIRNIGTMIFRRLLTKQRSIKQLFQHITVLEGvfSAGLTPIQAYQHHSLLFVELIDNAIKNIDDLsvlIPTWIEHGAKHARfkAYGFEIEYWDMFGSTMTEAAREWEGWRRHRETIRSWTLLISFIVDRLRQGY-
>SRR3954463_14455484 
---AQ--------------------------PRAARPSALRLSRP-GDGAP--------------FLLRAEvACLasGI-----g-----------TF-GPGLRSHPLARLGRS-----RALRGRAVLArCPPKIWSPLD------------
>tr|A0A1I8CQM9|A0A1I8CQM9_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=3 SV=1 
MNKLTEKRCDIIKETWEIYKqdGINNTIKIFFHLFTEHPEYKYIWPQFRGIPDS-SFILSSALRNHAEVYTAGLSIIINNMHNkakMYAHIKKIAYAHV-KWIIHQSHVQNMVPGLMMVLKDKVPH-FDDSIEDAWKTLYGVIGSLLE----
>SRR5258707_573086 
--------------------------XMILKSFKPNAAIGC-K----TIPT----------W-----FVP-LPTFTAGLTLPKLyplSVFGMRRYN--LGGLGEPH--QVEAALLWLVEKQFEGVLTREMRQAWVQFCQWLV--------
>NOAtaT_7_FD_contig_111_1754_length_212_multi_2_in_0_out_0_1 # 1 # 210 # 1 # ID=13324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662
--RLKPKDAEYLQDSWKVFlErsggLEGAGKEFYRLLFEKEPDLKKLFQV----P--E--------MSQAAAFMRAISRYVSLLAQpeqLKTAIEMLAFMHV-NLGISETSIFAFAESLLECVEDQLHDWDpgeVEQVMVLLTDLTTYIGRVIA----
>SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold554780_1 # 1 # 420 # 1 # ID=554780_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.669
--VLTSSIYlttgTVVTDFSVIVlDaegsAIEPGEAPYSLRVYFTPASTGTstatIQL----P--S--------GLISDgMLAVGARRLQEETINprrLAGACEAYGATVTSnvlTVNVrksgTASDPCDSTDAISLLFAGGMATWNslgTSVTSADFtmstnvdsdsvTYRLTFEENVFL----
>SRR4051812_4293204 
--EPLAAEQELLGQTWSDDFefLYELGASIYQHIFNTIPETRQLFPKIPTINNG----RwceSKEFRAQTLRFVQPLSFAVNNRHDierVAEHLFIIGVKHAKlvERGFRA----EYLDCALVSYFLKIFKFkyFIv---FIGFRT--------------
>ERR1719295_1797159
----------NIHVTFDLAltsDPKGFAENFYKGLLKEQPDIGQLFLD-----------KNTTFDTQSARFMAMLMHAIKMLDDtdhFTQSLDSLSEAHV-GYGVEVPMLDAFGKSLIAQVKVmnikyfeeqakggggggdekdeSLdimRvGEWTKKQDDSWKWFWSVVVGVMSAG--
>GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold789473_1 # 1 # 552 # -1 # ID=789473_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.562
--RIPPLKGSSLSAGWRTASSsgLS---------------------------------------------RNPRGTVSR-----ESGNTVFQSETF-AGAASPRGGSLL-C--FT--GENEPMGMINNLKT------------------
>ERR1712012_1094824
--SLTTSDIAAIRQSWILAkDaapFEVHGPAFYKLMFETYPSWRFAFNHMGGHLSIEVQIENTRFVKHTVTVFRFIDKCVNDLDNPtqiLENIKMVAKIHA-LQGIGVKDFIIIKAFICSKSD-KVGAGRSKNSFIFFPRFL------------
>ERR1719232_197721
--SLTTSDIAAIRQSWTLAkDaapFEVHGPAFYKLMFETYPSWRLAFTHIGGHLPIEVQIGNSRFVKHTVTVFRFIDKCVNDLDNPtqlMDNIKLVAKIHA-FQGIGVKDFVIIKDVVLNYFSTALGPALTDAAALGWSnfmDLM------------
>tr|A0A085MKY1|A0A085MKY1_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_01110 PE=3 SV=1
--------ASIIKEQISKIEvNEENGGKLYEVFFTVKPEFHKFFD-LKHAPEGKDVAHNQRFKTLGKLFLEKLKRIVMACEDehqLKEEIKGLKMDHD-PRHVGLTELKGAKPILMKFIEQQVG--MTEEQKHAWTEMFKKF---------
>tr|A0A183IBE5|A0A183IBE5_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
--------KHVLMEHMKRLNlTNKLGGKFYHQLFQSlPEAKSQFA---EHFDKLEDVENMKYYQQLGHSLLSLLKELPEHCDDdhaLKQEIMKIKKKHD-EKHVDAKMFKKSKPAILKFLTDNTQ--MTNEEKEAWDHLITHS---------
>ERR1712025_717817
--TLSPEHVDPITESAPSGKakGMVIANNLYRKLFSRHEMFRAMFPE---QS------------QQSGKMIQALPSALydfavncDNMGQMQSVVARIANRHV-QQGVQGFDGTFQFIPKKVDLsliPAGQCEAKLKVALNARQPGtgvgdrFQLHPSEVC----
>ERR1719495_824226
------QDIENVRKTWEKMIakheLQGVGLVVLTAWMNEHKEIRQVFAK--SFPIIDklekdvldlVQLNDPTLNEHATIMASSFGKMIECLDDteFVQMMIDIGKKHT-GFRVSADSFDTsLNSTLITALMALSEEKEDSPNIKSWKTVVEVMKHYLKQ---
>ERR1719210_734039
--HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VFELKANEDLNRHGMYILGVIKKIVGKNDDteyLEKLFDDLSDLHR-RLGVEASGMDIFGKVFCKVMRPILLEkkKWKPEIKDSWMTFFSSIVKVMKK---
>tr|A0A2T7P177|A0A2T7P177_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_12319 PE=3 SV=1 
-----------ITRSWKCFYekVCSFGVYEFLNLLTDLPEYEEAMRLI-KLTSSYKFLSAMDFNAHFLSMLTIIEKCMARLevDDlplLEDILHKVGTDHI-GRGVNPENFDLVIPPMVAGMKQMLEDKWTEKEDIAWTNFFTLMIHIMQE---
>SRR6476620_7243483 
--MLSDTSLPVIQATLPVVgeHIEEIAKRFYKHMFDARPDLLdGLFNR-GNQADG------RQQQALAGSIAAFAGMLVDKPDEVpDHLLSRVAHKHV-SLGLSPDQYQIVHDHLFWAIVDVLGDAVTPEVAAAWDEVYWLMGNMLINKE-
>tr|B3RTB3|B3RTB3_TRIAD Predicted protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54902 PE=4 SV=1 
-----------------------------------------------------DLIKDPLVRSHGLRFMKAIETMLEIeFDSngCIFLFSAIGNRHC-SYGIEADYLDYVPQAFRFMLTKALGNNYTDKIASVWDEILSHIIKAMQDKV-
>tr|A0A2G9TV92|A0A2G9TV92_TELCI Uncharacterized protein (Fragment) OS=Teladorsagia circumcincta GN=TELCIR_17315 PE=4 SV=1
-------------------------------------------------Q-KNSSSNKQAHRKT-----------------tsdTHQDL-RRTRDQP-CEKCPQSPRYHMLEPVLAVVKE-CNDDIDDETIQAWTTLYLIIAD-LIEIY-
>tr|A0A2R7X9G6|A0A2R7X9G6_ONCFA Uncharacterized protein (Fragment) OS=Oncopeltus fasciatus OX=7536 GN=OFAS_OFAS019380 PE=3 SV=1 
----PPVDINAVQKSWNGIKsslgdkaPEAVGKLVFENLFSNYPYMLEFFKNYGET--KEDILNNKKFMFHAKeRVFKTFDKTVNNLGNeaeLNNIASWLAEVHV-SRGIKPPDF-------------------------------------------
>ERR1712018_1077981
----------------------LIGCQSFQAFFDRSPEILSHFDKFNAIEI-DGVLVSSALKMHSSRVLAIVEDMVENTGNpekIRTILQDLGRNHY-RQVKPILMhFLX-----------------------------------------
>ERR1719199_1665450
--------KPMIRECAAKVvqmDIVELGLRFYVHLFTINPAASAFFTKPKWMI-----------SAIFGGVLRFYVHLFTINPAASAFFTK-----------------------------------------------------------
>tr|B3RTB2|B3RTB2_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54901 PE=3 SV=1 
-------------------------------LIKLSPATKIYFHGV-DFEkRDSYLAKNTFLRNHAARFMEAINVIIGQdMDIfsVESYFRVVGSKHH-SYNLKLEHVQDISDAFLEMARNALKKKFTKSTEAAWRSFFQMVTDAIKN---
>tr|A0A1B6G4Z3|A0A1B6G4Z3_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.45438 PE=3 SV=1
--RLDDNEMELIREGWKCITeSEDN----FRTAFSSKLaqknLAKVHFKHVENVSITDEGFSHEFLMSHSVDVMNTMHLMFNDIRNPeswMPEILRIATLHK-LFGVTLEDLKRFRCCVIEVLQQCLGEdGYTPQIKDVWDRVLECIEI-------
>ERR1719383_1602644
------------------------------------------FGLH-L---------------------QSTMLVGNDLDpvdERG--PDHCQQALW-TASE-GRTLSHRRREPCRSVLEVLGEdVVTPEIGGAWREAVQALAKILID---
>SRR6185437_4905046 
---------------------------------AENPEMEALFVR--DTA--------AL--VRGQMLAVVMEGFLDFVGDqdYsARLMQIERVNHE-GLGVAGRAPRHCGAAGGRSLTHFPGKP-------------------------
>SRR5512135_1032698 
--NMDQETLSTVDASLQRCNRdSRFLDLFYEKLLASSPKVREKFAH-TDFV------------RQKRALRSSLWMMLLVAEdeEkgPARYLRGLTAIHGSsGLDIGAELYDFWLDSLLETVAVCDP-EHDAKVNAAWERVMMVGIHYMCTHYH
>ERR1719336_1989132
------------------------------------------------------------QDRKGGgGTPGKLKVTAKYNDGtefVDefntvifaigrdactakmgleGVGVALNPKNG-KVlhneler-TSVDNIYAIGDvldgkpeltPVAIQAGKLLARrlAGTSEVTTDYVNVCTTVF--------
>ERR1719278_462770
--HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VYELKANEDLNRHGMYILGVIKKIVGKIDDteyLEKLFDDLSDLPL-LLLQQDRPHHLAKNLPKNVHSGSLYAeppvkvaEVVEELLQVLCV-VDLPHNLL-----
>ERR1719210_1454089
-----------------------------------------------------------------------------------rrclgyacf----ASFHKSQ-TIlklshdrdrferqkknPQQSSSFRRCGTsmgqsesslTAANLTQAPTLRpaEWDPNMYQSL----------------
>ERR1719284_537611
--------------------TEEIHSEFQSLLLQHNLELLSVFNI-PRQS--------DDVIDAEteeiasHHLAGVVLAFAAHVGHVQRmrELDQLAAKHC-SHNVHPFHYVVLHEHLLDAMRKALSTMLTPEVQYSWSQSLLFFAKILID---
>SRR6266536_2537548 
-APLSGREREIAMLAAAGLASKDIAERLYLSVRTVNNHLQHAYTKLG-VSGR------AGLAEQEIKFAEKLTEIVramPRLDELLTHTRALGARHV-SYGVRAADYQTLGNALLAALAAVLGGSFDAPTREAWTLAYNLVAETMLDG--
>SRR3954465_13942299 
-HPLTGREREIAMLAAKGILSKDIAARLSLAVRTVDNHLQRAYTKLG-ITGR------DQLADVLAHDTTTHPGPX-----------------------------------------------------------------------
>SRR5699024_12637729 
--TLPKGDHPLV-----LVsaGIGCTPMVAMLHRLVETA--------------------------------RERQVLVLHADHTpEEHAX------------------------------------------------------------
>ERR671932_89059 
--S-PTSCGPARACRSCCCtpTPPRRRSR------------YDgVHEG------------------------LMDLSSFPLPDD--ALFYLCgplpfmravREQLL-DLGVSPRDV--qyeVFGPDLWQADAdeGPGDAPEPgahdllgpEERQGPPPA-WSRPG-------
>SRR3712207_7345787 
--V-LDDVRALPNATVHVWyeSGAASALP------------VDgVHAG------------------------TMDVRSEEHTSELqSRQYLVCrlllekk--KTI------------kyeSTXX-------------------------------------
>KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1083625_1 # 3 # 881 # -1 # ID=1083625_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.686
----------------------------------------MEYEI--------------CLEPSGIRFMADAGQNIVEAAKqhgIpIKHGCASGScgDCK-GTILsgDSEQGPFMPLLLLPTERAA-G-------MAILCKLYP-RSDLRL----
>tr|A0A044RBY2|A0A044RBY2_ONCVO Uncharacterized protein OS=Onchocerca volvulus PE=3 SV=2
--ILSEIQQELIRQSWQTISgklevtEQCFGFFVYRRVFERNASLKQVFHV-EEYDSLESVPNEHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAaesFNEETVRLFCSQVVCTVVDLLETDIDPSCMEAWIDMMRYIGCKLLDGF-
>tr|A0A0R3RKB4|A0A0R3RKB4_9BILA Uncharacterized protein OS=Elaeophora elaphi PE=3 SV=1
--ILSEIQQELIRQSWQTITtklesnKRNFGFFLYQRVFKRNSMLKRAFHV-EEYDLLESVPEKHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAeeyFNEETVRLFCSQMVCTVADHLGGNVDPACMEAWIDMMRYIGCKLLDGF-
>ERR1719384_507171
------------KKCWNELmkDKVNVGERIFDYILTKEISMSKLFMQ-------------TNIEQQSGIFMVMMDKVVGFLDDkesMNDNLIKLGQLHVEKYGVKTKHFKHFRAAFLKAIKKYLP--WNDRREEVGSSFGLELLIKCRC---
>WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1216141_1 # 2 # 73 # -1 # ID=1216141_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.347
---FPDGVCMATIELTVLPvRpled-----DEKFQIILSEAQGGASFNPNDD--------------G----GKDDGvlTIVIKNTLQDpkgLKVLVESFGFQHL-DFDLTVPRVVVFRDSMVELMEAELQDRFTYKAKDG-----------------
>ERR1712214_179591
-------------------------------------------------------------PGHAgRREGRRSARQPGTGKDRqksTKYLLELGKFHR-FSGIPNDYFGVMGTIFVHAVRPYWEEagCASEQTEVVWMMLFAHIARVMTH---
>ERR1719458_2209728
--HLSDEHKTLVIDSWDFVPgfISEAGYKAFTDFVKLCPYYAEAFPFVKKKEEEF-SHLLCEHARKVTGEFGLLAKLISELKTkppeksndqvIHDIMVPLGRRHV-AF--------------------------------------------------
>ERR1711928_171062
---VSATQESHP-------------------------------LDLDSHE-IQQQRRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFET----LCFRWIQHD-----------CQQYGX---
>ERR1711928_123369
-------------------------------------------------------RRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFES----LCFRWIQHD-----------CQQYGX---
>ERR1740128_75568
---VTAQEKTLIRATWDQMMfNSEVAPKFMLRLFSEESQHELGgnFaVEHHLVP-GGadegLLLGSNDGFSNTLDVRVG-----------------------SHlLGNDAi-------DVVHDVFQCFLGGSIGRGDlfnglHHNMGRFVQLVDGX------
>ERR1719219_701605
---VSAAHKSLTRSTWTLMKfNSNVAPKILYKMFTTYPET-QKMyTRLADIP-ASQLMENKQFLALSHSAFAGFNMIVNNMDDPELIKLQLSKVDFPGtFVYPFpgtsLNTSKPPASSWKYSPKN-SAPLSPRKPLPLELPFELRHQGFGK---
>SRR6476646_9453568 
--PMLRTRLQLAEASYHRCAeSGAFYNTFYTHLLASDPRIPPMFAR-TEF------------ERQHRLLKHALGLLIIYAKHAnPAMLERIAQRHQ-EIGVLEDLYPAFVESLVLAVAEH-DPEYTPELADAWREALAPGIAFFIKRH-
>ERR1719347_2568912
--------------------------LPPPTHFLPLPGINRKVRIFQRQFgnQTSEFLTGKALRDHSIRVMDALDSVIVDTlKgkDIHKQMVDIGYSHL-KMGVEPRQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED---
>ERR1712189_147645
----------------------------------------------KPDF---RIPDWKSTPRSQHQSHGSLDSVIVDMlKgkDIHKQMVDIGYSHL-KMGVEPKQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED---
>ERR1719412_2466027
--NLRPLDVTNIKESWHSVEqqLVEVGIRVFISLLENQPNIKRTFRKYRSKR-HSELRINEDLQKLILYLICGLKRVVKYLNDnkaMGKYLRRIVKKHS-PTEIDFTRINpaELSTVFCSAIKDIVdahqaasaklqsvsetsspectspSTCWTIEVEESWTTLFGSLLNATR----
>ERR1711860_392201
------------------------GVHVFLVLFESQPQMKRIFRSYRGKK-HSELRLNEDLQQLVMYLISVLKKIVKYLEEsrtIVKYLRRIAKKYS-SPSIDLARFDphILTPIRVRRRHLFSresivfekRLKWPQK---------------------
>ERR1719266_3067024
--QLAPNDIANIQSSWTLIEpiLLKVEMAWLLLFRHIAGFMRNGYNSVV----TGPL--------------------IRHTTNcatS--TSSRMSNX-------------------------------------------------------
>ERR1719264_357726
--EVGLCDALNIQQVWPRIEqyLLPVGTRMYISILDGRCDKIIFCNKACCRKNasksssakstrsvysksvsrtcPNQVILNEELQKFVLLLMGLIRRAAKHLDNpshSAKVIRKVTKKrFG-KLNIDVTKIAfePIALNFIASVREIMtnTRHWNTETEASYYTLIRNLIAYVQ----
>tr|A0A2G2R4B7|A0A2G2R4B7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_07540 PE=4 SV=1
----------------------SASDKFYNVLQNDLPEFTQLFTN--PE-------------KQHMMFYAALRSIDGLKDNktkLAVYLRSIGVKHK-MLGLTHYHMEIGRNAFEQAIFA-GGKDLTHDQRQFYIDSFSQIEKNM-----
>APLak6261687352_1056175.scaffolds.fasta_scaffold62437_1 # 2 # 238 # 1 # ID=62437_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.447
-VRFPKDVIEEAQQAWMSFtmasTKEAAGEALYSAIFHAAPSLQSLYKIPR--------------PTMALRFMNSINAAVAIAHRpsaLKAQAEALGFQHF-DIDVTPSRGDIFREAILEVLDMELGSRFTTRARMAIGAILNYLIGANI----
>GraSoiStandDraft_15_1057317.scaffolds.fasta_scaffold2262553_1 # 37 # 405 # -1 # ID=2262553_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.610
-LQLSQSELFALGRSFELLlqglgnDRDRVGDAIYGAKTANLVVFKDKFITPR--------------AVLSLALFNGFRVLGHKSADpeeLRLFVETMAFKHL-GLDITLQRVTGVTDSFLELCQQNIKD-MPPGSLLAWRKLMTYTGSCFR----
>Go1ome_3_1110792.scaffolds.fasta_scaffold06098_1 # 3 # 227 # -1 # ID=6098_1;partial=10;start_type=ATG;rbs_motif=AAA;rbs_spacer=15bp;gc_cont=0.524
--VLSAGELAAARAAWDLMKDnVKVAESALVKHFVLHPPVQKLIPALADVP-ISELQGTTCSTPSPTRRC--ASPTTX----------------------------------------------------------------------
>ERR1712142_1087278
INALTETEVKVIIDSWDRIHPDKGAKMLFHQFLTDFPLMKIYFG-YQETESVAEIMESEQIKTRCKVVWDVLTKIVHASGDggkLAELVKEVSVKHL-NFNREKKDI----HCFLHALKVTLTC-FSGHLFRPWNIWCKMV---------
>tr|A0A1I2S201|A0A1I2S201_9CORY Uncharacterized protein OS=Corynebacterium spheniscorum OX=185761 GN=SAMN05660282_00995 PE=4 SV=1
--------------------SGHLEPELQLQLYARHPNAQWLLRAG---------------KAVPAELVELSIHAIAAADAegaldalAEARIRDLGLAQR-RFGFPSELYQDIQEIMVSLLRTTGAD-LPFPVEFAAERTIARVCVLLQE---
>tr|Q8NLZ4|Q8NLZ4_CORGL 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases OS=Corynebacterium glutamicum (strain ATCC 13032 / DSM 20
--------------------AQDFLRAVQAKLLTLAPQARGHFPTA--D------------DATHISIAEMVSALLEGTGEegkvddkTLEFFKEAALDAR-RFGLTPEMHSALGEAVRSELLSLCED-LPFENVLFAERAIAATTAVSVE---
>tr|L1MAU4|L1MAU4_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium durum F0235 OX=1035195 GN=HMPREF9997_02488 PE=4 SV=1
--------------------PDLFRTLAQRYFLDDCPEARFLFPTD--D------------STAHADLAAALIFVFNHSNAdgsltpkLVSILEQLGRDHR-KFQVADNHYERFGNALNRALKIVGAHAptYA---ITAAEKAITATLETMRR---
>tr|W5Y4C7|W5Y4C7_9CORY Putative oxidoreductase OS=Corynebacterium vitaeruminis DSM 20294 OX=1224164 GN=B843_11695 PE=4 SV=1
--------------------REELSAIAFDMFFATQRDARTRIRA-------------------TPAIADALTLLARSCDSegklpldVEKRFLQRATTLC-AHGLRVDDLEPLAESAHRAMLITAGG-QPFELVLPIERALQQLARTVVE---
>tr|A0A1W1UZL1|A0A1W1UZL1_9CORY NAD(P)H-flavin reductase OS=Corynebacterium glucuronolyticum OX=39791 GN=SAMN05660745_01670 PE=4 SV=1
--------------------SPEFHEHVRANFFDKCPETMLVFPLH--K------------ENVHADLGRVLSFVFDRTPVdghltdeMRTLITQLGKDHR-KYNVSPRYFHPFVECLRDSLLTLCSD-LQFKYLNGADTALGEVSTLLAR---
>tr|U3GX34|U3GX34_9CORY Uncharacterized protein OS=Corynebacterium argentoratense DSM 44202 OX=1348662 GN=CARG_08960 PE=4 SV=1
--------------------LSHFGDLAHSALLRRAPGLIS---FF--G------------PNPHTELTTAVLFILTHSTPgpqdsgtqtplspridaaGAGALRALATEHV-AYMpPDPALYLAAADALCEALRDSCAD-QPFQQVLAAEKALREACSLMAT---
>tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae OX=1717 GN=mphP PE=4 SV=1
--------------------VTAHSIQAVADElraHRAEFIQAANQKP-------------------DSPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD-LPFAEVLGAERAITSIANTLT----
>tr|C0E6D0|C0E6D0_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium matruchotii ATCC 33806 OX=566549 GN=CORMATOL_02563 PE=4 SV=1
--------------------GDGFSREVFTTYFRYVPDAQLIVSP-------------------DYPLGDALVGLFHGSDNegnlypeTIEHLRDVTEILA-AHGF--RRYRPLADAISPVLDRYCLD-ISAYDVFIIKRAVRQAAEVMDE---
>tr|A0A0G3GTQ0|A0A0G3GTQ0_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium epidermidicanis OX=1050174 GN=CEPID_01535 PE=4 SV=1
--------------------SPAFRRDVLRDFFSQHPHMRLKFAAN--E------------DHAHTELVFALTYLLENPTD-PELIRTLARDHI-KVSPGQEVVADFFAILHRQIHRYCAD-LPYEEVRQADLKLQEIA--------
>tr|A0A0F6R111|A0A0F6R111_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium kutscheri OX=35755 GN=UL82_09495 PE=4 SV=1
---------------------------MVASHfYADVPLARLSFRL-------------------QPSLVDTLIAGLSHP--lNITAW---AHDLA-HRGVDRSFYVPLSAALQHAVCHICSA-LPLVDVLAVEHRIDQIMKQLLA---
>SRR5580704_16882803 
-------------------------------------------PG--RH------------GCAAPAFLPGAQPYRRCPRgpegPRQPRALSAGTRAR-APKFGERHYEVFRRALIATLQRFAAPRWNETAKHAWETAFNHAATVMIE---
>SRR5690348_1231357 
--------------------------------------------------------------------------------arapevrrPRAPLRG------G-QAGADRHASAVCRAELEP------------DRQARMGDRVQPRRRIMID---
>SRR6476620_5060594 
----------PAQVSFWLLEpvADAAMTYFYAQLFAKATWTDREVY-----------------ISGPDHMIVKTA-RVLRERgapdRLIHYDLD-----------------------------------------------------------
>tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 OX=582737 GN=TSPGSL018_8354 PE=3 SV=1
----SSKIITLIEKSWAFVEsrcdLMEVSNKFFERLFQRAPALQNMFTKPK--------------RVQYVMLAKALDLIVRSAGEtkvMNEDIKAIALRHI-KYDIRQEHLNVFGSVLVETLANSVGPeNWDEDISAAWASIYGNIAAVF-----
>tr|A0A1Q9C6P6|A0A1Q9C6P6_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene41206 PE=4 SV=1
---CVCDLAQCRGRSWAAFFvdi--------QAAYYETSRS--LLFEGP---S-----------QDP----------ALVALQLpahVQALISDGALQGL-GI--PQEHIALLQDCvecsfwtftgqtqqvmatsgsrpgdgladvlFGALFAVILtcLEAKCQQCGLVHQSMSDALGVPDR----
>SRR6476646_8240181 
-----------------------------------------------------------------------------NINLLF-ALNRHTCPNL-I------------HEPASEFfFGLQRPATH--HEHIRVENIHHL----IK---
>SRR5688572_19725352 
-----------------------------KNLFELNPALRPLLPE---STAE-----------QDRLLTRLLNAEAGALAGTRPP----APRSAEGHGNEgTAPCSVAGEALLWTLQEAYGADFTPQARAAWEALYRFVTGTTKSAP-
>ERR1719229_1707680
---------------------QQLGVLLFANLFKKQPLCRNLFAD-SDI------------SKQSLRLLDMFGWLLRSLVKeknqMrLRTLKSLGDRHV-KYGIKIEFFGPMLDSLSDALQDWFGTNYNTQTRVALTTLFQSACNEMMKQ--
>SRR5512139_12076 
------TDLELIEASIEQMlDlETEIIGDTYARLFAHCDGARALFGP--NTYG-------P--RAQ--MVN---ETIIAGLDLLrgepwvHEYMTQHGVRHRHSYEVTDAMYRTYAESLLGAIRERLGDRFTPELEAAWS---------------
>tr|A0A2E3FAX6|A0A2E3FAX6_9RHOB Uncharacterized protein OS=Rhodobacteraceae bacterium OX=1904441 GN=CML69_02715 PE=3 SV=1 
---LPNENLELIRHSFPLIFqhKAEITTKFYEGLFRDAPELRRLFSK--EMNVQ---------KDMLVSVLTTLAKA--SFDEglVESMIARMARVHS-GLGITSGQFRTGEAALLSALDQSVGDLLSETTLDAWKTAVRRVISAMID---
>tr|Q9NAV7|Q9NAV7_9ANNE Dehaloperoxidase B OS=Amphitrite ornata OX=129555 PE=1 SV=1
---------------------RTYAQDIFLAFLNKYPDEKRNFKNYVGKS-DQELKSMAKFGDHTEKVFNLMMEVADRATDcvpLASDASTLVQMKQHS-GLTTGNFEKLFVALVEYMRA-SGQSFD---SQSWDRFG------------
>tr|A0A0G3G1X4|A0A0G3G1X4_9GAMM Uncharacterized protein OS=Thioalkalivibrio versutus OX=106634 GN=TVD_07385 PE=4 SV=1
--------PPNVESSYRRCcADASFLARFRLALRAADGQVSGIFDP-LSA------------RQQEVMLDASIRAALDFSSGdpqGASRVSEMIHVHGRQgrVPVPPALYPVWLESLIQAVRETDP-HWSDALERRWRAQLMPAVDMFVELYL
>ERR1719187_3161387
--ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFINdFIPSQMADKATPNTIKAWEKFMTVFIEHVKE---
>tr|A0A2E0SIT0|A0A2E0SIT0_9PLAN Globin OS=Planctomyces sp. GN=CMJ46_04905 PE=4 SV=1
--PVSMTIVDSVRESYARCrQNPDFFDAFYDHFARKSSEIGPLFSN-TDMQ------------KQNELLSDAIDSLISFSEGdvaARRHLDEIALSHDReHLNIKPEWYPLWMEALRDTIHESDP-GATTQLLADWNTVLQPGVNHIVQQH-
>ERR1719487_109746
----------EIEISHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALAMRHI-KYDAPPEFFGDFESAMLKTLAQSLKSTFTEAVKEAWKAALQFIASTIV----
>ERR1719327_803055
----------EIEITHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALATSTC-ASSGSRLA--PRPSSTATSI---GRSPFRCRX--------------------
>ERR1719356_1095802
-------------------LMRDIPNTIVALFAI-TVAVfeddySSMLDQ----P-------FlliAVLGFVTLTvilLLNLLIAQLNTTYV-RIYQEVFGWALI-TRGNQIVEV----LD-ACPMS-VWKPFLETLGLDERLE---FNEGDIG----
>ERR1719326_1696685
--------------ASSTQikeLFADVDLS---------------IHA----P-------Ifa---------sTLQSTISSLNNPTELLPLLEDLGKKRI-KYGVQEEHVVAASASLIFTLK-SIDDQWSPQVEAAWTEACNVMQNVAS----
>tr|A0A0N8ALQ3|A0A0N8ALQ3_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
---------------------------TKARLN----NCMLLFSE---------K---LAAFLaQASPSWPVWNVVIHPCfs--qelMANQLNVLGGAHQ-PRGATPVMLEQFXXXXSPPSSSSSSRKP-PASRNSSPN--------------
>tr|A0A0P5ANB1|A0A0P5ANB1_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
---GGNDGVETVSDQSNLFVVfAI-FGQGIDGNASEFDEVLLGAGSLLEEL-DEDGGNDGVAVTpDVFPaglniadlVGGQFSLGISQIfgflevlgdASdqsAHTVLPGLSGL-G-VEGAAQRFSKDFLSDVTELLEHDGVSSFNAEARQAWKNGMRALV--------
>tr|A0A0P5ESR8|A0A0P5ESR8_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
---------------------------------------------FLEDA-SELLEHDGGSS----TGFMGTTESVQLVghqllaeqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGNIGELAEHCLVL--GVGLDEA-EEDLGSDISV-L--------
>tr|A0A0P5I7S0|A0A0P5I7S0_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
---------------------------------------------FLEDA-AELLEHDGGSS----TGLMGTTESVQLVghqllagqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGKISEGLEHLLVL--GVVLDE-TEEDLGRHISVLL--------
>ERR1712168_1063860
-----------------------------------------------------------CEKAPPIPDCTSSNTVMMRLFKrdpeVAKLIYDVGVQHQ-TRNINEDEMTKMSKSIYSAVQDINVGPHSDKELAALHNLLEVVSYHFKRG--
>SRR5690349_6204932 
-TILTDEHRHFIRTSWEKINkrheKTTLGILMFEKVFAFLPDLRNVFGL-NDSS-VSETDRNENFRRHTSLVVNLIDLIIRNIFEmeaeMGPVLLMYGRRHFLKHDLVFQENQLVafAQGLCEFFEEEVDHdddnSLASETKAAWNIF-------------
>SRR4051812_9455799 
-GTLTPLRCQLLQKSWEAIIakygMFKPGMIMFQNIFKIQPELMEIFQI-SPEK-LGNFGDlPDEKFRHGRIFTNVLNLSVKNCVEleteVAPVLHLYGRRHVSKHNVDMAHHFLLvfAQGITSFLINEVK---------------------------
>tr|A0A1Y3EGL3|A0A1Y3EGL3_9BILA Globin OS=Trichinella nativa GN=D917_02219 PE=3 SV=1
--FLTKSQRQNVVRSWEKVpNKRALGEEIYIQIFMHKPMLKSLFP-FRTVP-VDQLRNNALFTRQAAIFADFIDCVVGYLaiNNgnlIMELSERVGVNHALMTSVnfDPEWWVLFANSVLDCIRQYCEPKFiclpisrhiTRKIMIAWRILLKEVVDRMSEAF-
>SRR5260370_37911868 
--------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVLTRIVERHSRpDLAVPPALYAPFVDSLIATGEQHDP-AFTPEVEHAWRSTAQTVVAYMTSRSX
>SRR5229473_1098235 
--------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVQTRIAERHSRrDLAVPPALCAPFVDSLIATGEQHDP-AFTRRWNTPGGAPPKRS----SPTX-
>tr|A0A1I8EE37|A0A1I8EE37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1
---LSKSQRITIENSWKRATksnaREQVGIQLFARILTARPEMKHLFG-LQKIP-EGRLKYDPRFRRHAIVFIKSFDYIVKNVAykeKLEQHFQALGERHTIlqGRGFDPGYWDTFNDCMRQTVS-LWGKDKDHRTANTWHTLISFVLQNMKIG--
>ERR1719264_1394560
--------ISVVAANFKTVKSnQVLANTLFEHLFELEPSSKALFES-KDL------------TQLKTKFAGFIGQGLKMLqgKNAKKSSGSLPRCTW-RWE-------------------------------------------------
>ERR1712226_1819570
---------------------------------QYDPSSRQVFEN-SNL------------TEHKQRFIGFIGKGIDTTiEGDREEWKDLVDMHV-DIGVTFKHFLAFEDAFLNTLHDLYADTFSDELLCAWIYVL------------
>ERR1719326_1666808
--------LDIVTKSYETVAAnSTFADILFERFFSYDESAKKLFGN-ADM------------ATHKKKLVGFIGKGLKMAqsSDPDGEMRKMAAFHK-EKKVEISHFIFFEESIIYALRGTLGVAFQDELADAWTLVI------------
>ERR1712071_441310
---IRRQGEDgrqrpvrhrqrtqrnpqtrlLSLESWTQKDrSPERPSQqvvghpkadccSSNRRFSHPPHGRRRPPW---LP-IQDANRLRAFPHQLHHQGRELP-----cRD--pKLsrX--------------------------------------------------------------
>ERR1719432_409132
---LRHQEHRrarrfrqqqerCPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPRIADVA-VSDLMNNRKFLSISYSAFAGFNFILNNMDDPEI--IKLQLSKV----DFPGMfvfpfpgtsqqHQ---dtsr-IVLEVFREELGAAFTAEAASGWTSLLNFVSQALIK---
>ERR1712179_658195
---VSGNSK-nAVRATFDQMRfNSEVAPKiml---KLFTAYPETQKMFHRIADVA-VSDLMNNRKFLHQLLCL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgyFSHR----IGSVQGRAGRRlh----RR--SRLWMDFSAELRQP---
>ERR1712137_151953
---LRHQEHRrarrfrqqqerRPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPPHrrcprlgfdeqPQIP-VHQLLCLRRIQLHPQ--QHG---RSRDHQTPT--------VQG----RLPRHvrlplpwylsaAP---gyfs-HRIGSFREELGAAFTAEAASGWTSLLNFVSQALIK---
>ERR1711946_32375
------------------------------------------------------DEQPQIPVHQLLFL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgtYPPS----HSNHTARERTAfqvlFLPQDT--SRIVLEVFRE-------
>ERR1719222_1795957
---VSAKAKSLIRDSWVQMKfNGEIAPKIYLKTFAAHPKTLAMFPQFAKVP-NRVRPHPYEpLLATAGIDYDVKLWIPSPGSEHNInveELMARNArmleetrDTI----TVPATfmirmlas--------MSNFRR-AGNRSTNDE--------------------
>ERR1719222_245222
-------ARSlgrtqesHPLDLDSHEIqqqRRTQNPLQDVHHLSRDPENVHPFGRYTR------------FSAHGEQTVLGFESLCFRwiqhdcqqYGCSRA--DQVAVVQG----RLPRHfrlslpwhfsaTRANPRIILEVFAEELGSTFTKEAAAAWNSLLNFVTKGLEN---
>ERR1711911_103569
---------------------------------sraDQVAVVQGRLPRHFR---------------------LSLPW----------------------------HfsatranhphhlgsIR--RRTRLHFHQGSRCrleLPfelRHQGFRKQHRRLATHR---SRP---
>tr|A0A0B2VDB7|A0A0B2VDB7_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_13543 PE=3 SV=1
--SMNDDTKGAICEQWHTILalydgdISRVGVAVYQRIFDAEPQLREVFGIPSFV---TDLSEYEPFQRSGKLFMSVVDLCVRNIYALdaemGPVLVMYGRRHYHQqsRGFHLRYMPIFTQCMKEFVSDCLNEKQkTSDSEDGWSLLFDYIAAKIVDG--
>tr|A0A0N0P721|A0A0N0P721_LEPSE Adenylate cyclase-like protein OS=Leptomonas seymouri GN=ABL78_2595 PE=4 SV=1
---------FTVQGTWNILEkegmLERFAQQLYDELLTQNARLRVYFYGV-DL------------DEQSKSLVRMIGTAVHFYEKpqvTVEMFTKAGARHR-GYGVNGEVFEEMRDAFFRVFPKFVGADVFSAAEEEWQKFWKLMLDLLQH---
>tr|S9WKS4|S9WKS4_9TRYP Adenylate cyclase-like protein OS=Angomonas deanei GN=AGDE_06844 PE=4 SV=1
---------NTVLHSWKLLEdggkMDDFGDALYADLLNSNPYIRVFFYGV-QL------------SEQPKALMRMLGTAVYSLNNpnkVDDLFVKTGAKHR-GFGVTTETFQSMETSFFKIFPEFIGEDVYEKTKKEWHDFWKYIIKKLDQ---
>tr|A0A2C9KGE7|A0A2C9KGE7_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 PE=3 SV=1
--LVTDSDIQALRSSWATLTAgpdgrNVFGNNFVLWMLKTIPNMRERFEKFNAHQSDEALKNDNEFVKQVKLIVGGLQSFIDNLENpgqLQATIERLAAIHLKmRPSIGAGYFGPLQNNIHDFIEDTLKVGADDAAPKSWTRLLTAFNDVLNSY--
>tr|A0A2E2XNM9|A0A2E2XNM9_9GAMM Uncharacterized protein OS=Cellvibrionaceae bacterium OX=2026723 GN=CL693_20675 PE=4 SV=1
-------DIDWIESSLELLAphADRLGGLVYPRFFVHFPEAETLFGG-GELG-----------KSTQESMIVPLLMGLKDIADGKtymLTIERWLEDHR-EYGVTLPMYSVMLDSLLLGMREAVGDLWTTEMDGAWQEVLARLLLLVEGVY-
>tr|L7L9M1|L7L9M1_9ACTN Uncharacterized protein OS=Gordonia hirsuta DSM 44140 = NBRC 16056 OX=1121927 GN=GOHSU_25_00750 PE=4 SV=1
-------IRQAVLESLARYEesHGDPTRAIYERFYRVHPEAIEELAF-D--------------TVLENRMMAGILALLADVADGSidpGGAVYWVSDHV-AWEVSETMIMGMFGAVRDTVREGLGPEWTARMDADWAGLLAALAPAMRDAV-
>ERR1712232_1039451
---------------------------------------------------------SEEMRTHATKVMTFVGNGVASIGNPEkcerfrAECIALGKKNQ-ERGISSQDYDIATQPFVDAVEHSwlqagwrqtdaSGSIWPPGAQGAYTKFYGHMAATIKDG--
>tr|A0A0D6M6J3|A0A0D6M6J3_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=ANCCEY_05408 PE=4 SV=1
--------------------------------MPSCVRTAVTLP-----------------YLEIFEPFVVIEGAVMSLDNlpaLDPILDNLGRRHG-KLEVNGKfrtyYWSTFLECSICIFRKTLTN--------------------------
>SRR2546427_1691122 
--------VVLLQTTFLRAAemrigKRNITDFIYEDLFLKRPQLKPMFTN--Q-----------V--LQRHKLGKMLGSIFIHLRDqdwIDEHLRDLGAMHW-RAGATPEVYPWIKDSVLAVLEEGMAPsGWNLRCQREGAGALGVSAQGMLMGY-
>tr|A0A183IHG0|A0A183IHG0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 
--HFSLREKELLSVSMKKLEqlEEDNAVKIFIRLFQENPAYKSLFPKLRFMG-DADIVNSTALVAHTQLILKMIKTFINGFQNestCAVVLKRAETAHR-KFDIKPSQVSTLFPILMEILDIS-----HNETQAAWKKLFETFSI-------
>tr|A0A1B6JRB7|A0A1B6JRB7_9HEMI Uncharacterized protein OS=Homalodisca liturata GN=g.2446 PE=3 SV=1
-ASLTDRDLRLGRATWFKNvDaTPDFGMVIFKELFRQYPDVESYFLHLRGN--AGSIFDSRTFRSHMTeRVVPKLKEVFEALDKpehLNEVMTKLGLYHA-KLGVSGHLVENMLSVILDALKSVMHTKMQPDEETAVRTC-------------
>SRR6185369_2033738 
---------------------------LRRVFI-QVASDRSDVSK-TNF------------KFQKLMLRQSLLEMLCfdrGMSGTREEIERLGLRHKV-LGVTPEMYAMWLDSLCEAIKQHDP-SYTPELEQLWRVAMLKSIKE------
>tr|A0A0P5RQ13|A0A0P5RQ13_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1
-TKLTPHQIRDVQRTWEHLRanRNAMVSSIFVKLFKETPRVQKHFAKFANVA-VDALPENGEFNKQIAPVAARLDTIISAMDDklqLLGNINYMRYPHQPPRAIPRQTFEDFARLPIESLEAS---GVSGDDMDSWKGVLTIFVNGVSMRY-
>SRR3954451_11513015 
--AASPCAQQLRQGCRDRPA---ACQLVLSSGVRDRPGCEIAVQ--GRH------------GEAGPQADGGADGLIDAIDRLDTI--------------------------------------VPAVEAAWTEAYTILATTMKD---
>tr|A0A1S3CW24|A0A1S3CW24_DIACI uncharacterized protein LOC103506299 OS=Diaphorina citri OX=121845 GN=LOC103506299 PE=3 SV=1
--GLTPKMVGLLKCLGVAIKPeaHRHGVNIFKKLFLMDKTVQRMFPKFACD-DMCGLDENPDFHKHVDAVMKSILYMMESSGsvpDMKSTLALQVKIHK-DLCIPDRHFITFGYAINEYLKETLGAKYSEDVECAVAYFWKFVASEMTAKP-
>ERR1719244_808981
------------------------------------------------------------------------------------KAPRTRRPPRAALQRENALFQALSRAFLKAIKVYLP--WSDRREAAWQLLWQRIITQMTL---
>tr|A0A2T5C1R0|A0A2T5C1R0_9BACT Hemoglobin-like flavoprotein OS=Mangrovibacterium marinum OX=1639118 GN=C8N47_108138 PE=4 SV=1 
---MTEADITVIEKSYAQIEAalPRMAKYFFNRANELDSDLDPLFEE--DK------------SKHGEAFVALFGKAVEHLNSPealLPEIKKMEAKLK-YYKFNEEVLNTVGVVFVDTLSFGFGNNFTQDIIDPWVKAYKTYSS-------
>tr|A0A1Z4LAZ9|A0A1Z4LAZ9_NOSLI Nitric oxide synthase oxygenase OS=Nostoc linckia NIES-25 GN=nos PE=4 SV=1
--AVPPELLLKMADSWQVMsqNKQQMGIEFYQMLFEKYPFVLPIFGR-ADMD------------YLSLHLFQALEFLVNCLKTgssdeMLRELRFLGQVHG-SADVPTCAYPAITECMIALMERHVP-DLTPQVRQGWVTLLERVINIVK----
>tr|A0A096P8B0|A0A096P8B0_OSTTA Flavoprotein pyridine nucleotide cytochrome reductase OS=Ostreococcus tauri GN=OT_ostta17g00030 PE=4 SV=1
---------------------------------------------------------------------------------------masvgsgat----DDD-GVDVPVSRCPFAhGTVTVDPYPGYVH-G---KNPRVCPRGCVPRPPSKP----
>ERR1712071_238239
-----ERSFTYWKDSAMMELa--------KWNARLQTPR----------------VYEVKwRRKKRNIPGRVGWRVLGAELWVRSSCRRRIRNRPYQEYFVSyvsiSQQLEETARLIIDALDEELGVRFTSYTRGVWSR-aFHFANSIMAESF-
>tr|A0A2D4BL26|A0A2D4BL26_PYTIN Uncharacterized protein OS=Pythium insidiosum GN=PINS_002968 PE=4 SV=1
---------------------TTLYDVFYAHLEQHSPELKPVFRS--SV------------HIRGKVLVHISVGMRTLIASenFVDKVLPLTKTHR-RFGVKPEHYEPLGRALLHAMQVVAL------ITRDRGRVEEPTSIILI----
>tr|G8YSE7|G8YSE7_PICSO Piso0_001107 protein OS=Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL Y-12695) GN=Piso0
--EITEQDIYRLSSSWNTIHtnsryhNDSFVSRLYANLLAANPKLLPVFSG--EN----------GLQEHSALFGELLSLTMIYLNDmptLKICIAAYARENPLFTEQCCEIVEPMGSALVLTLRQWLGKgVFDNELQELWIKVYVMLANTLL----
>ERR1719431_737524
---LDMSQISDLQRCWSTLQlhmgEQAIAAAFYNDIITNFPSIQKYFKNIWTESTFtRTIGNMNDVRKHASLVVSRLTNYMGNLHHLsevNEDLKELGMIHAARYHITEEVVEQFVSSMATTVADLLTKedLFDPVLCGAWKRFFFMILTFLSEG--
>tr|A0A0G4H5Q5|A0A0G4H5Q5_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) OX=1169540 GN=Vbra_6604 PE=3 SV=1 
---------------------SEIGIVFLHNLFSNAPTLQKLFVR---PS-----------ATYGRIFGQILKMLLAHLDDPAEvwqNNKELALRHI-KHGVRPSHVPLFSKLIVETFASIGGEEWTAEHTAAWQALWEVTGSELT----
>ERR1719431_2380502
--ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFiNDFIPSQMADKATPNTIKAWEKFMTVFIEHVKEG--
>tr|A0A1W2GS79|A0A1W2GS79_9BACT Uncharacterized protein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4044 PE=4 SV=1
MKDLNIRERKNIRDTWKVLAPniHEFAFSFYSNLHSLDSSLVPLFEN--EF----------GIIKQGDKALYVLGFVVASLDNLmvaregiKKALEGVFMEHQ---HIKRADEQKVMKAFLQAMKSTLRGVWTNEIAISWYRLLSLISAVSI----
>tr|U1JU51|U1JU51_9GAMM Uncharacterized protein OS=Pseudoalteromonas citrea DSM 8771 GN=PCIT_01118 PE=4 SV=1
-MSISPYQYQLLTQSFTTLKPNFhcFCVSLH-TQLKNYNLELA-------------LPSSSkYLLNIEHNIQLFLSEGIALLPQQsalVDLIKRHKPHFD-ALKLSEQDIAVLCHTMLETLQLHLGRQFTLALRNAWRKALHMFANIIKS---
>tr|A0A290TM25|A0A290TM25_PSEO7 Uncharacterized protein OS=Pseudoalteromonas piscicida GN=PPIS_a0207 PE=4 SV=1
-MSITPYQYQLLTQTLASIRPNFhgFCTSWY-NQIQHYDLRMQ-------------IPTNVgQLIIWEHQIFDFVQNCVMRIPQQsnlLHYLQKQRGTLL-FMGTSEKDISVLLFTFYSNAKKSSWQAFYHSSKKRLEQSTVTHRKY------
>tr|A0A2G1B531|A0A2G1B531_9GAMM Globin OS=Pseudoalteromonas sp. 3D05 GN=CSC79_14765 PE=4 SV=1
-MGISTLEKQLLLNSLHVVKPNFhcFSYTFQ-MHVKREPLDML-------------CLSNSKINEKTYILYCVLERIVMHLDNLrtvTPFIEHYAKNLS-NMGMSHQDTDILCNSFLATLKIHLKGCYPPKLESIWQHAINIFKSIVTG---
>tr|A0Y309|A0Y309_9GAMM Uncharacterized protein OS=Alteromonadales bacterium TW-7 GN=ATW7_05751 PE=4 SV=1
----MNSHKSVLLKSIGIIKPNFhaFTARFH-KKLVESDISMN-------------TLTAEQFNEKSYILYCTLERIIKNIDNPssvAPFLSHHLQFLK-KLNIQQSDIKPLTDIFYVTLVEHLGRFFNEESHLAWRKVLTYFERYTND---
>tr|A0A0K1PX98|A0A0K1PX98_9DELT Uncharacterized protein OS=Labilithrix luteola OX=1391654 GN=AKJ09_04675 PE=4 SV=1 
---------VVLKESWHLSYrrAPDLAARFYEELSWKYPSARRLLDHVFGAQN--------DI---AVCLSTVAGDLLDNVDDpdaFSAAIVALANAHV-SLDIPPHVVAWMEEVLLDTLEGAAGDDWTPEMRTTWRNAYEDLASRLAR---
>SRR4051794_15895678 
------------------------XmvgitqfyTEFYARLDTLDSSGKfdAIlsahtsgTNK---------------IAAKGEILIRIIKFALSIQGdnpavql----QLYLLGKS-HVQKRIRPWQYSIFVEAMIFTISSRLGTEATHEVMEAWVNIFAFILRSMLPQA-
>SRR6478672_7358577 
--------------------------------------------------------------------------SRmp--CNSstlkRRPSatscTESPTSTSP-WESAPSST-PSSASTYSPRSLRFWATPSPPRSPPRGGEVYWLFALQLV----
>tr|A0A1Z5JZN5|A0A1Z5JZN5_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_19Hh029 PE=3 SV=1
MEDISPDVVSAVQDSWERIKdsspawEDDFGDRFLKSIFTKAPLsYKLLFP-FGTT-SGPAMFESEDFIEAARTASTLMDMSVSLLecemDALFGQLLEIGLEHANFPRIQTSHWSMMRDALLRTLASYssaLSEDCKdlEKVLSAWSLVFDNLSNEMVE---
>ERR1700744_2408068 
-----------------------------------HPEAESLFRR--GPS--------MR--CPTGRP----------RSGTPG----GscwtkliASAlSA-RHKSRRLKSSLPLEEIRADVGFLL--DRVVVAIDavgdervvRNDRVLVRLDRVQS----
>tr|A0A2S3QTP4|A0A2S3QTP4_9PROT Uncharacterized protein OS=Halobacteriovorax sp. DA5 OX=2067553 GN=C0Z22_01530 PE=3 SV=1
-------DKDLIIESFARIEpnLKNFTNAFFDNVVILEPGMQKVFAH-AD-------------REQLKaSFIRALSITINNLKNpeyLKYYLQGLGGNQI-KYEVSETYFPIFEEAFIQTLMLFHMNSWTPKLETAWRDCFYYIAEYIS----
>ERR1719216_352717
----------IIKSSWRIIQnkvIARHGTDFFIEIFDSQF---------KP-P----IGVTPVFQGHGEKMIQVVGKAIETLRDgKspteqesqelWDMLIENGRLYL-GYGALPMYFDVLGTFDCKHSKDNVIVntGNCGKQEM------------------
>tr|A0A2D7G1P9|A0A2D7G1P9_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP96_10880 PE=4 SV=1
-------EQTCIERVLDCAAedQPDFQQRLYDRFYQLAPSAEALMIHIDEE-------------VQGKMLAEVIRLFLsPDVaVTDQQYLLFETKNHAQAYFVEPEMYRALNQALFETLKVGAGRIWSSEVESAVHNRLSKMLHGILEAL-
>tr|A0A2E1GZ77|A0A2E1GZ77_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ03_04085 PE=4 SV=1
-------DQAWIETAFDCAAvdNLNFNVDVYQTFYRAEPSVASLMAHIDEL-------------VQNKMLSEVIRLLLnPNIeSEEAGYLNFEVKTHIQGYGVSPLMFLSFNRAVYEVLQSSAARVWEDDLAVAVTRRFAVLSDALTEAL-
>tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ23_00915 PE=4 SV=1
-------MQSSIHALLEQVAttDIDFDKKCFERFFQISEEGKTLMAHMDRV-------------HRGKMMAEIYRLMMaRDLDDEADYLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY-
>tr|A0A2G2R0S2|A0A2G2R0S2_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_09030 PE=4 SV=1
--IVTPDQAIIIQESFARLStsSDSLIQDILGTIAEGNSDLAVTI-TF----------KSQNLVE---QISTALSHIIDQLhtaDNVAEYVAHFGELLL-AQNVQDENYSSFGEALLSGLENALQNDFTAEVRDAWTSGWAMLSGIMRE---
>SRR5258705_7404034 
----------------------CPTSSSRPVLWAAvrdCAGGQTLVPR--RY------------DGTRLQADGDAGRCGQQSGQSRsrvAGGERSCQASR-RPWREGGYYTPVGAALLWTLEQGFRI--------------------------
>tr|F0W0M6|F0W0M6_9STRA Uncharacterized protein AlNc14C5G666 OS=Albugo laibachii Nc14 OX=890382 GN=AlNc14C5G666 PE=3 SV=1
---------------------------------LNAPELKPVFKT----------------SKHARnVVLQHIVGGLRTMlahDVHIERVRALTRTHL-QFGVKMEYFDLLGQAVIFSMRHCSGSHWSSEIEEAWRRLYGHCSVILL----
>SRR5271163_4883858 
----------RTDSLYAQLGgkttIASIVDRFYEKVL-ADPDLKPFFAK-ANM------------AGIKQRQAQFLTQALGGPIDA--RNHETRPAHA-SLLSDTRHFERAATHLAVTLSEM-----------------------------
>ERR1711911_155006
--DIIRKNCLMLYTNFTATKiaFKWILLCLNCRYFEIKPEAQKLFPAFANVPL-KDLPKNYA-------FLAAVNTCFANVHYLIekagrnprdcPVFSKVV---A-KYD--ARDVKQFGDIMMNSLKSELGSQFTDEIEESWNLALEEIAKMVS----
>tr|A0A286GHZ2|A0A286GHZ2_9BACT Sulfite reductase, alpha subunit (Flavoprotein) OS=Spirosoma fluviale GN=SAMN06269250_4620 PE=4 SV=1
--ALTPDMIRLMRQVGDQLsaDARVIGTDFYHALFQTHPDIIPYFNR-TDID------------SLTEHLMQAVGFLVRSLASgvdITKELRELSQIHT-NFSVPPDAYPKLVEPLLTVMRKH-VPGFSTEQEHAWVILLNRVTNVLRQ---
>ERR550539_353004
--------------------------------------------------------------AMMQHLVKNLHDISRF---dsdIRELLTRLGQQWL-QKRVPLDFAVLLGNEYLEAvlpffHSNV-GATLALKLEVSLAYLYKEAMHFLLL---
>LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3583117_1 # 3 # 191 # -1 # ID=3583117_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
--ALAPEAVTKMRAGAEAMlaHPQEAGVFFYETLFDARPDLVSLFRT-ANMD------------ALSRHLIDTVVFLSRAADDltgLRDDLRNLARVHQ-VNQIPPSEYAHLAAPLLETLSRF-GHPLDAQMIRGWEVLFDRVSRIVAE---
>ERR1719359_219123
------------------IdEepmaEVVSGeDALV----AIA-DLlyQKL-------------------------------------SGdeaMAQFLENVDLT--QlanNLRSLlalvfngsdWPEMHLS--gSLiddgYEDFSSILQETL----qaSPg-DDALL--ESLDKL----
>ERR1719487_376807
------------------EeEgateEVASGeEALV----AIA-DMlyQKL-------------------------------------SGdqaMAEFLENVDLA--QlakNLRTLlaavfegndWPEINLS--aSIidegYEDFSSVLQETL----qtCLg-DNAML--ESLDKL----
>ERR1712100_485805
---SVGHVVLVV---GRCSfEcrniVVVEGlDGSLDRLLALRkvvgiglGLPilQQL-------------------------------------G-VLRHVGNVA-------------------lKVlrchFLQFSNHVLEVRSRLRldefclvgdivievilrDHgggkHeRD---------------
>ERR1719171_2780585
--NLSEEMITEVQKSWSEVLrrvdsKTEIGRIIYDSLFDRLPHLRKMFKT-NRL-------------TVAMRFANSVHSLVGILNNkeqTEEYVYNMALRHV-QYwsgdgSIAQANMSAFLKAVLIVFDNALDDKWTQRMEEAWGALFSYVGEAMVA---
>ERR1719265_1594411
--------VDTIVKDWAGLDLEKLGDTTFGMMVQNNPEIKTIFGG--DVHPG---VAQQGLKSQAATFVGFMSYAMTWLKKkdfivLEQKMVELGQRHV-HYGVNVSHFVSFQEAMFTALREQLGTRFE-DNKYAWTFT-------------
>ERR1740139_1939294
----DSDTIAVVKQTWKAITalPeqqEYVGMRLLHNlhpcyetsltfllvielyylsYLRVVPSARAFFPPTSD-----SLIDDESFRESASNLMMCIDKAINTLENqrhlrFKALLQTYGKKLS-RLHIPPSCYTMAWFALIETLQDVLEDRFTELMLAYWIDIIDPINT-------
>SRR5690606_18427011 
---VSHRN---AHEKHQPCHaKL-------------RPLLRE-----------------PRLLRRLLY--DLSGqLTRR-A--GEVRPERHG-----GAEASAX---------------------------------------------
>SRR5690606_42132731 
---MPMKNTNRVMQSYGRCCaSPGFFDDFYTTFLASSPAVREKSAQ-SDMA------AQKHLLRAGIP--NLVPLARG-M--PDTKLDRKSTRLN-----------------------------------------------------
>ERR1719487_109746
-MIMSAEAVQVVQDSFHRVDscvqiRDALEDVFFPHLFASSTQIKELFAD---V----------DLNMQAPMFANILNSTISSLNNpteLRPLLADFGEKCK-KYGVQGEHIATAGESLIFTMKSI-DDQWDAEVEAAWMAACSAMENAA-----
>tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1 
-----PMEVALVQSTWQRFLesPnlTTEFSAIFQRMFQMVPTAMQAFRYV-NSTDLDSLVANKDLQKVVTMMMSEVNATLQLLDQpqaLISLIRSHGARHA-TYGVTRQWEETMLNAILYAVETKLSPsGFNQSEKNAWRSVLDMLGRNF-----
>tr|A0A0C9M7G1|A0A0C9M7G1_9FUNG Type 11 methyltransferase OS=Mucor ambiguus OX=91626 GN=MAM1_0030c02374 PE=3 SV=1
--PPTQAQIDIVRYTWERVSeihldtddPtvsatHAFGLAFYDALFKLDPSLEPLFSNIFQQAralagMVSYIARSPKVTGPNKpksatSLsegcgmstaklekvptireinarkrketnATTFEELVSSAatskpkaeDDeeqLLYKLRELGARHY-FYNVEPKFLALVGPAALSALKTRLGKDFLPEVAEAWTRAHAYAAYHM-----
>ERR1719365_124985
-SEMSGKQKKIVWRTWNSMLgkqesdYNDFGINFVLWLFDNFPKMRNKFDELYGR-SRNSLIVDQHFIAHTENVVKELDRLIKDLPFprlLSKRISKLADSHLNqEP--------------------------------------------------
>tr|C9CRM3|C9CRM3_9RHOB Uncharacterized protein OS=Silicibacter sp. TrichCH4B OX=644076 GN=SCH4B_0097 PE=4 SV=1 
---ISSRDIDLLQSSCATAFlkKGVLASAFYNKLFEIEPAYVNKFS---NIN------------KQKIMFEAMLAYCISGITSgykVEALTARLRSYHM-HLEISDIDIANARSALMYALGSVLGEDFHSDLKQAWDAAFSSVSEALR----
>SRR5688500_3946624 
---VDSRTIALIKESFTPIAgrTLELADRFFNNLFTRQTSVRGFFPA--DVTEQ---------KRQLPGVIQTILENGDKLENLEPQLREVGREYA-KQGALPTHYGAVARTFVDTVREMSGIGWQARYTRAWTSLFDSLTKAIV----
>GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6338290_1 # 1 # 129 # -1 # ID=6338290_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
-------------------ReagLEQYAGALLRSGFDDLEtllaiedadmkdLGIPaCHVVRlrkklqelqRQRSGTRGDFDASNP---VVAFL-----ENAGLGQya--KLLLQNGFDdmDV-LLDIEDADLKDLGvprghaIKLKKGLRELQLQQYAQEDPMPLHAAA------------
>SRR4051794_36238122 
--------RRTAKASYLRLQgggrERAFFAAFYENLLVSCPDVKPFFVP-ERMA------------HQ----QSMLNRAIQLLLDFdracgCPQLRQLADGHA-GYQLTRWHYDQFVEALIRTIEQS-G-ITNPAELSAWRTTVMPAIEFM-----
>ADurb_Met_03_Slu_FD_contig_21_1037173_length_469_multi_2_in_0_out_0_1 # 1 # 468 # 1 # ID=69395_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.588
--------RRTALASYLRFQspdkVQKFSRGLYEHLFDRHEELERLFKP--DLK------------AQ----YEALNRALQALVDFrpedpdsAKAIETIATRHR-GYSISKAHLVTFLDAVAVGLACA-D-ERDPETHDAWHEVLVAAFKPF-----
>SRR3569833_2455512 
--------MKDVQARFGRCClHPNFLDTFYNAFMATSPEVARLFKN-TDF------------TRQKKMLQMSLNLLIShamGIGIVDGYLHQLAAKHSRhHLNPEPQHTTPPPNSLMKAVNQHDP-KYTPSLDHARRTGHGHGIELI-----
>SRR5439155_1005251 
--------KATtalAKASYDRCCqAPEFLQVFYRNFLAACPEAVPRFAG-TNF------------DQQTRLLRHAIGLLLIfpnQPNKEPNLLARLARGPGPcRRQGCA--CGQ---DRSDRTARTDGAsrqrrcraPCSRRpdarGSRKWVRAAP-----------
>SRR5262245_66279004 
---LEPTDRIRAKQSYLKHcmGKNDFYRKFYERFFQGPEGTmakEMFAD--KDL------------NQQYVKLDQSLHYLLNFGDQdmMEpTVLTTTATIHQ-TKGVAPEQLERFIECLIDTLSKDYQV--SGIEVDAWKNVCGP----------
>ERR1719277_2718232
--VLTDETIAIVKSTAPAMKehAYKISETMYQNMFAEKPEIRKLFTP-EDQ----KVQPGQTQKKQPLNLARAIQAYATHIDDldkKKSRIGRRIDrvrkKEC-SIESKNG---FNGK-RSEIVKEELTELERKNVVLrakmdSMEREvkllkKKFLSDIS-----
>ERR1719209_1562507
-----------------------------------------------GDHsh-AQSYH-----EVHEHLWRSLAFSVLNQVlsrDkRIKQDLFNLGYTHH-ERGLKEDDMLQLEYAVIDGIHDHLV---TDVHERAWRKVFQLIRIHF-----
>ERR1719487_2840864
-----------VRQSWAMIQaiqtS-sagGFGDALFFNISVMSSEIWSLFSV--SKE------------VMAVTFTDAFTLIVSYIADpvgLAEELFGEADGVG-DVGDDQGEGiregdghDLLGHGEQ--TPDLAAHDGDVEEERVAE---------------
>ERR1719171_2815737
----------------------agaendeelrensgvedsfasgsvptTFNEMFLFNLTVMGAGARK------NKA------------ImWMTEVLTSFDTIVANVANskrLQEECDVLGLRIS-KYPLDFVKLPEFKACMLSSLRSLLPRTWSGTHEVAWSWLWENIERML-----
>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 OX=905079 GN=GUITHDRAFT_143733 PE=3 SV=1 
--------SARIASSWTELvkksDYAEIGRRIYGS-VKANDTLEPLFR-FTNQ------------TVQGTKFVDMLSSIVENINNPqtiFEKVNELAPMHH-RKGVKAAHMPIMKGIIVSLLKHVLGDEFTNEDEEAWNWIWQYLTQILD----
>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold759411_1 # 1 # 798 # -1 # ID=759411_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.594
----------IAAQFWEEHiSyksladKLEIGCAIYFGMMVHNKEMKRILKKNlhhHQ-----------SIENSSVKFLDMMGWLLRSLlrSDidLCGSLQQLGAFHR-NMGVNINHFDPMLKSMHETFSYYFPIKYGIQIKYAIDQIFTLAARIMTG---
>ERR1719396_104066
---------FNIIESWELLRfhpslKEDLGTAIFRELFKEHPELREHFGL--PLVGLDALCKNQTFLSLSNQFVDVFARTMDTLGPdeelMDESIRELGEKCV-SIGIETSHLSLLRKPILSAVEKILLEDFDD---ESWKKFYSILATDLAE---
>tr|A0A0P5AEE1|A0A0P5AEE1_9CRUS Di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1
--KLtp--HQIQDVQRSWENI-rngLNALVSS-IFVKLFKETPRIQKFFAKF------ANVAVD------SLAGn----------------AEYEKQI-ALVD--TPTPNVEFPV--------------------------------------
>tr|A0A0P4WPK3|A0A0P4WPK3_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
--KLap--HQIRDVQTSWENIRgdRNSIVPPSSSSSSRRLPAPRSTSSN--SLA-LPSMP--------------------CpKManttnklllGDklqLLCNINYMRYTHQPPRAIPRERFEDFARLLLDVLSSK---GVSADDMDSWRGVLTIFVDGVS----
>ERR1719510_2339612
--SLTDNEVILIKSSWTYLKPhiNTILIESFMSLFAENSDVKEKFYSFKNHAIEdlnkkrgVGLASTNGLQRHIPRVSRAITKVVNSIENldrVSRYLEMLGKIHQ-QIGIEVQELMMLGAFFINSSKRHLPSSMQAdrHYSDSWLHLFTVISTMMRKGF-
>tr|A0A2V3J537|A0A2V3J537_9FLOR Flavohemoprotein OS=Gracilariopsis chorda OX=448386 GN=BWQ96_00611 PE=4 SV=1 
----DPETEALIKNTLPIFtkHSQQIAVQLYANLFEQHPQLKPMFC-LEFLQTPGQCKKSPgtGMSPQAKILSDSIVNFCANLDNIdmmNNAIERICAKHV-SRHVKSDHYPAVAGAFSRAVRQVLKNELSESDLKAWDTAVSALAGVLV----
>tr|A0A2G5SLB2|A0A2G5SLB2_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-17 PE=4 SV=1
-TEMSDEEVSAIREVWIRAKTDNVGKKILQTLIEKRPKFAEYFG-IQSeSLDIRALNQSKEFHLQAHRIQNFLDTAVGSLGFcpissVYDMAHRIGQIHFY-RGVNfgADNWLVFKKVTVDQVTTGATDsSKekdkdetnsngtangkvdteanpipvgiadinnvysgeNCLARLGWNKLMTVIVREMKRGF-
>tr|A0A2P8XQA5|A0A2P8XQA5_BLAGE Uncharacterized protein OS=Blattella germanica OX=6973 GN=C0J52_27026 PE=3 SV=1 
---LAREEKKFITESWHAFmrLPPANSVDAFVKFLQENPKYIKFFKSVDGIP-LEDLRYSFRVPKHVTAVLLYVNSMVHCLDNADAMfflSLQVGLMHS-NMGLTVEDFKLFNGYMVNILEDELG--LNDEGVAVWNKVLEIFM--------
>tr|T1FHE7|T1FHE7_HELRO Uncharacterized protein OS=Helobdella robusta OX=6412 GN=20208246 PE=3 SV=1 
-----------------------------GTLLQSNPLVKNTFEKFRQMDPMSDFTDSSVFSTHAMVVMSAFEDIFDNLDDseIVKDILEQGKSHG-KFseDFAPETFWAIEEPFMSSMKDILGRKMSSQLEKIYKKTIKFILSVLIKGLR
>SRR5580658_3791175 
-------DPALVREAWSFVSdrADQLVMNFYAELFYVFKEAPTMFPS--NMT--------RQRQEFGRAVVQWIIS--DDQEGL-----------------------------------------------------------------
>SRR3990167_4175368 
-TGLTDGEKGMIQQSWNLLSKVEFTKILYKKIFELAPHVRCLFQN--SIES-----------QHENfsIMMDMmINEHINDELDLFAVVLQLAKRHF-HYKVKTDYYSIFRDGFLWSLEQTLSIEtlnktITnestnqpTTIKSIWLKFVNYLISVMV----
>LauGreDrversion2_5_1035112.scaffolds.fasta_scaffold830278_1 # 2 # 232 # -1 # ID=830278_1;partial=10;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.316
-------------------------MAFWN----KHPEPAAQFVA---P----------TQdtltdefepeeeqGISKEQLLSALNAAQT----ALMMIDR----D------FNITYLNqKSVDLLKTHEALFQSIWPNFQATeefllGYCIdlfhanpshqrqmlsnpsNLPYTTTITVKDV-
>SoimicmetaTmtHMA_FD_contig_51_4416696_length_1368_multi_2_in_0_out_0_1 # 1 # 216 # -1 # ID=2511055_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.685
--------VALHTVEFAVADPsaRATI--------------------------------------------------ATHGLtpdDMAMLLSK---RE------------LIGPAFPALLDEFYGKVVEN----------------------
>tr|Q5D2M7|Q5D2M7_9TREM Myoglobin 1 OS=Paragonimus westermani OX=34504 GN=myo1 PE=2 SV=1
MAPLTQAEVDGVVSELNPfLAsdakKVELGLGAYKALLTAKPEYIQLFSKLHGLT-IDNVFQSEGIKYYARTLVEDLVKMLTAAAKddeLQKVLVHSGHQHT-TRKVTKQQFLSGEPIFIDFFNKTLSK---PENKAAMEKFLKHAFPVIANN--
>tr|A0A1S8X4B3|A0A1S8X4B3_9TREM Globin OS=Opisthorchis viverrini OX=6198 GN=X801_02811 PE=3 SV=1
MAPLTQSQIAGIHKELLPiLSndeaKTSFGVGAYKAFLGAHPEYIQYFSKLNGLT-IDNVFESEGIKYYGRTLVDEIVKMLTAGADdekLKQVLHDSGKAHT-ARNIDNATFMvsklfmflkrvsemrlarglygpfpifaqSGLPVFVDYFNKSLTV---PENQTAMEAFLNHVFPNISKD--
>ERR1719167_330163
-IDLTDKERELIQHTWWRFREEpYCRLRIMTHYFSANSSIKKKFQR-KNEENAAngNlmtAMVSWNIRRFSIRLVEFMDKVVRDLETEnyqdiYDISELQGAKHYRlKRMVEPGDMEALGQSIQTTISEHFGEKFNRSHILAWRRLFIVICSRF-----
>tr|A0A0T6BC68|A0A0T6BC68_9SCAR Uncharacterized protein OS=Oryctes borbonicus OX=1629725 GN=AMK59_2266 PE=3 SV=1 
-TGLTSQQKSLIQSTFNVIRPhiLNVGIDLFVRVLEVEPEHHRVLP-FSHIP-IADLHESFEFKFHCLAVVYSCSAIIDHLHDdgiLIPLMKKYASDL--KASIPLDIFQMIHDPLLEALDVHDDVKISEEALEAVRTLLRNLTNFLI----
>ERR1719199_1566639
---------------------------IFQHSGIQRPVFSTSSSSR-R-------------LCRP-CDLSMAFRPSDVLHSstrLKAQVETMGFGHL-HLDVTPARCKLFHGALVDFFVVELGDKLTPLAAEGWKRVLTYVASGLM----
>ERR1719362_342361
--RLSASAVTFLRSSWEHVPKDSFGMEFMKRACSEEPSLSDVFDC-P-V-------------ARPDNLAKVVQMLLDQAEielvprleRLAHGIAALSFKFG---KLRMSHLAPMKRALVRTVVAFAPGNQKAMTNRAWEAFFYAIAAVVA----
>ETNmetMinimDraft_19_1059907.scaffolds.fasta_scaffold284136_1 # 1 # 639 # -1 # ID=284136_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.595
--RLPKACVSLLRQSWKQVPQASFRKEFFDRLYIEDSSLQQIFQH-PMV-------------EVPENAWNVVQLMLDLLNvenvprleRFVHALAGLAFRHG---RFRLAHLAPIKRALVRTVTSHASKQEKKKLSQAWEAFFYALAAVAA----
>SRR5262245_21272653 
------QNVEVFRASLKRCLaAPYFMSRFYDLFMGSSDEVREHFGD-TDFK------VETRVLADSLYLMAVIAQ-GEAEAPAWTEMSRLAKRHSKaELDICPELYDLWLKCLIEAARLHD-AQFSEAVEQAWRATLAPGIEYLSSRRX
>tr|A0A2A4SWC3|A0A2A4SWC3_9GAMM Uncharacterized protein OS=Thiotrichales bacterium GN=COB61_05140 PE=4 SV=1
------MEFQDIRTSMGRAItHGDLFGRFYDIFLASNPKIKSMFVG-TNLE------TQKALLRQGVNLALMFAE-GKAIGK--SAMNRLRDSHSKsHLGIEPSMYRYWLDSFIKALKEFD-PDFDSALEKQWRQALGAAIEHIAAGYS
>tr|A0A1R1LTH4|A0A1R1LTH4_9GAMM Globin OS=Motiliproteus sp. MSK22-1 GN=BGP75_17400 PE=4 SV=1
------DFEHIFDSSYsrvlAVTYnKQGFFETFYQRFVVADEKVSELFKN-TDMA------RQQKLLESSVYFLRDFYT--TSYAD--DVLQKIAILHSKrVLDIPPALYDLWLEVLLSTVSDFD-PLFDENIELAWRLVLSAGITFMKFKHN
>tr|A0A2A2KP63|A0A2A2KP63_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_06989 PE=3 SV=1
-SGLTREEKRIIQVCWFKCNqkqLRKCAEDIFADILHMDDDLLRLFR-L-DHIQSNRLRDAEFFKSHASNFAIVLSLVVTNLQEhVeqaCEALQNLGRQHAA-F--LDKFFQSMyWDTFTDCFERNPPPAFRKgSEREAWSRMILFIIAQMKIGFQ
>tr|A0A1I7TYQ0|A0A1I7TYQ0_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=3 SV=1
-SGLTRDDKRIIETCWFKCSqkqLRKSSCDMFWDILHTDEDILRLFR-L-DHVSPNRLKDNEYFKSHASNLALVLNLVVTNLQDnFeqaQDALQALGYQHLH-L--IDRtHFQSMyWDIFTDCFERNPPPSFRKgAEREVWSRMILFIMGQMKTGYQ
>SRR5215204_501118 
--RVTRRDWQRLLENWERLQpsADRFATVFFDTLFAWEPQARQLFGG-------------ATLETQFLRFAHLLTSLVSAQDHpdeLDRRIDAVIRCFA-GGDPPRKREDAIRVAVAAMLNDVYAAGITPETRASWQSAYIGVITTIRS---
>tr|H3NRG3|H3NRG3_9GAMM Uncharacterized protein OS=gamma proteobacterium HIMB55 GN=OMB55_00005550 PE=4 SV=1
----SQSDIAIISESLTLCgdCLEDITPHVYRRFFELDASAASLMEYS-DEH------------MRGR----MFASVLELFlsddpFESDGFLAWELDNHVSSYSVTKSMYESLFKAFFEVAEETLGEDWSGDFERAWTNRIARIMAEVS----
>tr|A0A2V1ABH2|A0A2V1ABH2_9ASCO Uncharacterized protein OS=[Candida] duobushaemulonis OX=1231522 GN=CXQ87_003270 PE=4 SV=1
--QLSTADRNKVRASWGDAMaakdykTEQVIHEMFSSLIEQSEDARDLFEN--KK----------VRAQQETLFAEIMGFTMMYLHNitvLDECMNEFIREnpHIVRCGV--RYLEPMGAVLIQYLRQTLGPQFHAGLETLWVQTYIYIANCIL----
>ERR1719396_219344
-------------NTAAAVAPkaLDITKTFYGGMLQDYPELLAYFNPAHNVP---------ISENQPMALAGSIVAYASNIRDLSPllvpngPLMAICHRHC-ALCITPPQYNVVHENVMKSIAKVLGASSRRRSRPPGARRSSSSRR-PA----
>ERR1719396_178111
--------------------------------------------------------------------AHGPGRLHRRLREQHPglvpaagaqrPADGDLPPAL-RLVYHPPAVQRGARERDEVHRQGPGGVVTPEIAAAWSEAVLFLSKACI----
>SRR3546814_8055804 
---------------------KDITPFFYDRFFALYPEQRANFYHFES--------------TSGTMVNEMITSVLALASNearSEEHT-----------sELQSLMRISYAVFCLKKKNKT-----------------------------
>SRR3546814_13566968 
---------------------FTIYTTLSLNVVLPFVTHRSNFDHVES--------------TSESMVIEMITLVLALASKeawLTNSFQNFVAALR-SYgDIPPDAYARLLDVLVVTLAQVAGSRWTDEFETAWRWYVSGM---------
>ERR1719171_2136978
---------EAIRITVPMLEeigLENVGQVFYGHLFTESPQIQMHFIK------------------PNRMLAYIVRKAIFMVRDlhpkpkeVMAELKPLALRHI-KYDAPPELFADFLVSFTKTLEENLKEGFTTDCAEGWESATNFLANTITR---
>ERR1719171_2291403
---------PRIcgelwrkqtfklrfnilgkqihspgiPRFFQKMEnvgGLLVSalllaMCFYDPEIvAHEEQIGIHIID------------------RNDAIYYVLEACNACILWllvtnVFGFSvQLSAFKHC-VSQMaeDLAKFGTFAVVFLMAFGCAIhiTMPYDPDFEDMWVTILTLFAI-------
>UPI000297C1C9 status=active
--ELDEYSIGEVRNGWENLERRCGtPKAAA-EEFLHKVSAAIPKTE--HM------------QKRASTVWSKLNGLLASMHDqsmFTGQLEYLALRHM-NQDISAAEIETFKGLLLEFCASKLGGMMTPEFQYGVSRLVDAVGASYQ----
>ERR1719334_589756
-IMLSPAAIQAIKSSWQHV--KNVGFQFFGHLLfsfwlGNQPRALEIYCLHyhGDKR-KGVVELLPRFRRLGEIYAKRIDTWVSHLDDPftlFLILYEHGFNPP-KKavGINEKDFELMVPSLMDAISSAMGSKMTHRLFEQWKSFWKYVLTQIAEG--
>tr|A0A0E9N6V9|A0A0E9N6V9_9BACT Uncharacterized protein OS=Flavihumibacter petaseus NBRC 106054 OX=1220578 GN=FPE01S_06_00290 PE=4 SV=1 
--QMNQQEIQLVCQSWQQAAeePLRLAILFFDRLFEEAPELRQVFRT--PMS------------EKTRQLLVFFGFHINRLASgsIrRPSFEAYVW----EELLTDAQKGFLMETLSDTVAALLKPDWTPALQGAWGSFRK-----------
>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
--------NDLVLSSWDIVRqrteVQELGEKFWKYLNCMSPEQTNLFRR--SL------------SMWGhllHHIVNMLLISITDPEEYYDLMFELTIRHI-RYGVRSEYLNPFGNALFATFEEILSDVWEEKTTKAWKLVWKRATCNMSRG--
>ERR1719242_319529
------EYKNVLQSTWTKLlqKKEEIGKRIYESIvFDTTC-TT----T-GTSLSTSIIFENTNIGQSASRFMDMLDTVICKLDEpdaLVQKLEALSAFHSSNFNVQKRHYIDFEKGFMKAIKWELGAQRTILHDRAWRWFWNFLISKMC----
>KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1947561_2 # 429 # 647 # 1 # ID=1947561_2;partial=01;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.584
-------------------------------LFETNSDIKTMFAKLKDYETVAELRSSKILEDHSMKVICTIDDAIANLDDMeyvNRMLQTIAQAHSTRFpNFDPEFFM------------------------------------------
>SRR4029077_13489679 
-----------VQADVHAISvm--LNLMQPFRALRRRVDQFAKLWL--DPL------------WKTGRKAARIPA--TSTSITGRtgfAGRGRTGKAAC-----------------------------------------------------
>SRR5579859_1650388 
------------------------------------------------------------------NFLQALHTILLKMQRhdpsVFQFVQQLGARHE-KYGVTREHFRLVGGFFLTVLQRYVGVLWTRPMQRTWEALFGVLTDVMLFGY-
>tr|A0A0N4ZKI8|A0A0N4ZKI8_PARTI Uncharacterized protein OS=Parastrongyloides trichosuri PE=3 SV=1
--GLTYYQIQAIQRAWRHMSkagQVSCGRQIITKIYKNNTEIRNIFQTYVTIENLS-INQMepveWGVLKHGEEIVNLLDYVIKNLNNIemvEEKCEEVGRSHRKmkQYGMKEEHWDSLGEALSETIRENYG---------------------------
>ERR1719326_2865515
--NMPPEAIEQVKATWTKLLsmttHIELGSLMYDALFEKLPKIRSMFVS-------------PRL-ATASRGETNIDRIFGSFSKSas--------------YMrdpssMX-----------------------------------------------
>GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold4300996_1 # 1 # 264 # 1 # ID=4300996_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.629
------------------------TQAFYEEYFRLCPDSRDLMKHV-DEH------------VQGRMLASVHELLMLPDPDEQaRFIAFETQTHR-SYGARRYMYDRLFRALRSVVRDVSGDDWNPAWTTPGIAASRPCSRAST----
>ERR1719174_1428107
---------------------------------------------------------------------------VVDCQDqrsTLGYPPSAST----SVRCCVEQVARRaflwrkswfLTTLTIFIAGQ-AiLKYSHLDNLATERLLVFLFRAFI----
>ERR1719284_2194575
----------------------------------------------------------------------------SWREStssMRPCPPSLKL----LGIASL-------------------HSLKLDEKLEFGNGdIGLPGGIQI----
>ERR1719277_1813735
----------------------------------------------------------------------------------CMCAAETRIAHL-IGRASVANMHNLRNAVGSEVCLLSSlAIRFEANHVGWAHVsvadvVAVCSSISL----
>ERR1719310_1375130
--MLPQEQSQQLQQAWALVinmsgNRDALADLIYSAFFYRLGePR-APLRNPA--------------GSRSLPFLHGHQHLRRQLRrPwssaqfrrnveLRSHVLGYHRPSG-EHHSX-----------------------------------------------
>ERR1719310_407492
--ILPLEQSEQLQQAWALVinmsgNRDALADLIYSAFFGASASLEYLFVTPR--------------AVAAFRFFTGINTFV-AFCgDpaqLRRNSQLRSHvpGHY-NSSCEHHPX-------------------------------------------
>MEHZ01.5.fsa_nt_MEHZ011529165.1_2 # 173 # 307 # -1 # ID=206391_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
--YMSIDtgnleaakvmlqdlvtiradrsryyyclddlFKWHPDIVWKLTv--------------DAPELLrtmldGMIWRSRV--------------VvngnrrvnyylkhllvDEHGKFSNAM-SCIVKLQDpEIAIHPILVQ----LGDLVWNDLVYWrflrgklslVCTAGIFMVSQSMl-QYVESAGSFEERVATFICRLVV----
>tr|A0A067CC73|A0A067CC73_SAPPC Uncharacterized protein OS=Saprolegnia parasitica (strain CBS 223.65) GN=SPRG_06598 PE=4 SV=1
--ILNTAYLLDCSKSWKLIVtantdrMRQYgksgivlfYDEFFFRLFQRDFTLEEVFP---DI------------GKRGEVLVKAMTFMLKSSaENpkqIVNKCHYLGHRHRSFGGVRPHHWAQYTSTVIEVIMYWLGEYASPDVGAAWSNIVGFFLMHILESF-
>ERR1712194_94606
-----------VQDTWISATctfeyKECLGTQLLYNLMHIEPSFLDAAPFFDNTVLLGDGFDDESLIQCAIYIVQCITELVTMLDKyHEPKFRILINSHLSrlaKYNIYPSSFAKVAQALLMTLSDVMQEEFTKKVESYWMSVLIILF--------
>tr|A0A2M8U0Y4|A0A2M8U0Y4_9PROT Uncharacterized protein OS=Ferrovibrio sp. OX=1917215 GN=CTR53_17535 PE=4 SV=1 
-SPLSPAHLGLVRATFQILAadRDRLTEMFYARAVALDPHIQRPQ-----LV--------SNMVAQRLQFMLVLTDVVQQLDDLpslAQTAATFARRHG-TYGASDPRFRTARAALAWAVDRILETERNSAIQLAWNAAFDLVEALV-----
>tr|A0A1I8F573|A0A1I8F573_9PLAT Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1
-------------------------------------------------------STNQKPPSDGDRLLYWINVQ------ptAQPQLLRGASEGC-VRLFSPRILTRSCISSNLCVRAGRGRNS----SSTeTTSAEGADAVVAA----
>SRR2546429_8650734 
------DAQYLLTESLAVLRpyADELVAEFADRLATGHPALGAIFEP--RL----------------LTVLLELAATYDRPQGLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTLRDFPGAAWTPAHHGARVRAYAFAAATMM----
>SRR2546423_13669166 
------DDQYLLTESLAVLTpcADELAAEFADRLATGHPALRAIFEP--RL----------------LTVLLELAATYDRPQRLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTPRDFAGAPGAPAPHRAGGRADAVAAAPPK----
>SRR5690348_18181078 
------------------SrrRHTRWTGDWSSDVCSSDLETRALFRT--EGS------------ELVKG--SMLAMTVEAIIDFAgersGkfrMIACEVMSHD-AYGTSRELRSEERRVGKEC--RFGWVAYPX----------------------
>ERR1719323_1074371
--LIPFEQRTLITEVWNVLQestIRYVSNTMFLpLIVRSNKSLQKCFAALDQSLHGMELVECygSkfDRTKHGSLFLSKlLIRVVPNMDQmdrVLPYLAELGALHQ-RHGVAKQHIDLLGLAFCAAIRGVVAgggvkGGHLHETTKAWITLIQAVCTGMKMGY-
>tr|A0A1I8C1X6|A0A1I8C1X6_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=3 SV=1 
--DLSPHQIGLIKRAWKNLlksvNENEIAIKLLLRIFQLDPRNLAYFSL-NEYSPFDeyLIKENNIFINHVKTFESTLINVMTHPGNatkLSKHLQQLGGRHVNYTGVTykCSYWKCFIQSLIDVLTLNKDKNTSEDLHEAILILGEFCVEQMKIGY-
>tr|A0A0N5CQY3|A0A0N5CQY3_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1
--QLNAPQLLLVRKTWAHARSqGalEPAMSIFRNSFFKCSEIRSLIMN------GPKNEGHERLKSHAKAFTEIMDQLICGLETkelIMYELRAAGRSHIFLprdatdnkskgCTFRLAHFEHFASAMIErTLEWGEKKDRNETTQTAWTKIVLFVTEQLREGYQ
>SRR4051812_28599342 
------------------------------------------------------------------------------WVRprsRGGRSPRSRSSRS-SARRWPSGRPRPPSTS--RPDMRSGPSscgmsrarwqsifpapsrtgcasPIGVLGDP-----------------
>SRR6516225_8820395 
---------------YSVHCegKTNFYRLFYKRFFDKPPKWRTFFRK-HKIS----------MARQY----KLLDQAVASLANFHigaepTSLSHVARVHA-NLQLGREQYAMFTDSFLESISEM-GEK-DED---------------------
>SRR3569833_2822653 
----------------------------------APPERHTVLHE--AI------------VTNPVEVAGAIGWVVEHLHRteeVATACGELGPALARLLAGHEQHLDACGRSIIDAIRTGLADRWKPEFDGATSSAWELVAEWLRRG--
>SRR4051812_2284027 
----------------------------------TLPEMRTVLHD--AA------------IADPHALGRAVVWLMDNLTRpfvVTAGCELIGPALGDLLAEHPRDLEAFEPALTDAFRTALGTAWKPDHVTALHQAWDLTVKW------
>tr|L8JU91|L8JU91_9BACT Uncharacterized protein OS=Fulvivirga imtechensis AK7 GN=C900_03083 PE=4 SV=1
--TMEIGKITLVQNSYGRCL---SSGKLLETfyenFLSSSRDVADKFR-------------NTDFEQQRKLLRHGINLMIMYAaGNIagQTGLKRIKESHSRgRMNIEPRFYALWKAALIKAIAEHD-RDFNVEIKAAWNEVLDKGIVLITEGY-
>tr|A0A1Z9IBY6|A0A1Z9IBY6_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium TMED162 GN=CBD22_07770 PE=4 SV=1
MVGVTQTQEQLIEQSLTHYAarHGDPYDAAFQKLYAAAPHYEGLFVL--DTD--EGLR-----RNMMRTTLEMIATYIDDAYAAENLVTGARLVHL-TYEITDD-FDLFFQITRDVIAEGCADIWSDAHAAAWNTMLKDF---------
>ERR1719295_1776256
--YLQPQEIVHIQGSWATVErqLFNLGARVFISLMENQPNIKRTFRQYRNKR-HSELRINEDLQKLIMLLLCGMKRVVKYLNDtkaLTKYLKRMAKRHSPTeidfARINPAEVASVFCAALREIAPAEKDQWTQEVEDSWTSLIGGLLAA------
>ERR1712029_417561
-------------------------------------------------H-GSDWKV-VQVDRIILI-FRTIT--------vIIVRVQSVEKDHI-hT--------RKSF---------TQVLKVETVVEDSWTSLIGGLLAA------
>ERR1712071_338654
---PTAEEIALIRESWPIVKkNKNVFVEFVLEHFRVHPKTQDLLPEFANLAI-ADMPSNKFFVQLTEtYVVMAMQEIIDNLDNagvLTDLLQCLNSNWYVDyVSLDRQN-RETLRIRRVGQEQKSYSRNMESneiQQQRCPQNLRQAVH-------
>ERR1712179_849736
---PSAGV-------------------------------------------------------------------------PVNKLEENEDFQVLAyYSSAVATFivtnLDQEDILTHILVQQTKP--------------EQFVD-------
>tr|A0A077ZE79|A0A077ZE79_TRITR Globin OS=Trichuris trichiura OX=36087 GN=TTRE_0000613901 PE=3 SV=1
-------EWYNFKNFWKTVQrnKDNCAKLMFFKYLEQNPDLLQAYAKLRNMEMNeETAFNNSDFEHLANQYLDVFDEAITTIEsnpgDvssVVEELQNVGKRHRRIscieassfavtttvskDWLSVAILQKLQEGFMEMARQVLQDRFTEKCENSFGKFFDFVAKNLQQGF-
>tr|Q7M422|Q7M422_9DIPT Hemoglobin V OS=Tokunagayusurika akamusi OX=28383 PE=1 SV=1
-VGLSDSEEKLVRDAWAPIHGDlqGTANTVFYNYLKKYPSNQDKFETLKGHP-LDEVKDTANFKLIAGRIFTIFDNCVKNVGNdkgFQKVIADMSGPHV-ARPITHGSYNDLRGVIYDSMH------LDSTHGAAWNKMMDNFF--------
>ERR1719253_2317543
---ILSPAGRVLRLRGPGFLpprcrfgrlspnhccsrvspdriavarrPPPRPRSRPTSSPSPRTSTRGc-WAATRSC----------CSSSTrpttspsprt--SLR--------PSPAPSrptPPTSPTC-LPS-WSPAGPWRPSVTA----------TSPSPSTRCSTSWCTTTSwrpsprswatssrrrsrpagprPSSSSPRP---
>ERR1719253_507459
---LSQSAIDVVVSVAGRDArrARPRAGPRR----------TDp-WRRRRRA----------ARGG-gpgrragevqtraaegASTLGHGLVR------RGRalgHGLVRHGRGHC-HDS-------------------------------------------------
>tr|A0A016TEH5|A0A016TEH5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=Acey_s0110.g162 PE=3 SV=1
----------------------DTAGEYHKQLFTLHPEIAKYYDA-EDID-PDSIPKAQKFIMLGQQELQFFFRLPDVVDNerqWRSALSSFKE-TFGDNNVPMSEFNKVTDAFLAAMQKNAGG-VTPEQKKEWEELLAKAYADMK----
>tr|A0A0B2W4R6|A0A0B2W4R6_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_05310 PE=3 SV=1
----------------------DTAGEFHKQLFKKHPDMAAFYDA-EDLD-PDSIPKSQKFIMHGMSELQFFFKLPQAFSDerkWRSALSSFKD-QYEDVGVPMKEFNKTTDAFLAAMEKNAGG-VTAEQKKDWEELLAKAYADMK----
>ERR1711965_451221
-----------------------------------AGAVR---------P------------RP--------AAVI---GFPFPLFP-LLETADMtsvAVGAHPRLRA-----L-----LRDR-G---AWYLTGPQELASVIGRLERLER
>SRR5882757_2588511 
--SLSSRQQILARRFFDAVEAsdKPLAAMFHERLSEIDDRLDGLLL--EEE---------GCLLREAMVIVRTLSRNVDRLNRMVPIFRAFGRTCA-AQGIASANYEKIAPVLFWIAQECVGSEFSVEMGRALTALYDQLSREMKD---
>SRR5262245_14724532 
--------EDVVKKAYQRHCYrqPEFYRSFYENFFSRVPKARAMFK---DMA-----------RQHE-----MLDFALGQLLNysqqqSEpTTLTQFVERHS-RLGLTADDFKRFGEALIATFDSELRGdCEHHRTMAALEIVI------------
>tr|A0A183IYP9|A0A183IYP9_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 
------------------------------GLFTSSPEIRSLFPTLVDW--GDDIKTCQKFRNQGLKFVHVISLSLTTLHDkehLDTLLKEIGTRHVEfmPGGIKMEYWDIFEKAMVKCILQQIRwtDDFDEAIQskaaIAWRILCAYIVQKI-----
>tr|A0A0C2M2P6|A0A0C2M2P6_THEKT Uncharacterized protein OS=Thelohanellus kitauei OX=669202 GN=RF11_12769 PE=3 SV=1 
--FLTLEERLKLKESWIKIYqkiqdlPdVDITFEIFVRLMERRPEMSKNFE--KDV------YKYSRMKSHSDKMLVILNNMIRNLDDeqkMLKYLSGMVRRHR-NYGIRQGDCKMWEEIFLDIISR------------------------------
>tr|A0A1I7YD88|A0A1I7YD88_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1 
--LLTLRQRKILQRSWNKSQrtgLDNIGAHIFLKIYAKDSSVGYLFN-LGNCP-HSELKYRKFFQDHAMTFTRSLDFVMNHLDDLErvsKFCVELGKTHVKfmRRGFKTSFWDIFAEALTECAIDWEGGLRCRDVLNGWRTLVSFVIEEMRKGF-
>SRR5262245_33555564 
--------------------------TFYEHLFEGAPELRSLFPI--NM------------AAQERKLLLTISVVVKNLDRdeeLKRLALHLRDVHE-GIRIEEGHIEAFLGSLAHAFQQVHGSPFPRH---DWLTLRRAV---------
>SRR3954452_7277257 
------------------------------HLFQANPEIRMLFPI--NM------------AAQARKLLLTISVVVKHLDReteLQRVALHMRDVHS-HIRIDEGHIELFLASLAHAFQQVNGGAFPHQ---DWKNLRRAI---------
>tr|W4XW92|W4XW92_STRPU Uncharacterized protein OS=Strongylocentrotus purpuratus PE=3 SV=1
---------------------------------STHPEDSLHLHQ--GCCSHLASRESCRFVDQAMQVMQTIGNAIQNFDNKelfNTNMKELGLLHC-PVRDDtlavIHNHEVFKDALYNTLRKSLTESLTPEMTFAWKAF-------------
>KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold7330878_1 # 87 # 278 # 1 # ID=7330878_1;partial=01;start_type=ATG;rbs_motif=GGxGG;rbs_spacer=5-10bp;gc_cont=0.391
------------------------------------MASQTQFvygDE--DTVMACLTKESCRFLEHAMSVFQSVGGLVTSFADPpsdRKFNLDLGLKDQ-PKDVQDRHYKVFMKCLLKSVRFHLADSYDLAMHFAWKAF-------------
>SRR3982751_838383 
------GINDQLRESAAMLTsgGteatDAVIRDFYIALFRNAPSLIAIFPG--NPAQGDFG-SDHRGAKQRELLLGALAGLADLYdpgdaermTHLDSVLKRFGRSHAAFtrpdgtvSGATLDEYKAVKDALFSTLVRAAGDRWRAEYTVAWSQAFDYAAASMLL---
>SRR5690606_20444479 
---------DIVKQSFERSkQRKTLATIFYQNLFFLKPKIKNYIKQ-TDF------------AHQEKAIMDEMEFLMAFLDDkdrhARQQILRIAGTHSAkNLNIHPHDYYYWLEALIMTAKEC-DHLWRDDFQYYWRECLSFPLTFIISQYY
>tr|M6F3R8|M6F3R8_9LEPT Uncharacterized protein OS=Leptospira kirschneri serovar Bulgarica str. Nikolaevo OX=1240687 GN=LEP1GSC008_4081 PE=4 SV=1
KMNISENQIRSLNESFDIVNLDriKFAELFFIYLKENHPKYENIFSRI-QL-------------EDVKHFMNSARNISLSsVQYsqLERAIQNFGVECL-KICNQAEEIPILEKAWLFALEKWLGPWYSHEVEKSWQEVFKMIHTSS-----
>tr|V6I1Y8|V6I1Y8_9LEPT Uncharacterized protein OS=Leptospira alexanderi serovar Manhao 3 str. L 60 OX=1049759 GN=LEP1GSC062_2771 PE=4 SV=1
GMNISENQIRNLNESFDIINLDriKFAEIFFVYLKEKNPKFENIFSKI-QL-------------EEAKSFMNSARNIALSgAQNvqLEKAIQDFKMECI-KICNRTEEIPLLEKAWLFALEEWLGPWYSHRVEESWQKIFQMLYSEE-----
>ERR1719272_197188
--SLSATQRASILASWRQLCGEDGGATfcasLLGGAFEAVPETRALAGV-PEAAPEPeAvpeaeaavaapapapakgkagatavpeaaaaveeaaeeaveSAESVALRAAAAHAAVAMEIMAQQLSapeALKESLTELGVKAA-SRGLGcGAPFDRLGEALQTTLQASLGDeAFPEALAEAWRQLYAQASQEIQLQY-
>SRR5262249_23394332 
-------------------------ELFFSRLFAIEPGLRHCFDG--C------------FLGRRRAFEWMIGAAVRGRPDLRSFIQALEFMVAPSDATVHQECERLRDAFISSLSGSLGPRFTVEMMNGWLAVFELLH--------
>SRR5438034_714626 
--SMTEASIIAFNESFERCMaSGRFFDVFYDHFLRSSPEIAAKFQG-TYF------------NRQKRMLNQRPATTVGQpr-------------RSAReSRKTPAAQFVStcqampsaFVSELTKSGSTX-----------------------------
>SRR5258708_7736634 
------------------------------RFTGTSDAIREKFKN-SDF------------AVQHQAMADSLYLMAVSvqggPEN-LARHDMKRLYPKHqRMEITASMYDVWLDCFVATARIH-DPECTPAIESAWRECLTPGIAAMKSGA-
>SRR5690242_5369812 
--LVTEDDLALFLDSFDSCVaNKEFVARFYEIFLSTSPEIRALFAK-TDF------------HHQRRALKASLHVVAACaarrRAD-YSALDELADR--HrELRIEPRHYAVWQESLLAAVSEC-AERWDPDVERVWREGLSEAIAHMAS---
>SRR5512134_285705 
--ALTPTHATLVRESWARLAPGrAAAVhRFRARLEAVSPRTAARFTCL-DH------------EAQRDGLMIELDQAIAAtgsDDDLVPALARIARRFR-ESGPASSEYPMVRDALLEVLAEADRGIAPPELRRAWGSLFGLLAALV-----
>ERR1719232_1195758
-------ETVIIKDTWETIHkqVKAIGMEAFEKLFALNSDMSAYLPQTDDLDQDETRRLSDKVKSHAKLTMETLEQVIAAIPDMTEvynVITKMKKLHP-----QTGLLEVIGPVFCNTTRHFLliQGRWSLDVQRAWLALFGEVSAMIRASY-
>ERR1719189_1497217
-------GRQADEQ----VGreEAGPGHRGHRP----AQDDPAHLRgarDCGQRVRGRARRHGDRGV-QGRGQGEQS-QH--------------HRHQG-----S------HGQ----------lHGRHX-----------------------
>ERR550519_213
-------NIVLLRDTWSVIHrqVNTLGMETFQKLFEINSEVSHYVSpscpDLDPd----CIDSTTQAIKAHATHTITILHNTVSNLCNLgd--lagE------------------MNRLGKLHCDLGIDHGil----------------------------
>ETNmetMinimDraft_22_1059887.scaffolds.fasta_scaffold1682169_1 # 3 # 206 # -1 # ID=1682169_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.363
---------GTVFSQWRRMKIEDFGECMY-RSLVQDASLEKLFRR-------------ERMRTQSLLFAAFIQVALCWLEErdfrkVERDMISLGLRHR-SYGIQPSYVCVFQIALLQTLCQNLNG-LSLQAEISWSVVWSHF---------
>SRR6266567_3650358 
----------------------------------------------------------------------------------RAPSKAWGsgtspmascqstipssersfwkpsatywesaglqrtmmpgrkptkgsarscwkgpthrsqpeqssrqchrydlwererqdkikkgeatldtkqaaqkgfeQQHA-VVIGGSMAGLLAARVLSTHFGQVSVieRDHLPDGA-------------------
>SRR5579885_1989414 
------------------------------------------------------------------------------------------xmsnqqssrsgfgGQHA-VVIGASMAGLLASRVLSEHFEQVTVieRDQLPQEV-------------------
>SRR5579864_4130097 
------LQIELLETSFQAIApcGEAFVTAFYERLFMRFPQTRAFFAS-AE------------RNIKHVLAKPTIVTTLQPTRSascRTTRIT------F-PSSVGTAGVPISRS------TGYAGs---------------------------
>ERR1719414_1806988
----TVAQAEKVVAQWDAADQDAFIVAMYQAMMKTHPEWRALFNK-PTGA---PTPAEAEWKKQFDLTKAVLDRGLRsratDVDALKERMHAMAGRHV-NYGVTQTHFQALKPILTDVLAATVTG----ADMDAWSAVTYFMLDSI-----
>tr|A0A090RS91|A0A090RS91_9VIBR Uncharacterized protein OS=Vibrio sp. C7 OX=1001886 GN=JCM19233_1279 PE=4 SV=1
-----------------------FLTFFLQHFCSTNPRFAERFCGV-DS------------EQQTKMLKASIILVQnaAENPYIRNNVKSLAKRHKEmNLNIKPEELVAWRESLLATVANFD-PLFDDDIDQACAQRWN-----------
>tr|A0A139A347|A0A139A347_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 OX=1344416 GN=M427DRAFT_73171 PE=4 SV=1 
--MLSAEQARLLKKNWKDIGASsvanpmmFVVAQFYRRLLRK-KGYKRIFEGI-DI------------ETQYFKMQGALTACVEfaeNLDKFADTIRRIGARHA-RYNMTPNMMNDVVDSLVPSLKEFsldHGITWNEEIEEAYDEWLEQVTGYF-----
>SRR5262249_57009646 
-------------------------------------------------------FRKTDFPRQTRVAADTLFlmaVAAGARDHavAWRGRDRLPGTPPPpGLHSSPRHHPAQLVCPL-----------------------------------
>tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 GN=TSPGSL018_8354 PE=3 SV=1
-----------------------VGAGFLKLYAQRNPWAVEQFS-FG-LR-----------PQHAEKMGLALELIVNSATRpqvLQHQLRVLALGHV-QMGIKPEMFKSFEEALFAFLGQVLGAhnTFDEETEGAWRWMWGIVNAVFTQ---
>tr|A0A090LKP0|A0A090LKP0_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_2000335800 PE=4 SV=
-EELPKADKDIIISTYNILL--QADPELFSKAWimsaSRSTSIRKAFS----LIDP----NSTHIEVDFTKFSAVIERFFTriiceeKLVNesFEKSCINLGKKHVDfvPIGFHSNYWDIFMNCMIDVIAETVIIAFNEdnkqqqQVQKCWNKFVGRIVFLMQSGF-
>tr|A0A0M3JT43|A0A0M3JT43_ANISI Uncharacterized protein OS=Anisakis simplex OX=6269 PE=3 SV=1
-RSFTTPQLTSVFNAHFSMI--QLNPDVIKDCWiktsKRSSSIKKAFG----MLEH----EEPETNASFMNLPITIQAFFKelifelDCDSvkIRQRCEQLGARHVDfsERGFHSNFWDIFQVCTIEVIAEC--NLGLNedqhrSYELAWIHLLSSVVKSMRNGY-
>tr|A0A0A9Z6R2|A0A0A9Z6R2_LYGHE Neuroglobin OS=Lygus hesperus OX=30085 GN=NGB PE=4 SV=1 
--SLEEDEIERIKKSWVLVKEndfrfiDILRQEMLCDI----MMYELYFNPG-R-KADVCVSELTEFKNHPKNVYSTLDFIVGDLENenvIIEKMIEIGKNHG-RLGISRKHISFMTSTIYQAVECTIGPcMFDRLVDQSWEKFLTSFND-------
>SRR3990167_8699843 
-------------------------RLFYAHLFAKAAHLKPLFG---DSE-----------DTQNFKVIKMFELIIDNVEDLtqvQPICLDMAKRHS-FYGVKNDFYQYIDEAFVWCIQQQLSLSIQDPIIHAWYAATKYISSIMID---
>SRR5690606_19766530 
----VSDQYTDLQQSFGRCLrDKNFIERFYEVFMASNAEVAAMFAR-TDF------------QKQRLALRRGISVAIFHAAGssVvKRSMQQMADVHSRSgrCPVAPHLYPYWIDSLLTVIAETDA-EADEALLARWREAMGVTIGTFIGAYN
>tr|A0A023F5X6|A0A023F5X6_TRIIF Putative globin (Fragment) OS=Triatoma infestans OX=30076 PE=2 SV=1
--ALTADEKEILKESWKNRgiNKSTLAMMWFTKLFKANAEEIVEQNR-GQV--VEELFMDEANFDYVDKLADIFNIVVKNIHKstLcTKLIWEIGMYHC-CLDLRDGYFELMKETLLDTLKENMQPPLTSEQIEAWKKFIGVMFDIVHE---
>tr|A0A0N4YMT1|A0A0N4YMT1_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis PE=4 SV=2
---LSLEVHDLARAHWIQLHkLNRQSnliQNALLYIVENYKHTRPIWQ-FGlGIDEstkdwKTLLFNNFYFRHHSASIQAAITMVMENMDDrdcMKKLLNEIGAHHF-FYDACEPHLELFEQGMIHSLRTTLVGhvKIDESTEQSWTLFLKDLKTFMGEG--
>ERR1719326_703414
--------------------------------------------E-HPM-------------IPITMTEES----VKLVQDsl-SRVDSLVQV-----RDALQDvFFPHLF---------------------------------------
>ERR1719487_2229452
-----------------------AALSL--------P-------T-EQE-------------SPVTMTAEA----VQMVQDsl-RRVDSAVQV-----RDAMEDvFFPHLF---------------------------------------
>ERR1712176_999243
-------------------------------------------------------------------------SY-AHRDTfdqladAPRTI--FYTQK---------QGHPECSEMVEKMKNIVGDE-------------------------
>tr|W8BTT7|W8BTT7_CERCA Uncharacterized protein OS=Ceratitis capitata OX=7213 PE=2 SV=1
-LGLTITERRSLQNGWSIIKqkQRRAALTIYVNLFTEHENLYEVFRSDGV-------LNIEFASQHQKEVLTVFQMIIEQVDNarfVKTMLKELALRHE-AASVTNTQWQLYTNEVRKYFLETLADAISPTFVHALDKLMNFVCN-------
>tr|A0A1A9YF90|A0A1A9YF90_GLOFF Uncharacterized protein OS=Glossina fuscipes fuscipes OX=201502 PE=4 SV=1
-MGFTPLEIVALQNIWRLFKkrFKYHSMQIFLAFFNQNHKLIERFRLpSGK-------FQLNYLCQHSEKMLLLYENVIDkCLDNmanFHGIMADVTVSHR-HSGVTYEDVSLKSEHVRRYILDYFANQSSPTLVSALAKLSEHFND-------
>ERR1719370_117345
---------------------------------------------------------NATRMFPAKAALQESVEVmVDVLERrgmWGSGIRDAGISHH-KLGIKRRDMEKLATSILAAISDLLGDcDLDRKllQLNAWKKLLNAIADEFSA---
>ERR1719234_1549997
-----------------------------------------------------SLWhrssiQLEGASNHNKALMNAIDSVmVEVLERrpmSKSGIRDAGISHH-KFGIKRLDMDKLTTAILAAISDVLGDcDLDRKmlQLNAWKKFLNAIGDEFSV---
>ERR1711972_141202
--SISETEKTYCIKEWVKIcsDRSKTGTLLLSHVYQENPQLLTH-PAWKDLS-QDQLKENQHFKNLAEKTMGSVEQILTHIDNVDkvaSMFEQQGKDYK-SAGKSMSH---IMACLETFLPLDHPSlEVTEEYRGITQEILGIIKQSLMKGYR
>tr|A0A0N5DD39|A0A0N5DD39_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 
--NLTPHQKQLLVQSWPQVQlynRIHGGDAMFARFCEKNSIARETFQKIAVVQSfASNEASESVLKKHEQYLVQLLSEAVENLNNdCEPLLReclDYGAQHVT-LHelLNETVWEQLAEAIIDRIHKVNLVRRHKDLSKAWTMLIILLIDKIREGY-
>JI8StandDraft_2_1071088.scaffolds.fasta_scaffold105816_3 # 981 # 1154 # 1 # ID=105816_3;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.718
-----------------------------RNLFKIHPELKHALNI--EIK-KSGIQH-----VPLASIVFSYAANIDNADKFLVIIRHIVDKYS-SLGITVNDCPIIGSLLLDAIKESLGYAATTHLLAAWAEAFGLFTNALVQ---
>ERR1719199_1194134
-------HAGYIEKSRESVlnlDAAQLGADIHVKFLNVYPAAASLFQK--TLR----------M-LITTKIMGTLMAVISDPTGTLEDVRAVGVRHT-KYGISERYLLPFGAMLWEIVGTMLPGMWSDEHSAAWAFYLDFIASTMTRA--
>SRR5882724_2518483 
-----EEVRRKARKSYRELQDSAFYCNFYAELFRAAPDVRQLFRNI-NM------------DEQYEKLHAAVGKLLNfrPTDDPNP-MSRHAESHE-RLGLQPKHFEGFRDAFLTALSSRK--TADNYAMDAWRAIFDAGIAYMTTK--
>tr|A0A2P8AX05|A0A2P8AX05_9ACTN Terephthalate 1,2-dioxygenase, reductase component 1 OS=Micromonospora sp. MH33 OX=1945509 GN=tphA1I PE=4 SV=1
----------PDPQRLLAALgaPDQAADHFWSYMEDRSVRV---LP-----------------QQFAPMFFSTLAEMVARRGDpaaRRAELALMGRMYL-RFGLYPYHHTVVAAAMVDTVRRFAGASWEPDLAGYWEvgcrRSLRLAE--------
>tr|E5XPI8|E5XPI8_9ACTN Uncharacterized protein OS=Segniliparus rugosus ATCC BAA-974 OX=679197 GN=HMPREF9336_01410 PE=4 SV=1
----------TFVRSFHlELFgaAPELAARFPPGLGEHRGGF---VR-----------------M------AEHILETFAEGADpprLIDLLGQLGRDHR-KHRLDERDYRLAQAAFAKALVATARG---SGDGAFAAraaaLVCQVME--------
>tr|A0A246RU09|A0A246RU09_9ACTN Uncharacterized protein OS=Micromonospora wenchangensis OX=1185415 GN=B5D80_01060 PE=4 SV=1
-------------------------MREADELRSALPDR---LA-----------------AHDAELLIATLRRLATD-PEpaaQAVTLTVLGHAFR-RFALLPHAKLISALAGAD-------------------VPVELLR--------
>tr|A0A085M5J8|A0A085M5J8_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_06691 PE=3 SV=1
-TCLTKRQRRCILKSWRKVqNKAQLGEEIYIQIFMQKPVLKSLFP-FRAT-PVNELHDNVLFTRQAVIFIDFIDNVVAYVGinNgrlLQELCTRVGISHALMtrVNFDPEWWYLFANSVLDGMQKFCLPNFSCEpiatyigsqSMLAWRILLKHVVEMMSDAF-
>tr|A0A2C9LD65|A0A2C9LD65_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106067556 PE=3 SV=1 
--QLSHKDKLFILNSWLNFrNgkrEEDIGMEAALEMYSIYPEIKDIFTIYRDARM-KHLTDKEMIRTHSQQVASVVDKCVMRMDDAHAfamIAVDEGSVHI---KIQERFMRCYVDCYIREIKKYSKLKWSRANQMAWEVFFDTIVVNMKNGW-
>ERR1712086_1089461
-------MG---KEHGDGDSsadaNTAAGLDVMQGKKPEQKESKRWFSlgssaakgkqerS-----------KEEKEEKIADKALEMSAEMYKDPTRIQGETMGLGLRHI-MYNVDPAFFDALVTAYVEEMAVRTT---------------------------
>tr|B3LWC8|B3LWC8_DROAN Uncharacterized protein OS=Drosophila ananassae GN=Dana\GF16358 PE=3 SV=2
--GFTCVEKAALRNAWRLIEPfqRRFGKDNFYNFLTTHQDLIHNFRL--DPRSSDSPINLSKLHGHALAMMKLLARLVQTLDiNLqfRLALDENLPAHL-RRGIDPSYMKMLATALKRYILESsvIQNHNSSTLTSALTQLVSII---------
>tr|B5DW13|B5DW13_DROPS Uncharacterized protein OS=Drosophila pseudoobscura pseudoobscura GN=Dpse\GA26483 PE=3 SV=1
--GFTLCEKVALRQAWNLIRPreRRFGQDVFYTFLNEWYWSISKFKK-------GEDINIALLHAHALTFIRFVGALINESDPImfQVMINENNQTHS-RCRVGADYIAMLGQALTDYILKVLDKVRSPSLEQGLQRIVEKF---------
>ERR1719162_2542559
--------------------RSDIGMCVWNRVFVEDPKAENFFKQ-SN----------Q---RLIYIVTMAIKYSVEFYGDpekTKMAIEALALKHI-MYQVQPRMFMLFVTCYDEEIKARTDD---KLVQSGMHWSISIIASIMA----
>tr|A0A0V1BAT0|A0A0V1BAT0_TRISP Globin-like host-protective antigen OS=Trichinella spiralis OX=6334 GN=T01_2203 PE=3 SV=1
---------------------MENGGQLLANVFKANPELRKFYDV-EDID-PDDTKKSRLIQQAGGNLLNSVTFMVNNYDNErsfKQEIKEQICDLR-EKGMKLEDARKLKTGFVNYVKSKLSQPMTAKEEKEWDMFFQRFFDALKQ---
>SRR6476620_89806 
--------------------RHATRQQRRPDVF----------HER-QRTAGE------D--lnVLRERDVGQVH--ESLARAgvavIDGVVPRIGCEVV-DLSSEMQNG--------FPQGVIL-SAAVGVGDDDG----------------
>tr|A0A2W4R8Q8|A0A2W4R8Q8_9CHLR Uncharacterized protein OS=Chloroflexi bacterium OX=2026724 GN=DIU68_09390 PE=4 SV=1 
--RLSRQQKRIIQRTFSAVAvrHDLVARLTIERLRElsRTPAS-TC---FGNTP------------EDRRRLMHLLALLVQRMDDRGA-LHDACVAQTRQMGCDPFeggSTSLLAEAFIGALQSALAGRFEAKTEAAWREFFQMVERVLR----
>tr|A0A0L0FDI4|A0A0L0FDI4_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12917 PE=4 SV=1 
---KTDSEVELIRSSWRALLaGDGtaaqmpllrFVEQYYKRLFRLFPDSRGVFKT-RDTQ--------------SKSLSLLLSIIINVADEpeLemNAKKKKLEMMYK-EYGMNSLLAVIAGRVLIQSLQAFLEAsnKFQASVKDAWVKCYTSIADQL-----
>ERR1719203_545915
---------LILKDTWAVIveQIHELGLPTFVKLFRLSANLRYYYPKHnRPES--TEV--QENINTHFDQLVAVVDDVVRCLPDLsthIQYLRNLGPVHC-DVEVQPRLLELMGPVFAILSDLYCWskadgvirLKWPGYYYFDILLDScemVTIQLLLDLX--
>ERR1719232_1194111
---------IMLKDTWSGIieQMHELGLTAVVRLFKINYNLRFYNSPNvRYHP-TTHTNvkvlrgttaapatpaavasgstaaataagpsakdqatgksNLEDLSIVFNLLVSIIDHMISSLPNGsspTSHAGRNGksngtkakftlsaATMK-QLQILRQPTDWVGPVFCNTVRPLLLvqGKWSYQVEIAWRLLFRHLVRKNRTFD-
>tr|V6U182|V6U182_GIAIN Flavohemoprotein (Fragment) OS=Giardia intestinalis OX=5741 GN=GSB_151570 PE=3 SV=1
-MPLSEDTIKAVEATADLVAaqGLDFTRAFYERMLTRNEELKDVFNLshQRDLRQPKALLDSL--VAYARS-IRKINELhelqeqglpvpAERLAELqgfFAVAERIAHKHA-SVGIQPAQYQIVGAHLLATIEERVTA--DKAILAAWSKAYDFLAHLFV----
>tr|A0A1R1LGI5|A0A1R1LGI5_9GAMM Uncharacterized protein OS=Motiliproteus sp. MSK22-1 OX=1897630 GN=BGP75_23395 PE=4 SV=1 
--------LDKIYSTLQLLDdekSEKLINETYSIFFNAHPEAVLLWSK--DDPE-----------SRSKMFNGVILTIIDNLTRpdiFKNNLLSDVKDHD-EYGVDKEMYGGFFLSLTEALKKTLGSEFNQEMELAWKHQLAHIRE-------
>ERR1740121_1123239
--------------------------------------------------------------------------------------------------vWIVVGSA----------SVrHR--LrAFGSASGSSSgRRLSGidY---------
>ERR1740121_2035324
------------------FTplt-----Cqwa-----TPHDGPAQHVL-------------------------CEDGHFahFATDKCesAgHgA-RVQCPSDMPEMcaDttcgggqehccrpaggCTGgERPCPT--------TASASgSA--SgsaSGSASSRRLAgIDYE-----------
>ERR1719271_1314470
----------------------------------------------------------------------------------------------ghRqdeqhglQVPwCHQIPAVRGDC--PGLALQpCR--V---------HrREWC-----------
>ERR1719240_2235476
------------YE---DEE-------------------------------------------------------------------------GAqvdvmkgEDALVATADLLYQKMSEDAN---MQT-lLGNIELAELAsKLQKALa---------
>ERR1740122_169377
-----K------GE--ADKSgnAEAAGGgqGDTPETGAAQDTAAGV-------------------------TDEHS--------KaLGIEISS--FDELkvDqkciaaaIDAwKLFISTAESREAAGEAV---YNA-lFEGAPS--LQALFVTPRAE------
>ERR1719243_286169
------------------------------------SHPVNV-------------------------LVSDTMwkGY----t-vRgIRRVNYY--VKYMmlTrdgnvsqALGwFKDAADCKIISH-PVNVLVsDT--MwKGIVRKQFLGgRLWFII---------
>ERR1719158_147189
------------RV--CYLYplvhcNILAVLrelnfdGAAESLCLDAPALLPT-------------------------MLDGLIwrSR----vTeNgQRRVNYY--IKYFivDaeggfskTTEvMTDNGDPTIVCR-PVVSLVtDM--IwGRVAFRTFLYgKAWFLF---------
>ERR1740121_2502219
---------------------KSFALEVFKRLFAMVPHSESFFKQ-----------SNTRLIFIVSRALDMCMNIYKEPTRLVNEITALGIRHI-MWNIPTTYFDPFVQCMLDEAIVRYGAS--QQAIEGLEWSMRIIASIMV----
>SRR5262245_17232684 
---VEEETRALARYSYLQWlDDDEFFSAFYESFFAGATGAKGKFR---NV------------EQQRLKLRDAMTAVLNFYpGNEPTSLHRLIAVHA-ARDVTGTEIEQFERSFLEVLHQRLVERKIaeqlgpdvvAKIEQGWRELLHPVVQYVMGV--
>ERR1712137_24889
---LPRESITVIRDTWAMVErNVDIAPKMLLKMFQLYPMTQNLIPLLRGVS-LEDMPTNKRFLQLAYGSQFAMSAIVDKLHRpdmLEEIIG--GGMHAFVDGLSTS-FQMAaTTAlFNKIMTEELGSAYTAEAQEAFIATGDMMTSIMV----
>SRR5262245_32700325 
--WLNSNQRDLIRRNWDSssK-RYELCRRIYCRVFARRPEIRRIFSIGYDW----------WRLEI-VTFADFVQSIVDNLDDAkrvRQSAFEFGRDHAKwrRFGFRSDFWVQLAESTTREcvyLDAAVH--PPDESLETWTKFVSIVF--------
>SRR5271165_4656598 
------------------------------XMFYKKPDLKPTFIeIGHhidpendggLT----------WEV-EAQRFTNLLTDLIGNLNNLdrfEELSFDWGRNCVQwrEFGFKPEFWLHFSEAMTTEclyMDQAVH--SVGEVIEAW----------------
>SRR2546423_8132340 
----------------------DVADEMFtARLLELEPQWQRVLS---DEP-----------TEWGRRLLRAIRQAVASFTClggFAEALRELGGVPA--AHVGYRDYERQGAAFVGRLEHSLDKPMAGAMRESWQRVFRLLAE-------
>tr|A0A2A3E2S2|A0A2A3E2S2_APICC Globin OS=Apis cerana cerana OX=94128 GN=APICC_08732 PE=3 SV=1
-------------------------------------------------------------EAHCQNTASGCIDALDDVDLMEAILHTIGERHG-RRGQDRQQFIDMKGVIIEVMKDTLKSKFTIEIEAAWDRYP------------
>tr|A0A1W0WMU5|A0A1W0WMU5_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_09357 PE=4 SV=1 
--ALTHVQINLVRESWRWLNFnrplQETAVRFFlDFYFKQNPDCLPMFG-MKTVD-----HYNKAFSIHALTVMHAIKYAVEYIGNpeqFQRLFRTVGQTHL-RFGLTDLHVERFLEQWLAFLRANDAKVFDAATVEAWNLAGRIVVSQI-----
>ERR1711911_15016
--------VDLVRKILDKAKqNGNVAPKVFFKYFKAKPASMKAFPAISGLA-LSDLPRNGAFLSNVYTCFAGLKAYTLETDV-STRCPVFAKA---SGKYKSEDIDLFTSILKGVVAEELGADYDDVAKEAFEQFLDAVALTVT----
>SRR5690554_6373173 
-----------------------LYLSCYDIFMGQSADIGAQLFN-TRMS------------AQHGLLRGGIMWLIMHARGMsDSNIRALGKSHSRdQLYFHPSHYALWLDALMETLYKHVP-EFNLQLELAWRRTLEPSIDKIISMY-
>ERR1711879_742838
-----------------------FFEDFYSIFMTKSPDVLNMFAN-TDME------------AQRALLRSGILWLGMHARGMpDTKIRALGESHSKkKDEHQPHVLFHVAGRSDGNAFPPRP-G----LHSRTGANLAPYPTAHVT---
>ERR1719461_1661620
---------------------IEVGCYTFTQLFSQYPM-MDYLAKFDGLEV-EGVCIGEALRAHADAIGSVVAEIqenAGNPERIRMSLAQAGHRRF-LEGVERAQLDMLGPNMAETViIKDTWevISKQVKSigMESFEKLFSLNSDMSaYLPQ-
>ERR550519_213
---------------------IQVGCDTFTQLFQKYPQVNNYIAEFDDMEV-GGIKVGPALRAHASAVRSVVTEIqenAGNPERIRSSLAAAGHQQL-MAGVERKQLDVLGPVLCHVIRPLVWekGIWSVEVEKSWTHLFDIVACLMKLGY-
>tr|A0A173LPQ6|A0A173LPQ6_9ACTN Phenol hydroxylase P5 protein OS=Dietzia timorensis GN=BJL86_2914 PE=4 SV=1
---------------------PDFRRALEDALNTEAPYLRADLPR--NLD---------GPFA---TFVKLYRFLLTrvedsggdraKVDDVLDLCRELGHDLA-KYNVVEEQYERFGHALNAALARVAGEEWTGELSKVQNQFYVIIARALHK---
>tr|A0A0M3HYR2|A0A0M3HYR2_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=4 SV=1 
-PSLTPSQVQTIRKSWKHINtkgLYTVIRRCFQQLECMCPSVSNAFNSA-NNQLSANISTVRTLVEHTKFMLILIDRIVENDQDSIIELRRIGASHVVlkeSFGFGENELEKFGEMLAEAFLKLDGIRQSKETSRAWRLVIASMIDQLRAGF-
>tr|A0A1I8CNT8|A0A1I8CNT8_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 PE=4 SV=1
-IGLSNYQQKLILQCWPNIYttgnSSTFATNIYPNLCTRNQKAKALLQK-AD---GVAVFSQSeidCTSMHSKLTLEIIDSVVRNFDSnpisLIGYLNEIGHAHRSlkSIGMPSSMWDDLGDSILEGVRRNDLVRKHKELRRAWLAIIAFLTDNLKQGQ-
>tr|A0A0N5AJ93|A0A0N5AJ93_9BILA Uncharacterized protein OS=Syphacia muris OX=451379 PE=4 SV=1
--QLTVAQSVLVRKTWAHARnqgSMEPAMSIFRNSFFKSPDIRALMMA-GS-----KNTGYERLKRHAILFTNVMDKLIAGRvEEidsVIEELKNAGKEHACitreQYACpfRTSLLDQFAAAMIErTLEWGEKKDRTEVTQTAWTKIVLFIMEQMKAGFH
>tr|A0A0H5S8S8|A0A0H5S8S8_BRUMA BMA-GLB-3 OS=Brugia malayi OX=6279 GN=Bma-glb-3 PE=4 SV=1
--QLSSYQIHLLQQSWQRLRcSPNFFINVFRTVISKNTIAKELFRKT-SIIDGFTSYKCYDVKEHADSLIELIDFALREIHSsikvVQDRCMLMGAAHCNTCeNSMSSSWDQFGDSLAESIAKAEAIRGKRKCLKAWNALLSFIVDRIKGGY-
>tr|A0A0N4XUJ2|A0A0N4XUJ2_NIPBR Globin-like protein 9 (inferred by orthology to a C. elegans protein) OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1
-ASLSFSQKQALTTSWRLLRpqAAGFFRKILLELEIVSNTVKQIFYKAQFVDAfNKDEENIATMDAHIKLMVKFFDDILASLDDeteCVERMKRIGSCHAVlvrSCGFSSDIWERLGEISMERICAHEIVQKTREASRAWRVLLACIIDELRCGF-
>tr|A0A2A2LCK8|A0A2A2LCK8_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_21707 PE=3 SV=1
-STLSFSQKQALSLSWRALRpqAAALFRKVFLELEIASVKVKQIFYKASLVDAfNRDEENSATMEVHIKLLIKFFDDLIPLLDDekeAVDLIRRIGSTHAIlakSCSFTSDIWERLGEITMERVCTHETLQKTREASRAWRTLLACVIDELRSGF-
>tr|A0A261C2G6|A0A261C2G6_9PELO Uncharacterized protein (Fragment) OS=Caenorhabditis latens OX=1503980 GN=FL83_09405 PE=3 SV=1
-ASLTFSQKQALNLSWRLLKpqASACFRKIFLELEIASPKVKQIFYKAALVDAfNKDEDNSATMEVHIKLTTKFFDELLSTLDDeneFVAKIRGIGSAHAIlakGSNFSSDIWERLGEIAMERVCSHEVVTKTREASRAWRTLIAILIDELRGGF-
>tr|A0A1Y0I5V1|A0A1Y0I5V1_9GAMM Uncharacterized protein OS=Oleiphilus messinensis GN=OLMES_1782 PE=4 SV=1
-----TQDQRLFWNSFDRCLsspqrDQQFAEDFYQRLYSSDRAIAEIFDR-VSV------------SDQLHAVRQAVYLLQEMTplKQAEITLDKIQAIHH-QheIRLSNAMLDKWLECLLASVELAD-PEFNETVKQAWIDILTPAVHIL-----
>tr|A0A1I7TWD1|A0A1I7TWD1_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=4 SV=1
--RLSKIQKRAIRFTWHRLQtrnggkrVENVFEEVFDKLVKNLPNIRDMFST--RMF-LCAMsrGTTSTLRDHSKSCVKMIEAVIKNFDTeKskrtdtgtENDPRVIGRAHSIlkPYGLAGNYWEKFGEVMIDVVLAQEAVRDLPGAGQAWVIFTACLVDQMRAGFD
>SRR5439155_18881238 
----------------------PVLQGFQQAVSGFFTEVGRQFPK-NR------------FRQTPRKTQTSFLLVMGNIApgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSTQVEAAWRYTMGAGILFLKA---
>SRR5256885_16048310 
-----------------------FFFNDTATTEIYT-LSLHDALP-IY------------FRKQRRMLQTSFYMLVEYIAlgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSRSEERRVGKECRSR----WS---
>tr|A0A1I7ZQR2|A0A1I7ZQR2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1 
-IPLTAAQIHLVRTLWRQIFlskgPTVIGSTIFHKFFFKCPKVKEQFRR---CPLPRNFPNHDSFaKAHCKAMSELVDQVIENLENldtMTADLERVGRLHAEVmnGELSTKIWNDIAETFIDCTLEWgDRRCRTETVRKAWALIIAFMIEKIKLG--
>SRR2546427_190033 
--NMTYAELAHFDDSLTRCTrEPRFLERFCALFFASSDEVLQKFSQ-TDV------------QKQRRVLQASLYIQLSASPIvtnGSLIFCNPSVTWSIiQVQRSPAMRTLRthSSCPLVGYPLKA-GQCGVGHVPX-----------------
>SRR5213596_3505323 
-----------------------FLCVIFGLLRRGPSQVHTD----RLA------------EATEDVTGVVPQILMLEADGkpeGAVHLAPLAALHSQqHLDIPPHLYDLWLDCLIQAVRESD-PQCTPETESVWRRMMANGLAFMKVRYH
>SRR3569833_2178475 
--------------------HPNNHNTNKKTNKTTTHKKTQKNKN-TK------------NTQQKKKLQMSLNLLISHAMGigiVDGYLHQHAEKHSRhHLNVEPHHYTARLNSHMKAVKQHD-PKYSPALEQAWRTGLGHGIELIKS---
>ERR1719347_979638
-PIVTDEEMASINELWSCLRadAMHSSRFIFARFFEAHPEFLEPMPFVKDYYGniSPKYMDTQEMQDYCLKFMSTLDAVMTRVFArdkeALQVMRDIGYSHH-EFGLTSDMTVKFMNKMHDSVLELWGTEASRRDSKALDNIFKTIATEINVG--
>SRR5437762_8994925 
------PAAS--------------SDHHIPSQLAAGTRAKDRKGG-VEY------------PGHVCRGQRRCARDRPHILAspelCIPRACRTKSA------------AFCAVCENRCCETC-RSPPAKKPETARRSAERTG---------
>SRR5690625_2752079 
------SDYSDVQASYGRCVrNRDFIPGFYQRLLSKDKRIAAIFKR-TNW------------SVQNRALRRGISIALTWAGGskiVDRQLEEMADAHS-RKGrvpVDPVLYVFLREALKIGRASCR-ERVGVTVGDGcvpqdESGAATGG---------
>tr|A0A085LV25|A0A085LV25_9BILA Uncharacterized protein (Fragment) OS=Trichuris suis GN=M513_10305 PE=3 SV=1
--EFTAKEFAIAELTWAKLKvrfNNQVGMEIFRQIFASCPKVKNLFGV-QNRE-DQKALCDQRMARHTAIFQDIIELLIVDLSQrsdsLTQSLITLGAQHWFftQRGFRPEFWVIFGNTLVNLIRSLPLSlSQRYLARRTWIKLIVYLLDCVMFGY-
>tr|A0A0N5DS84|A0A0N5DS84_TRIMR Uncharacterized protein OS=Trichuris muris PE=3 SV=1
--EFTPKEFAIAELTWAKLKlrfNNQVGLEIFRQIFASCSQVKGLFGL-QNKE-DHTALGDQRMARHTAIFQDIIELLIVDLSKrsdsLTQSLITLGAQHWFfnQRGFRPEYWVIFGNVLVNLIRSLPLSlSQRYLARRTWVKLIVYLLDCVLFGY-
>tr|A0A183BUR6|A0A183BUR6_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 
-TGLSAHQIQILQKIWERSPeseISDCARNIMSHLLRSNAQMYQFFDLLGH--SDREIANSPIFARQSANFAVLLDFVLANLLEevqkVCLALQHLGAQHARlRWPIETHHWALFCRCFEDNPPKEV--FLNAEGHDLWKTMINFIIVQMRVGYD
>tr|B1KNW6|B1KNW6_SHEWM Uncharacterized protein OS=Shewanella woodyi (strain ATCC 51908 / MS32) OX=392500 GN=Swoo_3305 PE=4 SV=1
-----------FNDSYDFVLrnEELFFSTFYEIFVSSSPQVKAAFKH-TNM------------AKQNEMVRESFGFIICFFVtKiADEQLVKLAIDHKDKFHVDSELYAVFVNSVLAALEKIYP-KYNNECAVAWRITMAPGIEFMKH---
>tr|A0A176H0Y0|A0A176H0Y0_9GAMM Uncharacterized protein OS=Oleiphilus sp. HI0069 OX=1822245 GN=A3741_11335 PE=4 SV=1
-----------FDDSYDFILsnDSNFFDSFYTHFFNSSNLIKNAFAY-IDM------------DKQKQMLRESIKHLVKFYCtNkESEYLKTIARHHADKVRADEYMYKLFVDSFIQAIEDTYP-NFCEEAALVWRCALKPGIDFMNS---
>tr|A0A090LM85|A0A090LM85_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X000017100 PE=4 SV=
--NLTTSQIMSIKKSWKHINtkgLFNVLRRCYQRCQSCCPNVAKVFST-ENIKK-QQNIYSCGVSEHTKYFISLLDRIIDNEPNIEHELRNVGKEHAKlyeEYKLSITDIERLGEIIADVFLKLDGIRQNKETSKSWRILIASIIDEVSVGYE
>tr|A0A183CLY2|A0A183CLY2_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 
--LLTRTQRVLIENSWKRVKkaavEGGMGAKVFHNVLVAQPDMKLLFGL-EKVP-QGRLKYEGQFRRHAGLLNRTLEYVIKNVQytdKLGQHFRALGKKHCQmngGRAFPTNYWDTFLECILQSVLETDGSisgRYhrCREAALAWRNLVGL----------
>tr|A0A0M4CP70|A0A0M4CP70_SPHS1 Uncharacterized protein OS=Sphingopyxis sp. (strain 113P3) OX=292913 GN=LH20_00550 PE=4 SV=1
----ERSDAALMEATLAAVAetGIDIRHTLFERFFSAYPERHPAFLNL-DA-------------ASRRMTDETLQILFGLATDegwVWPLVAELVATHR-NYGmLPTDEYDAFIDLAIDELGRAAGRAWTGAHAAAWRRQGEIL---------
>tr|A0A1Y5Q3I5|A0A1Y5Q3I5_9SPHN Uncharacterized protein OS=uncultured Sphingopyxis sp. OX=310581 GN=SPPYR_3232 PE=4 SV=1
----PARDIAAMEASLAAVAdaGVEIRHALFDRFFDAFPDRRASFMIV-DA-------------SSRRMTDETLAMMLGLAKGegwVWPLVAELVFTHR-AYGpLPIAEYDAFIDMTVEELGTAAGAAWSAPAAAAWQRQAEAL---------
>tr|A0A2N3CVZ2|A0A2N3CVZ2_9PROT Uncharacterized protein OS=Alphaproteobacteria bacterium HGW-Alphaproteobacteria-17 OX=2013663 GN=CVT78_05625 PE=4 SV=1
----SARDAGQMEASLIAVAdaGIDIRHKLFERFFAAYPERRASFISV-DA-------------ASRRMTDETLQMMFGLAKGedwVWPLVAELVFTHR-SYGaLPIAEYDAFIDMTVEELGLAAGAAWSDETAAALQRHAEAL---------
>tr|A0A0D6LRF9|A0A0D6LRF9_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_06233 PE=4 SV=1
--PFFRIDNRLVPDSAVAtDMV-QAQIHSYVYSSLQSTVSREMFQKM---SIVEGFRTNQccDLNMHAKVLCDLFDSIVSDLQQaskiVQARCMDVGGSHV---HMNekccGSLWDQLGECLAEVITKVECVRSKRECTKAWIMLISYVVDGMKCGY-
>tr|A0A1I7RN92|A0A1I7RN92_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1
--GLTDDQCEQLATAFSNIPdKYYAFEQMFLNLfMKEDPQLAVVFGF-EGIR-PEELRRMSPFRTHVCKFQRFMTTVLDMLPKknreeeLIQIIRMVGRQHCNvkLLSFTAQKWLSFKNGMLNALAKG---GESHKYYSSWNILISFMISEMKDAY-
>tr|A0A183BTK8|A0A183BTK8_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=4 SV=1
--QLDDTECEQLSTVFAAMPdKYHLFEACLRPMpMPeVDPQIALTFGM-ANIA-EIELRRKTPFRYSV--------------QKrgreeeLVQIIRMVGRQHCQvkQLSFTAARWLSFKSALTWTFSRG---EQKDKLHVQWSLLISFLICEIKDAY-
>SRR5688572_1577071 
---LARHDWHVLLDRWQRLQpnADRFATAFFDTLFGQQPAFLQIFAS-APL------------DAQFLRFAHLLSEIVSAADDadeLPRCVELVVQRFA-NDDCETDRSRAVRAAINAMLTEVSAAHMTPHMRASWHAAYVAVTAIL-----
>SRR5690348_16468503 
--------------------ADAAMTYFYAELSSAARATWAdrdIYMS----------------GPDHMIVRT--ARALVErg------------------APSRLIHYDLVDPRVTEGQX-------------------------------
>SRR5258708_24656334 
--------------------ADAAMTYFYAQLFAMDTEIRAMFPA--AM------------DVQRRRFFEGSAGSPLPsraRpttIASCLTCRNSGPHHM-IAETAP----------------------------------------------
>SRR6185437_6364830 
--------------------ADAAMTYFYAQLFAMNTEIR-aVFPP--RP------------GPVKRMSRT--SSGACRrtrRs------------AAR-RPRPRPCHTSAGPAR-------------------------------------
>tr|A0A016TZT5|A0A016TZT5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0066.g3721 PE=3 SV=1
----ANKSKKLVIAEWPRLLehEPNLFKIVWSSSAARSTSIKQAFGI-TD---NESPLENESFMKLSPTIQAFFYKLVIsmQLDEdmVRSACEQLGARHVDfiARGFNSNFWDIFLVCMAEAIDATLSSYITDeakraEMILAWQRVFNMIVHHMRTGYN
>tr|A0A0R3Q1W4|A0A0R3Q1W4_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=3 SV=1
----ANRDKKLVIQEWPRLLeqQPHLFQIVWNASSTRSNSIKKAFGI-GD---DESPQENAVFMRLSETIAAFFEKIVItmQLDDdiVRSTCEQLGARHVDfiARGFNSNFWDIFLVCMAETIDETLSSYMTDegkraEMILAWQRVFNMVVHHMRTGYN
>ERR550534_360735
---------ADAKASWANVDTAAFGKAFFKNWMASDPEVKNVFKK-SSFP-----------QGPAQFLVERFDILLGVLDDevaLSQQLMSVAKTHM-DKGVDPEHLVTFQDSFVKTLAGF-DSDWSRERSESWAYVLSHVIT-------
>ERR550539_1411929
---------SLVETSWANVEKEAFGKAFFKNWMAIEPHVDEIFKK-SSFP-----------QGPAQFLVERFDILLDVLEDevaLSNELTVVAKTHM-ERGVEPDDIVTFQDAFLKTLPGF-DSDWTRDRSEAWAYVLSHVIT-------
>ERR1719192_2654783
---------GAQS---APTPPKPVGQTwtkRLSEKLSSEPEVADVFKK-SSFP-----------QGPAQFLVERFDILLDVMDDeasLSKELQVVAKTHM-DKDVSPDDLVTFQDAFLKTLPGF-DSEWTRDRSEAWAYVLSHVIT-------
>ERR1719242_19104
----------------------------------------------------------------------------------------------------------TPLIGMA--AQS-PLSWEQEK-----YVKLgQRWT-------
>tr|A0A0C2FEY2|A0A0C2FEY2_9BILA Uncharacterized protein (Fragment) OS=Ancylostoma duodenale GN=ANCDUO_24724 PE=4 SV=1
--SLMPSQVSVIRKSWRHINTKGLITVLSrvfQRFNA----ID-------GQE--YAKVYDMTIYGIIEF--------------------------------------------------------------------------------
>tr|A0A0C2G6K1|A0A0C2G6K1_9BILA Globin OS=Ancylostoma duodenale GN=ANCDUO_17195 PE=4 SV=1
--CLSYKHRKLLRATFQQMNsSGaflKLMEQVFRRLEAKYPDIRSIFLTTAFVNSLSRERSSPPLvrteHDHCKCLVALFEKIMDNLSDdtQLMVIRQYGEKHAQmkESGMSGGMIESFGEIAVAVIASQYSYWIQKPVDDVTrrkgrDEGLVYLNDYEYIIL-
>tr|E1NZ07|E1NZ07_CAEEL GLoBin related OS=Caenorhabditis elegans OX=6239 GN=glb-29 PE=4 SV=1
--NLSVKQKKLLRQSFNAMNsGGtflKLMEKIFRRLETKCPDMRSIFLTTAFVNSLSRERQTPPLvkteYDHCKCMVGIFERLIENLENIneqLTMIRHYGEKHAQmaESGFTGAMIEQFGEISVFVIGSQDVVKFNHETVKAWRLLLACVTDEMKVGFD
>ERR1719431_1401903
-----------------QLTtnSIRSGFCGRLCETTRyNPDCtsSNTFSMRfRKR--RKNFHSPMINTEISRRILWRRKRLMTRLFKrdpeATKRIYDVGFHHQ-MMSITEHDMTMLSSSIYSAVQDILGKKASDKDLAAWRHLLGLVSYHFKRG--
>tr|A0A1Q9NTV3|A0A1Q9NTV3_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_3 OX=1841598 GN=hmp PE=4 SV=1 
----TSKEADILTQSLKALEekTDDLPKLFYYHFLEPtsNKEIISLFNK-SDM------------TKQYMMFHQSLAIIVSSIKDshlLNQILKDLVKRHK-NYGVKYAHVQIFSSAFYKTIEEIFPK--DEKVKILWIKLINFVLSKFNE---
>ERR1719238_586270
----PKEVIAEVRRCWEAFIkasgsKEAASEHLYAALYDAVPSVQHLFVT--PR------------VVQAMRFMTQLQTFITLLDQPkqsKVTMEAIGFAHM-QRDITVELCVLVRDAILDLLQVELGDNLSSSAAAGFKGLLNWM---------
>tr|A0A2A2L6E6|A0A2A2L6E6_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_22934 PE=3 SV=1
--KLTKLQKKALKFTWSRLQtrnggkrVESVFEDVFDRVVRYLPQTREMFNT--RAF-LCAIsrNETSSLRDHARMTVRMIDVAVRNLEVetrkrsdtgSDMDPLLIGIVN-----WRGSRYS---CRIINRI--------------------------------
>tr|A0A2G5VGS5|A0A2G5VGS5_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-26 PE=4 SV=1
------SERSIKLRKYDYEKddgSK--------KLL---SFYKKVREK-------------FTFKRSGSEMVAVVVSVMQSLDEpdkISKMCQEIGQLHA-KYrrskGMKIDYWDKLGEAITETIREYQGWKIHRESLRAATVLVSYVVDQLRFGY-
>tr|A0A1I8EM37|A0A1I8EM37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1
-PSLTSAQIHLIRNIWRQVYitkgPTVIGSTLLHGIYFKSKKIKDQFFR-CPFP--HRFPNrDSFNKAHAKAVGEMLDKIVDNLENlesMSGYLFSIGATHANliRRQVSKEIWNLMAEAFIDCTLDWGdKKGRTEASRKAWAFIISFAIEKIKRG--
>SRR5690606_37396704 
---FSDTDTYILHTGLKWIEeaPETFAAKLYQRLLRDHPECQASLHAI-GL------------ESFNRNFIHFLKMVKEELLErhtIHVAPREFLALHALpvEKVRHSNYVIKMGRTFLDIFAELAEDAWSPALESTWNKAIEEVK--------
>GraSoiStandDraft_42_1057292.scaffolds.fasta_scaffold716659_1 # 2 # 607 # -1 # ID=716659_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.685
------TEIQILENGLRWIKesQDRFGDKFYHRLLREHPEVNPLLQSI-DP------------WSFNKDFVQSVDAIIGEIRAqgdVISPLKDFWPELSStaMTPLKPSELIKVAETFLDLISELAEDAWSPALEYVWRKAIKTVM--------
>SRR5215207_8455447 
--------------DFDTVVCSSFAERFYSRLFTHEGGehLRALFPDN--I------------QPQHAQFTTMLGDILAYNFRigRSLLGD-TFRKHI-DFNIRESDVDVFRKAFVEEVGSTFLH--LG----------------------
>ERR1711972_144950
---------SQVLQSWEQVKllgLESVGEMLRANTFELDPQVVALFRIPGVVSTGEGMLQRMALRRLFSKVLRFVGSVVAGRYDyqrLVETLSR-----------------------LGATRAAGGATEVHFKI-------------------
>tr|A0A238BIH0|A0A238BIH0_9BILA Globin OS=Onchocerca flexuosa OX=387005 GN=X798_07861 PE=3 SV=1
--------LFTLKNYWKTVRrnERDCAKMMLAKYLKQNPDNKEKYPKLKNIDVntVDVATANSGFETVAANYLKVFDDVITTVEEkpgdvsdACSRLTAVGKMHRTkVNGMDGSEFQLLEEPFLYMISEILQDRYNDKAENLFRKFYQFCLKYILEGFN
>SRR5215467_3799544 
----------QVSESYWRCCtNPLFIEELYQTLFSKCGEIKQLFEQ-KNVS----------MKRQYAMLRYALDIFVDYPHDMTATFPDIARKHT---GLDPRFYETFIEALIETVGKCDPK-WVPSLEHAWRERMT-----------
>OlaalgELextract3_1021956.scaffolds.fasta_scaffold865191_2 # 285 # 404 # 1 # ID=865191_2;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.492
-----RHEWHVLLERWQKLQpnADRFATVFFDTLFAADPELRQFFGG-ASL------------EAQFLRFAHLMTEIVSAAGDpeeldhrVEVVVQRFARDDS-A----TDQSRAMKLAIAAMLEEVAASDMTRQMRADWKAAYAAVGAM------
>ERR1712159_177610
---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNITFFNRA-HFTS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLRRISNKHV-GFGIKPQHYPIFFENLFVAFKEVLGTKATPELISSWKELVSLVQ--------
>ERR1712159_799488
---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNVPFFNRA-HFAS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLEESPTNML-DSESNHNTTRSSS-----------KTCSLPSKKS------------------
>SoimicmetaTmtLAA_FD_contig_31_10253239_length_247_multi_1_in_0_out_0_1 # 3 # 245 # -1 # ID=589621_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.671
--GLSEYERGLVVNSWKALTkpdfspldGTSSLSNFYDAVWTKWLKIDEF---------ANKMFRSRGFKGRVQHLLRIMGVIIKCAEDPlrgLEQLRSIGVQHC-IWGINSQSFASLALSIIHGLDQANGKEINAELKELWLAL-------------
>tr|A0A1V9ZGT6|A0A1V9ZGT6_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_12918 PE=3 SV=1 
-PVLTPTNVDICRRTWDLIQtagtdkMRqygkpgiiLFYDEFFYRIFERDTTIREVFPKV---------------QQRAEVLIKAINFILSTRAGtpasvmeTVNACRFLGHKHRAFAKVRPHHFAVYTNTCIEVIMYWLGEFGSHEVGTAWSHTVGFILRHILEAF-
>tr|A0A1I7XNU2|A0A1I7XNU2_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1
---------NTT------DSglqlEGIVVQNCFIYILSKYKHLRPIWQFGKKIEDneenwTLALYEDFYFRHHCASIQAGLTMIMENKDDpesIKKLLNEIGAHHF-FYDACEPHLELLDQ----------------------------VKGHVSDG--
>tr|A0A2A6BP14|A0A2A6BP14_PRIPA Glb-18 (Fragment) OS=Pristionchus pacificus GN=PRIPAC_48995 PE=4 SV=1
---STPEDKKLMEKTWSEEFdvLLTLGSDIYNYIFKNMSACKRLFPWIIKYEdEGVDWKKTTEFKDQALKFVQVIDTVVWGIIDgdkSEPFLYDVGQRHVQyaSRGFKASYWDVFLDAMQYAQDQRIPKmnnlnaQEKQRAKQIWHDVAAYIIKHMKSGF-
>UPI0002C4E217 status=active
--------------------------DFGTAFFEYCPDLKGQFPS--NYA------------L----VTKMIQKFINNViegKNLERLARHYGRTHW-RYDLEERHFLGFAEALADTINIRIGNFGTIELMKIWREEATMICKMLEDQY-
>SRR5215831_15107384 
----------------------LFFSKFYTNLFGRADDIEDRFKEL-DM------------ERQYRILNLAIHKLLEFRPEqpaTQKQLRDLSLRHA-KLGLTNHAPAWNR-IH-LDLRGIGA--DGRSsGVAAADKALAX----------
>tr|A0A085LU76|A0A085LU76_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_10599 PE=3 SV=1
--NLTTHQKQLLVQSWPKVQtynRIHGGDAIFARFCEKNSIGRIFQETFQKiavvQSFAINEASESVLKKHEQYLLQLLTQAVENLNNdrepLLRECLAYGAQHI-TLQelLNETVWDQLTEAIIERIHMVSFVRRHRNLSKAWTMLITLLVEKIREGY-
>tr|A0A2E0SMS8|A0A2E0SMS8_9PLAN Uncharacterized protein OS=Planctomyces sp. OX=37635 GN=CMJ46_12130 PE=4 SV=1 
MSQISERQYHLIHDSYRRCMlADDFLVMFHRNFMEKSPQIPKFFAD-HTL------------QQQHRILAKSVARLVSFVDGkpqaeqdMRDTMRI---LHDGNLRLTPEHYAFWATALMETICTI-DEACNDEVAVAWEQTISYGTGVLK----
>tr|A0A0B2VQV3|A0A0B2VQV3_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_12261 PE=4 SV=1
--NFNKRERVCLRETFQKLAdPkELIGAIFVDIVNDIAPELKKVFGV--DRAPKAAMLKMPKLGGHVARFTDLIDQLTNMVGyteNVlgaWQLVRKTGRAHT-KQYFletnqsarGTNYFALVANTFILEFTPYLTGekeepnvdekkkvrfasTYTStMISDVWARFFKVITAQLTDAF-
>tr|A0A1I7YWT2|A0A1I7YWT2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=4 SV=1
--SFTKKERICLRETYQRLQdPkEIIGRIFLDIVNDVAPEVKKVFGV--ERVPRPNMLKMPKLGGHVARVNDIFDQTTSMLGyteNVlgaWQLIRKTGRAHT-KQQFllenlnqlEKNYFQVVIDYFQEQFLPYLTGekegqerkkvrfaqNYTTiLIEDVWKRFFSILIAQMTDSF-
>SRR5512138_1182700 
--------HRRVQGSYSTFQatdrADRLYRTFYANLFASVPEARRMFAH-TDWS------------RQYNAINEALKLLLDFDADpqraadAAKQIGSVALKHQ-QYGLGERELRAFEGALLHALRSC-G-ECKPATLEDWRMILAPGFHHMRG---
>SanBayMetagenome_1026888.scaffolds.fasta_scaffold228792_1 # 28 # 387 # -1 # ID=228792_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.353
----EPNQRALAKASYRTWIepDTRFFEDFYRRFFATTAAKrahsVHKFK---DR------------KEQHDKLRNGMAAVLNFYpGNEPTSLRYVIDVHR-RKKVTEPELKQFSATFLELVSERLNRKLtgtgsaarRKEIMDAWTALFDQVLKHFRE---
>tr|A0A0V1CBX7|A0A0V1CBX7_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16916 PE=3 SV=1
--ELNDNDRQAIRQTWQKIGdHTLWAQRLFAKILVACPAFSKATSF-HSL-AGKHLLNDAKFRSFCQRFADFWQNLVQLLCvsdDpadwqqAVDSIRGLGQRHSLNRKVTfeAPIWLMIKNEIVLSITGY-SDICRSKDCLSWNKLLMFTVAEMKSAF-
>SRR5262249_4116633 
---------------------TKFFRSFYEILRE-SPEIHDMFTSP--FS----------VAKQAQKLNNAMEKILNFRTYMnTSSIGREVQRHR-KLNIKPEHYGPFRDDFVKALKKAkIDDGYS---EDAWCAVLDPALDYMRT---
>ERR1719347_1935341
-TGLSQNEVTLIWSHWESLKphKRRLAKRILKVYIKEHPRARELFPNWVDIP-TVELVKLTSFSRKAVDTWEAFSRAWECIDDaplCRKVCYAFGKKHI-ECnarikghgQIDEHHVKNFIRIFLRIILVSAR----EGSEEAWRKATEFFSINFVRG--
>ERR1712142_116161
-THLSQNEITIIWSHWESLKphKLKLAKKILKVYLKEHPKARELFPpHWKGIS-MADLVKLHSFRRKANDTWEAFTRVWECIDDpklCQRVCFTFGKKHV-EWnarlrqtrgQIDEHHLKNFMHCFSKTVLDNSR----AGSSEAWRKATDYFSLHFLRG--
>ERR1719313_2808357
--SLSDATHELLQKTWQAAKPegpg--LGEAWYEELRsdtSYVDDLGVILNF--PV-------------CRPENVSRVVQALLDLLPRecqetpepglmlpvprFTKLLLAAATLAQ-----------------------------------------------------
>tr|A0A0D6M2N5|A0A0D6M2N5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=ANCCEY_04360 PE=4 SV=1
--NITPFEIRYLKYSWEKASsTMDIGCELVARLLNDN---RTRFRALIEshsgdLLgsanfAAEDVKKFRRARSVAHGVVMFFNQVISELDEpnsadfIAVISQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALQVKkttsfacgktisMsDKKAREVWYKVIQFVIQNMKRGF-
>tr|A0A1I7W801|A0A1I7W801_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1
--NISSQEIQYLKYSWERASsASDIGCELVARLLNDN---RTRFRALIEshsghLLgssnfTADDVKKFKRARAVASGVVMFFNQVISKLDEpdaadkISLLSQSLGASHF-RMKvwFQAENWLCVKNCLLDAIMTALRKNggssllcgkrhmHnIKRATDVWYKVIQFVIQNMKRGF-
>tr|A0A0K0DKR1|A0A0K0DKR1_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1
--LLSTLVANNLQIYFSRANnATDVGCELVAGLLNDN---RTRFRALIEshsndWLgsatfTAEDVKKFKRAHSVANGVVMFFNQVISKLDEedaverIALQSQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALMTKpfmvcgksitMnQKKSREIWYKVIQFVIQNMKKGF-
>tr|A0A1I8BDP5|A0A1I8BDP5_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=4 SV=1
----MRYTNYLSKIVLARTLnQVDIGNEIVIHLLNDK---RSLFKNLLEqsspyEKeikniyDKKSLSkYSPRSLEISNGVTKFFKNLSLLLnqkgmEIeekedkLVEICKNNGKMHY-QMKvwFQAENWICLENSVIETIIKGNNLEkenFeSNQTIIVWSKLMQAIIGWMKQGF-
>tr|A0A158P8J3|A0A158P8J3_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1
--NLRKEQVRALRMTWTRLCepprsnckgIVNLVERVWEKLDRKDSSVRNIFYNAAFvetMHDRCERRrskgSIATLRDHTHFFVSLVSQVIQSLDLnpenILNHVDTIgKSNHAylKQYGFRSQHWEKIGEYFVDVVVIQDCVRGFPEACRAWTILVAALVDRLRAAP-
>SRR5262245_41417288 
------------RASYPRCMaSGNLHARIYEAFFAACPEAKPLFDN-TDL------KRQYQLLHQAIVLMLAFH---VSPNrEEPTILSRVAARHS-ELGVhiPPAWFDAFSAAIQQSLEAA-DTQFSDKTREAWAAVLADGIGYMQ----
>ERR1711884_327085
--------------------------------------------------------------------------------------------------SNESFSvIFKHLAFIKYL-HItktglFDELFGQHVCRIRRiLPFKLIIRL-SSNF-
>ERR1719471_2433215
-----------------------------------------------------------------KGIMKVVSKVLCHLNDlsrVEDYLRVVGRLHD-SAGVEIAYLSVTGDAFCTSLKRLgtHADIWNDEVKQTWNAFFRVVVDLMSAGY-
>SRR6266436_7042579 
-----------------------------------------------------------------------------------------------RVFITAqysCRYHSFSATFYVMAGdkerwkVYM-SHQQMSLhARSKDGLYSRRttQGY------
>SRR5437870_11165056 
----------------------------------------------------------------------------------------------------AqysCLNHILSATFYVMAGdkerlkVYM-SHQQMSLhARSKDGLYSRRttQGY------
>SRR4051812_43285676 
-------EVEVARDSYKRILddevkEEKFFRSFYQRFFRKCPDAAKEFAA-KEFPRRVAlsGRggnaREGKWPRQYRLIKQAVVLLltFKLLDDteGLTILTDIADKHE-RYP--QEFYDSFRDALIDTVISLDKDsgsgLQRYELRDAWEKSIQPGIDYIMN---
>SRR5262249_5830581 
-------DVEVARDSYRRILddverQREFFHTFYGLFLRRCPEAAAVFEA-KGYPALAQlgGPrvedSAGRGPQPPNPLKSAIVMLiaFNILGEkeEPTILDNLVDKHK-GFP--KRYYVAFQDALLETVVQFDDPsrcgMPPDELQHAWKQAIQPGGDYLID---
>tr|A0A2T7PRA6|A0A2T7PRA6_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_02930 PE=4 SV=1 
---FEPHDKTIVAESWKLLRsiFPDLIESAFVEMCRRVPRLKLQFGNV-DVDDD--EERHMNFLKHVWDVSFFFDQLLLYLPfksKLEECSFHIGLVHA-SVEVPAWYVDLFLVEFIRAAQETVQLEWTPAMENAWAVFLRYLCYYMKDA--
>tr|A0A2A6C3W4|A0A2A6C3W4_PRIPA Glb-17 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_39254 PE=3 SV=1
-MELTDEEVAAVRNVWIRAKTEDIGKKILQTLIEKRPKFAEYFGILCQSDklDMNSLKESKEFHLQAHRIQNFLDTAVGSLGYcpvtsIYDMAHRIGQIHF-YRGVNfgADNWLVFKRVTVDQVTKGvtstqasqanlLegtkepevveqhpmadvQNPFSGEnclARLGWNKLMTVIVREMKRGF-
>tr|S9VAV3|S9VAV3_9TRYP Uncharacterized protein OS=Angomonas deanei GN=AGDE_12480 PE=4 SV=1
-------------AAWSHLLtspnGGEFCSTLYEKLCQNLTYIPDYIRNLKD---------EE---RVIDHYINVITKTLELYENphvMIDELPKIAARHR-GFGVSSDAFFVMRNIFMELLPEYMDPKVYEQSKKDWLKFWRLVLDLMVSGS-
>ERR1719354_143580
------------------------------------------------------------------AFWDILDHICGHLDRlenLIPQLRDFALQCF-NSGLFSDDYNILGECLVTILSTNFD-PWEETHSDSWAWCLDLVMSTLVT---
>tr|A0A1I3QX19|A0A1I3QX19_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter neptunius OX=588602 GN=SAMN04487991_1987 PE=4 SV=1 
----DEQMIALVKASLKELQphAGAVFATFQSKLAQRAPELAYRYDEV-DP------------ERQGELLFEKLAIAlggVRFLDRLVPALGGVGLDAG-SASLTSCDFARLSEVLIAAFAEVSGNRFDPCIGAAWTTLFEELSWHMFE---
>SRR3954469_11252496 
------------------------------------------------------------------------------------DGGAIRRHHV-RSGIGGPDYGRFGDAIPAVMVDVGGNDLPKPIGGSWGDAFWAVIGRTKQR--
>tr|E0VF27|E0VF27_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236389 PE=3 SV=1 
-----------VLNDWPKIRknYKKIFIDSFINYFAENPNYKLLFPSFSNVS-EDDLPFNHCFRLHCFAVYKAINFLMSNWlGEyeedDSKILPVIGKTHF-DRGITLEMMNLYKHSIVYSCNNHLKPNL--KRKLSWQTVFDHIFDY------
>ERR1719461_240742
-----------AVASWNNIDdKTAFGKAFFSNWLESNPRIKDVFAQ-SSFK-----------QGPAQFLVERFDILLGVIEDeeqLAEELYQVAKTHK-KVGVDQSDLYSFQASFMKLFLPS-TLItaqrsqtlgltpFLtssSLLWSRWQLSLPV----------
>ERR1712165_596852
----------------------------RLFLPSTLTSLQRLETH-----------------GLTPF---------------------------------------------------------------SHVITAP----------
>SRR5580704_4499342 
------------------------LGDFYRRLLQHHPQLAAYFEGV-NI------------DFQVQKLVVVLSTIARDLPDrsvLDRVLFHQGVAHV-ERGIGRGEFNEFIALLANVVSCKTTLVGAAESYAVWYQELSAVATSML----
>tr|A0A0G4HY87|A0A0G4HY87_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_33490 PE=3 SV=1 
------NRIHLLQSSLAACLkmstkEEFVGRLMYDTLMRTLPEPGIIAKR--GR------------TMMSRAFNDtvaALVAFVSEPSHMETYMDWLALRHV-HYKIDTTLFPQFRQAMLVSLEQVMADQWNAEIERAWSEAYEMTSQALQ----
>SRR4051794_14672716 
--------------------SPAFAESFYTHLCR-SDAVRDLFVTAHRKRVPAALnrQESpaIPDETQRRKLVDGLKAVLNFRPGcSPSSIDSVAARHV-DLHLTTDHFDVFEKSFLETLEQHVTRSEdreeMEEITHAWEKLFATVRDEMLD---
>ERR1740139_220892
-------TRAALLKSWEMVQeaGTvPAANLLMKHLRERDAEALRVNTSH-ARP-KTGETEEDAVRKLAVRTVQILGSAATGMSDtvsLVQHLHKVGAGFA-GTGIKEGYFAMVRDASPFALRELLGDRFTADIASACRITGPFLASLIIAGLR
>ERR1712194_173361
-------TRAVLLKSWEVLAevGTaTAANVLTKHMRELDAEALRSYTSQ-AQP-KDGETEDDVVQKLAVRTVQMFGTAvtA---NDtasLIQHLHKVGAGFA-GTGIEEGYFSLVDKASPLALRELMGDRYTADIASACSMTGDFLTSFVREGFR
>ERR1719446_598571
--------------------KKAYGLNAFNRFFCKAATIGNSFQHI-QC-------------ASVCSgnarSPAVSGYLQGAYTlgeCGHLTWPQTHHVQH-FYRLLX----------------------------------------------
>ERR1719240_1501566
------------------------------------------------------------------------------------------------VQHFYRILRLLLEACCEELADWVKD---PAAVEGVEWALTQIAAIMI----
>ERR1719235_1367256
---LPGVTVEFLRSSLARISEDEFGDMFVQKLRETGDmlsegTIEGVLNT--PI-------------VRPTNLRKMIVYAL-----------------------------------------------------------------------
>SRR3989338_2963815 
---------TPLYHLYKENVppqkERELGLLFYKLLFDSNPELLDFFANV-DLD------------HLSDHLVQTIRLFLESRnslVSLVPAMKALGIIHQ-RAMIPSWAFPLVIENMAKLFSILLGDRFTVELASALVLSFDLLTSF------
>SRR3990167_6716616 
---------NPIYStlknIWlETVStpeiKSAVGELFYKNLFQYHPELLEYFNNV-DMD------------SLALHLSQALDFVFQSInkiGDYksqwRTVLEHLGEVHR-AALIPTWGYPIIGQQILKIFPYNEKAGFSTKQL--etaLATLYREIVII------
>SRR5436309_231744 
-------------------------------------EIGQLFEG-RKVT----------MEDQYRKLDRAMFSILSFNRRlKATTLDPQVASHS-EFGLKREYFQFFREAFLAALRETQAS--DDYSREAWSALLNPALAYMSD---
>ERR1719183_3286062
--------AISLRDSWVHIEvlkeeddSGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFSTLVHAMGDpqkFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDSMVRNFR
>ERR1719183_785787
--------AISLRDSWVHIEvlkeeddTGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFAVLVQSMADpakFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDCMVRNFR
>tr|A0A0N4UGY4|A0A0N4UGY4_DRAME Uncharacterized protein OS=Dracunculus medinensis OX=318479 PE=4 SV=1
--RLSDKQKLWIKLGYKKWRsksKMVPGEWVHAYAIKKYPTMKALFKK--HEN---------LARVYTQTITKIIEMAVESVdslDDsLGPLLISYASENgileERgmasiftirndklllfLEGFDRRFWGYVAEALCALSRDFPLKRHKWDTISAWRIIVLFIVKKLEYGF-
>tr|A0A2A6D1B3|A0A2A6D1B3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_35146 PE=4 SV=1
--TLNHQQRKLIKNGYDSWRkksCISSGRWVHSFVSSKDDRLKEIMEG--NEE---------TTRIHEETITHLLDMAVESLeslDDsLGPLLISYTGPQgvfeEK-DGFDRLYWSRVSEGMCQLARNFPSKANKYETVCAWRIVVLFICNKIELGF-
>tr|A0A2A6B4U3|A0A2A6B4U3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_54703 PE=3 SV=1
--GLTKDKTDLMANLWPSHYgtLYDMGIAAWDKLFAHNPGLKKHFGF-AENDPSSSWKNDERIKKMVLSLQQLLTEAVNTLGfgDtealtsFVNNLRELGGLHRAiADGVNPDAFTLLFAILPEVIVDVTSnrskdgplsSENRSELLAIWRAITRFMANQVMTGW-
>SRR5687767_14811217 
--------------------SREFMSRFYRRLFAARPELRSQFKNV---------------TTQHDMLAEAIRDLVLFRpGDQEARFLDYVETHR-RMNITVHDIEAFRLAFVAEVIATSMQngnAQARSHGDAWNAALKLGLGVMAK---
>SRR4029453_11133516 
-------------------------HLIILKLQRIAMQGAflSVIPAtgFSEH----------FITNSCEFLPK---PQSSSREKalgenEPNILSRIAEMHNKnNYNISPESYKAFVSALTATICGSAPEipePFAPqckisvneknLIKNAWQKALKPGIDYMIMRYS
>SRR5262245_37180117 
--------INKVHESLKRCRlQPGFFRDFYQQLVKNDAIQ-AIFTKrgLDVL----------KSDKQQWLLREGLDLLISYADEpkspGLHVLSRVAESHSI-YRVGIEMYDGFLEALLVTVRRHDLEfqdP---skddskVIEAAWRRALKPGLDYLKSQRP
>SRR5262245_45185474 
---------------------PTFLEAFYKLFTA-DEVVGKRF--vkFDDI----------EWKRQHGLLQQALDACFDFASLlsmqnlrelpEPNAMTKYVVRHGPgrgNLGITSTEYDAFVEALITTVCGNPGNgqaPYDPecadaerkdVIEFAWRRLMKLIVEHFKKVAR
>GraSoiStandDraft_39_1057311.scaffolds.fasta_scaffold195098_2 # 276 # 1100 # -1 # ID=195098_2;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.692
----SFDVFEIAKDSFNRCMgadgGALFFKTFYERLLSKLPVP-yaRQLSQkgVGTS----------SSHRQYDMLRQGIFILLQFGQHklyerEPNILSTVAVLHDQhHHNIPPNLYAAFTGALIDTVAGAPPAiptAFDKqcetdmdIITDAWEKALAPGIRYMTEKYF
>tr|M1PA46|M1PA46_9CORY Flavohemoprotein OS=Corynebacterium halotolerans YIM 70093 = DSM 44683 GN=A605_12675 PE=4 SV=1
--------------------SGEFRDEVHRRFYLDVLEARQVFPL--TLR------------ETHVDLASSLAWVLERtssdgtLPDdVLARIRRLGVDHR-RHGFPAEVYPAFLTALRGGLRTVTAEHggVDDPLVDAAGDVFARVCGAMADA--
>tr|A0A097IIH9|A0A097IIH9_9CORY 2-polyprenylphenol hydroxylase OS=Corynebacterium doosanense CAU 212 = DSM 45436 GN=CDOO_12240 PE=4 SV=1
--------------------SEKFRDLVHEQLFSTELQSRQVFPS--SRA------------RSHLDLAPALAWVLERstidarVPDeVMRTARRLGLSHR-RHGFPSEIYTPFADMLVHALREVNFRAdpqLSAGLIIPAETIIRNVCNAMRAS--
>tr|A0A0G3HGP7|A0A0G3HGP7_9CORY Uncharacterized protein OS=Corynebacterium uterequi GN=CUTER_09860 PE=4 SV=1
--------------------PDEFRSRTLTGFFAAEFQARQLFGL--HAT------------QAHDGLPEVIAWALERcgidghVPSeVLDRLQRLALVNR-RFGFAPSAYSSYAEAITTALKDLAYVHfgeVNIlpSQMFAATLALDTCARYMQRA--
>tr|K0YDT0|K0YDT0_9CORY Uncharacterized protein OS=Turicella otitidis ATCC 51513 GN=HMPREF9719_01398 PE=4 SV=1
--------------------RTAFRDATVDYLLRRLPRLRRVAPL--RQR------------HRAEALAERAVGLVARspqgmLRGeDAADLERAGRANR-RLGVPLRVYPVLAQALKAGLRAAFEAAgepYTA-AARDAEALAEAACASLARG--
>SRR6478735_8357209 
-----------------------REIAFLVARGLPsKEIAEQLFLSVR---------------TVQNHLQR----IFTKLG-VTSRGEVAGVLQG-LEGPSSX---------------------------------------------
>ERR1712130_811490
----------------------------------------------EAAlagmKAVEDLGGKFDRTKHGSLFLSVvLTRVVPHLDQrdrVLPYLVELGALHQ-REELQDITLICWVLHIalPSGVWSRVeecVGGYC--TRQPRLGLVWSLPS-------
>SRR5436309_12080688 
------------------------MHRFHAHLEQLNPRLRYHLPP--ALL------------RYVrFELLQAVRQQT--PMEVGSGLRRFGVHLR-AQGFEGPDLDTLGAAWLVALDEVLGDRFDSEAREQWLRFYKVLRSA------
>tr|A0A0N4Y9E2|A0A0N4Y9E2_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1
----------RIQHSFKTASfhltvnqlrsRPTIGDAILKRAISNRPEMRTFLNRLTE----------QQVEHMGKQFYSLIAVSVENIERpeavryfs-RLPFFAMFETYATlcQLGFRPDYFAPLADAAIAECVKLDGGaHKRCETLLAWSQLISAIFTSVRDGY-
>tr|A0A183LHE9|A0A183LHE9_9TREM Uncharacterized protein OS=Schistosoma margrebowiei PE=3 SV=1
--------------------KIKVGKEIFRQLLIKNPHYMKMYKPLQSVT-LPQALNLDYLTKMAICYVDNIMKIVRNFNEeekLQETVKYLAAIHT-NRGLTVAHFVSILPIFTDTIVSYME---------------------------
>tr|A0A183WH41|A0A183WH41_TRIRE Uncharacterized protein OS=Trichobilharzia regenti PE=4 SV=1
-----------------------------------------MYKPIQSVT-LPQALNSDYLTTMAIRYVDSIVDIVENFNDeenLQQKIKYLAGKHT-NCGLTVAHFVVSLQILCICVHIWQT---------------------------
>ERR1700755_1321676 
------------------------------------------------------LN-SKG-HRQRDELLNALVSILSKYDPdrpdsqpmieLEADAMGWGRRHASfaalggrPA--GPDQYRVVRDVLWQLLIDASDGRWDAGHTEALVDAYHWVQTIMMW---
>tr|A0A0V1KYG9|A0A0V1KYG9_9BILA Uncharacterized protein OS=Trichinella nativa GN=T02_16304 PE=4 SV=1
--SLSAGELKLLRWLWKQMKqvhQGLASAKLFQIIFATCPEIKRFFGL-AKDT-IDMIINSLSYDNE----------------QLAQLMIAFGCQHSFytRRNFDPKYWNVFGDAMLHLVDDLPLKAFKrYRAKSIWFRFVYFVISHMQLGY-
>tr|A0A1I7VKJ4|A0A1I7VKJ4_LOALO Uncharacterized protein OS=Loa loa OX=7209 PE=4 SV=1 
---------------------------------------------------------------------NALKKIIESLKNeqiPYEVLQRISVKHA-RHNIQTHHIQKMIKPLVENVRRALGR-QDENAERAWETLFQTIAII------
>SRR4051812_9951159 
MTPLPPEVAQTIRSSCRPLLerQEQFHGDFHASLVDLMPEVPMMREP--A------------GEQVSRWLVECVLWAVNADEPvpmIGATLQGVGLDAH-RLGFPRAGYQAVGHALLRTVRGASQNDWSGTLSSSWIGYHSWLCEYWVSG--
>ERR1711890_22380
-MHLSDTEKSAVVSSWSNVN-SSLLDSVLLQLVQENADMRAAMSR-GDLA-EDSIREQETFKADVTKLTCCITKLVTRLGNTGEVSSCPatCLKNC-P-YLQPKHVPLFISSFCD------KLELTEDAKKGWKFIMEKTAERI-----
>ERR1712018_299478
----------------SDVA-ENHLEDVLLQLVRENSELRSSFSW-GNLP-EDCLRDDDKFKEDVKRLNTCISKVVDILSSSGDApLACPvsSFTSC-P-YLKSVDMPLFIKCFNS------GNKFSENAKSGWTAIFEMAGKKM-----
>SRR5262249_47865225 
---MNHRQVELVRSSYERIRrvRHLFADLFNRRLTLIAPVLERLLPP--ET------------ARRDAAALELVEFVVAGLDRLDVLLPALAVQARVwrLKGVEAADYDVAGMALAWTVEQVLV---------------------------
>SRR5215470_9720857 
----------EAKRSYRQFArDISFYRELSKRLFRKIPGIEKKFRH-RTM------------EEQYKVLRDSLWLLLSYASapdQqEPTILSRIAHTYA-R--FPKEWFDTFREVILDVVAQRDP-----SSVRAWKHAMAPGLEYL-----
>ERR1719487_1476365
-------YKTILDRCYERMTtqldLVAMVTLFQGIFFGRDIRIQSYFSKP-N-------------ATLRYVVLRIINFLVNVYHkpaAITGELRALGVSHV-KWEIPPDLFVPLGEALFITLEICLGG--------------------------
>ERR1719271_344116
-----------------------IRKDIYSTFFTQAPAGQDYFKQS-N----------TYLHVVADKIMVMTLELYQNPVKMVDDISALGLRHV-GYAIPTELFGPFVSACVEVLMTRTSD---EATIESFRWSLGLTSKML-----
>LSQX01.3.fsa_nt_gb|LSQX01333836.1|_8 # 4697 # 5665 # -1 # ID=41498_8;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.475
-----------------------LRQEFFLNFFKLAPSGQDFFKQS-L----------TRLYFIADKIIELCLEIYRQPRAMVEDISGLGLRHV-GYAIPPELFGPFVGSAVEMFSLATTN---ETAIDGFKWAMQLVSKIL-----
>SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold703673_1 # 2 # 517 # 1 # ID=703673_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.653
-----------------------SSSIIVSSFMRDssrPCRRVRTIKQS-N----------TRLHFIAESATNMSLKLLQDPWRMVDDVSALGLRHV-GYGIPTEMFGPFTEAAVDALRGHVDE---TLALEAFNWSLSIISQML-----
>tr|A0A1I7RTA6|A0A1I7RTA6_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1 
-TGMTRHHKMILQKIWMRASeadINECSRNMMSHLLRSNQQLYQMFNLV-GMT-DKEIQQSIPFNRQAANFAMVFDFVITNLTDdlnrVAFALEFLGQHHA-DLGFTIdqPFWALFNRVFEDNPPKLV--FQNPEGHQVWKLMVNFVVRQVKNGY-
>tr|E3MDQ4|E3MDQ4_CAERE CRE-GLB-31 protein OS=Caenorhabditis remanei OX=31234 GN=Cre-glb-31 PE=4 SV=1
-------DVERIRAVWMDhINgNDDYFQEVIHRICKRNDGIRCAMLTQnAQHA-ESAAEEDFVLSNIADRISQFFHQLIEddvllNTVELKKCCYDLGRQHS-AYSkkqFKISFWEEFTLTMMDVLEQNYP-QTTKEEQKAWLHFQRFVNENMLDGY-
>tr|A0A0B2VIR8|A0A0B2VIR8_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_08540 PE=4 SV=1
---QTSTRIALLQSSWTSVQtmtSGQFGARIVYSMLRKDPSLFDVFTTVqydgeetplrqtsgliarfynfGSIPdktppnngEetplrqtsgliarkSFDLLTCPQYYEVGDRIMNFMGELIQMMQDgqseqaIIERIRLVGATHY-ERNVmfSSCVWREFKASTLAIVGESTFEseSIRVETLKAWSSFVSLIIREMKNG--
>tr|A0A1Y5SIU2|A0A1Y5SIU2_9RHOB Uncharacterized protein OS=Roseisalinus antarcticus OX=254357 GN=ROA7023_01630 PE=4 SV=1 
-------QAELVADSLSRVGdkVIWLASDYYEALFDASPQLHGVLPH--QM------------SEQTNMLGHALAHALANLRDpdgAAPMAQDAGLADR-SARMPPRMRRTIVRTLVHALSLWHGPTWTKDHARAWNEGLLGVA--------
>tr|A0A0N5CYF2|A0A0N5CYF2_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1
--ALSTVQRQIVKECMDKA-KDDIAERIYRRIFERRSDFRKFILA---LPD-------KQRWALTDSLHNYLKSAVNQIKDgsaVRKISEDFGAFHVQyrSFGFRPDFFVSTADAVTTEFVLLDAaVHQASDTLCAWSTLTGFMFSSVRDGY-
>tr|A0A0K6SA08|A0A0K6SA08_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_8920.t1.CR2 PE=3 SV=1
---------------------AAMAEKFFELVPKRAPNLRMIFEKRQDI-----------YKHHFGEI---TKRLLAYLDSpeeVWKEDPELAIKHI-EFGVMPCDVPVFANVFLQILAELAGPAWTQRHRDTWDKLFSIVSGALA----
>tr|A0A0G4H7J1|A0A0G4H7J1_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_24983 PE=3 SV=1
---------------------AVFSREFFKRLSTFAPSVHAVFVKSEEK-----------YTRTIKDL---LGRLLAYIDDpsaIWSDDEELAMRHV-IFGVMPTDIPLYNRVMVQTMAGIAGGEWNLQHDAVWTKMMGLATETLS----
>SRR5215468_7630418 
----SPEVMRVIRFSAGLLAelQDMFVRQLHSEVTALIPGLAA------NG------------RIFCERMVRSLLWAATAgqpPHAAAGALRQVGAANR-RDGFPEERYADVARALVLALRNVSGSSWDNSIGSAWISYFRWAEPHLRAG--
>SRR5215469_6664897 
----APAAGRVGCQSAIRLSrnQDAFIRQLYDDFKELDPDSaqtqAP------DL------------LVFCERMVRALLWVALTdqpLRVVADELRQVGAQNW-YES-------------------------------------------------
>SRR4051812_31756681 
----APSVMRLLASCTADLGpqQPELAEALYQRLLELLPEVatlAE------RG------------RPLSDRILHAVLYPTEPgrtPLNVATVVQQVGAQNY-LDGLVGEHYSSVTHAVLHAAREMYRGEWSSALSSAWVEYLLWLRGHLLAG--
>ERR1712232_311801
--------------------RREMSMAIWNRMFKKDPEAERVFKQ-SN----------ERLIFIVEKAFENAAKIYQSPSETREYIQGFLVLMK-LLLMAL--LGRFLSSRAPWL--------------------------------
>tr|A0A0G4HD16|A0A0G4HD16_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6316 PE=4 SV=1 
---LTFEQKeEIVRSAWTTLSstyqLQEIGRVLYETICEEAPGLSSRYTKPGE--------------VMALRFGEMLATLIHlfldFPNDLQQKMEELAIRHV-NYNVDLEYLPVFEISILRTVQELYCeGEFDVEVAT------------------
>tr|A0A2W4YK05|A0A2W4YK05_9SPHN Uncharacterized protein OS=Altererythrobacter marensis OX=543877 GN=DI636_06370 PE=4 SV=1
--------AALIERGLERAAqqLGDITPLVMREFYRRIPEAEASFRHH-APHDPH--------GLEAEMVGNTLHYIMRWHEAPmeiRIDMDTSVPHHRVALDVPPDWYRGMIEAAIDVILSSVPSSA-SDERTAWKQLRDQLVSL------
>tr|A0A1Y6FH01|A0A1Y6FH01_9SPHN Uncharacterized protein OS=Altererythrobacter xiamenensis OX=1316679 GN=SAMN06297468_2444 PE=4 SV=1
--------STLAERSFERLAeqRGDITQDVLERYYRRYPDGRASFEHH-GLGNRA--------ELEGRMVSTTAFLLMQWAQDPggtRIEQGTTIVHHQDTLEIGPRLYLGLIDAVLEVLFETIPDES-AEERAFWLSLRGEIADF------
>tr|A0A2E8LSZ4|A0A2E8LSZ4_9ACTN Uncharacterized protein OS=Actinobacteria bacterium OX=1883427 GN=CL510_01665 PE=4 SV=1
--------SELAQRSLERLSevGGDVTRPVLDAYYARHPDARASFEHH-GLGHTA--------ELEGRMVAESLYLLLTWIEDPataRIDHGTAIVHHNDSLHIPPRWYLGLVDAALDVLLRTVPEDS-PDERALWVALREEFAAF------
>tr|A0A1E4JTP1|A0A1E4JTP1_9SPHN Uncharacterized protein OS=Sphingopyxis sp. SCN 67-31 OX=1660142 GN=ABS88_06340 PE=4 SV=1
--------LELLDRSLTRAAdaIGDITPVVMARYYARHPDAAASFERH-GMGRTS--------ALEHEMVDNCLYCLMYCLERPteiEILLENSVPHHQFTLQVSFDWYRGLVDATIDVIAESVPADA-ADERQVWDEIRSVLGGV------
>tr|A0A2E0VIY1|A0A2E0VIY1_9GAMM Uncharacterized protein OS=Porticoccaceae bacterium OX=2026782 GN=CMK32_09515 PE=4 SV=1
--------NDLILNSFESAAesLGDITPHVYRRFFLQYPEAESLFNIK-GAQFQD--------ELKVQMVRDAIYAYLEYLETPeevEIVFKYTIPQHV-DLDIPIRYFIALLEAVADVVCDSVDDRTQADTKASWSELLQEFRQM------
>ERR1711865_325941
---------------------SQFGLNAFNRLFDTEPRSEDHFKT-SN----------A---RLSMLATKSLELSMQMYKEptrVMNEVTSLGLRYI-FPAHD-----------------------------------------------
>SRR2546421_6426420 
------------------------------XMIRRPPRstlfPYTTLFR-SDF------------ERQNKLLRHAFGLLLIFPNQartEPSVLTRVAERHSRrDLDIPRSEEHTSElqsRSDLVCRLLLEKKK-KNQV--------------------
>tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae GN=mphP PE=4 SV=1
--------------------VTAHSIQAVADELRAHraeFIQAANQ------------------KPD-SPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD--------------------------
>tr|D9QCQ3|D9QCQ3_CORP2 Oxidoreductase OS=Corynebacterium pseudotuberculosis (strain C231) GN=CpC231_1874 PE=4 SV=1
--------------------KDAFHTQVFANF--YHsnPYARATI------------------APS-EQLVPAVISLIGHLENngfisdeVKQKFLEHTKLLD-ARGF--HHYTALASAVRSALQTMCTD--------------------------
>ERR1719474_106261
----STASLELVLDFWRCTVhrlsvhdRAMMGGDLFRGMSRQDAACRALLESL--N------PTSERMDLWGLRFLDTTGWMLRRANaaDLDASLKAMGAEDR-ARGLTVAYYRVLVERLHSELAARFPTKYSETVQAAMEEVIWSFVRR------
>ERR1719499_858439
------------------------GRAIIEGMNHE-------------N------TSPNQMDMRTVRLLDTLGWMIRMSciPtmDLKVLYAAWNGMAA-EVGYSAEYHVSWIQYIEAQLTERFPSEYTDSVRSAVRELLRWSIPN------
>ERR1719410_2598304
-------------------------------------------------------------PSHALKILNVFGYVIRNLIHpsnhlkLFKQLQSLGTVHR-AHSLNNEMYEAMLKSFNYAMEEKFANHYKIRIRFCLSQLYRVIVDIMTG---
>ERR1719216_785110
-------------------------------------------------------------PKHTIKIITTFGYIIKNLIYskehtkIFKQLQSLGEMHQ-CHSMInTDIYMELLNAWHFAMEEKFQNKYKNNTRFCFNQLYRLIVDTLMG---
>tr|E0VF51|E0VF51_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236397 PE=3 SV=1 
--------VKIVTPTWESIKedFDWYCTKIEETFFQNDTTKKELFTL-PKFEeELTDDVVNKRLFKHSSAVLNFMECIVQFMNGneeTKPVLFVLGRNHY-TIGVNEKLFLEMKDAICSVIKYKIG----TENAKAWDTILQYI---------
>tr|A0A0M3IFG8|A0A0M3IFG8_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=3 SV=1
-TGLSMHQKAILTARWRQLPqgiVFDLGKRVFGTLFQKDPNLLVVINL-EHLQGTDAWRDHVNFHMHAQRFTHALSQCMRHLVEpivAADRLQEFGATYAEmedsenfnRSRIPHSYWDRLISAMTSTAKEFHEnpsqksrrnslsvddalvatnerldLQIDSANISAWSALATFVSNQIRFGYE
>ERR1719199_711328
---FKPSHISLIQNQMSALIsefgsIEGAGEFLITQICALDEYVAKLFSG-AAL------------RVQGFKFLGQIARWVTYLADpetVEADLYNLGIRHL-GY-VTQQDFAKFLPaviqCMQKSLKDVLDEQWSALAAESWKMFLGYAGGH------
>ERR1712070_698694
---------------------------------------------------------------LCFIIARVIDIAAQlfvEPDVCIAEVLQLGLRHI-MYKVPADFFGPFAGIIADEIEARCD---------------------------
>sp|O76243|GLBB_CERLA Body wall hemoglobin OS=Cerebratulus lacteus OX=6221 PE=1 SV=3
-----------------------VVDAFYVELFTAHPQYQDRFA-FKGVA-LGDLKGNAAYQTQASKTVDYITAALAGSAD----AAGLASRHV-GRNVGAPEFTHAKACLAKACA-------------------------------
>tr|A0A2C9LKZ0|A0A2C9LKZ0_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106051185 PE=4 SV=1 
--GISLADIKVITNQWEDVLrcSDLFGKLLVLYVLDNCPKVNALHPGLHAR--LTDARD-SVEKQIGLRVIQSISCVIHNLNRapaVESMVRDTFKKLQ-QHGYTKNTILECSEAFLSFMNQYFSKRWLKQHSDAWFKVLKALL--------
>SRR5690606_9602430 
-------------------------RAFYPILYSSVSGAQELFEA--TVG------------TDNRKMLQILAKLFGfisNVNhsSefMkSDAFIERGKYYA-DHGISETMMRGFSSALVLTLRRTLGELFTISHVRAWGIFLDTISHAL-----
>SRR4051812_40179264 
-------------------------RIFFPILYSTVPSSQELIEE--AVG------------TDSIKMLQLLVKIFRiisDINhdPevMkSEAFLERGKFYA-DHNISENMLRGFNSALTLSLRRSLGERFTISHVRAWGAFLEMISHSL-----
>SRR5690242_7041980 
-------------------------RAFYPILFSTVSSSQEIFEE--HIG------------SDQTRMTETLRHVLEffiSVNlnPqiLsSDKVIERAKKYA-DLGISENMLKGFSFSFLKALKQVLGGALSAEAMREMVRLLDNISIQI-----
>tr|A0A0G4HHE4|A0A0G4HHE4_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6802 PE=3 SV=1 
-------------------------DALLGILFEASPTMRSVFVKNGDL--------------YADLIEHLLRRIIAYADDpgaLWTDDQHLALDHI-NFGMSMSDLPLFGASLMNCLAGVLGENWCDEWQRAWEKAWQICCQSL-----