Mercurial > repos > galaxy-australia > alphafold2
view test-data/multimer_output/msas/A/bfd_uniclust_hits.a3m @ 12:7fbec959cf2b draft
planemo upload for repository https://github.com/usegalaxy-au/tools-au commit 6fdbb269efd97b6f5c6ab40db4ab0b23459f884b
author | galaxy-australia |
---|---|
date | Fri, 16 Sep 2022 06:14:06 +0000 |
parents | 3bd420ec162d |
children |
line wrap: on
line source
>chain_A MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR >tr|A0A1K0GGD5|A0A1K0GGD5_RAT Globin d1 OS=Rattus norvegicus GN=Glnd1 PE=3 SV=1 -----------------------MYGLEKEp-R------------ETEGClsrKLPSNLQRSSAPWRLHGFQNLLERSQGA--------QRAKPG------------HGAHSHSSVKMAL--SQTDH------------------rlvL >tr|F6QUQ8|F6QUQ8_XENTR Uncharacterized protein OS=Xenopus tropicalis OX=8364 PE=3 SV=1 -HWTAEEKAAITSVWQKV--NLEQDGHEALTSISLTFISPLdvvwAYFKG----------AAHNK---------IKFCFNIELKQISLSFHARWKNQNPEQKLERLGEVLVIVLASKLGTAFTPQIQGAWEKFVAVLVDALSQGYN >ERR1712144_198951 HESLWKRQVRG---evfLGESRPE-VrRDRRRSSG-qDAGGLPPDQTYFSHWaDLSPDSSQVKKHGGVIMGAVGEAVGKIDDIVGAVSNLSSCMPSSSEWTLPTS------------------------------------------- >tr|A0A096M318|A0A096M318_POEFO Uncharacterized protein OS=Poecilia formosa PE=3 SV=1 VNH-KHDELII---tgvFFTS-------VSECVP-pVRNIYRQTTNSIENIgNFKngetfLTNPPVALYVVNMVEFTSKPLMSL-PLNGFYGILDFLKA--KRKNPNGGKLLADCLTIVIASKMGS-gFTPEIQATFQKFLAVVVSALGKQYH >ERR1719244_1811598 --WSDDETKAIQMIWNSVD--VNELGPAALRRCLLVYPWTQRYFGKFgDIATPTAimqnpGVAQHGITVMNGLKLAGGPGGGPGNQPGGQQELWQRGKQQGQQQLWQQGQHGGKQRGqqQRQGQq-PSPRQSX------------------ >ERR1719167_1707907 VEWTDFERATIQDIFAKMP--YEEVGPAALARGLIVYPWTQRYFGNFgnLYSAStilvNPLIAKHGTTILHGLDRAMKNMDNIKETYAELSVLHSEKLHVDPDNFRLVSDCLTIVVAGKMGKDFTGEVQAAFQKFLAVVVSALGRHHH >tr|A0A146QLZ2|A0A146QLZ2_FUNHE Hemoglobin subunit alpha-2 (Fragment) OS=Fundulus heteroclitus OX=8078 PE=4 SV=1 IILTSNYNYTFNTFFSKFSSNSYSIFSYSLSIILFFYPHTNTYFSHFnYLIPFSSPFNNHLstfiflfsxxxXXVMGGVEDDVEKIENMKEGIIRISEMNELNMRVEKEKLKIMEKKIIVV--------------------------------- >tr|A0A147ASE9|A0A147ASE9_FUNHE Cytoglobin (Fragment) OS=Fundulus heteroclitus PE=3 SV=1 EPLSDSEREIIQDTWGHVYKNCEDVGVSVLIRFFVNFPSAKQYFSQFQdMedpeeMEQSSQLRQHACRVMNAINTVVENLNDPEKVSSvlaLVGKAHAMKHKVEPIYFKILSGVILEVLSEDFPDFFTADVQLVWTKLMGALYWHVTGAY- >tr|L8HUF7|L8HUF7_9CETA Hemoglobin subunit beta (Fragment) OS=Bos mutus OX=72004 GN=M91_21159 PE=3 SV=1 -YLTLEKKATVIDLWSKM--RVAEVGPDTVgrqvFKLLVVYPSTQRFFDYFgDCPLLIygqCFTffvsrhrfllfilvflCFKEDKMMYCFLKQFKKIKK------MIAKRNISK---------YKLRLIWVASHQYFGKEFTPEFQAACQKVVAGVVNALTYKYH >tr|A0A2Y9DG99|A0A2Y9DG99_TRIMA myoglobin OS=Trichechus manatus latirostris OX=127582 GN=LOC101351845 PE=4 SV=1 MALSDGEWQLVLNVWGKVEADIAGHGLEVLISLFKGHPETLEKFDKFkHLKseeemKACEDLKKHGVTVLTALGGILKKKGHHQAEIQPLAQSHATKHKIPVKYLEFISEAIIHVLQSKHPGDFGADAQGAMSKALELFRNAMAANYK >tr|M3YM80|M3YM80_MUSPF Myoglobin OS=Mustela putorius furo OX=9669 GN=MB PE=3 SV=1 MGLSDGEWQLVLNVWGKVEADLAGHGQAVLISLCQGLESRKEEKKRDpAHAcvssrrslFVSQDLLFHSDAFLVSLGHRSFLapvSGENGQSQKTQPAHHAQHHRQPWNTEKFISDAIIQVLQSKHAGDFGAEAQAAMKKALELFRNDIAAKYK >tr|A0A1C4HDU6|A0A1C4HDU6_PROAN Myoglobin (Fragment) OS=Protopterus annectens OX=7888 GN=Mb6b PE=2 SV=1 -------MACPAKFWEEnVVPDAAEHGKNILIRLYKEDPAAQGFFSKYkDTPvselGNNADVKEQGAVVVKALGELLKLKGQHESQLHAMAESHKNTYKIPVEYFPKIFKITDAYLHEKVGAVYA-AIQAAMNVAFDQIADGLKTQYQ >tr|Q9Y0D5|Q9Y0D5_MYXGL Hemoglobin OS=Myxine glutinosa GN=Hb PE=2 SV=1 -RTTEGERAAVRASWAVLMKDYEHAGVQILDKFFKANPAAKPFFTKMkDLHtledlASSADARWHVERIIQAVNFAVINIEDREklsNKFVKLSQDHIEEFHVtDPQYFMILSQTILDEVEKR-NGGLSGEGKSGWHKVMTIICKMLKSKY- >ERR1711977_634702 --WTDAERAAISSVWGKID--VGEIGPQALGRLLIVYPWTQRHFSSFgNLSTpaailGNPKVAAHGKTVMAGLERAVKNMDDIKSAYSDLSRCTPRSCMWIPTTSGSWLNAspcvwlpsldvrPSTLMSRRpGRSSWLwssppwadsTTEGLKTHHNQIICSSFL----- >tr|Q9U6L6|Q9U6L6_MYXGL Hemoglobin OS=Myxine glutinosa OX=7769 GN=Hb PE=2 SV=1 -TLSEGDKKAIRESWPQIYKNFEQNSLAVLLEFLKKFPKAQDSFPKFsakkSHLEQDPAVKLQAEVIINAVNHTIGLMDKEaamKKYLKDLSTKHSTEFQVNPDMFKELSAVFVSTMGGK----------AAYEKLFSIIATLLRSTYD >ERR1719474_978995 ---------------------------------LLQSSWKQ--FRT----------------------------------FASLSGIRQEELGAGCQHQDLP----------QIQHHLWISEPSTFQQL------------- >ERR1719336_830457 -----------------------------------------------------------------------------SINPQSTVDLGAQYISATPLNYKNHQDIYNSLLSNG------VLVPANVSLI------------- >tr|B7QI99|B7QI99_IXOSC Globin, putative OS=Ixodes scapularis OX=6945 GN=8041668 PE=3 SV=1 -GLTTSDKCAIKDTWTMFRRETRTNALSLFVALFSRYPEYQKMFPNFADvalkdMMQCPSLTAHALTVIYALASIIESIDDENtmvELIKKNIRNHV-RRSVTPEHFVNINNLLIEVMQVKLRSRMTASVIVSWKKFFAMHDAVTRQTY- >tr|A0A1W0WKD0|A0A1W0WKD0_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_10224 PE=3 SV=1 -GLTSNHIKAVRANWKLIEKRLPEYGLELFVAYLNKHPDWIGLLPFLKPadmprLQQTPRLKAHGTIVLKKLGELLTMLDSPPkliGELLKQGSTHR-ARGLAPENFQAIQHDLNELFVKICGPE---FDIEGWDAVLTLIMTGIEEGL- >tr|T1KR38|T1KR38_TETUR Uncharacterized protein OS=Tetranychus urticae OX=32264 GN=107366531 PE=3 SV=1 -LLSDDEVKVIQSIWSSVMKDANTHGMNFFLKFFRENPTFQERFASLRNlkteeEMkASKRLKAHAASVFHAITALVDNLDDLEcvsDMLEKIAANHL-RRKVNWPFFDRIALCIVAFLSETLGTqIMDSKATTAWTKVLNVITETVKRVE- >tr|A0A2N8ZEM6|A0A2N8ZEM6_9VIBR Globin OS=Vibrio tapetis subsp. tapetis OX=1671868 GN=VTAP4600_A2359 PE=3 SV=1 --LSEQQIYLVQECYRQVEESPHEFAKHYYGKLFELEPRLQALFRN-DLD-------IQGRKLIAMLEVAVNGVKDMGMLVPMltqLTQLahrHN-DYNVKKSHFSLLNTALHHAFEQHLQQAYTDEHRQAWQTLLDFMVDTMK---- >tr|A0A1I0MYA2|A0A1I0MYA2_9RHOB Hemoglobin-like flavoprotein OS=Cognatiyoonia koreensis OX=364200 GN=SAMN04488515_0317 PE=3 SV=1 --LSQTQVDLIRTSAEVLAEANVAATNVFYANLFRVAPGVRNLFSE-DMF-------EQSEKLWNTIVKVVESARDLTEIEADLHALgarHV-HYGAEPGHYVVVTDVLIQTISSMMEDKWTDETQAAWKTALEAVCATML---- >tr|A0A146Z291|A0A146Z291_FUNHE Hemoglobin subunit epsilon (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 ---SYHYLIIITSIFSNLY--YNYFFPNSLIIFLIFYPFTHIYFSNFFNLYNSYsintnpNIQSHFTNFLHFLYLSFNNIYNINFTYSYFIFLHSYNLHFYPYNFNLLSYFFTIFISSNIFSVIKE---------------------- >tr|H9GUN8|H9GUN8_ANOCA Uncharacterized protein OS=Anolis carolinensis GN=LOC103282340 PE=3 SV=1 -KMTDLDRRHIREIWTAAFENPEENGRLVIIRFFSDYPASKQYFKTVPTDGdlkAHPQVAFHGRRIMVAFSQVIENMENWNQACVLLErlvNNHKNIHQVPSGMFQLLFQAMLCTFDDLLGRTFTPEKRVSWEKFFQVIQEEVEAAY- >tr|C3YSB7|C3YSB7_BRAFL Uncharacterized protein OS=Branchiostoma floridae OX=7739 GN=BRAFLDRAFT_96956 PE=3 SV=1 TGLTANQIQLIRDTWQIVYKNKRENCFAIFRILFTDHPSTKSLFRLMDAVdldvpgefEKNVAARAHMVRFMHSFATFMDTLDEPAELRQLLYDLgknH-AKHQVGPELFDALGPILMKALPIVLDGKFTPEVKTAWLTAYTFMSTHLK---- >UPI000197D711 status=active AGLTPKDIYEAKQCWNKAASlGVNKVGVLLFKNIFTIAPEAAKAFSFGNDPnfMNNKEMEEHGVKVVMAFDHAVRSLDNIHAlqeTADGLRDTHS-FFNLSPEHHVIVKEALLQTLKQGLGDEFTDAQRELWNGIYTAIRNMWVG--- >KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold119418_1 # 1 # 498 # 1 # ID=119418_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510 EQISPLKLRLVQSSWRQAS-ADEQAGITAFKFFFEMEPVAIGMFGLQDIRdlYNSYELKRIAAKIVKAMTHIVNSFDNFEGlrpLIKKLGMMHG-EKGVSPSQYNNFGKAFMQTVEEILGDQFTPETRRAWETFFRILTGALQR--- >SaaInl8_100m_RNA_FD_contig_91_216993_length_256_multi_18_in_0_out_0_1 # 1 # 255 # 1 # ID=160783_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459 NLLPKNTILQVQTSLQKVLQTTKTISPIFYAQLFEIDPSTRPLFSTEND----QQLKQQETKFTLMLSAIVNSLTNLDSlipVLQDLGKKHL-NYKVQKSHYETFGIALLSTFALILADDFTQETKKAWEDTYGLIASIITE--- >tr|A0A091DYW0|A0A091DYW0_FUKDA Cytoglobin OS=Fukomys damarensis GN=H920_02872 PE=3 SV=1 -PPHEGGSCATPLPWGNRDLGPWACVRPDLCRFFVNFPSAKQYFSQFRHmedpleMERSPQLRKHACRVMGALNTVVENLHDPDKvssVLALVGKAHALKHKVEPVYFKTISGVILELIAEECANDFPPEAQRAWAKLRGLIYSHVTAA-- >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1887876_1 # 1 # 366 # -1 # ID=1887876_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459 -PVSDENKDILRESWKRLEEEKTTLCKNVFIRLLQLNPNLQDTFPSFkgvalDELMNSRSLFLHSKRLMEALEIAISSLDDGQDFTEYLTHLGErHtAISITENHFKIMEKALIFALKDMLGESCTEDVANAWREFFQSMAGT------ >tr|A0A2E1AIS1|A0A2E1AIS1_9CHLR Uncharacterized protein OS=Anaerolineaceae bacterium OX=2024896 GN=CL607_22355 PE=3 SV=1 SPVTSRQKLLL--HYTLLHLDADQMGKLFYDHILAAMPEVAPMFTD---------LESQRKHFMKMMIRIVHTIDEPDHLNIVLRELghiHK-RLHLKPRHFSKMGVAFSNSLAEVMGDRYTPEIGEAWRILYNRVAEAMQS--- >APLak6261659701_1056019.scaffolds.fasta_scaffold514158_1 # 3 # 230 # 1 # ID=514158_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561 IELNAKNKALVKEGWKLLIETQFPnevggneralarFFDEFYRKFFEVNPSGKRLFEEGGM-------AVQSKALVKMMSMVVTSLENPSNLDLTIERLggrHE-LYGVSRSDYLAFTNAMCETLETVLGDKCNQEMKESWSLVLNNLSEK------ >SRR6516164_1622129 -SDDPRTEATRAghletggsrrrRGSRHVLPSAVR-----------NRPHHAQAIPRDR--------------------------------------YgraTQ-K----AAA--DVGL--rhRWPGX------------------------------- >SRR6516225_5669596 -VMTPEQKRLAScfrrggppGSWRRPSPPLGIETAQVFRIPCVLPN--AAVHTAGVSD-------HNNSDTYRAALRPAH---R----AASQTASvrnHE-RIQSETAM--REGL--rrvTYARVLRTGS-hRTPYrnVTP------------------ >SRR5215813_13307430 -KSTPPRAsyfratdmaaqrkkllqtleqglgqawtPAVAs-AWSEVYRLLSGIMrnaAERVERLQNVWPAPFDAVIX------------------------------------------------------------------------------------------------ >SRR5262249_1440316 -AMTPEQKRLVEd-TLKQMAASADAAAALFYCRLFEIDPTTRKLLPQTARA-------ATRLGCGIPQLLTDIFAVR----YAAHADFgtfSE-GTHGHSDL--EAGY--hrrlVX---------------------------------- >SRR5260370_32836152 -SDDPRT-EATRaGHLETSGSRRRRGGRHVLPSVVRNRPTTRTLFRATDMV-------AQRKKLLQTLAFAIGGLDNLDALGSKVEDLgrrHA-GYGVTDAQYDSVGAALLWTLEQGLHH-pPWPRRGPKTTDC------------- >SRR3989338_1269240 MDFNDEEIDIIKDTWDAVLYPey---PEEGFNPVLNFSTKFYRRVFEHENckNLFEEVDMTSQGEKLVKILSVLLVAVQTkslnqdHIHVLRKMGERHRG-YGVSDDMYEIIGGCLLRTLSEVCADVWDDDAKVVWAKLFGVVSE------- >SRR6516164_9760095 IVTTPQQVQLVKQSFAKTTPIAEQAAGLFYGRLFETAPQLRPLFKG--------DIKTQGRKLMSTIALAVGSLQKLPELVPIVQDLgrrYV-GYGVKDDQLRYRRRRAAVDARQGaRGRLHTRCEGRVDLGLYDPrrYDEERRSAA- >SRR5690348_1420512 ------------------------------RHRAESAPAVSGRS------------HSAKKEADGDDLHDDRRTERFQKAGPGSQEPrraPC-RLWCDCGGLSIVGEALLWTLEQGLAAEFKPEVRSAWIKLYDMIATTMQAGA- >SRR5437870_6238790 FDVTPIQVDLIRASWAKVEPIQELAASLFYDRLDRKSTRLNSSHVAIS-------YAV---------FCLKKKKKKKEK------------YTHEHINNNKV---------------------------------------- >APAra7269096870_1048528.scaffolds.fasta_scaffold62442_1 # 1 # 438 # 1 # ID=62442_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.454 -IIQPSAVSIIQSSFEQIKPNAGRFTRVFYDRLFERDPSLKKLFIR--------DIREQRKKFFRMLGSIVKNLSNPDELEPKLQDLgsrHD-YYSVKREDYRTFFEAFIYTLAAALGNDFDENTRHAWRDFCDYVGAHMCKE-- >SRR5215472_6010456 -------------------------------------------------------------------------------rMISGPLPDvitAT-ACGKS----TMPASAPLYCGH--LSKGS---------VSISRPMWAMIAV-- >SRR5262245_10239308 -PENARPGNL-RHHYadrgrcsGSLLPEAvqaRSVAGRHVSRRHERAAEE--AAAdA--------DGRRQGARSA----RSGRGGRRGSRPAPRAIRRdrqAL-RHGRHGS---P------LGARGGTRARFTPSVKKAWATVYGLLATTMKNA-- >SRR5688572_4752169 -RMNSQQIALVRQTCTEVAPIADSTAEIFYQKLFQLSPSMRSVFAP-G-------LRERGRHLMETVEAATQIMDHRGTMTSAFAELgsrQM-ALAAGNNRYEAVGAALILAFRQGLGPSFTPEARQAWIALFDYIDETMKAD-- >SRR5689334_18520770 TSMTPDDIALVQESWRKIEPVKEIAAELFYTRLFELDPPLRIVCGD--------DMKDRRKRFTQVVGATVRGLARVDMLLPAVREFgmrHP-LPGEIEQHHANVAGALLWMLEKALRKEFTPEVKAAWIKAYGMLSQTIRQT-- >SRR5215207_7597532 QTMTRDQIRLVQASFRNVLPIRELAAALFYDRLFEIDPGTRGLFVDT-------DLRSQGGKLMAAIGMVVHALDAPESMVEKLKELarrHV-NYRQLQESSPPDFHRLhrfgsgrgsqRHVVSKGPGVAPVGQ----HVVPTHFASRvsrRLRAC-- >ERR1700730_6579985 -RQRLADDGVILRVLQRGLGIELEMEALAREEIGELDPDAarfRPHHAV--------GGGEVGGRHIELLRRHVDQRPpcHAAANGSARISLprgHV-SYGAKPRHYPVVGAALLWTLEKGLGDGWTPEVADAWLTAYSTLSGYMISE-- >SRR5262249_2898310 -ILTADEIERVRNSFDQVWAISARTAELFYGRLSAGNLFAHAPSEA--------ERDDKRQKFMLTLAVVVASLDERADMDSLSERLaqaHT-EAGVRPEPASELREALFWSLEQALGPVWTPAVDAAWRKAYRRLSERMVSI-- >SRR6516165_4200192 -----AQ--------------------------------------S--------DLVDRGRA------YRLLGLADLVDRrnQAaagGLSLFhrrAV----------------------SAGGVAWADRVLDALSlylcgyelrwpQLDHALGRgavhpdacaSLLRE-- >ERR1700733_1486793 -------------SQAHGGDIVDLyRDVRLVYRLFRRLPPAEQDAIpG--------DHRRGRLSRaAGRVAL------------APVRRAarrQ---------DRRREG-DVLELRRDGRGDDRRHVFHRDQElswlSDDV--PR-VVRD-- >SRR5580658_8437352 ---------TGAGKFESVQEYADSVVLLFYGRLFELAPPTRGMFKI--------GIPEQARKLMGTLTSLVDALDRFEELRQWLTDLgrrHV-EYKARALPGAGDGAHVGFRAGAGYRV------RPGDEDCVGAVAERGVCG-- >SRR5215831_4136876 -KHDPPTDLARAEQLQVRCA------DRVKGRRSLLRPSLRDRSRGPA-------A--LPRKIIRAEGKVdgdANEDRQQSSSAQchFASCTptrRaaQ-GLRCLDGSLWGSGCCLLWTLEQGLGSAFTPEVKAAWSEAYRTLAGAMQEG-- >SRR5215469_10861266 ------------------------------------------------------------------LTGAPLTVHPVRDRSPQFSRIgspsgrHA-TARARGQWIRNNSAFRAMTLQQALGSEFTPNVRDAWVAYYQTPAAEMKA--- >tr|A0A1E4AHQ5|A0A1E4AHQ5_9BACT Uncharacterized protein OS=Cytophagaceae bacterium SCN 52-12 GN=ABS46_00305 PE=4 SV=1 -ACTQDQIRIVKKTWSFFRNmSPEFVGDVFYTKLFMDYPDLEKRYPR--------EAQKRYEDLIKMLNMVISRLDRPDELTWALteiANQPH-RIWVTPAHYQKVVSTLIWTLRKGLGNDWTAVVEDAWMSCIKMVESLNAAI-- >SRR5262245_55554356 -CVTPEHRLLAQQAFATIQPLADELGLLFYSRLFELDGALRGLFKH--------DLANQAHSLMAMLQLTIEGLDAPEQFTRARTTWgyaTWTmGFSRTSTRLLRRPCSGRSSMRX------------------------------ >SRR5260221_10622870 -IVNAAQQELVMTKAEGVVLMPGVTGVLLCALLISANPSFRPLFKS--------DMRIQGVKLMTMLAMVVYNLPEPGQVLPAIRDRseeHT-SELQSHSDFVCR--LLLLHX-------------------------------- >SRR5918994_240771 -------------SWKGVAGRRDEIARAFYAVLFDRHPELRSLFAHTD-------MRAQYEKFALMVDEIVQLRTEPRQFVRSAVLLgqrHT-MYGVTRRLVIAPAIRL-DRFAATDSIGFATPSTSALQlllcpRETVRRSGVMS---- >ERR1700730_15638689 -AMTPKQVALVQDSFAKVALTSEAAAVLFYNRLFDIAPQMKAMFPD--------DMVEQRRKLMSMLAGVVKGLANLEQVFAGRQRTgkaAC-QLRCEGG--ALSGGRRRVAVDAGEGsGGWLDAGSGGcVGHRlWHAVRLHDFPS-- >SRR5258706_7695680 -RHDPPPdpadPPVLRPA----RVQGRETRHLDVQAPVPARPRPTPAVQ------------------------------------------------------------------------------------------------- >tr|A0A1W2GRB7|A0A1W2GRB7_9BACT Hemoglobin-like flavoprotein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4043 PE=3 SV=1 -----RELMLVKSCWQTVAPNAIPLAMKFYDDLFEAKPEYRRLFSGD-------M-NKQAEKLMMTLGFLMANVDRVDKIKDAIHKLgalHV-KFKVLPEYYPPVQKALVGAIAQFMDNQWSYEHEDAWNKLISAVGDMMIEGT- >tr|A0A0N0UYC0|A0A0N0UYC0_9BACT Uncharacterized protein OS=bacterium 336/3 OX=1664068 GN=AD998_10010 PE=3 SV=1 -----EQKEIIKSSFPRVLIHTLKNSTIVYEKLFMDIPEAKDLFKNT-------SIDKQGQMLVAAIGKIVKGLDNPDIFEKDLVELatrHV-GYGLKPEYFTHFGNALINMFEVSLVDSWDKDLHDAWVAVYQEVAEIMKSVI- >SRR6185312_354929 --MVR--A-----------RGSAkC--WKCRWR--------------D-------RA--SVSnSLPAPATSSAGSACSNFS--------mngTA---SSkQPefDRVPRGGrgrgrrrKMTpeqVSLVQqsfakvapiseqaavlFYD-RL-FevapavkamfpadmteqrkKLM----------GTLAV-V--- >SRR6201981_618659 -ERHD--T-----------GGGQpRDAELFQDR--------------A-------DCGQGGGdLLRPPVRDRAAGQIVVSIRHGGAPGQadgDA-DRRGrRSyqSSLDPARgerarq--TpcqLRRQGgalsgrrcrvavdAGE-GTWRgldarcrgcmegglrnpVRLHDL----RGLRQ-------- >APLak6261666328_1056055.scaffolds.fasta_scaffold241778_1 # 2 # 196 # 1 # ID=241778_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.415 -AKTAGGL---NLLFL--AIVSS----EPENGFVTISPAAKDLFP-A-------DLTEQRKKLIATLAIVVNRLSNLQSILPAARTLtkrHV-NYGAKPEHYPVVGSAVLH-AGgrPRLGLDARSRLrsdGCVWHAVRLDDgrnleHEFANL--- >SRR5919197_1191720 -VLTRDQADIVQLTWRAVLPVGDTFAELFYGRLFALDPQLRRLFRE--------NLVEQGRNLTAMLSVAAANLARPEKISVALRQLgrrPT-RSSRARCSRSLLRDLLRLPLDARRA--VADGVARVVVafaRAVVAIP-RVIHG-- >tr|A0A1I1PNT6|A0A1I1PNT6_9RHOB Nitric oxide dioxygenase OS=Tropicimonas isoalkanivorans GN=SAMN04488094_11525 PE=3 SV=1 MPPSQQELARVKQSFEDLRPHHEPTSYDFYEELFARAPELRQLFRD--------DLKGQGMRFMNTLGLVLDDMTNPNGTtvdYAELGHLHT-TLGVRQAHFEPMEDALMASLGKKLGNEFTADLEEAWRNAFRAFSKKLIEA-- >SRR5262249_25899110 -MMNTQHIARIRLSFAWIAPSADVFGELFVANLRALDPSLSGLLAA--------EAGPQGWQLISILRSIIGGRDRPDRLFWRLQSFgrrLA-GDGLCAEDYDTIGDALMLTLEQCLGERLTPDVAAAWDATYAALAEVVQL--- >SRR3954451_4172984 -XMTPEHIHTVQSSWSKVLPVGNGQARLLFERLLQSEASLWGLFQL--------DAATWSANLVQMIDVLVTGLSLGDRRAVltrRIGGRNT-ACPAIEHHYDLIGTALLRTLAKPLRAEFPPSVEAECPPFY------------ >SRR5215470_15672373 -LMTPEQIALVQSSFERVGPELPALATRFYQELFGRDPALRPLFTT--------DMTLQKVRFAEKLTEIVRAFSRIPARSAPGTSAtgyGS-LTTR--PsAKHSSPR-SLPFSATASTArparrGAS--PTTWWPRPCSRVRQRLGV--- >SRR6266566_5437046 -DLTPENCDFMTEHHDL----------RILGRLVATE---------------------------------------------------Q-EQPVKDPDHDQIeeatrhrprscPTLFIWPNRRSQPLhrvlmRYMPvpgpRSPPSWCGPPSRSRSHGPR--- >SRR5215203_7560530 RPMTPDQVSLVRDARRAIESRHAEFSAAFHDALHELDVDTCALFRDTV-------TGGRACNVGAMLDLLQQASDDPRALIEVAAELgraHA-HAGVRDVHHHVAGVALHRALHRVLGVEFTPAMYEAWAEAFTLLIAVMERAA- >SRR5580658_533798 -XMHSIMIGHLRDSVSLLPMEDLRPVHEFYRRLFELAPEAQPLFTR--------EAGQQAKKFSDMLAWVIAHLEHADELRKEMRELgarHR-GYGVTADQYASVGSALIWMFQHALGDRFTPEMEEAWLEVFAFISLEAERGA- >tr|A0A1D8RRN7|A0A1D8RRN7_9GAMM Uncharacterized protein OS=Colwellia sp. PAMC 20917 GN=A3Q34_02175 PE=4 SV=1 --MTAKQINLVQQSWQKVLILSPDVGDLFYQQLFVLRPELATLLKN--------DKQdKirANKDFICLLSQEINLLQPIELTEEKV-nTSVT-TNDV-KNYQADVENALLLALTMILDKELKIALKRAWISTIKRLVGSIVIEL- >SRR5262249_21459549 IGQKREPPTVERRHREQVEEAQEDGKIGDD------------------A-------QRLARALLDLFAELVGDLDGPRH---------V-GFLX------------------------------------------------ >APDOM4702015191_1054821.scaffolds.fasta_scaffold152199_1 # 3 # 686 # -1 # ID=152199_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.531 -------------------------------------------MSG--------DFSPEQKRYLEGFTS------GLQ--------IartGR-GLG-KPAASVPSGPD-----AEHLIAQDQT---------------------- >MesohylFT_1024984.scaffolds.fasta_scaffold1796824_1 # 3 # 146 # -1 # ID=1796824_1;partial=10;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.340 EELNFQEIAIVKDTFALVEPHGSKFAQDFYDKFFTMSPEVTSLFAN--------VDRDHSSKMiWNALMLIVYNLENKQQLQNTLFGLgrrHM-NYGVSSHHYLSMGEAIMATLQSYLEanQSWNEEVAAAWERAYNLVSRRMQKG-- >SRR3569623_2148552 --ISYGTVMQVTLSWDKFKQVQnfqERAGELIFERLFELEPQLRAQYKFSeD--ediKSNPAFASHARTMVDMIDMAVSFLGpDLDPLAEDLEDLgkrHI-AYGVNAVHLPVMEKAVVYAFEELLGDNFIKDDRNAWQVMFHFIITNMGKGM- >SRR5450759_1049036 -ALTAEaPYSELKnlCVWSKT--------NAGMGSLYRSQHELVFVF-kN--------GMrphinnvelgrfgrnrtniwnyagassfGstrdselamHPTVKPLSLVADAIlDCSKRGgivldafagsGTTLIAAEKTgrr---GYGTELDPFYADT----------------------ivrrFEDAYGL-KAVHVE--- >SRR5210317_1560035 -----------------XMTSL----KSSMIGFFRNHQNCAKMFGE--------DMRDQAQKLAAILQVAFDNLDHVDSLVPILEDVgakHA-TYAVTPEHYGLVAAALIGTISTELGDAFDERAAESFEAVLGTVANVMISG-- >ERR1719240_1900674 ----------------AVARvLVHGL-ANLHRRALERLDLLLELVDAhRVVVlrllHRLdgrldrlHVLRRHLVLVLE------EG------------LLgavHR-RVGLILH----------LHLRLAIGVRRGE---------------------- >ERR1044072_2403146 -VLTEEHKKALRHSWRLLEPLGETVSDLFYRRLFEIRPDLRILFPP--------DMAAQKRKLLVMLMFIVKAMDWPIedwaaeidpenDLLlvvLALVRRHSHLYQVTSEHYAPVGEALVWTLEQALGQGFEGAPQKTTGPVCVLGCSPWG---- >SRR5437899_2276119 ------------------YPAVQKSGAAVYRPALVAELRDRPY-EF--------DIQVQLCVYLARMA---------LEIVAALN--AA-GWICVPKDPSPEM------LKAAWAYALDEDAAGVWKSMIAA---------- >ERR1700757_2961956 -------------------------------------------------------------RFNRLAGRERRAPARTR-----ARQSr-----QRPGPSRHDPTrLALSD----------VSEAERTDIVVS------------ >SRR4029078_1694892 ------------------RNFnPVVIGDSFYSKLFSLKHSLRRMFPG--------VMHEHYLQLVKLLNLIIAALDQPGQLEEefeILARKHR-HYGLTSSHYELFEEAMLWTIERALGKDCNKPIVSRWKTCYLALVRRTIAA-- >tr|A0A1H4HXI9|A0A1H4HXI9_9BURK Adenylate cyclase, class 3 OS=Variovorax sp. YR216 GN=SAMN05444680_12751 PE=3 SV=1 ---APDSVLLVQSTIGVLLQHQKRFTQDLYRRLFALAPAAEGLFR-GDM-------DSQGQMLSHMMQFLVHAMSRPEIMALGLRDLgrrHD-GYGVAAEYYPAFRQAFLESARGILDERYTAQVEKAWAETIDMIIESMRGP-- >SRR5687768_10564074 -RMTPQQTQLVKRSFWIAEGRRTQLAGCFLAELFARDPALWRLFSS--------DPALRRDKLHHAVAGFVASIDRLHPIVPVLEWLafhGA-RHGIGERQHVAIADAFLAAMETVLGESFTPAHRQAWWLACRSVIDVMVHA-- >UPI0004291969 status=active --KQSDTVFLVQSTLEKVFPQLDEFTNQFFKKFYELDPSVKEIFYEIDA-------KNKKQMVVNMIGFLTQGINRFDVIIPSIKEInerHF-GREVKPKYYLIASKALVNVLEDYLGEDFTPEVKQTWIEFYEQIVNFME---- >tr|A0A2D6RHV2|A0A2D6RHV2_9GAMM Methyl-accepting chemotaxis protein (Fragment) OS=Colwelliaceae bacterium OX=2026726 GN=CL811_09640 PE=4 SV=1 --MTPKQNIAVIESWKKVQPIASQVSQVFYDDLCEKHPSLKALLGE--------ELSSARDQLVAYLNSLVETLVATDEVViEDL-AKH-LRIGLAPEQFSDVGPALLTSLEIGLEKDFTATVKRAWTALNKLIVAAMAQ--- >SRR5215469_12962076 --------------------------------------------------------------SLSARAGRQAGFGl------SG--------LGSAAT--taiPTPSTSLTGSTARTTG--cSAPYSR-----TGT----------- >SRR5205807_5077868 ----------------------RVGHGRVYPRLYIIARHAAGIYAlT--------RPVAKPgRPRPVCLVPIHKDIA--VMRVTTDQLLartPL-GrFGEAAevgqlVHYLVSDAA------RFVS-GATVTIDGAWTAYGGWALR------- >ERR1719223_615602 -MTDKSSSQRVLDSWNAIKSIPnykEVAGVLLFRRIFALAPEAHGLFRFTNGFepnseelFESERLIEHGKGVIATLEAAIDMLGpasDLNPLICFLQELganHQ-RYGVLHDHYPIVGEALIETLSAAMGDKFTDDIKLAWEEIYGIIESNMIDG-- >SRR3954469_4757651 -SMTEVSVQRLAENYQLLAGRMAALTATFYERLFEAMPSVRPLFKI--------DIALQSQHLAAARALIVRNVRHLDALEEPLTELgvhHA-KVGVRPEQSPPLCRVMIETLRDGSGDRWSPQLESDWTPVLEMVSRIMMAG-- >SRR6516165_10653891 -EPSPNQLHQNRPD------RRPGGGTLLWPPLRDGSR-NPGAVLQ--------RRGRTGSEANGRSCNRCEQSRRFRGDRPHRTRS-C-KAPRRPEHYALVGSALLWTLEQGLGDEFTPALRAAWAAAYCALSEVMIA--- >tr|A0A210QIU4|A0A210QIU4_MIZYE Neuroglobin OS=Mizuhopecten yessoensis OX=6573 GN=KP79_PYT10777 PE=3 SV=1 -YLTSEQVRLVKQSWLILGEDMAATGLLVFKKLFESNEGMKKLFYKLmRCDSseqlefDQEKLTRHATIVMQGLGAAVESLEDSVfltNVLIAMGERHA-MYNVKTEMVPHLWPAIRDAFKELMGEDFLPAVESAWLHVFEYIGSKFKMG-- >SRR3954465_7515966 --------------------RGRAVGPSCYAPVSPLHPATSRLCSA--------DLLAAGVRLVDELVSLAVAAGDLATFTDRARAVgmrCC-ACGVVAADYPAFGDALVAAVAEVVGPDWTTAAADAWRRLYTLMSETVLEG-- >SRR5215207_9441599 ----PEQLALVRGTASIIDAVGDSFAERFDDHLFARYPAARRLFPD--------DTTTHRGQLTDEIVFLVAAAADLHALLERARALgapPP-LRRtrrrlparrrgTRRRGRGRRGRSVVGRNG---G-SLA----------------------- >SRR2546430_16462751 -----------------------------------------------------------------------------FLLSVVIA--CS-CWCRHVSSlqhdrad-------HPVGLCPGIVADWSPALSQNVGEGFQQDCSD-dG---- >SRR5271166_2850757 -RWMRPKRNSCARPSPKSRRSPIKAGAMLYEKMFALDPDLRRLFAI--------DIETQGAKLMAVFATAIANLHRLDEILPTVRELgrrHV-AFGVKDRDYDTGGVALVQTLEAGLGDAFTPAVRDAWMACYEAITGEMKA--- >ERR1711915_528574 TAFTEEQEALVKKSWNAMKPNASELGFRFFLRVFEIAPSAKRLFSFLhDSdvpIEKNAKLKAHAITVFKMTCESAVQLREKGTPtfsesnVKDLGKSHF-KYGVVDEHFDVVKFCLLETIKDAVPDIWSLEMKTAWDEAYTQLAEAIKSEM- >ERR1719460_671936 -MVDAVVKGDVQRTWELVIPPDSgddhvfAIGKLFFDRIFEVTPGAEALFSFKGEdRAESAKFRAHAIKVIKTVGVAVAKLDDLETLVPILEDLgkkHV-AYGVVASTTT----SSVWRCCGRSRRGWATNSRPTW---------------- >SRR6266567_6698575 ---------------------LIVFTSTCLWSI----RKPNHSLPKRI-------CVVKLAHCWLHLTTVVAGVLREDNLVPVLQQLgqrHK-SYGVKAEYYPFFRAVLLETFQHYLGPRFTPKMQQAWEEAFEMISTQMLKGA- >SRR5688572_5289639 -TVTPDRQQLIRDSWRALEPNGPRLVELAFLHLLQIAPAARPLMTGH-------SLPCVCRNVASILDQLIAALDEPKQFVPLAIGLgrsNP-GHGINAALYPAMGEALLWALHLQLGEGLTPELQTAWLEYHHLVSAIMRRA-- >SRR5262245_22087501 -LMTPERQRLVHDSWRTLEPNGTRLVELAVLHLVSIAPSVRSRLDGA-------TLPLVCQHIAGMLGRLVETLDEPKQFVPLAISLgreNP-DRGLTAKLYPAMGEALIFALHLQLGDAFTHELQTAWLEFDRLVCAIMQRG-- >ERR1711916_36627 ----LELFKILGILWFLLLMSLRNCSIIDYLR----------------------NI--LKLRLCSLKTCNFKKLNSX----------------------------------------------------------------- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9902871_2 # 1417 # 1767 # -1 # ID=9902871_2;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.538 ----ALDTKLIKDSFELAKPISDKLVKRFYENLYSDYPQSKSLYLDG-------QLPESQLAILKAINFIVDNLHNKEKLGTFLKTLnerYE-LRLNDSVINQSVCSSFLKTLSEAFGSDWTSELAEQWELTYQMVTSFFQDSK- >1185.fasta_scaffold1192548_1 # 3 # 452 # -1 # ID=1192548_1;partial=10;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.684 -TVTPEQIDLVERSVTELTPIMDEVVADFYTGLFAADPAIETLFAGAGAGaggahGQGDGFAVQRAKFAAQLADILTAVRDHERFLATAAdlgARHR-GYGVHAAHYTLVGRALLDALARHLGDRWTPATADAWRLAYNLTAEAMMA--- >tr|A0A1Y5RHX9|A0A1Y5RHX9_9RHOB Flavohemoprotein OS=Palleronia marisminoris GN=hmp PE=3 SV=1 --MPNDDMRLIQPSIARIFVVRRSIGQAFYERLFERQPTFRTMFPT--------DLRTQARTFDDMIALIVKKTGDPEAVTPVllaIGRRYL-TYGLRPQDLRVIGEVLMEVLCAQTPGGLSPDEAAAWERSFSRAAEVVKL--- >DeetaT_11_FD_k123_441726_1 # 2 # 373 # 1 # ID=403715_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.481 -GLTDLQIEMIRSSWEKVTPNKKHHGQLLFHKLFEIAPEMTDLFPFG-DDFTKPQFTTHALNIMNALDHAIQNLDNPDVLIPKLRELgqmHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG--- >AP82_1055514.scaffolds.fasta_scaffold664619_1 # 53 # 358 # 1 # ID=664619_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.458 --MSGFALRLVLTQRQKATrkrpiaqyvieNHSINFAFHYIDRLFEIAPEMTDLFPFG-DDFTKPQFTTHALNIMNALDHAIQNLDNPDVLIPKLRELgqmHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG--- >SRR6187402_963757 -------KLHIQNSWLKLG-YSADMITDFYNQLFLLYPRLRPLFKE--------DIRLQARKFTAHITYLINHINDWNRLQRDLDELgkrHV-HYEIKVEYFEYVKEALFPTMRKHMG--------------------------- >tr|A0A1C4TW82|A0A1C4TW82_9ACTN NAD(P)H-flavin reductase OS=Micromonospora haikouensis OX=686309 GN=GA0070558_10167 PE=4 SV=1 -----AVSADLGPSWAATAAAVDRAAANFLDTVSDRLPGLLP--------------ERDHTVVFAALGRLAGGVDDTAGRAAALAVLaraHR-GVGLLPQHADLLGDALLAAVARENRAHWTAALATGWERGLRRAVTAVRRA-- >tr|D6Z7Y9|D6Z7Y9_SEGRD Oxidoreductase FAD-binding domain protein OS=Segniliparus rotundus (strain ATCC BAA-972 / CDC 1076 / CIP 108378 / DSM 44985 / J -----TDQ-gAAARLLEAVAADPVVFVRSFHVELFRCAPELAERFPS--------GLGGHHAAFVTMTKHILQGFAdgsDPPALIDLLGQLgrdHR-KYQLGEEHYRAAKTALAKALADAARSTRDNE---FCAQAAALVCAVMEQE-- >tr|A0A1Q9NIM3|A0A1Q9NIM3_9ARCH Bacterial hemoglobin OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=vhb_2 PE=4 SV=1 -SLNTKDIQLIKNSWEKLTENKKEVRNTFYTGMFEDDPKLKSLFRE--------SFLSWD-NLPDSFEFMFKHLENLEGEILEMKRLglkHK-TFSVKPKHFPIGRKSLVKTIKQYMGDKYTEELGAAWTKLFDYMSHYMILG-- >ERR1711911_167941 TGLTVRQKRIIAKNWDLVRPNLKEAGAFQVVRDGAT--ERVGRQP-QaAGPrrq---HHVQHDDAGRL-----------AQRCGVSGAAPGHH-RPqspssALETAPFSGEPQFILRR--------------------------------- >ERR1719223_727152 --PSSAQVDAVTASWDKVAALgAETVGVLLFKRIFEIAPALESELSEKPTaiIIGDLTLAREMT----EEEKETIDLEEKEEPeeveekeEPEEVDEqetTE-GRIISTESF------------------------------------------- >ERR1711871_830988 --------FFFFFFW---------RPPFFFFFLLLRVSSFLPLFVASLPPperlfKVGSPLVAYGATVVRALNVAIGLLTDLPTLVPVLKTAlpsL--FPGAQKEHYGIVGQAALNSLAIALGRYWKEPVKNAWLKIWNTVVAVVFS--- >ERR1712232_1508017 -PLDGRDIALVQTTLGMVAKLGlNTVGKVIFLKVLKLNPNAAQLFTWGKMDaalmwKDGSPAVAHSIKVVQTTATAIGLLTDLDTLVPILQTLgvqHNGspmlpdaygGKGVIPKELDVFAGAVLEALAVALGANFTEPVKNAWIKVYTTADGVMKA--- >SRR5882757_3847967 -----------------------TSI--------------WPIIIN--------TaVGirnipQDYRNVARVLRLnqFEF-FTKimVPAAAPYIFTGl-------------RIGIGLSWLAI--------------VAA-------------- >ERR1700737_3002051 -----------------------RDF--------------HHLDLA--------DhHQ---------HRVagTQW-AN--GSMSNAVWTGv-------------RLKDVLDRAGV--------------KSGAI------------ >SRR3954451_23003713 -----------------------LKS------------TTGEVFLE--------G--klv-DE-------PGpdRAI-VFQn-HSLLPWLTVYg-------------NVAIATDKVFGGSGARSKSKAERHDWVMHNLELVQM---A-- >SRR5206468_1650083 -----------------------TNA------------TMGCVLLE--------N--rev-NS-------PGaaRRR-QGVc-ERQDPQRAQRmgdAqpqpradgacqgqA-PG-GDFRRYEAARRHCPRAGHATKSAAARRAVRRAGRADPRAPAGL------ >SRR5258705_633045 -----------------------TSE------------DAGPVALG--------N--qev-KQ-------PRtqPPV-VFLd-PALPPRPPALd-------------HWLLRAARDAGGP------QPQ-------------------- >SRR5690606_21133184 -----------------------INP------------LHGAVRLN--------D--aap-RV-------GDpeVGY-LLAr-DALLPWRTALr-------------NVTLPLEV---RGI----ERREREQSARKVLRDVGL---E-- >SRR5688500_4892119 -----------------------QEP------------SEGEVQTF--------G--sra-QC-------PNphTVT-VQQa-YTCFPWLTALg-------------NVEFGLRV--QGK------RDNAREVATEYLHKVGL---G-- >SRR5699024_2544359 -----------------------LSPSSGKIIVAFSSPTSGKIMMD--------V--ndwtSYKDSEMTALRLkeIGF-IFQe-SHLLPYLKIRe-------------QLEFVGREAGMDK-------KHARKRAKEILDLFGL---D-- >SRR3954447_21976298 -----------------------RAA------------TGGVVRWS--------V--dplvAAG-----GRARhpLSM-VFQk-DTVLPWRTVAq-------------NVGLFYALN---RD----RRAGAEGVVDDLIRLAGL---E-- >ERR1719419_74415 -PFTPEQRTLINETWGNISTKEtgsmGMLAKQVYERLFRSAPGIKRLFKDSD-------MLAISRAFGGMLGVLVSAVNQPLQFQHIVKGLgvrHQ-VYGVKPDHFRIMYTSLVRTFAQILGDKFTSEHKKAWSCLYNWVIDAMQRSMR >sp|Q8T7J9|GLB_YOLEI Globin OS=Yoldia eightsii PE=1 SV=1 MSFSAAQVDTVRSNWCSMTADIDAAGYRIFELLFQRNPDYQSKFKAFkGLAvsalKGNPNAEKHIRIVLGGLGRILGALNTPE-LDVIYKemaSNHK-PRGVMKQQFKDMGQAIVTALSEIQSKSGGSFDRATWEALFESVANGIGQYQ- >sp|P0C227|GLB_NERAL Globin OS=Nerita albicilla PE=1 SV=1 KSLSADQKAAIKSSWAAFAADITGNGSNVLVQFFKDYPGDQSYFKKFdGKKpdelKGDAQLATHASQVFGSLNNMIDSMDDPDKMVGLLCknaSDHI-PRGVRQQQYKELFSTLMNYMQSLPGANVAGDTKAAWDKALNAMANIIDAEQ- >ERR1719238_612722 ------------------------------------------LDGE-------TKPKEDQ-----NLSNPWAATAVTAILIPNLRDLglrHC-RYGCRLEDYELGGKAFMMTIEHFMGDAVTPEVRAAWLWVYGVVQSVMVSM-- >tr|A0A0P6RCU1|A0A0P6RCU1_9RHOB Flavohemoprotein OS=Phaeobacter sp. 11ANDIMAR09 OX=1225647 GN=AN476_12305 PE=3 SV=1 ---ASTCKALVLRSFESERMDLEAFIPLFYSNFFEAYPEARAIFPT--------DTERLEAKLLASLTHIAEALESSERLdgiLSELGQKHR-RMQISDSHFDGFIQSFIRSLATTLGPEWSDQSDEAWSQFLRYVAKRMSFLE- >tr|B3SDK5|B3SDK5_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_62364 PE=3 SV=1 SYLNYQERQAIIDSWNAISTEKQKYGTILFLKLFELEPRVKSLFTIFDFNeplediIQSPHFRSHAMRFMQSLETGVLMGFD-kescDFLFKSLGSRHH-FYDLKSEFLDVIPECILHTIKKGCGNNWSNETADAWKIATKVLCELFREG-- >SRR6266700_8223772 -FFLPFKE-LTEQHFSILGlRKARRAGLVLAQELFEHAPHVGARHSN--------AFGGRHPNAILAVEPFLRRAKNRDQP------DSG-AWSATSFHFGWNGGFX------------------------------------ >ERR1044072_5206314 --MAPPQIAVARSTGPKVSPMQQRLAQVFYERLFELDPTTRAFFGGVD-------LRHHGLKLTETLSAGIEVLGRDGPAPRGS--------GSGMAALRDGGGCVVHGAGVLPGPRVHDRSPGGLVGGVLG---------- >SRR6516162_1975606 -TGVSEQHLLDLGGVDILA----ATDDHVFDPA--GDLQIsavvqdAQVAGT--------YPAVRVDGFGGAFGHVEVAEHGLVAAcADlpg-LAGRHG-LSGDRI----------------------ANGHLDL----------------- >SRR5947209_12860360 --------------------LFSRQPRSAGQRLFTRFPQTRTLFAATDM-------LEQRKKLQQSLALIVEHMQHPEVLGDMLKGWtrgTS-PMVFDHSIIP-----------------WSEQ--------------------- >tr|V3ZYY7|V3ZYY7_LOTGI Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_167450 PE=3 SV=1 ---------------------------------------------MDDNqesLKENYRFRCHVGLFCETIRIAVEEMREIEEVLLFLKDLgrkHR-MYGATPTYIKTAGEGIVYAIDRKLGNEFTRSMKTSWKKFFTILQDSILEG-- >SRR5438045_5489985 -------LITRPTSYYLLSlhdaLPISLLADVFYSKLFVKNTGLRKMFP-A-------DLQLQRQKLMNMLHFIISNLDQPELFnkeIEGLGLRQD-RKSTRLNSSHLGISYAVFCLKK------------------------------ >tr|H2ZPV1|H2ZPV1_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1 MHFTDEELDLIRTSWGQVMKlGTKEVGIQIFTRLLNDAPKLRSHFYSIdiaDDEelslevmREKKKVVSHATRIAVAISKFVDFLDKPEELDSlltKLGESHA-RLQVDPGSFEYVAPVILAVIGGHLNLPSNSSTLQAWVKAYGVMRNGIVA--- >SRR3954451_6295623 -------------XMSTLIKGSPHFSspysptgetDQVPEHLFRLDPSLRALFTRTD-------FVRQRRMLLNMIGVTVRGLDRLDGVVPTLRDLgrrHV-GYGVRPEHLSLSR------LNHWLPrGQADPEVMQGTADfhh-------------- >tr|V9ZVV7|V9ZVV7_AERHY Globin OS=Aeromonas hydrophila 4AK4 GN=AH4AK4_1427 PE=3 SV=1 --MTSEQIELVQRAWGKVTALNNTYVQEVYAELFRLSPELINLFPDPAG--------MPVAKVSDTLNTVITSLEQLDAlsfIIRDLGRRHQ-KFKVQSHQFDLLKQALTLVLARRLGEHFTPALSDAWSQMYDEIAALMLEG-- >SRR5580704_19412242 ----PDIAAFVRFASRFASES-SH-------SQMTIHATIVSQQRQ--------QIEMRTGFX------------------------------------------------------------------------------- >ERR1700732_4531564 ----ASPNGRRNSARASmlISSQPIRRSPRFSATTW-----------------------WHRPRC-SCSLWVRSEVNRMEELgggLCALGERHV-DYGVKRADYNKLASVLIQTLKEFLVDEFTVELQHAWGTVD------------ >SRR5258708_12476517 --------VLWEWLVDVGGARWRWFGGRLLEIFLETSPELRSLFHK--------DIAQETGMLEWMLGSLVKGLNRLLEIeggLRALGRRHR-DYKIDQADHEKVLRALLLTLAEFVGDDFTPQVSRAWKTVYGKIPDTMTDR-- >SRR6266699_2567678 -AItkrrfqAAQAVVQIDDSFnPPDWYpDEHPPMPEIVARFFELAPDAQGLFRG--------DMERQYLKLMNMIAAIVGTLDKREMFksiIGRSGRQHA-QFGAKPLHFAAFGDALIWGLEQQFGAAFTPEMKEAWIKLYDDVQREMMC--- >SRR5690349_3556304 -YLTGQQVLLLKKSFRQMN--PAQIAAQFYGTLFQQHPEVKSMFPA--------DTVELGSKLMSVFELVVFSFDEKEHgrfglqdvLikpLRALGRKHD-DKGVKPEYYEIANSLLLKIMKE--SEYFTTEMYQSWQLALEHLTYAMQDK-- >tr|A0A0S4IWR4|A0A0S4IWR4_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72665 PE=3 SV=1 -LVTVSSNELVQTSWSWVAHDMVGLGDMFYDQLFMIDSEIEHTlfAGT--------DMKRQAVRVMEMIDAAVQGLNTPETIAEVMFTSglrHA-AYGVQRDHYTVVGKALIAALKAFLARRFTPEVAQAWSVFYNGVQRRMLEG-- >SRR4051794_5741567 -SMRPEQMQLDGLTLADATTDRLARGRDFYRRLSVPAPYLRGRCDG--------DVDAESAKLKETRTLALRMLGNMRFMVATLDAMakrDV-ARGLSEQHCRAIAQSLIWALERRLGAGFSRQVCTAWTEFLAVVMTCLHG--- >SRR5436853_3450426 --------VLLKDSFNLVRSEEHTSELQSLRHLVCRLLLEKKKKnkTTTV-----NYIE---KEKLGKLEA-SCPVEQTI----GIGDKQR-DYQ--QMHHPERTEAQ-----KX----------------------------- >tr|C7FFW0|C7FFW0_BRASE Extracellular tetra-domain globin (Fragment) OS=Branchipolynoe seepensis OX=326992 PE=3 SV=1 --VSDAQKAAIKASWAGAD--LQAAGTGFYVHLAAEAPAVYANFNLGADPH-GAKSQEQGLRVMKFVNQCVNSIDNMAIVQAKIDALahrHM-SYNVKKSDFVPAKPCFLGALADALgG-KFNADARAAWAGFYDIIAAGLST--- >ERR1719506_1011120 -PITAREGQIVQDSWKAVKKVGGESGHavikdIFYQ-HLLKDPNVKQLFRNS-------DMKLQATKLWQTLHVAVDGLSTSGPWFLCCRIWarlTS-STGSKRS------TSMPWVRRsSTrspraWGPRsrrssrWRGRKCTAWLLRRX----------- >SRR5579862_1310240 -LMDPLRIRMVQDSLVKLTPREGSIVDLFAAELSGSPHDESETGGD--------NIAYQrERSVLGIMAAAAPFLHAPECILDEVVAEIG-AGRIHPADYDHAANAFLRALKKNLGAEFTADLWEAWLEALWTLCNLLSRT-- >tr|A0A1E3GPU1|A0A1E3GPU1_9GAMM Bacterial hemoglobin OS=Methylophaga muralis GN=vhb PE=3 SV=1 -KLQEQDIALVEQNFAVLMEFSDALAERFYQRLFTEYPEIMPLFKSV-------TIEGQHKKLLASMVLLIQHLRDTEMIEDYLqglGARHQ-QYGVETSHFEMFIENWLSVVAEFADQKWDSKLQQAWRNVLEYVAELMQSPT- >SRR5438034_562795 ------AVETLRNSFERVIERSPNLTRRFYEILFEKYPQTRRMFGL-Q------SGKGKGNGKGAGARQRLRRChcrlhfgkekaTVVPFPlpvPVPLPAFRD-SYX------------------------------------------------- >SRR5688572_434377 -PMDKERAHLVRDTWMVLTPRADEIAAAFYAHLFSLDPDAREMFAHVE-------MTAQGRKFLGMIGTLIRLLDDPADIVIetiPAARRHA-TYGVTGDHLDTGREALMRALERHVARRLHTCRSAGVGRAVRP---------- >SRR5205085_1772709 -LMENRQAHRTSDRLQIELAAAQARIGLLYFAQHDRAPAARAMFST--------DIGVQSRKFSDMLEVLVEGLDDFDQKRPALRAMglrHV-AYGVVPAHYDTLATAFLWALGHMLYPEFSPEVKGAX---------------- >tr|A0A0N9QWL5|A0A0N9QWL5_9ANNE Intracellular single-domain globin (Fragment) OS=Eulagiscinae sp. JPG-2015 PE=2 SV=1 --VSDAQKALIKSSWAGVD--LNAAGVAFLNQMEQKAHDVYAVFKVGGGATSNPKAAALGLKVMTFVDEAVKGIDDMGAVGGKLDelaQRHT-KYGAKKAHFPVAGPCFLDALAEVCGGRFSADARAAWSDFYDVIAQHLSA--- >tr|A0A0P6AJ75|A0A0P6AJ75_9CRUS Globin OS=Daphnia magna PE=3 SV=1 --LKTVNVSAVQNTWAIVNKDLNTHAPHFYVALLTAHPEYQPMFPTIANVpagalLNNAALKTLSVNVLTKLSELIGCMGNPDALNAQLVDLanqHK-GRGTTRAHFDNLSKVLIDFLAAKLGGEFTPEARQAWTATMQGINTVVEA--- >ERR1719347_1330150 FCLSESNIKALKSCHPHLKDRKEEFGHLFYSNLFSNHPDLKSLFDQTEEG-----RQLQAQRLADTVVAFLEKCDDLPSLLPTFKKIgkrHT-TKGVKPEMYQIIIDNLVDTLEEMLGKeVFSAEVKQEVLESISFLSNAFIK--- >ERR1035437_6084348 -SLDQEMIAIVQVSWENVTPDSRLAASMLAMNLCADDRNIASLFEE--------DRIKMSRDVMQAVSCIVADLDQPETLVPYFGSLgqlLR-RHGLHESGQQTFATALFLTLGQLLGPRYGPVEHNAWAIAYSFVVRIMIAE-- >SRR6185369_9977853 ------CGVPDPDHV--------RGGG-------TAQERSRRAFLPTA-------VRDRSR-----VPRAVQGHRHAGagRDADDHADLgrrHI-GYGVQLHHYDAVEQALLEMIRRMIGDAFTLDVRLAWSHIYNELVRIMLAG-- >SRR5215471_14715706 ------VPAGGPALARLLRR--------HLRRV--VSSRLAPLFLRLA-------FNDAISYDPATGSGGANGSIRLPEELARKEVAglaRA-V------------------------ERLRPVKE------------------- >SRR5215813_3453690 -------------------------------------------------------------------IASDSEIQVSPWtrt--GTLaisARRCS-SSRISSGigsdtTFSLYGNCV------------SSSATIAWNTHGD----IQLDS-- >SRR5579859_1863727 ------NISSLQLTILNLLTVEDEFVPRFYNNLFNMYPLARSLFVHTe--------ISLQYNKLRLMLMMIIRTIHDADGLKIQLqqlGQRHK-YYRVEPEHFAILYIVFVQTVVEYLGPKWTAELEAAWAEAYGTIVRMMDME-- >Dee2metaT_7_FD_contig_123_47857_length_200_multi_10_in_2_out_1_1 # 3 # 200 # -1 # ID=100007_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.434 ---------VLRDREG---LGDPELVVLQRRHLAEHGAILQPLalLARQr--------HREDLELVRELLLLECDHRVEHPRahpaGVGVEgelGVGHH-TERIKRSlspsalLGRWIDLVVVGAVRR---------------HHQGGVVDLRLVE-- >SRR5262245_19300173 -LLTPAQKRLIRESFVTLEPAIDLVGQLFFLKLYRLDPSFRARFGG--------NPETQGRKFMAAVKLAIIALKHDDCLAPMLKLLgvrQR-ILGMKVRDYRMIGKAWTWTLERSLEKRFTRPIKDDWTALLALATRVLSG--- >tr|A0A1S3M8L1|A0A1S3M8L1_SALSA cytoglobin-2-like isoform X1 OS=Salmo salar GN=LOC106571144 PE=3 SV=1 -HLTDEHREIIKETWKVIQENIAKVGIIMFVGLFETHPECKDVFFLFrDVedlerLWNNKELQTHGLRIMHFIEKSVARLNQMErldQLILDLGKSHY-RYNSPPKYYMYVGAEFIRAVQPILKDNWTPEVEEAWKTLFLYITSIMKQGYV >SRR5258708_4037766 -------PGAVGPAPGLQPPRNRPGARRGQPALMQSPSAGGPPPGPHrpR-------RTHRTPPRRAALVLLRRSLRDLDEVVPGLRAMgarHV-RYGARPEHYPVVGAVLIDSMAEVAWDAWRPAYGRAWAAAFDVVSGAMLAG-- >OM-RGC.v1.004444255 TARA_034_DCM_0.22-1.6_scaffold509117_1_gene597562 NOG05352 "" -PfLQPTKFELVVNLKTA------------------------KALGL--------EVPPTLLARADEVAGVGGSAKRISHWppr------------------------------------------QSRWAGLPRRPERH------ >SRR5262245_16285966 --------XMVEGTLDAV--SLPALSADFYRRAFDTDPELARMFTA-D-------RRVQEARFATELAAIVRSIRCHDEFVPagrALGPVPR-L-RRDGRPLPRDGRRPAGIagrcprsdvearGGRGMAPRLQPDRRDDAERRPRAGQLGVTSG-- >SRR6266568_4225566 ----------------------------------------------------------------------------------FFFFQaedGI-RDG-TVTGVQTCALPIFDTVRHFGAGTWTADMQAAWETAVASIGSIMRA--- >SRR5260370_35001365 -----------------------------------------PTFPP--------AVGAGRKGVSRAVPGAVWSSDQPERLARGVGELardPG-KFGVPEQPYRLFCDALLATVQAFCAGSWSDQVQAAWERALAAITAAMMaggsgapgE--- >SRR5215475_1743066 ----ISYWPLVKQSFARATSDGVAAAEHFYARLFAVNPGIRALFPT--------SMTVQRERMFADLSRVIWSLDTEPECTALLRQIgreHR-RYGVLAKHCEAFLAARGRLLCrHDAGRLIRCERRARVVDCLDRQSRTAVagggl---- >APLak6261665767_1056052.scaffolds.fasta_scaffold282062_1 # 1 # 210 # 1 # ID=282062_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.505 --------------------GRSNLSLVFMKICLKLIPKLNVYLVKLI-------WRSRVKKLLNSLILLVEGLRTPEALIPVLKDLgarHK-GYGIVTEYYPLVGEILLNTFADYLQEDWTPEVAQAWLEIYTTTSNLMLEGAG >SRR4029079_9820506 -RVDGILVEGLQASLATMQPAAAQIAHGFYTLLFARRPDFRAMFPE--------DMAAQERKLIATLAFVCEHWRKPAAVSvrlADLGALHQ-GLHVKPEHYPIVCDALVTAVMKHRHEALGPHRAR------------------ >tr|A7RHV8|A7RHV8_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g197347 PE=3 SV=1 IPLSVAQKYLVRETWETIEQHSKAVGKKTFLRmfymssidfiysvvmeskgskdirvlglelafddvknsyrtwrFFEMNPDYQKLFPEFaTLDqvelEQANALHGHAKRVMKAVENAVSAMDDAESFAAyleNLGARHK-ARALKPAYLDAMQVAYTDTIQDLLKTQWTDGTAEAWNKLFRFIADTMKHGL- >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold1207366_1 # 2 # 214 # -1 # ID=1207366_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.286 ----YASHQSQAASLAKAAPRPRVAVLGLrlpsgeSPQLARLGRAFAELLGA--------ELAAGERLLVLPAeRVehMKLELGLDEAEAYPLPTLgriHR-NLGPDLVVVGTlapqeprgtlsvtveVKDCLTGAVTATAKVTGPAAELFTLASQvggelrrrlgssalsgneraelraqrpaSPEVAQLYADG-- >ERR1712062_404977 -ILTNQEISVLKSSWELIAKKIEIAGAHTFLPTFDRDPKCPDN------------IERHCQRVMSVVGGSIELINDYKSLWKhliSLGREHF-GKIREWIFASIAGGSTersgcspssINFLSSKINGNITSKK--CFLQ-YKIVIITQX---- >SRR5271166_154013 -VMTRLEIALVHEGFHRMESRLESICMAFCRTLFGLDLSLRPLFPN--------DLQPLAAHLAAGLETAVRSLDDLQPVlvcAPALGLRLA-SHGVVPDDLHTVCAALLATLQSELGDAFTEGVRAAWRRLFWIVAAATIGA-- >ERR1719261_40108 ------TIAVVQGTWQEIKDalgdgVAETAGVILFKHIFRIAPQALALFSFKDCAggnvcdelFENKTLRKHAAKVVGTVDTAVGMLKKTRQADSRPGQSgqeAR-GLwggagalrcgrgGVVGDAVGRVGRRVYDRGPRGLGGGLRHHQNHN-----DRQELRLHGR-- >ERR1719238_2294225 ------------------------------LKVA----SALREFNTLRAEgivseqefLEM------KAKLLAVGKDELG-RSPSGDTLETLVEAthemdssrrRT-RWtrrarraSRSPTTVGVISCQIK--------KSSTRRTTRRW---------------- >SRR5690349_20281755 -IMRPEQAALIRTTWAQVTPLGIAAAALFYERLFALDPELAAKFAHTDM-------ERQGKKLLQALTVVVATADRLHTLGPSLEELglgHL-RYGVMDRHYDTVGVRYWPPSKPPLAQRSRhrsrrhgPWPTPAWPPMC-GPARGGR---- >tr|E9IBK1|E9IBK1_SOLIN Uncharacterized protein (Fragment) OS=Solenopsis invicta OX=13686 GN=SINV_03861 PE=3 SV=1 -GLTEKQKRLVQNTWAIVRKDEVSIGVALVLaiarfvyecntksffySYFKQYPEAQKEFKAFkDVPidelSKNKRFQAHCANIVATIGKVIEQMHDPElmeASVINFTEKHK-NRGQTQKQFENLKQMMLDVFPSVFGKQYTPEVQEAWKKMLDLIYSKIYQTL- >tr|A0A0L7R0Z8|A0A0L7R0Z8_9HYME Globin OS=Habropoda laboriosa OX=597456 GN=WH47_01055 PE=3 SV=1 -GLTGREKRLVRESWSVLRVQSVNTGVAIMTSYFQQYPQYQKVFPAFkDVPldelAASKKFQAHCQNIVSTLSNAIDALNDVDlmeAILHTAGERHG-RRGQGRQEFIDLKGVIIEVMKGALKSRFSTEVEAAWNKTIDVLYLKIFEGI- >tr|W6FSH9|W6FSH9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_a PE=2 SV=1 LDFSDDQKADIKSTWETLYSgNKFQLGVELMANLFKAHPDYQDLFPSLkGIPdvAGSNELRGHAIRVITGINNFVDALDEEeevmREMLHNMARSHK-PRKLTKTHFNEFAPILLETFEKKVD--MSSKARDAWIALYYSIVDNLFAE-- >tr|N1VSG6|N1VSG6_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 GN=LEP1 ----PDPILEIQKSFDHVLEYNPHWIDSYIDKLKNFSMenvTENQREGDNES-------PISSEEFLNSIESIIEKLGNPISVKKEVSKLaniYE-SLGITKKEFPKLLPILLSSLRENLPSEWNPSLESIWTQAITDLTIETIES-- >UPI00001F6528 status=active ----IDGLRDLSESFDTLaadeaatAPAATELKaavegqfsgvfGAEYAKQTGKQPDTASYTLE---------------HSAAALAQYHYIVRNPHPLGQknKLDKVagEA-RYHALHARYHTMLNAYLERFGyydvflidldgdvvysvfkemdyatNLKTGPWRDSgLGRVFRSALESNDtkSTFFDDFA >SRR3569832_1336210 ---PALVRAAPDSAAALRRCRCGGTAEKIAERARADD----------------------------------PESENSRGAGAemkGLGARHK-QYGVQPEDYPAMRAALLEVMAALAGKAWTPAVAMAWEDALYILTDVMQKAYR >SRR3569832_1187104 ---PALVRAAPDSAAALRRCRCGGTAEKIAERARADD----------------------------------PESEKSRGAGAddeRIGRTAQ-AIRCSAGRLSSDACCAVGEQNGNGGX-------------------------- >ERR1719259_112507 -GVTGRQRVAVQASWRLVAPDAKRHGVAIFIRLFKKHPETQLVFKSFkGQQpeslADNKRLAAHATTVMASVATLVDNLDDIDTLLELLHKVaenHK-RRGLPIQYFEMVSNTIFDYLVETLGAALDRSGVEGWSNVFRAINSVIAAEYK >ERR1712107_384356 ------------------------------NRIFTEQPNVQQKYFSHmD--iNELGTLGKHGVGFMKKIDLMVTyvKADEDDNLVALIHEItvsHS-KKGIRNAwEFEIVCEILISYFKEAMESEFTSDAEDAWkkffef------LV-------- >tr|Q53I62|Q53I62_9ANNE Intracellular haemoglobin (Fragment) OS=Alvinella pompejana GN=hb-i PE=2 SV=1 -----------ADNIAAVRGDVSTHAMNIFVEYFKKFPQHQNAFADYkGKDpeslKSLPKFKTHTTKVVSKLLDIVEKASDSGALQSNCTTLakmPQ-HKGLNQQQFADLGAVLVPYLQKALGGACDSA---AWeqayn---------------- >tr|A0A132BSZ5|A0A132BSZ5_9RHOB Flavohemoprotein OS=Rhodobacteraceae bacterium O3.65 GN=hmp_2 PE=4 SV=1 -VLHQIDARLVEGSFGTVFARKAELTDVFYKHLFEEMPAARDMFTH-DF-------SRQKEMFARVLATGVRSHRGDATLAPLIENLllqHR-HLGLTSEHMYMAQRALLMAFRVVLTGHLTAAELSAWNAALRRLCQSMAAGL- >tr|F7RKN3|F7RKN3_9GAMM Globin OS=Shewanella sp. HN-41 GN=SOHN41_01091 PE=3 SV=1 MGLTEIEKEAITSSFSLINHQEQHFATIFYDCLFDMAPLIKPMFKR--------DRKLIEEHFYMIFCAAVDNIHHLDTirtILLELGARHR-NYGVKVLHFPIVKSALILAIQHELKGQSNASIENAWSHYYDVLAAIILEG-- >SRR5579875_3194573 --------------------------------------------------------SRCCSRATPSYGRCSRSRCrgpgrrsATGSPSSSATCRrpgAR-RSCSRRWPGITAGSASvtgtTGRSSRRSGPAWTAELDAAWLAATDWFVSVLAAA-- >tr|A0A0L8P0I1|A0A0L8P0I1_KITAU Flavohemoprotein OS=Kitasatospora aureofaciens GN=ADK78_37645 PE=4 SV=1 ----AADQRVITEYLELVTPFGE-LITHLYETMFRRWPYLRSLFPE--------SMEFQRAHLARAFWYLIENLHRPDDIAEVFGRLgrdHR-KLGVRPVHFQAFEAALCEALRRTAGPRWADAVEQAWVRMLRFAVAAMVSG-- >tr|A0A0G4II14|A0A0G4II14_PLABS Uncharacterized protein OS=Plasmodiophora brassicae OX=37360 GN=PBRA_003666 PE=3 SV=1 -NLTEERIDIVRKTWLTLKSGqgkgerdrlgsnpsvqdaMDLLAVMFFEILFKNAPEVEALFQC--------DLVMQGRRLTTALNNLVDLLGKdaaaISEILTRLAEVHH-PHGIQPEHYDPFGQALLAMVKAGLAEDFTSDVCEAWEHLYSTICSFMIP--- >SRR5262245_46558688 -EMNRIQVNRLRSSFKWFRPCGPAMIAMVFRSLGDRHPGVRALFPE--------DTSTLNKRLFETLRQVVKALARFHSLEERLMELgarAA-RAGANPAHYRIVRDELLATMAALAREDWSEELARDWTLMLDAVSGAMLRGA- >SRR4051794_9566520 --------------KALVEDVAERghrrPMEVFYGARsdhdlydidtmlrmAQSHPWLS-VRPV--------VATGpaggPMNSLSGQLPDAVRQYGPWREYDAYLSGPpgmIR--NGVD----ALVGVGV---PSDRIRHDSVEELVAAGDX-------------- >SRR5258708_3005780 -EPTPTDITIVSDSLAPLTkEQVDNVLAAFYHQLFTRQPSLRQLFKSFRsgDQPDQQAMKLQRNKLAEIIALGLKLWEKPHQLIPALEKLgrqHH-QYGVRDEYYEDVWIALSEVLSEAFGLDRWEDICESWQRFIFLCARHMLNG-- >ERR1719198_2284224 ---------------------------------SDMPSDALDWFTNP-TPe---KRGTPDGGKVVSADVVAVAGQM-----------------------------RELISLPEADVAQGLSQLDP---Q-----DLMVLQ--- >ERR1719223_1791071 ---------------------------------------------------------ANSKAT-D-DEAS-KS-D-----------------------------ATKVAVPAGVAAPEPKEEE----P-----VAVMEP--- >SRR6266542_3322184 -VMTPEQIEAVEATTAVLAPALDDLAADVYARLDRLAPETAELFTG--------GPAAEVRGRARDDRARHPAPRRLpGACl------------PARPPARALRGQA------GALRARRC----------------------- >tr|A0A194VHM2|A0A194VHM2_9PEZI Flavohemoprotein OS=Valsa mali var. pyri GN=VP1G_10414 PE=3 SV=1 MALTHHEAQLVKSTIPFLKEHGESISDTVYRTLIEKHPELNNTLNLIHL-----KDGRLARALTVVILRFASSINHISELIPKLERIcnkHC-SLGIQPEHYEILGGLIIETFDDAMGPLMTPEMKAAWTKAYRILSNMMIG--- >tr|G9MK89|G9MK89_HYPVG Uncharacterized protein OS=Hypocrea virens (strain Gv29-8 / FGSC 10586) GN=TRIVIDRAFT_143449 PE=4 SV=1 -------------------------------------------------------MNPPEKVDIRSTDGASVIYRDVISLNSPQEEIrvlHL-ESG---SGSSLLKCTLHRvSLQSVQAPSYE-ALSYTWGNEndrraVVV-NGYLVD--- >ERR1022692_2453048 -------XMSLPASFTSICNgiLGREE--------NSGCPAAKGQFLP--------DRDAWrRssaLLLFGPLHQASRSTGYVSHLHegaArppgrRispDRRPgrqAG-RSGRLRAGPRAGPPQVRGHRRALRRGRRQPAGDTGAFRGRHLDARVMIEA-- >SRR6266581_3027569 ------DTHRLKDSFAKIAMHGDEVPLFFYSDLFIKHPEVRELFPT--------SMKAQRDHLIVALGQIISQVDRVDELSAFLRGLgrdHR-KFGAVAENYEYVRDSLLETIAHFSGAGWTSRLDSQWRSSRRPGRGCGA---- >tr|A0A1D1W7H5|A0A1D1W7H5_RAMVA Uncharacterized protein OS=Ramazzottius varieornatus GN=RvY_17919 PE=3 SV=1 -GLAVKERMLVQRTWKELMqLGRSNVGIELFHQYFTKYPQYVQHFKAFREvPseklKAHPRLKAHATTVVNAMDVIIDSLDDTETAVAVLDKTgrdHD-RRGLSTSAFADLQTTLMMLLGMFLKDSWTPAVEQAWDKALTVVMNTVM---- >ERR1719487_198517 -NLTNNDIDLVHTSWNMILNDtapeyvklkesgddkhancVAWFYTVFYHRLFDVHPACRHLFTR--------EMMTQGSFLVRMISLTLQEMHDMEnfrDMMRSLAEKHC-AYGVKGIEYGIAGDVLLYSLQTVLGSdVFTSAVHFAWRKVYSAMLNHITP--- >SRR5439155_13306073 -LLD-------GGTLRAVRMSGDTRSEPWLKDLWERGVAVGELRRHLLLPleTPPGLPVPRGRILCNCFDVAESEIDAFLA----------------------T-SNSIAELqarlkCGTNCGSCLPELRRKSLCDIG----------- >JI10StandDraft_1071094.scaffolds.fasta_scaffold6072973_1 # 3 # 245 # -1 # ID=6072973_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.634 -LVESFGAEGSDKKVTGIRLVGETIASDWLKEVMTSGEFTADIRRWALAPlsAPPSGHAGRGKVVCS----------------------------------------------------------------------------- >ERR1719326_289429 --------------------------------------------------------AGQRMNLTKFITTAFSLLGTLPDALEALSQLgmrHI-LYQTKDAYWPVVGANVIKTLKIILPAEDFDKEtEEEWATLYGIMQKTILDA-- >GraSoiStandDraft_1057264.scaffolds.fasta_scaffold343999_2 # 425 # 754 # -1 # ID=343999_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.636 --RRRMDAELLETSLALVDTPDDGLTKRFYALLFERYPAVRPVFPEEM----HRDIARQAKMLRSAIISVVDHLDDPVWLtetLGELGARHA-GWGVLAEMYDAVTECMVAAMAEIGGDDWTPYMTDAWTEALDAVSGLMLLGYP >tr|A0A1I3HEN0|A0A1I3HEN0_9RHOB Nitric oxide dioxygenase OS=Jannaschia pohangensis GN=SAMN04488095_0565 PE=3 SV=1 --VTNTQARLLSRSLRRISENGAPLARSFYAELFSAHPEVRPMFHS-D-------LSTQYAKFEDMLVVLVADVLNPGVILRPLQDLakrHV-EYGVTREMYPIVGDIMMRTLRTLDAAPLTGDELEAWDVLLGRVNAFLMDE-- >SRR5215203_6923026 -PGDSGADRAGRAD---AERDQAGLRRGRG-RLLPPAVRRRPLRggavhhrAG-H-------PtgeADRGAGCGDALDQAPRRVPAPGRH-ArpaAPGLRG-----------------PPAALRHRAG--------------------------- >SRR5579864_8015183 ---KPDPIFLVHTSFVHLRPRMAEFVSNFFRRLLKDSPELAPIFEDAD-------SVRLKTMVAKIFGTTIAGPEQTDQVeadLAELSRRHK-SYGAIPDFLPLVGRAFIATIRESLPDDTTPQTIEAWELLYANTAALMSKGL- >SRR5262249_54331370 -IRLRK-------EIDNEWLLIASgVLSVIFGLILVAQPGTGALA---------------LLYVIGIYAILYGILGPRPCcv---------N-RFGAQTALDRG-----------------TSTYRELWNIS----VARLIG--- >SRR6266536_6175029 -LMTPEQITLVQSSFERLGPQLPAMATRFYQELFTRDPALRPLFTT--------PLPQQEVRFAEALTEIVRAMPRLDELLThtrAPRRPArrlR-GTGCRLPDPRRRPprrargrpgRQVRRPHTRGMGPRLQPcrrdharrrsrgPAHQQLTTTAAPTASQADGG-- >ERR1700754_2066947 ------DPGdrQLARELLAGAAGGDDLDALvehDRGAVLEIAREAVPVaLAQAD-------RDdQLGHLGA-----------------DRlLRGPaerPL-GRGAPLQDVALVvhrddavergqqqRAVALAAGAELVGEIWERQERGSLtARRYGSNRSI------ >SRR5918995_1637126 ------DVQALEKSFDLVAPRGDDLMEVFYTRLFTAAPAVKPLFAATD-------RRRLKRPNQRSPSVsVSEKQWSMKCSQDQladgqstgasrpetprndcsdsppgelaaKRDQgakTL-SRGGCSGGAIMVPDCRTPTPRGRP---------------------------- >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3668839_2 # 105 # 377 # 1 # ID=3668839_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.656 -------SGPLAASLAIFEPRLEAVTARLVDVLAASSPHLLALFPPSSEP-------S-----AALLGRFLTRIVETESLGqPLGDGLgldAY-PIP-TRDQWEHLVESFIWSLSAVAGKAFSPPMARAWRATGERLFSTMFES-- >LULI01.1.fsa_nt_gb|LULI01000097.1|_29 # 27187 # 28320 # 1 # ID=97_29;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.310 ---------------DEIKGRH---HSMFVDEFERQQPQYKD---------------------------FWARL------NrGEYQAGeyrRY-GKG-GKEVWIQA---------------------------------------- >SRR6266851_2503075 ------------------------------------------------------XMRNGSASLPLwPARYGAWTTRRPSPNISAPSRSti----------ANSVCGRAITNWSARRCSPPSVSSAASGWEAAFNRIATIMIQA-- >SRR5215204_2071689 -VVMSNDYQLLKESLALIEPVYDKVTGYFYARLFVENPHLRLMFPL--------TMDLQRDRLFRALVHVVQAVDQPEQVVPMLQQLardHR-KFQVEPAHYDAVGRALIGAIRQYSYGEWSDEIEAAWWRTYSVAARTMIDA-- >SRR5262245_14739337 -PCARARLRPR-------RPAL------Y-AQALPPRRLVPRPVRE--------LAEAQSRKFMAGLKLGIIALNYEDGLTPVIRLVgvrNR-RAGIKVRHHRVMAKALLPTLEQSLETRFTRDTKHAWSSFLTQVTRILSG--- >ERR1719401_2136855 ----------------------rGCMGVTSAPQTLRQVRQCRRLHGGRLArhdrdwsaeegsdeedVWESPALRKLFGKFVNAVGCTVAGLHDMTEIGLP--RRgatKR-MYGSHqR---------------------------------------------- >ERR1700736_6084178 ---------------ARVA--------QALDRVRKAARQRKK------------------EQFTSLLRH-----LNVDTL--------------RTAHYALKRKAAA----------------------------------- >tr|L8LYK6|L8LYK6_9CYAN Hemoglobin-like flavoprotein OS=Xenococcus sp. PCC 7305 GN=Xen7305DRAFT_00009490 PE=4 SV=1 ---MSLQIGLLEQSFNCIRPYGKLFVSSFHENLFQTNPEIKSLFMGVE-------SQIQKNRIWDTLVLIMENIRHPNLLnntLQGLGARLF-THGLLPKHYPLVKKAFLATFKQFLGNEWNSELEQAWKNAYTYFHDLMQEG-- >SRR5919106_2778213 -----------------------A-VDRFYAA-VLGDPELAGYFTdvdidrvkrhqvlllsdvlggpesydG--------PDLGQAHRGlgitdghyDKVVGYLVAVFTDLGADGDTIAAAaevL----ASVK---PQ----I---VEDQAGSRDSHEX-------------------- >SRR5690348_11784222 ------------------RaePGRAGgvprarga--RRLGEPGGgrarpSRRPLADR-AAD--------GPH-ARaPRQRARPAAGGRRHRLRADAGgargPGAAPGaaaHP-GLDVVP--------Vveqdgg--------PGadpcgpleegtlADVVTRY-GAWADRDVLVCGSPAMI-- >SRR5947209_9205436 -------VLSVLRSpssplF---PyttLFRSRltver--DSERDVLMvaggtGIATMRAL--LD--------DLA-QWgENPRVHLFYGGRTDDDLYALDd--LHQLdrkST-RLNSSHANISY---Avfclk------------------------------------- >SRR5438270_814702 ------------------------------------------------------------------------XMTANAVVSPLPSQPprrQP-T----------T-----------GATAMVRLVRESWARI-------EARQ-- >SRR5919202_1970091 -------VQMVPGGqvsstmvrslkvgetV---RlgAPLGQaltlyag--ERHRDLIMvavgtGLAPLRAH--LE--------RIDqEwqSTgRAPRVRLFHGARLPWGLYENRl--LQNLagRP-WFTYTP--------Vvsddp----------typgrkgwvGDAAAVS-GPLHGLLALVCGSPEMV-- >SRR2546430_6350501 --GGRResRVRGGQGGWV----SRAIVAEPQRGDVGRSGPAMGRMKVD--------RG-AGRDVVMVAGGT------GLAPMRAIIDDL-A-QWGENPRvhlfyggrgrggPYH------PPSLVSTAAAqPGVPVVavagaeaglshkeagspagggvrHGALAGRG------------ >SRR6195952_1380156 --DVALAGEAVRAIWFRLADQEADVAHWFGALLFSLAPHLRAQFPA--------QADRAARRLLRASIAAMSAVDRPQEFPAAIGTLareTR-ALGLDASADEPVGVALVGAVREFAGELWAPGADAAWVLAYSLAAEPARR--- >ERR1700709_350262 ----------------------------------------GDLDAD--------AT-AERELLVVAGGRRGGVGpaprGepaGPSGAGGGRPPRparLA-AGVDVRRttvivgartaedLHT------LDRFAVIGEDaPWLAVVgacesdplelglapgpvvegitrAGPWLEHDVVVA-------- >tr|A0A098BFR8|A0A098BFR8_9NOCA Flavohemoprotein OS=Rhodococcus ruber GN=CS378_10080 PE=3 SV=1 --MEAFAVARVQLSFAsivATPGGAERFATAFYTALWSDTVGIRELFPA--------GMETMRQRFATAVGWAVNRLGDPDAVTAFLTQLgrdHR-KYGVRPEHFRSAGRALHTAVRECTPPiLWTDALDRTWARVIDLLVGTMAD--- >SRR3569833_3303276 ------------------------------------------------------------------------------PNNTNHDKH-T-HRKRNPPehqniggkrpedLYV------LDDLRRLTAVsKWLTVTgvteegaipggdrgtlahavaqRGVWEYYDILVS-------- >SRR5215208_6178010 --NGRGRPRPDTAIIRRGVAGQPTIRHLFYDRLFEHDPETRLLFRS--------DLDRQRLRLLTMITAMVGPASDDLS------ATNA-GhAGVPPWRWLSLA-----NARDVADP-------------------------- >tr|A0A0J9XAH5|A0A0J9XAH5_GEOCN Uncharacterized protein OS=Geotrichum candidum OX=1173061 GN=BN980_GECA07s01957g PE=3 SV=1 -SFSSWEIAEIRQSWASMRDDQLevsqekanvgtasaFFCQQFYENLLGEYPELSVLFPS---------IKSQASSMAGILALVISQLDNLPRVrevLISLGKRHSRIIGVEVTHYELVGNALLRTLSDRIQDEFTPELENAWIKFFTYITNLMLQ--- >ERR1044072_9602616 ------LEQSGYTVVGRAADARELmLKVRSYVPDVA--------VVD-V-------RMPP------DL--------TDDGLRAAAEI-rrsHptV-SVlVLSQHREPAYMLELVGDDASGVGYLL-KDRVRDVTQFVDAVQRVAAGG-- >SRR4051794_28399871 ------EHEAGTDLLELTD--------ALVRAGVPCADAAQEAVAG-V-------ELPHGAQLPAER--------LADRLERRRVD------lD------------------------------RLLRFGEDAG-HLVLGA-- >SRR4029453_17830486 ------DLQALETSFDLVASRGDVLMDVFYARLfaaapa------VKPLFAGTD-------PRRQKAMLLGALVRLRGSLRGPPAFVPPLPRPgagPggE-APlrrhrSPAPEGHAARGPraaAWLPARPAGVRSGaatPRGQARRLWRPAGALPGGRRgpdrLHG-- >SRR5688572_12388254 -SMNEEQIKLVETGFQSITGRGERFISRFYENFFAASPKAEKLFAQTEWP-------NQSRKMLLTIMMVVDNLRDAAHIKKMLHEAnlvHQ-KFTLQADDFDALTDAMLRTLREFLTDDWSKEAEDAWRAAFAKINAIMLEA-- >tr|A0A0N7Z8G1|A0A0N7Z8G1_9HEMI Putative hemoglobin-like flavoprotein (Fragment) OS=Rhodnius neglectus PE=2 SV=1 -GVSKEGIAAVRKTWEPVYKDKENSGVFLFQVLFELHPDFEKYFARFkSEGakslFDNPMFLFHVkHKVMDSLNEVIDNLENDERLLKILKSVasnHK-KRNIKKEEFVTLGKVVLETLRRALGTAMNPEVEDAWTKVIDCAMSAIG---- >SRR5712691_10715499 -ALTLEQFRLIQHSWQMVKDGQfnafkaqqliadplGFWGLQLYDTLFELNPALKPMFQNT-F--------TQSQMLTEMVGAALGLlpgiLDQAlgeektavlwylPEYKiviisITYANMSL-SQNIDR---------------------------------------------- >SRR4029450_4347554 ---------------------------------------------SG-V--------TGSSLPKTLVREgvQSLTtpchRKLPlgtektaidpqlLPILVDLAARHV-SYNVKAEHYGTVGLALVTTLERTRGSRVAAPTKAAWVELWSLICTVRIP--- >tr|R7TL54|R7TL54_CAPTE Uncharacterized protein (Fragment) OS=Capitella teleta GN=CAPTEDRAFT_144794 PE=3 SV=1 -KLSAEHKTTIRDTWPLISHSLQDNGIVVFEKIFEVSPSIRTVFAASfGFpaspipDayelSRASNLRDHVTRFMQAVGWSVQHMDDLDTV-ttvfVNLGKRHIHLKSLEPDFFRVFSGALMYVWRSTIGPDlFTAEVRGAWCKLFEFMLQHLAHGY- >tr|A0A1B6EVA8|A0A1B6EVA8_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.22480 PE=3 SV=1 -VITERDKYLAREVWMQVETNYVLISKSLFTNWITEFPEHLNFFKGLlDSSyddfLTSPKFEQHMaNSVLPNVGIMISNLDRPTDFRRHILKLawiHI-RKniALKIDHFNILKGLILRTLKESLGRGIGRDHEVAMFKVITAGFNLFS---- >tr|N1VY19|N1VY19_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 OX=1257 ----KDTILELQRSLELALHLNPNLARDFYVHFLETKPEFQKFFQNTD-------METQAKKLLAMFGRTIERFGNLNQIHNELknlGKMHE-EMGIKVTDLAEIAPSLLYALEKSLGERFQTEWKPIWEEALGSLVRLMS---- >UPI0007D2C88E status=active -GLDHKQIEIICASWAEVKKFGtEAAGCLLFKKFFIVAPETFSMFDEFkDIPnwEDSTQFKHHCKIVMNIIGGAVGLLRDPESLDSTLEYLglkHE-GFAITQHHFDLMQVELINTFRDALGAKVTPDVERAWNIFYAYIVRIIVCG-- >SRR5437870_4959208 --MARVNPRSMAHA--------ATAIAAATTRASEFMPTPQFVRTP--------AMPTQRERLLGAIIALVTHFDRPENLLPALTAMgrrHE-TYGVSLGHYAAVGSALLATLRDFAGLAWSPAYEGAWARAYTFAAG------- >SRR3954447_20457037 -------------------------------------------------------------HKVKVEDIIVRGGGNL---MVEL--MntdAA-GS-----PLDTPVRAVTDG------TESTAAAREPI--------RLNPG--- >SRR6266545_1588040 -------G----CDLEQAVDTCPA----------A---LVIGLRPA--------TMGTL---------CYMGGLASA-------AVCcwrHV-RVVTCSQFF-------------------------------TTASPQSRQ--- >SRR6059036_2276597 -ALFPGTSHWVV---AAGMARP-ESKDHPMLTVAQKTLVQ-----D-T-------FAIITPIADDAAALLYKKLFELDPSLERM---------------------------------------------------------- >SRR5581483_12392512 -PMTPEQIQLVRLTLAQATAGEPSIGRDFYRRLFVLAPDLRARFQG--------DVEAECPKLKDTLKLAFASLSDLPFLIATLEALARrgVARGLSDQHCRAISKSLLWAIEQRVGSAFTPQVCNAWIAFLAVVVSILR---- >SRR4051812_13904716 -GMSPEEVALLRHSLDEMRADGPQAAEAFYAELFRLDPSARELFHL--------PVEQQSVVFFHELDALLSAVSDLPAFverSRRLGRMHA-GRGVRPEHFEAAAAALDAMLLAVYADGASPELRRAWRHAYRMAAQLMQEA-- >tr|A0A0N0S3I7|A0A0N0S3I7_9BACI Uncharacterized protein OS=Lysinibacillus contaminans GN=AEA09_04415 PE=4 SV=1 -MLSLETINEIKKIASAISVNGEIIKKIFIEKLQKNVPELLHIFYQIL-QK----SGRSKISLIDAVYSAAMQIEHIDRFVPAVMQVahkHR-SLGIQPEHYPIVGQHLVDSIQEALGNQATEAGIAALQLAFNRIADVFIQV-- >ERR1719171_419597 MGLSAKTIEIVKATAPVMAEHGYAITSAMYGSMLTADPYIASLFNPSHQKVLPgDTHANQPRSLANAVYAYAANIDNLGALTSAVTRIaekHV-SLQIEASQYDVVGEHLMAAVKKVLGDAATEDVCAAWTEAYGFLASLFIST-- >SRR6187431_1436969 ---------GAAQRRRTVWALARKA--------VRIGPDRANLVQG--------GPRGFEDEAaQHACDDRVGAADRPEifdSVVEDLGRRHA-LFGVTPAQYSAVGEALIWSLGEALGPALTRSRREAWSDFYKVVQLSM----- >SRR5215207_7267255 -----QAV-----------AGEPEVRGSILRKAVRIGPDRANLVQG--------GPRGSEDEAaQHACDDRWSRLSTR-dlrLGCRGFGTTSR-TVRCDAGSVFGGRRSL---nleLGRGARTRADPVQARSVERFLQGGSALHVEG-- >ERR1719491_1400349 -------------------------------------------------------RQRRFTHMGAASGRPRAAVALPGARA----SLhdrPR-PHEAE-ASVASRCEATIKTLRDLLGDDCTPEVENAWAVVYGFMSSIMVESLR >SRR5919197_656730 -LLDDDTIGLLDESLRLIDDRSDVVVNHFYAAQFATPPPRGLLGSR--------ARGC--------LGRGVR--------RDGPGDVgrrSR-GGGGRAGLV--EGRD------------------------------------- >SRR5688572_8260099 ----DQEINIVRQTWNRLAAeHGNSVAEEFYKRLFECCPHLKDVFKN--------DFEVHGKEFIENMDHIIIQLDNPCMirEMQILGIKYA-SYGIRYEDYECMKKALFDALKTKLAEHWTPTVMVSWIWFYSTVSHIMKH--- >SRR4029077_8414069 -DMTPAQLQLIKKTLPEINASDDLFAAEFYRQCFDLWPETRSMMPG--------DLTERGRALVAEFIALASCVSgDMDRVVARaheLGVRHR-GHGALRAHHEVVEQAPAAPLASVLEDGWDEPTAQAWH--------------- >SRR6478736_6664572 --LNAVEIARVRLGFARVVPNCGAFADDFHARLFELAPTTSALFPD--------GVSNRRAKFRQTLVMLMTSLSTPTELKPALAALgnRCRACGVEEADFAAISQALIGTLAAHLGTKLTIADFDAWTALRGRIAGLLTA--- >SRR3546814_7943381 ---------------------------------------vfirlslsliiilvyRFLFFFFSSR-----RR-HTRCVLVTGVQTCALPIS-------TDELIa-----AWAAAYGQ--------------------------------LADLLIA--- >ERR1700737_1149585 -----------------------------------------------------------------------------KQPDGSAEKHfeqAC-ESGRPTGAVSHCRGTPAGCDQGSVGRRRNRRDHFHRGKGYGNLADILMG--- >tr|A0A254VKN7|A0A254VKN7_9BURK Nitric oxide dioxygenase OS=Xenophilus sp. AP218F GN=CEK28_14595 PE=3 SV=1 -MLDDATRAQIRHSAALLHTVGDQLVEHFYQRLLRHHPELGIFFNATHL-----HKRELQAAMSRAAAFYAEHNDQPENLQPMLQHIackHA-SLGVRPEHYPLIGEHMLKSLEEVLGPLASETVLHTWRMAFSELSGKLIA--- >SRR5215470_13616785 -----------------------------------------CMVTL--------CHCSFTqtcscGTRRRGICSRFRWLPSATGWCMRWAGScptSR-TSTPSAGTcRTWGASTASSAPSPSTTPTWTPELAADWKAAYDLVAQVMIG--- >SRR4249920_1577195 ------------------------------------------VWPC--------TATRCRCSSTRTC-----scgtrrRETCSR--SRWPYSAtgsCT-RWP-GSCPTSTTWTTSASTCRTWaaSIASSAPAPAADWKAAYELVAQVMVG--- >SRR5688572_1436081 -RPAPEVIAAVSASCQAVADRPVRLAEAFYEHLFEIAPQARTMFPA--------DMTAQMQRMSDTLVGAIAQLEKFdtAQLeaaLRRLGADHRTRHGVEAEQYRYVGHALTRAVRDVAGLAYSGALSSAWIAVYQYIEAHMSAG-- >ERR1740124_2148144 ---------RTRGAAALLLQgRAQPCGVAQAQEACYVCDEHCRCCSQGSgGPqqacarATGPPAHMPYA----THRCRVCCRIGIRARAPPTQALgkrHV-PYGVLPAHYDVVGQALLATLEGGLGAEWNDQVKASWTAVYGIIAKTMIG--- >ERR1711911_258465 -------------------------------------------------ritHGWEHVVQMHAMNVMNSITSIVDTLDNPESLVDDLKQIglnHR-KRPIEAIHFHVSIYAATEGVQHVLSEMIQSNIDDSAKYLRPVDGSQCDS--- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold8273257_2 # 299 # 427 # 1 # ID=8273257_2;partial=01;start_type=ATG;rbs_motif=TAAA;rbs_spacer=15bp;gc_cont=0.364 --------NELQTNIEDVYSAGDV-C-----ALFDSSaNRYRPtrtwlscafqgEVAALNM-------LGQDKVynegvFFNASHAYRSMYAVLGNFNPAQAD-gfeFF-VCNQDKENYE----RMVLKDNKIAGAMFVGSMKNVWSVKQLIEGQVDVS--- >ERR1711934_740551 ---SEETIRIVKSTAPAMKQHGYRICTTMFETLFAEHPSLASMFRKEDH-----TVQ-pgesyerQPLLVAqavrhsprflflapdshpllilipfsssSRCTRTPSTSIISPRWSPPsrgERERA------------------------------------------------------ >SRR4051812_844822 --TEPDTAFIAQSQLARIEAMGEELVQRFYAHLLA-APEMKQLFLHTE-------MARQHRRFLDQLTSAVRELRSPRNATAHLAALgarHR-GYGVKPEHFSLASSALLHALAVVIGKEFDARAASAWKEIIASLVILMNL--- >SRR5680860_1220841 TQLTAEQKHLIRLSFLRIEPALDLVAQLFFLKLFRLDPSLRKKFSG--------PIDVQARKFAAGAKLAMISLGHEDGLaptLKLLGARHR-QIGIRTRHYRTMSRALVWTLERSLDKAFDRDTKDAWNTLTAQFTKVMAG--- >ERR1719167_531039 MGLEQADIDNIQESWGIAKSKakLREHGVNFFLLLFTTLPEWRsKDFSHLgDGtleeLKTNPKFRAHCVLVMSNLNYWVENLDELDMGGASIQKTavnHA-GRGIMAEQFETVLGVVLKYLQGALAENLTEAMVESWTTLADTIVNIIKELN- >SRR4030088_1427564 ---------------------------------------RRGRDGGQP--------R-RRELRRDGQepdepDASRRGDRGRPCAGPASR--------------R--RGSAAGCRSSPPSPAWPALSYEQWRETCDTLHGhTQVLG-- >ERR1700752_5389668 -----------------------------------VVPQVPAARSRVP-------LR-AASFRRGGLehdpdPKGRVSAKQEPV-FGK----------------D--HGQTIRLSARGQSS---PrRNDAARETTCKEARMtPEQVK-- >SRR6218665_550821 -FLSEEELTAAKSTWVRLQAtrNMQAMGVKIFLRIFELEPATKQAFESFrNLKseelVTNVLFRSHATRFMKAVEVTMNNLDALDVIivpnLKHLGRLHTDFKGFHVEYLKAFEVAMDEVWAEELGTAFSGDCRLAWTKIFSLITTKVMEGYN >SRR5690606_39778542 ---------------------------------------------------------------------HATSVTSSHPCTPPVPcqcarrpALprlLRSsptrrssdlsL-MIKPEHYPIVgENLLASIRE--VLGe-gATDAVINAWA-EAYGFLA---D--- >tr|A0A257MW93|A0A257MW93_9GAMM Uncharacterized protein OS=Methylococcaceae bacterium NSP1-2 GN=CG439_2278 PE=4 SV=1 --VKVKNRLLVKLCIDEISPKIDIVSQLFYQELFHLNIHLKTIFSG--------NVTFLNRKFINMMATfkNVKHLEAIENSVEKMGERHVLHYRVQLKHFPTLKKALLLALKKHLGERFNAELEAAWHEVFDDVAEIMQRA-- >SRR5690554_3276444 ---xmSDADRLQVQASVERIRGQMDGFAGCFFDKLFALQPALRELLAT--------E-EGRRSKLRSMVSTlaNSRDFDKIAPAIRRLGDRHR-DYGVGVQDYVPVQQALLHAVAQVDPQGQSEQVQQAWSGQFQRISALMEPQ-- >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold510383_1 # 42 # 362 # 1 # ID=510383_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.393 ---mTSKDRALLKECVEYIEsESINELCDIFYKKLFDLDPKIKLILSD--------NDVVLRRKFFNMFSTfkSVKYIDKVSEIILQMGARHK-SYGINEKHLELMKEPLFESLHEVLGDEKFNYYKAGWEIGYQEVENLFKEG-- >SRR5436190_9873117 -GITHSDILLVQTTWNAVSEFSMKIVAGFYKHLFAAAPEVKPMFTT-ET-------SEQQKRMGSMINTIVNSADSLDEFRgsiSQLAKKHV-HMGVKKEYFPIVVKAIISSVEDQYGSGFTTAHKKAWYKILNEISNIMIEE-- >tr|A0A1X1R5G7|A0A1X1R5G7_9MYCO Uncharacterized protein OS=Mycobacterium bohemicum OX=56425 GN=AWB93_09655 PE=4 SV=1 ------TTSPVVVSLELYAEHVGDPIPIIYQRFYTAHPDAEAEFAG-DH-------HLEQRMMGGVLQMLIDLT-EGSfapSGCTYWLWDHI-GWGVTEQMVCDMFEAVVATIREGLGERWTPDMTSSWRDLISRLQPVLHAGF- >SRR5699024_11940786 FRRVLFRSEIVKSTAPVLKENSDKIGKRFYEKLFSKAPELYNIFNQTNQER----G-IQQEALAYSVYAAGENIDQLDNLKELISRVtekHA-ALGVKADRKSTRLNSSHVSISYAVFc----------LKKKX------------ >ERR1719310_1734953 ---SASSVKAVQASWAKAENIGlRVVGELFFKELFEASPAAKELFTAqkFgEDAAGQRRFKAHTLNVMQTLSAAVYGLSDLSALARTLPAPtyaIL-SLSFTLISFTSL--------------SLTPLI-------------------- >ERR1712087_347811 ---------------------------------------HEELFTAqkkFgEDAAGKAHFKAHTLNVMQTLAAAVYGLSDLSALARTLPARiyaIL-SLSFTLITFTSLSLTPLIYHTLTLKGARARNSGRaaPWIRRPT----------- >SRR5438874_997478 -----------------------XM------CTMHRHALRFPPAPN--------WAATRTTTPL-TTVTHRTAEVHPGRFAGSLRWLgraHG-KFHAPPAQYDVVRAALMDSLRAFAGEQWLPEYDQAWRDAYDVIARRMIQ--- >SRR6266511_448526 -------RRRRRRAATSSGRASHRLRDsRLEARARDRSRRVLDDASS--------WVEVVRLGDAGEPVVLVSAVAAIAHRDVRRVELareGE-RVRL-------QVLNVDAEEDDLAGEHWSVEYDQAWRDAYDRIARVMIM--- >SRR3954451_10251525 -------TSARRqqWTFPRCGPTspRPQRPGTRARCTSTPTCSCAIPRPA--------RCSRSRWRT-SGTGSSPPSATWLPgsttstRSCPSCSSSggtTG-SSGPSrRTTRPSVPacWPRSSTSTTS-GARNSPRAGRrptTASRAPDVLATVMIE--- >ERR671928_16913 ------------------------------------------------------------ALYFDGIDTGR------LRVHQTKLLVqvtGG-PVEYDGRELAVAHGGLDITLEHFD-PGWTPELARDWTQAYQLVAKVMID--- >tr|B7G0J4|B7G0J4_PHATC Predicted protein OS=Phaeodactylum tricornutum (strain CCAP 1055/1) GN=PHATRDRAFT_46237 PE=3 SV=1 -----HRKKMIQQTWRAVEFgLDVDCTRIFYTELFRKYPSVQPMFQHS-------NMEVQAQKLYEVIRVAVRFLDNVQELIPVLKDLgmrHAKHYGVLREHYDAVTEVFISVLNNYILteldcgnaGIWAMEVADAWHWVLTFIGNTMAD--- >GraSoiStandDraft_52_1057288.scaffolds.fasta_scaffold278261_1 # 2 # 652 # 1 # ID=278261_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.575 -----------------------------------------------------------------------------------------------AFLPAQRRAKLM-TSRLSSEPPWKGPAAEPSWHVLG----TMVG--- >SRR4026207_1965376 -LLRRALCRRA-QSAAAVSRRPDPASGSFRSRHRA----GR---PE--------SGRNGRGRRDPALALLSKTLDEMAPLREPLRDLgaqHV-HWGARPEDYITAREALVAALGA-LSPNWDETLEGDWRRAITAIIVPMIE--- >ERR1719359_2370951 ------------RLIVTPEHlDGCRAGLLALRVVLLHLGEGLGLLGSDSSGvsDCGVALgel-------PLQRLDLLGVLLGPR-----L---gl-L-NAGVRGLELSLLGRLlrvglselfVAEGLLLGL---------------------------- >ERR671911_2215695 ---------------ELEPAcaPDKQLVEHVQRlRVEAGAQVVGR-----E-------EerrsragqcprptsRVDVRGTHDD--------APLECVAEVLVDCgahAR-VACKVDergraaleLLDRVVPDDLVVDLHAVDEVDGGGQTgHVGPGTSSRRVstarakpQAGTLPQ-- >SRR4051812_41451604 -------------------------------QLAAAGPVLGARFAGGD-------RppraaavrprprRVGRRGGPLDRVPPPPRRDAARAAGARLRGRgaaRA-AGAGGRDQPLRVRDARVGAPVAVRGDLGGAAGIAAHYPVVGAVLIASMAD-- >SRR4051812_21433834 -------------------------------QLAAADPVLGAGHAGGG-------TparaaavrapprRVGRRGRLLDRVPPPPRRDAARAALARLRGRgaaRA-AGAGGCHQPLRVRDARVGAPVAVRGDLGGAAGIAAAGAPSGSPWTLTRSK-- >SRR3974377_1684031 -IMAPEHKRLLAESFSKLENRLDDLGSLLFQKMFEISPESRSLFKG--------DIEEQKLKVARFFAEVIRRRTRShhflpvtgkggEVIIPgvgPLGARHEINYGVRAKHYGYMREALLYAISTMLGSEYNEEIGRAWGETFDMLAGAMQK--- >APCry1669189000_1035189.scaffolds.fasta_scaffold267513_1 # 3 # 467 # -1 # ID=267513_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658 -VLSDQHKKVIVRNWTILSTDLSGRGTRIFLLIFGRNPLIKSIFSFGHLegdeLVCDPRFKGHALRFMQAVGAVVDNIDDYNNaVkpiLNDLGRRHTQFKGFKPIYFNEFQDSILQVSENGTCKQngeiriLNPSaagvnfCTPPLGKFSASEMTCIVSsGA- >tr|A0A2W1CGM6|A0A2W1CGM6_HELAM Uncharacterized protein OS=Helicoverpa armigera OX=29058 GN=HaOG211460 PE=4 SV=1 -GMSLRDVYNVQQSWKTIHANPLDNGYLMFFRLFEADPETKTFFKILDNarSeadmKAYVKFKAHILNIMGALNNSVVNLDKPEVvvvWMEKLGTAHQ-KFNIRERHFWVFRDVLVNILQNDLK--LSEPIVKSWGRYVTFIYSHI----- >tr|F2Q9X8|F2Q9X8_BRAFL Globin OS=Branchiostoma floridae OX=7739 GN=lGb13 PE=2 SV=1 -PLDAWQRFYLQKSWKTVARKSDQAARTVFLRMLQDNPGLRQKWPRISlL-teeeiPTSPYIKFLGERIFDCLDYIIDNLGDLDHVISELtklGRQHSDMNVMTPEDVWAIEAAFLAGVQECLEDRFTIKYEEIYSRFIVFVIETMVIGFD >SRR4029079_30121 ---------------------------------------------------------------------------------------------------MHGMH--FWflnnHKNNKMTQKQTELVRSTWSMV-----AAMDH--- >SRR3546814_3749254 -------------------------CLFFFFCFFFSSIRRHTRCA---LVTG--VQTCALPILFNAIAAYASNIENLPALLPAVEKIaqkHT-SFQIKPEQYNIVGTHLLATLDEMFSP--GQGVLDAWGKAYRSEERRV-GK-- >tr|A0A1K0GS94|A0A1K0GS94_9ACTN Globin OS=Couchioplanes caeruleus subsp. caeruleus OX=56427 GN=BG844_22340 PE=4 SV=1 -GMNPaddaelhAVQRLLISSLEQAGGQVEVATR-LRAALAQAGPALFARIPG--------GPLAQVEQLAEGLAWLAQHTDqP-PALVAGFGRLgavLA-ECGIAPQQLQLAGAALAEAMRAgMAANGWRQDYDQAwrstWQHAYQWIAHGMVAA-- >tr|A0A077WN08|A0A077WN08_9FUNG Uncharacterized protein OS=Lichtheimia ramosa OX=688394 GN=LRAMOSA02110 PE=3 SV=1 -PPSQAQLNVIRDSWERVLSTpinnnntdqsstssnstlsttpsaSSAFHHAFFEALFTLDPNLTTWFPN---------VKRQARALTGIVSYVVRapailpvkyktykSLREMhqiqqtldeeeeqwmREQLKALGARHA-VHhQIQIDMLDHVGPALISALYQRLDSEFSPAMRDAWLHALHYVVYYMKQ--- >SRR6267143_1520378 --VTLEQIQMVQASFAKIAPIVGPATDRKLRRCSALVAGFrkeTRLSTG--------VSKNPGRSEVRGTLCGASCCGSLSS---------------------------NWVANIRRGI----------SP-LALAIASI----- >tr|M3IRU3|M3IRU3_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) GN=G210_0056 PE=3 SV=1 QELTPDQLRLITECIPIMEDLNLTLGSKFYRRTTRRHPHLQSYFNETHH-----KLLRQPRAFIFTLIMFAKNIHDLTPLRDVIRRIvskHV-GLQVKPDHYPLLGDVLIETLCDMFPYHmVDDKFKTTWSIVYANLASLLIG--- >ERR1712228_269173 ---SETMKGDVVRSWDMIQELgTNAVGERIYRVFFELAPEAVEKFPAHvRHkyrewtadeSddeadlR--nsAALRKLFAKVLNAVGCVVAGLLGDAFTPEVEN--awNV-VYGf---------ASSIMISGLKQAKEAAQVRALQDS-DCAV----------- >ERR1711918_283694 ------------------------------------------------------GSECSWMCRC---GIARFEQT-------RTTSHksrRA-TYRvqPDRGILAHPGESCDDHFGGAPWGGLHPEVENAWNVVYGFPSSIMISGPR >SRR6516162_1580517 --------RVRRARCSAatesTATNTASVPGCSFAYFFACAYSASA-----------------------------C-ASSCNlnPVMV--SWGAL-GSSLKRSHFDAFGDALIWCLEHQFGAAFPPELREAWITALRRGPNG------ >SRR5262245_22234373 --SADFDREPIREVLTRLAADPEVTMGYLYAWLFTAYPELRSLFPH--------AMTQTRAAVFGKLVSVLAGLDDRLQTEQALARLaidHR-KFGVKEKHYQPFFDALYVTAQHAAGSAWTREMAAALRSALDWFGSIMQA--- >ERR1719495_1281412 -MFKANEVTELRLSWNAwVAGDLANKGFELFCKMFEKNPDTKNVFDFMKGSsvtqmQGSSKVLFHVTRVMKNIDDVVKHADRLDEIVPILRQVggrHGtQGYNVPSGYFPFLGNALRELLRTKYS-GYNTNLDENWKKLWNFIVKEMHAG-- >ERR1712105_94955 -EFKPNEIMDMRVMWNGwVSGDLASKGFEMFCKMFEMHPETKNVFAFMKGSsvaqmQSSAKVLFHVTRVMKYIDEVVKHADKLDEVVPIMRQVggrHGtHGYNIQSGYFPHLGEAQRLLLKDFFKDRYTANMDAIFKKLWVFIVKQMQAG-- >SRR5260370_506041 ----------------VRD---YSSTCSF--------FFFLQAEDG--------IRDSS--VTGVQ---TCALPIYQERTEQVLSRLavdHR-KFGVRDKHYEPFFDAVFATAEHAAGPAWTREMATAWRSALDWFGSVMA---- >SRR5580658_2929351 ----APLRAIV-EEVLRSGGG------------------------------------------------------------------nvAA-GTGVRRNASLFHGAREPPGFYD--MpGLRELSSSYPWFQV---VP-VIS---- >SRR5258708_13478776 ----APLKAII-QGILRA----------------------------------------------------------------------G-GPLLRRETRPLVGAPRGQKALL--PpHPPGSGSVASRPKG---IS-L------ >SRR6266704_2687724 -----IARPPDR-RPRCGD---GVLLR-P--------AVHRQSRPA-------------RAVSLRDDANPRGGLPDADRAGQEP--GrraCD-RAGPRPDRQGPpqirrepeALPAVLR-RAVRDGRAFRRPGPDRRDGRGLA---------- >SRR6266536_777504 ----DGYREALDASFARVASSGEKAVAYFYGRLFAATPRLRGLFPA--------AMDYQRDRLLCALLQITQRLSN-rAALSEYLVQLgrdHR-PPGVPPAV--PGGAACEHPNPTLA-pGVAPllsgvraagqrvarVPHPRRPRRLGQHVPGAVH---- >SRR6202030_4225180 ----YRAN--A-EAGTFP----------------------------------------------------------------------D-STQEPPETGPYRVAPSDARLLRKSLaLLEPQSE-------------------- >SRR5256886_2416282 ------DREADADREADADRDGDAEPEPLTAPALSSPPAV-PLAPP--------RDEAARQHdEPEPAPPPDQVPGAAdpretagppeppeeppP--------DgkgEP-AAG-----PDPAIAAGQEALRAFARE--afTSAAEEAWTQVYLAGSSLMIK--- >SRR5581483_8202477 ----------PDDPVFDGMqgNVGRvaarylphrEGEAYVAGPVGMVRETIRALTRA--------GLPRERIHYDDALLAEDKQASAQgvagatahtsrtpessrPGRTGEAGNAgpdGH-IrrvaesdqAGPAGGTAEPGQSGLRDAAADIAPQ--------ADTAHQDGGPHDDQagA--- >tr|A0A2G8KCQ8|A0A2G8KCQ8_STIJA Globin (Fragment) OS=Stichopus japonicus GN=BSL78_17342 PE=4 SV=1 -GLSTVEKDHIRKSWTALMKNKNENATLLIVNLFKMSEGAQDVFPKFKGknpdeLKKSIGVRSHGLRVLAALNSVVENLDDIECLVDMLQHIaHShHPRGTSRKHFEDLGGVVIATFEEALGKKFTDDAKNAWAKAYGVILGVIKSEY- >ERR1719203_2782565 --------ITSKFGWTSNMQ--------------KIIQSQTHSKTQDMqrDYYLNQK-KTLEI----------------NVRHPLMKELlrrVE-----DNPEDKVAKdMATMMFNTATLRSGFSLKDTVNFAESIELMMRQTLG--- >ERR1719343_1244138 --------LVGV-SWFfSSEKFsGRMQNFWILKALFGTSFPLLfvwvialVIVSIHTGSFIAPLIVX------------------------------------------------------------------------------------ >tr|A0A0P4VK04|A0A0P4VK04_9ANNE Extracellular globin OS=Glossoscolex paulistus GN=HgBp PE=3 SV=1 ---SAEDRRELKFIWNYIWASGftdrkAAIAGAVFKDLFQHYPSAHDLFTRVKVdEPDSGEYRSHLIRVANGLDLLIGLLDDTQVLDHQLNHLadqHILRKGVTQQFFKGIGESFARVFPQVS-SCFNV---DAWNRCFHRLANRISKD-- >tr|A0A0S2MLN3|A0A0S2MLN3_SEEJO Extracellular globin OS=Seepiophila jonesi PE=2 SV=1 ---NSLERIKVKMQWAKAFGYGasrAKFGDALWTNVFNYAPTVRPIFYSVNSkDMKSPKFQAHVARVLGGLDRVISMLDSEPTLNADLAHLksqHDPR-ELDPTAFVVFRQALIATVAGTFGVCFDV---PAWQQCFNVIAMGITGS-- >tr|A0A2W5I8T1|A0A2W5I8T1_9ACTN Uncharacterized protein OS=Lawsonella clevelandensis OX=1528099 GN=DI579_06450 PE=4 SV=1 -----TYYTVLGPAITLLREHPEDFMRHFLAAALTYDFHFHTFFPS--------VNDHHASRYTHALRYILEALDQstndpdcLDDVIDFLSQLgcdQR-KYQLTAEQYQSLAAALRDTFALLLPYQWSTELNDALLTSFEHAINVMQS--- >tr|A0A177JSP9|A0A177JSP9_9ACTN Oxidoreductase OS=Dietzia cinnamea OX=321318 GN=AYJ66_05610 PE=4 SV=1 -----AQAPPLLALRDLLA--DDRFPDLFARALRATDPDFRELFPR--------DATPVLREFVRAMTWAFETTEYahgdrskVEEVVEFARHLgadHR-KLDLAPRHHQRFGEALTHTLRHLAGRGWDDRLETTLATAYRVLSTALQQ--- >tr|A0A173LPQ6|A0A173LPQ6_9ACTN Phenol hydroxylase P5 protein OS=Dietzia timorensis OX=499555 GN=BJL86_2914 PE=4 SV=1 -----DQLPALLALRELTYRessdVAPDFRRALEDALNTEAPYLRADLPR--------NLDGPFATFVKLYRFLLTRVEDsggdrakVDDVLDLCRELghdLA-KYNVVEEQYERFGHALNAALARVAGEEWTGELSKVQNQFYVIIARALHK--- >tr|A0A2N6TBK5|A0A2N6TBK5_9CORY NAD(P)H-flavin reductase OS=Corynebacterium kroppenstedtii OX=161879 GN=CJ202_05310 PE=4 SV=1 -----VHEASLVPVVTVLQTDGSRFVDAVFTHLFARRPSFIRRLPA--------DLSQLKPSFRRALVHVYAKQATgnglDRRTRRFLRHLaedHR-SFGVEAPDYVAMGDAIIDAGREIIAPQVTSEEFELFAMATGQIIGLMEE--- >tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 OX=1581089 GN=HMPREF3121_11375 PE=4 SV=1 -----------MRAAAAFGRQAPTIGPEAFRRLLDAEPRFRHMFGG--------SKTALRDQFMSALSTALVTRADvgrfPAATIRRLEQLareNR-KFGVAPRDYATLAEHLLDVFGERLPAgpdsgAQVDALREILDEAMSLI-AAAAV--- >tr|M3VCE7|M3VCE7_9ACTN Putative oxidoreductase OS=Gordonia malaquae NBRC 108250 OX=1223542 GN=GM1_049_00130 PE=3 SV=1 -------QPVLTVLRDRIAHDPDRFAVGVFNRLFAETPFLRELFPS--------EMSRMRATFTQVVDHVLDAIANdddHAELIEFLAQLgrdHR-KFGVIGDHYWLMYDALMAEFAAMLGPGWSPDAQEATSHAMMLMTGVMRG--- >tr|A0A2D6MQX9|A0A2D6MQX9_9DELT Uncharacterized protein OS=Deltaproteobacteria bacterium OX=2026735 GN=CL908_08110 PE=3 SV=1 ----TEDHELLLQSLDRVMHGEVDLSTRLYERLFSRHPELRELFGP--------NSIPvQEEMITETLISAVDDLEGLpwiEDNMQLLSQKHS-DADVTSEMYDWWAECVIETLAELSAPDWNRRLEELWRKQIARLCELMRAET- >SRR5207245_2384740 -NPQPST-HAVTEQVVTLDV------LPWTSGKLGLGPGKarlsEPLAPG--------DTLE---SL----------LERQRARIpgfeewvYDArerriheHCTLL-VNGQAEYRRHTAEVEI------------------------------------ >SRR5689334_4915957 ------------------------------TASQRVTP----SLRG--------KRVPSGQmgdRKVPD-VPIVDAHVHLWDPTafrmpwlDGNKRLNR-PYGLADYREQTAGLPI------------------------------------ >MudIll2142460700_1097286.scaffolds.fasta_scaffold02451_1 # 3 # 1031 # -1 # ID=2451_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.574 ----------------------------------------miGSRALA--------ALFPHPKTFMDTKRPVADTHIHLWDPGyltypwlETVpaiagph----G-PAELQVQEPETDRFRL------------------------------------ >SaaInlV_200m_DNA_2_1039689.scaffolds.fasta_scaffold02144_7 # 4497 # 5432 # 1 # ID=2144_7;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.499 -----------------------------------------------------------LQCGVATVRSVIDSHVHFWQPQrlrylwlDEVpair----H-PFTPHELNQATQAIDL------------------------------------ >SRR6266704_3508957 --------TITRAEFCAGRSNRgskQAFACECYATLIRLHPEVKPLFTHTS-------MEKQAKKFMASLTLVLHVLGKPDVLTTTLQRLgrrHQ-TMGVRVEHYPMVAEALLATLKSGYAVVLLT----LFVQSYMFL---VRKGA- >SRR6478736_5796684 -------------------------------FMMGV---IASGMVVTGA-----ERRGRPKAVQPGNREWITVIQAINAEGQA--------------------------IP-PFIIGAGQYHLANWYRDSNLPGNWAIA--- >ERR1711935_979896 -------YSEVMNSWQRVRRvkdFDKTLGVLVFSKFFSKHPDATKIFGIEEEgeelVDTSASFVPQATKFVGLCDNFIDMLGPdsdlLKDILAEEGRKH-ARRGVELYHYPAIGEALISGIRAM--DvKFNDDTELCWRKVYCGVTHDLGKAV- >ERR1712137_931585 --------------------------------------MGTSLLGVDCEgeefVKT-DSFVPQAKKFIGLCDSFIDMLGPdaelMAKILEAEGRKH-EKLGIKLEHYSTMGEALISGVKTL--DeKFNDETELCWKLVYCGVTNNLGKAN- >tr|A0A210PV81|A0A210PV81_MIZYE Globin OS=Mizuhopecten yessoensis GN=KP79_PYT16126 PE=3 SV=1 -GLTERELKMIKVSWDVLAEDKKSNGVKFFMTLFTIFPTSKDLFKHFkDVPldqlkydgettKSNKKMVAHAMSVMYALESYVDSLDDAYcleELVKKVAISHK-PRGIGPDKFKLLTPVLHAVIEDLVKDDDSvdlETIKSGWTKLIDTVCDIVEK--- >tr|A0A226E0J1|A0A226E0J1_FOLCA Hemocyanin OS=Folsomia candida GN=Fcan01_14017 PE=3 SV=1 VQLTPDEMIAIKRNWEVIHQDLTGNGMDMYLHWFAAFPHMQKVFKKFaQVPrdqlKTNDAFKAQATVTLHWIDDMIEAIDSPSDMAavmKRLGRMHQ-TRHTNIYDFREMVKRIQEVIGTKVGEGYTPAAESGWTKLFAKLVENIGD--- >SRR5947199_2475351 ----------------------DELARAVR---lQ--gSRRIMEEHAcG--------AEGRQLARLFDERGRLARAP---RAVDEPGLELgarvsdgrcglakigdvverivqaedvdavRR-AGGDELADEVIVS-------------rtRADDEtseqrepayrigprtqCSDAFRRGLERPAGAPVQT-- >SRR6266516_4891354 -----------------------------------------------------------------------------GLGDGGRAEGgnrDS-GRGEQLEHLGCVHDVLLSFSESTVSTlphqaarpapaaegagpAITRRetadrapprrhrvggfLRSAGAARARSSIDRMTET-- >SRR6266508_4596506 ------------SAFVRL-t-DARRVARCLPSAH---pGDETPSTFPs---------ETGDPVNLN--------------LEALETSFDLvapRG-DG-SEATEDDVVGHPGPPA--QVA-PRPRGDRPQAA---------------- >tr|A0A1Q9CVT6|A0A1Q9CVT6_SYMMI Eukaryotic peptide chain release factor GTP-binding subunit OS=Symbiodinium microadriaticum GN=SUP35 PE=3 SV=1 -VPSSGTISTVQQSWMVVKELgVANIGEIMYKHLFKIAPVTKSLFPVSvRKRyrdwscseeevedgfENSPALRNLFAKVVEAVGSAVAGLHNISRLVAELNALgmrHI-NYNMKEEFFEYGGQALVLTLQDGLGTSLTEDVKQAWVAVYEFISACIISGLR >ERR1719433_537024 -ALRISIVGREKRA-NCTVTLgRVEQGELQVGATVLLVPPGAECGVQSvEVDgrevrsaqagefvcmRLLgcQP---SVGHALSSVD---GPLRSATKLKVRSAQAgefV------------------------------------------------------ >ERR1719161_1849694 -ALRVMVLGMTADKVG-AALEgHVEQGTLRAGTRCLAAlsEGQAECNVQIvLLNgvevshagpgehvrlKVTgaAAKGFTAGQVLSCIS---NPVRAIGKFKAKLRLMslpEM-LS----------CSLLVL---------------------------------- >ERR1719271_149007 --VSARERRLIERTWEKAKEDgCDALGANLLQTLLVAEPQVMQLFPFKDEenVYESLRFKAHASKLAVIIDAAVSLLANPVKLEsllISVATSYEYsFKQMLPEHFPLLGEALIRTLTSIVGgTKFTWQAESAWRKVWTIISTVMIGAI- >DEB0MinimDraft_4_1074332.scaffolds.fasta_scaffold429043_1 # 3 # 377 # 1 # ID=429043_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.227 -------LELIQQTWEKVKPHGKEWGPKFYNNMWTKYPEVRAQFFP--E----SKPEIQGPRLYASLNFMIKNATDIETLKqycFNMGDRHK-KYHCAAEHFKVVGDAFIMTLTEFLGDEFTPEIKQQFQLLYDTVAEMTI---- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold5203666_1 # 3 # 269 # -1 # ID=5203666_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.315 ------------------------------------------------------DFESQGRALTRMLAWIIQNMSNVSQLVPVLAQMggrHE-IYGVKDADFGTFATTVANSFRSVLGPEIiDDDAHQAWESCISGIGGLMQL--- >SRR5438477_4839339 -------------------------------------HGIEP-IPH--------RYAAIRRVVSGRE--------------AQARRVgqrHH-AAREDQRR-------LRGL----ERRRG-RPPARHVRL---------AA--- >UPI0003969FE8 status=active -----RPFEAA-----------------DRELLFGRAQDIRAVVEQ--------LRTDPLVLVTGDSGVGKSSLCRAGVLPQIREGAlndVR-RWSVAV---LSPGRWLLDTLGDA----LA----------------------- >OM-RGC.v1.018126893 TARA_122_DCM_0.45-0.8_C18859060_1_gene481717 COG0677 K02474 -----SELW-------RGRPRKTSLPAgssiRTRTAvlvplgrgketapssssanfvlnLTDVPPEAQELRiTA--------EVDDQRIHFQRRVPADVD-----KVVMELPEGSlarKV-R--VEVAAFD---------------------------RR-CS-IAAFRA--- >ERR1719491_698649 ----------------KLRAsedvsiSLIIFFSGSSSRFFKQQPDASSVFG-FDNNneniHKTPKFIDFANHFVEVIDQAVQMLGPdlelLTDFFVDLGDKHSKEYGIKPKFYPILGRVLMEQLEEMLGHNvFTVHTKVCWLQVYEAFARDMTST-- >tr|A0A147B4Z8|A0A147B4Z8_FUNHE Neuroglobin (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 -ELSVKDKELIRGSWESLGKNKVPHGVIMFSRLFELDPALLSLFHYStkcDSKqdcLSSPEFLDHVTKVMLVIDAAVSHLDDLHSleeFLLNLGRKHQ-AVGVSTQSFTEVGESLLYMLQCSLGQAYTAPLRQAWLNMYSIVVAVMSRGW- >ERR1740115_393061 NLLTPETVRVVKETSPRIASMAPALSSSFFKRFLS-HPDLAAYKASRH------NGEAKAAAVAAAVTGIGDSIDNLRSLsgaITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAWDEAIMVLADICVD--- >ERR1719469_1495088 NLLTPETVRVVKETSPRIASMAPALSSSFFKRFLS-HPDLAAYKASRH------NGEAKVPLTHTPP-------------FLSLPHPHS-SLPLPSSPFL-------SL--------------------------------- >sp|Q5KSB7|GLBB1_OLIMA Extracellular giant hemoglobin major globin subunit B1 OS=Oligobrachia mashikoi OX=55676 GN=ghbB1 PE=1 SV=1 ---SRGDAEVVISEWDQVFNAAmagsseSAVGVAIFDAFFASSGVSPSMFP--GGgDSNNPEFLAQVSRVVSGADIAINSLTNRATCDSLLSHLnaqHRAISGVTGAAVTHLSQAISSVVAQVL-PSAHI---DAWEYCMAYIAAGIGAG-- >ERR1719246_379870 ---TEKIKDDVQKSWDRILEVGiLYAGEVLYKKLFEIAPVAEEHLPPHIIAkyqqssfdageedqefVRNATLAKMFSKIFNAVGCAITGLHDLGKLVPMLLSLgarMG-GYWDSckydvaGNPWRYVFARCRASLDDGVRLhIVHHDTGFARGQGSCRVSX------- >SRR5687767_4837246 ----EKQVLLVKHSWSYQAGQLENLGTLFTKKLVALNPGLKAPMKR--------SLAETGSySLMVAMNQIVAALPDLHKAQNHIQVIvteYA-ALGITRSDYENALIAFLLALEKRLGKSWSDEIREAWIFIFSSLYH------- >SRR5215212_6395769 ---------------------------------ASLSPELKPLLKK--------LDQEKRLpHLFITVNDIVASIPDFKRSEKQALALiadYA-DKSISLSVYESALIAFLMALEKKLGKHWSSEMREAWILVFASLRQ------- >ERR1711963_100213 -SLSEGTVEVLKACHPLLKDVRRVIGKAFYNRLFKEYPQVKPLFSQSD-----AARTHQTLALADALIAFTGRQLLEG-F-EAKQRGqeRS-LRLRSLQAGSWQGLWRLPSRDRGERD---QNEGSQIKPQILTIQ---QDI-- >ERR550517_4578 -KFDPDELIALRLSWHAwVAGDLSGKGFDLFAKMFEQRKETKEVFAFAKgtDarqMQNSSKVLFHVSRVMKYIDDTVKHADRLQDVVATLRQIggrHGhNGYDVASAYFPYLGNALRTLIKANYKG-WDSKLEDIWTRLWGFITAQMMH--- >tr|A0A0L0FER9|A0A0L0FER9_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12208 PE=3 SV=1 MSLTPRQCEMIKSSWKEASQGgkptefrALRFVMDFYSHLFDLAPSTKSMFKG--------GMANQGKALVGMLDIVVNHIDSLATikgDVELLGQRHA-KYGVTSNMYVTAGRALVMALAPRIPDDeDKPECASAWMDAYSFLASIMCN--- >tr|A0A1X7UGV4|A0A1X7UGV4_AMPQE Uncharacterized protein OS=Amphimedon queenslandica PE=3 SV=1 MSLTSAQVALIESTWKVVKKDLQGAGNIMFLKLFQIDVSVRDKFPFRDVPyeelEDSESFLKHSLQVMETIDLAITLLlGGEMEkLVEalvDLGMAHA-MQGLKPEDFDHVGEALVHALGVALGKEFNDEAKKAWTLLYSVVTAKMKEGL- >ERR1712080_154454 ----DLQKIIVKHQWARSYNEgmsREYFGQAIWRAFFKLDPGARRFFTRVrGDDISHPKFQAHSLRILGGIDMCLSLIDDVPTFEAQMKHLqgqHI-EREVPSYYFDRLGTVLQEVMRAATGYCYDE---VAWGACYKYISDRIKANY- >SRR5476649_891947 --------------------------------------------------------ATSTRCCS--ATSRKCCRCSIKPTRPTASSsarwptpcWltqEI-SIawNnWARWHRPSStSMCRCKSsgNTIPWSAPrCSRRYVKCWAPRWRPmpsstpgpprtvsWRTCWPV--- >SRR6188768_2515855 -XMDSGQTALLKASFQRLSTVSELGAELFAGRLYLLDPPLWHHLGLG--------GRSAQHALLRMLARVIEDLDRFEELASTLEAVarrCA-SEGMDAAQFDTIAETLFWTLQQVLGDTYQAPIAAAWREAGGLLIGRMKA--- >APLak6261669570_1056073.scaffolds.fasta_scaffold275140_1 # 52 # 198 # 1 # ID=275140_1;partial=01;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.524 --WSTRRVKVVQRSWETFKStqaESTTVGLAVFKRFLRRSPAFLQLFPFRDQPLetlfLNAKVRLHCKLFADTVSRTVGLLGDSVAVKASLRELgarHSDLYKVRSGHYAAMGSALLEVLEHNLGESWDEETKTAWEETWAYITEQMQK--- >tr|A0A0Q5LAI2|A0A0Q5LAI2_9MICO Uncharacterized protein OS=Frigoribacterium sp. Leaf164 OX=1736282 GN=ASF82_14980 PE=4 SV=1 -VITSSHLTALRSTLPLVEARAAAIADDFYARLFADRPDLLrDQFNRGDQ-----AQGRQQRELALTIVTVARDVVgtqvgsgpagsatgpavpvapwsspapspwavrvAARETLSRLAQRHA-AIGVTRDEHDVFERHLRDAFAAALGDDWSGVVVDAWLALWRQTRDELVA--- >ERR1719383_514948 ----------------------------------------------RGRLvegrwRFDSARVKSCVddrqGCVETWQHGRRR--------SNAPQVgnhAR-GLRCAQAHYDVVGQALVTTLASY--CTFTDPVKNAWIKLCGVIKATMVH--- >ERR1719284_1849230 -PLDGRDVALIQHSWKEVGQaPADEVAREIFRNIFAIEPGALELFPFKNESedglwREGGERDFSKYFRHRAWCSGAVSFQKX----------------------------------------------------------------- >SRR5204863_5655766 -IMTPEAIGLIKSSYAGVTAIPRQLAARFYHELFTVAPNLRPLFPG-D-------LTNLQGHFEAALALVVRNLDEVEVLRPALRDLgaqHV-HWGARPEDYETARDALVAAIGALS-ANWDETLARDWRRAVTAIIVPMIEG-- >tr|F2Q9X2|F2Q9X2_BRAFL Globin OS=Branchiostoma floridae GN=lGb7 PE=2 SV=1 MSLSAADKKLVQESWDKVSKpSFADAGERVFLKLFRRNESTKAHFKKFkDIPsdqlAGQAVVRDHGEKVCKVLDDFIKGLDGSgDEAVKKVGRMHK-GLGMSNEQIDQMKGAIIEVLADAgFGD---ANYKGAWGKLWDRFMAVHR---- >SRR5580698_8666230 ---PDLEKMAARSPWLTVTA-----------------------------------------------------------------SLsaePV-SLGHGPRTEHgtvADVLARLGTWREHD--------------AYVCGSSAMVAA-- >SRR5919204_299658 ---------------------------------------------------------------------SDL-RSGPTSRCTHVRC--R-QQRSPPRHHRClRPRSPAPSWSARlsagfrssscrpstnRPARRRGRGRSTILASYTRLASVMLDG-- >SRR5262245_42249746 -AMTPEQIDLVQRNLPAVLSLQNRGP-RFHDHFVAVEPTRQFLFAGAD-------MGRQGAVLIDAIAVAIAASRsrEQ-DLSGALCQFHL-SYGVDAQRFQSAGKALVRMLEEEFGDRYFTQLGDAWIAACERVGQTIL---- >SRR3954454_16888348 VISRSAVIRHVLPTP----aepAAVDHIGQQVADRTSQQDRGERVLLNRT--------aHGLR--ALADGAARLRIAAQS-vadVTRTPLVGVLrqlRS-ALGDVSHRLCGLSDHAEAllgAIKDVLGDAATDEILAAWGEAYWLLADVliar------ >SRR3954471_17335278 VISRSAVIRHVLPTP----aepAAVDQIGQQVADRASDKDGGERVLLNRT--------aHGLR--ALADGAARLRIAIQS-iadVMRTPRVGVLgqlGG-ALGDVPHCLSGLSDDALGccaTCGCYLCR--------SRGGASWSFFCHaalr------ >SRR5215204_1408335 ATGGPTRWATMRGRWPLMS---------MLESIAQSG-SGRPVWYVHGA----RDrrahaMGDHARALAADEHAGK------------HRAVrqrT-------------------------------AG--------------------- >ERR1719446_1443192 ------------------------------------------------------------------------LAQDLSALCPE---Cgfk------VG--TMGVC---QTK------ANDAAIE-----------AKDPPVAT-- >sp|P02214|GLB_BUSCA Globin OS=Busycotypus canaliculatus PE=1 SV=1 -GLDGAQKTALKESWKVLGADGPtmmKNGSLLFGLLFKTYPDTKKHFKHFDDaTfaamDTTGVGKAHGVAVFSGLGSMICSIDDDDcvbGLAKKLSRNHL-ARGVSAADFKLLEAVFKZFLDEATQRKATDAQKDADGALLTMLIK------- >SRR5690242_2028058 -------LALLLQSYGRIGILIPKISENFYRRLFQLRPNLAALFANR----------DADLKVEEMLRRIVAHASDAAAAKAEVQssgRSHA-QWPLLPEDYRVAGECLIQAIIEAEGAATGSVVASIWRQAYVEVANLMIC--- >tr|A0A2T7P4S4|A0A2T7P4S4_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_10993 PE=3 SV=1 -VLTVQQKDMVQRSWATVMRrDLTAVGMLLFKNLFQQEPRIMTLFSLEasDDedLEQNLRLRLHAARFMQAVGAVIDNLQTPndklSALLSDIGERHSHLHSFHHEYFRAFREAFLTTLEHSLGKDrFKGELRAAWDSVIGFMTREMNHGHK >SRR5581483_4049588 ---------------MRIAPHKEEFAATFYQALLEKYPHLSQFFVGVD-------LKRQQTSLIATLRAMLNESERGEalrMMFRKIGQKHA-DQQIRAEHYPAFGQTLLDTLALYD-PQWTDDLRKGWATALEQSVRIMMESYH >SRR5690625_2040278 --------------------DRDGFGARFTEELLSRYTEIREALPD--------EPAWVARAVTAVTDALIDVADDPGALVTVLERLgvdNR-TVGVHSAHYAPIGHALILAARAVGGTAWTPDIERAWVDGFDVAAEVMVT--- >tr|A0A0Q9HRJ4|A0A0Q9HRJ4_9BRAD Uncharacterized protein OS=Bosea sp. Root381 GN=ASE63_23130 PE=4 SV=1 --MGDRAISLALASLETMGSEAEQADIMFNIRLLETYPDVYRVFCM-D-------FAPEERSFLRALAFILAHAGPFGAIGPTVRALapsDK-VCRLISSRYHELEETLMWTLRRRLGVAFTAEVENAWRSVLREAPGVS----- >SRR4051812_34838903 ------------------KPIRNRAIKLFFSRLIESHPSLLTVIGD-D-------YEAKARSLRPAVEMIIGCLGNMEALRPILRSMarsNA-ELGMQEHHYLTAVNTILWTMERCLGSAYSAEVDAAWEDVCWQVCEAM----- >ERR1712110_1394717 -ILSKEETTLISASWDLVATDIPGNGSKFFTFLFDIHPDVRdKYFQPLLQSSTdvQRTLEKHGAKVVNAIGSLVTALNTedDGKLVTIIRQIthnHW-NRAItNSAPYQLVLDALLEFLAVALGSQLSPAGGAAWKKLFDAFVVVV----- >ERR1711953_1620069 -------------TWAIVKLNMDKHGYKFFIRLFLDHPRIQtKHFSSISTSA--QSLTAHGLRFMMGIDSIIRFLELedEEGLRKRIQQIvtvHF-FKGItDPLDFEVLCNCLVDYLSTEVfGDHQL----------------------- >ERR1719210_139600 ---------------------------------FTLL-----DPPGQkRNvaqawSavvqadvaiLVVSANPGEFEAGLAK-------------------------GGQTREHAVLAKSAGVENLVVAVNKMDSVDGEGKWSNLryee------I------ >ERR1719428_2447797 ------------------------------------L-----DAPGLgAYvpavwVaatqadiavLVISAKAGEFEAGISK-------------------------GGVTQEHALLAFSAGVTSIVVAVNKMD--DASVTWGEPrfkt------I------ >ERR1740121_1193106 --LSESERDALQQSWVQVQKVgFDCVGEVFSQKLFELAPSTHARAG------------MEWGPVVKGIGHTVDYLSRLEAVAvryRRLGVLHR-CIGVTERELKEMGDAFILTLRDVLGK-------------------------- >tr|A0A158NI97|A0A158NI97_ATTCE Uncharacterized protein OS=Atta cephalotes GN=105620364 PE=4 SV=1 ------------------------------------------------------------MNIT--NGTIHDILSGGK-NtqkV--FL--FR-HRGRTKEVVEKEEKIRVAGLDtngshradCPKGTDEGREIGDPVTDSLLQMLQKKE---- >SRR5260221_159328 ------ALGLVREGFAAVIARPDVFVSELYQDFFTSNPRYRKYFGSADIGySgsadingtGSpeighaaADITRRNAKTVEAATRIVADLDRPGVLLPYLRKLaleYR-KYGVREAHYRAFAGSVMTALERTIGQAWTYEAAEAWVDELTMVASAMLG--- >ERR1711862_565156 ----------------------------------------KIMFHFPvnmNIetVLKSKIFLQHAKFFVKTLDITIGLLGpdtdIIQDVLLEHSKTYQ-NHGVNSAMYLHMGESILYALEKDLGDvNFTSKDREAWAYFYGTIVGVIVGG-- >OM-RGC.v1.029911412 TARA_036_DCM_0.22-1.6_scaffold294997_1_gene285712 COG0526 K03671 ---------------DRLRARGEPPSGNPYRGAAPYGPGDEALFFG--------RRAE--------LEVLIDRVQkTPFVLVAGDAGVgktS------------LCSAGLLPLVREgalGGPRHWACESIACGEEPLAALAAVLAR--- >tr|A0A2S2QIF8|A0A2S2QIF8_9HEMI Globin OS=Sipha flava OX=143950 GN=GLB PE=3 SV=1 MALSPVQISRIRRSWSALAQDPTELASALVIRMFKENPEYISLFKRLkGLsideLQSNSQFKAHASKVGGALGATIDHLDKPEKLeelLTDIGIKHR-KYGLSPKHFEVIRNVLIAIIAEAIGD-TDPELLDLWKSSLTGVMSII----- >SRR3546814_18929724 -----------------------AITNAVYARLFQNKe--IEASFDRAAQ-----TSGEQTKRPSAENLAYAKNIDKLHNLGSAVSHMvarHM-QTVVRPHQYPHGPTALQHSNSAVPGQQMgTNTDPTP----------------- >tr|T0QF73|T0QF73_9STRA Uncharacterized protein OS=Saprolegnia diclina VS20 GN=SDRG_06019 PE=3 SV=1 --ISKDVQALVLANWAAISSGSTPallkikpaspvvyFYDYFYGMIFEKAPAVKPLFRS--------SIIVQGKALINIIQSITSAVNAPNviEKVCDLAYRHN-KYGVKIEYFNLLGKCLLLAMHDCTGDTFTDELREAWRAAYAYMVMVMT---- >ERR1719402_1510571 -AISSITKSRSMYLWSIllnrkqhLEAFSVDNGWAAFVVFLLGDPHLLEG----------------GEGS---QDGSSNPYGVFPLRwsnDLHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQASFNNLLQFLVGNMKVGL- >ERR1719187_1205752 -VLEDAEVEGVQTLWAEVSGDLAQFGARVFGRLVRDQPTIRKYFPWGrnDKTeeqlVDAPDTQKHAEEVFGALGKIIGAADHLNDYrsfLVYKGMQHI-PRGVKPEHFVYLKAALVDTLKEELGDKVTPAGEEGLNKVYSFVEKAMSKGL- >ERR1719369_91055 -SIKRQFHLHSEAGWAKFAEDVAGNGAATFITLVHDHPEIRSVFPWGgkSY-lsVDDPDIRHHAELVFNGLGVAFNRIGHIHSLdgyYESLGLRHI-ARKVEMSFFDYVGDALSQTFQQILGGGYTADFKSGYSKVYAYVTQHMTAGL- >ERR550519_2895140 --LSKAERKEAENAWRIFEVNLVDNGVDAFLNLVRDHPNRKDAFPWVkpELSeealRNDPEMKKLAKLVFSAVKPAFKSLGDLQSLtnyYLNIGNELS-LMNIPPVMVSYLSDAFKKTCQKLLGSDYTHSLEASIEYVYDFITSRMFEGM- >ERR1719150_2276450 MGLTKAQVAAIQNNWATVSQNMQDVGDALFMRYLTANPGDLSFFPKFqGAGvgpqlHSNEDFQHQTLTVMQFLGQIVAHLGDIPAAEGMLRERvktHH-PRGISMAQFERLLDLVPRLVQEICGA--SGPTADAWRVAVATLMPSMRDEF- >tr|D3DIC1|D3DIC1_HYDTT Bacterial hemoglobin OS=Hydrogenobacter thermophilus (strain DSM 6534 / IAM 12695 / TK-6) GN=hmp PE=3 SV=1 --MSPEARLNIIKSIPFLQSYGERLTSRMYEILFEGNPELKSMFESD-----------DSTKLAGALLAFAQNLERLNVLEPAlnkMALSHV-EAGVKPEHYEKVWDALYKAMTEFG---ISNEIIEAWKEAYYFLAELLIKK-- >SRR3990170_2029843 -----------------------------SPCTTTRSPCWTRPCAS--------WAT-----------APTGSWAtstpPSSSRLPSCAR-csRR-RWTCSATG----CSRRSPAPRHYAEDVWVPELEDAWLRAYAAMSTTMIEG-- >ERR1719356_276690 -SLTEAELELIETVWAKAKAlVAEEFGMRLYRQVFDIAPEALQLFSFRDDSdpYESAEFKRQGQIVIAAFGKAVAVLRDPEALAPALDSLGDalaiSTDKVMLPHDRSVGKALLRTLRLELKDEFTLEAEKAWAKFWRILARTVQ---- >SRR4051812_22538299 --INADTAVLIESGWNAAIDANGDFAANFYQNLFAAAPVVIELFSG-D-------MTEQKGRLTHTLAETVALLHNPEHLLLLLRASgvrHH-HYQVKQAYFGVMRNILIDTIAVRAGELFTAVHRQAWEGFFDNMATIMQGG-- >ERR1740128_83505 -GLSQREKQDIRHVWSLVSQDLESAGMGFFLAYFKAHPEYQSKFKAFaKvpmdELKDNRSFQMHAMNVMNAITLIVDTLENPEELVSGLKEMgvnHR-KRRIEAIHFHTWRRCCWPSCRVPWVRLSLNRPrrvgAKRWVSSSAPSWRR------ >tr|A0A1E4GLJ3|A0A1E4GLJ3_9CAUL Uncharacterized protein OS=Phenylobacterium sp. SCN 70-31 OX=1660129 GN=ABS78_22870 PE=4 SV=1 MATAFARAADIEASLELLAERDIDPTARVYQRMFELHPQMEPYFWR-DTD---GK--IRGE----MLSLAFAAILDFVGErryADhmIGTEMinHE-GYDVPRDVFATFFAIVRDALRDLLGADWTPVFESAWEEMLAEIESYARQ--- >tr|A0A2A5EUW5|A0A2A5EUW5_9RHIZ Globin OS=Rhodobiaceae bacterium OX=2026785 GN=COA62_02605 PE=4 SV=1 TQACTAASDPIVASLELVVDKCGDPTELVYKRLFAQHPDMKPLFLL-DKD---NS--VKGN----MLSQVLECFMDFTGKqhyAAnlIACERvnHE-MIGVPPEVFTTFFTTVVDTFKDILQDDWTPVYDAAWSDLVNDLTVSVDE--- >ERR1711860_359782 ---LFSKSNYVFAS-----------LSRNTFKLFKDERSLYeKHFSSFDVN-DILRIRAHGLKVMKAVNSMVEAVSDENdeSLIDQIHFvahGHH-LRGITpRNEFEVRRKILNLDYHLLFHyllkkGCLSQSX-------------------- >SRR5256885_15743076 ------------------------------------------------------------KARMQPIATSDDALDRPAATVPALHARgtrTG-ANGVVDQHAETVGEALLWTHSKGSGRSPGaqgasPTIQHRDVHAMGVLTPTFRER-- >ERR1719329_2070839 -AMSDETVATVDATAATVAPHALDITKDFYAEMIESFPSVvLALFNPPHWR---RR-cARTPPTsRTCHHCSCLAAPSTPSITDTarsR-SFRHT-TRWCTTTSCGRWQRC---------SDqSWAARCPTPWST-------------- >SRR5256885_864722 -VLTDRQRAIVQSTVPLLETGGEALITHFYQTMLGEYPEVRALFSMAHQQ------sGAQPRALAYSVLMYAKHIDRLEALGDlpaQIDRKST-RLNSSHLVISYAVFCLKKKKRTGSDS--------FTRSE-----RLVV---- >SRR5256885_6575144 ------------------------------------------------------------------XMVMSMRGPALEAAGTtgcRSCSAAV-CCSFF--------FQAEDGIRDYkvtgvqTCAlP---------------ISDILIGA-- >tr|T2IER8|T2IER8_CROWT Uncharacterized protein OS=Crocosphaera watsonii WH 8502 GN=CWATWH8502_4740 PE=4 SV=1 ----------------------------MYEIAFNERPEYRRFFKNTHMK-SPEEGRKQAAKLAASVYAYASHIDELWTLNKKTIvsvNFTL-NI------SPELK--------------------------------------- >SRR5690625_6805322 -------------RSPSHSQtltLSPYTTLFRSRNLLRNHPELKNYFNTANQ-----VNGFQPRALASIILQFAKNINHIyeiVPKLERVCQKHC-SLGVQPRSEEHTSELQ------SRGHTVCRLL-------------------- >tr|A0A244CWV0|A0A244CWV0_9GAMM Diguanylate cyclase OS=Pseudoalteromonas ulvae OX=107327 GN=B1199_05805 PE=4 SV=1 ----------------------------------------------MET-------VNSKAKVLNKLLIA-------TSVVLISFIvslQLA-GVEMGQSSIIAILVFGIASIG---AMAF-------LYKAVEQIADKLNVIEE >tr|A0A0L0EW98|A0A0L0EW98_9GAMM Chemotaxis protein OS=Pseudoalteromonas rubra OX=43658 GN=AC626_03140 PE=4 SV=1 ----------------------------------------------MNS-------QSIQSSLNNKIIIA-------GVILVISIVvgiQLG-ASGAENMQLVAVALPLFGVVV---ALGY-------LKMALSAVSAQLGCVYR >SRR5688500_16794215 -----YDARVLRGSFAQLRPRIAQYSPVFYEHFWRDYPETRPLFGR-NMSKP-----ELDTRINHFMLWVTENADRPHFTIDYiqsVARRHV-GYRIRRRHFAYVDNTNIKTLRELLGDSFTPEVERHWRASFRFLTLLM----- >ERR1719193_2756600 -----------------------------FM--EKKVPSVIV------FlnslsLDDDGALETHALSVMNSVNKVVSRLDQPDRLVQLLHDLgrkHI-SYKANMAFLEPIAKHFILTIKPSVA-EWSPEIEDAWQQAFKVIGHIMQE--- >ERR1712080_794265 ---------ASHVIPGESHGKHQSQRWIVFEKLITDGPEFKAIFGF-PGKRDDPAAQALGSKVLTKVAEAVGCIDDQAKFSSILHaegVRHK-GRKTEAAHFSKLGPAIIYMLGEVG---VAADAQAAWGVAFGLISGEMIKGL- >tr|A0A1E3PUG6|A0A1E3PUG6_LIPST Uncharacterized protein OS=Lipomyces starkeyi NRRL Y-11557 OX=675824 GN=LIPSTDRAFT_199892 PE=3 SV=1 -HLTPEDAIAVKESWKETIGLSpantvatssgspaSLFCNQFYQKLFAVRPDLEFMFPDI---------GRQSAAISGLFQVAlamLESIDALDDILLRMGRRHAFVMGIEPEHFELLGEVFIQTMRDRLGERFTPQIETTWVKIYSYLASKMIA--- >SRR3989338_7687732 --------PLVQATWKQAMDLgdgDKGFGRNFYKNLFTKHPGLLeTLFKGV-------SIANQEKNLPKSITAVLGLLTDMPKAVDALQQLgmrHI-LYGTPDAGYPIVGANVIYTLEMILGSDFTPEAKARWGEIYGVIQTTMIDA-- >tr|A0A1C7N598|A0A1C7N598_9FUNG Uncharacterized protein OS=Choanephora cucurbitarum OX=101091 GN=A0J61_09444 PE=3 SV=1 -PPTQSQIDIVRFTWGHITDTrlpsdkpeispSHAFGLTFYDTIFHIDPDFKKLFPNIiQQakalggmiSylvkspeiisSPSSDdstlhtqvstirqINASKRKRstASTFSELVletaaDdTLghlpdSDVDHFACKLQQLgsrHY-RYGTQIDHFSLFGHAILKSIQARLGKDCLPEVLKAWTRVYSFTMFHMQA--- >SRR6185437_15632065 ----ADDVAIVRDSYGRIGPRGAALTIAFFGLLSDRVPRVRKFFPP--------DDKDKRAVAKDLFDLVVGHLESQLNVRWVLERMgrrGL-LDTITPSDVSAVGGCLLDALAELDE-AWSPATERAWSRVYDWAASAVV---- >SRR5678815_1770797 -------GARVLASYRRIGSRASAAALAFFVAVQRGSPRVRRVFKH--------DDVDQRTLAKEVFDVVVGHLESPRELRSLLERMgrrGL-VDTVSAGDIDAIGATLVGTLRDFDE-GWSSDVEQAWNAVWTVSYTHLT---- >SRR3984885_15745818 ---------------------ASRAtgGGWLPTRSPTGRSARTSR------------TGCRRGRCDGNTRPTV--GG-PAALGGGQCEDsarDG-KLGLSADHADSAGAGRVdlAAVRHPGGAGV------------------------ >SRR5262245_20097952 -EVTPQQIELLEQTLSELRRQSVFAAQLFYCRLFSLRPRLRRLLSGR--------PDFHGTRLLSVMSAAVAGLSDPGHFAGLLSLAarpavRE-AL-LQGDCVRVIGDAVHWMLERHFGGQITVEVREAWRAAHIRITQVIE---- >ERR1700722_6370008 ---------------RGIRPHCPavrqhLPCVLPPH--VRAGSVASHAIPQ-L-------SAPLTATLTAALEALVGALGDLQPVlvrAPALGLRLA-SYGLQPTDISIAASAFLATLDDELDEVSTNAARAAWGCVFWTVAL------- >SRR5581483_4578849 -----LQIALLEESFELIAGQSVELADRTLSRLIELDPQFRLLAARTE-------MAALRSVLFSVLYVLRRSLHNLNTLAPALETLgalRK-DQELSSEHFGTIGIALLDAMAEVGG--------------------------- >tr|Q17156|Q17156_9BIVA Beta chain of the tetrameric hemoglobin (Intracellular) OS=Barbatia lima PE=2 SV=1 ---SEKIKEDLRLTWGILSNELEDTGVTLMLTLFKMEPGSKARFGRFgNIDSgmGRDKLRGHSITLMYALQNFMDSLDNTEKLrcvVDKFAVNHR-IRKISASEFGWIMKPIREVLMERMGQFYDPSFVDAWGKLIGVVQASLARE-- >SRR6266536_694904 ----------------------------------------GTRFAD--------SHRPPRTMERTGPLRDRLALRALRlgvgdvvwEDVPSLKRSmcg-----------AAAAGAAPVVAAVASAAPGDPQKHLKRADQVYAKSILLRMS--- >SRR5262249_10507301 ----------------------------------------------------------------------------NVKYSShhqQHGPQAR-GVRSTNLAFCCVWRRTEMG----------P-ATAVWSGVHCRDAAGMDGA-- >tr|W6FIG9|W6FIG9_9ECHI Hemoglobin OS=Ophiactis simplex OX=533354 GN=Hb_b PE=2 SV=1 MVVSAEQKALIQGAWTPIYAgNRFQLGVDIFAHFFKAHPNYANLFPSLvGVpnPSTSVELRGHAIRVLTGINYFVAALDEKKPvimeMIHNMARSHK-PRKLTREHFAQFAPVLFDTIG------VSGPARDAFLPYYNFIADNLFAE-- >ERR1041384_2362020 ----------------PLAPKANVLGERKvVAVLYSDLRGFGTL-----------SETGHAVDVLERLNDYFD------RMVAAITSHgg------------------------------------------------------- >SRR5574337_1776253 --VGLDDRDALRVLHAAFVApvdgngAANGLTAAIFDRWFGTDPSVRDLFPP--------DLDAQRAAFGQAMSWVYGELiaqraQEPVSFLAQLGRDHR-KYGVTQQHYETLSQVLHATLRHRLADAWTGAVDAAARDSLKLIFGVMSG--- >SRR5271167_3167484 --VGLEDRDALRVLRDAFNQedpgASNELVRQLYAHWFALDTSVRDLFPP--------EMDSQRAAFAHALHWVYGELvaqraQEPVTFLAQLGKDHR-KYGVLPSHYDTLRRALHATLRTQLSDAWTDAVEDTACQSLNLITGVMSG--- >SRR5258707_573086 ---------------------------XMILKSFKPNAAIGC-KTI-P-------TW-----FVP-LPTFTAGLTLPKLYPLSVFGMRRyNLGGLGEPH--QVEAALLWLVEKQFEGVLTREMRQAWVQFCQWLVL------- >tr|K0T9D6|K0T9D6_THAOC Uncharacterized protein OS=Thalassiosira oceanica GN=THAOC_11871 PE=4 SV=1 ---------------MEREDSSgSL--PSFVSETEIEPSDVQPaaasgenNVDKGR------RKTSSSSKRTPSITKRIESFSSFKSLSSSFS---------------SKLDDERNAGEAGQAERVEsttapESVASGETQGNAGGQHTLN---- >SoimicMinimDraft_5_1059733.scaffolds.fasta_scaffold33866_1 # 3 # 488 # 1 # ID=33866_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.741 ---------------------------------FELAPASAGLFPAQvrhkyrewttEEvhasdndVRNSPSMRRLFAKMLTVIGCAVASSQNLAALVPEVKSLgarHA-AYGVSEAHWERAADAVRAEPSRSYGGLEGERRRGPHMtrvtarTLTvIFGTMLLVAT-- >tr|A0A165S3D1|A0A165S3D1_9GAMM Chemotaxis protein OS=Halioglobus sp. HI00S01 GN=A3709_07715 PE=4 SV=1 ----MTAIMMIDRDFTVTYANEAT-----LQLLRDNQATLSSIYPGFN-------PDKLI--------------------------------GSCIDGFHKNPEHQRNILADPANLPWRTDIEVADLKFS-LNVTAIVDAQ- >tr|A0A1I2IR29|A0A1I2IR29_9GAMM Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor (Fragment) OS=Fontimonas thermophila GN=SAMN04488120_104136 ----KGVIQYINRDFIEVS--------------------------GFS-------ESELI----GSPQNIVRHPDmPVEAFadfWAT----------------------------LKDGKPWTGLVKNRCKNGDHywvLANATPLRAN- >CZCB01.1.fsa_nt_gi|955242656|emb|CZCB01016507.1|_3 # 1728 # 2327 # 1 # ID=16507_3;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.493 ----GVSSFEMNQQFSAQSSDSIEKNIAAISELWQKYMATnitdeekvladkfvatrgafvkealLPAVDALR-------ANdYEKAKLFSTKARDLYNVAHPALVeliQYQAGHAKL-EYDTSVESYKLTRNWTIASLFLAVGFLACFAYFImrSIANPLSvifRVLDNIKSN-- >tr|A0A1I5XDG1|A0A1I5XDG1_9PSED Globin OS=Pseudomonas borbori OX=289003 GN=SAMN05216190_1566 PE=3 SV=1 -----DDAALLEETLEMVSSRSEDLTPDVYARFFSRCPAASGLFTvIDpatPPM-------GCGQ----MLFEIISLLRDSAAgkpyvAsyMQQIATEHaA-FDVRDPALYREFMHSLADVQATLLGPDWSPAHAQAWDRQIAALLRHL----- >tr|A0A1B0G6S0|A0A1B0G6S0_GLOMM Hemoglobin-like flavoprotein OS=Glossina morsitans morsitans PE=3 SV=1 STMNSDEVYEIKRTWEIPATTPTESGVAILIRFFTKYPSNLQKFSTFkDMTldelKNNPRFKAHANRIMKVFDDSIKTLDDncshLEEIWTKIAQSHF-NRQIEKQSFNELKEVILEVLVAACN--LNDQQTEIWLKLLDFVYEIIFKT-- >tr|A0A1J1IV29|A0A1J1IV29_9DIPT CLUMA_CG015163, isoform A OS=Clunio marinus GN=putative Globin CTT-Z PE=3 SV=1 HVLTPEEIVLVKDSWKIPSANAVDSAELIFYTFLSRYPEHQKRFVRFkDKPlnelKGSPFFRAHASRIYNVFDSVIDGIGKdpenkeVMSFIAESGIFHA-KKKVTKQAHAELRVVLVEILNDVCK--LDEKGNVAWSKLLDIFYHVMFEC-- >tr|Q7M422|Q7M422_9DIPT Hemoglobin V OS=Tokunagayusurika akamusi PE=1 SV=1 VGLSDSEEKLVRDAWAPIHGDLQGTANTVFYNYLKKYPSNQDKFETLkGHPldevKDTANFKLIAGRIFTIFDNCVKNVGNdkgFQKVIADMSGPHV-ARPITHGSYNDLRGVIYDSM----H--LDSTHGAAWNKMMDNFFYVFYEC-- >tr|A0A0G4EPR9|A0A0G4EPR9_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_12573 PE=3 SV=1 ---SDKERgVLIDKTWGllKERYTLQEIGEELYDNVFKNAPDLRHLFKRPKEL----MALKFGEMISTIC-GLFQ--TDRESLLEtmrDLGIRHV-DYGSRPEYFPLFKACLLDTLENLLEDGeFTAATEASWNDMWDEASEMLISS-- >sp|P15447|GLB4_GLYDI Globin, monomeric component M-IV OS=Glycera dibranchiata PE=1 SV=2 MGLSAAQRQVVASTWKDIAGsdNGAGVGKECFTKFLSAHHDIAAVFGFSGA--SDPGVADLGAKVLAQIGVAVSHLGDEGKMVAEMkavGVRHK-GYGykhIKAEYFEPLGASLLSAMEHRIGGKMTAAAKDAWAAAYADISGALISGL- >GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold789473_1 # 1 # 552 # -1 # ID=789473_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.562 -RIPPLKGSSLSAGWRTASSSGLS---------------------------------------RNPRGTVSR--------ESGNTVFqseTF-AGAASPRGGSLL-C--FT--GENEPMGMINNLKT------------------ >tr|A0A1G7K468|A0A1G7K468_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter baekdonensis OX=875171 GN=SAMN04488117_103319 PE=4 SV=1 -MLAVKQISLVRNDFRRLAPARPEMFKWFYDRLFEIAPHTRDLYSE--------SLTEESSRVNGLLEIAFLSLDHPQAMFATLHTLgrdFS-GFGIWETKLHLVVDLLVEVFAEFGGEDWGSELEKAWHSVLIFIAQGMKEG-- >tr|A0A291GF03|A0A291GF03_9RHOB Uncharacterized protein OS=Celeribacter ethanolicus OX=1758178 GN=CEW89_16165 PE=4 SV=1 -MPSARQIALVRNNFRALSPKRPDIFIPVYDRQVGEDPKAAAQYDG--------SLCQRARVLDGLIELALLSADHPTALFATLHKMgqdYA-HYGSWREKHPFLIGQIIKAFAEATDTHWTDELADAWEQFLYFMAEGMLEG-- >tr|Q86G74|Q86G74_PHAPT Hemoglobin II OS=Phacoides pectinatus OX=244486 PE=2 SV=1 TTLTNPQKAAIRSSWSKFMDNGVSNGQGFYMDLFKAHPETLTPFKSLfgGLTlaqlQDNPKMKAQSLVFCNGMSSFVDHLDDNDMLvvlIQKMAKLHN-NRGIRASDLRTAYDILIHYMEDHNH--MVGGAKDAWEVFVGFICKTLG---- >ERR1719468_599295 -ELNEKQIAVIKESWKVLTNEITEIGMLAFLHLFESTPDAQGSFKEFhSMTkdelKHSEIFRNHASRVTGVIKKVVEKIDEPETYLPHLHILgqkHV-MYEIDVNHIDQMGYMFLSGIKTALENknAWNDNARDAWESLLLMVIAEMKK--- >ERR1719329_2046659 ----------IKTVWAKIMKEVgtLNAGTMLFKNVFMLAPETKQLFPKFRHlkddlLLSNESFKNQAKLSISALSNAIMSFDDPPKLkrmLMDLGRIYE-SKGVSLATLPIVGNALMATIEAALGNDSCIETFNFFALFYNEGSNMLAEGYK >ERR1711915_153481 LGLTKRQRFLLKGSWKGISREMQVTGVRVFIQMFQSRPETFQFFPQFqGLDgpeqqKRSEVFQEHSEKVISRIDEALASAENPEVLTGVLLQTgayHRKIDGFNPQLFLCIEEPFLESLSLTLDERYTPQMDSIYKIITKYIIQTVIDGYN >ERR1719369_313705 TGLTKKQRFLLKSSWKGVSRDLEYTGVKWLVGVFSTQPHTQKYFTNFsSLSldgelQECTEFREMAEKVMERLDNALFHMEEPDTMRSILLETgayHRRIQGFREDMFKDSEAPLLQAIENTLDERYTKQMAEIYTVVVQFFIETIMEGYT >tr|A0A0S8CN91|A0A0S8CN91_9BACT Uncharacterized protein OS=Nitrospira bacterium SG8_3 GN=AMK69_14025 PE=3 SV=1 -GLPPSDISRIQRSFRMVASQGEKMASRFYDLLLERSPELQKFFHPGN-------LSQQHAKFFNGLHSLILHLEHPQALraaLVQLGEQHQ-GDGIEIQHYPPVVDTLLQVLTEFSGEGMDGETYDAWAHFLHLVRAIMLENH- >tr|A0A182IYR6|A0A182IYR6_9DIPT Uncharacterized protein OS=Anopheles atroparvus OX=41427 PE=3 SV=1 -GLTKSQKVALIAAWSIVKKDLVTHGRNIFVIFFEEYPQYLDYFDFSASdAtgdlGENRSLHAHALNVMNFIGTLIDyGLNDPDllkCSLARLVRNHR-RRNVTKEDVGAVGGVIMRYCLKALEQHRSKTLEDAFGAFLGTVAAAFE---- >tr|A0A182QXV6|A0A182QXV6_9DIPT Uncharacterized protein OS=Anopheles farauti OX=69004 PE=3 SV=1 -GLTAQEKITLFSAWGLIRKDLDIHGRNMLLLLFHKYPHYVSYFDFTDDaSaqtlVDNKSLYSQSIHVIKTFGSLIEyGLKDPAlfnETLKKITRIHA-ERNVYGKDILTIGDVLLNYLAQVLGRQVSDALPDAFRKLFVTIAGRFP---- >tr|A0A1Y1I4E0|A0A1Y1I4E0_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_002310190 PE=3 SV=1 -QLSPFEQQLVQKTWKLLQPRLADLGQAVFTHLFQKAPKTRPLYTCPlRLadgdrrTPDGHAIPTHAVEIVSTIGLAACRIGSSSRILAVLErlgQRHV-AYGAAPDMFSVFKEAFLVALKKTLGGeHFTAQVHKAWSKALDSVVAHLKKG-- >SRR5271157_2714777 -SRIVDRLTALRAFFAEMEPQLPVIVARSYERLFDVEPAIALLFKG--------NAREHQLRFLAKLQSIVKLTRSSqlwpasaatgQILipeVLDFGRSHA-KIGVLPVHFSLLNDMIAWTCKEIAPLRFTPLVEEGLAFVFDVLGASLTAK-- >ERR1719323_206356 -KLSEQEKSVLKSSWAVISKNLEVVGSQMFIEMFQANPDTQHQFSNFrgiDQTelSETPQMIQYRTKVVATIGQVIDNVDNTHMLWDlliKFGRDHF-SYGALPMYFDLMGPHFVIAARNNMGNDWYEALEYHWLALFELIIYIMKFGWH >ERR1719461_2449329 -----------------------------FLPSFDHDPECPEKISLH------------CQRVMSVVGGSIEHIEDYQCLWKhliSLGRDHF-GKIYEitlgqkSTFYPKIHSLKIpIFTKfTFLKSNFSQNSRFSNIKFL------VISGX- >SRR5690349_7596073 --------------------------------------------------------XMQMTRFTDL-GLRTLMLL-asaestgrrvttRTIAVGANASEHH-VAK----------------------------AVSRLAELGMVMADTLIE--- >SRR5215510_2422438 -QMTKEQIEVVQNTFNKVRPMSGTAAQLFYNRLFDVDPSVRETLLW--------TLKQGlGADFTPEAEVAWGNAYDFLAAVMQQAAKGA-SMX------------------------------------------------- >tr|A0A158PBC2|A0A158PBC2_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=3 SV=1 -LPNPRERELLRRTWSDEFKFLYELGSSIYIYIFEHNPHCKQLFPSIAKygddYKDSREFRIQALRFVQTISQVVKNIYHMDRLESylyGIGQLHCKyaHRGFKPEYWDDFKDAMEHSLTDHMNSlsDLDAqqrsEAVAIWRKVAHYIISHMRTGY- >tr|A0A2A6CNA4|A0A2A6CNA4_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_32112 PE=3 SV=1 -QCNPRYTALLKSTWSDDFEVLFALGAKMYITAFEgpHGVACKSLFPWVAKyeeagenYADKSEFRLQALRLVQTIVKALDKVDDLQKLEAylyAVGHRHVFylPVWLDPVYWDVFKasratsylgqstmlksaserDAVQVGVNDHLHKlsKLSTddlaRATLIWTDIIEYIFEYVKEGF- >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3481696_1 # 1 # 387 # -1 # ID=3481696_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.584 -VLTSNDIALIRESWAYAKDI-PAIQTETLLEHFRIQPRTQALFPKFaDVPlnklPTNDAFIKQARSCVSFGLNFIVANLDNPSLLkDMLGRVdTyG-KWYVDF--MtkeRQMQTTVdifIQVLSKELGGRLSAAAKAAWTRAMTLVFVEMMS--- >ERR1711894_485352 ---------------ILLYNY-rfLTYVIYYYYRFLAEDPTVASVFSRVNVdDQQSGEWHAHMLRIMGGVDILINMMDDVNVLTEEVKHLraqHVVREGVTHERMKAFLIIMMDELPKVMT-HFNH---DAWKSCLSKKLKRIG---- >SRR4051812_36412483 --------RRRTRGSARITWPGYQMRNLLSPRLFDRASAVRVLLPD-DLT-------RLKHQFARTLHWLIGHLHEPQKVriaLVDLGRRHQ-EYGVKAEYYPAICEALVDSLATISADDWNDELARDWRQTFELMVHHMLRAYR >tr|A0A1Y1IHX6|A0A1Y1IHX6_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_006460015 PE=3 SV=1 -KLSDERILKAQALWDFMEGsafadndrrQFIDRGVKVFENLFELAPQVLTLFPFKDENgrPRRKELEVHVETVMSTTGQVVRQMQDPDSLAPMLTELtalHV-KYGVELIHYDILCSTFLLTFEQLLGPRWNSDYRDVWISIFSFITTFARKAY- >SRR4051795_8230555 -----PAVT-----------------------SPRVpA---------------------------------------------FgSPCPvirQQ-RWTGAI-----IGTRQEGSVP----------SAHSTTSGD------------ >SRR5215203_3322109 -ELSERTIALVKATVPALEAHGLAITRRMYERMFH-NEAIRDLFNQSHH----GETGSQPKALAAAILAYARNIEILAAWGEAYWYLaevLI-ARERLIyqglaaapGGWTGWRDFTV--AEKRCESEVITSFVLRPTDGGPVLRHR------ >SRR3954470_353290 -----ARRS-----------------------------------------------------------------------SPLaEGDPryhVH-QWDRGRQPRRSTRCRVTPPVT----------NIRRYLVGP------------ >SRR6478735_1414904 -----SGSR-----------------------PARLaS---R------------P-SW---------------------NHRPIgEATLvnrYG-RS---A-----AGSDVE--------------RIERDLSGT------------ >SRR3954468_7455402 -----APPD--RA-----------LT----GGGETVpG---V------------R-ASR------P-------------RTIDRsGRTLvsqSE-RS---A-----EGSGVE--------------EIERDLSGT------------ >SRR3954470_12739883 ------------------------------------------------------tsaCSRTRTSATCStsrtmarqapsprrspPPWSPMRAISTTSARSPRVERIaqkHV-GLNILPEHYPAVAESLLGAIKDVLGVTHYSRGLTDDPDWYPYLKKHEWL--- >SRR5215831_13609655 --------KPCNRSKPFFRINAFCSAvslalrlQRLCELPESAHPQRC----ASCL----K-TANPAKNVVPKRFGTFISIHLRDTYIFAVSKIgqkHC-GLNILPEHYHYVAESLLGAIKDVLGEAATEEVLSAWGEAYWFLADVLMA--- >tr|F2UFM9|F2UFM9_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_06664 PE=4 SV=1 -RLDMEQLKIALGSWTAVVELVPTWHEVFFAELFQAHPETeRLLYSSDKSK---SWNERHMARVGKSVGDVIKSLSNYDDVIEHLTTGephEQ-ACCL--------TDG--YVIGTGLGNT----PRSLWLACGS----------- >tr|A0A1Z5KPX1|A0A1Z5KPX1_FISSO Uncharacterized protein OS=Fistulifera solaris GN=FisN_16Lh317 PE=3 SV=1 ---SPACVMKVINRWETARQRngfDEQLDIDTLLALFKMDPQVKPIYGFAvEKEvkaQgmQRMGVLIYGLQVVKMFDVILSALGPDeElfyDVVTEMGEQHC-KHGLTPDHFTLLCGAVMGVLETIMDTEWTKDVRAAWSQVIECVNAEIVK--- >ERR1712000_676789 MSLTPQQSAQIRSSLPVLKSEGETITSLLYASLLHNHPDLHNLFNSVNQAN-----GRQPRALLSSASVKGTARWESHQLS----------------------------MISSRGTCWRPSR-RSWGPSGRLSX-------- >ERR1719328_19047 -GMTPEQKQLIDDSFAVLKKDVKGNTIVFYETFFKMNPELVAHFPGVseaDLVnlGKNEFIIQRGAKFFNMIETTTHLMESKEGCLELVRMLkesVP-EGKVTYDRYKVAKEPFIKMMETALGGNFSAETKAAWRKFFDSLAETTK---- >SRR4051794_16351730 -TLTPFEVGVIRTSFRDLQKRSGPAAQRFFRELFSYDAALRELFAP--------SPWTRQENLMSVLSGVIEQIDSSTTLTTHLDEVvrrFP-AFAVNSYYHLYVGAALFAM--------------------------------- >ERR1719187_1205752 -SLSQGENDALKAGFKAAQGKLGDIGANTFANLIANDDSFRQRFPWANsdITveeiKTYAPAIAHGEKVLQGVNVAVKNLDRLNSFVSyfvDEGVKHV-PRRVTVDDFQAFAEAVHPAFQKELGDLYTDDFKNGLTGLLGFISDNMAKG-- >ERR1719187_2594184 -QFTEAEKTILRDTWKGTIQpHMAQNAANLLITYINENPQDRKLFYWGRndKSgmalRVSPGFVTHSQGVFSGVGVGIDRLDNIASLDKfytQLGEDHI-PRGIHEGVFAPMKDAFLQILGHALQEEFTDEAKAAYGKYYDHIAGKMIEG-- >ERR1719309_658292 -HLSGEEKQLLQDTWSRSIApLKHENGANMFIHFITHNPELRREFFWGRnnKTamalRVDVRFASHIRSIFDAIAHGISRLDNMDSLQGyytELGQDHI-PRGVQRVMFAPLADSFMYAVGLALEDQFTPAVKAAYLKYYMHIP-------- >tr|A0A0G4IVL1|A0A0G4IVL1_PLABS Uncharacterized protein (Fragment) OS=Plasmodiophora brassicae GN=PBRA_001183 PE=3 SV=1 MRLSARITNLVKSSWAEAMTLQgrdgMTLQKAFYNHMFTKAPESRAMFKE-DTS-------KQELMFGQMMTDAVNILDNFEELVNKlvyLGEVHR-YLDLAPEHFRVVGESLIGTLEDILGKkRFNAEVKEAWVMVFDLMATIML---- >tr|S6BNG7|S6BNG7_POLVA Globin OS=Polypedilum vanderplanki GN=PvHb32 PE=2 SV=1 -PLSKEQADEVRHAWDKVKSN----EVEILYEIFKAHPDIQNKFPQFagkNLDsiKNNSDFGTHATRIVSFITEIMSLGGKpdllpaIKTRVNEMGQNHR-NRGVTKEQFNEFRSTLTDYVKHHS--SLDGDTEHAWNQAIDNVFFIIFSNL- >tr|S6B7W8|S6B7W8_POLVA Globin OS=Polypedilum vanderplanki GN=PVHb31 PE=2 SV=1 -TLTADEANLVKSTWSQVKDK----EDEILYDIFKQNPDIQGRFPMFvgkNLDsiKSTEQFKTHADKIVKAIGSYIDLLGNesnsgaIKTILNELGQRHR-DRGASKEQFNEFKTSVLKYVKEHAS-GWNDASGSAWDKAFDDMYKIVFSNL- >SRR5579871_994368 -----ADPMNINESIHDILNRDEIVADLFYDVFLDRHPEVRRFFVGVDI-------RQQAIV-LTMMLSIIEDfYHHsypaTARYLRLVGQRHK-ARAIPKEMYLIFCQCLLETLERFHGQNWSAQLSDEWERAFDKASQVLLEGYQ >SRR5512135_1415087 -------TELIARTWEALGDRQAQFIEAFYDRFFERFPGYRKLFPHE-LR------TAHLEKMVLTLALLADLSDDRTAIAPRLHKLgaaHK-PFDLELRDFNNFKAVFIEVLGPQLGKQWTAAAAKAWNDAFDAVLIP------ >tr|A0A163MXG7|A0A163MXG7_ABSGL Uncharacterized protein OS=Absidia glauca OX=4829 GN=ABSGL_15412.1 scaffold 16614 PE=3 SV=1 ---SQTDIDLVRSSWERVIETqhpsdedgvspAQAFGLVFYAALFHLDPHIRPLFDGTNVMIqakmltfvigclvRAPMVIQRRGPTLKEISTTPTGAEDMEGLAAKIRELgarHH-FYNVEPAHFQLVGPAVDMALRERLKHEYTDAIGQAWLRTHAFVAHHMA---- >SRR5207247_8066543 ------DVQRLQESFARMAMHGDAVPLFFYSDLFLRHPETRDLFPV--------SMAAQRDRLVDALGRIVSDVEHVDADSGDPSGArpeDA-HIQAVRILsnAQQMADNYVADAQEY-----SSQLSTX----------------- >ERR1719193_187210 -VLTENDIKAIKAIWYPVRQTPADIGAAAFEKFFKLYPHQKEKFWFMkNDDLKEKGMRAHGEKVIKSLDEAVLRTVDrarIRSCLQRLDYIHF-QMGITEEDMEELSDAVVKTIKEVVIdtnKKLTHEELDSFKKFMKMVTAE------ >ERR1719193_859649 -------------------------------------------WRMLkKRH------NRDGGKLLH-PLKTILQTCYksrIKNCFQRIGYIHF-RMGVQEEDMEQLGEAIIKTVEAAWGDEFTPEEYAAFRKFMKKFTAA------ >SRR5580704_1734515 -----------APRAELATGVAPDYgSPDDVASRRSQSRACRRTLRR-P---------TTGAVRGEMLARVIEAILDFIgeRRYAhHLiqcEVVtHE-GYDVPPETFGIFFGVVATTVREQLADAWTDAFDEAWRTLLYDLDY------- >SRR5258708_241677 -----SCGEDPAGSSD-------DHDAD----VVASAGQVEGGVDL-V---------EHPPALGVPIAAPCQWLVDLEgaGACAaNRmaaERVnHE-GVGVPPAALARFFPIVAETCRDLLGEAWTGEIEAAWAGLLTRLAV------- >ERR1719296_55987 --MDSDMQVAVQKSWEKVQEIGTlAVAELLMKHTLEIDPEAIQLYICKAKPGEDENVLDVARKLfartLFILGSSAAGMADTAHVVKNLTVAGStlANSGVKESYFNTVGTAFQMTLQEVLGDKFTPEVATAWKVAFDFMTAIMVAGMR >SRR3954451_11513015 -AASPCAQQLRQGCRDRPA-----ACQLVLSSGVRDRPGCEIAVQG--------RHGEAGPQADGGADGLIDAIDRLDTI--------------------------------------VPAVEAAWTEAYTILATTMKD--- >Dee2metaT_27_FD_contig_31_2132282_length_204_multi_2_in_0_out_0_1 # 3 # 203 # -1 # ID=1013462_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.592 ------------------------------------------------------SAATSNPQF-------VAAV-------------------KKAIDYSGL--------LTVAGQGAVQPagiipSVIAGTLPAADALKQDVAG-- >AntAceMinimDraft_18_1070375.scaffolds.fasta_scaffold521461_1 # 3 # 443 # -1 # ID=521461_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.569 -------DD---------------DDDDDDdDRMFHDHPEARALFSRVhGDNTYSPDFEAHAQRVLGGLDSCISLMDDPDTLASELGHLkaqHA-DHTdVTAEHFDVSICFSsTDVTSTYTsthckimdrpnYTVFQT--RGQrnltksaSRRAHspvRDHPRG------ >ERR1719191_324407 --MDDSAMKITQESWAMVEKEVPHWPEIFYDQMFA-DPSVAKLFPFSsGNFKENPKFQEHTQKVKDTMHTAMTSIKEFDKLrpvLYKMGQRHV-AYGTLPEHSTNFKNAFLFTLKAGYGDKWNEDLDDAWNQCVDALL-------- >tr|A0A0P5XAJ2|A0A0P5XAJ2_9CRUS Di-domain hemoglobin OS=Daphnia magna OX=35525 PE=3 SV=1 -LLTANDRRIIRKTWARAKKD-GDVPPQILFRFIKAHPEYQKMFKSFaDVPqaelLGNGNFLAQAYTILAGLNVVIQSLSSQELIANkinALGGAHK-PRGATPIMFEQFVNVAEEVLAEELGSSFNAEARQAWKNGMRALVTGIT---- >ERR1740129_283753 -PLTRREIRTLGLSWSKFHGCRQEFGVELLVQFFQLVPEASDLFRFQRekTISENPGLKNHADRVVRVLSRVIHNILSLEEVVPDLKALgmkHYMDYGVSPTHYCLFGKALLGTVQTF-GG--TPPEQGCLPKLYEWMSRTMTS--- >GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold759510_1 # 2 # 568 # -1 # ID=759510_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.697 ----FVTTQCVVENWERLkySPFFDEFVIAFYQRVFRLCPQAKSLFGSSfCLD-DQAAMT---QEFVRLIDRILDLLGPESqlmvEVLRDLGSHHE-AYGVTVEMYDIMRNAFLLTLEQFEGEKmFTSKVRQAWTTVCSAVADVMTEA-- >tr|F2UFM8|F2UFM8_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) OX=946362 GN=PTSG_06664 PE=3 SV=1 MRLDMEQLKIALGSWTAVVELVPTWHEVFFAELFQAHPETERLLYSSDKSK--SWNERHMARVGKSVGDVIKSLSNYDDViehLTALGTRHA-RYGLHVDQLDLFINAFLWTLGAGLGDSWDHSVKKAWMHVLPFILSPLKS--- >UPI00054DD732 status=active ----------------------------------------------------------------------------------LTCARDF-FltfVGVERCR-PKLLKQEPQTITSKLGm-A-PMLQSAFWSIRVMRIASS---A-- >tr|A0A1E2UUQ1|A0A1E2UUQ1_9GAMM Uncharacterized protein OS=Candidatus Thiodiazotropha endoloripes OX=1818881 GN=A3196_04875 PE=4 SV=1 --ITKTNLKRFQQSLRRIS-LKQGFYDTFYDHFIAQSDEIAAIFHARDM-------DQLKGKLKETLQMVEDALMGKPGvvlYLEMLGRIHT-RLKVDQRHFEMWKYALLSTIERYDD-EYDAEVKMAWEAAIETVVSLMYPES- >SRR5262245_29633745 ----------------------------------------------------------------------LGNHSTR-cgrSVESSQSNSTA-DFLNSRRIHDAYSpaiRAAKSKSE------------------------------- >SRR4051795_1885912 -----------------------------------------ApRTARRRL-----QPGQPGRRLAAdRAGRVGRGlRQRPaegprtdsrapavadraqarvaghrprpvrrraRQPVLGHRRRAR-EGGHTGGRRRV----GRGLLADglCPGQPGARPLQRAWRAA-----GDGVAR-- >ERR1719218_338423 --AEEAGDTVLV---GGAPLgarqRPMATGSKIFRKLFTGDTAVLRLFPFRHQartLFVSAPFKLHAKLFVDTMTELIANLHDLEKVERdvrELGKRHL-TYGVQPAHFDAMGEALIASSTS------------------------------ >SRR6516162_179054 ----SQTVMDIEESLHHILEREKLVADLFYMVFLEKYPEVRRHFINVN-------LRRQAVLLTMALQVVVQYYLKgFptaEAYLKILGEEHN-RRGIEPELYPKFCTALLETLSRFHFHDWSEDLAQQWEEALKLAATEMVEASP >tr|I2G907|I2G907_9HEMI Hemoglobin A OS=Anisops deanei GN=HbA PE=2 SV=1 -SLTDREVEVINQSWNQIKAQELVVGLQMFKTLFQRYPQYERLFTHLHQSgkslYEGDRFQRHVvGNIMSSINKVIETLNSSDNAVKTLQDMgvkHK-KLDVHRKHFESFVPFVVDAMVSVRMSMSQDEVASAWTKMMEGVASNLSKG-- >tr|A0A0P5UVQ8|A0A0P5UVQ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna OX=35525 PE=4 SV=1 -LLTANDRRIIRKTWEPRpR-RTEDVPPQDPLPFHQGPPRVPEdVQVLRlCSPsracEQRKLLGPRPNTILAGLNVVIQSLSTHGAYCQPNQRSrsaNK-PRGVPPIMFEQFGNVAEEVLAEALGSSFNAEARQAWKNGMRALVTGIT---- >RhiMethySRZTD1v2_1073278.scaffolds.fasta_scaffold3173058_1 # 192 # 530 # 1 # ID=3173058_1;partial=01;start_type=GTG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.740 ---------LYSGTNvytgataslLAQADYLSSLIGDTDYPMFDVESVVQLFL----------------------------------------EwehNKHH-DIMGFRN---YPHKSVMTG-------TRAPVHHTPWLQALDDSMECYLNT-- >ERR1719183_2765469 --------------ADIFMPRLEEIVMRMYNLILEEQHECINIFNTPSLS-----PGQPLAALAACIRGLIEDINVRPRLEhrvEMIAQKHC-AINLQAHNYLGLQGMFMSAAEDVLGADMTPQRFSAWSQALLFICRLVIER-- >SRR6478609_8547471 -VlvdveevlrvvfgFDLPQTDVVRSvVLGNPGQ----I--------IAVHKVDV----------------AAGGRIGPQGGRVVPHPRDVcLV-LRRVHPLR------------------------------------------------------ >ERR1719383_1602644 -------------------------------------------FGLHL----------------QSTMLVGNDLDPVDERG--pdhCQQALW-TASE-GRTLSHRRREPCRSVLEVLGEdVVTPEIGGAWREAVQALAKILIDT-- >ERR1740139_1260005 ---TEQMKTDVVSSWGKVLSFGTlTVGRVLCRHTFALSPDMHALFPPHILhkyqeegeTDSNGALSRHFSMILNAVGCVVSSFDQDadLSTITQLGMRHA-SYRVVESHFETIGRALELTLHDILKDDFTPEVRHAWKLVYSFLSLVMIRGI- >tr|H2ZAE8|H2ZAE8_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1 MEMNAQEIQDVRDSWKRLCADGeKTVGLMLMQKLFNTYPESIKVFSRLGITnkaiitiddlSTNSAASRHAESLTSRIGTLVDLMHNTHefkECSTEVGEIHI-KYGVTAEHVDILGNVLLSVICDSQGLSKSSDLYLCWTKTWEGIAKYVK---- >SRR6185437_12825295 ----------------------------------LIAPRLELILPA-DP-------ARRDAAFLELVDMVVQRLDRLDLLLPMLAAQaHSwGKRDVLDGDYVLAGKALAWTVEQVIK---EPAAIAAWRDTFDFLAGVMRR--- >SRR3954465_11422119 ---PCRSSPTTSGRSPGAS--TRT---------------CStAtRGCWTGPStgatrpRA-----PSRSRWPGPSRSSpahwSRSPSRSpSTCSpgSRTSTTHsasprpppP-PPPPARAERGVVQDNLFWAIVDVLGEAVTPEVAAAWDEVYWLMAYALVNQ-- >SRR3712207_885952 -------------------------------------------LGR---------------------GLLadglRAHPPGAgALQR---------PRRAAGDGVAGVggRRGENRERGRREPPPAAGAGTPGVDRAAPPGRCRPGTP-- >SRR3954465_6877418 -AtaaaTAAASSTDIRATRPASLEG-------------HDRPHLDTaEAGRAQLADG-----EGDIEVGGVDEvVAtqhlLRLHERAvGHlgpPTDARRGAGR-LQGVAAEELGTVRLDLDGELVVRLHDL-----VEDLGRRRRVLALVLVDQ-- >SRR3712207_8177874 -VLSDRARPVVEATLAPVADNIGEiarRRSEER---------------------------------------------------------------------RVGKECRSR-----WSPY-----H------------------- >tr|A0A0L0FUF5|A0A0L0FUF5_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_07147 PE=4 SV=1 -ICKPEELHtkdlgfivtHTNNPW--GSTDEQDFGVDFFRDHADQ----------------------SGLTSFFSSIVIIACEMYQEfePSIPQLQKLgeeAK-HLDIPCHMEDNIVGYVASTLSR-SKQ-FDAIEECAIFKLIWRVVLFVLE--- >tr|A0A252E791|A0A252E791_9NOSO Nitric-oxide synthase OS=Nostoc sp. 106C OX=1932667 GN=BV375_01385 PE=4 SV=1 -ALPPQMLHQMADCWEVFSQNKQQMGMEFYQILFEKYPFVLPIFGRADMD-------YLSLHLFQAVEFLVRCLRTGSsdNMLQELRFLgqvHS-FADVPSCAYPAVSDTMFVLFEKYLPN-FTPELRQAWQILFDRVVNVIKL--- >tr|A0A2T1LS65|A0A2T1LS65_9CHRO Nitric-oxide synthase OS=Aphanothece hegewaldii CCALA 016 OX=2107694 GN=C7H19_21845 PE=4 SV=1 -ALPPEMLQQMIASWSVFSQNKQEMGMEFYQILFEKYPFVLPIFGRADMD-------YLSLHLFQALEFLMRCLQSGSseEMLQELRFLgqvHS-FADVPTCAYPAIGDTMFTLFEKYVPD-FSPELRQAWQTILERVINVIKL--- >tr|A0A2E9QYM9|A0A2E9QYM9_9DELT Nitric-oxide synthase OS=Deltaproteobacteria bacterium OX=2026735 GN=CL920_22905 PE=4 SV=1 -ALSS--MKEAKRLWEEGVGLHTAPGSEWVHQLVAERPEWNHFFASSDPE-------AFGEALFSTIDSAVHQLDDEVSMFSSLREDselFT-AWDVRACAFSALPDVLVDFVV---ED-HQTVGAQALRTFLRRVCTIVSL--- >HubBroStandDraft_6_1064221.scaffolds.fasta_scaffold2618798_1 # 2 # 181 # -1 # ID=2618798_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.622 ---SAEDRSIIQEQWKILFKDVdsskikIAVGRKLVLNLIQRQPDAKVLFDKFNVdEPNSPQFSAYALRLFNRIDLIINLLKDPEALDAALEFnaeRYGNIPNIKKAYFQTAAQILAYALPKVLDD-FNA---LSWQSCTRYILTTVASKVS >RhiMetdeSRZDD1v2_1073273.scaffolds.fasta_scaffold2404579_2 # 426 # 629 # 1 # ID=2404579_2;partial=01;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.627 ---SSEDRRIVQKQWNALFGDVrssrvkIALGSKLLLKLAELRPDAKEALKPIHIdDPTSGEFQAHSFRVLNSLDVFINLLTDAEALDAALDHhskEHSGIAHIKKEHFKVFGEILISSLPKVLDD-FDA---FSWRSCYKYIGQRLTAQLH >sp|P02210|GLB_APLLI Globin OS=Aplysia limacina PE=1 SV=4 MSLSAAEADLAGKSWAPVFANKDANGDAFLVALFEKFPDSANFFADFKgKSvadiKASPKLRDVSSRIFTRLNEFVNNAADAGKMSAMLSQFakeHV-GFGVGSAQFENVRSMFPGFVASVAAP--PAGADAAWTKLFGLIIDALKA--- >SRR3981081_215795 -RDDPDQKQLVRAFWKQVVPTAEAAAGLLYRPPFERGPHTPAPARVsrpTAAS-------PARGSLLECWGFQSAAGQAR----------PANGEGGKP----RPPPRRL----------------------------------- >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold136029_1 # 443 # 1567 # -1 # ID=136029_1;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.433 -----HHLQFLQQQISAAEPRAGIAMLVFWKNLFELNPSLRPLLGEK--P------GEEDYLLVQFLAAGLAPLFRQTPNTAPTdQDGACAPVNTDeEQQCSVVGEALLWSLEEAFGADFTPKVRSAWETLYRFITVSNKQSY- >SRR5687768_12147577 -------------------------------------------------------------GLAHARMDSvSLK--PpanphcaiktwvlacgvparTAEWRPMSNlSDAP-SPSLLSDQSLSV----VQ-TTATVVAAHADEITAAWSEVYWLVALQLVA--- >SRR6476660_4664138 -M-VVVGVDAHKrtHTCVAVDGSGRKLGEKTVPATT----------------------VGNASALRWARSTFGpdltwgiedvrnvsRRLE----------QELV-NAGQR---VVRVPTHLMARTRasartrgksdsidaTAVARAvpREPDLPVAqHDSVS--RELQLLI--- >tr|R7TLW3|R7TLW3_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_227018 PE=3 SV=1 -----------EITWAILSENRDGLGTEVFVRMFESYPDLKSAFGPLrHMNKKdagyEDVLRAHGIRVLSIVEQVLSKRHNMEEVLSILHDLgrkHL-TFSAKVEYIDIVSQMFLFAIESALKEKWNNSTEKSWGEIIRFVTYVMKET-- >SRR5918994_1081840 -----------------------------------------------------------------MLAVAIEALLDRGGegrlagLVGIERMNHV-NIGVPPEVFDGFFALLMEVVRDALGPPPKGGGeragGGGWPPAPRPAGAR------ >ERR1712157_679996 -----TTMDCVLSSWEQVRRIpnyRETVGLAILQKLIHRMPEGREVLHMQrNLIknsppgiESDKLLLAHARAIVNGLDTVVELlgplIDDISEILREIGKSQYHDYGDSMALWNpLMRECVLEVIQETLKDDYTHELKVAWTDFLGEVAKDIHS--- >ERR1719360_423992 -PLTQAQKEIIFTSWDAIT-HKENLGVTIMYRIFTGHQEIKHLWKFADdLKteeeiRGSKTTQFHAKKVINGVNSAIKAVEAgkeVESlGLDKLGARHF-KYGAKPADFRHFVESLFWAIKTIVPE-VSAEMAAAWTNFVMQIIKQMTN--- >SRR6476660_7963253 ------------------A-SHSTFFERFSSNFKAANMSLQPFM-----D-------RQQKLLREDLTKLVMCAENAEFa------TRPGAvALNVSPQLSKFWIDALMLTVREFD-EKFTPELERKWRTILQKGLA------- >ERR550517_1828149 -------IYYVSikPPKNRLESHIRKqSRVqsdysQDYIKETAIFSFFIQIFHKLNPNPNSsgikytkdqalkESLHEHGVKVLNGVDEVLSNLDQPSLCFSLIRKTgahHRKLQGFKPKYFKCFEEPFLAMVENSLGQRFTPQMETVYRSVATFFVQTLIEGY- >ERR1719220_3089060 ---------------------------latvnIHLRSAFHASSLLIQIFQKLNPNPNSsgikytkdqalkESLHEHGVKVLCGVDEVLSNLDQPSLCLSLIRKTgafHRKLQGFKPKYFKCFEEPFLAMVQSSMGQSFFIFPGllPKWRSFTSPSPASLSK--- >SRR5919199_1911786 ------------ATLPVVSDHIGDIARRFYDHLFGEHPELLdGTFNRGNQAEGTQKV-ALAGSVAVFASALLKRPETVwRDWR--VAEKTD-E-------TADVVSFRMQRIDDRLVKTSLP---GQYVTVQVQMPD----gvrqprqfsltrA-- >SRR6476659_5675031 -STHRPDQALRGGGRPPHRAADNNAKGAATGHRVSGRS---SPAELPENSMREQQQ-ALAGAVAAFASSLIETPERVpQSLLSRIAHKHA-SLGIRPDQYQVVHDNLMWAIVDVLGDAVTAEVAAAWDEVYWLMGNALINQ-- >tr|A0A1I3XAR1|A0A1I3XAR1_9PROT Methyl-accepting chemotaxis protein OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_101121 PE=4 SV=1 ----QAAIQRA-EACLTLSADGLVLEA---------NDRFAALL-GLA-------PAAVADRPHA--ALLTLAERDGATYrrfLDQLAQGR-------------------------------DTVARLWHQGAggagvllELSAAVMAAD-- >tr|A0A1I3XA39|A0A1I3XA39_9PROT Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_10 ----MAAIDMA-QPMMLLGADGVVQDA---------NAPLAALL-GVS-------ADALAGRPHA--ALLAEAERDSAAFrrfRDAVAAGQ-------------------------------AGHARLRHAGAggntvtlDLMMQPLAAE-- >tr|M2X1G3|M2X1G3_9NOCA Flavohemoprotein OS=Rhodococcus triatomae BKS 15-14 GN=G419_19149 PE=3 SV=1 -ILSATSRPIIEATLPVVGEHLGEISRIFYRHLFDNLPSLEsDLFNRTNQANGEQ-QKALAGAVAAFATLLVTEEAPPvDEVMSRIAAKHA-SLGIVQVHYDLVHTALFTAIVDVLGDAVTPEVAGAWDEVYWLMANSLMAQ-- >tr|A0A0N5C327|A0A0N5C327_STREA Uncharacterized protein OS=Strongyloides papillosus PE=4 SV=1 -NLSNDQQALIRKSWRRVP--KQSIGKVIYQKMCQKCPELKNFLST-D----NNCVERHFKYFGDMIQCTVDSLNDLDTaLYPWLNVIgsgHG-GFAITTTHWDAFGEALISSIKQWILTgKDHKETVRAWMKLSCSLIDTLAAA-- >ERR1719323_2694698 -RLSDKTVQLLKGSAPELKEKGTQIATHLFLSLFERYPVFRDLFPK-DNVK-S---GKMISVLPHALTVFAENADNMIQLDDIITrivKKHV-DKGVQQWHYPLLEECFLDALSSTLQLQKRPDLLQAWEDGFKFLANKLM---- >ERR1712018_308843 -------CSTPQILCSRVKRKRFTRGHTSFTSLFERYPVFRDLFPK-DN---G---GKMIAVLPHALTVFAEKADNMIELDDIITrivKKHV-SSGVQQWHFPLLEECFLDALSSTLKLDKRPELL------------------- >ERR1719230_2183946 -WFTDDRERLLKRSWQQLQLdSCEEAGALLCRNYCSQSPEDAASCG-MDW-----------SAVIKVIGFPIDRMDNLAFVKKRLRCLganHA-KWETKEHQFQSMKYAFLSAPRDVFANEFTSDLELAWDLLYDFVSTEMIAGL- >tr|Q9NG75|Q9NG75_9CRUS Hemoglobin P polymer OS=Parartemia zietziana PE=2 SV=1 -GITDAEKQLVQESWELLKPDLMGLGQKVFGRIFTKNPEYQTLFTRVgfgDTPltqlMANPAYGAHLIKVMRSFDFVIQNLGKPKTLLAYLKNVgadHI-ARNVERRHLQAFSESLIPVMQNELKAKLKPEAVAAWRKGLDRIIGVIDQ--- >tr|A0A0D2WU86|A0A0D2WU86_CAPO3 Uncharacterized protein OS=Capsaspora owczarzaki (strain ATCC 30864) OX=595528 GN=CAOG_006523 PE=3 SV=1 ---RHETRDVIKSTWALAIQKQdeadvtpvATFVNVFFGKLFELCPETRLVFGQ-D-------LSLQGKSLSSVLTGMLEFVVHPKKlttQVKSLAVKHV-GLGITPDMFDAFGAALVYTIKTRIGKVWSPQTERVWVDAYGGVNNIITQQ-- >tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii OX=37682 GN=F775_23753 PE=3 SV=1 -TFSEEQEALVLSAWDAMKGDSAAIALKFFLRGRNN-------FVQLaHVEspkRRIPVVEERKTDL-----------------IFEIRTKTW-KIGQKSTAYRSW--LLLR--QKSLPa----HAPKGHLSElvpldTIDHTHQET----- >tr|A0A2T5C1R0|A0A2T5C1R0_9BACT Hemoglobin-like flavoprotein OS=Mangrovibacterium marinum OX=1639118 GN=C8N47_108138 PE=4 SV=1 --MTEADITVIEKSYAQIEAALPRMAKYFFNRANELDSDLDPLFEE-DKS-------KHGEAFVALFGKAVEHLNSPEALLPEIKKMEAklKYYKFNEEVLNTVGVVFVDTLSFGFGNNFTQDIIDPWVKAYKTYSSL------ >tr|A0A074ZRQ0|A0A074ZRQ0_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_04650 PE=3 SV=1 -SLTDAQINGVQSSWKLLKIHIEKIGVIVFLGLFEEHSDFRDAFARFRQkqlsiLTRDPAFQAHGLRVLNVVDKIISRLRRIDTIqdfLLSLGSKHC-RYVPNIELVPAVGEQLLEAIRPVLEEqgLWDDDTAVGWEAVLAYLNCAMRY--- >SRR3954463_14455484 --AQ----------------------------PRAARPSALRLSRPGDGA-----P----FLLRAEVaCLasGI-----g-----------TF-GPGLRSHPLARLGRS-----RALRGRAVLArCPPKIWSPLD------------ >SRR6476620_12491069 --LSDQSLSVVQATAPVVAAHADEITAHFYPRMFAAHPELLLVFNQGNQA-----TGEQSKALAGSVvAYAVQLIDPkapsFDHVMRRIAFKHV-SLGIRPERTQLSASICSLPSLRLSATPPPPrpprpgarsigCSRSSWSPR-----KHGST--- >ERR1712198_397898 -GLTEEEITEIQSTWKSIISdKTSEHGVNILIRFFKNYPEYKAqYFQNLnTLSedelRESPKLRSHGAGFVLAITQIISDLDNMlivEEVAKKIARNHY-NKGIREPlNYKLMTNTIIDYIKDIGN--LADGTMQNFRKMFDIFIISVRKK-- >SRR5580700_967641 --------------------------------------------------------------------XMNRNIG----LFFPLIRHs---------CTYF--AQEPVLeFLG-GFKSAAAD-DQSVRVERIDHL----IE--- >ERR1719464_2687596 -NLTEEEKKVLRTSWAIISQKVDQDGESRFLHKFESNQENEDPILQQ-FT-QIDASICVNCCNIGSSFSWFdsnlcRNllSPSWSTFWLIIAQLvrsTF-FSSS------------------------------------------VKFGM- >ERR1719375_1958814 -----ETALTVIDSWELLRRKknyAVVVGSGLFKKFFQEEPGAIAIFGFTDEEiesdeepfYQSKRFIDLAKNFVGVIDQAVDMLGPEMEmVGEVFVELSK-QYKIEIQHYMLLGNLLLEELEDVLGaKAFTDHIKSCWVQVFQVLCKDVKKKL- >ERR671932_89059 -S-PTSCGPARACRSCCCTPTPPRRRSR------------YdGVHEG------------------LMDLSSFPLPDD--ALFYLCgplpfmravREQLL-DLGVSPRDV--qyeVFGPDLWQADAdeGPGDAPEPgahdllgpEERQGPPPA-WSRPG------- >SRR3712207_7345787 -V-LDDVRALPNATVHVWYESGAASALP------------VdGVHAG------------------TMDVRSEEHTSELqSRQYLVCrlllekk--KTI------------kyeSTXX------------------------------------- >ERR1712168_1470941 --------------------------------------------------lmLTCCkiqKPRNMLMGFSKPWAPQLIVFDTLGSLagyYTSIGVKHI-PRHLEHAHFGWMKASINEVMMSELGDAYTADFESGWDKVISFILERQEL--- >ERR1035438_5604951 -EQTNDLARIFNDSYERVMHgpgrSSGEFFVAFYDLLTATSDEAASKFGNTDM-------AEQVRTLQSSVPVLLNFFvSsRQDEYLGKLAERHSKrGVDIPPELYDVWLDCLVETVRQFDS-KFNDDVATAWRTVFSKGIEVMTSRYE >SRR5476649_733261 ------------------------------------VTGVQ-TCAL-PIC---GL--VRGQMFQVTMESLLDFLGDRSygANLIQIERVnHQ-GLGVEPEMFDRFYLTVMATFKDILGAGWTQETETVWGRVIAELTG------- >ERR1719284_537611 --------ELLEQTAPLVAMRTEEIHSEFQSLLLQHNLELLSVFNIPR---QSDDVIdAETeeiasHHLAGVVLAFAAHVGHVQRmrELDQLAAKHC-SHNVHPFHYVVLHEHLLDAMRKALSTMLTPEVQYSWSQSLLFFAKILIDR-- >SRR5580704_16882803 --------------------------------------------PG--------RHGCAAPAFLPGAQPYRRCPR-gpEGPRQPRALSAgtrAR-APKFGERHYEVFRRALIATLQRFAAPRWNETAKHAWETAFNHAATVMIE--- >tr|A3VC53|A3VC53_9RHOB Flavohemoprotein-like protein OS=Maritimibacter alkaliphilus HTCC2654 GN=RB2654_17741 PE=3 SV=1 ---------MIRACLSDLYSVRIEFSRRFYDRFFEQVPEARRLFVH-NQ-------DKQALMLYAAVAMTMRGMESgrdLDGELIEFGKRHA-RLGVKQDMFPIFGSTFLETLIEYLPHHDHPKIAKAWWGGFTDMSTPII---- >ERR1711953_6095 ---------------------------------------------QLGPAdTlciadqaD-GSLSQEIQWIQTTIFqVMLHYT-----------ENvpfHIPP---HKMKFQYFSDPFLGLVHNCLGKEYNSEMRKVYQSVADFLIQTLTEGY- >ERR1712106_122433 -GLTNKQLSLLITSWKSIGSEMQAQGVTLFVEIFKNNKEVIHAFPLLNPNmKgndamtMNEAFREHGIKVMSRVNEVLHNLEQLNLCVSLIKQPvpiTGVFKGLSPISSRTFTSPSSRWPRQALARSTPRKRKQSTRPX------------- >tr|A0A0B6ZHC3|A0A0B6ZHC3_9EUPU Uncharacterized protein (Fragment) OS=Arion vulgaris OX=1028688 GN=ORF61548 PE=3 SV=1 -GLSARDRKLIKDTADIIFGQlkLQNKGVVFLIAFFKAYPHHQRYFKMFrGIPPdelkSIPHTENHGRRVMSNVALLVQHIEEPNVIKEQLVDLlikHN-PRSVKPRQMKDMLNMFVDFTSQQLGAKFTSQHETAWRKLTTHILSVLEE--- >ERR1719502_1452556 -VLPPEQSALVRRVWQRLVGT-PGAAPILVRQLQSVAPEVAALLSDAsstNGRSniNRGglhavhtDPHGRAAAVLSEVSELTELLDDSAALRQRLRQLRARMPPVGPEVYPSVGKAFLHFVWEGVGSGYDNATAAAFAALWDQVEETMLE--- >tr|A0A0K8QCZ9|A0A0K8QCZ9_9MICC HTH-type transcriptional repressor NsrR OS=Arthrobacter sp. Hiyo1 GN=AHiyo1_24440 PE=4 SV=1 --------------------------------------------------------------------------------MKINAFADV-SLRAL--------LVLSSAPAGELL--TTQNIADAVGTPYHHVSKAIVR--- >tr|Q6BBK1|Q6BBK1_9BIVA Hemoglobin chain I OS=Calyptogena kaikoi GN=Hb-I PE=2 SV=1 --VSASDIKNVQDTWTKLYDQwEAVHASKFYNKLFKDNEDISEAFVKAGT-GSGIAMKRQALVFGAILQEFVENLSDPTALSLKIKGLcatHK-TRGItNMELFAFALADLVAYMGTTI--SFTAAQKTSWTAVNDVILHQMSSY-- >SRR5258705_7404034 ----------------------SCPTSSSRPVLWAAvrdCAGGQTLVPR--------RYDGTRLQADGDAGRCGQQSGQSRSRVAGGERScqaSR-RPWREGGYYTPVGAALLWTLEQGFRI-------------------------- >tr|U5EPU4|U5EPU4_9DIPT Putative globin 1 (Fragment) OS=Corethrella appendiculata PE=2 SV=1 --LSENEIAIIERSWNVVKPDLTSAGEAVLYRLFEkyphnQQYFAQFKNVPLESLKGSTSFRKHVIRVMTVLKNAVEALRLDsadekiHELFLEVGNNHA-KRNITKESYNELRESIFVTLTAACE--LNSEEQEVWDKFLNCAFDISL---- >SRR6185295_10958302 -------CILLLVA-------CFLTFKLFFYSMFQDYPEYKNLWPKFRHLndealINTGELSNFCSVYMDGWEKVIGELDDNAALareLKIIAKTHL-RKGVERshimvakkealcqiriheyCYLQNMMPKMLSLLKEKNGT-LDAEVEEAWKTVFIINADIIE---- >SRR6185295_987807 --MSETHLELAQESLGRLNA-TPKFCGTFYQFFLESSPVIPPMFAATEFE-------VQCKQLRHGLGLLLAYAKHKnPILLERVALRHSRgDVNATPDLYPLFLESLLKAIAAHDP-SYSPELDQAWRAAVTPGVEYMKSMYD >tr|A0A0K8S6V4|A0A0K8S6V4_LYGHE Uncharacterized protein OS=Lygus hesperus PE=3 SV=1 --ATPEQVAMVKKAFDPLSVDAPGVGKVFFERLFELYPGSQKYFQHLGStdeeLFANPVFQHHCTKVILSVGTMIDNYTQTtaektKSCLRNWQRFTP-NGKFPPSKHLTSS-IHLWTFFTWNHIQPWRKHG------------------- >tr|A0A0G3G1X4|A0A0G3G1X4_9GAMM Uncharacterized protein OS=Thioalkalivibrio versutus OX=106634 GN=TVD_07385 PE=4 SV=1 ------TPPNVESSYRRCCA-DASFLARFRLALRAADGQVSGIFDPLSA-------RQQEVMLDASIRAALDFSSGDPqgaSRVSEMIHVHGRqgRVPVPPALYPVWLESLIQAVRETDP-HWSDALERRWRAQLMPAVDMFVELYL >ERR550517_2232778 ---------------------------------------------------------------------------------------gpDQ-PKAIPHRCLPQkhrhtgsisrhHGARFLQCCPSHLAE--AQDVERRDGGLLDGSFQSDHEHH- >ERR1719309_231760 -TLTEEEIQTVKTMWAGLLENSADSGLFIFQNFFELYPEQVHRFSFIrDSQgnpipnyLKSQAMLQHSAMVMDALDGVITGVFEHDPLLGqmmyNAGYSHH-SKNIAKDDIEKLSNSILEVIKLVASCegSGKATKVEAWRKLLNIVNERFEQGF- >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1 ------NLGLVRECWDSICEQYttNELGEMVYDHLFKMAPNLTMLFTKPR--------SYMAVKMGDMLSMLVSFADSSESMkqqISWLGLRHV-KYKIRPHHIPLMGPVFLAVVAEAAGVHWSQDTEKAWSVLFNMVCVNMADA-- >tr|A0A0W1L270|A0A0W1L270_9GAMM Uncharacterized protein OS=Pseudoalteromonas sp. H105 GN=ATS75_15205 PE=4 SV=1 MGINTFEKQLLLNSLTIIKPNFHCFSYTFQMHVKR-ES--------LDMLcLSSs-KINEKTYILYCVLERIVMHLDDLRTVTPFIKHYanNLSNMGMSYEDTDILCNSFLATLKIHLKGCYSPKLENVWQQAISIFRSIVTG--- >tr|A0A063KVI9|A0A063KVI9_9GAMM Hemoglobin OS=Pseudoalteromonas fuliginea GN=DC53_02740 PE=4 SV=1 ---MNTNQSVLLKSLQIVKPNFHAFTARFHRKLAE-SG--------IVMNyPTAn-QFNEKSYTFYCVLERIIKHLDNPSSVTPFLTHYleHLNKRNIQQTDIKILCDIFYATLEAHLGQHFCLQSQTAWQEFLTFFENCTNS--- >tr|A0A1V9ZUY0|A0A1V9ZUY0_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_00581 PE=3 SV=1 --PTPKDEELMTRSWDNIIGAkiraelerrklktidadDefeAssvvQFYDVFFAKLFTINPATQPVFRG--------SMHVQSKALVNIVGAIRHILHSEdaTSNIAALALRHI-QYGVKLEFFDSLGLAMIETLSAMGDtGRWNKDVRDAWHTVIAYIICILVPPY- >SRR4029077_13489679 ----------VQADVHAISVM--LNLMQPFRALRRRVDQFAKLWLD--------PLWKTGRKAARIPA--TSTSITGRTGFAGRGRT------------------------------------------------------- >tr|A0A016SWG0|A0A016SWG0_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0168.g192 PE=3 SV=1 -QLTSEEMDLLRSSVRIISENATEVGCNTYEMIFEQSPYVKEFFHFTKSdddAYRQKQTVQLAQKYMQVLIAFVEGIEDPSIlepVSAKLIEIHRKvddVQ--MAAHWGVFTECTLYNIRKALEKDehFNDMdrldaAVMLWRMVIRGIVRRLKA--- >ERR550534_835606 ------AKKIVDESMNLLAKcDLDEFGTTFYSTVFSLSVDAQQYFYKP-----NAMMKFIAKKVLTIIAAVLHEPDETAHDIRAMGLRHM-KYGVPPDYFPLFGESLTAALPGVLEGYWDDSVRTSWEGIFEFVKNCMTR--- >ERR1712025_717817 -TLSPEHVDPITESAPSGKAKGMVIANNLYRKLFSRHEMFRAMFPEQS---------QQSGKMIQALPSALydfavncDNMGQMQSVVARIANRHV-QQGVQGFDGTFQFIPKKVDLsliPAGQCEAKLKVALNARQPGtgvgdrFQLHPSEVC---- >SRR4051812_15383594 -PMTSDTIALIRASFRLAAADPQALSQVFFRRLLLRSPGVQRMFPAS--------LVRDPQRLVGLIDQVLRLLDRRDmlvEGLQNLGRLQA-PYAALPMHYPLIAGAFREALALRVGTLWSVDMEESWAELQALVIRIMGA--- >NOAtaT_7_FD_contig_111_1754_length_212_multi_2_in_0_out_0_1 # 1 # 210 # 1 # ID=13324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662 -RLKPKDAEYLQDSWKVFLERsggLEGAGKEFYRLLFEKEPDLKKLFQV--------PEMSQAAAFMRAISRYVSLLAQPEQLktaIEMLAFMHV-NLGISETSIFAFAESLLECVEDQLHDWDpgeVEQVMVLLTDLTTYIGRVIA---- >SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold554780_1 # 1 # 420 # 1 # ID=554780_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.669 -VLTSSIYlttgTVVTDFSVIVLDAegsAIEPGEAPYSLRVYFTPASTGTstatIQL--------PSGLISDgMLAVGARRLQEETINPRRLagaCEAYGATVTSnvlTVNVrksgTASDPCDSTDAISLLFAGGMATWNslgTSVTSADFtmstnvdsdsvTYRLTFEENVFL---- >SRR5690554_337115 ----DEYVKLLETSFQKAVENvgIEELSTRFFSRFFETFPETNSLFKGTNIDYF---RKFKMRVIFDFLIDIVKHPNYAEAHIAQEVMRHQ-MYGLqDKEYYFTLAACLLEAVKSALGDAWTDEDESAWNDILLVFKG------- >SRR5690606_3594538 ----EHHLSVVEQTIQQAIGKsgEEALAAELFRRYFERFPETKeRYFHATNIEYF---GVRKFRIIRDFLIDTLKYPNYAEGNMYNEVMRHQ-VYGLkDKEYYFGLIDALMESVQ------------------------------- >tr|A0A0K1PX98|A0A0K1PX98_9DELT Uncharacterized protein OS=Labilithrix luteola OX=1391654 GN=AKJ09_04675 PE=4 SV=1 --------VVLKESWHLSYRRAPDLAARFYEELSWKYPSARRLLDHVF--------GAQNdiaVCLSTVAGDLLDNVDDPDAFSAaivALANAHV-SLDIPPHVVAWMEEVLLDTLEGAAGDDWTPEMRTTWRNAYEDLASRLAR--- >ERR1719468_1094774 -PLTSNDRKLIVRSWTIVDQQISQVGLSSFLELFRRAPETLSVFPFLkQLGPEdmefYHQLKNHSIRITGVISMLVKQLESEErpadeairDLLLDLGRRHF-SYGAKTSHMELLGRVFAESLQPIFEGDpEAKAIQEAWLVFFSVIVFWLQKGFR >ERR1719183_1674583 LALTTDQIEAIRSSFGMVLaaaPSKEAAADTFYQTLYDASKSIQPYFVT--------PRAVAALRFVQEVSAHLSVLDDPKQLKTLVETRsfnHF-AIPVSVAAVAKVRDAIMDLFAAEIGKKFTEEAKLAWKAYFNYVGGAFI---- >ERR1719458_172070 -NLTEEEKKVLRSSWDIISQKVDQDGESRFLHKFESNQETEDPILQQFT--QIDASIFNGKSAMIIVALTLENLE-------KSHQTrtrSL-W---------IWSTT------DVFRLDWST-FRY------------------ >ERR1719278_416587 ----------------------------------------------------------kNRRRPVA--TFLLKNLKatsesslYLPGLWSTIR------------------TTIPVPVrrRQPLRLSHP----------RDLLRGCKQRPQ- >tr|C9CRM3|C9CRM3_9RHOB Uncharacterized protein OS=Silicibacter sp. TrichCH4B OX=644076 GN=SCH4B_0097 PE=4 SV=1 --ISSRDIDLLQSSCATAFLKKGVLASAFYNKLFEIEPAYVNKFSNI---------NKQKIMFEAMLAYCISGITSgykVEALTARLRSYHM-HLEISDIDIANARSALMYALGSVLGEDFHSDLKQAWDAAFSSVSEALR---- >ERR1719419_503384 -DLSPKEILDIQMSWAEIHQEGlVNPDVLMFKLFFEESESGRLKYSHLlkNVNldnlnwmrdwTKVQKLKDSIDKTGEALGDVIKSLNYHDRVVDKLYSHgvvHA-KFGVTRKEIHTFCECLLMTLKMELGTNLSQEAQASWERLLKMIVEVF----- >ERR1719295_364028 -DLTPEEKRCIQRTIPVILQEAEMIGTKTYLKTFHNYPLSMIYFEPLrDKLvtevkQTDDYLKKHGVLFVKFIGELVAEMDDPDSvdlKLKSLGRFHD-DLGVLKQYLEAIGPLFVQAIRPVLMtqasipsatncgvgvsspnSLWTRDTKPSWIRFFRVIALQMKRAY- >ERR1711860_326342 -ELNSDEKTLIVTCSKQLLEIQKVLGPQMMQQKFQKV-----------------WSKEAGEL-KQLYDMR------------------------------------------------------------------------ >tr|A0A2A6B374|A0A2A6B374_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_54161 PE=3 SV=1 --IPDDeekkLtSQILCDSLSLAIvgngEPPVENGQEFYQFLFTIDPRLQSHFVGADEfmgqdPKEPTKFAKQGQRLLMAIHTMAASFDDSEAFDKTVSdliKRHK-DRHVDPALWNKFFGWFVTFLKSKGE--LTSIEEDAWKQLGIRFN-------- >tr|A0A0B1T604|A0A0B1T604_OESDE Globin OS=Oesophagostomum dentatum OX=61180 GN=OESDEN_07088 PE=3 SV=1 --VSAADvRKLTSASMATVPvsspSDKTKHGNDFYQYFFTHHPEVRKYFKGAENyaaddVAKSERFDKLGNDILLAVHVLTETYENDNVFRGVCRdviNRHV-EGgrHLDPALWKQFCSIWVAWLESKGAK-ISADQKAAWDTLSVTFN-------- >tr|A0A0R3RQ08|A0A0R3RQ08_9BILA Uncharacterized protein OS=Elaeophora elaphi OX=1147741 PE=3 SV=1 --MSHSElKAKCIKVMNeVGRvgtdDEAIQHGKNFYKFMFDHHPDLRIYFKGAENysgtdVQNSDrfNYGFSGQRLLLGVRTLIDIYDDIETFKAYARetvNRHI-KFKMDRTLWLAFFTVLVSSLKEHIT--IDEETEKAFLQIGKEFS-------- >tr|A0A1S0U934|A0A1S0U934_LOALO Globin family protein OS=Loa loa OX=7209 GN=LOAG_01385 PE=3 SV=1 --MSHLEmQAKCMKILNeAGRvgtdEEAIQHGKNFYKLFYVWP-----------------SSGFTGQKILLALRIVINTYNDPETFKAYARemvNRHI-RFKMDRTLWLAFFTVLVNSLKEHTR--IDEETEKAFLQIGKEFS-------- >tr|A0A1Y5FEW2|A0A1Y5FEW2_9PROT Uncharacterized protein OS=Halobacteriovorax marinus OX=97084 GN=A9Q84_13980 PE=3 SV=1 -------------------ENIDQFVESFYEHFFSLTPEIFELFKNSEIG-------KQKNEFKISIHTLLINLSQLDkldSYFKDLGIRHI-CYNVSERHYKLAKESFLYAIKKTYADHWSKVVETKWEEIIDHVTLKMKEG-- >tr|T0SGR6|T0SGR6_9PROT Globin OS=Bacteriovorax sp. Seq25_V OX=1201288 GN=M900_0432 PE=3 SV=1 -------------------VNLKKVIDDFYNLFFNEENDLTRIFRNTELT-------LQKHELQKSLELLLSNILDKEevsKYLRDLGVRHI-TYEVKPYHYEQAKQALLLAIKNNLKESDFIKEEKAITEFVTFICINMMNG-- >tr|A0A2E2XNM9|A0A2E2XNM9_9GAMM Uncharacterized protein OS=Cellvibrionaceae bacterium OX=2026723 GN=CL693_20675 PE=4 SV=1 ------DIDWIESSLELLAPHADRLGGLVYPRFFVHFPEAETLFGG-GELG-----KSTQESMIVPLLMGLKDIADGKtymlTIERWLED-HR-EYGVTLPMYSVMLDSLLLGMREAVGDLWTTEMDGAWQEVLARLLLLVEGVY- >tr|L7L9M1|L7L9M1_9ACTN Uncharacterized protein OS=Gordonia hirsuta DSM 44140 = NBRC 16056 OX=1121927 GN=GOHSU_25_00750 PE=4 SV=1 ------IRQAVLESLARYEESHGDPTRAIYERFYRVHPEAIEELAF-D--------TVLENRMMAGILALLADVADGSidpgGAVYWVSD-HV-AWEVSETMIMGMFGAVRDTVREGLGPEWTARMDADWAGLLAALAPAMRDAV- >ERR1719478_64653 -SLPTAQIEAIRNTLNMVISaapSRDAAADTFYQTIYDASRIIQPYFVS--------PRAVQALKFVQGIANDLAVLDDPPQLKilvETRSFGHL-ALPVSVPLVVKVREAIMDLFNVELGSKFTAVAKTGWTAYLNYVGGAYI---- >tr|E3MNQ8|E3MNQ8_CAERE CRE-GLB-30 protein OS=Caenorhabditis remanei GN=Cre-glb-30 PE=3 SV=1 -HLTPIDREILNKSWAIVSKDMQQVAVNIFQMIFEQAPDAKLMFSFMmkDYkeDKKSNEFIFHAVRFLQVIESTMTHLDDPSQldaVFLNLGKIHAkheEQLGFSAHYWSVFKECVLFHFRKAMKAHnkFSkhkemsfAEIDSAiilWREVLRFIIDRMKVGYC >SRR5690606_31308825 -FMGYANSDIVLQSYGRCC-RDEPFFEHVYNVFRSQSEDIRDMFTHTDMT-------EQRRLLRAGITWMIMHSRGGgRSKLESLGKSHNrHGYNVPPALYRHWLDALVESVAAYDP-HYDATLEQHWRGVMTPGIEIIASAYX >SRR5438046_4862914 -------SNPIERSFELAAERCEDLTPLVYRRLFDAHPEARTMFRTE-GS---EL--VKG----SMLALTIDAVLDFAGertgHFRLIEaevSSHD-AYGTPRELFVAFFGVIAQTLREIVARTGRTTSMRrgGSCSVTSKVSLQGS---- >SRR6266403_3319847 -----------------------DAARL--SPPVSQTPGSQNDVPKR-RQ---PA--GKG----FNVGADHRRHPGFRRraigELRMIScevQSHD-AYGTPRELFGEFFGAIADTLREILGSDWSPEIE-eAWRELLVELDRVVT---- >SRR6266481_9249308 --------------------------------------------------------------------------------------------TNWRSLVQFALEEIVTDIDLLL--DRIVVAVDavgdqrvaRDDRILVELDRIQA---- >ERR1700744_2408068 ------------------------------------HPEAESLFRRG-PS---MR--CPT----GRP----------RSgtpg------gscwtkliaSAlSA-RHKSRRLKSSLPLEEIRADVGFLL--DRVVVAIDavgdervvRNDRVLVRLDRVQS---- >ERR1719178_87025 ------NKHLIDETMERTADaNISDLGSICHRKLFSLSADVQNYFYKP-----NTMVAYILEKVLYILSNLSHEPVAIAHEIRALGMRHI-KYNIPPIYFPLFGKALVFTFGSTLEGFWTDDIENAWGSVFDFVCRCMTR--- >ERR1719158_1490032 ----------------------------------------------------GGQLSFICRGHSSRIN------------RNALRVRRsrI-TNRSHSNCFSSYT----------RCSISSITCASAWATCLLR---RL----- >tr|A0A044TBZ8|A0A044TBZ8_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1 -NFDDAEIQLLRRSWKTIKPEKQT---------VLQCPEVRRFFPFMNSdlkscEKKNKRFVFQALRFIQvdmtIFNEIIISSF-------s----------NDIAILMLVFLECSIHQIRITLLNSkldlWNRKDvdnvIILWWHLNSGICGKIK---- >SRR5215831_5553854 -------VTDLHRSLEIAAERGGDIYPAIYDAYFARCAGSRDLMELTDIC-------MRGRMLDSLFELLMA--DDAASQVAYLhfeTKNHS-SWGVQPQMYDNLLTATRDTVRGACGPDWTPAMAAAWDARIGDVIR------- >tr|X1ZVE5|X1ZVE5_CAPTE Uncharacterized protein OS=Capitella teleta PE=3 SV=1 --LKTEQVALLKSSWQQLCVKrsPYFLGRQIFLRVFELNPEIKKSFQFGEFHgndlINNPMFKIHVKNFVSVIDSSIRSVDSLKTVlAPTLhtlGGTHQSVEGFNKNNLEIFLKAMLLVLRQEFKSALDvddLEVEVAWRKLLEFIVYQIHIGYR >KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1083625_1 # 3 # 881 # -1 # ID=1083625_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.686 -----------------------------------------MEYEI--------CLEPSGIRFMADAGQNIVEAAKQHGIpIKHGCASgscgdCK-GTILsgDSEQGPFMPLLLLPTERAA-G-------MAILCKLYP-RSDLRL---- >tr|H3NRG3|H3NRG3_9GAMM Uncharacterized protein OS=gamma proteobacterium HIMB55 GN=OMB55_00005550 PE=4 SV=1 ---SQSDIAIISESLTLCGDCLEDITPHVYRRFFELDASAASLMEYSDEH-------MRGRMFASVLELFLSdDPFESDGFLAWELDNHVSSYSVTKSMYESLFKAFFEVAEETLGEDWSGDFERAWTNRIARIMAEVS---- >tr|L7MTK4|L7MTK4_SYMRO Neuroglobin OS=Symsagittifera roscoffensis OX=84072 PE=1 SV=1 MQVSEEQQSLIMEDVQVLLPNYDDFVEDVLQQFMEENPETFQIFPWADASKtakemrSHPRFKSHAKSIGKVISDCLVDLNGVKKHepkLSSLGAMHT-KKKVPTELFGKLGGCILTQVVKRVSeAKWSEEKKEAWLKAYGIITV------- >SRR5215204_501118 --VTRRDWQRLLENWERLQPSADRFATVFFDTLFAWEPQARQLFGGA-------TLETQFLRFAHLLTSLVSAQDHPDELDRRIDAViRCFAGgDPPRKREDAIRVAVAAMLNDVYAAGITPETRASWQSAYIGVITTIRS--- >tr|A0A1W2GS79|A0A1W2GS79_9BACT Uncharacterized protein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4044 PE=4 SV=1 -DLNIRERKNIRDTWKVLAPNIHEFAFSFYSNLHSLDSSLVPLFENE------FGIIKQGDKALYVLGFVVASLDNLMvareGIKKALEGVFMEHQHIKRADEQKVMKAFLQAMKSTLRGVWTNEIAISWYRLLSLISAVSI---- >SRR5512143_1477374 ----------EPhDSCVRCF-AVPTFVGRFYARLFSEHPDVGRYFVGIDCA-------RQEQLLRASIPLLVLAPGgsaAARAALERLGRHHGpDGIGVEDVHYERWIACFLATVRD-CDHGWSPAVDSAWRHTLAHGVAVMRRAA- >tr|O97381|O97381_ARTSA Hemoglobin C1 polymer OS=Artemia salina OX=85549 PE=2 SV=1 -GLSGLEKNAILNTWGKVRGNLQEVGKATFGKLFAAHPEYQQMFRFFqGVQlaelVDSPKFAAHTQRVVSALDQTLLALNRPSDFVYMIKELgldHI-NRGTDRSHFENYQVVFVEYLKETLGDSVDEFTVKSFNHVFEVIINFLNEGL- >SRR5512139_12076 -----TDLELIEASIEQMLDLETEIIGDTYARLFAHCDGARALFGPNTYG-------PRAQMVN---ETIIAGLDLLRGepwvheYMTQHGVRHRHSYEVTDAMYRTYAESLLGAIRERLGDRFTPELEAAWS--------------- >ERR1719193_549257 -IFTDDELAILKDVWAHLKHHTAGAGLTILDHFFKRQHWALERFEALrDMYgnihpdyMKIDLMRFLAVDLMEGIDIFVTGFFERD---PEVTDLiadvgyaYV-KKIIIESEIEIFVDSMLAAMEELLGEDtWK-KNMAPWKKLMPVVAEHFSRGFK >SRR3989304_6997408 --XMTTNLDAVTASYHRCRA-SAGFFDTFYECFPARSEEVAEKFRQTDFT-------RQKLMLRESLISMLLFnlgTGSARAELEQLAKRHSRdRSEEHTSELQSRLHLVC----------------------------------- >tr|A0A0P5Q0G6|A0A0P5Q0G6_9CRUS Uncharacterized protein OS=Daphnia magna PE=3 SV=1 -SMKGRGSCFDQGHLESCKKN-GNIAPKAFIRYLKLKPEAQKKFAAFaEVdladLPTNSHFLNQAYTCLAGLNAYSDNLGKNPKSCPYLNSPAF---KdVKPDELKLFGEVMFNVMEKNWTIIFPRQARKAWKDGLTACDVA------ >tr|A0A2D4BL26|A0A2D4BL26_PYTIN Uncharacterized protein OS=Pythium insidiosum GN=PINS_002968 PE=4 SV=1 -------------LEKQQNYKVTTLYDVFYAHLEQHSPELKPVFRS--------SVHIRGKVLVHISVGMRTLIASEnfVDKVLPLTKTHR-RFGVKPEHYEPLGRALLHAMQVVAL------ITRDRGRVEEPTSIILIQ--- >tr|A0A024G680|A0A024G680_9STRA Uncharacterized protein OS=Albugo candida GN=BN9_028420 PE=3 SV=1 -------------LdGMQPAERMELLYDTFHKFLELNAPELKPVFKT--------SKHTRNVVLQHIVGGLRTMLAQNvhIERVRALTKTHL-QFGVKMEYFDLLGQAVIFSMRQCSGTHWTNEIEEAWRRLYGHCSVILLR--- >ERR1719474_2118124 -SLNPTQKCVIVATWHSIFlKHMNFMGKQLFVDLFKVEPNILKYFDAFrDVGlanlLQSRSFQNHGVRIMNLVKFAVENLDNPEKLqdhMHALGRLHV-KKGIDSKYLNIMGPTFCQAIRPMVMaeGQWSIDIEGAWIQLFKILAQMMRVAYE >ERR1719244_357615 -WFVPTEKCIIVATWNTIFfKHMNTMGKHLFMDIFKMEPNVLKYFEAFrDVGlsnvLQSRAFQNHGVRVTNLVKFAVENLDNPEKLkdhMLMLGRLHV-KKGIESRVLDLMGPTFCAAIRPMVMaeGSWSLDIDSAWAKLFRILVQMMIPAYS >tr|A0A1I2S201|A0A1I2S201_9CORY Uncharacterized protein OS=Corynebacterium spheniscorum OX=185761 GN=SAMN05660282_00995 PE=4 SV=1 ----------------LLRQESGHLEPELQLQLYARHPNAQWLLRA--------G-KAVPAELVELSIHAIAAADAEgaldALAEARIRDLglaQR-RFGFPSELYQDIQEIMVSLLRTTGAD-LPFPVEFAAERTIARVCVLLQE--- >tr|A0A2S9Z387|A0A2S9Z387_9CORY Oxidoreductase OS=Corynebacterium sp. 13CS0277 OX=2071994 GN=C1Y63_03975 PE=4 SV=1 ----------------ALTRHPELFRRAVTATFTGLCPAAGVLIA----------QPAAHADLPVACAWVLRNSAE-qvsDYAAAVIRQLgceHR-RSSTDPAHYALFARALRAGLDAVAAEDdLEPADVAHAAHLLEHCCTLMRD--- >tr|W5Y4C7|W5Y4C7_9CORY Putative oxidoreductase OS=Corynebacterium vitaeruminis DSM 20294 OX=1224164 GN=B843_11695 PE=4 SV=1 -------------------RNREELSAIAFDMFFATQRDARTRIRA-------------TPAIADALTLLARSCDSEgklpLDVEKRFLQRattLC-AHGLRVDDLEPLAESAHRAMLITAGG-QPFELVLPIERALQQLARTVVE--- >tr|A0A172QXP0|A0A172QXP0_9CORY 2-polyprenylphenol hydroxylase OS=Corynebacterium crudilactis OX=1652495 GN=ccrud_12565 PE=4 SV=1 ----------------LVEDNAQDFLRAVKAQLLQLAPQSRGHFPT--------DDDLTHISIAETLSALLDGTGKEgevdEGTLAFFQEAaldAR-RFGITPDMLKALGEAVRTELLELCSD-LPFENVLFAERAIAATSAASIQ--- >tr|A0A1W1UZL1|A0A1W1UZL1_9CORY NAD(P)H-flavin reductase OS=Corynebacterium glucuronolyticum OX=39791 GN=SAMN05660745_01670 PE=4 SV=1 ----------------RLRSVSPEFHEHVRANFFDKCPETMLVFPL--------HKENVHADLGRVLSFVFDRTPVDghltDEMRTLITQLgkdHR-KYNVSPRYFHPFVECLRDSLLTLCSD-LQFKYLNGADTALGEVSTLLAR--- >tr|K0YDT0|K0YDT0_9CORY Uncharacterized protein OS=Turicella otitidis ATCC 51513 OX=883169 GN=HMPREF9719_01398 PE=4 SV=1 ----------------ILGAQRTAFRDATVDYLLRRLPRLRRVAPL--------RQRHRAEALAERAVGLVARSPQ-gmlrGEDAADLERAgraNR-RLGVPLRVYPVLAQALKAGLRAAFEAAgePYTAAARDAEALAEAACASLAR--- >tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae OX=1717 GN=mphP PE=4 SV=1 ----------------LRLVTVTAHSIQAVADElraHRAEFIQAANQKP-------------DSPLADAIVQLVDHTDLDghvpESIATSWLQHaaaAE-SLGVSRDYYLTLADASRSALRHICAD-LPFAEVLGAERAITSIANTLT---- >tr|A0A0G3H0V1|A0A0G3H0V1_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium mustelae OX=571915 GN=CMUST_13735 PE=4 SV=1 ----------------LR-ALSEEFSRDVFHSFFRSHPHERLVISP-------------EFPVAAAVSFICHGADANgtlyPETENRLRELaeiIT-AHGF--RSILPFADAITKSIRHYCMR-DDFFGTIAAERAVEQAAEILNH--- >tr|A0A0G3GTQ0|A0A0G3GTQ0_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium epidermidicanis OX=1050174 GN=CEPID_01535 PE=4 SV=1 ----------------TLRAKSPAFRRDVLRDFFSQHPHMRLKFAA--------NEDHAHTELVFALTYLLENPTD----PELIRTLardHI-KVSPGQEVVADFFAILHRQIHRYCAD-LPYEEVRQADLKLQEIA-------- >tr|A0A0F6R111|A0A0F6R111_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium kutscheri OX=35755 GN=UL82_09495 PE=4 SV=1 ----------------------------MVASHfYADVPLARLSFRL-------------QPSLVDTLIAGLSHP----LNITAW---ahdLA-HRGVDRSFYVPLSAALQHAVCHICSA-LPLVDVLAVEHRIDQIMKQLLA--- >tr|A0A2D7G1P9|A0A2D7G1P9_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP96_10880 PE=4 SV=1 ------EQTCIERVLDCAAEDQPDFQQRLYDRFYQLAPSAEALMIHIDEE-------VQGKMLAEVIRLFLSpDVaVTDQQYLLFETKNHAQAYFVEPEMYRALNQALFETLKVGAGRIWSSEVESAVHNRLSKMLHGILEAL- >tr|A0A2E1GZ77|A0A2E1GZ77_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ03_04085 PE=4 SV=1 ------DQAWIETAFDCAAVDNLNFNVDVYQTFYRAEPSVASLMAHIDEL-------VQNKMLSEVIRLLLNpNIeSEEAGYLNFEVKTHIQGYGVSPLMFLSFNRAVYEVLQSSAARVWEDDLAVAVTRRFAVLSDALTEAL- >tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ23_00915 PE=4 SV=1 ------MQSSIHALLEQVATTDIDFDKKCFERFFQISEEGKTLMAHMDRV-------HRGKMMAEIYRLMMArDLDDEADYLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY- >tr|A0A096P8B0|A0A096P8B0_OSTTA Flavoprotein pyridine nucleotide cytochrome reductase OS=Ostreococcus tauri GN=OT_ostta17g00030 PE=4 SV=1 -------------------------------------------------------------------------------------masvgsgat-DDD-GVDVPVSRCPFAhGTVTVDPYPGYVH-G---KNPRVCPRGCVPRPPSKP---- >SRR6266498_4102119 ----------VATQSYR-MHCQgrPAFYSTFYQRFFQHCPEVKTWFS--NM-------HAQYDKFDQALQFLLNYRHGCMEEPTVLSmtaNKHR-AFKLSACQFDEFERALLETLKESAHE--SDRVLKAWETTIR----------- >ERR1719474_730311 ---------NIHVTFDvALTSDPKGFAEKFYRGLLKEQPDIGQLFLDK-----NTTFDTQSARFMAMLMHAIKMLDDTDHFTQSLDSLseaHV-GYGVEIPMLDAFGKSLISQVKqfnieyyqqqqnhkgddqkeETVdilkVGRWTTKQDDSWKWFWSVVVGVMSAG-- >SRR6266536_2537548 -PLSGREREIAMLAAAGLA--SKDIAERLYLSVRTVNNHLQHAYTKLGVS-GRAGLAEQEIKFAEKLTEIVRAMPRLDELLthtRALGARHV-SYGVRAADYQTLGNALLAALAAVLGGSFDAPTREAWTLAYNLVAETMLD--- >SRR3954465_13942299 -PLTGREREIAMLAAKGIL--SKDIAARLSLAVRTVDNHLQRAYTKLGIT-GRDQLADVLAHDTTTHPGPX----------------------------------------------------------------------- >tr|A0A1Q9C6P6|A0A1Q9C6P6_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene41206 PE=4 SV=1 --CVCDLAQCRGRSWAAFFVDI-------QAAYYETSRS--LLFEG--------PSQDP----------ALVALQLPAHVQAlisDGALQGL-GI--PQEHIALLQDCvecsfwtftgqtqqvmatsgsrpgdgladvlFGALFAVILtcLEAKCQQCGLVHQSMSDALGVPDR---- >tr|A0A0E9N6V9|A0A0E9N6V9_9BACT Uncharacterized protein OS=Flavihumibacter petaseus NBRC 106054 OX=1220578 GN=FPE01S_06_00290 PE=4 SV=1 -QMNQQEIQLVCQSWQQAAEEPLRLAILFFDRLFEEAPELRQVFRT-P-------MSEKTRQLLVFFGFHINRLASGSIRRPSFEAYVW-EELLTDAQKGFLMETLSDTVAALLKPDWTPALQGAWGSFRK----------- >tr|A0A2G2R0S2|A0A2G2R0S2_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_09030 PE=4 SV=1 -IVTPDQAIIIQESFARLSTSSDSLIQDILGTIAEGNSDLAVTIT-----FKSQNLVEQIS---TALSHIIDQLhtaDNVAEYVAHFGELLL-AQNVQDENYSSFGEALLSGLENALQNDFTAEVRDAWTSGWAMLSGIMREA-- >SRR3546814_3775940 -----------ERSLEAVMEAGKDITPFFYDRFFALYTEQRANFYHFES--------TSGTMVNEMITSVLALASNEAWLtnsVQNFVAAHR-SYGdIPTDAYARLQDVLVDNLAQDSKSTSLNTsNYCANsl-LYSVX---------- >SRR3546814_13566968 ----------------------FTIYTTLSLNVVLPFVTHRSNFDHVES--------TSESMVIEMITLVLALASKEAWLtnsFQNFVAALR-SYGdIPPDAYARLLDVLVVTLAQVAGSRWTDEFETAWRWYVSG---------- >ERR1719397_23434 -NLTDCQVRLVLVSWPVILEEFQKVGVQCIVHLFEVVPYMKEHFQQLiNNSgkfdpkDGNvmqTVMENHAKLVMNVVHEVVTNIDALDSVTEkliQVGEKHC-KAGVEQRYLDIVGPIFCNAVRPVLLRsgIWNNRTEEAWMEVFTAIASTMRTGY- >SRR6478672_7358577 ---------------------------------------------------------------------SRMp--CNSSTlkrrpSatscTESPTSTSP-WESAPSST-PSSASTYSPRSLRFWATPSPPRSPPRGGEVYWLFALQLVA--- >SRR4029450_1817054 ------------------------------------MARLLRVFNQGNQA-----TGEQSKALPGSgVASAV-QLIDPNApslahVMRRIAYKHM-SLGVCAEQYIVVGHYLSRRWARSSVRRSLPRSRQRGRKFigFLPFS-------- >tr|A0A177B679|A0A177B679_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_02502 PE=3 SV=1 -GLTKTDINMVLGSWESIN--NDEASSIFYRELFNTYPDTKSLFVKFySVdndkLIDNPAALKQLRVTWTAITTLIDYLKkgRIDEANKaidYLIEKHRKIKTFQGPMFNMALEPLLYLVKEKL---TSQAYIDAYKKVFGAIFLTIISKY- >SRR2546427_1691122 -------VVLLQTTFLRAAEMrigKRNITDFIYEDLFLKRPQLKPMFTNQ---------VLQRHKLGKMLGSIFIHLRDQdwiDEHLRDLGAMHW-RAGATPEVYPWIKDSVLAVLEEGMAPsGWNLRCQREGAGALGVSAQGMLMGY- >ERR1719244_673251 -----GQKDLIIASWREIRICLDEVGFDTFKQLFAHHSDIRAYFPAMkKLSSndveMSRKIKEHSTRIMAVLKLFVDNIYDLEKIEPSIedlGRNHS-FRTLLGLFLSE-------RISGQL--AWR--------RCCFNYLNIS----- >ERR1719369_2640530 ---SPSQVDMLRSSWVILVRQLDEIGMKVFAKLFTVHSDIAQYFPQAkRPGS-SVFIKDLSHRVMNLLKLIVDNIEKLEMIRDTIrilGEKHY-QIGVRSEHLDLMGPIFCETIRPILVanNVWTHHVGDTWLST------------- >tr|A0A2G2R4B7|A0A2G2R4B7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_07540 PE=4 SV=1 ----------------------QSASDKFYNVLQNDLPEFTQLFTN-P-------E-KQHMMFYAALRSIDGLKDNktkLAVYLRSIGVKHK-MLGLTHYHMEIGRNAFEQAIFA-GGKDLTHDQRQFYIDSFSQIEKNM----- >tr|A0A2D9F7C7|A0A2D9F7C7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=CMM61_16775 PE=4 SV=1 ----------------------EAVAEAFYAALFREAPDVERLFRD-E-------T-NKTVMFVNALESISGLERGdphFADFMAMLGQRHR-DIGITQQHLKAGWTAFNEALDV-GGGNLTLPRRQFYRDAFKKLVAAM----- >ERR1719378_1531842 --FHPgaDGVHRIGGEESQ--AEVRRQRSLSLPKFLDSLSGEKEKFAFNfDSMgnvlpnfHASHAQKIHSMKIMDAIDAVISEILRDHPIKQRlmdVGYAHY-ELHATSKDIRKLTTAFYKGVKDLIGIDDdNDRHLVAWKDFLNKIEEGFK---- >ERR550534_2245262 -----------------RDLRHPLGLLLALH---------GGFLSFFhGFFgsykadaMQTEFMKNHSIKIMNALDTVIAGITAQQPMREAvreIGRDHY-HKKIDKIHMRQMADGMLEGLKEVIGDAKdSTRKL------------------- >ERR1719192_2788519 ------RREIIGTMWESFREDSVSSGLFILEHFFSTYPDEMDRFTFAsGGQtdketplafiMKRERMRIHSAQLMNALDRNGHVY--GRSpgCMDQapqSHRG-------------NVCRRTGKSSGIA---------VFKWRVA------------- >ERR1719367_1435250 -------KTQLRSTWNVIMSDMASIGVVMFLKMFETHPETLSSFIR-NVYSikeiemdewYQENLKLHAIRVMAIVEQVIHRLDEVGSVIKILMKRglsHK-RLGVQRSMLEKMGRSFVLSIQSPLEEanKWDATVEQSWLSMFRFIEFWMGLVY- >ERR1712004_299484 ---------ILRESWKHLQSRIESLGVVTFLSLFNASSETLHTYLTPeDIATlkeqdkdkmLIEKLRVHPLRIMSVLEKTVHRLEDHQRCLKMLRQYgrkHQ-RFGVPPFMFATWPGVFYLYSSPYWKNlsNGMRTFHKLGKACFNSLHLEYRE--- >tr|S9TQJ9|S9TQJ9_9TRYP Adenylate cyclase OS=Strigomonas culicis OX=28005 GN=STCU_09709 PE=4 SV=1 --------YTVEATWNILEKegMVDRFGQQLYDQLLTKNPRLRVYFYGVDLD-------EQSKTIVRMLGTAVHSYNNPVRTvefITRAGARHR-GYGVTPSVFREMEVAFFKVFPKFVGLDVFEASEEYWKDFWAVVLDLLSR--- >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 OX=582737 GN=TSPGSL018_8354 PE=3 SV=1 ---SSKIITLIEKSWAFVESRCDlmEVSNKFFERLFQRAPALQNMFTKP--------KRVQYVMLAKALDLIVRSAGETKVmneDIKAIALRHI-KYDIRQEHLNVFGSVLVETLANSVGPeNWDEDISAAWASIYGNIAAVF----- >LauGreDrversion2_5_1035112.scaffolds.fasta_scaffold830278_1 # 2 # 232 # -1 # ID=830278_1;partial=10;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.316 --------------------------MAFWN----KHPEPAAQFVAP-------TQdtltdefepeeeqGISKEQLLSALNAAQT-------ALMMIDR-D------FNITYLNqKSVDLLKTHEALFQSIWPNFQATeefllGYCIdlfhanpshqrqmlsnpsNLPYTTTITVKDV- >SoimicmetaTmtHMA_FD_contig_51_4416696_length_1368_multi_2_in_0_out_0_1 # 1 # 216 # -1 # ID=2511055_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.685 ------KVALHTVEFAVADPSARATI--------------------------------------------ATHGLTPDDMAMLLSKRE------------LIGPAFPALLDEFYGKVVEN---------------------- >SRR5262245_66279004 --LEPTDRIRAKQSYLKHCMGKNDFYRKFYERFFQGPEGTmakEMFADK--------DLNQQYVKLDQSLHYLLNFGDQDmmePTVLTTTATIHQ-TKGVAPEQLERFIECLIDTLSKDYQV--SGIEVDAWKNVCGP---------- >JRYH01.1.fsa_nt_gb|JRYH01001677.1|_10 # 8312 # 9718 # 1 # ID=1677_10;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.684 --MPASWVTELQEIWQDFNKrvgSRQAAGEIIYDAVKEAAPRIVIDdFRIP--------RPVWSSRFVDGISSLIAEASDLKMLRKRAEAMgfsHM-SLALSIEKCELLRDVVVSSIEQECgpgKFSAQCIARKALTIVLNYIAGALL---- >sp|Q7M416|GLB1_LIOJA Globin-1 OS=Liolophura japonica OX=13599 PE=1 SV=1 --ISADQAKALKDDIAVVAQNPNGCGKALFIKMFEMNPGWVEKFPAWKgksldEIKASDKITNHGGKVINELANWINNINSASGILKSQGTAHK-GRSIGIEYFENVLPVIDATFAQQMGGAYTAAMKDALKAAWtGVIVPGMKAGY- >tr|A0A090KT29|A0A090KT29_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X0 -KLTENHRKVIKSSFEIFKKNGVPNAHNIFLRMFKEYPDYKNVWSQFkNMSdeelSQTPLLWKHATTFVFGLERVIRTMDDQEMMILMIHStanQHK-SWGLKKEHFFAMVHLITDILMEEKGEpDEKYAIMEAWESFYDVLGTL------ >tr|A0A0P5DF02|A0A0P5DF02_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 --KPANDRRIIRKTWDQAk----------------------------------------------------KDGDVPPQILFRFI----K-AHPEYQKMFKSFADVpqae------LLGNGNFLAQA-YTILAGLNvviqslssqelianQINALG----- >tr|A0A0K0JIN4|A0A0K0JIN4_BRUMA Uncharacterized protein OS=Brugia malayi OX=6279 GN=Bm1_04635 PE=3 SV=2 --LSEIQQELIRQSWQTISAKLEvneqNFGFFVYRRVFEHNPLLKRAFHVEeyDlldSIPREHSIFRQMRLFTNLIALAVRHDNELETeIAPAVFRYGQRHYKFAAEyfnegTVRLFCSQVVCAVADLLEVDIDPACMEAWIDMMRFIGCRLLDGF- >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1216141_1 # 2 # 73 # -1 # ID=1216141_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.347 --FPDGVCMATIELTVLPVRpleD-----DEKFQIILSEAQGGASFNPNDD--------G----GKDDGvlTIVIKNTLQDPKGLKVLVESFgfqHL-DFDLTVPRVVVFRDSMVELMEAELQDRFTYKAKDG----------------- >SRR5690348_18181078 -----------------SRRRHTRWTGDWSSDVCSSDLETRALFRT------------EGSELVkgSMLAMTVEAIIDFAGersgKFRMIAcEvmSHD-AYGTSRELRSEERRVGKEC--RFGWVAYPX---------------------- >ERR1719191_2635985 --LSTKSLAVVGATLPLVAKAGPSFTQHFYTRIFNAHPALFNTFNISN-----QRTGKQSGALFAAIASCATGLLTsgklPSEMLEGVNHKHC-ALNVAPAHYDVVGEHILGTITDLLNP--GQHVLDAWGELYTALANQCIKR-- >tr|A0A0K2U629|A0A0K2U629_LEPSM Cytoglobin1like [Saccoglossus kowalevskii] OS=Lepeophtheirus salmonis OX=72036 PE=3 SV=1 --LTKKETFLIRESWKLVTPEMTKHAVGYYIGMFVSYPKWQDRFfRRIkGIplrdLRNNPILAAHSSQVFSAVSNLLNNLENTEVIVegvKKIARTHW-PLNIRGKELEAGLVLLLDYLEASFPGQISKECGDAWNKMFNAMSGVIVD--- >tr|A0A2B4SAV5|A0A2B4SAV5_STYPI Uncharacterized protein OS=Stylophora pistillata GN=AWC38_SpisGene8312 PE=3 SV=1 ------------DTFGPK-ESRCREESVCKVRLLELNPNLQDAFPSFrGVsldeLMNSRSLFLHSKRLMAVVEEAVSSLDDAKELIEDLtnlGERHL-AMSITEKHLKNLQRAGPATNQDAKHRLLANKGTAQIDRHIARMEDTRLP--- >GraSoi2013_100cm_1033763.scaffolds.fasta_scaffold146077_1 # 2 # 316 # -1 # ID=146077_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.663 -------------------------------------------------IFESFCLAQ----ML----YETVGMAREPKQERIVS--------------------------------------------------------- >SRR5690606_18427011 --VSHRN---AHEKHQPCH-AKL-------------RPLLRE-----------------PRLLRRLLYDLSGqLTRrAGEVRPERHG-----GAEASAX--------------------------------------------- >tr|A0A0N4TEQ4|A0A0N4TEQ4_BRUPA Uncharacterized protein OS=Brugia pahangi PE=3 SV=1 -PLTRKQKFVLIKNWKGIERDVTTAGIEMFLKMLTEHPEYYEFFNFRNIANtakekqaSDERLSAHGAAVMKFIGKAISQIENADAFFMLLEnngRQHAHRGAFRPEMFWASYSFTCYSFSNGFIRNFFSNI--------NLLLTKVEMSY- >tr|A0A2M8U0Y4|A0A2M8U0Y4_9PROT Uncharacterized protein OS=Ferrovibrio sp. OX=1917215 GN=CTR53_17535 PE=4 SV=1 -PLSPAHLGLVRATFQILAADRDRLTEMFYARAVALDPHIQRPQLV-------SNMVAQRLQFMLVLTDVVQQLDDLPSLaqtAATFARRHG-TYGASDPRFRTARAALAWAVDRILETERNSAIQLAWNAAFDLVEALV----- >PlaIllAssembly_1097288.scaffolds.fasta_scaffold05791_3 # 3730 # 3864 # -1 # ID=5791_3;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.556 -VPTAQDKQIIRDNINILKAKKSNWGAKTMLKLLKAHPDSIKLFPKFaNVPlhelANNAEFLAYGNVFSAGLNFMIDNIDDPTAVKHILSGKDAskyFVPGVSIrQQLEETFRVAIEAIGEELGPRFTPKTRAAFTRVLRFLNQVQDDGF- >tr|A0A1I7S4N0|A0A1I7S4N0_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=3 SV=1 ----MADRQILLKSLEFMPltRDGEKQGVEIYKYSFANMPAMMPFYHLADGftadsTITSDRFQKLGCKLALATHILANLADQPETLKAYAREHvlrHI-SRKVSPRMFRGFFDILVDWMATKTT--ISEEARREWAKLGDLFSY------- >ERR550539_353004 ---------------------------------------------------------AMMQHLVKNLHDISRF---DSDIrelLTRLGQQWL-QKRVPLDFAVLLGNEYLEAvlpffHSNV-GATLALKLEVSLAYLYKEAMHFLLL--- >LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3583117_1 # 3 # 191 # -1 # ID=3583117_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561 -ALAPEAVTKMRAGAEAMLAHPQEAGVFFYETLFDARPDLVSLFRTANMD-------ALSRHLIDTVVFLSRAADDLTGLrddLRNLARVHQ-VNQIPPSEYAHLAAPLLETLSRF-GHPLDAQMIRGWEVLFDRVSRIVAE--- >ERR1719199_1665450 --------PMIRECAAKVVQmDIVELGLRFYVHLFTINPAASAFFTKPKW-----MISAIFGGVLRFYVHLF--TINPaaSAFFTK----------------------------------------------------------- >SRR5262249_23394332 -----------------AIPISGVASELFFSRLFAIEPGLRHCFDG--------CFLGRRRAFEWMIGAAVRGRPDLRSFIQALEFMVAPSDATVHQECERLRDAFISSLSGSLGPRFTVEMMNGWLAVFELLH-------- >tr|A0A2V3J537|A0A2V3J537_9FLOR Flavohemoprotein OS=Gracilariopsis chorda OX=448386 GN=BWQ96_00611 PE=4 SV=1 ---DPETEALIKNTLPIFTKHSQQIAVQLYANLFEQHPQLKPMFCLEFLqTPgqckksPGTGMSPQAKILSDSIVNFCANLDNIDMMNNAIERIcakHV-SRHVKSDHYPAVAGAFSRAVRQVLKNELSESDLKAWDTAVSALAGVLVK--- >SRR5688500_3946624 ---DSRTIALIKESFTPIAGRTLELADRFFNNLFTRQTSVRGFFPA--------DVTEQKRQLPGVIQTILENGDKLENLEPQLREVgreYA-KQGALPTHYGAVARTFVDTVREMSGIGWQARYTRAWTSLFDSLTKAI----- >ERR550532_2368357 -------ISMVAANFKTVKS-NQVLANTLFEHLFELEPSSKALFESK-------DLTQHKTKFVGFIGQGLKMLqgKNAKKELRELARMHM-EMGVTTLHFVFFEEAMLLGLRAAHGDKFDGELATAWTYVV------------ >ERR1719264_1394560 -------ISVVAANFKTVKS-NQVLANTLFEHLFELEPSSKALFESK-------DLTQLKTKFAGFIGQGLKMLqgKNAKKSSGSLPRCTW-RWE------------------------------------------------- >tr|I2K200|I2K200_DEKBR Globin, putative OS=Brettanomyces bruxellensis AWRI1499 OX=1124627 GN=AWRI1499_0864 PE=3 SV=1 -QLTREEIDLLRWSWRLVTVDddSTSLGGNTFnAADFSSYLFCIQFYNNFiSMDekvvEMIPSIRHQASSFADVLNQAIGTLEDLSkmqELLTNLGKLHARILGIERSYFKTMGEALIKTFRDWFGNNetFfPLILEEAWIKLYCFLANSIIQ--- >ERR1719396_178111 ---------------------------------------------------------------AHGPGRLHRRLREQHPGLvpaagaqrPadGDLPPAL-RLVYHPPAVQRGARERDEVHRQGPGGVVTPEIAAAWSEAVLFLSKACID--- >tr|A0A2E6CQF7|A0A2E6CQF7_9DELT Globin OS=Sandaracinus sp. GN=CMN31_05165 PE=4 SV=1 --LDHSTLHAVRSSFE-RV-REPAFAAAFYERLLARDPEIRRRFAHTDFE-------RQRELFLHGLFALVDYASGGatgKLAIERLHAMHGpEQLDVPAALFDVWRDVLLETLAEHD-PEWRGELAVAWRAVLGPGIDAVRSP-- >tr|T0T344|T0T344_9PROT Uncharacterized protein OS=Bacteriovorax sp. DB6_IX OX=1353530 GN=M901_0762 PE=4 SV=1 --------TEVRKCYFRSI-ENPHFPKYFYRNLFFLSPKIEDYFKNTD-------WEHQEKALMLGLSHLFHYFDEQdtfhHKQIVRLANVHSHdNLNIHPHMYYYWIEALVMTCKKVDP-QWYEDLQYYLRETVFFPISFMISLYH >ERR1712080_92393 MSLSAGEITAVTASFEAVKADLGTNIGKVLQKLVAEHPDLKPHFPWHavptADLLGNDGFKTHAAQVGRGFAEAAGNLSNLSaceGYYVSLGDRHK-TRGFAAAQVPMVADAFVAALQ------LTGDDASGWTKLITFVGSSIVSG-- >tr|A0A1X6NYK5|A0A1X6NYK5_PORUM Uncharacterized protein (Fragment) OS=Porphyra umbilicalis OX=2786 GN=BU14_0331s0026 PE=3 SV=1 -PPGPKAVRLLCATAPTLRAAGVPLVHRFGHLLVTRYPAVAARFDVSpaGD--WEGAVVAQVARLTAAFLAAAERMGEPACLNPVLDRIaakHA-ARVLPAGLYASVGDCLLEAVGEVLGDDAPQEVLDAWDAAYAWLGGALAA--- >tr|A0A2S3QTP4|A0A2S3QTP4_9PROT Uncharacterized protein OS=Halobacteriovorax sp. DA5 OX=2067553 GN=C0Z22_01530 PE=3 SV=1 ------DKDLIIESFARIEPNLKNFTNAFFDNVVILEPGMQKVFAHADRE-------QLKASFIRALSITINNLKNPEYLKYYLQGLggnQI-KYEVSETYFPIFEEAFIQTLMLFHMNSWTPKLETAWRDCFYYIAEYIS---- >tr|A0A0N4YFT6|A0A0N4YFT6_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis OX=27835 PE=3 SV=1 -RLSEHQRQIIIETFAEMEHHAVKNGLKMLVKLFSEYPNYKQIWPQFRAIPdsslmNAIALRRHASVYMCGLGAIIHSMKHENELALQMtriAKAHI-KWNVHRSHVVHMLDPVLDIVQE-CNPNYNNEMKQAWTTLYHIIADL-IEIY- >ERR1719487_109746 MIMSAEAVQVVQDSFHRVDScvqIRDALEDVFFPHLFASSTQIKELFADVDL-------NMQAPMFANILNSTISSLNNPTELRPLLADFgeKCKKYGVQGEHIATAGESLIFTMKSI-DDQWDAEVEAAWMAACSAMENAA----- >tr|A0A132A213|A0A132A213_SARSC Globin-like protein 2 OS=Sarcoptes scabiei OX=52283 GN=QR98_0035350 PE=3 SV=1 ---EREEIEVLREQWDRIVHyHQECFGMKLFQRLLQLHPEYRPLFGFEeTVeeIQNTQRLKAHGINVVYMLNMLFDNFDDMDmidELIFKLVKLHM-MRGIDQIWLDDIIEPFELVLEEF-NAKIQIERIEVLRKAFIFIKNRMQELY- >tr|A0A1Y3BHE1|A0A1Y3BHE1_EURMA Globin-like protein OS=Euroglyphus maynei OX=6958 GN=BLA29_010084 PE=3 SV=1 ---CEEELQSLRIQWDKIVHyQQECFGLKLFLRLLDLHPEYLCLFGFTwDEfnYHETNQLRAHGINVMYMLNMLFDNLNDMDmfdELIGKLIRLHL-CRGIQKSWFDDLCAPFLTILEDF-SEKLSIEHPESIYKAFMFIKNRIQQLY- >tr|A0A1Q9DB21|A0A1Q9DB21_SYMMI Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Symbiodinium microadriaticum GN=AK812_SmicGene25788 PE=4 SV=1 -KLPSHDVQILRSSWHQLMDavghDREQLGDVLYVGLTGSLAVLKDQFIT--------PRAVMSLRLFNGFRVVVEKADDPAALLNFTETLafkHL-SYEVTQVRAGLVADTFLEVLTQNVTEELPQGAGAVWRQILMYVGSAFR---- >tr|K1PS51|K1PS51_CRAGI Uncharacterized protein OS=Crassostrea gigas OX=29159 GN=CGI_10019581 PE=3 SV=1 ----YRQIFNIRNGWKSVARVMEDTAKETLIRLLEKHPEYREKYPMIaSLNteeelRESLEFETYAMQIFGLFDEVIQNLENVDAALDEIEHTg----KQLTLQLITDLEECFMNSLHLVLDERFTDTLQENYRLLYGFVKSNIPQ--- >tr|A0A085MKY1|A0A085MKY1_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_01110 PE=3 SV=1 -------ASIIKEQISKIEVN-EENGGKLYEVFFTVKPEFHKFFdlKHAPEgkdVAHNQRFKTLGKLFLEKLKRIVMACEDEHQLKEEIKGLkmdHD-PRHVGLTELKGAKPILMKFIEQQVG--MTEEQKHAWTEMFKKF--------- >tr|A0A183IBE5|A0A183IBE5_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 -------KHVLMEHMKRLNLT-NKLGGKFYHQLFQSlPEAKSQFAEHFDKledVENMKYYQQLGHSLLSLLKELPEHCDDDHALKQEIMKIkkkHD-EKHVDAKMFKKSKPAILKFLTDNTQ--MTNEEKEAWDHLITHS--------- >ERR1719334_3108017 -GLTPKQAQAIISSWENLN---SECSSLLFKQLFTIFPELKEYFGFSKreLvdkILNSEEMIAHMDATWNGLDKLVLSTQTGTRFaaiGKGLGYNHF-KFEIDRQDVHKFMDFFKQVLKDDLKSQFHGDLEEAWNIWCKAVEDVFIMGY- >ERR1719347_1061473 ----------------KIM---KSClKSRLEHSGFRFSHELIMNFGFAKseLvdkILNSEQMIAHMDATWKGLDKLVLFTQTGTRFapvGKGLGYNHF-KFEIERQDVHKFMESFKQVLKDDLKSQFHGDLKEAWNIWCKAVEDVFIMGY- >tr|A0A0X3NNN3|A0A0X3NNN3_SCHSO Uncharacterized protein OS=Schistocephalus solidus GN=TR151324 PE=3 SV=1 --FSEFEKDVLLSTWAVLNEEANKHSAAVFTLAGQMFPGLRNLFDIPcaNTekeNCESEAAKRHREAYMKMINGAIECLEYPREdFYDDLLVAgaHYaTIPGMKTEYFKVIKRATLVTWNSLLGEEFTEDVKQSWQSLLDYIITVISEGC- >tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium GN=CMQ23_00915 PE=4 SV=1 --------SSIHALLEQVATTDIDFDKKCFERFFQISEEGKTLMAHMDRV-------HRG-KMMAEIYRLMMArdLD-DEADyLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY- >ERR1719359_219123 -----------------IDEepmAEVVSGeDALV----AIA-DLlyQKL-------------------------------SGDEAMAQFLENVdlt--QlanNLRSLlalvfngsdWPEMHLS--gSLiddgYEDFSSILQETL----qaSPg-DDALL--ESLDKLT--- >ERR1719487_376807 -----------------EEEgatEEVASGeEALV----AIA-DMlyQKL-------------------------------SGDQAMAEFLENVdla--QlakNLRTLlaavfegndWPEINLS--aSIidegYEDFSSVLQETL----qtCLg-DNAML--ESLDKLT--- >ERR1712100_485805 ---VGHVVLVV---GRCSFEcrnIVVVEGlDGSLDRLLALRkvvgiglGLPilQQL-------------------------------G----VLRHVGNVa-----------------lKVlrchFLQFSNHVLEVRSRLRldefclvgdivievilrDHgggkHeRD--------------- >ERR1719487_109746 -------RKEIEISHPELLKiGLDNVGTTFYTNLFQDSPQIQMHFIKPN-----RML---SYIVQKTIEMIGDLHPKPREVMKGLKALamrHI-KYDAPPEFFGDFESAMLKTLAQSLKSTFTEAVKEAWKAALQFIASTIV---- >ERR1719221_1379514 --------------------vLMRDIPRSAVALFGI-TVAIfeddyRDMNHEPALL--CAVL---LFVTFTvilLMNLLIAQLNTTYV-RIYQDTVgwaLI-NRASTIVEV----LA-TVSRT-KWTRFVDGLGLDEKLE---FNEGDVG---- >ERR1719460_1401436 -------REKIDNTMDVLAKhDMDDLCNKFCNKW-INADEVNGYFDKPS-----GIF---KFILLRILYLVSTIYHDPREISKEARALglrHV-KYSPPEALLPL----------------------------------------- >SRR4051794_36238122 ------ARRTAKASYLRLQGggRERAFFAAFYENLLVSCPDVKPFFVPERMA-------HQQ----SMLNRAIQLLLDFDRAcgCPQLRqlaDGHA-GYQLTRWHYDQFVEALIRTIEQS-G-ITNPAELSAWRTTVMPAIEFM----- >tr|E9HGU5|E9HGU5_DAPPU Uncharacterized protein OS=Daphnia pulex OX=6669 GN=DAPPUDRAFT_301206 PE=3 SV=1 -SLSDSDINLIVSSWNFLKKRLSSFAPKVFIGYLEARTDSKKMFPDFAHvniaeLATNVEFRSRACNCVASLNYIIPHLKRSFpvLQCPALKNLKT-KYNQHIDILKSLGIIWVKAMQEELDkKIFTDDVRVVWKKLFSVLKE------- >tr|A7RWR5|A7RWR5_NEMVE Predicted protein OS=Nematostella vectensis OX=45351 GN=v1g203303 PE=3 SV=1 -DMTYEQKYLIRETVDNRECVNekDflawRYVCELAAIFLNMHPGLQTYFSEFKhIKiDNINGSHGHPRRLLMAIDNAVTALGDSDSFsayLVELGRRHHgMNFRPGPTHFNDLRKCFLSVIKEILATasLWDFQVEEAWNRLFDSITAMMLR--- >SRR4051812_28599342 -------------------------------------------------------------------------WVRPRSRGGRSPRSrssRS-SARRWPSGRPRPPSTS--RPDMRSGPSscgmsrarwqsifpapsrtgcasPIGVLGDP----------------- >SRR3569832_2950508 -------------KNNKKN-HHPNNHNTKKKANKTTTPKKTQKNKNTNFT-------RQKKMLQMSLNLLIShamGIDIVDGYLHQLAERHSRhRLNIEPHHYAAWLNSLMKAVRQHDP-K------------------------- >SRR5262249_31239692 --------------WACCA-RGGAS-R-AY-------AKSRERHARDGFA-------WRP----RAASGTLRageGEPEGEAHLRRLAAIHDRdHHDIRPEPYDRSLDCLPQAGRDRDA-EATPEVEEAWRDVLAPGIAVMKAAY- >ERR1712048_439078 ---------NVTTIWDSIKAVpgyEEKFGRMLYEKFYEMEPESFKLFKK-TRQpaaedvFSDPVFVQHSLEFVRLLDFFIQVLGPdIelvEESLVDFGETHQ-DYGVTLDTYSSFGEAMTETVEELLGGngKMDETSRRCWVTAYRYMSMHMTRG-- >tr|L1IAP2|L1IAP2_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_120658 PE=3 SV=1 --------NFIVSSWRKLLRKvsYADLGLSIYESV-RDVDELEPLFRFTNRV-------VQGTKFVDMLSSIVDNIHSPAEIYVKIADLaplHH-RKGVRGSQMPLMQEIVMRVFDSTLGDDMLEEEKKAWLWMWAFLTKALD---- >ERR1719336_1989132 -------------------------------------------------------QDRKGGgGTPGKLKVTAKYNDGTefvDefntvifaigrdactakmgleGVGVALNPKNG-KVlhneler-TSVDNIYAIGDvldgkpeltPVAIQAGKLLArrLAGTSEVTTDYVNVCTTVF-------- >ERR1719278_462770 -HLSTADVAILKGSWSVLEEHVTRVGVDFFIDMMTNHEEIKAVFRQMpNIPvyelKANEDLNRHGMYILGVIKKIVGKIDDTeylEKLFDDLSDLPL-LLLQQDRPHHLAKNLPKNVHSGSLYaePpvkvaEVVEELLQVLCV-VDLPHNLL----- >ERR1719186_958210 -HLSTGDVTALKSTWAEVDSQISKVGVEFFLDMFHNHDDVKQTFREHpELPvfelKANEDMHRHSIFVLGAIKTIIKHIDDTeylESFLADLSDKQR-AVGVDANNMELFGKVFVKVMRPVLLekRKWKPEVKDSWMTFFTSIVKVMKK--- >ERR1700748_142917 -------PALVREAWSFVSDRADQLVANFYAELFFVFKEAPMMFPS-DMT---RQRQEFGRAVVQWII-----SDDQDGLAMHLIQLgadHR-KFDVEPRHYEVAGAAMVNAWKKLAGWKWTPAHEAA----------------- >tr|A0A1B6KXW2|A0A1B6KXW2_9HEMI Uncharacterized protein OS=Graphocephala atropunctata OX=36148 GN=g.8863 PE=3 SV=1 --LNDVEVEMIQEGWKCITESEDFFRTAFSSIDF-----TPVNFRE-DEHtdderFSRDFLKSHSVHVMNTVRTIVEDVKNPNSWMLELlriATLHK-LYGVTLEDLRKFQCSMLETLKQCLGEcNFSPPMQEVWEKVVECVVI------- >tr|A0A1S3CW24|A0A1S3CW24_DIACI uncharacterized protein LOC103506299 OS=Diaphorina citri OX=121845 GN=LOC103506299 PE=3 SV=1 -GLTPKMVGLLKCLGVAIKPEAHRHGVNIFKKLFLMDKTVQRMFPKFacdDMcgLDENPDFHKHVDAVMKSILYMMESSGSVPDmksTLALQVKIHK-DLCIPDRHFITFGYAINEYLKETLGAKYSEDVECAVAYFWKFVASEMTAKP- >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 OX=905079 GN=GUITHDRAFT_143733 PE=3 SV=1 -------SARIASSWTELVKKsdYAEIGRRIYGSV-KANDTLEPLFRFTNQT-------VQGTKFVDMLSSIVENINNPQTIFEKVNELapmHH-RKGVKAAHMPIMKGIIVSLLKHVLGDEFTNEDEEAWNWIWQYLTQILD---- >tr|A0A0R3PZJ2|A0A0R3PZJ2_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1 -PFTDEEKSELLRSWKVIEAQKQAVGCDIYEMIFNQL------EPFLCVSikapkELHNKFRIIVICIVGRYEEELSSVNE------------------------------------------------------------------ >ERR1719192_2137381 ------------TSLNFKHLcvQ-QLLKLPCLPRMFETHPEWRNLWQHMGgkLHiddmLTLPRFVRHTMSNLAYLDKIIRDADDQTKTIAsvqFLAKVHA-VQGIGERDFKQL---------------------------------------- >tr|W2T4S9|W2T4S9_NECAM Globin OS=Necator americanus OX=51031 GN=NECAME_11818 PE=3 SV=1 -----RDFFTLKNYWKAIDRKRQDSAQLFFSRYLNQNSENTKLYPKLkNIDgatvDmtcSDSGFEAMAASYLKVFDDVISIIEekpgDVQaacDKLTSVGKMHKtKGVQVQPKSFQAMEEPFMHMVKEMLQDRFNEKAEGLFRKFFDFCLKYILEGF- >SRR5512134_285705 -ALTPTHATLVRESWARLAPGRAAAVHRFRARLEAVSPRTAARFTCLDH-------EAQRDGLMIELDQAIAATGSDDDLVPALARIARrfRESGPASSEYPMVRDALLEVLAEADRGIAPPELRRAWGSLFGLLAALV----- >tr|A0A1E4RL21|A0A1E4RL21_9ASCO Uncharacterized protein OS=Hyphopichia burtonii NRRL Y-1933 GN=HYPBUDRAFT_5624 PE=4 SV=1 -TLSSSDSQVIKRSWTELQNNnkyhKDEFVSRLFGNLLAANPNLKSVLST-DL-----IIRQQSKMFNDMLGFTIMYLDNEPLLEECMNEFvqeNPSIVALGVQYLEPMGLALIQTFRQWLGSaKFHAGLETLWIKIYVFLANCIL---- >ERR1711973_858157 --VSAAHKSLIRSTWTLMKF-NSNVAPKILYKMFTTYPETQKMFAKIAEVStfdlmENKDFLALSYTFYSQFNLIVNNVDNPEIIKSQVARMISPsFFIDpsasIAQQLERANKIILEIFGEELGSSFTDEAAAAWTSLLKIVYEVVE---- >ERR1711928_171062 --VSATQESHP---------------------------------LDLDSHEiqqqrRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFET----LCFRWIQHD-----------CQQYG---- >ERR1711928_123369 ---------------------------------------------------rRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFES----LCFRWIQHD-----------CQQYG---- >ERR1740128_75568 --VTAQEKTLIRATWDQMMF-NSEVAPKFMLRLFSEESQHELGgnFaVEHHLVPggadeglllGSNDGFSNTLDVRVG-----------------------ShLLGNdai---------DVVHDVFQCFLGGSIGRGDlfnglHHNMGRFVQLVDGX------ >ERR1719219_701605 --VSAAHKSLTRSTWTLMKF-NSNVAPKILYKMFTTYPET-QKMyTRLADIPasqlmENKQFLALSHSAFAGFNMIVNNMDDPELIKLQLSKVDFPgTFVYpfpgTSLNTSKPPASSWKYSPKN-SAPLSPRKPLPLELPFELRHQGFG---- >tr|A0A1V9Y3S0|A0A1V9Y3S0_9ACAR Globin-like OS=Tropilaelaps mercedesae OX=418985 GN=BIW11_00005 PE=3 SV=1 -SLSKEDMELLKGSWQTIRKDSKVIGRSIFVQLFREDPNLIKKFRHLDNIpaeqlPYHPKLLANALSVFYVVTSLIDHADDADtcrELVRKVAATHR-PRNITRQHFETFGVAFLHVVSSMMS----ARALNSWQRGF------------ >ERR1719510_1721190 --LAPNDITNVKSSWTTIETILLQVGIHVFIVLFETQPNMKRTFRQYRGKkhselRINEDLQRTIMYLMSNLKRLVRYINDNRATVKFMRRLakkHS-PLELDLGRIDpnEVATLFCTAIRDAKqickdqngKTSWSTEIEASWANFFGAILGAMR---- >ERR1719264_357726 --VGLCDALNIQQVWPRIEQYLLPVGTRMYISILDGRCDKIIFCNKACCRknasksssakstrsvysksvsrtcpnqvILNEELQKFVLLLMGLIRRAAKHLDNPSHSAKVIRKVtkkrFG-KLNIDVTKIAfePIALNFIASVREIMtnTRHWNTETEASYYTLIRNLIAYVQ---- >ERR1719244_2234371 -DLSTNQKNMIRDAYAVFEKNGEKNGADAFIYLITQHPDLKQVFPWGDVSneelRENQVFKDHVYVVFKGLKVAIDRIDNLKATASyyvHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQTSFNNLLQFLVGNMK---- >ERR1719193_348913 -KLEQKDIRAIREGWACITAHpgLEKTGVDWLHLSFELQPGTKHHYKNFTNKtleeiCQTPYMKILAGKYMSEIGILVEHLEHSNFVlmrLENLGHLHA-KMGVPMETLFTM----NIVMQHYFRELYSrqdvpDDCEGAWSKV------------- >tr|B3RTB2|B3RTB2_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54901 PE=3 SV=1 --------------------------------LIKLSPATKIYFHGVDFEkrdsylAKNTFLRNHAARFMEAINVIIGQDMDIfsvESYFRVVGSKHH-SYNLKLEHVQDISDAFLEMARNALKKKFTKSTEAAWRSFFQMVTDAIKN--- >ERR1719229_1707680 ----------------------QQLGVLLFANLFKKQPLCRNLFADSDI-------SKQSLRLLDMFGWLLRSLVKEKnqmrlRTLKSLGDRHV-KYGIKIEFFGPMLDSLSDALQDWFGTNYNTQTRVALTTLFQSACNEMMKQ-- >SRR5438046_805262 --------------------SRRSTG-GSSRS----ARPLDPCSPRPTSI-------GSTGC-CVTPSACCYFPAQPdgePTILARVADRHSRrDLAIDPALYPLFIDSLIDTVKQY-DHEFTPAVEGAWRTAVATGVEYMQSKYX >SRR5438034_714626 -SMTEASIIAFNESFERCMAS-GRFFDVFYDHFLRSSPEIAAKFQGTYFN-------RQKRMLNQRPATTVGQPR-----------RSAReSRKTPAAQFVStcqampsaFVSELTKSGSTX----------------------------- >SRR5258708_7736634 -------------------------------RFTGTSDAIREKFKNSDFA-------VQHQAMADSLYLMAVSVQGGpenLARHDMKRLYPKHqRMEITASMYDVWLDCFVATARIHD-PECTPAIESAWRECLTPGIAAMKSGA- >tr|A0A0G4HCC2|A0A0G4HCC2_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6317 PE=3 SV=1 --------PLIHTSFDNVLERttTEELGVRFYEIVFETAPHLQKLFKK--------PRRLQGRVFANVAALLISGIENPRFLTQELQRLslrHV-GYDIRPEHIPVFGNSLMRTIKEAaLrpspkdgqPFDFSHAHDEAWGALWGRVST------- >ERR1711965_451221 ------------------------------------AGAVR--------P-------RP--------AAVI---GFPFPLFP-LLETADMtsvAVGAHPRLRA-----L-----LRDR-G---AWYLTGPQELASVIGRLERLER >ERR1712012_1094824 -SLTTSDIAAIRQSWILAKDAApfEVHGPAFYKLMFETYPSWRFAFNHMGGhlSievqIENTRFVKHTVTVFRFIDKCVNDLDNPTQILENIkmvAKIHA-LQGIGVKDFIIIKAFICSKSD-KVGAGRSKNSFIFFPRFL------------ >ERR1719431_737524 --LDMSQISDLQRCWSTLQLHMgeQAIAAAFYNDIITNFPSIQKYFKNIwTEStftrtiGNMNDVRKHASLVVSRLTNYMGNLHHLSEVNEDLKELgmiHAARYHITEEVVEQFVSSMATTVADLLTKedLFDPVLCGAWKRFFFMILTFLSEG-- >SRR5882757_2588511 -SLSSRQQILARRFFDAVEASDKPLAAMFHERLSEIDDRLDGLLLE-EE----GCLLREAMVIVRTLSRNVDRLNRMVPIFRAFGRTCA-AQGIASANYEKIAPVLFWIAQECVGSEFSVEMGRALTALYDQLSREMKD--- >ERR1719199_2454663 ---------LLQAVKYVPARefyatfdeaSKYQLRADVYVKFFADCPVGEGYFKQ--------SNTYLHIIAAKLMDVVVAIYIDPVAVVDmisGVGLRHV-GYAIPIELFPPWVTVWID--------------------RWRSIGAT------ >ERR1719199_1562120 --VPADLAEEAKKAWTMLITaagSKDAVGEALYSAFYE-aAPSLHYLFVT--------PRAVQAMRIFVQVNNFVNLPISPADLKNaveALGFWHM-SMDVTVPRCVVFRDCILDLFVAELGRPIEHSRAPKHWSSAVQFPSPIP---- >ERR1712176_999243 --------------------------------------------------------------------SY-AHRDTFDQLadaprtI--FYTQK---------QGHPECSEMVEKMKNIVGDE------------------------- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514 -VLTAEEVQLVKSSWPIISKD-LKVAENALIKHFILHPPIQKLYTKLaNVPiselKDNDEFHAQAATAVKITHFIVNNLDNDELLTAMLSKVTIPaffvDYMDPIHQLDETTRLFLQAVKEELGNQISERTLAAWKKALDHVMLIMSN--- >tr|A0A1I3QX19|A0A1I3QX19_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter neptunius OX=588602 GN=SAMN04487991_1987 PE=4 SV=1 --MDEQMIALVKASLKELQPHAGAVFATFQSKLAQRAPELAYRYDEVDP-------ERQGELLFEKLAIALGGVRFLDRLVPALGGVglDAGSASLTSCDFARLSEVLIAAFAEVSGNRFDPCIGAAWTTLFEELSWHMFE--- >SRR3954447_25823703 -------EDLVKASYHRYCADKISFYKDLYKRFFKRNPDGQRFFVKTS-------MKRQC----RMLDEAVSLLTNFRtgpepTSLSRTARGHA-GLGIEEKHYRDFNVAFVESLQMA-GE-DDEDTLNAWRCMFARGTE------- >tr|A0A024TW08|A0A024TW08_9STRA Uncharacterized protein OS=Aphanomyces invadans GN=H310_08903 PE=4 SV=1 -VLTKARIETCARSWDKVRTAATdkmkSygkpgivlFYDEFFYRLFQRDSTFRVVFAN---------SKERAEVLIKALMFMLNMRADSPEsvanmqnRCRFLGHKHRSYSLVRPHHFAAYTMTCIEVIMYWMGDEASIMVADAWSNVVGFVLRYLLEPY- >tr|W4G1Q9|W4G1Q9_9STRA Uncharacterized protein OS=Aphanomyces astaci GN=H257_12218 PE=3 SV=1 -LITKPRLQLCLKTWEVVQSASTdkmkQygkpgiilFYDEFFYRLIERDATFSQVFVN---------VKERGEVLIKALSFILSMRADDPAdvtnmqnRCRFLGHKHRTYARVRPHHFAAYTMTCIEVIMYWLGDDASPLVGDAWSNVVGFVLRYLLEPY- >tr|A0A2H1V3P2|A0A2H1V3P2_SPOFR SFRICE_008656 (Fragment) OS=Spodoptera frugiperda GN=SFRICE_008656 PE=4 SV=1 --LFGSqEFKACCsgMGMGKIGKGG--IGPPVtsL--tqrnttqalfhvgflPYLRAAIQwctvqvDNSFDYLgIWTepvafSVDPLLIAWlaykpTVKSEASLPAAVKSLSQTQQIp-------FR-RRSTP----------------------------------------------- >UPI000297C1C9 status=active --LDEYSIGEVRNGWENLERRCGtPKAA--A-EEFLHKVSAAIPKTE--------HMQKRASTVWSKLNGLLASMHDQSMFTGQLEYLalrHM-NQDISAAEIETFKGLLLEFCASKLGGMMTPEFQYGVSRLVDAVGASYQ---- >SRR5262245_14724532 -------EDVVKKAYQRHCYRQPEFYRSFYENFFSRVPKARAMFK--D-------MARQHEM----LDFALGQLLNYSqqqsepTTLTQFVERHS-RLGLTADDFKRFGEALIATFDSELRGdCEHHRTMAALEIVI------------ >ERR1712071_238239 ----ERSFTYWKDSAMMELA---------KWNARLQTPR-----------vYEVKwRRKKRNIPGRVGWRVLGAELWVRSSCRRRIRNRPYQEYFVSyvsiSQQLEETARLIIDALDEELGVRFTSYTRGVWSR-aFHFANSIMAESF- >ERR1719204_2878153 ------------------------------------------------------------------GITMMMAVVRGRPVRPAVQDigrAHY-SLRVDKDDMRQLATAMISAISDSVGTYMSPDALDAFTKLFEHIVEEFGNGY- >tr|A0A183IHG0|A0A183IHG0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 --FSLREKELLSVSMKKLEQLEEDNAVKIFIRLFQENPAYKSLFPKLRFmgdadIVNSTALVAHTQLILKMIKTFINGFQNESTCAVVLKRaetAHR-KFDIKPSQVSTLFPILMEILDIS-----HNETQAAWKKLFETFSIR------ >ERR1712232_1039451 ---------------------------------------------------ESEEMRTHATKVMTFVGNGVASIGNPEKCerfraeCIALGKKNQ-ERGISSQDYDIATQPFVDAVEHSwlqagwrqtdaSGSIWPPGAQGAYTKFYGHMAATIKDG-- >tr|A0A0N5DPZ7|A0A0N5DPZ7_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 -NLSAKELQLIEQSWLDIE-NKDELGKEVFKRVLLSNEKIRTIFDLHtcpdDELDQNETFKRHLKSLSLFIGICATSVAvgseRLVSIARRIGEKHVNFRwvTFDAEYWLLIKGIMVDVIASKQRPKEVEKVRSAWNTLLSFVISEIKHS-- >tr|A0A183UUV2|A0A183UUV2_TOXCA Uncharacterized protein OS=Toxocara canis PE=3 SV=1 -RLSPRHRNLIIKSWSKTN--KSKIARDTFVELFKTSADIRSKFVFGDVPikrlKQEDRFLAHCERFVAALDSVIAHLDEIGAVIEnaeALGKYDIsaepihaAmAKDLRNEHWRLFGDILVERIIENDTkqPSGGSEVHAAWKMLGQLLVFHMRLGY- >tr|A0A1Z9IBY6|A0A1Z9IBY6_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium TMED162 GN=CBD22_07770 PE=4 SV=1 -GVTQTQEQLIEQSLTHYAARHGDPYDAAFQKLYAAAPHYEGLFVL-DTD---E---GLRRNMMRTtLEMIATYIDDAYAAENlvtGARLVHL-TYEITDD-FDLFFQITRDVIAEGCADIWSDAHAAAWNTMLKDF--------- >ERR1712150_396892 -DFPSDQKQLVVKTWHYVEDHFNEVGITAFMDLFKVSPESKMIFDFLKLyHtddgKFYDLVTKHSLRILGMVSNLVKELKCKsseaadesiHDIILPLGRRHV-QYKANVIQMELLGLLLVKSLLKPIPKEeVGdkeyGQISEAYLVFFRVIVYW------ >ERR1712062_817879 ---------------------------TAFMNLFKVSSDLRTTFSFFGYvNvddeKFYKLVTKHSLRIFAMASTLVKELKSRdsdasdrfiHDTLFPLGRKHV-NYRSNLIHMEMLGILILNSLMKTIPRDqLNehryKRMNYAYFQFFRVIVYW------ >ERR1712135_246677 ------------------------------------------TL-------SVILKRTAEITAHKIIIVVTFQLKSKdseeadrfiHDTLFPLGRKHV-NYGSNVVHMEMLGLLIVKSLMKTIPRDeVNehrfERINDAYFQFFKVIVYW------ >ERR1719171_2780585 -NLSEEMITEVQKSWSEVLRRvdsKTEIGRIIYDSLFDRLPHLRKMFKTNRL--------TVAMRFANSVHSLVGILNNKEQTeeyVYNMALRHV-QYwsgdgSIAQANMSAFLKAVLIVFDNALDDKWTQRMEEAWGALFSYVGEAMVA--- >ERR1719203_1566926 ----------------P------SHPICLRSPkrFTRRSSAVTGnCCNsstQHTTFPNR---TTSPRPWPVPWPPTPPTSSTSRPSscpavpVEAICHRHV-ALAIHPMQYVVVHENLMAAIAEVLGDIVTPAIGAAWSEAVLFLAKAFID--- >ERR1719253_2317543 --ILSPAGRVLRLRGPGFLPprcrfgrlspnhccsrvspdriavarRPPPRPRSRPTSSPSPRTSTRGc-WAATRSCCSS---STrpttspsprT--SLR--------PSPAPSRPtppTSPTC-LPS-WSPAGPWRPSVTA----------TSPSPSTRCSTSWCTTTSwrpsprswatssrrrsrpagprPSSSSPRP--- >ERR1719253_507459 --LSQSAIDVVVSVAGRDARRARPRAGPRR----------TDp-WRRRRRAARG---G-gpgrragevqtraaeGASTLGHGLVR------RGRALghgLVRHGRGHC-HDS------------------------------------------------- >ERR1719253_479176 --HHQELLHAGVGQPPGAAA--VLQPGPQR----------PRl-HEPAX---------------------------------------------------------------------------------------------- >tr|A0A183EWZ6|A0A183EWZ6_9BILA Uncharacterized protein OS=Gongylonema pulchrum PE=3 SV=1 --LSKRQRVAIENSWKRATKsDAdKHVGIQIFFRILAARPEIKHIFGLQKIPdgrlKYDLRFRRHAVILTKTFDYIVKNLAYKEklqQHFQALGERHTVlqGRGFFPEYWETFSDCMRQTVLLWNK-EKKREITSTWYQLVSKSnFPVRY---- >LauGreDrversion4_1035100.scaffolds.fasta_scaffold358575_1 # 2 # 736 # 1 # ID=358575_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.683 -KMPKDAVQEAQATWQKWImknTDEETAGLCIFEAVFQSMPALQGLFDTT--------TPAQAGKFMKAFTECLQGALSREELklkIETLGFLHM-NIEVTTANTVLFKNAMITCMDKDLQSAFSVSAREVISKLVLYIGGAF----- >tr|A0A0D6M6J3|A0A0D6M6J3_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=ANCCEY_05408 PE=4 SV=1 ---------------------------------MPSCVRTAVTLP-----------YLEIFEPFVVIEGAVMSLDNLPALDPildNLGRRHG-KLEVNGkfrtYYWSTFLECSICIFRKTLTN-------------------------- >tr|A0A1Q9CXH8|A0A1Q9CXH8_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene31162 PE=4 SV=1 -ILLEAQIVEVNECWQGFLdcyAKPEHAGEAIFAAILDAAPSLQTFFRG--------GTALLAGKFVAGYSQMVHNLRNPDGLMGVVEHLgfqHL-DVDINIPRIAIFREAMCDAVSAELGEKLTDLGAYGLRRLVSYAGGALI---- >ERR1719174_1428107 ----------------------------------------------------------------------VVDCQDQRSTLGYPPSAst---SVRCCVEQVARRaflwrkswfLTTLTIFIAGQ-AiLKYSHLDNLATERLLVFLFRAFI---- >ERR1719277_1813735 --------------------------------------------------------------------------------CMCAAETriaHL-IGRASVANMHNLRNAVGSEVCLLSSlAIRFEANHVGWAHVsvadvVAVCSSISL---- >ERR1719310_1375130 -MLPQEQSQQLQQAWALVInmsGNRDALADLIYSAFFYRLGePR-APLRN--------PAGSRSLPFLHGHQHLRRQLRrPwssaqfrrNVELRSHVLGYhrpSG-EHHSX----------------------------------------------- >ERR1719487_3068354 -ILPQEQAEQWRPSASSLVsthSLQSLAIHLAC-VLLLRPYPSdTCTWTS--------LFPVLTSSVM--PSSICSWLSLAASX-------------------------------------------------------------- >MEHZ01.5.fsa_nt_MEHZ011529165.1_2 # 173 # 307 # -1 # ID=206391_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.393 -YMSIDtgnleaakvmlqdlvtiradrsryyyclddlFKWHPDIVWKLTV---------------DAPELLrtmldGMIWRS--------RVVvngnrrvnyylkhllvDEHGKFSNAM-SCIVKLQDP--EIAIHPILvq---LGDLVWNDLVYWrflrgklslVCTAGIFMVSQSMl-QYVESAGSFEERVATFICRLVV---- >tr|A0A2T3VCJ1|A0A2T3VCJ1_9ACTN Oxidoreductase OS=Micromonospora sp. RP3T OX=2135446 GN=C8054_25080 PE=4 SV=1 ---------DPGELLASALVVLSPAADYFWSFMEDRSVRF---LPQ-----------QLAPMFFSTLGQMVAGRGDPAGRRAALAVMgrmYR-RFDLQPYHDTVIAAAVVDTVRRFAGASWVPEQAGQWEkgcrQALRLS--------- >tr|E5XPI8|E5XPI8_9ACTN Uncharacterized protein OS=Segniliparus rugosus ATCC BAA-974 OX=679197 GN=HMPREF9336_01410 PE=4 SV=1 ---------TFVRSFHlELFGAAPELAARFPPGLGEHRGGF---VRM-----------------AEHILETFAEGADPPRLIDLLGQLgrdHR-KHRLDERDYRLAQAAFAKALVATARG---SGDGAFAAraaaLVCQVM--------- >tr|A0A246RU09|A0A246RU09_9ACTN Uncharacterized protein OS=Micromonospora wenchangensis OX=1185415 GN=B5D80_01060 PE=4 SV=1 --------------------------MREADELRSALPDR---LAA-----------HDAELLIATLRRLATD-PEPAAQAVTLTVLghaFR-RFALLPHAKLISALAGAD-------------------VPVELL--------- >tr|A0A0P5RQ13|A0A0P5RQ13_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1 -KLTPHQIRDVQRTWEHLRANRNAMVSSIFVKLFKETPRVQKHFAKFaNvavdALPENGEFNKQIAPVAARLDTIISAMDDKLQLLGNINYMrypHQPPRAIPRQTFEDFARLPIESLEAS---GVSGDDMDSWKGVLTIFVNGVSMRY- >tr|A0A0L0FDI4|A0A0L0FDI4_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12917 PE=4 SV=1 ---TDSEVELIRSSWRALLAGDGTaaqmpllrFVEQYYKRLFRLFPDSRGVFKTRD---------TQSKSLSLLLSIIINVADEPElemnAKKKKLEMMYK-EYGMNSLLAVIAGRVLIQSLQAFLEAsnKFQASVKDAWVKCYTSIADQLL---- >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1 --------DLVLSSWDIVRQRteVQELGEKFWKYLNCMSPEQTNLFRR--------SLSMWGHLLHHIVNMLLISITDPEEYYDLMFELtirHI-RYGVRSEYLNPFGNALFATFEEILSDVWEEKTTKAWKLVWKRATCNMSRG-- >tr|B3RTB3|B3RTB3_TRIAD Predicted protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54902 PE=4 SV=1 -----------------------------------------------------PLVRSHGLRFMKAIETMLEIEFDSNgciFLFSAIGNRHC-SYGIEADYLDYVPQAFRFMLTKALGNNYTDKIASVWDEILSHIIKAMQDKV- >ERR1719347_2568912 ---------------------------LPPPTHFLPLPGINRKVRIFqRQFgnqtsefLTGKALRDHSIRVMDALDSVIVDTLKGKDIHKqmvDIGYSHL-KMGVEPRQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED--- >ERR1719474_100483 -----EYKNILRSTWSKLLENKEEIGLKIYKSIVfDTTstPtgnglSTSIIF-------ENSDLGQSSSRFIDMLDTVISQLDEPEALTRRLEELskmHSDKYDVRKRHYMDFERGFMKAIKWELGAQRTAQHDRAWRWFWDFMLSKMC---- >ERR1719464_849876 ----------------------VLIGCQTFQAFFDRHPQFLSNFDKFNAieidgVLVSSALKMHTSRVLAVVEDIVEKTGNHPRTLGDVR-------------------SSDMSIRPLvFRSgLWTIELE------------------- >ERR1719232_2219129 ----------------------CRPGCVTFTQLFAQYPMMefLGKFDNME-vegVNIGEALKSHAEAIGSVVAEIQENAGNPERIRMSLAGAghrRY-QEGVARQQLDMLGPILAHVIRPLvWEKcLWSVELEKAWTHLFDIVACLMKLGY- >SRR3990167_8699843 ------------------ANQLEDLCRLFYAHLFAKAAHLKPLFGDSE--------DTQNFKVIKMFELIIDNVEDLTQVQPiclDMAKRHS-FYGVKNDFYQYIDEAFVWCIQQQLSLSIQDPIIHAWYAATKYISSIMID--- >SRR5690554_7960028 --------NV--QFVSRGC-GGTRFCSLGFPH----PPSATLFPYTTLF-------RSQRHLlrngVMQIILVAR-GMSD--RKLRDLGESHNRsNYNIKPEWYDLRSEEHTSELQSRPH-LVC-------RLLLEKKKKNLNITY- >ETNmetMinimDraft_19_1059907.scaffolds.fasta_scaffold284136_1 # 1 # 639 # -1 # ID=284136_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.595 --LPKACVSLLRQSWKQVP--QASFRKEFFDRLYIEDSSLQQIFQHPM--------VEVPENAWNVVQLMLDLLNVenvprLERFVHALAGLAFRHGRFRLAHLAPIKRALVRTVTSHASKQEKKKLSQAWEAFFYALAAVAA---- >SRR5438477_815846 ---------------------------------------------------------------------------RPHLSAHECGRLgpaVGQTLGAARNHLDAFGLALIEALNAATSLD-SVPTATEWSDAWDLTVRWTRP--- >SRR3569833_2822653 -----------------------------------APPERHTVLHE--------AIVTNPVEVAGAIGWVVEHLHRTEEVATACGELgpaLARLLAGHEQHLDACGRSIIDAIRTGLADRWKPEFDGATSSAWELVAEWLRR--- >SRR3954471_21372458 ---------------------------------------------------------------------------------------gpaIA-ALGIAPDKLEPMSLFLVEALLAALSPMVPadRTAGGGGRGAGGGAAAGAAQ--- >tr|A0A2T7P177|A0A2T7P177_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_12319 PE=3 SV=1 ---------VITRSWKCFYEKVCSFGVYEFLNLLTDLPEYEEAMRLIKLTSsykflSAMDFNAHFLSMLTIIEKCMARLevDDLpllEDILHKVGTDHI-GRGVNPENFDLVIPPMVAGMKQMLEDKWTEKEDIAWTNFFTLMIHIMQE--- >ERR1712198_190235 --ISSEEK-hVLIDNLKMTKG-NKKFGANmll---KMFLAHPKTQSLFPNFaKLPvsslSNNAEFVAFGKMMVSGIEIFVN-cWVTNpSanislPTSLWINNLRKPPAWX------------------------------------------------ >ERR1712179_658195 --VSGNSK-nAVRATFDQMRF-NSEVAPKiml---KLFTAYPETQKMFHRIaDVAvsdlMNNRKFLHQLLCL-RRIQLHPQQhgrsrDHQTpTvqgrlP-----RHVRLPLPWYLsAapg-YFSHR----IGSVQGRAGRRlh----RR--SRLWMDFSAELRQP--- >ERR1719419_2176015 ----------------------------------------------------------------------------------------------FvfLgssQKTILARsiftkkIVLLTEHTLKISVAVSPLSAADFTILKD--NLKMIN--- >ERR1711946_32375 --------------------------------------------------dEQPQIPVHQLLFL-RRIQLHPQQhgrsrDHQTpTvqgrlP-----RHVRLPLPWYLsAapg-TYPPS----HSNHTARERTAfqvlFLPQDT--SRIVLEVFRE------- >ERR1719222_1795957 --VSAKAKSLIRDSWVQMKF-NGEIAPKIYLKTFAAHPKTLAMFPQFaKVPnrvrPHPYEpLLATAGIDYDVKLWIPSPGSEHNinveeLMARNArmleetrDTITVPATFMIrMlas----------MSNFRR-AGNRSTNDE-------------------- >ERR1719222_245222 ------ARSlgrtqesHPLDLDSHEIqqQ-RRTQNPLQDVHHLSRDPENVHPFGRYtR-------FSAHGEQTVLGFESLCFRwiqhdcqqYGCSRa-DQVAVVQGRLPRHFRLslPwhfSATRANPRIILEVFAEELGSTFTKEAAAAWNSLLNFVTKGLEN--- >ERR1711911_103569 ----------------------------------sraDQVAVVQGRLPRHfR----------------LSLPW----------------------HFSAtranhPhhlGSIR--RRTRLHFHQGSRCrleLPfelRHQGFRKQHRRLATHR---SRP--- >SRR6476620_89806 ---------------------RHATRQQRRPDVF----------HERQRTAGE-D--lnVLRERDVGQ---VHESLARagvavIDGVVPRIGCEVV-DLSSEMQNG--------FPQGVIL-SAAVGVGDDDG---------------- >tr|A0A0K0D079|A0A0K0D079_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1 ---------------------TRDTAGEYHKQLFTLHPELAKYYDAEDIDPdsvlkvcnaddmrylayssaiQAQKFIMLGQQELQCFFRLPTVVNDERSWRSALSDFkeTFGENnNMPMKEFNKVYDAFFAAMQKHAGG-VTAEQKKEWMALFDKAYEDMKK--- >tr|A0A0P5EFU8|A0A0P5EFU8_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1 -NRPPPDP-RCPEELGKHRNGRNALVSSIFVKLFKETPRIQKFFAKFaNVavdsLAGNAEYEKQIALVADRLDTMISAMGDKLQLLGNINYMrytHT-ERGIPRAPWEDFSRLLLDVLGSK---GVSTDDLDSWKGVMAVFVNGV----- >tr|L8DEE0|L8DEE0_9GAMM Uncharacterized protein OS=Pseudoalteromonas luteoviolacea B = ATCC 29581 OX=1268239 GN=PALB_34720 PE=4 SV=1 MSISPYQYRILTQSLAVVRPNFHCFCVSLRTQVS-HFQLNN------ALITKTEYAYQQEDGLFRFIHQCVGLTLDHPALVHFISAQakLLKSIEISERDICVICNCFLSTMQLHLGKQYTLAMRNAWRRLLHIIANILNHE-- >tr|A0A290TM25|A0A290TM25_PSEO7 Uncharacterized protein OS=Pseudoalteromonas piscicida OX=43662 GN=PPIS_a0207 PE=4 SV=1 MSITPYQYQLLTQTLASIRPNFHGFCTSWYNQIQ-HYDLRM------QIPTNVGQLIIWEHQIFDFVQNCVMRIPQQSNLLHYLQKQrgTLLFMGTSEKDISVLLFTFYSNAKKSSWQAFYHSSKKRLEQSTVTHRKY------ >ERR1719262_376372 -DVGEKVINEVIKSWQLLIKRVeskTEIGKIDFDSLFDRLPHLRKLFKTNRL--------TVAMRFANSVHTLVGALTSKEqteEFTYNLALRHV-QYWagdasIAQANMSAFLKAVLIVFDNALDEKWTQTMEEAWGALFSYVGEAMVS--- >ERR1719440_1320932 ---------------LPSLSLPsLLLPSLLLPSLLFSSLLLPSMFVSPR-------L-STAMRFAMSLHSLITSLESTEKteeFTYNLSLRHV-KYWqgdasIAQENMSAFLGAILLVLENALDERCTQAAT------------------- >tr|Q9NAV7|Q9NAV7_9ANNE Dehaloperoxidase B OS=Amphitrite ornata OX=129555 PE=1 SV=1 -----------------LRGDLRTYAQDIFLAFLNKYPDEKRNFKNYvGKSDqelkSMAKFGDHTEKVFNLMMEVADRATDCVPLASdasTLVQMKQHS-GLTTGNFEKLFVALVEYMRA-SGQSFD---SQSWDRFG------------ >tr|A0A0M4CP70|A0A0M4CP70_SPHS1 Uncharacterized protein OS=Sphingopyxis sp. (strain 113P3) OX=292913 GN=LH20_00550 PE=4 SV=1 --KERSDAALMEATLAAVAETGIDIRHTLFERFFSAYPERHPAFLNLDAA--SRRMTDETLQILFGLA---TDEGWVWPLVAELVATHR-NYGmLPTDEYDAFIDLAIDELGRAAGRAWTGAHAAAWRRQGEIL--------- >tr|T1HWR1|T1HWR1_RHOPR Uncharacterized protein OS=Rhodnius prolixus PE=3 SV=1 -SLTQNEKELLKDSWKKRGINKSTLAMMWFTKLFKANPEELlkhnhgqileELFM--DQT--N---LDYMDKLAEIFSIVVQNIDKSTlctKLIWELAMYHR-CLDLTESYFQLLKKTLLDTLIENFHPSLTPEQIEAWKKFIGIMFDIIY---- >ERR1719171_2291403 -------IPRIcgelwrkqtfklrfnilgkqihspgiPRFFQKMENVGgLLVSalllaMCFYDPEIvAHEEQIGIHIIDR------------NDAIYYVLEACNACILWLlvTNVFGfsvQLSAFkHC-VSQMaeDLAKFGTFAVVFLMAFGCAIhiTMPYDPDFEDMWVTILTLFAI------- >ERR1700760_4852051 ----------------------------------AGSPSSPAR----------------------------RPRPA-IATEHdcrtrAPANR-APiTYGSPVD------------------ALACRRAL-NDWFRVPGVP-------- >SRR5690348_16468503 -------------SFWLLEPVADAAMTYFYAELSSAARATWAdrdIYMS----------GPDHMIVRT--ARALVerg-------------------APSRLIHYDLVDPRVTEGQX------------------------------- >tr|A0A0B2UXI9|A0A0B2UXI9_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_18450 PE=3 SV=1 -LLTAHQRILLQKSWNKSQKtGLENIGAHVFLKIYHREPSVKTLFGIEDVPhaelKYNKIFQNHAMTFTRSLDFILANLNKLDIVanfCRQLGRRHTQyiTRGFRPEYWDAFAEALTECAIDWEGGLRCREALNGWRTLVGFLIEEMRIGF- >tr|A0A2W4R8Q8|A0A2W4R8Q8_9CHLR Uncharacterized protein OS=Chloroflexi bacterium OX=2026724 GN=DIU68_09390 PE=4 SV=1 -RLSRQQKRIIQRTFSAVAVRHDLVARLTIERLRELSRTpASTCFGNT---------PEDRRRLMHLLALLVQRMDDRGALHDACVAQT-RQMGCDPFeggSTSLLAEAFIGALQSALAGRFEAKTEAAWREFFQMVERVLR---- >ERR1711911_155006 -DIIRKNCLMLYTNFTATKIAFKWILLCLNCRYFEIKPEAQKLFPAFaNVPL--KDLP-KNYAFLAAVNTCFANVHYLIekagrnpRDCPVFSKVVA-KYDA--RDVKQFGDIMMNSLKSELGSQFTDEIEESWNLALEEIAKMVS---- >SRR6478735_8357209 -----------------------EREIAFLVARGLPsKEIAEQLFLS---------VRTVQNHLQR----IFTKLG----VtsrGEVAGVLQG-LEGPSSX--------------------------------------------- >ERR1719487_2840864 ----------VRQSWAMIQAIqtssAGGFGDALFFNISVMSSEIWSLFSVS-K-------EVMAVTFTDAFTLIVSYIADPVGLAEELfgeADGVG-DVGDDQGEGiregdghDLLGHGEQ--TPDLAAHDGDVEEERVAE--------------- >ERR1719171_2815737 ---------------------agaendeelrensgvedsfasgsvPTTFNEMFLFNLTVMGAGARK----N-K-------AImWMTEVLTSFDTIVANVANSKRLQEECdvlGLRIS-KYPLDFVKLPEFKACMLSSLRSLLPRTWSGTHEVAWSWLWENIERML----- >SRR5262245_17232684 --VEEETRALARYSYLQW-LDDDEFFSAFYESFFAGATGAKGKFRN---------VEQQRLKLRDAMTAVLNFYPGnEPTSLHRLIAVHA-ARDVTGTEIEQFERSFLEVLHQRLVERKIAeqlgpdvvaKIEQGWRELLHPVVQYVMG--- >ERR1711962_392431 -KFTAEELEAVKKVWDSLLQNGQNSGLFFFEHFFKIYPDQRAKFSFIhDQYghiepeyMETIAMRNHTMKFMNILGDLLNQVLSrDKRVKQDLSNLgytHH-ERGLKEDDVLQLEYAVIDGIHDHL---VTDVHERAWRKVFQLIRIH------ >ERR1719510_2339612 -SLTDNEVILIKSSWTYLKPHINTILIESFMSLFAENSDVKEKFYSFkNHAiedlnKkrgvglaSTNGLQRHIPRVSRAITKVVNSIENLDRVsryLEMLGKIHQ-QIGIEVQELMMLGAFFINSSKRHLPSSMQADrhYSDSWLHLFTVISTMMRKGF- >tr|W4GBS3|W4GBS3_9STRA Uncharacterized protein OS=Aphanomyces astaci OX=112090 GN=H257_08997 PE=4 SV=1 -VLTRRHVRLIEANWTLISRGTSSaydetrhgNPDKffhrtYYSLLFAVMPSCRSIFRS--------SMHLQGKSLFAILRAMTSILhcPDIVDRMQALAGRHL-TYGCEKTDYTTAGVTLLKTLEIVSGDQWNYDVKEAYLTAFCLLMYLM----- >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold759411_1 # 1 # 798 # -1 # ID=759411_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.594 -----------AQFWEEHISykslaDKLEIGCAIYFGMMVHNKEMKRILKKNlhhhQ------SIENSSVKFLDMMGWLLRSLLRsdidLCGSLQQLGAFHR-NMGVNINHFDPMLKSMHETFSYYFPIKYGIQIKYAIDQIFTLAARIMTG--- >ERR1712214_179591 --------------------------------------------------------PGHAgRREGRRSARQPGTGKDRQKStkyLLELGKFHR-FSGIPNDYFGVMGTIFVHAVRPYWEEagCASEQTEVVWMMLFAHIARVMTH--- >tr|A0A1Y0I5V1|A0A1Y0I5V1_9GAMM Uncharacterized protein OS=Oleiphilus messinensis GN=OLMES_1782 PE=4 SV=1 ------DQRLFWNSFDRCLsspQRDQQFAEDFYQRLYSSDRAIAEIFDRVSVS-------DQLHAVRQAVYLLQEMtpLKQAEITLDKIQAIHHqHEIRLSNAMLDKWLECLLASVELADP-EFNETVKQAWIDILTPA--------- >tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1 ---TTQQMELVKTSWTDIV-------------LFQKEAPIASLFSFVESAksdadnlLLNTAMQTHVKKFKAAMTSVVDLLPNLDAagqMMQSVGSRHA-NYGVKQMYIMTMSNAIIYALDLSLSArgKFDQATREAWTVFLGAMSRKFTEGL- >tr|A0A2B4SF50|A0A2B4SF50_STYPI Gelation factor OS=Stylophora pistillata OX=50429 GN=abpC PE=3 SV=1 -QMSREHMTLVQDSWHLLKGNLEGMGVDFYISLYKENTDLLCQFPYMSeQStehvmNMDDRVKRKGLVTVQHVKEAVTALRNPGSCVH-----HQKASGFCPRNLQSVGGALLYSLDKSLGQSFTSKEKDAWCTVYGIDVATIG---- >ERR1719199_1566639 ----------------------------IFQHSGIQRPVFSTSSSSR--------RLCRP-CDLSMAFRPSDVLHSSTRLKAQVETMgfgHL-HLDVTPARCKLFHGALVDFFVVELGDKLTPLAAEGWKRVLTYVASGL----- >tr|A0A0B2VDB7|A0A0B2VDB7_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_13543 PE=3 SV=1 -SMNDDTKGAICEQWHTILALydgdISRVGVAVYQRIFDAEPQLREVFGIPsFVtdLSEYEPFQRSGKLFMSVVDLCVRNIYALDAEmgpvLVMYGRRHyhQQSRGFHLRYMPIFTQCMKEFVSDCLNEKQkTSDSEDGWSLLFDYIAAKIVDG-- >tr|A0A2C9KGE7|A0A2C9KGE7_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 PE=3 SV=1 -LVTDSDIQALRSSWATLTAGPdgrNVFGNNFVLWMLKTIPNMRERFEKFNAHqsdealKNDNEFVKQVKLIVGGLQSFIDNLENPGQLQATIERLaaiHLKmRPSIGAGYFGPLQNNIHDFIEDTLKVGADDAAPKSWTRLLTAFNDVLNSY-- >SRR3982751_838383 -----GINDQLRESAAMLTSGGteatDAVIRDFYIALFRNAPSLIAIFPG-NPAQGdfgsDHRGAKQRELLLGALAGLADLydpgdaerMTHLDSVLKRFGRSHAAfTrpdgtvSGATLDEYKAVKDALFSTLVRAAGDRWRAEYTVAWSQAFDYAAASMLL--- >GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6338290_1 # 1 # 129 # -1 # ID=6338290_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.636 ------------------REAGlEQYAGALLRSGFDDLEtllaiedadmkdLGIPaCHVVRlRKKlqelqrqrsgtrgdFDASNP---VVAFL-----ENAGLGQYA---KLLLQNgfddmDV-LLDIEDADLKDLGvprghaIKLKKGLRELQLQQYAQEDPMPLHAAA------------ >LauGreDrversion4_2_1035121.scaffolds.fasta_scaffold1378443_1 # 2 # 412 # -1 # ID=1378443_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.550 ------------------------AVRELLSEAVRCVSRGKEHFASIDME-------RQCQ----ILNDAIHMLLDFQAergnaPLRDLAARHK-PFGLTRRHYDIFLTGLLEAIAES-G--IDAAHLAAWQKTLTPAVDFI----- >tr|A0A0P5AEE1|A0A0P5AEE1_9CRUS Di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1 -KLtp--HQIQDVQRSWENI-rNGLNALVSS-IFVKLFKETPRIQKFFAKf--aNVAVD------SLAGn-------------------AEYEkqi-ALVD--TPTPNVEFPV-------------------------------------- >tr|A0A164VL64|A0A164VL64_9CRUS Hemoglobin OS=Daphnia magna GN=APZ42_022506 PE=3 SV=1 --------------FAKF-gS-----------AAVDSLPGNAEYEKQVaLVadrlDTIISAMDDKLQLLGn-------------------INYMryt-HIERGIQRGTWEVR---------------------------------------- >tr|A0A1Y1Q0V7|A0A1Y1Q0V7_9GAMM Uncharacterized protein OS=Thiotrichaceae bacterium IS1 OX=1934244 GN=BWK78_10305 PE=3 SV=1 --------ELIGQSWDKLAPRQTEFIDAVYELLFQQHPHYKPLFSE--------SIQREMAKMVETVAMVARvsGESEIsHPRLIKLGERHS-PLQLNRGDLENFKTAYLTVLKQFCP-EWTTECELSWEEDQSLIPG------- >tr|A0A1B6JRB7|A0A1B6JRB7_9HEMI Uncharacterized protein OS=Homalodisca liturata GN=g.2446 PE=3 SV=1 -SLTDRDLRLGRATWFKNVDATPDFGMVIFKELFRQYPDVESYFLHLRGnAgsiFDSRTFRSHMtERVVPKLKEVFEALDKPEHLnevMTKLGLYHA-KLGVSGHLVENMLSVILDALKSVMHTKMQPDEETAVRTCL------------ >ERR1719323_1074371 --IPFEQRTLITEVWNVLQESTiRYVSNtMFLPLIVRSNKSLQKCFAALDQSlhgmelvecYGSkFDRTKHGSLFLSkLLIRVVPNMDQMDRVLPYLAELgalHQ-RHGVAKQHIDLLGLAFCAAIRGVvagGGvkGGHLHETTKAWITLIQAVCTGMKMGYT >tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1 ----PMEVALVQSTWQRFLesPNLTTEFSAIFQRMFQMVPTAMQAFRYVnstDLDslVANKDLQKVVTMMMSEVNATLQLLDQPQALISLIRshgARHA-TYGVTRQWEETMLNAILYAVETKLSPsGFNQSEKNAWRSVLDMLGRN------ >ERR1719495_824226 -----QDIENVRKTWEKMIAKheLQGVGLVVLTAWMNEHKEIRQVFAKSfpiiDKlekdvldlvQLNDPTLNEHATIMASSFGKMIECLDDTEfvQMMIDIGKKHT-GFRVSADSFDTsLNSTLITALMALSEEKEDSPNIKSWKTVVEVMKHYLK---- >ERR1719272_197188 -SLSATQRASILASWRQLCGEDggATFCASLLGGAFEAVPETRALAGV-PEAApepeavpeaeaavaapapapakgkagatavpeaaaaveeaaeeavesaESVALRAAAAHAAVAMEIMAQQLSAPEALKESLTELGVkaasRGLGC-GAPFDRLGEALQTTLQASLGDeAFPEALAEAWRQLYAQASQEIQLQY- >tr|A0A0N8ALQ3|A0A0N8ALQ3_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 ----------------------------TKARLN----NCMLLFSE-----k--LAAFLaQASPSWPVWNVVIHPCfs--qelMANQLNVLGGAHQ-PRGATPVMLEQFXXXXSPPSSSSSSRKP-PASRNSSPN-------------- >tr|A0A0P5ANB1|A0A0P5ANB1_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 --GGNDGVETVSDQSNLFVVF-AI-FGQGIDGNASEFDEVLLGAGSLlEELDedggNDGVAVTpDVFPaglniadlVGGQFSLGISQIfgflevlgdASdqsAHTVLPGLSGL-G-VEGAAQRFSKDFLSDVTELLEHDGVSSFNAEARQAWKNGMRAL--------- >tr|A0A0P5ESR8|A0A0P5ESR8_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 ----------------------------------------------FlEDASelleHDGGSS----TGFMGTTESVQLVghqllaeqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGNIGELAEHCLVL--GVGLDEA-EEDLGSDISV---------- >tr|A0A0P5I7S0|A0A0P5I7S0_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 ----------------------------------------------FlEDAAelleHDGGSS----TGLMGTTESVQLVghqllagqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGKISEGLEHLLVL--GVVLDE-TEEDLGRHISVL--------- >SRR2546423_8132340 --------------------LADVADEMFTARLLELEPQWQRVLSD--------EPTEWGRRLLRAIRQAVASFTCLGGFAEALRELGgVPAAHVGYRDYERQGAAFVGRLEHSLDKPMAGAMRESWQRVFRLLAEM------ >SRR5260221_7941029 -----------------------IAEAMFTARLLELEPQWQAVLSD--------ERRQPTQRLLHALRQAVAGFTRLSGFEAALKELGaIPVKGCSHGDYESLGAAFIARLERSRLGPRAHQMRERGETGFSPLSX------- >SRR5262245_33028046 --------EHADHNYDSNLRNNANFFHSFYSRLFESSDEIAKLFEQRNV-----TMAEQYRKLDHAMVSILAFNPRLRaTTLDPQIESHA-NFGLSAAHFGLFREAFLHALRETQGA--DEYSQEAWRAILNPALTYMRDK-- >SRR5436309_12080688 ------------ASFAKLLAVWEPLMHRFHAHLEQLNPRLRYHLPPA--------LL---RYVRFELLQAVRQQT-PMEVGSGLRRFgvHLRAQGFEGPDLDTLGAAWLVALDEVLGDRFDSEAREQWLRFYKVLRSAF----- >tr|A0A139A347|A0A139A347_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 OX=1344416 GN=M427DRAFT_73171 PE=4 SV=1 -MLSAEQARLLKKNWKDIGASSVanpmmFVVAQFYRRLLRK-KGYKRIFEGIDIE-------TQYFKMQGALTACVEfaeNLDKFADTIRRIGARHA-RYNMTPNMMNDVVDSLVPSLKEFsldHGITWNEEIEEAYDEWLEQVTGYF----- >ERR1740139_1939294 ---DSDTIAVVKQTWKAITALPeqqEYVGMRLLHNlhpcyetsltfllvielyylsYLRVVPSARAFFPPTsDSLIDDESFRESASNLMMCIDKAINTLENQRhlrfkALLQTYGKKLS-RLHIPPSCYTMAWFALIETLQDVLEDRFTELMLAYWIDIIDPINT------- >ERR1712129_538146 -------------------------HGDISSInhpvyytftllnkfthdsYLRVVPSARYFIPVIsDDDI-----TEKGIYLIACIDRVVRLLERQEkrrlqVLLRSYGRILL-RYDINPSNYTTAWLALIDTLQDILKNSFTELMLAYWIDIMEPTNL------- >tr|A0A1Y6FH01|A0A1Y6FH01_9SPHN Uncharacterized protein OS=Altererythrobacter xiamenensis OX=1316679 GN=SAMN06297468_2444 PE=4 SV=1 -------STLAERSFERLAEQRGDITQDVLERYYRRYPDGRASFEHHGL--GN-RAELEGRMVSTTAFLLMQWAQDPGGTRIEQGTTivhHQDTLEIGPRLYLGLIDAVLEVLFETIPDE-SAEERAFWLSLRGEIADFLE---- >ERR1711879_742838 ---------KVFQSYGRSC-NNMVFFEDFYSIFMTKSPDVLNMFANTDME-------AQRALLRSGILWLGMHARGMpDTKIRALGESHSKkKDEHQPHVLFHVAGRSDGNAFPPRP-G----LHSRTGANLAPYPTAHVT--- >ERR1712080_808083 ---TAGDVQVILRNWESVWGaqfsgRRVAIGQAVFANFLDRVPDAKDLFKRVKVdQPDSPEFKAHIIRIVNGIDNVLNPLVLILVSnscLVSML----SEMASRLPCSRS----WVPLSTMFFP--------------------------- >tr|M6F3R8|M6F3R8_9LEPT Uncharacterized protein OS=Leptospira kirschneri serovar Bulgarica str. Nikolaevo OX=1240687 GN=LEP1GSC008_4081 PE=4 SV=1 MNISENQIRSLNESFDIVNLDRIKFAELFFIYLKENHPKYENIFSRIQL--------EDVKHFMNSARNISLSSVQYSQLERAIQNFgvECLKICNQAEEIPILEKAWLFALEKWLGPWYSHEVEKSWQEVFKMIHTS------ >SRR6478735_3884488 ---------------------------------------VRRTTLY--------MPRP-DGRGGTMKPVVAAGSL----AIMAFVTVgaqAP-APTPQDRMYAAVRSDDT----AAVSALLQGGA-------------------- >tr|Q25689|Q25689_PSEDC Hemoglobin OS=Pseudoterranova decipiens OX=6271 GN=hemoglobin PE=2 SV=1 --------------------HQKQNGIDLYKHMFEHYPHMRKAFKGReNFtkedVQKDAFFVNKDTRFCWPFVCCDSSYDDEPtfdYFVDALMDRHI-KDDIhlPQEQWHEFWKLFAEYLNEKSHQHLTEAEKHAWSTIGE----------- >tr|A0A2P8XQA5|A0A2P8XQA5_BLAGE Uncharacterized protein OS=Blattella germanica OX=6973 GN=C0J52_27026 PE=3 SV=1 --LAREEKKFITESWHAFMRLPPANSVDAFVKFLQENPKYIKFFKSVDGIPledlrYSFRVPKHVTAVLLYVNSMVHCLDNADAMFflsLQVGLMHS-NMGLTVEDFKLFNGYMVNILEDELG--LNDEGVAVWNKVLEIFM-------- >ERR1740121_2035324 -----------------FTPLt-----Cqwa-----TPHDGPAQHVL-------------------CEDGHFahFATDKCesAgHG--ArvQCPSDMPEMcaDttcgggqehccrpaggCTGgERPCPT--------TASASgSA--SgsaSGSASSRRLAgIDYE----------- >ERR1719240_2235476 -----------YE---DEE---------------------------------------------------------------------GAqvdvmkgEDALVATADLLYQKMSEDAN---MQT-lLGNIELAELAsKLQKALa--------- >ERR1740122_169377 ----K------GE--ADKSG-nAEAAGGgqGDTPETGAAQDTAAGV-------------------TDEHS--------KA--LgieISS--FDELkvDqkciaaaIDAwKLFISTAESREAAGEAV---YNA-lFEGAPS--LQALFVTPRAE------ >ERR1719243_286169 -------------------------------------SHPVNV-------------------LVSDTMwkGY----t-vRG--IrrvNYY--VKYMmlTrdgnvsqALGwFKDAADCKIISH-PVNVLVsDT--MwKGIVRKQFLGgRLWFII---S----- >ERR1719158_147189 -----------RV--CYLYPLvhcNILAVLrelnfdGAAESLCLDAPALLPT-------------------MLDGLIwrSR----vTeNG--QrrvNYY--IKYFivDaeggfskTTEvMTDNGDPTIVCR-PVVSLVtDM--IwGRVAFRTFLYgKAWFLF---T----- >ERR1712071_338654 --PTAEEIALIRESWPIVKKNKN-VFVEFVLEHFRVHPKTQDLLPEFAnLAiadmPSNKfFVQLTETYVVMAMQEIIDNLDNAGVLTDLLQCLNS-NWYVdyvslDRQN-RETLRIRRVGQEQKSYSRNMESneiQQQRCPQNLRQAVH------- >ERR1711988_652294 --PSAGEIELIRESWPVIKKNKN-VLAEFVLEHFRVHPKTQELLPELAgIAladlPNNAyFVQLSETYVVLATNEIVDNLDNAGVLVNKLGENED-FQVLayyssAVATFivtnLDQEDILTHILVQQTKP--------------EQFVD------- >ERR1711911_417752 ----------------------------------------AisyPVFPSTSsLKy---------------------------DSLKKYLlDAFIf--NYCT---------LIFFL-------------fIKGNWQLgdgGIgrRIRYS------- >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 GN=TSPGSL018_8354 PE=3 SV=1 ------------------------VGAGFLKLYAQRNPWAVEQFSF-GLR------PQHAEKMGLALELIVNSATRPQVLQHQLRVLalgHV-QMGIKPEMFKSFEEALFAFLGQVLGAhnTFDEETEGAWRWMWGIVNAVFTQ--- >ERR1719232_1195758 ------ETVIIKDTWETIHKQVKAIGMEAFEKLFALNSDMSAYLPQTDDldqdetRRLSDKVKSHAKLTMETLEQVIAAIPDMTEVYNVITKMKK--LHPQTGLLEVIGPVFCNTTRHFllIQGRWSLDVQRAWLALFGEVSAMIRASY- >ERR1719189_1497217 ------GRQADEQ----VGREEAGPGHRGHRP----AQDDPAHLRgarDCGQrvrgraRRHGDRGV-QGRGQGEQS-QH-----------------HR--HQGS------HGQ---------LHGRHX----------------------- >ERR550519_213 ------NIVLLRDTWSVIHRQVNTLGMETFQKLFEINSEVSHYVSpscpDLDPdciDSTTQAIKAHATHTITILHNTVSNLCNLgd--LAGE---------------MNRLGKLHCDLGIDHgiL---------------------------- >ERR1712051_111803 -------------------------------------------------------------------SNF--HASDGHlmdgAFDPnISQIFSF-FYLFQNCEMLVFGPHFVASAMYYLPSPLrEKSTQESWLKLFSVITEIMMS--- >SRR3990167_4175368 -GLTDGEKGMIQQSWNLLS--KVEFTKILYKKIFELAPHVRCLFQNS--------IESQHENFsimMDMmINEHINDELDLFAVVLQLAKRHF-HYKVKTDYYSIFRDGFLWSLEQTLSIEtlnktITNestnqptTIKSIWLKFVNYLISVMV---- >ERR1712212_288737 -LLTDDELFSVGNLWTNLRESSADSGLYIFQHWFDMFPEVVESFDFAkDQYgnillnlMQTKKMRNHAIGVMNKLDAMMMRLFKRDPevakLIYDVGVHHQ-TRNINEDEMTKMSKSIYSAVQDINVGPHSDKELAALHNLLEVVSYHFKR--- >ERR1719167_330163 -DLTDKERELIQHTWWRFREE-PYCRLRIMTHYFSANSSIKKKFQRKNEENaangnlmtamVSWNIRRFSIRLVEFMDKVVRDLETENyQDIYDISELqgakHYRlKRMVEPGDMEALGQSIQTTISEHFGEKFNRSHILAWRRLFIVICSRF----- >ERR1719378_576485 -DLTDKERELIQHTWWRFREE-PYCRLRIMTPYFSANSSIKKKFQRKNEENaangnlmtamVSWNIRRFSIRLVEFMDKVVRDLETENyQDIYDISELqgakHYRlKRWWNRETWKLSANRSRQQFR------------------------------- >tr|A0A1I8F573|A0A1I8F573_9PLAT Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1 --------------------------------------------------STNQKPPSDGDRLLYWINVQ-------PTAQPQllrGASEGC-VRLFSPRILTRSCISSNLCVRAGRGRNS----SSTeTTSAEGADAVVAA---- >ERR1719265_1594411 -------VDTIVKDWAGLD--LEKLGDTTFGMMVQNNPEIKTIFGGDVhPGVAQQGLKSQAATFVGFMSYAMTWLKKkdfivLEQKMVELGQRHV-HYGVNVSHFVSFQEAMFTALREQLGTRFE-DNKYAWTF-------------- >tr|A0A0N5AG16|A0A0N5AG16_9BILA Uncharacterized protein OS=Syphacia muris PE=3 SV=1 --PSRRQCCILHKSWHRAQQCgLD-IGSRIVMQVTKNEPTVWRTVGLTNATGadikYDKNIQYQAALFTKALTTIMSKIDDPEAVseyCRELGRRHVRhvKKGFQTRWWDTFAESLTECVIEWEGttvdltslvfhatkicGQRCKEALNGWRKLVIFIISEMRAGF- >SRR3989338_2963815 ----PHQMTPLYHLYKENVPpqKERELGLLFYKLLFDSNPELLDFFANVDLD-------HLSDHLVQTIRLFLESrnsLVSLVPAMKALGIIHQ-RAMIPSWAFPLVIENMAKLFSILLGDRFTVELASALVLSFDLLTSFV----- >SRR3990167_6716616 -----EYENPIYStlknIWlETVSTpeIKSAVGELFYKNLFQYHPELLEYFNNVDMD-------SLALHLSQALDFVFQSinkIGDYksqwRTVLEHLGEVHR-AALIPTWGYPIIGQQILKIFPYNEKAGFSTKQL--etaLATLYREIVIIM----- >tr|A0A0Q4Y6B0|A0A0Q4Y6B0_9BURK Uncharacterized protein OS=Pseudorhodoferax sp. Leaf267 GN=ASF43_05025 PE=4 SV=1 ------HRVLAKYAYRQwVEPLGMQFSQAFYTRFFQDDKASRAIFERALGPRAAgLilVDDAHHNKLVGSLGKVLNYRRGsPPSSIDDLVPSHR-DKGITIEHLRHFREAFLKTLEAQIDAsdPEKRAVVDAWRQLFEPVLDAMAS--- >tr|A0A1Y5SIU2|A0A1Y5SIU2_9RHOB Uncharacterized protein OS=Roseisalinus antarcticus OX=254357 GN=ROA7023_01630 PE=4 SV=1 -----PQAELVADSLSRVGDKVIWLASDYYEALFDASPQLHGVLPH--------QMSEQTNMLGHALAHALANLRDPDGAAPMAQDAglADRSARMPPRMRRTIVRTLVHALSLWHGPTWTKDHARAWNEGLLGVAPL------ >SRR5690606_37396704 --FSDTDTYILHTGLKWIEEAPETFAAKLYQRLLRDHPECQASLHAIGL-------ESFNRNFIHFLKMVKEELLERHTIHVAPREFlalHALpvEKVRHSNYVIKMGRTFLDIFAELAEDAWSPALESTWNKAIEEVKIALW---- >SRR4029453_11903763 -PMTDAELALFHDSLTRCTSQ-PPFLERFYTLFLAASDEVRHKFRQTD-------FQKQRRLLQASFYMVMLQADGKpEGavHFERIADLHSQrHLDIPPHLYDLWLDCLMQAVREYDP-EWMPGTGGLFWGRVGTCIVFFYMISV >tr|A0A1R1LGI5|A0A1R1LGI5_9GAMM Uncharacterized protein OS=Motiliproteus sp. MSK22-1 OX=1897630 GN=BGP75_23395 PE=4 SV=1 ------QLDKIYSTLQLLDdEKSEKLINETYSIFFNAHPEAVLLWSKDDPE-------SRSKMFNGVILTIIDNLTRPDIFKnNLLSDVkdHD-EYGVDKEMYGGFFLSLTEALKKTLGSEFNQEMELAWKHQLAHIRE------- >tr|A0A1H1BYI0|A0A1H1BYI0_9ACTN Group 1 truncated hemoglobin OS=Thermostaphylospora chromogena OX=35622 GN=SAMN04489764_1195 PE=3 SV=1 -------------LYEKIGGgpAVREVVDAFYTDVL-GDTDLKPYFDGIDMA----RLKRHMVVLLC---SVLGGPEGY--RGRELGEAHK-NLGISDEHYAKVGDKLVTALRDH----------------------------- >tr|A0A1R2BTD0|A0A1R2BTD0_9CILI Uncharacterized protein OS=Stentor coeruleus OX=5963 GN=SteCoe_19762 PE=4 SV=1 -------------IYDRYGGqpFWERILDVFYTKNL-AEPTLQGFFIGKDVE----RAKAMNRSLLA---AALRPEGEH--FPVSIKRTHR-NMDISDAQFGKFAENLISTLGEN----------------------------- >tr|A0A218QUH5|A0A218QUH5_9CYAN Group 1 truncated hemoglobin OS=Tolypothrix sp. NIES-4075 OX=2005459 GN=NIES4075_64370 PE=3 SV=1 -------------LYDKLGGkpTLDKVVQDFHKRIL-ADNTLQPFFANTDME----KQRQHQVAFFA---QIFEGPNEY--KGRAMEA-tHA-GMNLQQPHFDAIVSHLKESMASV----------------------------- >tr|A0A1Z4FY87|A0A1Z4FY87_9CYAN Group 1 truncated hemoglobin OS=Calothrix sp. NIES-2098 OX=1954171 GN=NIES2098_33650 PE=3 SV=1 -------------LYEKIGGqaTLDKVVADLHKRIQ-ADSSVNTFFAKTDMA----KQRSHFVAFVA---QLLEGPKQY--AGRPMDK-tHT-GMNIQPQHFDTIAKHLSDAMAAN----------------------------- >tr|A0A0T6BC68|A0A0T6BC68_9SCAR Uncharacterized protein OS=Oryctes borbonicus OX=1629725 GN=AMK59_2266 PE=3 SV=1 -GLTSQQKSLIQSTFNVIRPHILNVGIDLFVRVLEVEPEHHRVLPfsHIPIadLHESFEFKFHCLAVVYSCSAIIDHLHDDGILIPLMKKYASdLKASIPLDIFQMIHDPLLEALDVHDDVKISEEALEAVRTLLRNLTNFLID--- >SRR5689334_189301 -------LDALETSLDLVSPHG----SELMDAFFAERP-----FPAGD-------AGAQRAATLRLMGLLRLCLRDVHSVVALVRDLGA-RHGAQREQ-------------------------------------------- >SoimicmetaTmtLPA_FD_contig_71_176585_length_314_multi_3_in_0_out_0_1 # 2 # 220 # -1 # ID=1957230_1;partial=10;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.685 -----------------------RGGaveevQGPESALLESPPSLDRVATDRS--------AMIPLG-ATGLHGIMTSM--taPSMLqdlVLSLASQHL-DVVLSPPRAIVLRDAILDLFQQELGDGFDSKARSGLSLILNYVCGSFL---- >ERR1712159_177610 ---STSSLNAVKNSIPLIQQHGNAIAENFYVQ--QIQPTNITFFNRAHFTS-----GQQAQTLSQFLVLLAQRSDNLELMnthLRRISNKHV-GFGIKPQHYPIFFENLFVAFKEVLGTKATPELISSWKELVSLVQEG------ >ERR1712159_799488 ---STSSLNAVKNSIPLIQQHGNAIAENFYVQ--QIQPTNVPFFNRAHFAS-----GQQAQTLSQFLVLLAQRSDNLELMnthLEESPTNML-DSESNHNTTRSSS-----------KTCSLPSKKS------------------ >ERR1719323_2894579 ---KVHRQTYDICD----------LILQHIQIITVHCILIQDIDQCCHL-----KTDKQVAAVVNILYQYAMNCDNLNVLENEIAdiiGLAV-NLNMEAWQYPLIAQSLVE---------------------------------- >ERR1711868_248053 ---------MIKGTAKTIKEKGSSIITRMHQNLVNKHKEFKTIFPEEIL-----KDAIHMQKAVGLLHGYASNCDNMPVIEADISelvGILI-NVGVENDHYPLVAEALVEAIGTCLGSDTNAETVDAWKQALDFMVVHF----- >tr|G5ZYB7|G5ZYB7_9PROT Truncated hemoglobin OS=SAR116 cluster alpha proteobacterium HIMB100 OX=909943 GN=HIMB100_00010220 PE=4 SV=1 ----------------------SKLVSELYEELS-QNEITAPYFENSNMT----SLMDHQVKFLSQAL---GGPEQY--TGQAMNAAHT-GLKITEAAFTEVAKTIQFILEDN----------------------------- >SRR5688500_9373349 -------LPYTTLFRSALGDDAVGMAAELMDRLIADHPHDAHAFMNPEAA--RERMTRETLEAM--LGVA-AREPWGETTIANFVDLHH-NYAsFGADDYAARFAMTMAVMERGAGARGPGGASSAWRRQAA----------- >ERR1719365_124985 -EMSGKQKKIVWRTWNSMLGkqesDYNDFGINFVLWLFDNFPKMRNKFDELYGRsrnslIVDQHFIAHTENVVKELDRLIKDLPFPRLLSKRISKLadsHLNqEP-------------------------------------------------- >ERR1719199_1194134 -----THAGYIEKSRESVLNlDAAQLGADIHVKFLNVYPAAASLFQKT-L-----RM-LITTKIMGTLMAVIS---DPTGTledVRAVGVRHT-KYGISERYLLPFGAMLWEIVGTMLPGMWSDEHSAAWAFYLDFIASTMTRA-- >ERR1719359_1737517 -----------------------SFGEAFRFNLGMMAPEFMAMFKTLTAE-------QFTDQFTVMVGQIVNYIDDPPKLLEDlyiLSVRHL-HYNTKPGNSLSLGKQ-------------------SWLLCEASFHRIGIG--- >ERR1719487_2229452 -----------------------SAALSL--------P-------T-EQE-------SPVTMTAEA----VQMVQDSL--RRVdsaVQV-----RDAMEDvFFPHLF--------------------------------------- >tr|A0A2E0SMS8|A0A2E0SMS8_9PLAN Uncharacterized protein OS=Planctomyces sp. OX=37635 GN=CMJ46_12130 PE=4 SV=1 --ISERQYHLIHDSYRRCM-LADDFLVMFHRNFMEKSPQIPKFFAD--H-----TLQQQHRILAKSVARLVSFVDGKPQaeqdMRDTMRILHDGNLRLTPEHYAFWATALMETICTI-DEACNDEVAVAWEQTISYGTGVLK---- >SRR5690349_6204932 -ILTDEHRHFIRTSWEKINKRHekTTLGILMFEKVFAFLPDLRNVFGLNDSSvsetDRNENFRRHTSLVVNLIDLIIRNIFEMEAemgpVLLMYGRRHFLKHDLVFQE------NQLVAFAQGLCEFfeeevdhdddnsLASETKAAWNIF------------- >ERR550537_1224553 ----------------------NVVGRVVFMNIFKAAPEAKALFPGAREEnmwGPGSKMEQHVIKVVQTLAVAIGGLKDLGPIVPVLEvGLgvgIL-RNRHILSTIHLFRTFWllcIPMIQRIVGHPsscQTQRWSSRCRVVLI----------- >tr|B5DW13|B5DW13_DROPS Uncharacterized protein OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN=Dpse\GA26483 PE=3 SV=1 -GFTLCEKVALRQAWNLIRPRERRFGQDVFYTFLNEWYWSISKFKKG-EDINIALLHAHALTFIRFVGALINESDPI-MFQVMINENnqtHS-RCRVGADYIAMLGQALTDYILKVLDKVRSPSLEQGLQRIVEKF--------- >tr|A0A1I8CTR5|A0A1I8CTR5_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=3 SV=1 -KMTASQKSVLISSWKFIKPNANFIMRKIFTELESVSPKVKQIFAKAailDCfskesSDaKACTVDEHVRLLSRFIDDVISNIDKEKEVrniLRKVGQSHAGlsnGSLFTSSLWEFLGEIAVAKICQVDYVQKSREAAKAWRLLIAFMTDELRNAF- >SRR5258705_2725614 -----SSFPPGPGELRNCCAHRRRRRRALLPAPLRARPVARAHVLR-R-------HAL-RDHFEAALALIIRNLDEMEALAESLLESEW----------------------------------------------------- >tr|A0A1Z5JZN5|A0A1Z5JZN5_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_19Hh029 PE=3 SV=1 --ISPDVVSAVQDSWERIKDSspawEDDFGDRFLKSIFTKAPLsYKLLFPFGTTSgpamFESEDFIEAARTASTLMDMSVSLLECeMDALFGQlleIGLEHANFPRIQTSHWSMMRDALLRTLASYssaLSEDCKdlEKVLSAWSLVFDNLSNEMVET-- >ERR1719329_2064399 ------------SLFVRLGGDvaVDAAVERFYERIL-QDPLLAQIFSRVNL-------AGLKNMQRKFLTMAFGGPDLYDG--LSLRDAHQ-GKGITEAHFAAVAGHLSATLREmAVPDRQHDEVMAIAASTQGNIV-------- >tr|A0A1I8MDY2|A0A1I8MDY2_MUSDO Uncharacterized protein OS=Musca domestica OX=7370 GN=101890360 PE=4 SV=1 NGFTATEIASLRNGWRHFKRRFGYHSKQIFMKFYQEHEQMLEKFRNRMGKFNMQQLHRHPQELLQVYGNLIEqGLDNMtymHVLMTAISQRHR-MFGVTGYEIKLQTDhitlYILALLEKII----SPTFVSGLEKLSRLIN-------- >tr|A0A1Q9NTV3|A0A1Q9NTV3_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_3 OX=1841598 GN=hmp PE=4 SV=1 --FTSKEADILTQSLKALEEKTDDLPKLFYYHFLEPtsNKEIISLFNKS-------DMTKQYMMFHQSLAIIVSSIKDSHllnQILKDLVKRHK-NYGVKYAHVQIFSSAFYKTIEEIFPKD--EKVKILWIKLINFVLSKFN---- >SRR3990167_8190046 --MDNAQKLhIVDTILERASELAGDITDSVMAEFYRGDPEAKDLFTHHCPV---DTIRIEAGTVEQALYCFMRWFQSPGEIRILLLGSvphHVETLKVPVNYYHRFLQAMATVIRKTIPAE-SREEIDVWNEICGDLGEIVDA--- >SRR5690625_7611079 ------------------------CALCFYLCFcTDTPPTRTYILSLHDAL---PICQLEGEMVENSLYCLMSWFESPGEIEMLLAGSvphHEETLRVPPHWYEELLEATRSEEHTSELQS-RGHLVC----------CLLLE--- >tr|A0A0K6SA08|A0A0K6SA08_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_8920.t1.CR2 PE=3 SV=1 --------------------VSAAMAEKFFELVPKRAPNLRMIFEKRqDIY------KH---HFGEITKRLLAYLDSPEEVWKedpELAIKHI-EFGVMPCDVPVFANVFLQILAELAGPAWTQRHRDTWDKLFSIVSGALAE--- >SRR5690606_8675308 -----IDRDLIEASFEHAAETLGDITPFAYQHFFARYPQAEELFLCKG-VQFKNDL--QNQMVRDAIYAFLEYLDTPDEVDIVFKytiPQHL-DLNIPMLYFNGLLEAVAEVVCGATPEAGKAATEASWKVLLESIE-------- >tr|A0A1W0WMU5|A0A1W0WMU5_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_09357 PE=4 SV=1 -ALTHVQINLVRESWRWLNfnrPLQETAVRFFLDFYFKQNPDCLPMFGMKTVDHYNKAFSIHALTVMHAIKYAVEYIGNPEQfqrLFRTVGQTHL-RFGLTDLHVERFLEQWLAFLRANDAKVFDAATVEAWNLAGRIVVSQI----- >ERR1719354_143580 -------------------------------------------------------------AFWDILDHICGHLDRLENLIPQLRDFalQCFNSGLFSDDYNILGECLVTILSTNFDP-WEETHSDSWAWCLDLVMSTLVT--- >SRR5215207_8455447 -------------DFDTVV--CSSFAERFYSRLFTHEGGehLRALFPDN--------IQPQHAQFTTMLGDILAYNFRIGrsLLGD-TFRKHI-DFNIRESDVDVFRKAFVEEVGSTFLH--LG---------------------- >SRR5271170_3229012 -----------------------------------GRECRRDNrLLLLDAPPATPLgtSqyLDARHRTVSCTSantgvctgpYQPDQLKD---------RKT--VLGGGLR-------------LAQPGSRLSQPLPGRFGESAGX---------- >SRR5216684_1000550 -----------------------------------LHQGRHRPRVHLGL-------------------------------------------RGGSPAHPPRDPRPRHKRGAIHRA--drhVPPPrPPRQSGAQAdfSDHSRL----- >SRR5271154_4753691 -----------------------------------LHQCKHRC-LHWSLPARSA-qrSQDGP----RRRVtlqpPPVRNRGR---------GVSAlsllrsswpniRFYRVETVSCPRDRLCIDLDPISTVKRNLA------GVsDVYLL-RS------- >SRR4051812_37657562 -------------------------------------------------------------------------------------------------XMSSGVRFTRWRCESIRARLRapsdhcvTVPVkPSRRSDSAVsaRKAEQ------ >DeeseametaMP0200_FD_k123_38240_1 # 1 # 450 # -1 # ID=33738_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658 -----------------------------------PHACLSTChAANPP----VAI----RARRSSAEGYAR---------------SD--DARGGTA-------------SPPPGRELSSPASAIDPFSRGAISFVSF---- >tr|H3FA75|H3FA75_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=WBGene00108645 PE=4 SV=1 -GLTAYQQKLLIQCWPNIYSTGpgGQFASAIYNRLQNSCPKAKQLLAKANGVavFANSDvdcTAMHSRVTIELLDTAIRNLDAdHAKLTAYlieVGRSHRplRQEGLAIAVWDDLADSLMECVCRYDAVKKHKELRRAWLALIAYIVDNLKNL-- >SRR5437879_6948005 -----------------------------------------------------------------------PPSTcsWTtslsagrgvRPISSVSASPTA-STaaTLPPHLYDFWLDCLLHAAKECD-QQWSPEVAAAWRYMMGSCSSRLAT--- >tr|A0A0V1B190|A0A0V1B190_TRISP Uncharacterized protein OS=Trichinella spiralis OX=6334 GN=T01_13586 PE=3 SV=1 --LNPKEVILTRNVWAALKEKhQHLVGMEIFRQIFNRRPDLKSLFGVSALdtemALNSTRLHRHTMIFQDVIDILMVNISNVDVniadSLIDLGAQHWvlTKRGFDPAYWLIFGDVLFDLVENVTRKLpSRKRSTNAWRKTIAFMLDCMQIGY- >SRR5437762_8994925 ------AAS----------------SDHHIPSQLAAGTRAKDRKGGVE-------YPGHVCRGQRRCARDRPHILAsPELCIPRAcrtksA--------------AFCAVCENRCCETCR-SPPAKKPETARRSAERTG--------- >ERR1719204_228700 -QLSPSTVKAVQTSWNNIRSGGpGYFGHLLFSYWLAEHPRALGVYSMYyhdDKkHrvSLLPRFHRLGEVYAKRIDYWVTNLEEPVKLFLMLyehGFNHA-KRGVNLRDFPNMTPSLMDALATALGRQMTLKLYDQWKDFWKFIFMQIAEG-- >tr|A0A2A6CS87|A0A2A6CS87_PRIPA Glb-5 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_35904 PE=4 SV=1 -----DETHLARAHWILLHKMnkQGTVIQSTFEHLMTEFKHTRPIWQFGrniDENvkdwnkelHEDFYFRHHCASVQAAITMIMENKDDIVSLTRVLnevGAHHF-FYDAYEPHLILFEDAMITAMKKVLKGveELDEETERSWRVLLQLTRKHLIEG-- >tr|A0A090LKP0|A0A090LKP0_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_2000335800 PE=4 SV= -ELPKADKDIIISTYNILLQADPELFSKAWIMSASRSTSIRKAFSLIDPnsTHIEVDFTKFSAVIERFFTRIICEekLVNesFEKSCINLGKKHVDfvPIGFHSNYWDIFMNCMIDVIAETVIIAFNednkqqQQVQKCWNKFVGRIVFLMQSGFK >tr|A0A2G9URY2|A0A2G9URY2_TELCI Uncharacterized protein OS=Teladorsagia circumcincta OX=45464 GN=TELCIR_05034 PE=4 SV=1 -PIANKTKKLVIQEWPRMLEHQPNLFGIVWISSATRSNSIKKTFGIGANenPEDNEAFMKIWPTVQQFFHKL---------------------------------VCMAETVDQTLCEYYTddlkrAEMILAWQRVFNTIVHHMRTGYI >tr|A0A0M3JT43|A0A0M3JT43_ANISI Uncharacterized protein OS=Anisakis simplex OX=6269 PE=3 SV=1 -SFTTPQLTSVFNAHFSMIQLNPDVIKDCWIKTSKRSSSIKKAFGMLEHeePETNASFMNLPITIQAFFKELIFEldCDSvkIRQRCEQLGARHVDfsERGFHSNFWDIFQVCTIEVIAEC--NLGLnedqhRSYELAWIHLLSSVVKSMRNGYT >tr|A0A077Z0R2|A0A077Z0R2_TRITR Globin OS=Trichuris trichiura OX=36087 GN=TTRE_0000042901 PE=3 SV=1 --FTAKEFAIAELTWAKLKVRfNNQVGMEIFRQIFGSCPEVKDLFGLQNKedqkALCDQRMARHTAIFQDIIELLIVDLSQRsDSLtqsLITLGAQHWffTQRGFRPEFWVIFGNTLVNLIRSLPLSlSQRYLARRTWIKLIVYLLDCVMLGY- >tr|A0A0N5DS84|A0A0N5DS84_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 --FTPKEFAIAELTWAKLKLRfNNQVGLEIFRQIFASCSQVKGLFGLQNKedhtALGDQRMARHTAIFQDIIELLIVDLSKRsDSLtqsLITLGAQHWffNQRGFRPEYWVIFGNVLVNLIRSLPLSlSQRYLARRTWVKLIVYLLDCVLFGY- >tr|A0A016V5D5|A0A016V5D5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0017.g3216 PE=3 SV=1 --LNRMQRRALRFTWHRLQTRnggkrVENVFEEVFDRLVRALPCVRDMFTTRMFlcamArNETASLRDHAKVTVKMFDVVLKNMDTDPskrtdtgfPLDpKIIGRAHGplRPYGLTGQYWEKLGETIIDVVLGQEAVRDLPGAGQAWVIFTACLVDQMRAGF- >ERR1719187_3161387 -ELTDDEINEVQQSWDLLTRSeggLREAGLTLNQQLLTAQPHHIRSFEKFRkykdfdDILKSPEFKTHSYSTVREISLVITNLKHPGVFtqlTQSIGFAHR-RANTPPNQMVDFKSVFINdFIPSQMADKATPNTIKAWEKFMTVFIEHVKEGL- >ERR1719481_246497 -ELTDDEINEEQQSWDMMTRTegg-lREAGMTLNRQLLTAQHHHIRTFEQFKkykdfdDILKSPEFKAHSYSTVREISLVITNLKHAGTFtqlTQSIGFAHR-RAKVPPNQLVDFRSVFINdFIPSQMADKATPNTIKAWDKFMTVFINHVKEGL- >ERR1719347_979638 --VTDEEMASINELWSCLRADAMHSSRFIFARFFEAHPEFLEPMPFVkDYygniSpkyMDTQEMQDYCLKFMSTLDAVMTRVFARdkEalQVMRDIGYSHH-EFGLTSDMTVKFMNKMHDSVLELWGTEASRRDSKALDNIFKTIATEINVG-- >tr|A0A1I7TYQ0|A0A1I7TYQ0_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=3 SV=1 -GLTRDDKRIIETCWFKCSQKqLRKSSCDMFWDILHTDEDILRLFRLDHVSpnrlKDNEYFKSHASNLALVLNLVVTNLQDNfEQaqdALQALGYQHLhlIDRtHFQSMYWDIFTDCFE----RNPPPSFRkGAEREVWSRMILFIMGQMKTGYQ >ERR1719396_104066 ---------NIIESWELLRFhpsLKEDLGTAIFRELFKEHPELREHFGLPlvGLdaLCKNQTFLSLSNQFVDVFARTMDTLGPDEELmdesIRELGEKCV-SIGIETSHLSLLRKPILSAVEKILLEDFDD---ESWKKFYSILATDL----- >ERR1719396_219220 ---------NIIESWELLRFhpsLKEDLGTAIFRELFKEHPELREHFGLPlvGLdaLCKNQTFLSLSNQFVDVFARTMDTLGPDEELmdesIRELGKKCF-WKTLMMNHGKN----STPYWEQIWQREFQQ---DKRDKLYSYSNNNN----- >SRR5215467_3799544 --------QQVSESYWRCCT-NPLFIEELYQTLFSKCGEIKQLFEQKNV-----SMKRQYAMLRYALDIFVDYPHDMTATFPDIARKHT---GLDPRFYETFIEALIETVGKCDPK-WVPSLEHAWRERMT----------- >tr|A0A1I7VXG1|A0A1I7VXG1_LOALO Uncharacterized protein OS=Loa loa OX=7209 GN=LOAG_10963 PE=4 SV=1 -QLSSYQIHLLQQSWQRIRS-SPNFFINVFRTVIAKNTIAKELFRKTSIIdgftsYKCYDVKEHADSLIELIDFALQEIHSSTKVVQhrcmLMGATHCNTcENSMSSSWDQFGDSLAESIAKAEAIRGKRKCLQAWNTLLSFIVDRIKGGY- >SRR3954451_1828621 --MDPADDALLRQTQGLLRESldfaggAVAVADRLRQALRAARPEVVAALPG--------DAATQTAKLAAGLVWLVDHLDQPPLLVGgsaRLGAALA-ACGVPPRGLQFVGAALAEALRAGSPaGEWRQEFELAWRSTWQHVYEWMQVT-- >SRR5262249_5830581 ------DVEVARDSYRRILDDVerqREFFHTFYGLFLRRCPEAAAVFEAKGYPalaqlggPRvedsAGRGPQPPNPLKSAIVMLiaFNILGEKEepTILDNLVDKHK-GFP--KRYYVAFQDALLETVVQFDDPsrcgMPPDELQHAWKQAIQPGGDYLID--- >tr|A0A2A6CAG8|A0A2A6CAG8_PRIPA Glb-32 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_40555 PE=4 SV=1 -GLTPEQKRILETSWVKATPKqIRKATEDVFASIINHDRSLAVMFRLDDVPinriRENQAFKKHAANFALVLDLVIKNIPDNvDSCcqaLQALGGQHVslRDRGFDSIYWDVFTDCFENNPPATFK---TDIDREAWSAMILFILAQMKLGFR >tr|A0A0N4XT53|A0A0N4XT53_NIPBR Globin-like protein 26 (inferred by orthology to a C. elegans protein) OS=Nippostrongylus brasiliensis PE=3 SV=1 --ALQALKVILRTTWRHMSKSGqGNCGSTIMRRLFIRNDRVKNVFHHNIMigglLepnaQETHNLQQHYSDIVQFLQFAISNLDHPSRITekcHEIGLKHR-KYktmGMKkkidkkylqAEHWDLLGEAITETIREYQGWKRHRESLRAANILVSFLVDRIRT--- >SRR5215831_15107384 ----------------------KLFFSKFYTNLFGRADDIEDRFKELD-------MERQYRILNLAIHKLLEFRPEQPAtqkQLRDLSLRHA-KLGLTNHAPAWNR-IH-LDLRGIGA--DGRSsGVAAADKALAX---------- >ERR1719234_1549997 --------------------------------------------------slwhrssIQLEGASNHNKALMNAIDSVMvEVLERRPMSksgIRDAGISHH-KFGIKRLDMDKLTTAILAAISDVLGDCdLDRKmlQLNAWKKFLNAIGDEFSVG-- >SRR5262245_32700325 --LNSNQRDLIRRNWDSssK---RYELCRRIYCRVFARRPEIRRIFSIGYDW----WRLEI-VTFADFVQSIVDNLDDAKRVrqsAFEFGRDHAkwRRFGFRSDFWVQLAESTTREcvyLDAAV--HPPDESLETWTKFVSIVF-------- >tr|A0A2A6C3W4|A0A2A6C3W4_PRIPA Glb-17 OS=Pristionchus pacificus GN=PRIPAC_39254 PE=3 SV=1 -ELTDEEVAAVRNVWIRAK--TEDIGKKILQTLIEKRPKFAEYFGILCqsDKldmnslKESKEFHLQAHRIQNFLDTAVGSLGYCpvtsiYDMAHRIGQIHF-YRGVnfGADNWLVFKRVTVDQVTKGVTSTqasqanllegtkepevveqhpmadvqnpFsgeNCLARLGWNKLMTVIVREMKRGF- >tr|A0A2G5SLB2|A0A2G5SLB2_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-17 PE=4 SV=1 -EMSDEEVSAIREVWIRAK--TDNVGKKILQTLIEKRPKFAEYFGIQSESldiralNQSKEFHLQAHRIQNFLDTAVGSLGFCpissvYDMAHRIGQIHF-YRGVnfGADNWLVFKKVTVDQVTTGATDSskekdkdetnsngtangkvdteanpipvgiadinnvYsgeNCLARLGWNKLMTVIVREMKRGF- >tr|A0A0N4ZE39|A0A0N4ZE39_PARTI Uncharacterized protein OS=Parastrongyloides trichosuri PE=3 SV=1 -DLTAEEIEAIRDIWLRAK--NESVGRKILLALIEKKPKFAEYFGIGSENvdpkelLGKREFQLQAHRIQGFLDTAVGSLGYCpmssiYDMAHRIGQIHF-YKGVnfGADNWLVFKKVTVDQVSRVNVEGkdrksnvslgkrnnsgdaedstaetprkesahsfndmYevsNCLARLGWNKFMTVIVREMKRGF- >SRR5512138_1182700 --------RRVQGSYSTFQAtdRADRLYRTFYANLFASVPEARRMFAHTDWS-------RQYNAINEALKLLLDFDADPQRaadAAKQIGsvaLKHQ-QYGLGERELRAFEGALLHALRSC-G-ECKPATLEDWRMILAPGFHHMRGA-- >SRR5687768_15481058 -ELSDRTRDLLVQSLPLMEHRKDALIEGLARYLIGSTGD-----ANQ-------DSELVAIVLTELLIGQASHLVRSSALpdLDDIRLEHS-RLGVQGSHYSRFGDALTPVIRDVLGPKLPREVAGAWGDVFWTVINVI----- >SRR5687767_13070119 --ISDRTRDLLAQSLPLMEQRKDALIDRLGAYLGG-AGD-----ADE-------DSELVAIMLTELLISQVGNLLRSGDLqdVGDVGHEHR-MLRIQGRHYSRYGDALSPVIRGVLGPQVPGEVAGAWGDAFWAVIRAV----- >tr|A0A0R3PFZ5|A0A0R3PFZ5_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1 -KFTQYVGNIVVLAFLNcfatitktvsdtsitvhvdqiqihcdihtsfqcsrekgtsfeqgldfdkTF---IKRLLGLFRLLCFKSALSREMFQKMSIVegfrtNQCCDLNMHAK---------------------arcmDIGGSHV---QMneecCGALWDQLGECLAEVITKVDCVRSKRECTKAWIMLISYVVGGMSLGN- >ERR1719414_1806988 ---TVAQAEKVVAQWDAAD--QDAFIVAMYQAMMKTHPEWRALFNKPTGAptPAEAEWKKQFDLTKAVLDRglrsRATDVDALKERMHAMAGRHV-NYGVTQTHFQALKPILTDVLAATVT----GADMDAWSAVTYFMLDS------ >SRR4051794_33648798 ---------------------DHGSTN-ASTRALAARPTMSAKFGRAT--------AARARHLTRAIQDLVEFREDDgASRFRlHHVPAHA-GMGITREDAEAIRREFVAEVIATFERsggNvSPQMHGDAWNAVSRRRVERCVE--- >tr|A5L2R3|A5L2R3_VIBBS Uncharacterized protein OS=Vibrionales bacterium (strain SWAT-3) GN=VSWAT3_02206 PE=4 SV=1 ----------------------QAFLESFLADFCQHNPRFSERFEKVG-------LEQQTKMLKASIILIYNSAGLPsvRNSVKRLGKQHK-DLGmdISEQELNEWFKSLLNTVKKYD-PHYNDQVEQAWTETLDVGLKIMKQ--- >APIni6443716594_1056825.scaffolds.fasta_scaffold2871162_1 # 2 # 304 # 1 # ID=2871162_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.617 ----------------------QEFLETFLADFCEHNPRFSERFESIG-------LEQQTKMLKASIILIYNSSGLSsvRNSVKRLGKRHK-DLGmdISEQELNEWFNSLLNTVKKYD-PHYNEQVEQAWAEMLDAGLKIMKQ--- >tr|A0A0N5DFM9|A0A0N5DFM9_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=4 SV=1 -LLSPAQIKLIRNHWNGLYItiGPTAIGNYLFNRIVFKNPQSRKMLLSLlvDHLSPGYFSKRHARAIGVILNFVMKNLEYPENIsliLKMVGHCHAKlvTVGLDSSIWNVFAEALLECSLEWGeKSRRVDEVRKAWAIIIAFITEKLKAGFN >tr|A0A183IST0|A0A183IST0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 -QLNDKDITLIAESWRKIED-RSLWAQRLFAKLFVYRPQLASIMSYQDVSgkklLSNPKFQNFCQRFADFWQDVVSGLCDRgtdddwKqvvALIRELGARHSRipKITFEASIWLHMKSEIVQSIT-GFKDIYRDELCYSWNKLLMFVVTEMKDAF- >UPI0002C4E217 status=active -------------------------HEDFGTAFFEYCPDLKGQFPSN--------YALVTKMIQKFINNVIEG-KNLERLARHYGRTHW-RYDLEERHFLGFAEALADTINIRIGNFGTIELMKIWREEATMICKMLEDQY- >SRR5262245_41417288 ----------------------GNLHARIYEAFFAACPEAKPLFDNTD-------LKRQYQLLHQAIVLMLAFHVSPNreepTILSRVAARHS-ELGvhIPPAWFDAFSAAIQQSLEAA-DTQFSDKTREAWAAVLADGIGYMQ---- >tr|A0A0K0EPG4|A0A0K0EPG4_STRER Uncharacterized protein OS=Strongyloides stercoralis PE=4 SV=1 -GLSFYQQKLILQCWPNIYTtgVGSNFASNIYPTLCCKNSKAKALLQQADGVavFSNSgvdCTTMHSKLTLEIMDSIIKNLDSnPQPIISYLQDTgysHKnlKIQGMNMSMWDDLGDSILEGVRKNELVRKHKELRRAWLAIIAFLIDNLKQG-- >ERR1719183_3286062 -------AISLRDSWVHIEVlkeedDSGGFGDALIFQLSVVA---QEIFGLVVT-----ERNALGKIFNRMFSTLVHAMGDPQKFTEeffVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDSMVRNF- >tr|A0A0V1BAT0|A0A0V1BAT0_TRISP Globin-like host-protective antigen OS=Trichinella spiralis OX=6334 GN=T01_2203 PE=3 SV=1 -----------------------ENGGQLLANVFKANPELRKFYDVEDIDpddtKKSRLIQQAGGNLLNSVTFMVNNYDNERSFKQEIKEQicdLR-EKGMKLEDARKLKTGFVNYVKSKLSQPMTAKEEKEWDMFFQRFFDALK---- >tr|A0A2E3CX61|A0A2E3CX61_9GAMM Uncharacterized protein OS=Pseudomonadales bacterium GN=CMK89_07570 PE=4 SV=1 -------SDLLNLSLEQIASAIGDPTEPVFTLLYQRHPELAAF-SREDTS-------WQHYMIQEILQNLMEMAENPDTALAIIRDMtlhHQ-MIGLEADTFKGMYRTLHDVVVQHLSGPHREDMTALWEDSVQRICRSVD---- >tr|A0A2G6L250|A0A2G6L250_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium GN=CSA49_02275 PE=4 SV=1 -------TELINLSLEQTVETLGDPVEKIYERMYQRFPDLVSY-KEENED-------WENYMFEEIITNFMSFGDDPETALLTIREMvvhHE-LIGVPREAFKGMYDTLYEVITATFHGPQESEMKAVWQEIVAKIYDCIE---- >SoimicmetaTmtLAA_FD_contig_31_10253239_length_247_multi_1_in_0_out_0_1 # 3 # 245 # -1 # ID=589621_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.671 -GLSEYERGLVVNSWKALTKPdfspldGTSSLSNFYDAVWTKWlkidEFANKMFRSR-------GFKGRVQHLLRIMGVIIKCAEDPLRGLeqlRSIGVQHC-IWGINSQSFASLALSIIHGLDQANGKEINAELKELWLAL------------- >14BtaG_2_1085337.scaffolds.fasta_scaffold158720_1 # 2 # 106 # 1 # ID=158720_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.467 -GLNDAEIESIKASWKTITNTastngGDTMIVKFYDTVWNRWtkldEVANQMFQSR-------GFKGRAQHLMRIIAILIKFLDDPS-TLtqiKNLGVQHC-VWKINTESFSALAV-------------------------------------- >tr|A0A2T7PRA6|A0A2T7PRA6_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_02930 PE=4 SV=1 ---EPHDKTIVAESWKLLRSIFPDLIESAFVEMCRRVPRLKLQFGNVDVDDDEerhMNFLKHVWDVSFFFDQLLLYLPfksKLEECSFHIGLVHA-SVEVPAWYVDLFLVEFIRAAQETVQLEWTPAMENAWAVFLRYLCYYMKDA-- >tr|A0A183IYP9|A0A183IYP9_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 -----------------------------TLGLFTSSPEIRSLFPTLvDWgddIKTCQKFRNQGLKFVHVISLSLTTLHDKehlDTLLKEIGTRHVEfmPGGIKMEYWDIFEKAMVKCILQQI--RWTDDfdeaiqskAAIAWRILCAYIVQKIKIGF- >ERR1719183_316154 ---EPEVSAATKRGWRAWVAdmfaRGIPAGEALYQTIMDDAPSLKHLFTKP--------KPVQAMRFRTVLSSLVQTCDDPERLRvqtETLGYQHL-NLEITVDRAELFRDTIYDFIQMDFGNR------------------------- >SRR5947209_7523480 ------------------------IAKAFVDQLAHVFPPICAMLPMAT--------KTARYQTACAIAAACKHAHDLGAIAPMIAATgadLS-RHGFTAEHLPAARAAFLNALRKCAGEDWTTVVEKDWNEVISEFAGH------ >ERR1051326_6499376 ------------------------IAKAFVDQLAHVFPPVKGMLPMAT--------KTARYQTACAIAAVCKHASNLNDIAPMIAATgadLS-RRGFTSEHLPAARAAFLNALRKCAGEDWTSVVDTDWNAVISEFAGH------ >tr|A0A1A0K7B8|A0A1A0K7B8_9CORY Uncharacterized protein OS=Corynebacterium sp. EPI-003-04-2554_SCH2473622 GN=A5774_01015 PE=4 SV=1 ---------DLASLATHLRAHPATFRDAVHRHFFAALPDARQSFPMD--------ASQAHRGLAESFAAAFDAP-DLDEYFADLGRSHR-RHGFPPDTYPIFATATRQALAEID---LADNVLQQAGALVDDIVAFMSTA-- >tr|A0A127NUX4|A0A127NUX4_9CORY Oxidoreductase FAD-binding domain protein OS=Corynebacterium simulans GN=WM42_1693 PE=4 SV=1 ----------MKELGEHIRRHADDYRDAVHQHFFATVAESRQIFALS--------MRDTHPALAPAVAWILDAADdagflpeETIERVRELGKEHR-RHGFPTEIYPKFEASLNEGFIALG---LTQHQLVVAKRAVHTVCTTMAQA-- >tr|A0A0F6QY96|A0A0F6QY96_9CORY Oxidoreductase FAD-binding domain OS=Corynebacterium camporealensis GN=UL81_10405 PE=4 SV=1 ----------MKELADHLRRHANEYRDAVHQHFFNTVLESRQIFSLQ--------MRHTHVELAPALAWAFDRAQrdgtltpELEEQLTQLGRDHR-RHGFPPEIYTDFANSLIAGFDALG---LTPYQRQVASHAVTEIANVMANA-- >tr|U3GX34|U3GX34_9CORY Uncharacterized protein OS=Corynebacterium argentoratense DSM 44202 GN=CARG_08960 PE=4 SV=1 ---------TLADTLRAEPKRLSHFGDLAHSALLRRAP---GLISFF--------GPNPHTELTTAVLFILTHSTpgpqdsgtqtPLspridaagAGALRALATEHV-AYMPPdPALYLAAADALCEALRDSCA-DQPFQQVLAAEKALREACSLMATH-- >tr|T1FHE7|T1FHE7_HELRO Uncharacterized protein OS=Helobdella robusta OX=6412 GN=20208246 PE=3 SV=1 ------------------------------GTLLQSNPLVKNTFEKFRQmDpmsdfTDSSVFSTHAMVVMSAFEDIFDNLDDSEIVKDILEQgkSHG-KFseDFAPETFWAIEEPFMSSMKDILGRKMSSQLEKIYKKTIKFILSVLIKGLR >tr|A0A0N4WD13|A0A0N4WD13_HAEPC Uncharacterized protein OS=Haemonchus placei PE=3 SV=1 -CLTPAQILLIRRTWTHARNQGaLEPAISIFREFWKNLNFLQ-FQKLKKSRKCSESFQRHAQIFTTIMDELIANLDNPTATSPSLREsgeKHVFqtrdQYGCpfRATLLDQFASAMIErTLEWGEKKDRTEVTQTGWTKIVLFVVEQIKEGFH >tr|A0A2R8AKY2|A0A2R8AKY2_9RHOB Uncharacterized protein OS=Aliiroseovarius pelagivivens OX=1639690 GN=ALP8811_01706 PE=4 SV=1 ------------HSLDLLVGQEDAFAHAFFPLLFARAPELRVLFGDNiDD------PTQQVRVLYRMMMAFA---GNDVTLIaglRLIGFRLA-MRGLGADQAELMANTLIGTLKRQLGNSWQSDFAFAWRIE------------- >tr|E1NZ07|E1NZ07_CAEEL GLoBin related OS=Caenorhabditis elegans OX=6239 GN=glb-29 PE=4 SV=1 -NLSVKQKKLLRQSFNAMNSGGtfLKLMEKIFRRLETKCPDMRSIFLTTAFvnslSreRQTPplvkTEYDHCKCMVGIFERLIENLENINEQLTMirhYGEKHAQmaESGFTGAMIEQFGEISVFVIGSQDVVKFNHETVKAWRLLLACVTDEMKVGFD >tr|A0A0C2G6K1|A0A0C2G6K1_9BILA Globin OS=Ancylostoma duodenale GN=ANCDUO_17195 PE=4 SV=1 --LSYKHRKLLRATFQQMNSSGafLKLMEQVFRRLEAKYPDIRSIFLTTAFvnslSreRSSPPlvrtEHDHCKCLVALFEKIMDNLSDDTQLmvIRQYGEKHAQmkESGMSGGMIESFGEIAVAVIASQYSYWIQKPVDDVTrrkgrDEGLVYLNDYEYIIL- >tr|A0A0G4HY87|A0A0G4HY87_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_33490 PE=3 SV=1 ----SNRIHLLQSSLAACLKMstkEEFVGRLMYDTLMRTLPEPGIIAKRGR--------TMMSRAFNDTVAALVAFVSEPshmETYMDWLALRHV-HYKIDTTLFPQFRQAMLVSLEQVMADQWNAEIERAWSEAYEMTSQAL----- >SRR5262245_61346593 ----DCLRRGLESDFKALV--DESFAASFYKRLFQSRPLLEGRFHN---------LQTQERMLAENLRDLVEFH--PEESagrFLDHVNRHK-PRGITAEDILAFRAAFVAEIVQQGskllAQKIPpGARADAWNA-------------- >SRR4051794_17889687 ----DSLRDAIIDSFSLVS--DERFGLRFYESLQS--HHVGGRFKD---------INEQHRKFIKELRSFVDSE--PPAGlaLRIIAGRHR-PYKLS----------------------------------------------- >tr|A0A0K0FHQ3|A0A0K0FHQ3_9BILA Uncharacterized protein OS=Strongyloides venezuelensis OX=75913 PE=4 SV=1 -NLTASQIMSIKRSWKHINTKGlFNVLRRCYQRCECCSLAVSMIFSAEQMKkqqhAYSCGVSEHSKYFISLLDRIIDNEPNIEQELRNVGKEHVKlyeEYKLGTADIERLGEIIADVFLKLDGIRQNKETSKSWRILIASIIDEVSVGY- >SRR5699024_10156350 ----------------------PRFPALFARALRAADPDFRGMFPRD--------PAPVLAEFVRAMTFVLETTeaaaAATartDevvELARPLGADHR-ERDLPPSNRVPTGDARAATLPPLAGSGWTEAPETTLSTAYRVVSTALQ---- >tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 GN=HMPREF3121_11375 PE=4 SV=1 ----------------------PTIGPEAFRRLLDAEPRFRHMFGGS--------KTALRDQFMSALSTALVTRadvgRFPaa-tiRRLEQLARENR-KFGVAPRDYATLAEHLLDVFGERLPAGPDSGAQVDALREILDEA-MSL---- >tr|A0A0C2M2P6|A0A0C2M2P6_THEKT Uncharacterized protein OS=Thelohanellus kitauei OX=669202 GN=RF11_12769 PE=3 SV=1 --LTLEERLKLKESWIKIYQKIqdlpdVDITFEIFVRLMERRPEMSKNFEKD-VY-KYSRMKSHSDKMLVILNNMIRNLDDEQKMLKYLSgmvRRHR-NYGIRQGDCKMWEEIFLDIISRY----------------------------- >tr|Q5D2M7|Q5D2M7_9TREM Myoglobin 1 OS=Paragonimus westermani OX=34504 GN=myo1 PE=2 SV=1 -PLTQAEVDGVVSELNPFLasdAKKVELGLGAYKALLTAKPEYIQLFSKLHgLTidnvFQSEGIKYYARTLVEDLVKMLTAAAKDDELQKVlvhSGHQHT-TRKVTKQQFLSGEPIFIDFFNKTLSK---PENKAAMEKFLKHAFP------- >tr|A0A1S8X4B3|A0A1S8X4B3_9TREM Globin OS=Opisthorchis viverrini OX=6198 GN=X801_02811 PE=3 SV=1 -PLTQSQIAGIHKELLPILsndEAKTSFGVGAYKAFLGAHPEYIQYFSKLNgLTidnvFESEGIKYYGRTLVDEIVKMLTAGADDEKLKQVlhdSGKAHT-ARNIDNATFMvsklfmflkrvsemrlarglygpfpifaqSGLPVFVDYFNKSLTV---PENQTAMEAFLNHVFP------- >tr|A0A1I8C1X6|A0A1I8C1X6_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=3 SV=1 -DLSPHQIGLIKRAWKNLLKSvnENEIAIKLLLRIFQLDPRNLAYFSLNEYspfdeylIKENNIFINHVKTFESTLINVMTHPGNATKLskhLQQLGGRHV-NYtGVTykCSYWKCFIQSLIDVLTLNKDKNTSEDLHEAILILGEFCVEQMKIGYK >ERR550539_1089662 ---------AATASWNNIDD-KPAFGKAFFKNWLSSNPAIEEEFAKSSFK------QGPAQFLVERFDILLGVIEDEDSLAEELYqvaKTHK-KVGVDQSDLYSFQASFMKTLPSFD-ADFSAETGNAWAYVLSHVI-------- >ERR1719210_3079978 ---------QPKRVGRTLT--KQLSEKLFFQNWLDSEPDVAEIFKKSSFP------QGPAQFLVERFDILLDVIDDEVALSKELYvvaKTHM-DRGVSPDDLVTFQDSFLKTLPSFD-SEWTRDRSESWAYVLSHVI-------- >tr|A0A1G0FYS6|A0A1G0FYS6_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium RBG_16_51_14 OX=1798265 GN=A2W28_07810 PE=4 SV=1 ---------LFNNSFQRAIiPDSNSFYKRFYEIFVGSDPRIAELFEKTF-------MNLQREMLKQSMTYMMSFSatLEPSDEMKELAEMHGRgKLNIPANLYEIWLESMIKTVEEFD-PKFDENIEIAWRVMMAPGVAYMQS--- >SRR3989338_9975634 -TIDHRSVQLIKQSAGAIKGQAQAINRLVYEQLRRDHPAAYSLLQQAGL-------P----PLASIVANYAAGIDNLEVFLghaPKIALTHQ-RIDLQEVHFESVASSLFLAFRQALDPDaLSDEALLAWRRAYDH---------- >tr|A0A2A6RLC4|A0A2A6RLC4_9CHLR Globin-coupled histidine kinase OS=Chloroflexi bacterium Kir15-3F OX=2024553 GN=CJ255_07345 PE=4 SV=1 MGLRAEDGATLKALAPKAEAYGPTLTKTFYDRLFA-HANTAEYLQGVD-------MQRLHSMVQTWFMGMFAGVYDRDYArqRLHIGEVHV-KVGLPVRYPLAMIDVVMSFGDQIANESSePAVALAAFQKVLSLDIAIFNQAY- >tr|A0A085M5J8|A0A085M5J8_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_06691 PE=3 SV=1 -CLTKRQRRCILKSWRKVQ-NKAQLGEEIYIQIFMQKPVLKSLFPFRATPvnelHDNVLFTRQAVIFIDFIDNVVAYVGINngrllQELCTRVGISHAlmTRVNFDPEWWYLFANSVLDGMQKFCLPNFSCepiatyigsQSMLAWRILLKHVVEMMSDAF- >SRR5215470_9720857 ---------EAKRSYRQFAR-DISFYRELSKRLFRKIPGIEKKFRHR-------TMEEQYKVLRDSLWLLLSYASAPDqqepTILSRIAHTYA---RFPKEWFDTFREVILDVVAQRDP-----SSVRAWKHAMAPGLE------- >SRR4051812_31756681 ---APSVMRLLASCTADLGPQQPELAEALYQRLLELLPEVatlAE------------RGRPLSDRILHAVLYPTEPGrt--PLNVatvVQQVGAQNY-LDGLVGEHYSSVTHAVLHAAREMYRGEWSSALSSAWVEYLLWLRGHLLAG-- >ETNmetMinimDraft_22_1059887.scaffolds.fasta_scaffold1682169_1 # 3 # 206 # -1 # ID=1682169_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.363 ---------TVFSQWRRMK--IEDFGECMYRSL-VQDASLEKLFRRE-------RMRTQSLLFAAFIQVALCWLEERDfrkveRDMISLGLRHR-SYGIQPSYVCVFQIALLQTLCQNLNG-LSLQAEISWSVVWSH---------- >tr|A0A0A9Z6R2|A0A0A9Z6R2_LYGHE Neuroglobin OS=Lygus hesperus OX=30085 GN=NGB PE=4 SV=1 --LEEDEIERIKKSWVLVKENDFRFIDILRQEMLCDIMMYELYFNPGrkaDVcVSELTEFKNHPKNVYSTLDFIVGDLENENVIIEkmiEIGKNHG-RLGISRKHISFMTSTIYQAVECTIGPCmFDRLVDQSWEKFLTSFN-------- >tr|A0A0N5AZ47|A0A0N5AZ47_9BILA Uncharacterized protein OS=Syphacia muris OX=451379 PE=4 SV=1 -PISYKNRQLVQSCFRNP---HELLGKRILKKTRDKKPDFDLFLSKLDGK----QRDELEESIKVLLKKVVANIDFIDEVqrlGEEFGANHVqfRKEGFKPEFFGIYADAAVTEctfLDSA--VHPPHQTLDAFSSFISWIFSFVRDGYY >tr|A0A158N7T9|A0A158N7T9_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=3 SV=1 -GLTAQQKAILATMWRQLPRGvIFDLGKRVFEIIFERDPKLLMIINLEHLQntnqwQEHVNFRMHAQRFTHALSQSMRNLTEPIIAADRLqefGASYVNqenitygslNVVIPHSYWDRLSAAITTTAQEFLNKqqlktskqtltvdnvlllenerrnsrnlfSQVSANINAWSILAQFIANQIRFGYE >tr|A0A1I7RRX1|A0A1I7RRX1_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1 -GLTSTQKKLVQAKWMEMDGVgILDMGRNVFETLFRREPACLKAIGLGHLThgrnlewRYHVNYRQHVKRFCEAFNEVIRSFEHPRTSIDQLqelGALHANtylkaseERKVPSNYWDGLVFAINYAAKDLQVEsssrgsespsnvifdrrfllpsddlgsstppsptqfsslcvtpqrrsgSVCPRVAEAWNLLAIYAVSQMKFGYE >tr|A0A0B2V954|A0A0B2V954_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_09629 PE=3 SV=1 -GLSMHQKMIVTAKWRQLPQGfVFDLGKRIFETVFERDPYLLSIISLEHLQgsdewRDHANFHLHAQRFSHVLSQCMRHLSEPIVAADRLqefGAAYAEvedsenfvRSRIPHSYWDRLITAITSTAKELHEDqpqqvrknslsvddallakkdrlalETDSTNACAWNALATFVSNQIRFGYE >tr|A0A1I7RN92|A0A1I7RN92_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1 -GLTDDQCEQLATAFSNIPDKYYAFEQMFLNLFMKEDPQLAVVFGFEGIRpeelRRMSPFRTHVCKFQRFMTTVLDMLPKknrEEELiqiIRMVGRQHCNvkLLSFTAQKWLSFKNGMLNALAKG---GESHKYYSSWNILISFMISEMKDAY- >tr|A0A183BTK8|A0A183BTK8_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=4 SV=1 -QLDDTECEQLSTVFAAMPDKYHLFEACLRPMPMPeVDPQIALTFGMANIAeielRRKTPFRYSV--------------QKrgrEEELvqiIRMVGRQHCQvkQLSFTAARWLSFKSALTWTFSRG---EQKDKLHVQWSLLISFLICEIKDAY- >tr|A0A1I7ZF06|A0A1I7ZF06_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=4 SV=1 -QLDEEQIDTIVDAFAKVSDKYGAFERVFVQLFVYEDKEIAEQFGLASVPeeviKRNQVFRTHVGKFQRFMTTVVELLPKvgrEDELieiLRIVGRQHCNvkQMNFTAAKWLSFKNVLLSVLCKN---DHHDKVYMCWNQLLSFLIYEIKDAY- >tr|A0A2V7AV10|A0A2V7AV10_9BACT Uncharacterized protein OS=Candidatus Rokubacteria bacterium OX=2053607 GN=DMD92_03445 PE=4 SV=1 -GLGEADVAVIRRTAPIVLTCEAAVTDALYAHFL-QFPATAQFFLGEDGEPDAARLARRKHTLGRWLRETAAVATTHEFSyyLLAVGLShsHRAhgPGGAVPPHFVVGamslaQTALARLFGAELGDpQAALEASLAWNKLLHVHLAVLLLGY- >tr|A0A2E9LM24|A0A2E9LM24_9CHLR Uncharacterized protein OS=Dehalococcoidia bacterium OX=2026734 GN=CL902_07715 PE=4 SV=1 -GLGQNELDIIESTRELVLSKGEEITAEVYDHFL-RFQETRRFFLNEEKAVDDDRLERRKHSLLRWLRGSLDFKIDEDYPvrLLATGIVhsHPPshraHMGSVPGRFMIGsmsylQTLLAEIFHSEIEDrEEAHRASVAWNKMLMVQLDILQAGY- >tr|W4MD58|W4MD58_9BACT Uncharacterized protein OS=Candidatus Entotheonella gemina OX=1429439 GN=ETSY2_07185 PE=4 SV=1 -GLSDDERQLIKDSGPIVLGHVRKLTEGIYDQLL-AYPESAQFFTTENGQRDEKRIEDNIQTMISWFRAAVTAPTNQGFIryLVGISQMhaNIPvhrsNNTPVAPRYVIGtisyyQTNLDDILHQHMADpDLARRTCVAWNKWLLVILELMLANY- >tr|A0A0N5DD39|A0A0N5DD39_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 -NLTPHQKQLLVQSWPQVQLYnRIHGGDAMFARFCEKNSIARETFQKIAVvqSfasneASESVLKKHEQYLVQLLSEAVENLNNDcEPLLREcldYGAQHV-TLheLLNETVWEQLAEAIIDRIHKVNLVRRHKDLSKAWTMLIILLIDKIREGY- >tr|W8BTT7|W8BTT7_CERCA Uncharacterized protein OS=Ceratitis capitata PE=2 SV=1 -GLTITERRSLQNGWSIIKQKQRRAALTIYVNLFTEHENLYEVFRSDGVL-NIEFASQHQKEVLTVFQMIIEQVDNARfvkTMLKELALRHE-AASVTNTQWQLYtnevRKYFLETLADAIS----PTFVHALDKLMNFVCN------- >tr|A0A0A1X397|A0A0A1X397_ZEUCU Globin, monomeric component M-IV OS=Zeugodacus cucurbitae GN=GLB4_1 PE=3 SV=1 -GLTSTERKSLQNGWTIIKQKQRRAALNIYVNFFTGHEDLYEIFRFNGTL-DIGFASQHQKDVLTVFQMIFEQLDNARfvkTMMKELALRHQ-ASAVTNTMWQLYanevKHYFLKTLNDALS----PTFVHALETLINYICD------- >SRR5438046_775397 --VSRETTALARASFERCSA-NGEVPQAFYRNFFARCPPAPALFAPGL-------AAGLAArLLSApaaaeqIFLFTLVAGGTPRTrl-LPP----MSrGX--------------------------------------------------- >AACY02.8.fsa_nt_gi|132068355|gb|AACY021643300.1|_1 # 2 # 748 # -1 # ID=15695_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.288 ---------------------------------------------------------------LKG--------SFHFHLlgeLENLDFEFK-FLASWFSEVDIFRDALIDLFEMEMNDqSLTPQGRHVMALLINYVG-------- >ERR1719424_2066333 -------------------------------------P-------------------------------KASTWLRPCTVhllVQSTRQQHL-VSAI-------------SCTTSRRV--------------------------- >tr|A0A0G4HHE4|A0A0G4HHE4_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6802 PE=3 SV=1 --------------------------DALLGILFEASPTMRSVFVKNGD--------LYADLIEHLLRRIIAYADDPGALWTddqHLALDHI-NFGMSMSDLPLFGASLMNCLAGVLGENWCDEWQRAWEKAWQICCQSL----- >tr|A0A2C9LD65|A0A2C9LD65_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106067556 PE=3 SV=1 --LSHKDKLFILNSWLNFRNgkREEDIGMEAALEMYSIYPEIKDIFTIYrDARmkhlTDKEMIRTHSQQVASVVDKCVMRMDDAHAFAMiavDEGSVHI---KIQERFMRCYVDCYIREIKKYSKLKWSRANQMAWEVFFDTIVVNMKNGW- >SRR4030095_5973293 -----DHFEIAKDSYARCISggdSGNSFFKTFYHELTRISPEAAVKFKGKgiGET----ETNRQYGILREAIFILLMFGENklgenEPNILSRIAEMHNKnHYNISPESYKSFVSALTATICGSAPDipePFDPqckisvneknLIKIAWQKALKPGIDYMIMRYP >SRR6478736_3613867 -----DSFEIAKDSYNRCISgedSGDIFFKTFYNRLVKKLPKD-vaAQLKGKgiGRS----KGHRQYAILREAVFILLQFGQNrlgenEPNILSRIAQMHNKaNYNISPQLYTVFVDALIDTISGLPPDipkPFDSqcsisvyereIIRNAWSEALSPGITYMKDKYX >SRR5262245_45185474 ----------------------PTFLEAFYKLFTA-DEVVGKRF--VkfDDI----EWKRQHGLLQQALDACFDFASLlsmqnlrelpEPNAMTKYVVRHGPgrgNLGITSTEYDAFVEALITTVCGNPGNgqaPYDPecadaerkdVIEFAWRRLMKLIVEHFKKVAR >ERR1712142_1087278 -ALTETEVKVIIDSWDRIHPDK--GAKMLFHQFLTDFPLMKIYFGYQETesvaeIMESEQIKTRCKVVWDVLTKIVHASGDGGKLaelVKEVSVKHL-NFNREKKDIHCFLHALKVTLTC-----FSGHLFRPWNIWCKMVEDLF----- >ERR1719263_534529 ---------------------KRTYGLNAFNRFFAKQKKAEDHFNTSN--------ARLSVLAMQGLNLCQDIYKEPTRLvnvVTSLGLKHI-MYNISTEYFDAFVEAMCEELSDWHPGN--QAAVEGVEWALTQIAAIMI---- >ERR1719446_598571 ---------------------KKAYGLNAFNRFFCKAATIGNSFQHIQ--------CASVCSgnarSPAVSGYLQGAYTLGECGhltWPQTHHVQH-FYRLLX---------------------------------------------- >ERR1719446_1691251 ---------------------KKAYGLNAFNRFFAKQKKAEDHFNTSN--------ARLSVLAMQGLQLCQDIYKEPTRLvnvVTSLGLKHI-MFNISTEYFDAFVEAQCEELAEWHPGN--QSAIEGVEWALTQIAAIMI---- >tr|A0A182EAA6|A0A182EAA6_ONCOC Uncharacterized protein OS=Onchocerca ochengi PE=4 SV=1 ------------------mgsgssvpnhgqprnvaggggndgggggnagvengdqqkvdprlpypnfrelftlknywktvRRNERDCAKMMLAKNYLKNYGYSLGII------------------------------------------------------------------------------------------------- >tr|S9VAV3|S9VAV3_9TRYP Uncharacterized protein OS=Angomonas deanei GN=AGDE_12480 PE=4 SV=1 -------------AWSHLLtsPNGGEFCSTLYEKLCQNLTYIPDYIRNLK------DEERVIDHYINVITKTLELYENPHVMIdelPKIAARHR-GFGVSSDAFFVMRNIFMELLPEYMDPKVYEQSKKDWLKFWRLVLDLMVSG-- >tr|A0A2K6VLK5|A0A2K6VLK5_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1 -NLTTTQLLLVRKTWNHAKNQGaLEPALGIFRNSFYKCGEIRSLIMGGPKNVGYERLKKHAKSFTNIMDSLITGLDAKESVIEELRKagrAHATllrdtsnkfgnksntqliGCPFRLAHFDHFASAMIERtLEWGEKKDRNKTTQTGWTKIVLFIVEQLREGYQ >SRR4051812_9951159 -PLPPEVAQTIRSSCRPLLERQEQFHGDFHASLVDLMPEVPMMREPA--------GEQVSRWLVECVLWAVNADEPVPMIGATLqgvGLDAH-RLGFPRAGYQAVGHALLRTVRGASQNDWSGTLSSSWIGYHSWLCEYWVS--- >SRR5690242_179091 -PLPPEVAQVIRSSCRPLLERQEQFHGEFHASMVDLMPEVPMMREPA--------GEQVSRWLVECVLWAVNADEPLPMIGATLqgvGLDAH-RLGFPRSGYQAVGHALLRTVRGAYQSDWSGTLSSSWIGYHTWLCEYWVS--- >ERR1711972_144950 --------SQVLQSWEQVKLLgLESVGEMLRANTFELDPQVVALFRIPGVVSTGEGMLqrmalrRLFSKVLRFVGSVVAG----------------------RYDYQRLVETLsrLGATRAAGGATEVHFKI------------------- >tr|A0A1I8CIB1|A0A1I8CIB1_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=4 SV=1 ---------------------------LLLVRTFELDPKQKHNFNLDKVDiedlRIHPIFVDYVKSFQPLLLNVFKYTNRATIMskyLQQMGGKLMRytKVSYKSSYWKVFEQALIDVVS---GGNAGDETIEALTILANFCSEQMRIGFR >tr|K7H1D4|K7H1D4_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1 ----MDGEYLLFANCPAPGIgDGNDFLYHNGVGLESNCPIVSQCFQSATYSlstnpNQVRTVADHAKYLLQLLDKIIEGDVDAEY-LREIGANHVslkHENGFSNTEWDRFQEIMVEVILKQDGVKQSKETSRAWRLLICSFIELIRDGF- >tr|A0A0D8XGR1|A0A0D8XGR1_DICVI Uncharacterized protein OS=Dictyocaulus viviparus OX=29172 GN=DICVIV_11062 PE=4 SV=1 ---------RIQHCFKAA---RPTIGEAILKRASNNRCEMRILMSRLTD----QQIELMGKQFYMLIAYSVENIERVEMIQQharTLGETyaaLC-RLGFRPDYFTSLADAAIAECVKLDGGtHKstyffnRCETLLAWSQLIGTIFTSVRDGY- >tr|A0A0N4UGY4|A0A0N4UGY4_DRAME Uncharacterized protein OS=Dracunculus medinensis OX=318479 PE=4 SV=1 MRLSDKQKLWIKLGYKKWRSKsKMVPGEWVHAYAIKKYPTMKALFKKHEN-----LARVYTQTITKIIEMAVESVDSLDDsLGPLLisyasengilEERgmasiftirndkllLF-LEGFDRRFWGYVAEALCALSRDFPLKRHKWDTISAWRIIVLFIVKKLEYGF- >SRR2546421_6426420 -------------------------------XMIRRPPRstlfPYTTLFRSD-------FERQNKLLRHAFGLLLIFPNQArtePSVLTRVAERHSRrDLDIPRSEEHTSElqsRSDLVCRLLLEKKK-KNQV-------------------- >LakMenE18May11ns_1017337.scaffolds.fasta_scaffold18991_1 # 3 # 107 # -1 # ID=18991_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.400 --------SLALASYNRCRDCHQEFIREFYDAFIEGLPEPYkEHFQNR---------QRQNTMLDSAIYLLFD-LEAPEnqKLLRSIftGSKTAGkpnpHPAYPIEWYERFLDTLVGQVSHMDRKNWNAEVEASWRNLRENALHLIR---- >ERR1719262_958340 ------QKEILDICYAKMTGelDLPAMVTMFQGIFFSRDLRIQSYFSKPNG--------TLRYIVLRIIEFLCNVFHKPAAItkeLRTLGVSHV-KWEIPPDLFVPLGEALF----------------------------------- >SRR5512142_1307926 -GLTESDIETIKQSKPIIEKHIPEIVTKFYAHLLR-YPPTRRVFLKKDGSVDQPYVELRMRHLTNFWLRTATGVyDdDYARYIDYVGRAHT-SHGADPHIYIAeryvigqvgfVQHAITDALSRELRhtdEEFEVRAVEAWDKLMMVLLEMLSRAY- >ETNmetMinimDraft_9_1059917.scaffolds.fasta_scaffold1595668_1 # 1 # 216 # 1 # ID=1595668_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.366 -GFTRADAEIIAQACPIIEEHLPNIVADFYDQLLR-YPPTRKVLLKPDGTIDQEHVEKRMLFQINFWLRSASGVyDdDYASYIDYVGRAHT-SHGADLNIYIAeryvigmvgfMQRAIDQALDSELHdadHTMEDRAEAAWGRLLMVILEMLSRAY- >ERR1712137_619303 --LPRESITVIRDTWAMVER-NVDIAPKMLLKMFQLYPVTQNLIPLLrGVSledmPTNKRFLQLAYGSQFAMSAIVDKLHRPDMLEEIIGGGmHAFVDGLSTSFQmAATTALFNKIMTEELGSAYTAEAQEAFIATGDMMTSIMVK--- >SRR5580704_4499342 ------------------------TLGDFYRRLLQHHPQLAAYFEGVN-------IDFQVQKLVVVLSTIARDLPDRSVLdrvLFHQGVAHV-ERGIGRGEFNEFIALLANVVSCKTTLVGAAESYAVWYQELSAVATSML---- >ETNmetMinimDraft_24_1059892.scaffolds.fasta_scaffold323471_1 # 1 # 354 # -1 # ID=323471_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.472 ---------------------RKKVCTDLYFRLFDVVPASQDYFKQSNT-----RLHFIAELV---INMTLDMYQKPTKMMsqiSALGLRHV-ALNVPTDIFPAFIDVYITVVKEYTN--------------------------- >tr|A0A1Q9F3K1|A0A1Q9F3K1_SYMMI Copper-exporting P-type ATPase A OS=Symbiodinium microadriaticum GN=copA PE=4 SV=1 --LDEFTIKEVQNGWATTEKrlgGPKAAGEHVFGKLKKEVPRTEGMLKRSS--------TV------WHLFTElLQAIDQPKLVqkrLEYIALRHM-NADITTADIEVFRNILFEVCASKLGGLmtpefqYQAQYSFGMGQIIVAVGTS------ >tr|A0A183BUR6|A0A183BUR6_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 -GLSAHQIQILQKIWERSPESeISDCARNIMSHLLRSNAQMYQFFDLLGHsdreIANSPIFARQSANFAVLLDFVLANLLEeVQKVclaLQHLGAQHARlRWPIETHHWALFCRCFEDNPPKEV--FLNAEGHDLWKTMINFIIVQMRVGYD >SRR5690348_61285 ----------------------------------------------------------------RATHWLLDHFDHPGEIVSVLVRYvpalDA-LTGPHSRQLELFGEQITQQVDDEA---------------------------- >JI7StandDraft_1071085.scaffolds.fasta_scaffold2802978_1 # 2 # 235 # -1 # ID=2802978_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.607 --LTPTTIRLLLATSDIVG--SKETADKFYNRLFLHSPELKELFVGGETTtTTSMGIGDQALKFSQMMQWTTRALQQmhlqqkqkqqpsrssggggggdacsngtaPTrrstsAVfrsMTNLGRRHV-RYGVQLKHFHPVKQALLDTIAEL----------------------------- >ERR1740139_220892 ------TRAALLKSWEMVQEAGTVPAAnLLMKHLRERDAEALRVNTSHARPktgeTEEDAVRKLAVRTVQILGSAATGMSDTVSLVQHLHKVgagFA-GTGIKEGYFAMVRDASPFALRELLGDRFTADIASACRITGPFLASLIIAGLR >ERR1740139_941170 ------TRAALLKSWEMVQEAGTISAAnLLMKHMREKDAEALRLNTSQARPktgeTEEDAVRKLAVRTVQILGSAATGMSDTVSLVQHLHKVgagFA-GTGIKEGYFAMVRDASPFVLRELLGDRFTADIESACRITGPFLASLIIAGFR >tr|E0VF27|E0VF27_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236389 PE=3 SV=1 ---------VVLNDWPKIRKNYKKIFIDSFINYFAENPNYKLLFPSFsNVSeddlPFNHCFRLHCFAVYKAINFLMSNWlGeyeeDDSKILPVIGKTHF-DRGITLEMMNLYKHSIVYSCNNHLKPNL--KRKLSWQTVFDHIFD------- >SRR4030067_646800 -AFTQADADAINESRFIIEKDIPEIVSKFYTQLLR-YPPTRKHFMRQDGTLDQEYLQLRMHHLTNFWRRTAYGeFDdNYARYVDYVGRART-SHAGDHRPGCgppagsrglrAGPGNAHLGREPRRGG-CESGGDRRWRKEDRP---------- >ERR1719347_1935341 -GLSQNEVTLIWSHWESLKPHKRRLAKRILKVYIKEHPRARELFPNWvDIPtvelVKLTSFSRKAVDTWEAFSRAWECIDDAPLcrkVCYAFGKKHIEcnarikgHGQIDEHHVKNFIRIFLRIILVSAR----EGSEEAWRKATEFFSINFVRG-- >SRR5690625_6901273 -------------------TPPETYTPSLHDAL----------PISA--------RASRHVDLTVAIAWALENPAPkVDALVAQLGRDHR-RLGFPPEVYDTFAQDRKSTR-----------LNSShVAISYAVFCLKKKT--- >SRR5580692_4143848 --SDSGIWPVIRQSAARLSRDEDAFIQELHYEITRLISDPAGAPAP--------DMWVFCERMVRSFLWVAL-TDQPlGVVADtlrKVGVHYW-VEGFPDTLYGEVTHAMVQTVHYLCAHDWSASMGSAWITYFMWIKPHLLAG-- >SRR6266704_2516069 --SDSGYD---APPAGALARDQGAFIRQLHYDVTSRIPESAVPPAF--------DMWGFCDRMAQTLLWVAL-TDQQpSLVTDtlrQLGAQNW-YEGFPDS--------------------------------------------- >SRR5438132_1665678 -------RSRVLASYSRVQSgdRARTLYQAFYQQLFRAVPDVEPLFARID-------MVRQYDALNKAIKLLLDYDPQSREstdDIRAVAVIVA-APVIVAVHLNVAApVTVIDKRKGCGS--FGTTVV----AVMGPGVGWGD---- >SRR4051795_10036070 -------RDQLFISYSHR---DESWLEEFATMLAPVQKSgslnIWSDKEiraGED-------WSAKiQEAMSRARIALLLVSPAFLAsdFIQKTELPKI-LSDHTCRGMHVywvlleqslTEWSPLSQLQAAHP--IKISlseisnvgerrnVIANICRQIANELGQYS---- >ERR1712051_620824 ------------EGWATMQDHILNYLSntMMLPFVMRCNKSILKYFVTYESNvsllkfEGSqglAslEKTKHGCWfLTEVLTKVIPNLECLDTCieyLKDLGQKHQ-TQGVRREHLDLLALVYVSAVKEVMA--------------------------- >tr|A0A2C9LKZ0|A0A2C9LKZ0_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106051185 PE=4 SV=1 --ISLADIKVITNQWEDVLRCSDLFGKLLVLYVLDNCPKVNALHPGLHArlTdARDSVEKQIGLRVIQSISCVIHNLNRAPAVESMVRDTfkkLQ-QHGYTKNTILECSEAFLSFMNQYFSKRWLKQHSDAWFKVLKALL-------- >tr|A8WLI5|A8WLI5_CAEBR Protein CBG24801 OS=Caenorhabditis briggsae GN=CBG24801 PE=4 SV=1 -------------WIFSFQLEG-SKSRTQIERILKKFKNKKKS--------------------------------------------------------------------------------------------------- >tr|A0A1I7RWJ6|A0A1I7RWJ6_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=4 SV=1 -KLSKLQKRALRFTWHRLQTRnggkrVDNVFEDVYDRLMRLVPVMKEMFTTRAFlsamSkHEVATPRDHARFTVKMIDSVIKNLDTDEKkrtdtlseFDPVlIGRAHAvlRPYGFVASIWEKLGETIIDVVLVQDAVRDLPGAGQAWVVLTACLVDQLRAGF- >tr|A0A2A2L6J3|A0A2A2L6J3_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_22934 PE=4 SV=1 -KLTKLQKKALKFTWSRLQTRnggkrVESVFEDVFDRVVRYLPQTREMFNTR--------------------------------a---FlCAISrneTSslRDHARVIFFLhsfadlcKLHDKCLLL----------IPSA--FTLCFSLCTIYELRGS-- >tr|A0A1I7XY15|A0A1I7XY15_9BILA Uncharacterized protein OS=Steinernema glaseri PE=3 SV=1 ----TSSLALLTSTWPDHFGNLFDMGLNALDATFKKHPDLMAYFAFNDRVnwKKEDKVRKVVLALEQTLVHAVSVFGEvhsgdekeeaiqgFEVLLEEIGGLHRAiVPNFVPEHFIKFLAVLPTAIVTTICdkreeimpESDREMLLELWKKISAFMGFHLDAG-- >KNS7NT10metaT_FD_contig_41_844412_length_214_multi_3_in_0_out_0_1 # 3 # 212 # -1 # ID=205324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.619 --IPPKLAVLIREKWQAFLEKfptREQAGEAIYDSFMEEAPSLRPLFKTP--------RSVFGLRFIASLTNLMAVRPAGVTEEagGNHGF-----------------------PAPRLGG-------------------------- >tr|E5SHC1|E5SHC1_TRISP Uncharacterized protein OS=Trichinella spiralis GN=Tsp_03845 PE=3 SV=1 -SLSAGELKLLRWLWKQMKQVHQgLASAKLFQIIFATCPEIKRFFGLAKVS----------------DEKALIDerMRKhmlilqASKLIILFQIISSa----------------------------------------------------- >SRR5690606_9602430 -------------------------YRAFYPILYSSVSGAQELFEATVG-TDNRKMLQILAKLFG----FISNVNhSSEFMKsdAFIerGKYYA-DHGISETMMRGFSSALVLTLRRTLGELFTISHVRAWGIFLDTISHAL----- >SRR5581483_1589235 ---------DIKESFHRILEQKQAVTHLFFTVALGSGHEARLLIWETEG-----------------AGCSVESTDPPQWLC------------PPFTIYAQFTNDLLQALREFHGADWNQELMEQWRMTIERVGQIIFSACR >SRR5262249_34977875 ------------------LEQKQAVTHLFFTVALSGCHEARLIFWGTEG-----------------AGHSGEFFSSPQMLC------------APLAMYAQFTNDLLRALREFHGADWNPELTEQWRMAIERVGQAIFATYR >SRR3954454_18132641 ----------------------WRDADRPAWAALNADPEVREFFDR--------PLTrpeADASldrfrsdLAARGWGWWAIELTATGE---------------------LIGMAGLDPTE--DDIP-VAGVEMGWRlarAHWGHGYATEA---- >SRR3954470_12875293 ----------------------WRADDLDAWAAINADPQVRAFLGG--------VLDrgqAAESirrfrtaLAARGWGWWAVELTATGE---------------------LIGIAGLDPVD--EGLP-FDGVEIGWRlarWAWGRGYATEA---- >tr|A0A1Q9EV88|A0A1Q9EV88_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene4882 PE=4 SV=1 ----------------------SAFKMEVFETFFATCEQSQEYLKASNA-----KLQFIAGRILDI---MTDMFRTPQSAVkdiSALGLLHA-GYGVREELIQPFVTAFMTAVKNAC---------------------------- >tr|S9TGR2|S9TGR2_9TRYP Uncharacterized protein OS=Strigomonas culicis OX=28005 GN=STCU_11951 PE=4 SV=1 ---------TLEGCWQLLELrpqGLEEIAQAMYFYLLSHNRQLQSYFYGI-------DMEEQGRALVRMLCSTVHTYGRTqtecdpvawsnfEGYLVEMGARHR-SYGVGDNVFHEMRDAFFQQFPHFVDAnSWRI-TCREWHTLWDTIIRLLQQG-- >tr|A0A0A2NAV4|A0A0A2NAV4_ALCFA Uncharacterized protein OS=Alcaligenes faecalis OX=511 GN=JT27_01100 PE=4 SV=1 --VTDAQRDIIKTAAPLLASGDKALTTYFYELILRDSPPMSPLASQ-------------------------------------IANNHL-ALQIQPEHDPMMGTCQLQAVREELIVRMTgNKLIDGWVAAYQQLSNLLIEA-- >SRR3954463_13473713 -RVTPDDLKHVQRSWAKLCDRRESLLAELT-VTFQSNPALQ--C----------DACCRAEWLLCAGEELVELLPAPSTLASRARVLgDRWPDPLTAPSFEIDGRAWMAAATRCSS-MWSDTIEMAWRQAWLLLSDVLA---- >ERR1711890_22380 MHLSDTEKSAVVSSWSNVNS---SLLDSVLLQLVQENADMRAAMSRGDLAedsiREQETFKADVTKLTCCITKLVTRLGNTGEVSScpATCLKNC-P-YLQPKHVPLFISSFCD------KLELTEDAKKGWKFIMEKTAERI----- >tr|A0A0B2VKC9|A0A0B2VKC9_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_09473 PE=4 SV=1 --ISPQGRDIIVNCFENS---HADIGNRICMRVFERRSDYQRFILALGKE----KWSWVTNTLRDFIEEVVLRIDDLAKideLSRKYGEDHVelKPFGFKPDFWVSLADAMIVeCVVLDMASHQPTDTVAAWSQLVSLMFSSIRDGY- >ERR1700761_7028990 -PLDEEALRIVRHSAGRLTYVTDDFIDWLHREGVALSPEVGHSVAG--------EGWPFCERMAQALLWV-ALTDQPAGvaagVLRRVGADNW-RDGFPDAEYVSVVQALVRVLRGLSGAAQIPAMASAWISCFQWMQPYLLIG-- >tr|A0A2A6D1B3|A0A2A6D1B3_PRIPA Uncharacterized protein OS=Pristionchus pacificus GN=PRIPAC_35146 PE=4 SV=1 -TLNHQQRKLIKNGYDSWRKKsCISSGRWVHSFVSSKDDRLKEIMEGNEE-----TTRIHEETITHLLDMAVESLESLDDsLGPLLISytgpqgvFEE-KDGFDRLYWSRVSEGMCQLARNFPSKANKYETVCAWRIVVLFICNKIELGF- >tr|A0A0N5AH18|A0A0N5AH18_9BILA Uncharacterized protein OS=Syphacia muris PE=4 SV=1 -SLTEKQKQLIKIGYKKWSEStTVTVGEWVYQYIFHKFPSVKGKFAKDEK-----SLAENQRRITDIIEMAVESVDSLDDsLGSFLVSyssengfLGE-SEGFDRGYWEIVSEALCQLSRHFPVKSHKSDTVLAWRIVILFVINKIEYGF- >tr|A0A0G4IA00|A0A0G4IA00_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_12404 PE=4 SV=1 ------------------------LAGKVFQKIITKAPSFRKLFVRPDE--------AYTKHFSVFLEQCLDYAQRPRCFWQehnDLAVKHI-IFGVGHNDITMMGRMIVEALQDIGGEGWAEDYAETWQKFWTEISRSL----- >ERR1719384_273858 -----------LLGTTLTT-KLLSEKLSSRAGWA--QTQTSKMFSLLSFK------QGPAQFLVERFDILLNVIDDEDQLAEQLYqvaKTHK-KVGVDQSDLYSFQASFMKTLPSF-DSDFTAEVGNAWAYTLSH----------