view Roary/t/data/expected_combined_proteome_with_filtering.fa @ 0:c47a5f61bc9f draft

Uploaded
author dereeper
date Fri, 14 May 2021 20:27:06 +0000
parents
children
line wrap: on
line source

>1234#10_00001 nudix hydrolase
MSKHGFFQITQKLFLRKGDELLILRDRKSGLGDLPGGRMNENEFFEDWSLSMQREIEEEL
GSQVQIRVSTKPLFIHKHKVNEGNFPCIIIAYHADYLGGDIILSDEHDYISWEKVQTYEP
SPLFTEYMLDAVNLYLKEYAPLVH*
>1234#10_00002 intramembrane serine protease
MIIKLKIILNSYLIYYFLRGQNTLIRTLLFEFPLTTFFVFLMVATFFIVNVFLPEHLIRQ
YFLNHPGQIQPLSWIGAVFYHGNLIHLFGNMFYLFFLGRAVEYKAGKGRWLLFFFMAALI
SSLLDSFIRGVILHDPTPVVGASGAISGIAAVAALLSPFSLRFNQRNIPFPVFLVAWIMV
YSDITNVFTEDGVARWAHLGGFISVIFAAYFLKPTERKQLHSGFILNLIFIILTLILAFF
YSNRS*
>1234#10_00003 hypothetical protein
VKTTLSGEIEKLRYEVAVKIVNLQGEVLDLRAEMKINFSEVNSKILKLQFEFEMAKIRKE
LKTEIADLRAETKTDFLELQKSIVDIYKTISTQTRWILGVATLFAAIGKVIN*
>1234#10_00005 imidazoleglycerol-phosphate dehydratase or histidinol-phosphatase
LTDKLIGFYDPVRMKAERKTSETEIKLEMNLRGTGQYQFDTEIPFFEHMLSHISKHGLID
LNLWLRGDIEIDCHHSVEDTAILMGATIHKQLGDKAGIFRYGHFTLTMDEVLTTVAVDLG
GRYFFKYTGPELTGKFGIYDAELSLEFLQKLALNAKMNLHVVVHYGDNKHHVHESIFKAL
GKALRMAIAQDSAAAGAIPSTKGVLE*
>1234#10_00006 imidazole glycerol phosphate synthase subunit HisH
MIAILDYGMGNIHSCLKAVSLYTKDFVFTKDHSTIENSKALILPGDGHFDKAMENLNSTG
LRKTIDKHVTSGKPLFGICIGFQILFESSEEIAQGSKKEQIEGLGYIKGKIKKFHGKDFK
VPHIGWNRLQIRRKDKSVLLKGIGDQSFFYFIHSYRPTDAEGNAITGLCDYYQEKFPAVV
EKNNIFGTQFHPEKSHTHGLKLLENFIRFI*
>1234#10_00007 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase
MIVIPAIDLFDNCAVRLFKGNYEEKKIYSSEPWKLAESFAKNGATLLHLVDLNGARNQLG
VNEDSILKIRETTSLKVQLGGGIRDKEKLAYYDKIGINRFILGTAAVTNPDLLKYALDNY
GKERVVVAVDARDGIVKIAGWEKDSGIHYRDLLERLVKAGIEHIVFTDIAQDGTLAGPNL
EAYREILNSYPFQVIASGGIASLKDLMDLSSLKTKISLYGVITGKALYEGKLDLAKAISS
I*
>1234_00186 GGDEF family protein
MNFENEYDLEKLVNNSLDLLTIVDLSGNVLLVNPAFERTLGWKKEDLVGKDPFHLLHPED
KESTYKEFEKLNQGLLTLSFQNRYICADGQYRYFSWTASPDLVSGLVYVTGRDITDVIES
NRKISQLAVKLKETNDRLFEQASTDPLTKLKNRRMFNEELNNLIHACDKESHPLSLLMID
ADHFKDYNDKFGHIAGDKVLVELASILTKTFRKKDVLARYGGEEFIAALPNTSEPEANQI
AERLVQTVREFSWEKRSVTISVGITTYNFNPTSKSINSEYLLNLIEQADKALYCSKVSGR
XXXXXXXXXXXXXXXXXXXYYI*
>1234_00325 only a few unknowns so keep
MAFDPSVPQQQAQAPAGTLLFPEGSSANTLNVLHSGTVRYLTEVPGGRKLELFKLNGANL
TPGSVALFTSGRYPFHLQAEEACVISTYAMNRDTISKSVGSRVSLGLMVARTLLREITEL
FKKSNQIRKITSEIEKVNDNLSILYYQFNPSVFPDIKPGSPIPEVSADVVDPVMRLCREN
LKLFFDNGGILPDRPSPQFLEEEHESQLTRLYPEEIDFQDGEFNFIRKLVMQDPKILNVL
FTADPSMLAYVCSKLANVLDQISGILKTCLTDLDEAFRIFFIGENSLVEKFYLILDITSS
GYGTAPAEFVIPVLGAFAGKIEKYKNGHQALFGVPVANISPNTQAFQSKAVTLAKKMEET
APKVQAPVTSSATAGVDVDAIRKELDNSASVIIQFSGLGAEQIKEFSALMVKVKSLKNPL
DPEGDNRKVRRTLGRHYWDMYQECFTKYMNSNRNVPKPVELMLKYGYFDETLVDDSQIAF
MYTQKDPANFTSNVPISLGTEWLEKVFKREVPTSLDEMGQNFFEKVKLENRNIVIKKESD
IPPELDNPDTRLKFEFASLYEANVRLTSGSPATHFPILTKFHSQMAIDKSYVSKKILEEV
VHELMAVDYSIFHREVIYNNNELGITKEFIQKCVIPDFILVPSIGTKVMMWQDLSIHRGA
GSKESPGRIVLPIFAQGDLKTMVADALAAFRWELTKSILGAEWNNVGNPSITADYTDYIQ
FFKKNKDLSMEIKEKLASDFKRFRNDRDIFANDYQLWMKYEADGVQRLNKVVRGIFYRHI
PFSKQVRDKVAXXXXXXXXXXXXXLLPLQRSITDLLILEIENILK*
>1234_00377 Uncharacterized protein conserved in bacteria
MTLNEFAKNVLFGSGLEDKLFSPPVHPVDIRSFDFLNVPSLPAREKKIQISEQKSKIPRL
EQLFNEENRIITLHHFANHELMAIELFAWAILKFQDAPSSIRFGLYRTLLEEQTHLKMYL
SEMKKGGMELGDRPLNFIFWKQVPKMQTLEKFXXXXXXXXXXXXXXXXXF*
>1234#10_00016 hypothetical protein
MKVTHSCLEFDSIEGLIDFAREFETGSMIRFLSPIEDNSGNVLVKEEVQVKESTLARLKD
IKGQYTPKFEVKLNKELLEQIQNILAIKIVNQLKVTDMKFLKFMYENTNYNFKGIIRNSL
LSKKTTLTLLKVYNQNLNFFKYISELGLLSLGIVMIPDTMRFRLLRRYAFTAGILMDVPR
IGVDKFTKLPSDDNEKVRVAHKCSDILQKLDLIEFTYGSISNHMPLGMIEDSTSSDKAAP
GENIDETFLDDIISNDGESDSKVDGSREDAIPEKSYDIFQALLTDALKLARYIANVSHNA
VDKDYVMEELVYYIAYNTSKKYFDELLANPLVATFKEFEVNVKRLRKIAEVEMKCVYPPS
AWAYPKPKSSQVLCKNKVWDCPNIVMGWDIHVITAQEAFGWVGTSLPVDNYPKCRLEEEL
DEIMVEPEKPKKK*
>1234#10_00017 LipL45-like lipoprotein
MKRYLSIVILCTFAMLLLVCSTNKSSGSDQVKTESNATSARIVWLLGDVKILSDSGEKKA
ELGASLSSTDRVVTGPNGGAEIMVADSGIIKMSKNSDIEISSLMNPNGSDTNVQVNYGKI
VTMVKKGQKTTEFTVSTPTALAGVRGTSFLTSVESPEGSKINCAKANCTVRFAVIEGTIA
VSKKGESSEVILSKNRELRIEKNQKLTDKLIRSLQNDSLSEMKELIVLHKNETFEYGKLV
EELKSSSEELKILSQSGSVEEVKAAFQKREADRNNADEITKTAKAVNETKYVQQDVQKEK
LKLNPKETF*
>1234#10_00018 putative lipoprotein
MDFSFNLVDNNGTVMRSTFSPIRFFTFSFLWILIDCSSVQKIENFNSVLQEPTFKSLKEE
EAILGGSSDSDYKIRKTGNTIPVFVLSPIQTPEGMDSKLAAFLSDEVRLIWAKVKGKQVR
IQEMSWKNPSQLSQELKRLNVDAVIKTDIREVSGKWVVNQKITDPVKEIVYGSVDGSFQS
PKIEDELQANQAYYLKHGSGVLALDAKSSLVPIWEKSLSSGEIDSILKKSIQGYLSFSAS
SADTEVLFQGEKIGIASFRNYPLPEGLQQIQITRPGQKDISKSLQIRSGQTISIYQEWKE
DRTLGGVRILSFPEALQVALDGLKMGETPFYRSNLTPGAMQLELVRETENGPLVYYEGQL
IVDADKITEIALPYKTDNLISEPEFWKLSGEKGFQAFSGKTLDFQNVSSLPPGWYGVFSA
PFVPENMELEGIIPITAESDSGIVAISFHTSKKTISLEYEKERLSVYSFPSNGNNVGTYK
FKKEDKEDGRPFRIITDVKEGTIRLYLGYSKVLEDRLDVSGVWRISILTRGENFSKRSPL
RNLKIEYKGYK*