view Mafft/testsequences/ex2_protein.fasta @ 0:e4d26cd8be10 draft default tip

Uploaded
author basfplant
date Tue, 05 Mar 2013 04:01:17 -0500
parents
children
line wrap: on
line source

>A.thaliana 67208.m00002
MDTRFPFSPAEVSKVRVVQFGILSPDEIRQMSVIHVEHSETTEKGKPKVGGLSDTRLGTI
DRKVKCETCMANMAECPGHFGYLELAKPMYHVGFMKTVLSIMRCVCFNCSKILADEVCRS
LFRQAMKIKNPKNRLKKILDACKNKTKCDGGDDIDDVQSHSTDEPVKKSRGGCGAQQPKL
TIEGMKMIAEYKIQRKKNDEPDQLPEPAERKQTLGADRVLSVLKRISDADCQLLGFNPKF
ARPDWMILEVLPIPPPPVRPSVMMDATSRSEDDLTHQLAMIIRHNENLKRQEKNGAPAHI
ISEFTQLLQFHIATYFDNELPGQPRATQKSGRPIKSICSRLKAKEGRIRGNLMGKRVDFS
ARTVITPDPTINIDELGVPWSIALNLTYPETVTPYNIERLKELVDYGPHPPPGKTGAKYI
IRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFVLFNRQPSLHKMSIMGHRIRIMPY
STFRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMMVPKCIVSPQANRPVMGIVQ
DTLLGCRKITKRDTFIEKDVFMNTLMWWEDFDGKVPAPAILKPRPLWTGKQVFNLIIPKQ
INLLRYSAWHADTETGFITPGDTQVRIERGELLAGTLCKKTLGTSNGSLVHVIWEEVGPD
AARKFLGHTQWLVNYWLLQNGFTIGIGDTIADSSTMEKINETISNAKTAVKDLIRQFQGK
ELDPEPGRTMRDTFENRVNQVLNKARDDAGSSAQKSLAETNNLKAMVTAGSKGSFINISQ
MTACVGQQNVEGKRIPFGFDGRTLPHFTKDDYGPESRGFVENSYLRGLTPQEFFFHAMGG
REGLIDTAVKTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDAVWIES
QKLDSLKMKKSEFDRTFKYEIDDENWNPTYLSDEHLEDLKGIRELRDVFDAEYSKLETDR
FQLGTEIATNGDSTWPLPVNIKRHIWNAQKTFKIDLRKISDMHPVEIVDAVDKLQERLLV
VPGDDALSVEAQKNATLFFNILLRSTLASKRVLEEYKLSREAFEWVIGEIESRFLQSLVA
PGEMIGCVAAQSIGEPATQMTLNTFHYAGVSAKNVTLGVPRLREIINVAKRIKTPSLSVY
LTPEASKSKEGAKTVQCALEYTTLRSVTQATEVWYDPDPMSTIIEEDFEFVRSYYEMPDE
DVSPDKISPWLLRIELNREMMVDKKLSMADIAEKINLEFDDDLTCIFNDDNAQKLILRIR
IMNDEGPKGELQDESAEDDVFLKKIESNMLTEMALRGIPDINKVFIKQVRKSRFDEEGGF
KTSEEWMLDTEGVNLLAVMCHEDVDPKRTTSNHLIEIIEVLGIEAVRRALLDELRVVISF
DGSYVNYRHLAILCDTMTYRGHLMAITRHGINRNDTGPLMRCSFEETVDILLDAAAYAET
DCLRGVTENIMLGQLAPIGTGDCELYLNDEMLKNAIELQLPSYMDGLEFGMTPARSPVSG
TPYHEGMMSPNYLLSPNMRLSPMSDAQFSPYVGGMAFSPSSSPGYSPSSPGYSPTSPGYS
PTSPGYSPTSPGYSPTSPTYSPSSPGYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSP
SYSPTSPSYSPTSPSYSPTSPAYSPTSPAYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSP
TSPSYSPTSPSYSPTSPAYSPTSPGYSPTSPSYSPTSPSYGPTSPSYNPQSAKYSPSIAY
SPSNARLSPASPYSPTSPNYSPTSPSYSPTSPSYSPSSPTYSPSSPYSSGASPDYSPSAG
YSPTLPGYSPSSTGQYTPHEGDKKDKTGKKDASKDDKGNP
>SPBC28F2 S.pombe chromosome II cosmid c28F2
MSGIQFSPSSVPLRRVEEVQFGILSPEEIRSMSVAKIEFPETMDESGQRPRVGGLLDPRL
GTIDRQFKCQTCGETMADCPGHFGHIELAKPVFHIGFLSKIKKILECVCWNCGKLKIDSS
NPKFNDTQRYRDPKNRLNAVWNVCKTKMVCDTGLSAGSDNFDLSNPSANMGHGGCGAAQP
TIRKDGLRLWGSWKRGKDESDLPEKRLLSPLEVHTIFTHISSEDLAHLGLNEQYARPDWM
IITVLPVPPPSVRPSISVDGTSRGEDDLTHKLSDIIKANANVRRCEQEGAPAHIVSEYEQ
LLQFHVATYMDNEIAGQPQALQKSGRPLKSIRARLKGKEGRLRGNLMGKRVDFSARTVIT
GDPNLSLDELGVPRSIAKTLTYPETVTPYNIYQLQELVRNGPDEHPGAKYIIRDTGERID
LRYHKRAGDIPLRYGWRVERHIRDGDVVIFNRQPSLHKMSMMGHRIRVMPYSTFRLNLSV
TSPYNADFDGDEMNMHVPQSEETRAEIQEITMVPKQIVSPQSNKPVMGIVQDTLAGVRKF
SLRDNFLTRNAVMNIMLWVPDWDGILPPPVILKPKVLWTGKQILSLIIPKGINLIRDDDK
QSLSNPTDSGMLIENGEIIYGVVDKKTVGASQGGLVHTIWKEKGPEICKGFFNGIQRVVN
YWLLHNGFSIGIGDTIADADTMKEVTRTVKEARRQVAECIQDAQHNRLKPEPGMTLRESF
EAKVSRILNQARDNAGRSAEHSLKDSNNVKQMVAAGSKGSFINISQMSACVGQQIVEGKR
IPFGFKYRTLPHFPKDDDSPESRGFIENSYLRGLTPQEFFFHAMAGREGLIDTAVKTAET
GYIQRRLVKAMEDVMVRYDGTVRNAMGDIIQFAYGEDGLDATLVEYQVFDSLRLSTKQFE
KKYRIDLMEDRSLSLYMENSIENDSSVQDLLDEEYTQLVADRELLCKFIFPKGDARWPLP
VNVQRIIQNALQIFHLEAKKPTDLLPSDIINGLNELIAKLTIFRGSDRITRDVQNNATLL
FQILLRSKFAVKRVIMEYRLNKVAFEWIMGEVEARFQQAVVSPGEMVGTLAAQSIGEPAT
QMTLNTFHYAGVSSKNVTLGVPRLKEILNVAKNIKTPSLTIYLMPWIAANMDLAKNVQTQ
IEHTTLSTVTSATEIHYDPDPQDTVIEEDKDFVEAFFAIPDEEVEENLYKQSPWLLRLEL
DRAKMLDKKLSMSDVAGKIAESFERDLFTIWSEDNADKLIIRCRIIRDDDRKAEDDDNMI
EEDVFLKTIEGHMLESISLRGVPNITRVYMMEHKIVRQIEDGTFERADEWVLETDGINLT
EAMTVEGVDATRTYSNSFVEILQILGIEATRSALLKELRNVIEFDGSYVNYRHLALLCDV
MTSRGHLMAITRHGINRAETGALMRCSFEETVEILMDAAASGEKDDCKGISENIMLGQLA
PMGTGAFDIYLDQDMLMNYSLGTAVPTLAGSGMGTSQLPEGAGTPYERSPMVDSGFVGSP
DAAAFSPLVQGGSEGREGFGDYGLLGAASPYKGVQSPGYTSPFSSAMSPGYGLTSPSYSP
SSPGYSTSPAYMPSSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSATSPSYSPTSPSY
SPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS
PSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYS
PTSPSYSPTSPS
>CE07183
MALVGVDFQAPLRIVSRVQFGILGPEEIKRMSVAHVEFPEVYENGKPKLGGLMDPRQGVI
DRRGRCMTCAGNLTDCPGHFGHLELAKPVFHIGFLTKTLKILRCVCFYCGRLLIDKSAPR
VLEILKKTGTNSKKRLTMIYDLCKAKSVCEGAAEKEEGMPDDPDDPMNDGKKVAGGCGRY
QPSYRRVGIDINAEWKKNVNEDTQERKIMLTAERVLEVFQQITDEDILVIGMDPQFARPE
WMICTVLPVPPLAVRPAVVTFGSAKNQDDLTHKLSDIIKTNQQLQRNEANGAAAHVLTDD
VRLLQFHVATLVDNCIPGLPTATQKGGRPLKSIKQRLKGKEGRIRGNLMGKRVDFSARTV
ITADPNLPIDTVGVPRTIAQNLTFPEIVTPFNVDKLQELVNRGDTQYPGAKENGARVDLR
YHPRAADLHLQPGYRVERHMKDGDIIVFNRQPTLHKMSMMGHRVKILPWSTFRMNLSVTS
PYNADFDGDEMNLHLPQSLETRAEIEEIAMVPRQLITPQANKPVMGIVQDTLCAVRMMTK
RDVFIDWPFMMDLLMYLPTWDGKVPQPAILKPKPLWTGKQVFSLIIPGNVNVLRTHSTHP
DSEDSGPYKWISPGDTKVIIEHGELLSGIVCSKTVGKSAGNLLHVVTLELGYEIAANFYS
HIQTVINAWLIREGHTIGIGDTIADQATYLDIQNTIRKAKQDVVDVIEKAHNDDLEPTPG
NTLRQTFENKVNQILNDARDRTGSSAQKSLSEFNNFKSMVVSGSKGSKINISQVIACVGQ
QNVEGKRIPFGFRHRTLPHFIKDDYGPESKGFVENSYLAGLTPSEFFFHAMGGREGLIDT
AVKTAETGYIQRRLIKAMESVMVNYDGTVRNSLAQMVQLRYGEDGLDGMWVENQNMPTMK
PNNAVFERDFRVSVAQNAIKLMDLTDNKFLRKNYSEDVVREIQESEDGISLVESEWSQLE
EDRRLLRKIFPRGDAKIVLPCNLQRLIWNAQKIFKVDLRKPVNLSPLHVISGVRELSKKL
IIVSGNDEISKQAQYNATLLMNILLRSTLCTKNMCTKSKLNSEAFDWLLGEIESRFQQAI
AQPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKEIINVSKTLKTPSLT
VFLTGAAAKDPEKAKDVLCKLEHTTLKKVTCNTAIYYDPDPKNTVIAEDEEWVSIFYEMP
DHDLSRTSPWLLRIELDRKRMVDKKLTMEMIADRIHGGFGNDVHTIYTDDNAEKLVFRLR
IAGEDKGEAQEEQVDKMEDDVFLRCIEANMLSDLTLQGIPAISKVYMNQPNTDDKKRIII
TPEGGFKSVADWILETDGTALLRVLSERQIDPVRTTSNDICEIFEVLGIEAVRKAIEREM
DNVISFDGSYVNYRHLALLCDVMTAKGHLMAITRHGINRQEVGALMRCSFEETVDILMEA
AVHAEEDPVKGVSENIMLGQLARCGTGCFDLVLDVEKCKYGMEIPQNVVMGGGFYGSFAG
SPSNREFSPAHSPWNSGVTPTYAGAAWSPTTGGMSPGAGFSPAGNTDGGASPFNEGGWSP
ASPGDPLGALSPRTPSYGGMSPGVYSPSSPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSP
SYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPSSPSYSPSSPSYSP
SSPRYSPTSPTYSPTSPTYSPTSPTYSPTSPTYSPTSPSYESGGGYSPSSPKYSPSSPTY
SPTSPSYSPTSPQYSPTSPQYSPSSPTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSP
SYTPSSPQYSPTSPTYTPSPSEQPGTSAQYSPTSPTYSPSSPTYSPASPSYSPSSPTYDP
NS
>gi|7292659|gb|AAF48057.1| RpII215 gene product [Drosophila melanog
MSTPTDSKAPLRQVKRVQFGILSPDEIRRMSVTEGGVQFAETMEGGRPKLGGLMDPRQGV
IDRTSRCQTCAGNMTECPGHFGHIDLAKPVFHIGFITKTIKILRCVCFYCSKMLVSPHNP
KIKEIVMKSRGQPRKRLAYVYDLCKGKTICEGGEDMDLTKENQQPDPNKKPGHGGCGHYQ
PSIRRTGLDLTAEWKHQNEDSQEKKIVVSAERVWEILKHITDEECFILGMDPKYARPDWM
IVTVLPVPPLAVRPAVVMFGAAKNQDDLTHKLSDIIKANNELRKNEASGAAAHVIQENIK
MLQFHVATLVDNDMPGMPRAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSARTVIT
PDPNLRIDQVGVPRSIAQNLTFPELVTPFNIDRMQELVRRGNSQYPGAKYIVRDNGERID
LRFHPKSSDLHLQCGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFRMNLSC
TSPYNADFDGDEMNLHVPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKM
TKRDVFITREQVMNLLMFLPTWDAKMPQPCILKPRPLWTGKQIFSLIIPGNVNMIRTHST
HPDEEDEGPYKWISPGDTKVMVEHGELIMGILCKKSLGTSAGSLLHICFLELGHDIAGRF
YGNIQTVINNWLLFEGHSIGIGDTIADPQTYNEIQQAIKKAKDDVINVIQKAHNMELEPT
PGNTLRQTFENKVNRILNDARDKTGGSAKKSLTEYNNLKAMVVSGSKGSNINISQVIACV
GQQNVEGKRIPYGFRKRTLPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLI
DTAVKTAETGYIQRRLIKAMESVMVNYDGTVRNSVGQLIQLRYGEDGLCGELVEFQNMPT
VKLSNKSFEKRFKFDWSNERLMKKVFTDDVIKEMTDSSEAIQELEAEWDRLVSDRDSLRQ
IFPNGESKVVLPCNLQRMIWNVQKIFHINKRLPTDLSPIRVIKGVKTLLERCVIVTGNDR
ISKQANENATLLFQCLIRSTLCTKYVSEEFRLSTEAFEWLVGEIETRFQQAQANPGEMVG
ALAAQSLGEPATQMTLNTFHFAGVSSKNVTLGVPRLKEIINISKKPKAPSLTVFLTGGAA
RDAEKAKNVLCRLEHTTLRKVTANTAIYYDPDPQRTVISEDQEFVNVYYEMPDFDPTRIS
PWLLRIELDRKRMTDKKLTMEQIAEKINVGFGEDLNCIFNDDNADKLVLRIRIMNNEENK
FQDEDEAVDKMEDDMFLRCIEANMLSDMTLQGIEAIGKVYMHLPQTDSKKRIVITETGEF
KAIGEWLLETDGTSMMKVLSERDVDPIRTSSNDICEIFQVLGIEAVRKSVEKEMNAVLQF
YGLYVNYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFEETVDVLMDAAAHAET
DPMRGVSENIIMGQLPKMGTGCFDLLLDAEKCRFGIEIPNTLGNSMLGGAAMFIGGGSTP
SMTPPMTPWANCNTPRYFSPPGHVSAMTPGGPSFSPSAASDASGMSPSWSPAHPGSSPSS
PGPSMSPYFPASPSVSPSYSPTSPNYTASSPGGASPNYSPSSPNYSPTSPLYASPRYAST
TPNFNPQSTGYSPSSSGYSPTSPVYSPTVQFQSSPSFAGSGSNIYSPGNAYSPSSSNYSP
NSPSYSPTSPSYSPSSPSYSPTSPCYSPTSPSYSPTSPNYTPVTPSYSPTSPNYSASPQY
SPASPAYSQTGVKYSPTSPTYSPPSPSYDGSPGSPQYTPGSPQYSPASPKYSPTSPLYSP
SSPQHSPSNQYSPTGSTYSATSPRYSPNMSIYSPSSTKYSPTSPTYTPTARNYSPTSPMY
SPTAPSHYSPTSPAYSPSSPTFEESED
>HSRPIILS    X63564   9202 H.sapiens mRNA for RNA polymerase
MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYPETTEGGRPKLGGLMDP
RQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKLLVD
SNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEMDNKFGVEQPEGDEDLTKEKG
HGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPERVHEIFKRISDEECFVLGMEP
RYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDLTHKLADIVKINNQLRRNEQNGAAA
HVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVD
FSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYII
RDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPWS
TFRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQD
TLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHI
NCIRTHSTHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEM
GHDITRLFYSNIQTVINNWLLIEGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKA
HNNELEPTPGNTLRQTFENQVNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKIN
ISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHA
MGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGES
VEFQNLATLKPSNKAFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMR
EDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKL
VIVNGDDPLSRQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAI
AHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLT
VFLLGQSARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMP
DFDVARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIR
IMNSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKI
IITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKALER
ELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETVDVLM
EAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLGAAGPTG
MFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAASDASGFSPG
YSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSP
SYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSP
TSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS
YSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPTSPSYSPT
SPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPTSPKYTPTSPSYSPSSPEYTPTSPKY
SPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTSPKYSPTSPTYSPTS
PKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAISPDDSDEEN
>AL590443_Chr.3_Encephalitozoon_cuniculi_GBM1
MFEAKVKKQIKSIQFGLFSPDEVRNGSVALIVHPEVMEGGVPKTGGLIDLRMGTTDRMYL
CQSCGGDNFSCPGHFGHIELTKPMFHVGYISKIKKVLECVCFYCSKIKIPRKGIKSTLSN
VWGMSKGRSVCEGEVLDNGRSGCGNKQPVIKREGLTLVAFMKGEESNEGKVMLNGERVYS
IFKKISDEDSVYMGFDLKYSRPEWMILTVLLVPPPAVRPSIVMEGSLRGEDDLTHKLADI
IKSNGYLKKYEQEGAPGHIVRDYEQLLQFHVATFIDNDIGGLPQALQKSGRPLKSLSARL
KGKEGRIRGNLMGKRVDFSARTVITPDPNISLEEVGVPLEIAKIHTFPEKVTSFNIDRLE
KLVRAGPNEHPGANYVLRSDGQKIDLNFNRSDIRLEEGYVVERHMQSGDVVLFNRQPSLH
KMSMMAHYARVMGNKTFRLNLSVTSPYNADFDGDEMNLHMPQSYTSKAELEELALVSRQI
ISPQSNKPVMGIVQDTLTGLRLFTLRDTFLNEREVMSLLYAVNLEFCDIPLGDAVQTGLR
KGKDYDIMKILRKPAIAKPMRLWTGKQVLSFVLPNLNYIGLSSEHDDDDKENIGDTRVII
QDGYIHSGVIDKKAAGATQGGLVHIIFNDFGPKRAAQFFDGVQRMINAFMTGIHTFSMGI
GDTIADPKTVKVVESAIRKAKEEVSALIENARQNRLERLPGMTMKESFESHLNLVLNRAR
DVSGTSAQRSLSENNNMKTMVLAGSKGSFINISQVTACVGQQNVEGKRIPFGFSHRTLPH
FVKDDYTGKSRGFVENSYLTGLDPEEFFFHAMGGREGLIDTAIKTAETGYIQRRLVKALE
DAIVRQDESVRSGNGLVYQIKYGEDGFDATFLESQKVDVKNFTKRYYIDMFGTEELEIKH
GQVSEEVYGMLSSDVDLQKLLDQEYEWLVGEIFEGPPILSVGEVDIERDYKVRDIYQSAV
MSPCNFTRILATAKRTFHLSTGDVSPYYILEAHKHLTTSNRILNVLIRTNLSVKRVLLEH
RLNTEAFNWVVEVIDAKILKAKITPNEMVGTLAAQSVGEPATQMTLNTFHLAGVASTVTM
GVPRLKEIFNVTKNLKTPSMKIYLDREHGKSIEAAKTIQNEIECLTVKDLCLFSEIYYDP
EITGTEISDDKDFVEAYFEFPDEDVDFSCLSPFLMRLVVDRAKLVGRGINLEYVAMFIRK
ELGGGAHVICSDENAVNMVVRVRTTKSEDESLNFYTTALNSLLRLQLGGYKNVKKVYISE
DKDRKEWYLQTDGICLSQILGNPAVNSRLTISNDLVEIAETLGIEAARESILRELTIVID
GNGSYVNYRHMSLLADVMTMRGYLCGITRHGVNKVGAGALKRSSFEETVEILLDAALVSE
KNICRGITENIMMGQLAPMGTGNIEIMLDMKKLDKAIPLSNPVFKPNEPATPVISTPSSD
SFSISSGNWSPTHLEMAYSRDLGERLSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS
YSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPT
SPSYSPTSPSYSVSMSSFSNKNKSKNQDGDKKRRNDGSF
>Yeast ORFP:YDL140C RPO21, Chr IV from 205361210562, reverse compl
MVGQQYSSAPLRTVKEVQFGLFSPEEVRAISVAKIRFPETMDETQTRAKIGGLNDPRLGS
IDRNLKCQTCQEGMNECPGHFGHIDLAKPVFHVGFIAKIKKVCECVCMHCGKLLLDEHNE
LMRQALAIKDSKKRFAAIWTLCKTKMVCETDVPSEDDPTQLVSRGGCGNTQPTIRKDGLK
LVGSWKKDRATGDADEPELRVLSTEEILNIFKHISVKDFTSLGFNEVFSRPEWMILTCLP
VPPPPVRPSISFNESQRGEDDLTFKLADILKANISLETLEHNGAPHHAIEEAESLLQFHV
ATYMDNDIAGQPQALQKSGRPVKSIRARLKGKEGRIRGNLMGKRVDFSARTVISGDPNLE
LDQVGVPKSIAKTLTYPEVVTPYNIDRLTQLVRNGPNEHPGAKYVIRDSGDRIDLRYSKR
AGDIQLQYGWKVERHIMDNDPVLFNRQPSLHKMSMMAHRVKVIPYSTFRLNLSVTSPYNA
DFDGDEMNLHVPQSEETRAELSQLCAVPLQIVSPQSNKPCMGIVQDTLCGIRKLTLRDTF
IELDQVLNMLYWVPDWDGVIPTPAIIKPKPLWSGKQILSVAIPNGIHLQRFDEGTTLLSP
KDNGMLIIDGQIIFGVVEKKTVGSSNGGLIHVVTREKGPQVCAKLFGNIQKVVNFWLLHN
GFSTGIGDTIADGPTMREITETIAEAKKKVLDVTKEAQANLLTAKHGMTLRESFEDNVVR
FLNEARDKAGRLAEVNLKDLNNVKQMVMAGSKGSFINIAQMSACVGQQSVEGKRIAFGFV
DRTLPHFSKDDYSPESKGFVENSYLRGLTPQEFFFHAMGGREGLIDTAVKTAETGYIQRR
LVKALEDIMVHYDNTTRNSLGNVIQFIYGEDGMDAAHIEKQSLDTIGGSDAAFEKRYRVD
LLNTDHTLDPSLLESGSEILGDLKLQVLLDEEYKQLVKDRKFLREVFVDGEANWPLPVNI
RRIIQNAQQTFHIDHTKPSDLTIKDIVLGVKDLQENLLVLRGKNEIIQNAQRDAVTLFCC
LLRSRLATRRVLQEYRLTKQAFDWVLSNIEAQFLRSVVHPGEMVGVLAAQSIGEPATQMT
LNTFHFAGVASKKVTSGVPRLKEILNVAKNMKTPSLTVYLEPGHAADQEQAKLIRSAIEH
TTLKSVTIASEIYYDPDPRSTVIPEDEEIIQLHFSLLDEEAEQSFDQQSPWLLRLELDRA
AMNDKDLTMGQVGERIKQTFKNDLFVIWSEDNDEKLIIRCRVVRPKSLDAETEAEEDHML
KKIENTMLENITLRGVENIERVVMMKYDRKVPSPTGEYVKEPEWVLETDGVNLSEVMTVP
GIDPTRIYTNSFIDIMEVLGIEAGRAALYKEVYNVIASDGSYVNYRHMALLVDVMTTQGG
LTSVTRHGFNRSNTGALMRCSFEETVEILFEAGASAELDDCRGVSENVILGQMAPIGTGA
FDVMIDEESLVKYMPEQKITEIEDGQDGGVTPYSNESGLVNADLDVKDELMFSPLVDSGS
NDAMAGGFTAYGGADYGEATSPFGAYGEAPTSPGFGVSSPGFSPTSPTYSPTSPAYSPTS
PSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYS
PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPAYSPTSPSYSPTSPSYSPTSP
SYSPTSPSYSPTSPNYSPTSPSYSPTSPGYSPGSPAYSPKQDEQKHNENENSR
>Drosophila #corrected# CG17209|CT38140|155068361551320
MPKEQFRASALNKKISHVQFGISGADEIQQEALVRIISKNLYQAQRQPVPYGVLDRRMGI
STKDAMCETCGQGLNECIGHFGYLDLALPVFHIGHFRSTINILQMICKVCAHVMLKPEDR
QLYEKKLHNPNFSYLGKKALHVQMLAKAKKVTKCPHCGSPNGGVKKGPGLLKILHDPYKG
RKMDSLFTSNMNEMLRSTQTNRDLNSTLGNYSTAEELTPLMVLDLFEQIPQRDVALLGMC
SHDAHPKHLIVTRVFVPPACIRPSVLSEVKAGTTEDDLTMKQSEILLINDVIQRHMATGG
KIELIHEDWDFLQLHVALYFHSEISGIPINMAPKKTTRGIVQRLKGKQGRFRCNLSGKRV
DFSGRTVISPDPNLMINQVGVPVRVAKILTYPERVNPANIRHMRELVRNGPSMHPGANYV
QQRGSSFKKYLAYGNREKVAQELKCGDVVERHLRDGDIVLFNRQPSLHKMSIMCHRAKVQ
PQRTFRFNECACTPYNADFDGDEMNLHLPQTEEARAEALILMGNQSNLVTPKNGEILIAA
TQDFITGGYLLTQKEVFLTKEEAMQLAACFLANEDSTMHIKLPPPALLKPRRLWTGKQMF
SLLMRPNDDSQVRLNMVNKGRNYTRNKDLCSNDSWIHIRNSELMCGVMDKATMGSGTKQC
IFYLLLRDFGESHATKAMWRLARIASYFLQNRGFSFGISDVTPSKKLLQHKELLLNNGYA
KCNEYIELLKAGTLQCQPGCTPEETLESVMLRELSAIREQAAKTCFAELHPTNSALIMAL
SGSKGSNINISQMIACVGQQAISGKRVPNGFENRALPHFERHSAIPAARGFVQNSFYSGL
TPTEFFFHTMAGREGLVDTAVKTAETGYLQRRLVKCLEDLVVHYDGTVRNAVNEMVDTIY
GGDGLDPVSMETRNKPVDLVHQYDNLRAQHPQGKDRPLNAEEMSEALETLLRTPEFAEAR
DDFKLDVRNHINTVSKRIGQLQKRYEKCIDLCHQIECLTTEQLLQFVRRINDRYNRAVTE
PGTAVGAIAAQSIGEPGTQMTLKTFHFAGVASMNITQGVPRIVEIINATKTISTPIITAE
LENCHSMEFARQVKARIEKTTLAELSSYVEVVCGPYSCYLAIGVDMARIKLLGLHIDLDT
IVFSILKSRMRVKPTQVEVVASQSRIVVRVEATRTSTINAELARLALSLQNVVVAGLPNI
NRAVIAVDDARQPPTYKLCIEGYGLRDVIATYGVVGKRTRSNNICEIYQTLGIEAARTII
MSEITEVMEGHGMSVDWRHIMLLASQMTARGEVLGITRHGLAKMRESVFNLASFEKTADH
LFDAAYYGQTDAINGVSERIILGMPACIGTGIFKLLQQHEDKQVPPIEPTICSSLNLLPS
KTT
>AF021351 9711 Homo sapiens RNA polymerase III la
MVKEQFRETDVAKKTSHICFGMKSPEEMRQQAHIQVVSKNLYSQDNQHAPLLYGVLDHRM
GTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFRAVIGILQMICKTCCHIMLSQE
EKKQFLDYLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGAFNGTVKKCGLLKIIHEKYK
TNKKVVDPIVSNFLQSFETAIEHNKEVEPLLGRAQENLNPLVVLNLFKRIPAEDVPLLLM
NPEAGKPSDLILTRLLVPPLCFRPSVVSDLKSGTNEDDLTMKLTEIIFLNDVIKKHRISG
AKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKR
VDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPEVHPGANF
IQQRHTQMKRFLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARV
KPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIA
AIQDFLTGAYLLTLKDTFFDRAKACQIIASILVGKDEKIKVRLPPPTILKPVTLWTGKQI
FSVILRPSDDNPVRANLRTKGKQYCGKGEDLCANDSYVTIQNSELMSGSMDKGTLGSGSK
NNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAG
YKKCDEYIEALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTM
ALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLPHFEKHSKLPAAKGFVANSFYS
GLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVRSSTGDIIQF
IYGGDGLDPAAMEGKDEPLEFKRVLDNIKAVFPCPSEPALSKNELILTTESIMKKSEFLC
CQDSFLQEIKKFIKGVSEKIKKTRDKYGINDNGTTEPRVLYQLDRITPTQVEKFLETCRD
KYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFGGVASMNITLGVPRIKEIINASKAI
STPIITAQLDKDDDADYARLVKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLL
RLEVNAETVRYSICTSKLRVKPGDVAVHGEAVVCVTPRENSKSSMYYVLQFLKEDLPKVV
VQGIPEVSRAVIHIDEQSGKEKYKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGI
EAARTTIINEIQYTMVVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLA
SFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLLHKADRDPNPPKRPLIF
DTNEFHIPLVT
>CE04196
MGKEQFREADKALKVVGVKFAPGDCEFMRQTAHVPIFNNKLYEDVEGKLQPANYGPLDPK
MGVSTKTGTCSTCGLGLTDCVGHFGYFDLDVPVFHIGFFKLTIQLLQCICKNCSSILLTP
EQHRVFSRQVMNPNLDYLHRKALHKRIVAACKKGSTCSHCGLKNGTVKKAVGAVLKIAFA
SPVSVDELGKFATMFSSNQEVGDHVRKMKFTLLNPLFVQKLFSNIKEGDIPVLMVRSGEE
KHPNDLLLTRMPVPPVCIRPSVVSEVKAGTTEDDVTMKLMEIMLTNDVLKKHKRDGAPSK
TLFETWEHLQIQCALYINSEMSGLPPDMQPKRAMRAFTQRLKGKQGRFRGNLCGKRVDFS
GRTVISPDPNLRIDQVGVPIHVAMTLTFPEIVNASNIEKMRQLVINGSDVHPGANYLVEK
KTGNKKLLKYGKRDELAKNLRMGDTIERHLDDNDVVLFNRQPSLHKISIMSHRAKVMPGR
TFRFNECACTPYNADFDGDEMNLHLPQTYEAKAEASELMNVKNNLITPRSGEPLVAAIQD
FITGGYLLTHKDTFLPRAEVYRFAAALIDASAKKQTKIRIPPPAIRRPVELWTGKQLIEL
IIRPDKGSDVSLNLTAKNKSYTGNLELCSKDSYVIIRNSVLLAGCLDKSLLGSSSKVNIF
YMLMRDYGEDAAVDAMWRLARMAPVFLSNRGFSIGIGDVRPSEQLLREKGQLVDNGYSKC
SQYIKELEEGKLKAQPGCTEEETLESIILRELSTIRDHAGQVCLRNLSKYNAPLTMAVCG
SKGSFINISQMIACVGQQAISGHRPPDGFEERSLPHFERKKKTPEAKGFVANSFYSGLTP
TEFFFHTMGGREGLVDTAVKTAETGYMQRRLVKCLEDLCASYDGTVRSSVGDVIEFVFGE
DGLDPAMMEAKDGSVVDFTHVLEHAKNIQTTKEQPIPVDKLDEVLKAEIQKKFKGKYVHF
ADKLSEYITQTEIKKSKKWQNGKAHCGNHETADLKTKESCKICKNLEAYKTSLLANSCLT
KTQLCSFIELCYYKVARAITEPGEPSTQMTLKTFHFAGVASMNITQGVPRIKEIINAVKT
ISTPIITAALLDPYDESLARRVKARIEKTTLGEICDYIEEVYLPDDYFLLVKLNSKRIRL
LQLEVCMESISYAIATSKVCPQMRGCKIVAHGKTMMAIRPPSTSKLSKTMTMQILKYSLA
NVVVKGIPSVNRCVIHADEKKGDFYSLLVEGTDFRSVLSSVGVDPRKTNFNNALVVADVL
GIEAARSCIINEIIATMDAHGIGLDRRHVMLLADVMTYRGEVLGITRNGLVKMKDSVLLL
ASFEKTMDHLFEAAFFSQRDVIHGVSECIIMGTPMTVGTGTFKLMQKYDKKAVLKQNSPI
FERLNVTL
>SPBC651 S. pombe chromosome 2    1397aa     8865   13055
IPKRIKHLQFGINGPEEFVKDGTVEVSRRDLYTMTDRSPAEHGALDLHMGTSNKQINCAT
CGESMADCMGHFGYVKLALPVFHIGYFKATLTILQNICKDCSSVLLSDQEKRQFLKDLRR
PGIDNLRRSQICKRINDHCKKMRRCSKCDAMQGVVKKAGPLKIIHERFRYVRKSQDDEEN
FRHSFDEALKTIPELKMHLSKAHDDLNPLKVLNLFKQITPVDCELLGMDPEHGRPENLLW
RYVPAPPVCIRPSVAQEGATTEDDLTVKITEIIWTSSLIRAALSKGTPISNLMEQWEFMQ
LSIAMYINSEMPGLRPSDMPSKPIRGFCQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPN
LRIDQVAVPYRIAKILTFPERVTTQNKKHLQDCIRNGPDVHPGANYVIDRESGFKRFLRF
GNRNRIADDLKIGDIVERHLHDNDVVLFNRQPSLHKLSIMAHLVKVRPWRTLRFNECVCG
PYNADFDGDEMNLHVPQTEEAKTEALELMGIKNNLVSPRNGEPIIAATQDFITAAYLLSL
KDTFLDRKSISNICCYMMDASTHIDLPPPAIIKPRCLWTGKQVFTVLMKPNRFSKVLVNL
DAKTRSFSRIKSKTPEMCPKDGYLMIRNSEIIAGVVDKSVVGDGKKDSLFYVILRDYGAL
EAAEAITRLSKMCARFLGNRGFSIGIEDVQPGKSLSSQKEILVNKAYATSDDFIMQYAKG
ILECQPGMDQEATLEAKISSTLSKVRDDVGEICMDELGPANSPLIMATCGSKGSKINVSQ
MVACVGQQIISGKRVPDGFQDRSLPHFHKNSKHPLAKGFVSNSFYSGLTPTEFLFHAISG
REGLVDTAVKTAETGYMSRRLMKSLEDLSSAYDGTVRSSNSDVVQFVYGDDGLDPTYMEG
DGQAVEFKRTWIHSVNLNYDRHDSAMLPYEIIDYVNRALDDPKFLTNCNRDFIETIRTFV
IENIAKYLASVRERRDLAPMLEEPDMDDLDDMEGDEFAPVAKRKSVENIIRVTEKQLRSF
VDRCWEKYMRAKVEPGTAVGAIGAQSIGEPGTQMTLKTFHFAGVAAQTTLGVPRIKEIIN
AAKTISTPIITGQLINDRDERSARVVKGRIEKTYLKDVTSYIEEVYGPVTTYLSIQVNFD
TISKLQLDITLADIAAAIWNTPKLKIPSQQVTVNNTLQQIHVHTSSDGKSSETEVYYRLQ
TYKRVLPDVVVAGIPTINRSVINQESGKIELFMEGTGLQAVMNTEGIVGTKTSTNHVMEM
KDVLGIEAARYSIISEIGYTMAKHGLTVDPRHIMLLGDVMTCKGEVLGITRFGVAKMKDS
VLALASFEKTTDHLFNAAARFAKDSIEGISECIVLGKLAPIGTNVFQLIRRTEEEEEQKP
KELLFDTPSLHQLEITA
>Yeast ORFP:YOR116C RPO31, Chr XV from 539761544143, reverse compl
MKEVVVSETPKRIKGLEFSALSAADIVAQSEVEVSTRDLFDLEKDRAPKANGALDPKMGV
SSSSLECATCHGNLASCHGHFGHLKLALPVFHIGYFKATIQILQGICKNCSAILLSETDK
RQFLHELRRPGVDNLRRMGILKKILDQCKKQRRCLHCGALNGVVKKAAAGAGSAALKIIH
DTFRWVGKKSAPEKDIWVGEWKEVLAHNPELERYVKRCMDDLNPLKTLNLFKQIKSADCE
LLGIDATVPSGRPETYIWRYLPAPPVCIRPSVMMQDSPASNEDDLTVKLTEIVWTSSLIK
AGLDKGISINNMMEHWDYLQLTVAMYINSDSVNPAMLPGSSNGGGKVKPIRGFCQRLKGK
QGRFRGNLSGKRVDFSGRTVISPDPNLSIDEVAVPDRVAKVLTYPEKVTRYNRHKLQELI
VNGPNVHPGANYLLKRNEDARRNLRYGDRMKLAKNLQIGDVVERHLEDGDVVLFNRQPSL
HRLSILSHYAKIRPWRTFRLNECVCTPYNADFDGDEMNLHVPQTEEARAEAINLMGVKNN
LLTPKSGEPIIAATQDFITGSYLISHKDSFYDRATLTQLLSMMSDGIEHFDIPPPAIMKP
YYLWTGKQVFSLLIKPNHNSPVVINLDAKNKVFVPPKSKSLPNEMSQNDGFVIIRGSQIL
SGVMDKSVLGDGKKHSVFYTILRDYGPQEAANAMNRMAKLCARFLGNRGFSIGINDVTPA
DDLKQKKEELVEIAYHKCDELITLFNKGELETQPGCNEEQTLEAKIGGLLSKVREEVGDV
CINELDNWNAPLIMATCGSKGSTLNVSQMVAVVGQQIISGNRVPDGFQDRSLPHFPKNSK
TPQSKGFVRNSFFSGLSPPEFLFHAISGREGLVDTAVKTAETGYMSRRLMKSLEDLSCQY
DNTVRTSANGIVQFTYGGDGLDPLEMEGNAQPVNFNRSWDHAYNITFNNQDKGLLPYAIM
ETANEILGPLEERLVRYDNSGCLVKREDLNKAEYVDQYDAERDFYHSLREYINGKATALA
NLRKSRGMLGLLEPPAKELQGIDPDETVPDNVKTSVSQLYRISEKSVRKFLEIALFKYRK
ARLEPGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINASKVISTPI
INAVLVNDNDERAARVVKGRVEKTLLSDVAFYVQDVYKDNLSFIQVRIDLGTIDKLQLEL
TIEDIAVAITRASKLKIQASDVNIIGKDRIAINVFPEGYKAKSISTSAKEPSENDVFYRM
QQLRRALPDVVVKGLPDISRAVINIRDDGKRELLVEGYGLRDVMCTDGVIGSRTTTNHVL
EVFSVLGIEAARYSIIREINYTMSNHGMSVDPRHIQLLGDVMTYKGEVLGITRFGLSKMR
DSVLQLASFEKTTDHLFDAAFYMKKDAVEGVSECIILGQTMSIGTGSFKVVKGTNISEKD
LVPKRCLFESLSNEAALKAN
>AB019231    AB019231 0012 Arabidopsis thaliana genomic DNA,
METKMEIEFTKKPYIEDVGPLKIKSINFSVLSDLEVMKAAEVQVWNIGLYDHSFKPYENG
LLDPRMGPPNKKSICTTCEGNFQNCPGHYGYLKLDLPVYNVGYFNFILDILKCICKRCSN
MLLDEKLYEDHLRKMRNPRMEPLKKTELAKAVVKKCSTMASQRIITCKKCGYLNGMVKKI
AAQFGIGISHDRSKIHGGEIDECKSAISHTKQSTAAINPLTYVLDPNLVLGLFKRMSDKD
CELLYIAYRPENLIITCMLVPPLSIRPSVMIGGIQSNENDLTARLKQIILGNASLHKILS
QPTSSPKNMQVWDTVQIEVARYINSEVRGCQNQPEEHPLSGILQRLKGKGGRFRANLSGK
RVEFTGRTVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEKLRQCVRNGPNKYPGAR
NVRYPDGSSRTLVGDYRKRIADELAIGCIVDRHLQEGDVVLFNRQPSLHRMSIMCHRARI
MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGEILVA
STQDFLTSSFLITRKDTFYDRAAFSLICSYMGDGMDSIDLPTPTILKPIELWTGKQIFSV
LLRPNASIRVYVTLNVKEKNFKKGEHGFDETMCINDGWVYFRNSELISGQLGKATLGNGN
KDGLYSILLRDYNSHAAAVCMNRLAKLSARWIGIHGFSIGIDDVQPGEELSKERKDSIQF
GYDQCHRKIEEFNRGNLQLKAGLDGAKSLEAEITGILNTIREATGKACMSGLHWRNSPLI
MSQCGSKGSPINISQMVACVGQQTVNGHRAPDGFIDRSLPHFPRMSKSPAAKGFVANSFY
SGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMKALEDLLVHYDNTVRNASGCILQ
FTYGDDGMDPALMEGKDGAPLNFNRLFLKVQATCPPRSHHTYLSSEELSQKFEEELVRHD
KSRVCTDAFVKSLREFVSLLGVKSASPPQVLYKASGQKYEILPLIYINMQVFVKICVFRY
REKKIEAGTAIGTIGAQSIGEPGTQMTLKTFHFAGVASMNITQGVPRINEIINASKNIST
PVISAELENPLELTSARWVKGRIEKTTLGQVAESIEVLMTSTSASVRIILDNKIIEEACL
SITPWSVKNSILKTPRIKLNDNDIRVLDTGLDITPVVDKSRAHFNLHNLKNVLPNIIVNG
IKTVERVVVAEDMDKSKQIDGKTKWKLFVEGTNLLAVMGTPGINGRTTTSNNVVEVSKTL
GIEAARTTIIDEIGTVMGNHGMSIDIRHMMLLADVMTYRGEVLGIQRTGIQKMDKSVLMQ
ASFERTGDHLFSAAASGKVDNIEGVTECVIMGIPMKLGTGILKVLQRTDDLPKLKYGPDP
IIS
>AL590447_Chr.7_Encephalitozoon_cuniculi_GBM1
MFSTMGLSSLPILCLNQIFQNSETDFLQPMIKKVEEDEIDIRIRRISFSLLSPQEISDLS
VHEISSKDLYDISTRAPLPNGPLDLRLGVGNKKDKCATCGEGLATCIGHFGEVRLVLPVF
NVGLIKNTISTLNCLCKSCGSILLNEKKKIYFKKKLRESSGGNDTKLILRRIVAECKKVN
VCFVCSFRNGQIKKTFGFRIVHEVEDLEKRKKDRGSKGSDGVHSSCCEEINPLVALNVFN
MMKEDDYELLGFLESPSRLIIQNVIVPPSCIRPSVSMDDEGTNEDDITIKISEIVHTNKV
LREGIEKGNPLNLINEDWDHLQLQCALLINSELPQIGIPGQPVRGIVQRLKGKNGRFRCN
LSGKRVDFSGRTVISPDPNLSIEEVGIPERMAKILTISERVTRLNRRKLQALVLNGPENY
PGANYVVGEKFKRFLMYGKRDIELKYGEVVERHLMDGDMVLFNRQPSLHRMSIMSHKVRV
HRNKTLRFNECVCTPYNADFDGDEMNVHVPQTEKARAEASVLMSVSNNIVTPRHGEPIVA
ATQDFITGLYLITGKDTFFDRERFGQLVSYFSEARVGVKPAIRKPVELFTGKQLIEVLIK
DSFTASIERNAVTGNMAVSLVGKNRSFKTHDDPNDGSVAILDNSYYFGRLDKSIVGGENK
RDSLIYAIMKVSSMAAVRAMNSITKLCSRYLGETGFSIGLDDVQPGPILRQKKEMVVRRG
YAECESKILEYSKKPEANEEMLEMEISSILNRIREECGSICIKELGIRNSPNIMQACGSK
GSKINVSQMVACVGQQIVSGTRIPNGMDERCLPHFKRGSRTPESKGFVLNSFFDGLTSPE
FFFHAVSGREGLVDTAVKTAETGYMQRRLMKALEDLSIQYDGSVRNSNMEVVQFAYGEDQ
IDPAMSEGDESINLEQVFLRAKSSFLHSLGPSPDLYSSWLDEHGTIDILRRLTPKGYPFE
HIANRRFVGSLLRFMESKHEDKFCFGGTFYRYFLSKDFIETFFSMVSQKMMNLIVEPGTT
VGAIAGQSIGEPGTQMTLKTFHFAGVASMNITLGVPRLKEIINAVCNISTPIINAELDDP
HDLFRTEVIKGRLDRISLKDVCSSLTEVISKDEIFLDIRVDLESLSRLKLNLDIHKIEKL
LNNHDQVRVVNENTLRVSIKKITDQSYFNLQRIKKKLLNTKICGAPQVNRVIINSSRGLY
SLVIEGRGLLNVINIDGVKAINTTSNSITEVEEVLGIEAARAQIIHEIEYTISNHGIKID
PRHIMLLADTMTYRGEVFGITRFGISKMSRSTLMLASFEQTSDYLFEAAVQSKSDEICGV
SESIILGIPICIGTGSMDLYWNSGT
>0750 AL590444_Chr.4_Encephalitozoon_cuniculi_GBM1
MVMKFHPKNISFGSYSDEEIERNSCMEVVVPGCFDKFGHPIRGGLYDLRMGPLDLSSNCK
TCNLSFLNCPGHFGHIKLSRIVLNPMFFDSLFSIVRCYCFQCKHFRITNYDRLILFCKFS
LLLNGEEAEDLDRLYMVSEEEELYKIVSERIENAKKNRSEPLMESHQEVVHRFLQNAGSR
RKCVRCGHSNPKIVKGAKMKVLKDLRKGNEKDEDGKDLEFLSPDAVRELMDDLFSNEADL
IESIFFRRDPGMFFISNLPVIPNRFRPMIVLDGKKAENFQTLYLNDILRVSVLVTKDLSY
WPELQAAILSSFDNKNISKWSRTRMVPPGHKQILERKDGLFRRNIMGKRVNFAARTVISP
DPNLETREIGIPLVFAEALTFPERAASFNVDRLKKAVVNGPTYPGSLYLQDGDVMLSLAH
MPDEKRYALANQLLDGKKVVWRHLVDGDVVLMNRQPTLHRPSIMAHKCRVLRKERTLRMH
YANCKSYNADFDGDEMNVHFPQNYASEAESRHIVMNDSNYLVPTNGKPIRGLVQDHIVIV
TVLTMKDSFFSEEDYFTLVNAGLSDRRIVLDRPCIVKPARLYSGRQVISTILKNLGLVVN
IEIETNVPKNAWREHSEERILRIREGNIVTGILDKNSLGPTFKSLIHACGEIKGFATSND
LLTYIGWVGNRYLLMYGFTIGIDDLLLDKEADKKRREVVRVKDQEAKELQRRYVEANPDF
YLYADKKAYLDSVMRTEMNSVTSEIVKVSVPSGLQKSFPENNMELIIATGSKGSIVNLSQ
ISGALGQQELEGQRVPIMASGKTLPCFAALDPSPSSGGYIYHRFLTGIDLPQYFFHCMAG
REGLIDTAIKTANSGYLQRCLIKHMEEAKVEYDMSVRIGKRVIQFMYGEDGLDCTKSSYL
DDAEFFKRNVLLFGKSASIVSKLEDRYKDFLSLKFRKQIDLLDDELRRFLADRYICSLAD
PGESVGVIAAQSVGEPSTQMTLNTFHLAGVGAKNVTLGVPRLREILITASKSIRTPLITV
PIKRRVSFDITECLRRVTLKDCVKRFGVTEEIVMVSGVFQKKVKIMFELDNFVDLAGEVL
DRKFLKLLGNKMKGLAKAKYTLGMSEAPKDDCKEVDENKENEGEDESDSDKESDPEGDNV
CLEATTGENGDKDSEDHSTSYLEATDETDQTDDSGNTEEEEDFMNFAKKSKNVLTFEILY
PSGFNEMISSVVESILPTIVVREVKGMEKASVSGSQLFVKSSSIYSLTRMIEVSPGVYED
LLDILDIYNAESNDIYDVYLTLGVEAARHAIINEVVKVFDVYGISIDIRHLLLIADYMTR
KGSYSPFSRHGLGADDSPIQRISFESCYSNFKTAATFHLEDKLSNPSASLTVGNPVRCGT
GCFDLIHNIDFPNKV
>gi|7303170|gb|AAF58234.1| RpI1 gene product [Drosophila melanogast
MGSKRAMDVHMFPSDLEFAVFTDQEIRKLSVVKVITGITFDALGHAIPGGLYDIRMGSYG
RCMDPCGTCLKLQDCPGHMGHIELGTPVYNPFFIKFVQRLLCIFCLHCYKLQMKDHECEI
IMLQLRLIDAGYIIEAQELELFKSEIVCQNTENLVAIKNGDMVHPHIAAMYKLLEKNEKN
SSNSTKTSCSLRTAITHSALQRLGKKCRHCNKSMRFVRYMHRRLVFYVTLADIKERVGTG
AETGGQNKVIFADECRRYLRQIYANYPELLKLLVPVLGLSNTDLTQGDRSPVDLFFMDTL
PVTPPRARPLNMVGDMLKGNPQTDIYINIIENNHVLNVVLKYMKGGQEKLTEEAKAAYQT
LKGETAHEKLYTAWLALQMSVDVLLDVNMSREMKSGEGLKQIIEKKSGLIRSHMMGKRVN
YAARTVITPDPNINVDEIGIPDIFAKKLSYPVPVTEWNVTELRKMVMNGPDVHPGANYIQ
DKNGFTTYIPADNASKRESLAKLLLSNPKDGIKIVHRHVLNGDVLLLNRQPSLHKPSIMG
HKARILHGEKTFRLHYSNCKAYNADFDGDEMNAHYPQSEVARAEAYNLVNVASNYLVPKD
GTPLGGLIQDHVISGVKLSIRGRFFNREDYQQLVFQGLSQLKKDIKLLPPTILKPAVLWS
GKQILSTIIINIIPEGYERINLDSFAKIAGKNWNVSRPRPPICGTNPEGNDLSESQVQIR
NGELLVGVLDKQQYGATTYGLIHCMYELYGGDVSTLLLTAFTKVFTFFLQLEGFTLGVKD
ILVTDVADRKRRKIIRECRNVGNSAVAAALELEDEPPHDELVEKMEAAYVKDSKFRVLLD
RKYKSLLDGYTNDINSTCLPRGLITKFPSNNLQLMVLSGAKGSMVNTMQISCLLGQIELE
GKRPPLMISGKSLPSFTSFETSPKSGGFIDGRFMTGIQPQDFFFHCMAGREGLIDTAVKT
SRSGYLQRCLIKHLEGLSVHYDLTVRDSDNSVVQFLYGEDGLDILKSKFFNDKFCADFLT
QNATAILRPAQLQLMKDEEQLAKVQRHEKHIRSWEKKKPAKLRAAFTHFSEELREEVEVK
RPNEINSKTGRRRFDEGLLKLWKKADAEDKALYRKKYARCPDPTVAVYKQDLYYGSVSER
TRKLITDYAKRKPALKETIADIMRVKTIKSLAAPGEPVGLIAAQSIGEPSTQMTLNTFHF
AGRGEMNVTLGIPRLREILMLASSNIKTPSMDIPIKPGQQHQAEKLRINLNSVTLANLLE
YVHVSTGLTLDPERSYEYDMRFQFLPREVYKEDYGVRPKHIIKYMHQTFFKQLIRAILKV
SNASRTTKIVVIDDKKDADKEDDNDLDNGDEVGRSKAKANDDDSSDDNDDDDATGVKLKQ
RKTDEKDYDDPDDVEELHDANDDDDEAEDEDDEEKGQDGNDNDGDDKAVERLLSNDMVKA
YTYDKENHLWCQVKLNLSVRYQKPDLTSIIRELAGKSVVHQVQHIKRAIIYKGTDDDQLL
KTDGINIGEMFQHNKILDLNRLYSNDIHAIARTYGIEAASQVIVKEVSNVFKVYGITVDR
RHLSLIADYMTFDGTFQPLSRKGMEHSSSPLQQMSFESSLQFLKSAAGFGRADELSSPSS
RLMVGLPVRNGTGAFELLTKIC
>CE22157
MDFFVRNGEEPYMQFSNFKLRSYFPHEIDKLSVLKITQTKTFDEVGHPIAGGLYDPILGP
DNTFDMCMTCNQYERHCPGHMGHIQLAVPVFNPLLFQFTYNLLKGSCVHCHRLTCKGDGV
NARMLLAQLRCFELGVEHVAFDLESILRDKIANADLFGDDDSKTFNDVDACIAQLANKPL
SELTAFKSMPTKNSVQLKKDIITEFLRSHLFKRLQKCPLCKNRNGVLRNDGARSILIDFT
SGARGGGKSKKAIIIIGDNGYNEESSTDEDEEKKGGGGAEVTGDVVDLDEGSLEMQMKNV
RTGECDKLAWRGAEVREHFRMLFKNDGKLLLKLFPMLVDELNGEDMICPLDGLFLERILV
PPKKFRPIRMFKGAQYEDPQTLNLRKVLEATETISAISLIMKGDTSAQLKELIANRVRGK
TINAQMHDAYLQLQLRANAIFDQDLNKGDRDSIAGIKQILEKKQGLFRMHMMGKRVNFAC
RSVITPDPYLDIDEIGIPDIFAKKLTFTEPVNAFNVNEMKGLLRQGPHQHPGANFFVEPS
GKKTMLGDKPEEKKRRMQMAKTLNAATTENLRQTPKVLRHMKNGDMIMMNRQPSLHKPSI
LGHRARVLTGQRALRMNYAPCKAYNADFDGDEMNGHLVQSHIAQTEVREIANVGSNFLVP
KDATPLLGLIQDHVVSGVLLSVRDRFLNKEDFMHLVLASFAQYSKRIEIPPPTILYPKRL
WTGKQVITTIVKNCIPDGKPLINLDGKAKTPLSCWIVPGFDAPQFDMSESHVVFRQGELL
VGVLDKAHFGATQFGLAHCAFELYGHRCGVQLLSCFSRLFTTYLQFHGFTLGVADILVVK
DADGKRKEAVMESRTIGNQVVKTAFGLPDTATPAEIKRTLAATYCNPRGQGTDVKMLDFG
MKQGIAKYNDAITKSCVPTGLLRLFPQNALQLMIQSGAKGSAVNAIQISGCLGQIELEGK
RMAVTIAGRTLPSFRCFDPSPRAGGYIDQRFLTGMNPQELFFHTMAGREGLIDTAVKTSR
SGYLQRCIIKHLEGIRVHYDSTVRDHDGSVIQFRYGEDGMDTTKATFLNKKTMPFLEDNL
EAVTLASKPEGVTDADFGIKETEKRYKKIVKWKKKAEKSGKSGTKKSYFSAFTNFSAEHI
GMEKKRILAMWFELSLEEREQYARGIPKKCPEAVDERFNPTCKLGALPEKMLDEIEGFCT
RRVKKVDDEEPPKEVLKRTLYWKGMRSLADPGENVGLLAAQSIGEPSTQMTLNTFHFAGR
GEMNVTLGIPRLREILMTASKSIATPSASIAVIAGTSRDRIDSIKRELDRVYLKQLLKNF
SLEEKITLTQNQSCRRYHLRIDILAAEKRELGARHLKRSQIMEEIEKRFILRVAQAIKKK
YHEITDYQQMSHRTMRQGNMAAGIETGGTKNRGLQGPDNGDSSDEEADGGREADAAEARL
HRRHRDEGADYEGEDEERVEVREEEEPMDSDSEDVKKEGLDGEDQTTEPLLVNSSRIQSV
QRLSENISSYTYDVKSNKWCEVVFELPLRNKTKMDVSSIVEKEVELFIVHQTPGIERCVE
TTEQKNGKEMTILQTQGVNLAAFFKHADVLDVNSVYSNDLNLILENYGVEACSKAITTEM
NNVFAVYGIEVSKRHLSLTADYMTFTGQIQPFNRGAMSSSSSPLQKMTFETTMAFLREAL
LHGEEDNVNSPSARLVMGALPRGGTGSFDLLLDTKMQSEREEHEAARARKRNAKQKF
>HSU33460    U33460   9903 Human DNAdirected RNA polymerase
MLISKNMPWRRLQGISFGMYSAEELKKLSVKSITNPRYLDSLGNPSANGLYDLALGPADS
KEVCSTCVQDFSNCSGHLGHIELPLTVYNPLLFDKLYLLLRGSCLNCHMLTCPRAVIHLL
LCQLRVLEVGALQAVYELERILSRFLEENADPSASEIREELEQYTTEIVQNNLLGSQGAH
VKNVCESKSKLIALFWKAHMNAKRCPHCKTGRSVVRKEHNSKLTITFPAMVHRTAGQKDS
EPLGIEEAQIGKRGYLTPTSAREHLSALWKNEGFFLNYLFSGMDDDGMESRFNPSVFFLD
FLVVPPSRSRPVSRLGDQMFTNGQTVNLQAVMKDVVLIRKLLALMAQEQKLPEEVATPTT
DEEKDSLIAIDRSFLSTLPGQSLIDKLYNIWIRLQSHVNIVFDSEMDKLMRDKYPGIRQI
LEKKEGLFRKHMMGKRVDSTARSVICPDMYINTNEIGIPMVFATKLTYPQPVTPWNVQEL
RQAVINGPNVHPGASMVINEDGSRTALSAVDMTQREAVAKQLLTPATGAPKPQGTKIVCR
HVKNGDILLLNRQPTLHRPSIQAHRARILPEEKVLRLHYANCKAYNADFDGDEMNAHFPQ
SELGRAEAYVLACTDQQYLVPKDGQPLAGLIQDHMVSGASMTTRGCFFTREHYMELVYRG
LTDKVGRVKLLSPSILKPFPLWTGKQVVSTLLINIIPEDHIPLNLSGKAKITGKAWVKET
PRSVPGFNPDSMCESQVIIREGELLCGVLDKAHYGSSAYGLVHCCYEIYGGETSGKVLTC
LARLFTAYLQLYRGFTLGVEDILVKPKRDVKRQRIIEESTHCGPQAVRAALNLPEAASYD
EVRGKWQDAHLGKDQRDFNMIDLKFKEEVNHYSNEINKACMPFGLHRQFPENTLQLMVQS
GAKGSTVNTMQISCLLGQIELEGRSTPLMASGKSLPCFEPYEFTPRAGGFVTGRFLTGIK
PPEFFFHCMAGREGLVDTAVKTSRSGYLQRCIIKHLEGLVVQYDLTVRDSDGSVVQFLYG
EDGLDIPKTQFLQPKQFPFLASNYEVIMKSQHLHEVLSRADPKKALHHFRAIKKWQSKHP
NTLLRRGAFLSYSQKIQEAVKALKLESENRNGRRPWDSGRMLRMWYELDEESRRKYQKKA
AACPDPSLSVWRPDIYFASVSETFETKVDDYSQEWAAQTEKSYEKSELSLDRLRTLLQLK
WQRSLCEPGEAVGLLAAQSIGEPSTQMTLNTFHFAGRGEMNVTLGIPRLREILMVASANI
KTPMMSVPVLNTKKALKRVKSLKKQLTRVCLGEVLQKIDVQESFCMEEKQNKFQVYQLRF
QFLPHAYYQQEKCLRPEDILRFMETRFFKLLMESIKKKNNKASAFRNVNTRRATQRDLDN
AGELGRSRGEQEGDEEEEGHIVDAEAEEGDADASDAKRKEKQEEEVDYESEEEEEREGEE
NDDEDMQEERNPHREGARKTQEQDEEVGLGGPVPSHPPDAAPETHPQPGAPGAEAMERRV
QAVREIHPFIDDYQYDTEESLWCQVTVKLPLMKINFDMSSLVVSLAHGAVIYATKGITRC
LLNETTNNKNEKELVLNTEGINLPELFKYAEVLDLRRLYSNDIHAIANTYGIEALRVIEK
EIKDVFAVYGIAVDPRHLSLVADYMCFEGVYKPLNRFGIRSNSSPLQQMTFETSFQFLKQ
ATMLGSHDELRSPSACLVVGKVVRGGTGLFELKQPLR
>ORFP:YOR341W RPA190, Chr XV from 960978965972
MDISKPVGSEITSVDFGILTAKEIRNLSAKQITNPTVLDNLGHPVSGGLYDLALGAFLRN
LCSTCGLDEKFCPGHQGHIELPVPCYNPLFFNQLYIYLRASCLFCHHFRLKSVEVHRYAC
KLRLLQYGLIDESYKLDEITLGSLNSSMYTDDEAIEDNEDEMDGEGSKQSKDISSTLLNE
LKSKRSEYVDMAIAKALSDGRTTERGSFTATVNDERKKLVHEFHKKLLSRGKCDNCGMFS
PKFRKDGFTKIFETALNEKQITNNRVKGFIRQDMIKKQKQAKKLDGSNEASANDEESFDV
GRNPTTRPKTGSTYILSTEVKNILDTVFRKEQCVLQYVFHSRPNLSRKLVKADSFFMDVL
VVPPTRFRLPSKLGEEVHENSQNQLLSKVLTTSLLIRDLNDDLSKLQKDKVSLEDRRVIF
SRLMNAFVTIQNDVNAFIDSTKAQGRTSGKVPIPGVKQALEKKEGLFRKHMMGKRVNYAA
RSVISPDPNIETNEIGVPPVFAVKLTYPEPVTAYNIAELRQAVINGPDKWPGATQIQNED
GSLVSLIGMSVEQRKALANQLLTPSSNVSTHTLNKKVYRHIKNRDVVLMNRQPTLHKASM
MGHKVRVLPNEKTLRLHYANTGAYNADFDGDEMNMHFPQNENARAEALNLANTDSQYLTP
TSGSPVRGLIQDHISAGVWLTSKDSFFTREQYQQYIYGCIRPEDGHTTRSKIVTLPPTIF
KPYPLWTGKQIITTVLLNVTPPDMPGINLISKNKIKNEYWGKGSLENEVLFKDGALLCGI
LDKSQYGASKYGIVHSLHEVYGPEVAAKVLSVLGRLFTNYITATAFTCGMDDLRLTAEGN
KWRTDILKTSVDTGREAAAEVTNLDKDTPADDPELLKRLQEILRDNNKSGILDAVTSSKV
NAITSQVVSKCVPDGTMKKFPCNSMQAMALSGAKGSNVNVSQIMCLLGQQALEGRRVPVM
VSGKTLPSFKPYETDAMAGGYVKGRFYSGIKPQEYYFHCMAGREGLIDTAVKTSRSGYLQ
RCLTKQLEGVHVSYDNSIRDADGTLVQFMYGGDAIDITKESHMTQFEFCLDNYYALLKKY
NPSALIEHLDVESALKYSKKTLKYRKKHSKEPHYKQSVKYDPVLAKYNPAKYLGSVSENF
QDKLESFLDKNSKLFKSSDGVNEKKFRALMQLKYMRSLINPGEAVGIIASQSVGEPSTQM
TLNTFHFAGHGAANVTLGIPRLREIVMTASAAIKTPQMTLPIWNDVSDEQADTFCKSISK
VLLSEVIDKVIVTETTGTSNTAGGNAARSYVIHMRFFDNNEYSEEYDVSKEELQNVISNQ
FIHLLEAAIVKEIKKQKRTTGPDIGVAVPRLQTDVANSSSNSKRLEEDNDEEQSHKKTKQ
AVSYDEPDEDEIETMREAEKSSDEEGIDSDKESDSDSEDEDVDMNEQINKSIVEANNNMN
KVQRDRQSAIISHHRFITKYNFDDESGKWCEFKLELAADTEKLLMVNIVEEICRKSIIRQ
IPHIDRCVHPEPENGKRVLVTEGVNFQAMWDQEAFIDVDGITSNDVAAVLKTYGVEAARN
TIVNEINNVFSRYAISVSFRHLDLIADMMTRQGTYLAFNRQGMETSTSSFMKMSYETTCQ
FLTKAVLDNEREQLDSPSARIVVGKLNNVGTGSFDVLAKVPNAA
>SPBC4C3 S. pombe chromosome 2    1706aa    12995   18112
VCGRIACAGCIQKQNSTMNIAQPVSSEIKSVKFGIYDVDDVEKISVKQIVNPVLLDNLNH
PTNGGLYDLALGPYLKNSVCATCHLDERYCPGHFGHIVLPIPAYHPLFFSQMYNLLRSTC
LYCHHFKLSKVKVHLFFCRLKLLDYGLLNESEMVENVSLTEAIIKNSNGTPLEDGSDSED
SGLGHDDIAKDAATLMRIRDEFVAKSIADSRQNAHIDAQLTTLLLHERKKVVRAFYHAIS
SRKQCDNCQSFSPNFRKEGFAKIFEIPLSGKNLQFMEQTGKIRSDVLRDTSKKHHEDEGY
DGDSDSSNESEVEGIDLFEEDPNPLKNKSKSPIAHGAKYMTSTEVRNHLRRLFVKENVVL
SRLYAHKRGKPASADMFFLQNIAVPPTRFRPASKMGDEVHENIQNELLTRILQSSIQIAS
LSKDSTVEVNPDEKEGLERRSRAFELLINAFVQLQHDVNSLIDSNRNPSSGGQSRTVPPG
IKQILEKKEGLFRKHMMGKRVNYAARSVISPDPNIETNEIGVPPVFATKLTYPEPVTLYN
FNEMRNAVINGPHKWPGASHIQNEDGTLISLMPLTIEQRTALANQLLTPQSNLISSPYSY
SRLINTNKKVYRHVRNGDMLILNRQPTLHKPSMMAHKARILPGEKTIRMHYANCNSYNAD
FDGDEMNMHFPQSTNARSEAQFIANTDSQYLVPTSGDPLRGLIQDHVVMGVWLTCKDTFY
TRDEYQQLLFQALKPDETGMYGRIKTLPPAIQRPGIYWTGKQIISSVLLNLKPSDRPGLN
LKSKAKVPGKYWSPDSEEGSVLFDDGELLCGILDKSSFGASAFGLVHSVHELYGPDIAGR
LLSVLSRLFTAYAQMRGFTCRMDDLRLDEQGDNWRRQLLENGKSFGLEAASEYVGLSTDS
PIALLNANLEEVYRDDEKLQGLDAAMKGKMNGLTSSIINKCIPDGLLTKFPYNHMQTMTV
SGAKGSNVNVSQISCLLGQQELEGRRVPLMVSGKSLPSFVPYETSAKSGGFIASRFLTGI
APQEYYFHCMAGREGLIDTAVKTSRSGYLQRCLMKHLEGLCVQYDHTVRDSDGSIVQFHY
GEDSLDVTKQKHLTQFEFSAKNYKSLIQKYKVKSVLSAVDSETASSYAKKALKKPYKYDP
VLDKYPPSRYLGSVSEKFQRAVDEYTQKNPDKLIASKKESKLDDSLLNESKFKALMQLRY
QQSLVDPGESVGVLASQSIGEPSTQMTLNTFHFAGFGAKNVTLGIPRLREIIMTASANIQ
TPTMTLRLNDGVSDKRASAFCKEVNKLVLSEVVRQVRVTEKISGQGSDEQSKTYAIRLDL
YSRDEYQDEYGVLQEEIESTFSNRFLKILNRIIKSYLAKSKQRKSGGKDDTVPEVGQALK
PLEDIDEAPIEGRAQEALEDEDNDATNEKMVSRSKQHASYEGPDEADKVALRQLKGSNKV
EDVNMDEEEDEGFKSDESVSDFKERKLLEKQNTVSISERRELQLKTAKEILSNCKHLDFD
YVNGEWATVELVFPINTEKLLMVSLVEKACSETVIHEIPGITRCFSKPPDSALDTVPKVI
TEGVNLKAIWEFYNEISMNDIYTNDIAAILRIYGVEAARNAIVHEVSSVFGVYGIAVDPR
HLSLIADYMTFEGGYKAFNRMGIEYNTSPFAKMSFETTCHFLTEAALRGDVDDLSNPSSR
LVVGRVGNFGTGSFDIFTPVVDSPAN
>A.thaliana 67303.m00015#F15B8_150 chr.3 DNAdirected RNA polymeras
MAHAQTTEVCLSFHRSLLFPMGASQVVESVRFSFMTEQDVRKHSFLKVTSPILHDNVGNP
FPGGLYDLKLGPKDDKQACNSCGQLKLACPGHCGHIELVFPIYHPLLFNLLFNFLQRACF
FCHHFMAKPEDVERAVSQLKLIIKGDIVSAKQLESNTPTKSKSSDESCESVVTTDSSEEC
EDSDVEDQRWTSLQFAEVTAVLKNFMRLSSKSCSRCKGINPKLEKPMFGWVRMRAMKDSD
VGANVIRGLKLKKSTSSVENPDGFDDSGIDALSEVEDGDKETREKSTEVAAEFEEHNSKR
DLLPSEVRNILKHLWQNEHEFCSFIGDLWQSGSEKIDYSMFFLESVLVPPTKFRPPTTGG
DSVMEHPQTVGLNKVIESNNILGNACTNKLDQSKVIFRWRNLQESVNVLFDSKTATVQSQ
RDSSGICQLLEKKEGLFRQKMMGKRVNHACRSVISPDPYIAVNDIGIPPCFALKLTYPER
VTPWNVEKLREAIINGPDIHPGATHYSDKSSTMKLPSTEKARRAIARKLLSSRGATTELG
KTCDINFEGKTVHRHMRDGDIVLVNRQPTLHKPSLMAHKVRVLKGEKTLRLHYANCSTYN
ADFDGDEMNVHFPQDEISRAEAYNIVNANNQYARPSNGEPLRALIQDHIVSSVLLTKRDT
FLDKDHFNQLLFSSGVTDMVLSTFSGRSGKKVMVSASDAELLTVTPAILKPVPLWTGKQV
ITAVLNQITKGHPPFTVEKATKLPVDFFKCRSREVKPNSGDLTKKKEIDESWKQNLNEDK
LHIRKNEFVCGVIDKAQFADYGLVHTVHELYGSNAAGNLLSVFSRLFTVFLQTHGFTCGV
DDLIILKDMDEERTKQLQECENVGERVLRKTFGIDVDVQIDPQDMRSRIERILYEDGESA
LASLDRSIVNYLNQCSSKGVMNDLLSDGLLKTPGRNCISLMTISGAKGSKVNFQQISSHL
GQQDLEGKRVPRMVSGKTLPCFHPWDWSPRAGGFISDRFLSGLRPQEYYFHCMAGREGLV
DTAVKTSRSGYLQRCLMKNLESLKVNYDCTVRDADGSIIQFQYGEDGVDVHRSSFIEKFK
ELTINQDMVLQKCSEDMLSGASSYISDLPISLKKGAEKFVEAMPMNERIASKFVRQEELL
KLVKSKFFASLAQPGEPVGVLAAQSVGEPSTQMTLNTFHLAGRGEMNVTLGIPRLQEILM
TAAANIKTPIMTCPLLKGKTKEDANDITDRLRKITVADIIKSMELSVVPYTVYENEVCSI
HKLKINLYKPEHYPKHTDITEEDWEETMRAVFLRKLEDAIETHMKMLHRIRGIHNDVTGP
IAGNETDNDDSVSGKQNEDDGDDDGEGTEVDDLGSDAQKQKKQETDEMDYEENSEDETNE
PSSISGVEDPEMDSENEDTEVSKEDTPEPQEESMEPQKEVKGVKNVKEQSKKKRRKFVRA
KSDRHIFVKGEGEKFEVHFKFATDDPHILLAQIAQQTAQKVYIQNSGKIERCTVANCGDP
QVIYHGDNPKERREISNDEKKASPALHASGVDFPALWEFQDKLDVRYLYSNSIHDMLNIF
GVEAARETIIREINHVFKSYGISVSIRHLNLIADYMTFSGGYRPMSRMGGIAESTSPFCR
MTFETATKFIVQAATYGEKDTLETPSARICLGLPALSGTGCFDLMQRVEL
>MJ10423 DNAdirected RNA polymerase, subunit A' (rpoA1) {Methanoc
MERYEIPKEIGEIMFGLLSPDYIRQMSVAKIVTPDTYDEDGYPIDGGLMDTRLGVIDPGL
VCKTCGGRIGECPGHFGHIELAKPVIHIGFAKTIYKILKAVCPHCGRVAISETKRKEILE
KMEKLERDGGNKWEVCEEVYKEASKVTICPHCGEIKYDIKFEKPTTYYRIDGNEEKTLTP
SDVREILEKIPDEDCILLGLNPEVARPEWMVLTVLPVPPVTVRPSITLETGERSEDDLTH
KLVDIIRINNRLEENIEGGAPNLIIEDLWNLLQYHVNTYFDNEAPGIPPAKHRSGRPLKT
LAQRLKGKEGRFRYNLAGKRVNFSSRTVISPDPCLSINEVGVPEVVAKELTVPEKVTKYN
IERIRQLLRNGSEKHPGVNYVIRKMIGRDGTEQEYKVKITESNKDFWAENIREGDIVERH
LMDGDIVLYNRQPSLHRMSIMAHRVRVLPYRTFRHNLCVCVDGDTTVLLDGKLIKIKDLE
DKWKDVKVLTSDDLNPKLTSLSKYWKLNADEYGKKIYKIKTELGREIIATEDHPFYTTNG
RKRCGELKVGDEVIIYPNDFPMFEDDNRVIVDEEKIKKVINNIGGTYKNKIINELKDRKL
IPLTYNDQKASILARIVGHVMGDGSLIINNKNSRVVFRGDIEDLKTIKEDLKELGYDGEE
IKLHEGETEITDYNGKKRIIKGKGYSFEVRKKSLCILLKALGCVGGDKTKKMYGIPNWIK
TAPKYIKKEFLSAYFGSELTTPKIRNHGTSFKELSFKIAKIEEIFDEDRFIKDIKEMLKE
FGIELKVRVEEGNLRKDGYKTKVYVASIYNHKEFFGRIGYTYANKKETLARYAYEYLLTK
EKYLKDRNIKKLENNTKFITFDKFIKEKCLKNGFVKEKIVSIEETKVDYVYDITTISETH
NFIANGFLTGNCPPYNADFDGDEMNLHVPQSEEARAEAEALMLVEKHILSPRFGGPIIGA
IHDFISGAYLLTSNYFTKDEATLILRSGGIKDELWEPDKVENGVPLYSGKKIFSKALPKG
LNLRYKAKICRKCDVCKKEECEYDAYVVIKDGELIKGVIDKNGYGAEAGLILHTIVKEFG
PEAGRKFLDSATKMAIRAVMLRGFTTGIDDEDLPEEALKEIEKVLDEAEEKVKEIIEKYE
RGELELLPGLNLEESREAYISNVLREARDKAGAIAERYLGLDNHAVIMAVTGARGNILNL
TQMAACLGQQSVRGKRIFRGYRGRVLPHFEKGDLGARSHGFVRSSYKKGLSPTEFFFHAM
GGREGLVDQAVRTAQSGYMQRRLINALQDLKTEFDGTVRDSRGIMIQFKYGEDGIDPMLA
DRGKAVNIDRIIDKVKMKYNQXXXXXMDMEALKQKIEGLDIPQSLKDELFEKLSKEKDLT
EEMVDEIIDEVVNAYRKALVEPYEAVGIVAAQSIGEPGTQMSLPYEEKIIIKEGEFIKPV
EIGKLVDEMIERFGFEKIGNSEVCDLPIDIYALSLDQDEKVHWKRIISCIRHKHNGKLIK
IKTKSGREITATPYHSFVIRKDNKIIPVKGSELKIGDRIPVVKHIPANCVEAINISDYVS
GNYVVDNINNKIAPKINGKSIPNNIKLDYDFGYFIGIYLAEGSVTKYFVSISNVDELILN
KIRAFADKLGLNYGEYDNNNGFAESHDIRIYSSTLAEFLSNFGTSSNTKKIAEFVFGANK
EFVRGLIRGYFDGDGNVNADRKVIRVTSNSKELIDGIAILLARFNIFSIKTKTKNQFVLI
IPHRYAKKFHEEINFSVEKKKSELERLVSSLNDDKTYDSIDMIPSIGDALTKLGEKVDYP
KVILKKFERKQKIGRATLQRHLRRIEELAVKKGVNILALKEYWLLKKAVESDVIWDEIVK
IEEISCDKKYVYDISVEGLETFTTFDGVLTHNTMRTFHYAGVAEINVTLGLPRMIEIVDA
RKEPSTPIMTIYLKEEYKDNREKAEEIAKEIESLTLGSIAESISIDLWTQSIKVELDENR
LADRGLTIDDVIEAIKKKLKVKIDVDGTTLYLKIKTPSIKALRKRIPKIKNIQLKGIPGI
ERVLVKKEGGEYVLYTQGSNLREVFKIDGVDTTRTITNNIIEIQEVLGIEAARNAIINEM
RNTLEQQGLEVDIRHLMLVADIMTADGEVKPIGRHGVAGEKGSVLARAAFEETVKHLYAA
AERGDVDKLKGVIENVIVGKPIYLGTGCVELTIDREYEEGKNMEE
>AF18889 DNAdirected RNA polymerase, subunit A' (rpoA1) {Thermoco
MVPKRISAIKFEVLSPQEIRRMSVVKIITPETYDDDGFPIDGGLMDTRLGVIDPGLRCKT
CGGKAGECPGHFGHVELAAPVIHVGYAKMIARLLNGTCSECGRILLKDERRDKFIAEIER
RKELNQSYEEVVKEVFRLTKAAKKCPHCGAEQLEVKFDKPTFFYLGEHRLTPKDVREWLE
KIPDDDLPAFGINPKATRPEWFVLTVLPVPPVTVRPSIILETGQRSEDDLTHKLVDIIRI
NQRFMENKEAGAPQQILEDLWELLQYHVTTYIDNEVSGIPPARHRSGRPLKTLAQRLKGK
EGRFRGSLSGKRVNFSARTVISPDPCLSINEVGVPREIAEELTVPIYVTPQNIDMAREFV
LRSEHPKANYIVRPDGRRIRVIESNKEELAEKLEPGWIVERQLMDGDIVLFNRQPSLHRM
SIMAHYVRVLPYKTFRLNPAVCPPYNADFDGDEMNLHVPQSLEAQAEAKILMAVQEHILS
PRFGGPIIGGIHDHISGLYLLTRGEKKFTREEAMELLRSLDISEIPVKDEYTGKEIFSFI
LPDITLEFKAEICQGCEECKGAECEYDAYVIIENGKLLKGTIDEKAVGAFKGIIIDEIAR
KYGKEMAKKFIDDMTQLAIRIISKLGFTVGIDDEDIPPEAAEQIEEVLKEAENEVNRLIE
AYRRGDLEPMPGRSIEETLEMRIMQVLGRARDRAGKIAQRHLGMDNAAVIMAVSGARGSM
LNLTQMAACIGQQSVRGERISRGYTYTNRTLSHFKPGDLGAEARGFVRSSYKKGLSPVEF
FFHAAGGREGLVDTAVRTSQSGYLQRRMINALQDLKVEYDGTVREQTSGALVQFRYGEDG
VDPMRSFRGKPVDVRRIIREVIGEVKKXXXXXMSKVDYDSILKDFPFPPAVKEEIKAELE
KHGLKKTEAKKVVEKCFQAYLANLMEPGEAAGIVAAQSIGEPGTQMTMRTFHYAGVAEIN
VTLGLPRLIEILDVRKNPSTPMMTIRLLPEYAKDREKAREVANRIEATYVKDVADIEVDI
RRFTIIVKPDEKALERKGLTVEDLKSKIGKALKTEVEETEQGLAVQITEPSYKALMAAFD
KLKDTVIMGLKEIKRVIIRKEEDEYVLYTEGSNLKKIMKVKGVDFTRTTTNNIYEIYEVL
GIEAARNAIIREALDTLEEQGLEVDVRHIMLVADVMTADGELRQIGRHGVAGEKQSILAR
AAFEMTVNNLLDAAVRGEEDHLRGITENIIVGQPIKLGTGDVELVLKMGGKK
>MTH10512 [prot="DNAdependent RNA polymerase, subunit A' "] [gene
MRGILKKISQINFGLMSPEDIRKMSVTQIVTPDTYDEDGYPIENGLMDPRLGVIDPSLRC
RTCGAKGGECPGHFGSINLARPVIHVGFADTIHKILRSTCRKCGRVLLTETEIEEYRQRI
LDAMEKEESLTPIIKEIYAEARRDKCPHCEEEQEEIKLDKPVSIVEGDYKLTPSEVRERL
ERISDDDALILGVNPEVARPEWMVLTVLPVPPVTVRPSITLETGERSEDDLTHKLVDILR
INQRLKENMEAGAPQLIVEDLWELLQYHVTTYFDNEASGVPPARHRSGRPLKTLAQRLKG
KEGRFRSNLSGKRVNFSARTVISPDPNISINEVGVPEIIAREVTVPVYVTEWNIDRMREY
IENGPDVHPGANYVIRPDGRKIRIYNETKEVVLENLKPGYIVERHLKDGDIVLFNRQPSL
HRMSMMAHQVRVLPYKTFRLNLCVCPPYNADFDGDEMNMHVFQTEESRAEAKTLMRVQEH
ILSPRFGGPIIGGIHDHISGAYLLTRKSAVFSEEKVFQILKKAGLPLPDSRGRDWTGKEI
FSMVLPDDLNMVYRAEVCRKCEECLEMECENDAYVVIENGQLISGVIDEKAYGAFAGKIL
DHIVKEYGTDAAREFLDSATKLAIAGIMHAGFTTSTNDEEIPEEARERIEAHLRNAEARV
DQLIEAYENGELEPLPGRSLEETLEMKIMQVLGEARDKSGEIAESYFDMDENHAVIMALT
GARGSMLNLTQITACVGQQSVRGGRISRGYDNRTLPHFKKGELGAKSRGFVHSSYKEGLD
PIEFFFHAMGGREGLVDTAIRTAQSGYMQRRLVNALQDLTVDENGRVVDNRGVIIQNRFG
EDGVDPAKSDYGKIVDLDKLVEEIRLKSKGXXXXXMQDIIGKIEDYSSKNGILLPDPVVE
YVARIADEEKLKEPELQEMVRLFSRIAERNQGLDDDELLDAVEDDYQRILKVQELVKRKR
ARFPPKLIEDIAEVMKKHELSDDELDELIRRVRRAYDRARVEAGEAVGTVAAQSVGEPGT
QMTMRTFHYAGVAELNVTLGLPRLIEIVDARKKISTPTMSIYFEGDLRYDEEFVRRKANK
IGKSTLNDVLKNFSIQYADMSVVAELDEEKIQEKHLEYDEILAKVEKTFKKVEIDNNILR
FEPPKPTIRELRLLADKVRKLQISGVKNIGKVVIRKEDDEWVIHTEGSNLGAVLKEEGVD
KVRTTTNDIHEIETVLGIEAARNAIIHEAKRTMEEQGLTVDIRHIMLVADMMTADGSVKS
IGRHGISGEKASVLARASFEETGKHLLRASIRGEVDHLTGIIENIIIGQPIPLGTGSVSV
VMKERK
>Thermoplasma volcanium complete genome
MMGISKRISSIKFALLSPDEIRKLSQVKVITADTYDDDGYPIEHGLMDLHMGVIEPGLRC
ATCGGKVDECPGHFGHIELAMPVVHVGFVKEIKMFLDATCKSCGRIKLTDDEIKTYLPEV
QKVDFETGDPEDIELMTKRFVDLASQRMVCPHCGAQQSKIILDKPTTFREEGTNVKITPK
EIRERLERIPDDDLVFFGFNPKTARPEWMILTVLPVPPINVRPSITLETGERSEDDLTHK
LVDIIRISQRLRESRDNGSPQLIIEDLWDLLQFHVTTYFDNQTPGIPPARHRSGRALKTL
VQRLKGKEGRFRSNLSGKRVNFSSRTVISPEPYLSVNEVGVPEKAARELTVPVIVNQFNI
DEMREFIKRGRNPRDKYGKYMAGVNYVIRPDGRRIKITDQNAEENANRIEIGWTVERQLM
EGDIVLFNRQPSLHRMSMMGHTVRVLPGQTFRFNLAVCTPYNADFDGDEMNLHVIQKEEA
RAEARIIMKVQEQIMSPRFGRPIIGGIHDHVTAMFLLTHNNPLFTQEEMIHIMCYVDPDL
IPDATIVNGKKYYSGRNIFSTILPKGLNLRFRSKLCSGSSETCEYEKEKEDTYVTIVDGK
LIHGTIDEAAISPFSGAIIDKIFRKFGPNEAAKFIDRMTRLAVGFITYYGFSTGISDYDI
PYSATARIEELVNQAEDRINKLIETYKRGELQPAPGRSVEDTLEIEILSEAGVVRDESGK
IASSYLGLSVPSVIMARSGARATMLNISEVAGIVGQQSVRGGRLNRGYYNRTLPHFKRGD
IGADARGFVKSSYMTGLTPTEYFFHSIGGREGLVDTAVRTSRSGYMQRRLINAFEDLKVD
EERQVKDTVGSLIQIKYGEDGIDPTRSEKGRAIDINYILFDENAGRXXXXXVILTEVIIW
KDTAKNMSLLSKSVPAKYAVDFEVPKGITEGYVTSDKKRFTYHVSISSVAPYSNESDVIQ
KKKSGLKSIMEIEKIQKIEPISIMEFRSSGKRIDELLTYAIAERETAEIREKYEYEKKVS
SQVLDVIAEAKKLGYNIPESVAEEILRRKEEWGEKKYREILKRIGEEIQDELIDPYEAVG
IIAAQSIGEPGTQMTMRTFHFAGVREMNVTLGLPRLIEIVDARRIPSTPSMTIYLKPEFE
TNDEVVMDVVKRLENTSVSDVADIITDIGELTITVRPDPNKMNDRLINQDDLVNAIYKVK
GVTVMEESGQIIVKPQQESFKKLYLLQEQIKALPIKGISGIKRAIARVEGKEHRWVIYTQ
GSNLKDVLEVDEVDPTRTYTNDIVEIATVLGIEAARNAILNEAQRTLQEQGLNVDVRHLM
LVADMMTFSGSVRAVGRTGISGRKSSVLARAAFEITTKHLLRAGIMGEVDKLAGVAENII
VGQPITLGTGAVDIIYKGYPKTKK
>TACID2      AL445064 0010 Thermoplasma acidophilum comp
MMGISKRISSIKFALLSPDEIRKLSQVKVITADTYDDDGYPIEHGLMDLHMGVIEPGLRC
ATCGGKVDECPGHFGHIELAMPVVHVGFVKEIKMFLDATCRSCGRIKLTDDEIRTYLPEI
QKMDFETGDPEDIEILTKKYVDLASQRMVCPHCGAQQSKIILDKPTTFREEGTNVKITPK
EIRERLERIPDDDLIFFGFNPKTARPEWMVLTVLPVPPINVRPSITLETGERSEDDLTHK
LVDIIRISQRLRESRDNGSPQLIIEDLWDLLQFHVTTYFDNQTPGIPPARHRSGRALKTL
VQRLKGKEGRFRSNLSGKRVSFSSRTVISPEPYLSVNEVGVPERAARELTVPVIVNQFNI
DEMRELIKRGRNPRDQFGRYVTGVNYVIRPDGRRIKITDQNAAENADRIDIGWTVERQLM
EGDIVLFNRQPSLHRMSMMGHTVRILPGQTFRFNLAVCTPYNADFDGDEMNLHVIQKEEA
RAEARIIMKVQEQIMSPRFGGPIIGGIHDHVTALFLLTHNNPRYTHEEMVHIMAYLEPDL
LPEARIENGEKYYYGRDIFSTILPKGLNVRFRSKLCSGSSERCEFEDDPSDTYVEIVDGK
MIHGTIDEAAVSPFSGAIIDKIFRKFGSQEAARFIDRMTRLAVGFITYRGFSTGISDYDI
PESAVARIEELVAQAEDRINKLIETFRRGELQPAPGRSVEDTLEMEILSEAGVVRDESGK
IASSYLGLKVPSVIMARSGARATMLNISEVAGIVGQQSVRGGRLNRGYYNRTLPHFKRGD
IGADARGFVRSSYMTGLSPTEYFFHSIGGREGLVDTAVRTSRSGYMQRRLINAFEDLKVD
DSREVKDTVGSLIQIRYGEDGIDPTRSARGKAVDMNYILFDEERRXXXXXMKSMASLLWR
DTSKNIAAILEKLPADYAVDYDVPNNVEDGYITINKKNFTYHVVISGVRKYSPDVEAIVK
KKSGLKSIITIEKVEKIEPLSFMEFRVGGKTLEAMGSFEVAERQVTEIKEKYGENLSEDV
QKVLDDARAMGFTLPESVAEEIARRRTEWGEKAYKNILKRIGEEIGNELIDPYEAVGIIA
AQSIGEPGTQMTMRTFHFAGVREMNVTLGLPRLIEIVDARRIPSTPSMTIYLRPEYETND
EVVMDVVKRLENTSISDVADIITDIGELTVTVRPDPRKTKDRLIEMEDIMNAISKIKGIT
VMEDSGQIIIKPQQESFKKLYLLQEQIKGLTIKGISGIKRAIARVEGKEHRWVIYTQGSN
LKDVLEVDEVDPTRTYTNDIVEIANVLGIEAARNAILNEALRTLQEQGLNVDVRHLMLVA
DMMTFSGSVRAVGRTGISGRKSSVLARAAFEITTKHLLRAGIMGEVDKLAGVAENIIVGQ
PITLGTGAVDIIYKGYPKTKK
>PAB04245 (rpoA12) DE:DNAdirected RNA polymerase, subunit A' (rp
MQSVKKVIGSIEFGILSPQEIRKMSAVEVTVPDTYDDDGYPIEGGVMDKRMGVIDPGLRC
ETCGAKAGECPGHFGHIELARPVIHVGFAKTIHRILESTCRECGRIKLTDEEIEEYMKKL
ELAKNRRSEVNKILKEIHKKARERMVCPHCGAPQYPIKFEKPTIYWELRKDEQGNEYKHR
MMPTEIRDRLEKIPDKDLPLLGLHPEKSRPEWMVLTVLPVPPVTARPSITLETGIRAEDD
LTHKLVDIIRINNRLKQNIEAGAPQLIIEDLWDLLQYHVTTYINNETSGVPPAKHKSGRP
LKTLAQRLKGKEGRFRGNLSGKRVNFSARTVISPDPMISINEVGVPIQIAMELTVPEKVT
EFNIEKLRKMVLNGPDKYPGANYVIDPEGRRIRIMESNRENLARMIDIGWTVERHLLDGD
IVLFNRQPSLHRMSIMAHRVRVMPYRTFRLNLAVCPPYNADFDGDEMNLHVPQTEEAQAE
ARILMEVQNHIISPRYGGPIIGGIQDHISGGYLLTREGAYFTREEVEQMLMFAGVDIKEL
PEPDKYENGKPLWSGKTIFSLLLPDDLTVWYRNKLCDEPERCEALEKLIEEKLMPDPEEV
RKLAYDGFVYIQNGKLLSGAIDKKAYGREDGIILDLIVREYGVERARQFLDQVTKLTIWV
ITHKGFTTGIDDEDLPEEARDRIKEIIREAEERVQKLIEAYKRGELEPLPGKTLEETLES
KIMAVLAEARDNAGSIAEKYLGMDNHTVIMAKTGARGKILNITQMAALLGQQSIRGKRLY
RGFRGRVLSHFKPGDLGARAKGFVVNSYKSGLSPQEYFFHAMGGREGLVDTAVRTAQSGY
MQRRLINALQDLKVDYDGTVRDPTGVIVQFRYGEDGVDPMKSWGGKTVDVDRVIMRTLIK
MRANNKGXXXXXMVSSSTIKSLIEKKGKDLPESVKQELYEKLIKYNEKYKLTKAEVETII
DEVVKEYERALVEPGEAVGTVAAQSIGEPSTQMTLNTFHYAGVAEINVTLGLPRIIEIVD
ARKNPSTPMMTVYLDEEHRYDREKAEEVARRIEGTTLENLARTTTLDLINMEFIVEIDPE
RLEKSGLTMEKVLKKLQSSFKSAEFEMEGYTLIVRPKKFEKISDLRRLAEKVKKHRLKGL
SGVGKTIVRKEGDEYVIYTEGSNFKQVLKVPGVDPTRTKTNNIHEIAEVLGIEAARNAII
EEIMNTMREQGLEVDIRHIMLVADIMTLDGVVRPIGRHGVVGEKASVLARAAFEITVQHL
FEAAERGEVDNLSGVIENVLIGQPVPVGTGMVKLTMKLPLRPQKEKEEV
>PH15454|PHCB0201    DNAdirected RNA polymerase subunit A', puta
MHSVKKVIGSIEFGILSPQEIRKMSAVEVTVPDTYDDDGYPIEGGVMDKRMGVIDPGLRC
ETCGAKAGECPGHFGHIELARPVIHVGFAKTIHRILESTCRECGRIKLTDEEIEEYMKKL
ELAKNRRSEVNKILKEIHKKARERMVCPHCGAPQYPIKFEKPTIYWELRKDEQGNEYKHR
MMPTEIRDRLEKIPDKDLPLLGLHPEKSRPEWMVLTVLPVPPVTVRPSITLETGIRAEDD
LTHKLVDIIRINNRLRQNIEAGAPQLIIEDLWDLLQYHVTTYINNETSGVPPAKHKSGRP
LKTLAQRLKGKEGRFRGNLSGKRVNFSARTVISPDPMISINEVGVPIQIAMELTVPEKVT
EFNIEKLRKMVLNGPDKYPGANYVIDPEGRRIRIMESNKENLAKMIDIGWTVERHLVDGD
IVLFNRQPSLHRMSIMAHRVRVMPYKTFRLNLAVCPPYNADFDGDEMNLHVPQTEEAQAE
AKILMEVQNHIISPRYGGPIIGGIQDHISGGYLLTREGAYFTREEVEQMLMFAGVDIKEL
PEPDKYENGKPLWSGKTIFSLLLPDDLTVWYRNKLCDEEEKCEALEKLIEEKLIPDPEEV
RKLAYDGFVYIQNGKLLSGAIDKKAYGREDGIILDLIVREYGVERARQFLDQVTKLTIWV
ITHKGFTTGIDDEDLPEEARDRIREIIREAEERVQRLIEAYKRGELEPLPGKSLEETLES
KIMAVLAEARDNAGSVAEKYLGMNNHAVIMAKTGARGKILNITQMAALLGQQSIRGRRLY
RGFKGRVLSHFKPGDLGARAKGFVVNSYKSGLSPQEYFFHAMGGREGLVDTAVRTAQSGY
MQRRLINALQDLKVDYDGTVRDPTGVIVQFRYGEDGVDPMKSWGGKTVDVDRVIVRTLIK
MRSNGKKXXXXXMVSSSTIKSLIEKKGKDLPESVKQELYEKLIKYNEKYKLTKVEVETII
DEVIKEYEKALIEPGEAVGTVAAQSIGEPSTQMTLNTFHYAGVAEINVTLGLPRIIEIVD
ARKNPSTPMMTVYLDEEHRYDREKAEEVARRIEGTTLENLARTTTLDLINMEFIVEVDPE
RLEKSGLTMEKILKKLQSSFKSAEFEADGYTLIVRPKKIEKISDLRRLSEKVKKHRLKGL
SGVGKTIIRKEGDEYVIYTEGSNFKQVLKVPGVDPTRTRTNNIHEIAEVLGIEAARNAII
EEIINTMHEQGLEVDIRHIMLVADIMTLDGVVRPIGRHGVVGEKASVLARAAFEITVQHL
FEAAERGEVDNLSGVIENVLIGQPVPVGTGMVKLTMKLPLKPQKEKEEV
>AE005138    AE005138 0010 Halobacterium sp.
MSAGQAPKEIGEISFGLMDPEEYRDMSATKVITADTYDDDGFPIDMXXXXXGLMDPRLGV
IDPGLECKTCGQRSGGCNGHFGHIELAAPVIHVGFSKLIRRLLRGTCRECASLLLTEEEK
DEYRENLDRTRSLRQDVSDVMTAAIREARKKDHCPHCGEVQYDVKHEKPTTYYEVQQVLA
SDYSERIAASMQPDEDEDDAGVSPQELAEQTDIDISRINEILSGEFRPRREDREAIETAI
GADLTTEDMNKLMPSDIRDWFEDIPGEDLEALGVNSDRSRPEWMILTVLPVPPVTARPSI
TLDNGQRSEDDLTHKLVDIIRINQRFMENREAGAPQLIIEDLWELLQYHVTTFMDNEISG
TPPARHRSGRPLKTLSQRLKGKEGRFRGSLSGKRVNFSARTVISPDPTLSLNEVGVPDRV
ATEMTQTMVVNEQNLERARRYVRNGPEGHPGANYVTRPDGRRVRVTEKVCEELAERVEPG
WEVQRHLIDGDIIIFNRQPSLHRMSIMAHEVVVMPYKTFRLNTVVCPPYNADFDGDEMNM
HALQNEEARAEARVLMRVQEQILSPRFGENIIGAIQDHISGTYLLTNDNPRFNETQASDL
LRQTRIDELPAAAGTDEDGDQYWTGHQIFSELLPDDLSLEFTGTTGDTVVIEDGQLLEGT
IADDEVGEYGSEIVDTITKVHGNTRARIFINEVASLAMRSIMHFGFSIGIDDETVSTEAR
ERIDEAIQSAYDRVQELIETYENGDLESLPGRTVDETLEMKIMQTLGKARDSAGDVAEEN
FDEDNPAVVMANSGARGSMLNLTQMAGCVGQQAVRGERINRGYEDRTLSHFAPNDLSSEA
HGFVENSYTSGLTPKEFFFHAMGGREGLVDTAVRTSKSGYLQRRLINALSELETQYDGTV
RDTSDTIVQFEFGEDGTSPVQVSSNEEVDIDVEHVADRILNSEFDSDTQKAEFLEVEEPP
TNLSEHGAAWEVESDDXXXXXMTEFDVTDGVHDVIEDTELPRRLKDEVLATAEERGVTKS
QANEIATAVEAQYLDTRVDPLDPVGTVSAQSIGEPGTQMTMNTFHYAGVAEMDVTQGLPR
LIELVDARKTPDTPVMEVYLEDEYAEERERAHEVVWKIEATKILALGDISTNVADMVVRI
DLNEDTLQERWPRVDSTTQIAGEVAETIEGNLGVTVTQDGTILEFGPSEPSYRELLQLVE
QLRDIVFKGIEEVSRVVIRREETERGEEFVLYTEGSAFKKALKIEGVDATRTSCNNIHEV
HKTLGIEAAREAIINETMTTLEEQGLDDVNIRHLMLVADIMTNDGTIESIGRHGISGNKN
SVLARAAFEVTVNHLLDAAIHGESDDLDGVIENVIVGKPVRLGTGDVDLRMGATQTSD
>PAG5_06667 362121 364775 DNAdirected RNA polymerase subunit A' (
MSLREEFDAIPKKVIKSIKFGVLSPEMIRKYSVMEVTTSEVYDEGGLPVRGGIADRRLGV
AEPGSRCETCGQTHDVCPGHFGHIELVKPVIHVGFARIIYDVLRATCPNCGRIMLRDEEI
ERYRERLNKLKARWRLLALNLHEKIRKKATERMTCPHCGYKRNKVRFERPYYFYEETKSG
ALVKLDPEVLRERLSKVPSEDLELLGINPSVARPEWAILKVLPVPPPHVRPSIQLETGIR
SEDDLTHKLVDIIRMNEKLKIAIETGAPTNVVDNLWELLQYHVATYFDNELPGIPVAKHR
GGRPLKGIAQRLKGKEGRFRGSLSGKRVNFSARTVISPDPHISINEVGVPYDIAKVLTVP
ERVTAWNVDVLREYVIRGPESWPGANYVVTPEGRRIDLRYVKDRKALAERLAPGWIVERH
LIDGDIVLFNRQPSLHRVSMMGHLVKVLPGRTFRLHLAVCPPYNADFDGDEMNLHVPQTE
EARAEARALMLVEKHIITPRYGGAIIGARQDYIIGAYLLSHKSTFLTKKEVAFLLGAGKS
QEDPPEPAILHPVELWTGKQIISHFLPKDLNWVQPTAFKSKCQNAYTCPTDEWIIVINGN
LVKGVLDKKSIGAEQVDSLWHRIARDYSPEVARRWLDSSLRLFLRFLDLRGFTFGMDSIY
IPPEAYREIDNIIKASLEKVDKLIDDFRSGHLEAMPGFTVEETFENKVTEILSKVREDAA
VVVEKYIDKNSEGYLMAKTGARGSIVNIVQMVATLGQQTIRGERIRRGFRTRTLPHFPVG
DIGAFSGGFVRNCFRCGLTPVEYFFHAAAGRDGLIDTAVRTAQSGYMQRRLINALQDVYV
AYDGTVRFGGAMLLQYLYGEDGVDVSRSDHGKVVDIKSLKMWIRXXXXXMISREELLSKL
SQVLPQPLYKEVEEAVRDLDDEKALRLVYRVLKLYVTSLIDPGEAIGIVTAQSIGEPGTQ
MILRSFHYAGLREFSMARGLPRLIEVVDARRTPSTPLMYVYLKPPYNKSREAAESVAKKI
QQVTLETLAKEVDVDYVAGTVTITLDQEQLKYRGLTLKDVEKIVAKAKGKDVAISMRGYT
ITASLTTPDILKIRKIKDKILQIKISGIKGVRKVVLQYDSKNDEWYIVTEGTNLEAVLQL
EEVDPTRTYSNDLHEVEEVLGIEATRALVAQEIKRVLEEQGLDVDIRHMYLVADAMTWSG
RLRPIGRHGVVGSKESPLARAAFEVTVKTLIEASVRGEDELFKGVVESIIAGKYVPIGTG
IVRLLMQF
>AE006659    AE006659 0104 Sulfolobus solfataricus secti
MSEKNIKGIKFGILSPDEIRKMSVTAIITPDVYDEDGTPIEGSVMDPRLGVIEPGQKCPT
CGNTLGNCPGHFGHIELVRPVIHVGLVKHIYEFLKATCRRCGRVKISEDEIEKYSRIYNA
IKKRWPSAARRLTEYVKKTAMKAQVCPHCNEKQYKIKLEKPYNFYEERKEGVAKLTPSDI
RERLEKIPDSDVEILGYDPTTSRPEWMILTVLPVPPITIRPSIMIESGIRAEDDLTHKLV
DIVRINERLKESIDAGAPQLIIEDLWDLLQYHVATYFDNEIPGLPPSKHRSGRPLRTLAQ
RLKGKEGRFRGNLSGKRVDFSSRTVISPDPNISIDEVGVPEIIAKTLTVPERITPWNIEK
LRQFVINGPDKWPGANYVIRPDGRRIDLRYVKDRKELASTLAPGYIIERHLIDGDIVLFN
RQPSLHRISMMAHRVRVLKGLTFRLNLLVCPPYNADFDGDEMNLHVPQSEEAIAEAKEIM
LVHKNIITPRYGGPIIGAAQDYISGAYLLTVKTTLLTKEEAQQILGVADVKIDLGEPAIL
APREYYTGKQVISAFLPKDFNFHGQANVSSGPRLCKNEDCPHDSYVVIKNGILLEGVFDK
KAIGNQQPESILHWLIKEYSDEYGKWLMDNLFRVFIRFVELQGFTMRLEDVSLGDDVKKE
IYNEIDRAKVEVDNLIQKYKNGELEPIPGRTLEESLENYILDTLDKLRSTAGDIASKYLD
PFNFAYVMARTGARGSVLNITQMAAMLGQQSVRGERIKRGYMTRTLPHFKPYDISPEARG
FIYSSFRTGLKPTELFFHAAGGREGLVDTAVRTSQSGYMQRRLINALSDLRAEYDGTVRS
LYGEVVQVAYGDDGVFPMYSAHGKTVDVNRIFERVVGWKAXXXXXMEGMIDEKDKPYLEE
KVKQASNILPQKIVDDLKNLILNKEIIVTRDEIDKIFDLAIKEYSEGLIAPGEAIGIVAA
QSVGEPGTQMTLRTFHFAGIRELNVTLGLPRLIEIVDAKKVPSTPMMTIYLTDEYKRDRD
KALEVARKLEYTKIENVVSSTSIDIASMSIILQLDNEMLKDKGVTVDDVKKAIGRLKLGD
FMIEESEDSTLNINFANIDSIAALFKLRDKILNTKIKGIKGIKRAIVQKKGDEYIILTDG
SNLSGVLSVKGVDVAKVETNNIREIEEVFGIEAAREIIIREISKVLAEQGLDVDIRHILL
IADVMTRTGIVRQIGRHGVTGEKNSVLARAAFEVTVKHLLDAAARGDVEEFKGVVENIII
GHPIKLGTGMVELTMRPILR
>APE18532  DNAdirected RNA polymerase subunit A', putative
MLGAMSLRLSEFRETNLLDKILFGVLSPHEIRQLAKVTVVRGDLYEADGTPVSGGLRDPH
FGAIEPGERCPVCGNSREECPGHFGKIELARPVLIPHYTDYVYKILQATCRVCGRITLPE
EKIKYYRLIFRRLRGKWPQLAKTFATMVVKEAAQATQCPHCGKPQYKVVYIRPFHFYEKK
PEGDVKLTPSEIRERLEKTGSEIEILGVHPERSRPEWMVATVLIVPPLAVRPSITLETGL
RSEDDLTHALAEIVRQNEKLRNVIVTNAPETLIEENWMVLQEMVAAYIDNELPGARRLTH
RRKRYLKTLAQRLKGKDGRIRGNLSGKRVNYSARTVISPDPYISINEVGVPEEIAKTLTI
AMRVTPYNLEEARRYVLNGPDKWPGALFVYKASERKKYDLRFFKDYEKLAESLEPGDVVE
RHLINGDIVLFNRQPSLHRMSIMGHIVRVMPGKTFRLNLLVCPPYNADFDGDEMNLHVPR
LEEAQAEAREIMLVEKHILTPRYGGPIIGGRQDYVSGAYLLTVKTRLFTKEEVERLLSVA
GYKGDLPEPAILKPEPLWTGKQLASIFLPEDLSFKGKSKTNAGDLACSDELCLHDSYIVI
TNGKLLEGVLDKKAIGAEEPNNLLHIIALEYGNSKARQLLDSFYRMFIRMLELRGLTISL
HDIDLPDRAKQEIEELIREYKGRVYSIIEEYRKGTLEPIPGRSLEESLEIKIMEVLDELR
KRVQEVASNNLDPFNDVFVMARTGARGSDVNISQMAAMLGQQAVRGRRIRRGLRNRVLAH
FKENDIEPEAWGFVRSSFRKGLRPTEVFFHAAAGREGLVDTAVKTSQSGYMQRRLINALQ
DIVVTYDGTVRDLYGNLIQLKYGEDGVDPMKTYHGKPVDVKRIIQRVKASKAETVXXXXX
MEGVKEFKTLEESLEAARYILPESLYKELVETVEKEDGLSEEDKISVVKETIRTYLRSLA
QPGEAVGTVAAQSIGEPGTQMTLRTFHYAGIMEFDVTLGLPRLIEIVDAKQTPSQPLMYI
YLKDEYAKDLEKAKEAARKIEYTTLEKIIDNIEWDLGDRVVAIVINAEYMEDKGVTVDMV
LEALDKSKLGKVVEDGVREVSEGGVKKVIVYFEISDKQLPDEELFNSNAYHKVLEKLKNT
YIKGIKGIRKVTVRREEGEDSYEYMLIVEGSNLREVLMLPEVDHRRSISNDIQEIAQVLG
IEAARTAIIEEIKRVLEDSGLDVDIRHLMLIADLMTWPGYVRQIGRLGVVGEKPSPLARA
AFEVTVKQLYEAAVWGEEEEFAGVTENIIAGLPPRVGTGSVLLRMGAARK
>MPU00089    U00089 9611 Mycoplasma pneumoniae complete ge
MTKRNKKNNKLYKNIKAIKLSIASNDTILNWSEGEVTKAETINYKSLKPEPGGLFDEAIF
GPVKDYECACGKFKKIKYRGVRCDRCGVWVTESIVRRERMGHIALVSPVAHIWMSKELPS
PSKISLVLNISYKEVEQVLYFVNYIVLDTGKIKDPKIMPFKFKEVLDLAGKGSLTTRQKM
RRVIGYIFRNLIKNRSSEDYRKGKIFYESLKNSSLPFSLNDAFNYIKKYTGFRVGIGAEA
ILELLNKIDLNYEFSKLNDALRKAKKDSVEDAKVKKILRQLETISWFRNSKLHPKNMILH
TVPVIPPDIRPIIQLDGAKFTTSDINNFYRRVIIRNDRLRRILEDGTVPAIVVNNEKRLL
QESVDALFDNSSRHKPALSKDKRSLKSLTDRLKGKQGLFRHNLLGKRVDYSGRSVIVVGP
ELKMYEVGIPALMILKLFKPFIIHGLINKFDSNGNEIRPIASSIRQAEDMIKNQDDLIWG
IVYDVIKDRPVLLNRAPTLHRLGIQAFEPRIVDGKAIRLHPLVTTAFNADFDGDQMAVHV
PLSENAVNEARAILLASKHILGLKDGRPIVTPTQDMVLGNYYLTTERKGQTGEGIIFGTV
HEARAAYEAGKVHLHAIVGISTKAFPNKHFEAQGTLITTVGKIIFNDVLGDNIPYINEGE
FDEHACPQKFIVPPSGDVRAAIAAHQVLPAFGKKVISKLIDLLYTVVEFKDLPRILENIK
ALGFKYSTHSSTTVSVFDIPKYSNKQQYFDEADQQVLKYKQFYNKGLLTDDERYKRVVKL
WNGVKEKVSSEIQDLIKREEYRDNSIVVMADSGARGNISNFTQLFGMRGLMSKSFNYERN
NQSKIIKDTIEVPIKHSFLEGLTINEYFNSSYGARKGMTDTAMKTAKSGYMTRKLVDATH
ELIINHDDCGTRKGIVVEAIVETKTRSLVESLFDRIVNRYTIGPILDPETKAEIVPANSL
ITQELAKQICATSIKQVLVRSVIYCERENGVCQYCFGVDLSTGKLVELGTAVGVIAAQSI
GEPGTQLTMRTFHTGGVSTENNLAQGFERLKQIFEVVAPKDYERCVISEVKGVVKSITTT
QNAQEVLIESSVDERTYSIPFSAQLRVKVGDAVELGSKITEGSIDIRQLLRVAGIQRVRQ
YMIVEIQKVYRIQGIEIADKYVEIIIRQLTSLLQVTDAGSSNLFVGQLVHSHHLNELNKS
LLLSGKMPVIAINQVFGIDEAASKSNSFLSAASFQDTKKILTDAAVKTQVDYLLGLKENV
IIGGKIPAGTGFLTDEELAYLGAKTVQEEY
>MG340 DNAdirected RNA polymerase, subunit beta' (rpoC)
MTTTRRNKRNNKLYKNIKAIKLSIASNDTILNWSEGEVTKAETINYKSLKPEPGGLFDEA
IFGPVKDYECACGKFKKIKYRGVRCDRCGVWVTESIVRRERMGHIALVSPVAHIWMSKEL
PSPSKISLVLNISYKEVEQVLYFVNYIVLDTGKIKDDKIMPFKFKEVLDLTGKGSLSTRQ
KMRRVIGYIFRNLIKSKSSEDYRKGKIFYESLKNSSLPFSLNDAFNYIKKYTGFRVGIGA
EAILELLNKIDLNLEFSRLNDALRKAKKDSVEDAKVKKILRQLETISWFRNSKLHPKNMI
LHTVPVIPPDIRPIIQLDGAKFTTSDINNFYRRVIIRNDRLRRILEDGTVPSIVVNNEKR
LLQESVDALFDNSSRHKPSLSKDKRSLKSLTDRLKGKQGLFRHNLLGKRVDYSGRSVIVV
GPELKMYEVGIPALMILKLFKPFIIHGLINKFDENGNEIRPIAASIRQAEDMIKNQDDLI
WGIVYDVIKDRPVLLNRAPTLHRLGIQAFEPRIVDGKAIRLHPLVTTAFNADFDGDQMAV
HVPLSENAVNEARAVLLASKHILGLKDGRPIVTPTQDMVLGNYYLTTERKGQLGEGIIFS
TVYEARAAYESQKVHLHAIVGISTKAFPNKKFACQGTLITTVGKIIFNDVLGNNVPYIND
GEFDENACPEKFIVKQGEDVRQSILKHQIIPAFSKKVISKLIDLLYLLLEFKDLPKTLDN
IKALGFKYSTFSSTTVSVFDIPKYTNKQNYFDSADQQVLKYKQFYNKGLLTDDERYKRVV
KLWNNVKEKVSDEIQNLIKQEQYRDNSIVVMADSGARGNISNFTQLFGMRGLMSKSFNYE
RNNQSKIIKDTIEVPIKHSFFEGLTINEYFNSSYGARKGMTDTAMKTAKSGYMTRKLVDA
THELIINHDDCGTRKGIVVEAIVETKTKSLIESLFDRIVNRYSITPIVDPETQKTIVEAN
SLITTQLAKQICATSIKEVLVRSVIYCERENGICQYCFGIDLSTGKLVELGTAVGVIAAQ
SIGEPGTQLTMRTFHTGGVSTENNLAQGFERLKQIFEVVTPKDFEKAVISEVKGTVKSIT
TVQNAQEVVIKSNVDERIYTIPFSAQIRVHVGDQVSPGSKITEGSVDIKQLLRIAGIQRV
RQYMIVEIQKVYRIQGIDIADKYVEIIIRQLTNLLQVTDAGNSNLFVGQLVHSHYLNELN
KSLLLAGKMPVIAINQVFGIDEAASKSNSFLSAASFQDTKKILTDAAVKNQVDYLLGLKE
NVIIGGKIPAGTGFLTDEELTFLGSKTVAEEY
>AE002118    AE002118 0002 Ureaplasma urealyticum section
MSQKGIKSLTISIASPEQILNWSKGEITKPETINYKSLKPEPNGLFDESIFGPSKDYECY
CGKYRKVKHKGKICERCHVEITESIVRRERMGHIELAAPVAHIWFTKELPSPSKISLLLD
ITYKEVDQVVYFVNYIVLDEGNNVYDGKSIFNKKEVLDLTSPKNSIRSRNKLRRTLRNIQ
ERIEDELNHEREALIQDFDYRLAVTYDQMLKDSNIPFSVKDVMAFIEKHTGVRFGIGAEA
IRELLEKLNLEEEHEKIKQAIQNSPNAYDQKTKRLLRRLECVRWIKDSGSKPEWMVMTRI
PVTPSETRPIISLDGGRFTTSDTNNFYRKIIIRNERLKQMQATDAPEILLDNEKRLLQEA
VDSLFDNNSRKKPVVGKDKRPLKSLSNHLKGKQGLFRQNLLGKRVDYSGRSVIVVGPELK
MYEVGIPALMILKLFRPYIISELIRKRDELGNEIQPICANIKLAEQKILAQDNEIWPVVE
KVIKQRPVILNRAPTLHRLGIQAFEPKMVDGKAIRLHPLVTTAFNADFDGDQMAVHIPLS
KEAVAEARSILLASWHILGPKDGKPIITPTQDMILGIYYLTKEKFPQVIEEMMAKDPTQA
RVEFINNFHIFSTQDEAIRAYKLKTIRINDVIGITTKAFNNKTFSKEGILVTTVGKIIFN
QAFPVNFPYINDVKNLYGENQFEIIGMHESILDYLKAYNLKEPLTKKTLSTVIDYLYKVS
EIEVVPQTMDKIKALGFKYSMISATSISAFDIPSYDQKYEYFKETDELVSKLREFYLDGK
LTDDERYTKVVQAWSQTKDKVTHDIEKLINSNEYKDNPIVIMAKSGARGNTSNFTQLAGM
RGLMSKSYNYDQKNNNGVIKDTIEIPIKHSFIEGLSVSEYFNSSFGARKGMTDTAMKTAK
SGYMTRKLVDSTQAVVIKDHDCGTKEGIIVREIRNTKDNTSIESLKDRIVGRYSINTIYD
TKNKLIIESDKLITSEIANIIQNSGIREVEVRSPLHCASLYGVCQKCFGLDLSTNKLIET
GTAIGVIAAQSIGEPGTQLTMRTFHTGGVAGDTNITQGFERIKQLFDCIQPQENEKAVIS
QVKGTVERIEKDSNTNGYNVVIKYNKDNYVNYPTRSNAVLRVKTGDEIIAGQKITEGSID
VNDLLKYAGIENVRHYIIKEVQKVYRMQGIEISDKYIEVIISQLTNKITITNPGDSGLFV
GETISINEFTEVAQNMLVNKKKPPSAINQVFGLDHAPSKSGSFLSAASFQDTKKILTDAA
ARSQKDMLIGLKENVILGNLIPAGTGLKDVEEVIAYGEEMYKKQY
>MPULM03     AL445565 0105 Mycoplasma pulmonis (strain UAB
MPKTRKYSTVDEEKILKVSLSLATKEDVLEWSHGEVTKPETINYKSYKPERHGLFDELIF
GPATDYKCPICGKKYKKSNEGLTCNNTPQCEIEKPEILPKISRRSRMGHIALQTPVVHFW
FFKIDNSIISKLLVLRVGESNEYVSKNDLENIIYYKSHIVLDNGGLKSLPKNKIININNA
AQIYKDALIELRELNLNDADALEIIDGTINHLNDIVGSKVGNDYGVDFYELNEVIEEYSS
AKIQTGSKAIEFLLENIDLEEEQRKIKSKIKEINNLEKTSSSRKQDLSKLYKRLQVVESF
INSGQKPTSMLIYNLPVIPAELRPLVQLDGGRHSTSDINELYRRIIIRNNRLRKWIELNA
PTLITQNELRMIQEAVDALIDNSKKKPKPVTSKDNRNLKSISDALTGKKGRFRQNLLGKR
VDYSGRSVIVVGPELKMNQVGIPREMAAKLFEPWIIKELIDQEITLSVKSARKLIDNLNP
IIWPHVAKVIQGRPVLLNRAPTLHRLSIQAFEPVLIRGKAIKLHPLVTTAFNADFDGDQM
AVHVPISDEAVREAKELLFANRNILGPKDGEPIINPSQDMILGIYYLTIEIAGAKGEAKV
FQDVNSMLRAYEEGSVSLHARVAIPFKKLQKTFNLKGDKGYIFSTVGKFIFNQAFPENFP
FIFDSSVSSISDAQEYTKKYYIPYGLNIKETIQNTPINDALSKKDLSKIIRTIFDKYVPV
LTKEDVASVINDVNHTNYKDTSTKFANLVTTNKTALEYIHAESLSKFTTKHFVDVNKKLS
LKTPGNPNQPIWEVDQYVELLENVWFDYVNIVASVLDEIKDLGFKFSTKSGTSISIHDIE
VSDNKKERIKEGDDYTSELKSMYREGLLTDDERYSLTINKWSEVKDNIQNDLKKIVKNNP
LNPIFIMMNSGARSNMANYVQLAGIRGLMTNNTKILKSDAENERVVRSTVEIPVKSSFLD
GLTAYEFYSSTHGARKGLTDTALNTAKSGYLTRRLVDVAQGIVVTEKDCATQNGFVVKDI
VDNKTKTVIVPIRERIEGRFTIEDVKDKDGNVIVEKDTLIDAKMAEEIVEVHDVKEVNIR
SILGCEAKNGVCQKCFGKDLATSRIVSIGEAVGITASQSIGEPGTQLTMRTFHSGGVAGV
EDITGGFGRLTELIDAYRSPWGRPAIISKVDGIITEIKTPKDKNTNLVYITYLDQDDASQ
TEVVSVPKNRTLRVKVGDKIVKGQKIIDGPIILEELLEYGGPRKVQSYLLKEIQKIYRMQ
GIAINDKYIEIIISQMLSKIEISEPGDSDFIIGSLVNNLDFYNTNNELLEKGLEPAKGKV
VIHGAKRIPLLSNSFLAAASYQESAKILVNSSISSQQDFLVGVKENIILGKKIPAGTNSQ
YESKSKFDIRDPKEYFKDKSPQRHYKIEMDNEVSDMFNEFRISQNK
>TM0459 DNAdirected RNA polymerase, beta' subunit (rpoC) {Thermoto
MPMSSFKRKIKAIQIKIASPEVIRSWSGGEVKKPETINYRTFKPERDGLFCERIFGPVKD
YECACGKYKGKKYEGTVCERCGVRVESREARRKRMGHIELAAPAVHIWYLESIPSVLGTL
LNMSTSDLENIIYYGSRRVIERAFIVTDPKDTPFSQGDVIYETEYRIYRKKWDFDVEQAF
VVKNPKSPVLSDIDGEVTLKTEKSITGREITWIIVKNITRATHTVLPGMILVVKDGQEVE
KGQDLTKEMTIDPVYAPFDGHVEIDELSNTITLKPLTTSKDQPVVFTIPYGAKILVSNGQ
KVKKGDQITTSTSLPAVKASISGTVRFGSNLNIRALEDGNFEVLSTGEVYVEQVIEERKY
PVFEGALVYVNNGDQVKKGDHLADRFLFEEEYLSATEYKIFESHYPTMFDVEERTENDRP
IVVITDIDPEVSKETGLKVGDIVTENEYEAYLQIYPEKIVADAGAQAIKKLLQNLDLEAL
QAEIEAELKKLPSSSSKAIKLRRRLKMVKDFLKSGNKPEWMVLEVVPVIPPDLRPMIQIE
GGRFATTDLNELYRRLINRNNRLKKLLELGAPEIILRNEKRMLQEAVDALIHNGSDSEGK
RSRRAVLKDRNGRPLKSLTDLLKGKKGRFRRNLLGKRVDYSGRAVIVVGPNLKIHQCGIP
KKMAMELFKPFVLAKLLGEGSSSKTMRKVKKAIIEKEMPEAWEVLEEVIKGSVVLLNRAP
TLHRMSIQAFEPKLVEGNAIQLHPVVCPPFNADFDGDQMAVHVPLSAAAQAEARFLMLSR
YNIISPAHGKPISLPTQDIIIGSYYLTTVGKEFDSLKEEDVKWKFSSPEEAMLAYHLGFI
KLHTPILIKVAVNGEEKRIKTTLGRVIFNGILPEDLRDYNRIFDKKQINALVYETFKRHG
IDRAADLLDDIKDIGFHYATVSGLTLSLKDLKIPPERDEILRKTWEKVRIIEENYEKGFL
TEEQRKSEIIRLWMSVTEEITKLTSKTLAEDPFNPIYMMVNSGARGNIDQVKQLAGIRGL
MIKAYDPRSREIKSKIFKGQAIHEALTFDYPVDKNLREGVDILQFFISTYGARKGQVDTA
MNTSFAGYLTRRLVDVAQSVTVAEPDCGTHEGIRAMDLIKEGTVVEKMNEFLFGRVLARD
VLDPETKEVLKNPETGKEYTRNTMLTDDDANFLASYKKMVDVVRYEEIDITELSLPNMYA
EIAEPVGEYEEGTELTWDVIKAAKNEGKYRIKVKVYPVVGTVYAEEEPLYDKKGERQLLV
YQEVINEIVAKMLEENGIEKVSVRPDIIVRSPLTCESEYGVCAACYGMDLSNHKIVNVGE
AVGVVAAQSIGEPGTQLTMRTFHVGGVMGASDIVSGLTTVEKTFEPYAFLREEKSGGKKE
IRKYYGSEAILCEVDGFVKDIATDESGRTVIYIEDYAGNIHAYKVPKRAKVRVEKGQKVL
RGETLTSGAIVWWKLLELESEKGVMTAMNLLKIIKNAYVQQGVSIHDKHFEIIFKQMLSM
ATIVDPGDSDYLPDQLVPLVDIKRFNREILEGNAKVEENRKWVIGKTLAKRIITETEEGE
LVELAQKGDEVTEELLKKIIEAGIKEIDVFEKDKVVTYQILPKEPIKYKRRLLSLKKAAL
NYPGWLSAAAFEETAWVLTAAAIEGKVDPLIGLKENVIVGQLIPAGTGLDVFAGIQVEET
PRAAVEEELA
>AP001507    AP001507 0008 Bacillus halodurans genomic DNA
MIDVNNFEYMKIGLASPNKIRSWSRGEVKKPETINYRTLKPEKDGLFCERIFGPQKDWEC
HCGKYKRVRYKGVVCDRCGVEVTRAKVRRERMGHIELAAPVSHIWYFKGIPSRMGLVLDM
SPRSLEEVIYFASYVVTDPGDTPLEKKQLLSEKEFRAYLDKYGRSFTAQMGAEAIRKLLM
DIDLDKEVDGLKEELQTAQGQRRTRAIKRLEVLEAFRNSGNEPSWMILDVLPVIPPELRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKRLLDLGAPSIIVQNEKRMLQEAVDALIDNGR
RGRPVTGPGNRPLKSLSHMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPNLKMYQCGLPKE
MALELFKPFVMKELVSKGLAHNIKSAKRKVERVQPEVWDVLEEVIKEHPVLLNRAPTLHR
LGIQAFEPTLVEGRAIKLHPLVCTAYNADFDGDQMAVHVPLSAEAQAEARILMLAAQNIL
NPKDGKPVVTPSQDMVLGNYYLTMEREGAKGEGSVFKDTNEALIAYQNGYVHLHTRIAIP
VASLGKTTFKEEQNSQLLLTTVGKLIFNEILPESFPYVNEPTAHNLEVETPSKYMVPTST
NVKELFQERDVVAPFKKGFLGNIIAEVFKKFKITETSKMLDRMKDLGFKYSTKAGITVGV
ADIVVLPEKKEILAEAEKKVDRVLKQFRRGLITEEERYDRVISIWSEAKDVIQDKLMGSL
DKRNPIFMMSDSGARGNASNFTQLAGMRGLMANPSGRIIELPIKSSFREGLTVLEYFIST
HGARKGLADTALKTADSGYLTRRLVDVAQDVIVREDDCGTDRGLEVEAIKEGNEIIEGLY
DRLVGRVAFKTVRHPETGEPIVKKNELIHEDLAKQIVEAGVEQVTIRSVFTCDTRHGVCK
KCYGRNLATGSDVEVGEAVGIIAAQSIGEPGTQLTMRTFHTGGVAGDDITQGLPRIQELF
EARNPKGQAVITEIEGEVTNINEADKREITVKGEMETKTYSIPYGARIKVELGEQVVPGQ
SLTEGSIDPKELLKVQGMTGVQEYLLREVQKVYRMQGVEIGDKHVEVMVRQMLRKIRVID
AGDTEVLPGSLIEIQHFNDENKKVLLSGKRPATGRPVLLGITKASLETDSFLSAASFQET
TRVLTDAAIKGKRDELVGLKENVIIGKLVPAGTGMNRYRNLDIVSDYDQQAVGTEEAVME
EAVTTE
>BSUB0001    Z99104 9711 Bacillus subtilis complete genome
MLDVNNFEYMNIGLASPDKIRSWSFGEVKKPETINYRTLKPEKDGLFCERIFGPTKDWEC
HCGKYKRVRYKGVVCDRCGVEVTRAKVRRERMGHIELAAPVSHIWYFKGIPSRMGLVLDM
SPRALEEVIYFASYVVTDPANTPLEKKQLLSEKEYRAYLDKYGNKFQASMGAEAIHKLLQ
DIDLVKEVDMLKEELKTSQGQRRTRAIKRLEVLEAFRNSGNKPSWMILDVLPVIPPELRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKRLLDLGAPSIIVQNEKRMLQEAVDALIDNGR
RGRPVTGPGNRPLKSLSHMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPHLKMYQCGLPKE
MALELFKPFVMKELVEKGLAHNIKSAKRKIERVQPEVWDVLESVIKEHPVLLNRAPTLHR
LGIQAFEPTLVEGRAIRLHPLVCTAYNADFDGDQMAVHVPLSAEAQAEARILMLAAQNIL
NPKDGKPVVTPSQDMVLGNYYLTLERAGAVGEGMVFKNTDEALLAYQNGYVHLHTRVAVA
ANSLKNVTFTEEQRSKLLITTVGKLVFNEILPESFPYMNEPTKSNIEEKTPDRFFLEKGA
DVKAVIAQQPINAPFKKGILGKIIAEIFKRFHITETSKMLDRMKNLGFKYSTKAGITVGV
SDIVVLDDKQEILEEAQSKVDNVMKQFRRGLITEEERYERVISIWSAAKDVIQGKLMKSL
DELNPIYMMSDSGARGNASNFTQLAGMRGLMANPAGRIIELPIKSSFREGLTVLEYFIST
PGARKGLADTALKTADSGYLTRRLVDVAQDVIIRETDCGTDRGILAKPLKEGTETIERLE
ERLIGRFARKQVKHPETGEVLVNENELIDEDKALEIVEAGIEEVWIRSAFTCNTPHGVCK
RCYGRNLATGSDVEVGEAVGIIAAQSIGEPGTQLTMRTFHTGGVAGDDITQGLPRIQELF
EARNPKGQATITEIDGTVVEINEVRDKQQEIVVQGAVETRSYTAPYNSRLKVAEGDKITR
GQVLTGGSIDPKELLKVTDLTTVQEYLLHEVQKVYRMQGVEIGDKHVEVMVRQMLRKVRV
IDAGDTDVLPGTLLDIHQFTEANKKVLLEGNRPATGRPVLLGITKASLETDSFLSAASFQ
ETTRVLTDAAIKGKRDELLGLKENVIIGKLVPGGTAMMKYRKVKPVSNVQPTDDMVPVE
>AP003130    AP003130 0104 Staphylococcus aureus genomic D
MIDVNNFHYMKIGLASPEKIRSWSFGEVKKPETINYRTLKPEKDGLFCERIFGPTKDWEC
SCGKYKRVRYKGMVCDRCGVEVTKSKVRRERMGHIELAAPVSHIWYFKGIPSRMGLLLDM
SPRALEEVIYFASYVVVDPGPTGLEKKTLLSEAEFRDYYDKYPGQFVAKMGAEGIKDLLE
EIDLDEELKLLRDELESATGQRLTRAIKRLEVVESFRNSGNKPSWMILDVLPIIPPEIRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKRLLDLGAPGIIVQNEKRMLQEAVDALIDNGR
RGRPVTGPGNRPLKSLSHMLKGKQGRFRQNLLGKRVDYSGRSVIAVGPSLKMYQCGLPKE
MALELFKPFVMKELVQREIATNIKNAKSKIERMDDEVWDVLEEVIREHPVLLNRAPTLHR
LGIQAFEPTLVEGRAIRLHPLVTTAYNADFDGDQMAVHVPLSKEAQAEARMLMLAAQNIL
NPKDGKPVVTPSQDMVLGNYYLTLERKDAVNTGAIFNNTNEVLKAYANGFVHLHTRIGVH
ASSFNNPTFTEEQNKKILATSVGKIIFNEIIPDSFAYINEPTQENLERKTPNRYFIDPTT
LGEGGLKEYFENEELIEPFNKKFLGNIIAEVFNRFSITDTSMMLDRMKDLGFKFSSKAGI
TVGVADIVVLPDKQQILDEHEKLVDRITKQFNRGLITEEERYNAVVEIWTDAKDQIQGEL
MQSLDKTNPIFMMSDSGARGNASNFTQLAGMRGLMAAPSGKIIELPITSSFREGLTVLEY
FISTHGARKGLADTALKTADSGYLTRRLVDVAQDVIVREEDCGTDRGLLVSDIKEGTEMI
EPFIERIEGRYSKETIRHPETDEIIIRPDELITPEIAKKITDAGIEQMYIRSAFTCNARH
GVCEKCYGKNLATGEKVEVGEAVGTIAAQSIGEPGTQLTMRTFHTGGVAGSDITQGLPRI
QEIFEARNPKGQAVITEIEGVVEDIKLAKDRQQEIVVKGANETRSYLASGTSRIIVEIGQ
PVQRGEVLTEGSIEPKNYLSVAGLNATESYLLKEVQKVYRMQGVEIDDKHVEVMVRQMLR
KVRIIEAGDTKLLPGSLVDIHNFTDANREAFKHRKRPATAKPVLLGITKASLETESFLSA
ASFQETTRVLTDAAIKGKRDDLLGLKENVIIGKLIPAGTGMRRYSDVKYEKTAKPVAEVE
SQTEVTE
>gi|13874857|dbj|BAB46103.1| RNA polymerase betaprime chain [Staph
MIDVNNFHYMKIGLASPEKIRSWSFGEVKKPETINYRTLKPEKDGLFCERIFGPTKDWEC
SCGKYKRVRYKGMVCDRCGVEVTKSKVRRERMGHIELAAPVSHIWYFKGIPSRMGLLLDM
SPRALEEVIYFASYVVVDPGPTGLEKKTLLSEAEFRDYYDKYPGQFVAKMGAEGIKDLLE
EIDLDEELKLLRDELESATGQRLTRAIKRLEVVESFRNSGNKPSWMILDVLPIIPPEIRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKRLLDLGAPGIIVQNEKRMLQEAVDALIDNGR
RGRPVTGPGNRPLKSLSHMLKGKQGRFRQNLLGKRVDYSGRSVIAVGPSLKMYQCGLPKE
MALELFKPFVMKELVQREIATNIKNAKSKIERMDDEVWDVLEEVIREHPVLLNRAPTLHR
LGIQAFEPTLVEGRAIRLHPLVTTAYNADFDGDQMAVHVPLSKEAQAEARMLMLAAQNIL
NPKDGKPVVTPSQDMVLGNYYLTLERKDAVNTGAIFNNTNEVLKAYANGFVHLHTRIGVH
ASSFNNPTFTEEQNKKILATSVGKIIFNEIIPDSFAYINEPTQENLERKTPNRYFIDPTT
LGEGGLKEYFENEELIEPFNKKFLGNIIAEVFNRFSITDTSMMLDRMKDLGFKFSSKAGI
TVGVADIVVLPDKQQILDEHEKLVDRITKQFNRGLITEEERYNAVVEIWTDAKDQIQGEL
MQSLDKTNPIFMMSDSGARGNASNFTQLAGMRGLMAAPSGKIIELPITSSFREGLTVLEY
FISTHGARKGLADTALKTADSGYLTRRLVDVAQDVIVREEDCGTDRGLLVSDIKEGTEMI
EPFIERIEGRYSKETIRHPETDEIIIRPDELITPEIAKKITDAGIEQMYIRSAFTCNARH
GVCEKCYGKNLATGEKVEVGEAVGTIAAQSIGEPGTQLTMRTFHTGGVAGSDITQGLPRI
QEIFEARNPKGQAVITEIEGVVEDIKLAKDRQQEIVVKGANETRSYLASGTSRIIVEIGQ
PVQRGEVLTEGSIEPKNYLSVAGLNATESYLLKEVQKVYRMQGVEIDDKHVEVMVRQMLR
KVRIIEAGDTKLLPGSLVDIHNFTDANREAFKHRKRPATAKPVLLGITKASLETESFLSA
ASFQETTRVLTDAAIKGKRDDLLGLKENVIIGKLIPAGTGMRRYSDVKYEKTAKPVAEVE
SQTEVTE
>gi|12724824|gb|AAK05897.1|AE006409_7 DNAdirected RNA polymerase b
MRIGIASPQKIRYWSFGEVKKPETINYRTQKPEREGLFDERIFGPQKDWECACGKLKGVF
YKNQVCELCGVQVTTAKSRRERMGHIELAAPISHIWYFKGIPSRMGLALDMSPRALEEVI
YFASYVVIDPKETDLEKKQLLTEREYREQLLKNGFGSFVAKMGAEAIQDLLNDVDIDKEV
SELKEELKTVTGQRRVKIIRRLDVLSAFRKSGNALSWMVLNVLPVIPPDLRPMVQLDGGR
FATSDLNDLYRRVINRNNRLKRLMELNAPNIIVQNEKRMLQEAVDTLIDNGRRGRPITGA
GNRPLKSLSHMLKGKQGRFRQNLLGKRVDYSGRSVIAVGPTLKMYQCGVPREMAIELFKP
FVMAQLVKKELAANIRAAKRKVERQDSDVWDVLETVVKEHPVLLNRAPTLHRLGIQAFEP
VLIDGKAIRLHPLACEAYNADFDGDQMAIHLPLSEEAQAEARLLMLAAEHILNPKDGKPV
VTPSQDMVLGNYYLTMEEKGREGEGMIFATPEEVEIAMRNGYVHLHTRIGIATKSLNKPW
TENQKDKILVTTVGKVIFNSIIPEGMPYLNEPTDVNLTTSTDDRFFMDAGQDIKEVLAGI
DTVRPFKKGYLGNIIAEVFKRYRTTATSEYLDRLKNLGYHQSTLAGLTVGIADIPVVEDK
HKIIDAAHKRVEQITKQFRRGLITDDERYNAVTGVWRDAKEALEKRLIDEQDLTNPIVMM
MDSGARGNISNFSQLAGMRGLMAAPNGKIMELPIISNFREGLSVLEMFFSTHGARKGMTD
TALKTADSGYLTRRLVDVAQDVIIREDDCGTDRGLVISDIATGKEMVEPLFERLVGRYTR
KSVLHPETGEMIIADDTLISEDVARKIIDAGVKEVTIRSVFTCKTPHGVCKHCYGINLAT
GDAVEVGEAVGTIAAQSIGEPGTQLTMRTFHTGGVASSSDITQGLPRVQEIFEARNPKGE
AIITEVTGTVESIVEDPATRTREITVKGKTDTRSYTVGMADVLMVEEGEFIHRGAPLIQG
SIEPKHLLQVRDALSVETYLLGEVQKTYRSQGVEIGDKHIEVMVRQMLRKVRVMDNGSTD
ILPGTLMDISDFEALNETALLNGEMPATGRPVLMGITKASLETNSFLSAASFQETTRVLT
DAAIRGKEDHLLGLKENVIIGKIIPAGTGMFRYRNIEPLADLTNAPEVEEVETETVEN
>AE006480    AE006480 0104 Streptococcus pyogenes strain S
MVDVNRFKSMQITLASPSKVRSWSYGEVKKPETINYRTLKPEREGLFDEVIFGPTKDWEC
ACGKYKRIRYKGIVCDRCGVEVTRAKVRRERMGHIELKAPVSHIWYFKGIPSRMGLTLDM
SPRALEEVIYFAAYVVIDPKDTPLEPKSLLTEREYREKLQEYGHGSFVAKMGAEAIQDLL
KRVDLAAEIAELKEELKSASGQKRIKAVRRLDVLDAFNKSGNKPEWMVLNILPVIPPDLR
PMVQLDGGRFAASDLNDLYRRVINRNNRLARLLELNAPGIIVQNEKRMLQEAVDALIDNG
RRGRPITGPGSRPLKSLSHMLKGKQGRFRQNLLGKRVDFSGRSVIAVGPTLKMYQCGVPR
EMAIELFKPFVMREIVAKEYAGNVKAAKRMVERGDERIWDILEEVIKEHPVLLNRAPTLH
RLGIQAFEPVLIDGKALRLHPLVCEAYNADFDGDQMAIHVPLSEEAQAEARLLMLAAEHI
LNPKDGKPVVTPSQDMVLGNYYLTMEDAGREGEGMIFKDKDEAVMAYRNGYAHLHSRVGI
AVDSMPNKPWKDNQRHKIMVTTVGKILFNDIMPEDLPYLQEPNNANLTEGTPDKYFLEPG
QDIQEVIDRLDINVPFKKKNLGNIIAETFKRFRTTETSAFLDRLKDLGYYHSTLAGLTVG
IADIPVIDNKAEIIDAAHHRVEEINKAFRRGLMTDDDRYVAVTTTWREAKEALEKRLIET
QDPKNPIVMMMDSGARGNISNFSQLAGMRGLMAAPNGRIMELPILSNFREGLSVLEMFFS
THGARKGMTDTALKTADSGYLTRRLVDVAQDVIIREDDCGTDRGLLIRAITDGKEVTETL
EERLQGRYTRKSVKHPETGEVLIGADQLITEDMARKIVDAGVEEVTIRSVFTCATRHGVC
RHCYGINLATGDAVEVGEAVGTIAAQSIGEPGTQLTMRTFHTGGVASNTDITQGLPRIQE
IFEARNPKGEAVITEVKGNVVEIEEDASTRTKKVYVQGKTGMGEYVIPFTARMKVEVGDE
VNRGAALTEGSIQPKRLLEVRDTLSVETYLLAEVQKVYRSQGVEIGDKHVEVMVRQMLRK
VRVMDPGDTDLLPGTLMDISDFTDANKDIVISGGIPATSRPVLMGITKASLETNSFLSAA
SFQETTRVLTDAAIRGKKDHLLGLKENVIIGKIIPAGTGMARYRNIEPQAMNEIEVIDHT
EVSAEAVFTAEAE
>AE007809    AE007809 0107 Clostridium acetobutylicum ATCC
MELNKFDALQIGLASPEKIREWSRGEVKKPETINYRTLKPEKDGLFCERIFGPIKDWECH
CGKYKRVRYKGIVCDRCGVEVTKSKVRRERMGHIELAAPVSHIWYFKGIPSRMGLILDMS
PRALEKVLYFASYLVLDPKETPLLKKQLLNEKEYREAADKYGEESFEAGMGAESIKKLLQ
EIDLNQLSEELKENLKTSTGQKKVRIIRRLEVVESFRKSTNKPEWMIMDVIPVIPPDLRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKKLLDLGAPDIIVRNEKRMLQEAVDALIDNGR
RGRPVTGPGNRPLKSLSDMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPDLKMYQCGLPKE
MALELFKPFVMKKLVETGAAHNIKSAKRMVERVQNQVWDVLEEVISDHPVMLNRAPTLHR
LGIQAFQPILVEGRAIKLHPLACTAYNADFDGDQMAVHVPLSVEAQAESRFLMLAAHNIL
KPSDGKPVCVPTQDMVLGSYYLTIDKDGVKGEGKAFTNVDEALMAYQLGEIDIHAKIKVR
LEKEIDGKMVSGIIETTIGKLIFNESIPQDLGFIDRSIAGNELLLEINFLVGKKNLGGII
DKCYRKHGPTKTSIMLDKIKAKGYHYSTISAITVSTSDMTVPPNKGELMSEAETAVEKIE
KMYRRGFISDDERYERVISTWTKTTEKVADALMDNLDRFNPIFMMADSGARGSKSQIKQL
AGMRGLMANPSGKIIELPIKASFREGLDVLEYFISTHGARKGNADTALKTADSGYLTRRL
VDVSQDVIVRNEDCGATEGFEVSEIKEGNEVIESLSERLIGRYTSEDIIDPTSKEVLVKQ
NEYIDEDKAIRIEKVGVKKVKIRSVFTCNCKYGVCAKCYGMDMATAEKISMGEAVGIVAA
QSIGEPGTQLTMRTFHTGGVAGSDITQGLPRVEELFEARKPKGLAIVSEVSGTVRIEETK
KKRIVFIATESGEEVSYDIPFGSSLKVKNGETIGAGDEITEGSVNPHDIIRIKGVNAVKN
YLLSEVQKVYRLQGVDINDKHLEVVIRQMTRKVKVEDSGDTELLPGTMIDIFDFRDENKK
VEENGGRPAQARVSLLGITKAALATDSFLSAASFQETTRVLTDAAIKGKSDPLVGLKENV
TIGKLIPAGTGMNRYKNIEIDPLVTETNADDENFIAEDKIEG
>Rv0668, TB.seq  763368:767315 MW:146740
VLDVNFFDELRIGLATAEDIRQWSYGEVKKPETINYRTLKPEKDGLFCEKIFGPTRDWEC
YCGKYKRVRFKGIICERCGVEVTRAKVRRERMGHIELAAPVTHIWYFKGVPSRLGYLLDL
APKDLEKIIYFAAYVITSVDEEMRHNELSTLEAEMAVERKAVEDQRDGELEARAQKLEAD
LAELEAEGAKADARRKVRDGGEREMRQIRDRAQRELDRLEDIWSTFTKLAPKQLIVDENL
YRELVDRYGEYFTGAMGAESIQKLIENFDIDAEAESLRDVIRNGKGQKKLRALKRLKVVA
AFQQSGNSPMGMVLDAVPVIPPELRPMVQLDGGRFATSDLNDLYRRVINRNNRLKRLIDL
GAPEIIVNNEKRMLQESVDALFDNGRRGRPVTGPGNRPLKSLSDLLKGKQGRFRQNLLGK
RVDYSGRSVIVVGPQLKLHQCGLPKLMALELFKPFVMKRLVDLNHAQNIKSAKRMVERQR
PQVWDVLEEVIAEHPVLLNRAPTLHRLGIQAFEPMLVEGKAIQLHPLVCEAFNADFDGDQ
MAVHLPLSAEAQAEARILMLSSNNILSPASGRPLAMPRLDMVTGLYYLTTEVPGDTGEYQ
PASGDHPETGVYSSPAEAIMAADRGVLSVRAKIKVRLTQLRPPVEIEAELFGHSGWQPGD
AWMAETTLGRVMFNELLPLGYPFVNKQMHKKVQAAIINDLAERYPMIVVAQTVDKLKDAG
FYWATRSGVTVSMADVLVPPRKKEILDHYEERADKVEKQFQRGALNHDERNEALVEIWKE
ATDEVGQALREHYPDDNPIITIVDSGATGNFTQTRTLAGMKGLVTNPKGEFIPRPVKSSF
REGLTVLEYFINTHGARKGLADTALRTADSGYLTRRLVDVSQDVIVREHDCQTERGIVVE
LAERAPDGTLIRDPYIETSAYARTLGTDAVDEAGNVIVERGQDLGDPEIDALLAAGITQV
KVRSVLTCATSTGVCATCYGRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGV
GEDITGGLPRVQELFEARVPRGKAPIADVTGRVRLEDGERFYKITIVPDDGGEEVVYDKI
SKRQRLRVFKHEDGSERVLSDGDHVEVGQQLMEGSADPHEVLRVQGPREVQIHLVREVQE
VYRAQGVSIHDKHIEVIVRQMLRRVTIIDSGSTEFLPGSLIDRAEFEAENRRVVAEGGEP
AAGRPVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKENVIIGKLIPA
GTGINRYRNIAVQPTEEARAAAYTIPSYEDQYYSPDFGAATGAAVPLDDYGYSDYR
>MT0696 DNAdirected RNA polymerase, betaprime subunit (rpoC)
MLDVNFFDELRIGLATAEDIRQWSYGEVKKPETINYRTLKPEKDGLFCEKIFGPTRDWEC
YCGKYKRVRFKGIICERCGVEVTRAKVRRERMGHIELAAPVTHIWYFKGVPSRLGYLLDL
APKDLEKIIYFAAYVITSVDEEMRHNELSTLEAEMAVERKAVEDQRDGELEARAQKLEAD
LAELEAEGAKADARRKVRDGGEREMRQIRDRAQRELDRLEDIWSTFTKLAPKQLIVDENL
YRELVDRYGEYFTGAMGAESIQKLIENFDIDAEAESLRDVIRNGKGQKKLRALKRLKVVA
AFQQSGNSPMGMVLDAVPVIPPELRPMVQLDGGRFATSDLNDLYRRVINRNNRLKRLIDL
GAPEIIVNNEKRMLQESVDALFDNGRRGRPVTGPGNRPLKSLSDLLKGKQGRFRQNLLGK
RVDYSGRSVIVVGPQLKLHQCGLPKLMALELFKPFVMKRLVDLNHAQNIKSAKRMVERQR
PQVWDVLEEVIAEHPVLLNRAPTLHRLGIQAFEPMLVEGKAIQLHPLVCEAFNADFDGDQ
MAVHLPLSAEAQAEARILMLSSNNILSPASGRPLAMPRLDMVTGLYYLTTEVPEDTGEYQ
PASGDHPETGVYSSPAEAIMAADRGVLSVRAKIKVRLTQLRPPVEIEAELFGHSGWQPGD
AWMAETTLGRVMFNELLPLGYPFVNKQMHKKVQAAIINDLAERYPMIVVAQTVDKLKDAG
FYWATRSGVTVSMADVLVPPRKKEILDHYEERADKVEKQFQRGALNHDERNEALVEIWKE
ATDEVGQALREHYPDDNPIITIVDSGATGNFTQTRTLAGMKGLVTNPKGEFIPRPVKSSF
REGLTVLEYFINTHGARKGLADTALRTADSGYLTRRLVDVSQDVIVREHDCQTERGIVVE
LAERAPDGTLIRDPYIETSAYARTLGTDAVDEAGNVIVERGQDLGDPEIDALLAAGITQV
KVRSVLTCATSTGVCATCYGRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGV
GEDITGGLPRVQELFEARVPRGKAPIADVTGRVRLEDGERFYKITIVPDDGGEEVVYDKI
SKRQRLRVFKHEDGSERVLSDGDHVEVGQQLMEGSADPHEVLRVQGPREVQIHLVREVQE
VYRAQGVSIHDKHIEVIVRQMLRRVTIIDSGSTEFLPGSLIDRAEFEAENRRVVAEGGEP
AAGRPVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKENVIIGKLIPA
GTGINRYRNIAVQPTEEARAAAYTIPSYEDQYYSPDFGAATGAAVPLDDYGYSDYR
>ML1890, rpoC, [beta]' subunit of RNA polymerase 2269270:2273220 re
VLDVNFFDELRIGLATAEDIRQWSYGEVKKPETINYRTLKPEKDGLFCEKIFGPTRDWEC
YCGKYKRVRFKGIICERCGVEVTRAKVRRERMGHIELAAPVTHIWYFKGVPSRLGYLLDL
APKDLEKIIYFAAYVITTVDEEMRHNELSTLEAEMMVERKSVEDQRDADLEARAQKLEAD
LAALEAEGAKADARRKFRDGGEREMRQLRERAQRELDRLEDIWSTFTKLAPKQLIVDENL
YRELVDRYGEYFTGAMGAESIQKLMQDFDIEAEAESLREVIRNGKGQKKLRALKRLKVVA
AFQQSGNSPMGMVLDAVPVIPPELRPMVQLDGGRFATSDLNDLYRRVINRNNRLKRLIDL
GAPDIIVNNEKRMLQESVDALFDNGRRGRPVTGPGNRPLKSLSDLLKGKQGRFRQNLLGK
RVDYSGRSVIVVGPQLKLHQCGLPKLMALELFKPFVMKRLVDLNHAQNIKSAKRMVERQR
PQVWDVLEEVIAEHPVLLNRAPTLHRLGIQAFEPMLVEGKAIQLHPLVCEAFNADFDGDQ
MAVHLPLSAEAQAEARILMLSSNNILSPASGRPLAMPRLDMVTGLYYLTTAVDGDTGAYR
PAAEDRPESGVYSSPAEAIMAADRGVLSVRAKIKVQLTQVRPPADIEARWFGANGWRPGD
PWIADTTLGRVMFNELLPLGYPFVNKQMHKKVQAAIINDLAERYPMIVVAQTVDKLKDAG
FYWATRSGVTVSMADVLVPPRKKEILDHYEERADKVEKQFQRGALNHDERNEALVEIWKE
ATDEVGQALRDHYPVDNPIITIVDSGATGNFTQTRALAGMKGLVTNPKGEFIPRPVKSSF
REGLTVLEYFINTHGARKGLADTALRTADSGYLTRRLVDVSQDVIVREHDCQTERGIVVE
LAVRVPDGSLIRELYIETSAYARTLGANAVDEAGNVIVARGEDLGDPEIDALLAAGITQV
KVRSVLTCTTGTGVCATCYGRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGV
GEDITGGLPRVQELFEARVPRGKAPIADVTGRVRLEDGERFYKITIVPDDGGEEVVYDKL
SKRQRLRVFKHADGSERVLSDGDYVEVGQQLMEGSADPHEVLRVQGPREVQIHLVREVQE
VYRAQGVSIHDKHIEVIVRQMLRRVTIIDSGSTEFLPGSLIDRAEFESENRRVVAESGEP
AAGRPVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKENVIIGKLIPA
GTGINRYRNIQVQPTEEARASAYTIPSYEDQYYSPDFGQATGAAVPLDDYGYSDYR
>ORF02754 DNAdirected RNA polymerase, beta` subunit (rpoC)
MKDFNKVRIAIASPEKIREWSFGEVEKPETINYRTLKPEREGLFDERIFGPIKDYECACG
KYKRQRYEGKVCERCGVEVTSSKVRRYRMGHIDLATPAAHIWYVKDSPSKIGTLLDLSAG
QLEKVLYFSSFIVTKPFNAQKDGRPLKRGELLSDDEYRELRFGRQETYTIPNSVEDVEIR
DGEYVTRGQILGGNVVSKMDGLAQYRFPRRAVIAYSEGVEASLPLPADTLVEQETFRAGE
ILAELEQDVQITAPVAGTVFMHDLGEDSVMIELREGVEANDSDEEEAADPIRGEVLARVY
VPHGMNVQVAEGEVIEAGALLADASEGARLRVSRDSNLSGVTFPKKKGDVTVTAHWTRRV
EYPIDPTMHVLVGDGSEVTKGQRVIGAIDKEEEVIAEADGVITLHQPASILVSKAKVYAY
DDEPLVVNGDRVEPGDELADDGNLRSEISGRIELDLVRKQVRVIESYDFEAKMGAEAVKE
LLDELNLDELETELNEQMKDNSRHKRAKARKRLEVTRSFKASGNNPSWMILGTVPVMPPD
LRPMVQVDGGRFATSDLNDLYRRLINRNNRLKKLMSQGAPDMIIRNEKRMLQEAVDALID
NGRRGSPVTNPGSDRSLRSLTDLLGGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCG
VPKRMALELFKPFLFKVLEEKGEVTNIKQARKMLERYRDTRDSVWDALEEVIEDKVVLLN
RAPTLHRLGIQAFEPVLVEGQSIQLHPLVCEAFNADFDGDQMAIHVPLSAQAQAEARIQM
LASHNLLSPANGEPNVKPSRDIILGIFTLTQLRRDNLGAGTEYASEADALAAFDEGKLSL
NSPVMVNGVETSPGRLRYTFSNPDEALHAVEQGEIDHQDHVRIRLNGQVYETSAGRVQFR
RMVQEALGAQGGLIDTLVDLETTYEKDALKDMVMACFKHLGIEATAGLLDGLKEGGFKLS
TTSGITIGIDDIVLPPNKGELLAEADQMLAEIEQNFEFGFMTEEERYKQVVQLWNNTTDA
VKDAVFENFSKNYPFNPLWIMSQSGARGNPQQIRQLAGMRGLMARPDGSTIEVPIRASFR
EGLTVLEYFISTHGARKGGADTALRTADSGYLTRKLVDVAHEVVVRDVDCGSTDSTVMPL
GATDERTGEWRSRKGSEIETSIYGRTLTADVEFSDGRVIPEGEMLSMEDVKAIAKDAKHI
GEVFVRTPLNCRVKAGVCQKCYGYDLSQAKPVSMGEAVGVVAAESIGEPGTQLTMRTFHT
GGIAGGGDITMGLPRVIELFEARKPKTQAVVADRDGVIRIEEEEERYLVRIEADDEQYSS
KTATKVPRVLRMTVKDGERVEAGQPITRGAVNPHDLLMYKDTDAAQRYLVEEVQRVYRSQ
GVKVHDKHIEVIVRQMLRWVEVTDGGDTTLLEGQTVEHWEVDQANEALAEGQTPASWKPV
LLGITKSSLTTKSWLSAASFQHTTHVLTEASMRGQVDDLIGLKENVILGKLIPAGTGLLT
VREMQVADDRTLEKYGEGSTSSDAVTGGQRYDDTRPGSSINPGYGD
>D90913  +  D90905   Synechocystis sp. PCC6803 complete g
MKAQSEPRFDYVKIAIASPERIRQWGERTLPNGTVVGEVTKPETINYRTLKPEMDGLFCE
KIFGPSKDWECWCGKYKRVRHRGIVCERCGVEVTESRVRRHRMGYIKLAAPVTHVWYLKG
IPSYLSILLDMALRDVEQIVYFNAYVVLNPGNASNLQYKQLLTEDQWVEIEDQIYAEDSE
LEGIEVGIGAEAVQRLLAELQLEEVAEKLREEILASKGQKRAKLIKRLRVIDNFIATHSQ
AEWMTLDVIPVIPPDLRPMVQLDGGRFATSDLNDLYRRVINRNNRLARLQEILAPEIIVR
NEKRMLQEAVDALIDNGRRGRTVVGANNRALKSLSDIIEGKQGRFRQNLLGKRVDYSGRS
VIVVGPNLKIYQCGLPREMAIELFQPFVIHRLIKLGIVNNIKAAKKLILKGDPQIWSVLE
EVITGHPVMLNRAPTLHRLGIQAFEPILVEGRAIQLHPLVCPAFNADFDGDQMAVHVPLS
LEAQCEARLLMLACHNVLSPATGKPIVAPSQDMVLGCYYLTAENPNAQKGAGRYFAGIED
ALRAYDHGQVDLHSQIWIRHLDEDVVTEKPDTEVIKTEDLGDGTVMKYYRERKIREGVDG
EIITQYIQTTPGRIIYNKTIAEALVFXXXXXXMTFYNYTIDKGRLKKLIALAYRRYGSAR
CSQLADELKELGFRFATKAGVSISVDDLTIPPEKKQMLEAAEKEIRTTEERYARGEITEV
ERFQKVIDTWNGTSEELKDQVVVNFRKTDPLNSVYMMAFSGARGNMSQVRQLVGMRGLMA
DPQGEIIDLPIKTNFREGLTVTEYVISSYGARKGLVDTALRTADSGYLTRRLVDVSQDVI
VREQDCGTERSLRVTAMTDGDQVKISLADRLFGRLLAKDVVGPDGEIIAKRNDEIDEALA
NRIAAVTDEVYVRSPLTCEAARSVCQNCYGWSLAHGHKVDLGEAVGIIAAQSIGEPGTQL
TMRTFHTGGVFTGEVARQEKAPEDGTVKWGKGLSTRKVRTRHGEDAEQVEIAGDLIWKGE
GKKAATQTYSLTPGSLLFVQDGQTVTAGQLMTEISLSKTQRSTERATKDVAGDLAGEVLF
DRLVPEEKTDRQGNTTRIAQRGGLVWILSGEVYNLPPGAEPVVKNDEQVEVGSIMAETKL
VTNDGGVVRLVSNREIEIITASVLLDQAQVKLESSGGREQYVIYTADKQRFLLKAAPGTK
VQNHSIVAELIDDRYRTTTGGMIRYAGVEVAKGGRKQGYEVTKGGTLLWIPEETHEINKD
ISLLIVEDGQYVEAGTEVVKDIFCQSSGIVEVVQKNDILREIIIKPGDFYQDVDPGSVKI
ESGQLLQPGQDVFPGVTVSTLSQAEWIESPEGNGLLLRPVEEYKVFDEPAAPSQGSQNEE
GGRQIELRSVQRLFYKDGDRVKSVEGAPLLSTQLVLEIYGSGNEGISHLSADIELQDDEE
EDCQRLQLVILESLVLRRDQESDPLGGASKTRLLVQDGDQIPPGAVVARTEIQCKEAGTV
RGIKEGQESIRRVLLERAADRLVVDLPSAPEVKPGQLLVAGQELVPGVKLEESGKVLEIN
GKGDNYQLVLRRARPYRVSPGAVLHIEDGDLVQRGDNLVLLVFERAKTGDIVQGLPRIEE
LLEARKPKEACVLARAPGVCQVEYLEDESVDIKVVEDDGTVSEYPLLPGQNAMVTDGQRI
DVGHALTDGYNNPHEILDVFFSYYVDKDGCYQAALRGLQAAQKFLVNEVQTVYQSQGVDI
SDKHIEVIVRQMTAKVRIDDGGDTTMLPGELVELRQVEQVNEAMGITGSAPARYTPVLLG
ITKASLNTDSFISAASFQETTRVLTEAAIEGKSDWLRGLKENVIIGRLIPAGTGFSSHEE
VLGLIETQDDIQGYMIEPIELPTTKKKASATKVKTKKVEADDDLLDDTRARAYAGTQLSQ
DDEEFEETYDTDEDDFDMDDDDDFGDDED
>AE001273    AE001273 9912 Chlamydia trachomatis complete
MFREGSRDDAALVKEGLFDKLEIGIASDVTIRDKWSCGEIKKPETINYRTFKPEKGGLFC
EKIFGPTKDWECYCGKYKKIKHKGIVCDRCGVEVTLSKVRRERMAHIELAVPIVHIWFFK
TTPSRIGNVLGMTASDLERVIYYEEYVVIDPGNTDLVKKQLLNDAKYREVVEKWGKDAFV
AKMGGEAVYDLLKSEDLESLLGELKERLRKTKSQQARMKLAKRLKIVEGFVSSSNRPEWM
VLKNIPVVPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKAILRLKTPEVIVRNEKR
MLQEAVDALFDNGRHGHPVMGAGNRPLKSLSEMLKGKNGRFRQNLLGKRVDYSGRSVIIV
GPELKFNQCGLPKEMALELFEPFIIKRLKDQGSVYTIRSAKKMIQRGAPEVWDVLEEIIK
GHPVLLNRAPTLHRLGIQAFEPVLIEGKAIRVHPLVCAAFNADFDGDQMAVHVPLSIEAQ
LEAKVLMMAPDNIFLPSSGKPVATPSKDMTLGIYYLMADPTYFPEEHGGKTKAFKDEVEV
LRALNAGGFILKDEICGSRRDETGRGIHIHEKIKVRIDGQIIETTPGRVFFNTIVPKELG
FQNYSMPSKRISELILQCYKKVGLEATVRFLDDLKELGFVQSTKAAISMGLKDVKIPEIK
KEILKDAYDKVAVVKKQYEDGIITDGERHSKTISIWTEVSDLLSNALYSEIKKQTNSKHN
PLFLMIDSGARGNKSQLKQLGALRGLMAKPNGAIIESPITSNFREGLTVLEYSISSHGAR
KGLADTALKTADSGYLTRRLVDVAQDVIITERDCGTLNHIEVSTIRQGSEELLPLKDRVY
GRTVSENIYQPGDKSNVLAYAGDVLTSAQAEAIDDAGIESVKIRSTLTCESRRGVCAKCY
GLNLANGRLIGLGEAVGIIAAQSIGEPGTQLTMRTFHLGGIAATSSTPEIVAECDGILVY
LDLRVVVDQEGNNLVLNKMGALHLVQDEGRSLSEYKKLLSTKSIESLATFPVELGAKILV
NDGAAVAAGQRIAEVELHNIPIICDKPGFVHYEDLVEGVSTEKVTNKNTGLVELIVKQHR
GELHPQIAIYADANMKELVGTYAIPSGAIISVEEGQRIAPGMLLARLPRGAIKTKDITGG
LPRVAELVEARKPEDAADIAKIDGVVDFKGIQKNKRILVVRDEITGMEEEHLISLTKHLI
VQRGDSVIKGQQLTDGLVVPHEILEICGVRELQKYLVNEVQEVYRLQGVDINDKHVEIIV
RQMLQKVRITDPGDTTLLFGEDVDKKEFYEENRRTEEDGGKPAQAVPVLLGITKASLGTE
SFISAASFQDTTRVLTDAACSSKTDYLLGFKENVIMGHMIPGGTGFDTHKRIKQHLEKEQ
EDLVFDFDSEFESVAG
>TC0588 DNAdirected RNA polymerase, beta` subunit (rpoC)
MFKEGSRDDAALAKEGLFDKLEIGIASDVTIRDKWSCGEIKKPETINYRTFKPEKGGLFC
EKIFGPTKDWECYCGKYKKIKHKGIVCDRCGVEVTLSKVRRERMAHIELAVPIVHIWFFK
TTPSRIGNVLGMTASDLERVIYYEEYVVIDPGNTDLVKKQLLNDAKYREVVEKWGKDAFV
AKMGGEAVYDLLKSEDLESLLGELKDRLRKTKSQQARMKLAKRLKIVEGFVSSSNRPEWM
VLKNIPVVPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKAILRLKTPEVIVRNEKR
MLQEAVDALFDNGRHGHPVMGAGNRPLKSLSEMLKGKNGRFRQNLLGKRVDYSGRSVIIV
GPELKFNQCGLPKEMALELFEPFIIKRLKDQGSVYTIRSAKKMIQRGAPEVWDVLEEIIK
GHPVLLNRAPTLHRLGIQAFEPVLIEGKAIRVHPLVCAAFNADFDGDQMAVHVPLSIEAQ
LEAKVLMMAPDNIFLPSSGKPVATPSKDMTLGIYYLMADPTYFPEEHGGKTKVFKDEVEV
LRALNAGGFILKDEICGSRRDETGRGIHIHEAIKVRIDGQIIETTPGRVFFNTIVPKELG
FQNYSMPSKRISELILQCYKKVGLEATVRFLDDLKELGFVQSTKAAISMGLKDVRIPEIK
KEILKDAYDKVAVVKKQYEDGIITDGERHSKTISIWTEVSDLLSNALYAEIKKQTNSKHN
PLFLMIDSGARGNKSQLKQLGALRGLMAKPNGAIIESPITSNFREGLTVLEYSISSHGAR
KGLADTALKTADSGYLTRRLVDVAQDVIITEKDCGTLNHIEVSTIRQGSEELLPLKDRIY
GRTVSENVYQPGDKSNVLAYAGDVLTSSQAEAIDDAGIDSVKIRSTLTCESRRGVCAKCY
GLNLANGRLIGLGEAVGIIAAQSIGEPGTQLTMRTFHLGGIAATSSTPEIVAECDGILVY
LDLRFVVDQEGNNLVLNKMGALHLVRDEGRSLSEYKKLLSTKSIESLATFPVELGAKILV
DDGAAVTAGQRIAEVELHNIPIICDKPGFVHYEDLVEGVSTEKVTNKNTGLVELIVKQHR
GELHPQIAIYADANMQELVGTYAIPSGAIISVEEGQRIAPGMLLARLPRGAIKTKDITGG
LPRVAELVEARKPEDAADIAKIDGVVDFKGIQKNKRILVVRDEVTGMEEEHLISLTKHLI
VQRGDSVIKGQQLTDGLVVPHEILEICGVRELQKYLVNEVQEVYRLQGVDINDKHIEIIV
RQMLQKVRITDPGDTTLLFGEDVDKKEFYEENRRTEEDGGKPAQAVPVLLGITKASLGTE
SFISAASFQDTTRVLTDAACSSKTDYLLGFKENVIMGHMIPGGTGFDTHKRIKQHLEKEQ
EDLVFDFDSEFESVAG
>CP0693 DNAdirected RNA polymerase, beta` subunit (rpoC)
MEKIMFGENSRDIGVLSKEGLFDKLEIGIASDITIRDKWSCGEIKKPETINYRTFKPEKG
GLFCEKIFGPTKDWECCCGKYKKIKHKGIVCDRCGVEVTLSKVRRERMAHIELAVPIVHI
WFFKTTPSRIGNVLGMTASDLERVIYYEEYVVIDPGKTDLTKKQLLNDAQYREVVEKWGK
DAFVAKMGGEAIYDLLKSEDLQSLLKDLKERLRKTKSQQARMKLAKRLKIIEGFVSSSNH
PEWMVLKNIPVVPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKAILRLKTPEVIVR
NEKRMLQEAVDALFDNGRHGHPVMGAGNRPLKSLSEMLKGKNGRFRQNLLGKRVDYSGRS
VIIVGPELKFNQCGLPKEMALELFEPFIIKRLKDQGSVYTIRSAKKMIQRGAPEVWDVLE
EIIKGHPVLLNRAPTLHRLGIQAFEPVLIEGKAIRIHPLVCAAFNADFDGDQMAVHVPLS
VEAQLEAKVLMMAPDNIFLPSSGKPVAIPSKDMTLGLYYLMADPTYFPEEHGGKTKIFKD
EIEVLRALNNGGFIDDVFGDRRDETGRGIHIHEKIKVRIDGQIIETTPGRVLFNRIVPKE
LGFQNYSMPSKRISELILQCYKKVGLEATVRFLDDLKDLGFIQATKAAISMGLKDVRIPD
IKSHILKDAYDKVAIVKKQYDDGIITEGERHSKTISIWTEVSEQLSDALYVEISKQTRSK
HNPLFLMIDSGARGNKSQLKQLGALRGLMAKPNGAIIESPITSNFREGLTVLEYSISSHG
ARKGLADTALKTADSGYLTRRLVDVAQDVIITEKDCGTLNHIEISAIGQGSEELLPLKDR
IYGRTVAEDVYQPGDKSRLLAQSGDVLNSVQAEAIDDAGIETIKIRSTLTCESPRGVCAK
CYGLNLANGRLIGMGEAVGIIAAQSIGEPGTQLTMRTFHLGGIAATSSTPEIITNSDGIL
VYMDLRVVLGQEGHNLVLNKKGALHVVGDEGRTLNEYKKLLSTKSIESLEVFPVELGVKI
LVADGTPVSQGQRIAEVELHNIPIICDKPGFIKYEDLVEGISTEKVVNKNTGLVELIVKQ
HRGELHPQIAIYDDADLSELVGTYAIPSGAIISVEEGQRVDPGMLLARLPRGAIKTKDIT
GGLPRVAELVEARKPEDAADIAKIDGVVDFKGIQKNKRILVVCDEMTGMEEEHLIPLTKH
LIVQRGDSVIKGQQLTDGLVVPHEILEICGVRELQKYLVNEVQEVYRLQGVDINDKHIEI
IVRQMLQKVRITDPGDTTLLFGEDVNKKEFYEENRRTEEDGGKPAQAVPVLLGITKASLG
TESFISAASFQDTTRVLTDAACCSKTDYLLGFKENVIMGHMIPGGTGFETHKRIKQYLEK
EQEDLVFDFVSETECVC
>AE001363    AE001363 9903 Chlamydia pneumoniae complete g
MFGENSRDIGVLSKEGLFDKLEIGIASDITIRDKWSCGEIKKPETINYRTFKPEKGGLFC
EKIFGPTKDWECCCGKYKKIKHKGIVCDRCGVEVTLSKVRRERMAHIELAVPIVHIWFFK
TTPSRIGNVLGMTASDLERVIYYEEYVVIDPGKTDLTKKQLLNDAQYREVVEKWGKDAFV
AKMGGEAIYDLLKSEDLQSLLKDLKERLRKTKSQQARMKLAKRLKIIEGFVSSSNHPEWM
VLKNIPVVPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKAILRLKTPEVIVRNEKR
MLQEAVDALFDNGRHGHPVMGAGNRPLKSLSEMLKGKNGRFRQNLLGKRVDYSGRSVIIV
GPELKFNQCGLPKEMALELFEPFIIKRLKDQGSVYTIRSAKKMIQRGAPEVWDVLEEIIK
GHPVLLNRAPTLHRLGIQAFEPVLIEGKAIRIHPLVCAAFNADFDGDQMAVHVPLSVEAQ
LEAKVLMMAPDNIFLPSSGKPVAIPSKDMTLGLYYLMADPTYFPEEHGGKTKIFKDEIEV
LRALNNGGFIDDVFGDRRDETGRGIHIHEKIKVRIDGQIIETTPGRVLFNRIVPKELGFQ
NYSMPSKRISELILQCYKKVGLEATVRFLDDLKDLGFIQATKAAISMGLKDVRIPDIKSH
ILKDAYDKVAIVKKQYDDGIITEGERHSKTISIWTEVSEQLSDALYVEISKQTRSKHNPL
FLMIDSGARGNKSQLKQLGALRGLMAKPNGAIIESPITSNFREGLTVLEYSISSHGARKG
LADTALKTADSGYLTRRLVDVAQDVIITEKDCGTLNHIEISAIGQGSEELLPLKDRIYGR
TVAEDVYQPGDKSRLLAQSGDVLNSVQAEAIDDAGIETIKIRSTLTCESPRGVCAKCYGL
NLANGRLIGMGEAVGIIAAQSIGEPGTQLTMRTFHLGGIAATSSTPEIITNSDGILVYMD
LRVVLGQEGHNLVLNKKGALHVVGDEGRTLNEYKKLLSTKSIESLEVFPVELGVKILVAD
GTPVSQGQRIGEVELHNIPIICDKPGFIKYEDLVEGISTEKVVNKNTGLVELIVKQHRGE
LHPQIAIYDDADLSELVGTYAIPSGAIISVEEGQRVDPGMLLARLPRGAIKTKDITGGLP
RVAELVEARKPEDAADIAKIDGVVDFKGIQKNKRILVVCDEMTGMEEEHLIPLTKHLIVQ
RGDSVIKGQQLTDGLVVPHEILEICGVRELQKYLVNEVQEVYRLQGVDINDKHIEIIVRQ
MLQKVRITDPGDTTLLFGEDVNKKEFYEENRRTEEDGGKPAQAVPVLLGITKASLGTESF
ISAASFQDTTRVLTDAACCSKTDYLLGFKENVIMGHMIPGGTGFETHKRIKQYLEKEQED
LVFDFVSETECVC
>CJ11168X2   AL139075 0007 Campylobacter jejuni NCTC11168
MSKFKVIEIKEDARPRDFEAFQLRLASPEKIKSWSYGEVKKPETINYRTLKPERDGLFCA
KIFGPIRDYECLCGKYKKMRFKGVKCEKCGVEVANSKVRRSRMGHIELVTPVAHIWYVNS
LPSRIGTLLGVKMKDLERVLYYEAYIVENPGDAFYDNESTKKVEYCDVLNEEQYQNLMQR
YENSGFKARMGGEVVRDLLANLDLVALLNQLKEEMGATNSEAKKKTIIKRLKVVENFLNS
NLNANTDSDEAVPNRPEWMMITNLPVLPPDLRPLVALDGGKFAVSDVNDLYRRVINRNTR
LKKLMELDAPEIIIRNEKRMLQEAVDALFDNGRRANAVKGANKRPLKSLSEIIKGKQGRF
RQNLLGKRVDFSGRSVIVVGPKLRMDQCGLPKKMALELFKPHLLAKLEEKGYATTVKQAK
KMIENKTNEVWECLEEVVKGHPVMLNRAPTLHKLSIQAFHPVLVEGKAIQLHPLVCAAFN
ADFDGDQMAVHVPLSQEAIAECKVLMLSSMNILLPASGKSVTVPSQDMVLGIYYLSLEKA
GAKGSHKICTGIDEVMMALESKCLDIHASIQTMVDGRKITTTAGRLIVKSILPDFVPENS
WNKVLKKKDIAALVDYVYKQGGLEITASFLDRLKNLGFEYATKAGISISIADIIVPNDKQ
KAIDEAKKQVREIQNSYNLGLITSGERYNKIIDIWKSTNNVLSKEMMKLVEKDKEGFNSI
YMMADSGARGSAAQISQLAAMRGLMTKPDGSIIETPIISNFREGLNVLEYFISTHGARKG
LADTALKTANAGYLTRKLIDVAQNVKITIEDCGTHEGVEINEITADSSIIETLEERILGR
VLAEDVIDPITNSVLFAEGTLMDEEKAKILGESGIKSVNIRTPITCKAKKGICAKCYGIN
LGEGKLVKPGEAVGIISAQSIGEPGTQLTLRTFHSGGTASTDLQDRQVSAQKEGFIRFYN
LKTYKNKEGKNIVANRRNAAVLLVEPKIKTPFKGVINIENIHEDVIVSIKDKKQEVKYIL
RKYDLAKPNELAGVSGSIDGKLYLPYQSGMQVEENESIVEVIKEGWNVPNRIPFASEILV
EDGEPVVQNIKAGEKGTLKFYILKGDGLDRVKNVKKGDIVKEKGFFVVIADENDREAKRH
YIPRESKIEFNDSEKIDDANTIIASAPKKERKVIAEWDAYNNTIIAEIDGVVSFEDIEAG
YSADEQIDEATGKRSLVINEYLPSGVRPTLVIAGKGDKAVRYHLEPKTVIFVHDGDKIAQ
ADILAKTPKAAAKSKDITGGLPRVSELFEARKPKNAAVIAEIDGVVRFDKPLRSKERIII
QAEDGTSAEYLIDKSKHIQVRDGEFIHAGEKLTDGVVSSHDVLKILGEKALHYYLISEIQ
QVYRGQGVVISDKHIEVIVSQMLRQVKVVDSGHTKFIEGDLVSRRKFREENERIIRMGGE
PAIAEPVLLGVTRAAIGSDSVISAASFQETTKVLTEASIAGKFDYLEDLKENVILGRMIP
VGTGLYGEQNLKLKEQE
>AE001540    AE001540 9901 Helicobacter pylori, strain J99
MSKKIPLKNRLRADFTKTPTDLEVPNLLLLQRDSYDSFLYSKDGKESGIEKVFKSIFPIQ
DEHNRITLEYAGCEFGKSKYTVREAMERGITYSIPLKIKVRLILWEKDTKSGEKNGIKDI
KEQSIFIREIPLMTERTSFIINGVERVVVNQLYRSPGVIFKEEESSTSSNKLIYTGQIIP
DRGSWLYFEYDSKDVLYARINKRRKVPVTILFRAMDYQKQDIIKMFYPLVKVRYENDKYL
IPFASLDANQRMEFDLKDPQGKIILLAGKKLTSRKIKELKENHLEWVEYPMDILLNRHLA
EPVMVGKEVLLDMLTQLDKNRLEKIHDLGVQEFVIINDLALGHDASIIHSFSADHESLKL
LKQTEKIDDENALAAIRIHKVMKPGDPVTTEVAKQFVKKLFFDPERYDLTMVGRMKMNHK
LGLHVPDYITTLTHEDIITTVKYLMKIKNNQGKIDDRDHLGNRRIRAVGELLANELHSGL
VKMQKTIKDKLTTMSGAFDSLMPHDLVNSKMITSTIMEFFMGGQLSQFMDQTNPLSEVTH
KRRLSALGEGGLVKDRVGFEARDVHPTYYGQFGPIETPEGQNIGLINTLSTFTRVNDLGF
IEAPYKKVVDGKVAGETIYLTAIQEDSHIIAPASTPIDEEGNILGDLIETRVEGEIVLNE
KSKVTLMDLSSSMLVGVAASLIPFLEHDDANRALMGTNMQRQAVPLLRSDAPIVGTGIEK
IIARDSWGAIKANRAGVVEKIDSKNIYILGEGKEEAYIDAYSLQKNLRTNQNTSFNQVPI
VKVGDKVEAGQIIADGPSMDRGELALGKNVRVAFMPWNGYNFEDAIVVSERITKDDVFTS
THIYEKEVDARELKHGVEEFTADIPDVKEEALAHLDESGIVKVGTYVSAGMILVGKTSPK
GEIKSTPEERLLRAIFGDKAGHVVNKSLYCPPSLEGTVIDVKVFTKKGYEKDARVLSAYE
EEKAKLDMEHFDRLTMLNREELLRVSSLLSQAILEEPFSHNGKDYKEGDQIPKEEIASIN
RFTLASLVKKYSKEVQNHYEITKNNFLEQKKVLGEEHEEKLSILEKDDILPNGVIKKVKL
YIATKRKLKVGDKMAGRHGNKGIVSNIVPVADMPYTADGEPVDIVLNPLGVPSRMNIGQI
LEMHLGLVGKEFGKQIASMLEDKTKDFAKELRAKMLEIANAINEKDPLTIHVLENCSDEE
LLEYAKDWSKGVKMAIPVFEGISQEKFYKLFELAKIAMDGKMDLYDGRTGEKMRERVNVG
YMYMIKLHHLVDEKVHARSTGPYSLVTHQPVGGKALFGGQRFGEMEVWALEAYGAAHTLK
EMLTIKSDDIRGRENAYRAIAKGEQVGESEIPETFYVLTKELQSLALDINIFGDDVDEDG
APRPIMIKEDDRPKDFSSFQLTLASPEKIHSWSYGEVKKPETINYRTLKPERDGLFCMKI
FGPTKDYECLCGKYKKPRFKDIGTCEKCGVAITHSKVRRFRMGHIELATPVAHIWYVNSL
PSRIGTLLGVKMKDLERVLYYEAYIVKEPGEAAYDNEGTKLVMKYDILNEEQYQNISRRY
EDRGFVAQMGGEAIKDLLEEIDLITLLQSLKEEVKDTNSDAKKKKLIKRLKVVESFLNSG
NRPEWMMLTVLPVLPPDLRPLVALDGGKFAVSDVNELYRRVINRNQRLKRLMELGAPEII
VRNEKRMLQEAVDVLFDNGRSTNAVKGANKRPLKSLSEIIKGKQGRFRQNLLGKRVDFSG
RSVIVVGPNLKMDECGLPKNMALELFKPHLLSKLEERGYATTLKQAKRMIEQKSNEVWEC
LQEITEGYPVLLNRAPTLHKQSIQAFHPKLIDGKAIQLHPLVCSAFNADFDGDQMAVHVP
LSQEAIAECKVLMLSSMNILLPASGKAVAIPSQDMVLGLYYLSLEKSGVKGEHKLFSSVN
EIITAIDTKELDIHAKIRVLDQGNIIATSAGRMIIKSILPDFIPTDLWNRPMKKKDIGVL
VDYVHKVGGIGITATFLDHLKTLGFRYATKAGISISMEDIITPKDKQKMVEKAKVEVKKI
QQQYDQGLLTDQERYNKIIDTWTEVNDRMSKEMMSAIAKDKEGFNSIYMMADSGARGSAA
QIRQLSAMRGLMTKPDGSIIETPIISNFKEGLNVLEYFNSTHGARKGLADTALKTANAGY
LTRKLIDVSQNVKVVSDDCGTHEGIEITDIAVGSELIEPLEERIFGRVLLEDVIDPITNE
ILLYADTLIDEEGAKKVVEAGIKSITIRTPVTCKAPKGVCAKCYGLNLGEGKMSYPGEAV
GVVAAQSIGEPGTQLTLRTFHVGGTASRSQDEREIVASKEGFVRFYNLRTYTNKEGKNII
ANRRNASILVVEPKIKAPFDGELRIETVYEEVVVSVKNGDQEAKFVLRRSDIVKPSELAG
VGGKIEGKVYLPYASGHKVHKGGSIADIIQEGWNVPNRIPYASELLVKDNDPIAQDVYAK
EKGVIKYYVLEANHLERTHGVKKGDIVSEKGLFAVVADDNGREAARHYIARGSEILIDDN
SEVSANSVISKPTTNTFKTIATWDPYNTPIIADFKGKVNFVDVIAGVTVAEKEDENTGIT
SLVVNDYIPSGYKPSLFLEGANGEEMRYFLEPKTSIAISDGSSVEQAEVLAKIPKATVKS
RDITGGLPRVSELFEARKPKPKDVAILSEVDGIVSFGKPIRNKEHIIVTSKDGRLTDYFV
DKGKQILVHADEFVHAGEAMTDGVVSSHDILRISGEKELYKYIVSEVQQVYRRQGVSIAD
KHIEIIVSQMLRQVRILDSGDSKFIEGDLVSKKLFKEENTRVIALKGEPAIAEPVLLGIT
RAAIGSDSIISAASFQETTKVLTEASIAMKKDFLEDLKENVVLGRMIPVGTGMYKNKKIV
LRALEDNSKF
>HP1198 DNAdirected RNA polymerase, beta subunit (rpoB)
MSKKIPLKNRLRADFTKTPTDLEVPNLLLLQRDSYDSFLYSKEGKESGIEKVFKSIFPIQ
DEHNRITLEYAGCEFGKSKYTVREAMERGITYSIPLKIKVRLILWEKDTKSGEKNGIKDI
KEQSIFIREIPLMTERTSFIINGVERVVVNQLHRSPGVIFKEEESSTSLNKLIYTGQIIP
DRGSWLYFEYDSKDVLYARINKRRKVPVTILFRAMDYQKQDIIKMFYPLVKVRYENDKYL
IPFASLDANQRMEFDLKDPQGKVILLAGKKLTSRKIKELKENHLEWVEYPMDILLNRHLA
EPVMVGKEVLLDMLTQLDKNKLEKIHDLGVQEFVIINDLALGHDASIIQSFSADSESLKL
LKQTEKIDDENALAAIRIHKVMKPGDPVTTEVAKQFVKKLFFDPERYDLTMVGRMKMNHK
LGLHVPDYITTLTHEDIITTVKYLMKIKNNQGKIDDRDHLGNRRIRAVGELLANELHSGL
VKMQKTIKDKLTTMSGAFDSLMPHDLVNSKMITSTIMEFFMGGQLSQFMDQTNPLSEVTH
KRRLSALGEGGLVKDRVGFEARDVHPTHYGRICPIETPEGQNIGLINTLSTFTRVNDLGF
IEAPYKKVVDGKVVGETIYLTAIQEDSHIIAPASTPIDEEGNILGDLIETRVEGEIVLNE
KSKVTLMDLSSSMLVGVAASLIPFLEHDDANRALMGTNMQRQAVPLLRSDAPIVGTGIEK
IIARDSWGAIKANRAGVVEKIDSKNIYILGESKEEAYIDAYSLQKNLRTNQNTSFNQVPI
VKVGDKVGAGQIIADGPSMDRGELALGKNVRVAFMPWNGYNFEDAIVVSECITKDDIFTS
THIYEKEVDARELKHGVEEFTADIPDVKEEALAHLDESGIVKVGTYVSAGMILVGKTSPK
GEIKSTPEERLLRAIFGDKAGHVVNKSLYCPPSLEGTVIDVKVFTKKGYEKDARVLSAYE
EEKAKLDMEHFDRLTMLNREELLRVSSLLSQAILEEPFSHNGKDYKEGDQIPKEEIASIN
RFTLASLVKKYSKEVQNHYEITKNNFLEQKKVLGEEHEEKLSILEKDDILPNGVIKKVKL
YIATKRKLKVGDKMAGRHGNKGIVSNIVPVADMPYTADGEPVDIVLNPLGVPSRMNIGQI
LEMHLGLVGKEFGKQIARMLEDKTKDFAKELRAKMLEIANAINEKDPLTIHALENCSDEE
LLEYAKDWSKGVKMAIPVFEGISQEKFYKLFELAKIAMDGKMDLYDGRTGEKMRERVNVG
YMYMIKLHHLVDEKVHARSTGPYSLVTHQPVGGKALFGGQRFGEMEVWALEAYGAAHTLK
EMLTIKSDDIRGRENAYRAIAKGEQVGESEIPETFYVLTKELQSLALDINIFGDDVDEDG
APKPIVIKEDDRPKDFSSFQLTLASPEKIHSWSYGEVKKPETINYRTLKPERDGLFCMKI
FGPTKDYECLCGKYKKPRFKDIGTCEKCGVAITHSKVRRFRMGHIELATPVAHIWYVNSL
PSRIGTLLGVKMKDLERVLYYEAYIVKEPGEAAYDNEGTKLVMKYDILNEEQYQNISRRY
EDRGFVAQMGGEAIKDLLEEIDLITLLQSLKEEVKDTNSDAKKKKLIKRLKVVESFLNSG
NRPEWMMLTVLPVLPPDLRPLVALDGGKFAVSDVNELYRRVINRNQRLKRLMELGAPEII
VRNEKRMLQEAVDVLFDNGRSTNAVKGANKRPLKSLSEIIKGKQGRFRQNLLGKRVDFSG
RSVIVVGPNLKMDECGLPKNMALELFKPHLLSKLEERGYATTLKQAKRMIEQKSNEVWEC
LQEITEGYPVLLNRAPTLHKQSIQAFHPKLIDGKAIQLHPLVCSAFNADFDGDQMAVHVP
LSQEAIAECKVLMLSSMNILLPASGKAVAIPSQDMVLGLYYLSLEKSGVKGEHKLFSSVN
EIITAIDTKELDIHAKIRVLDQGNIIATSAGRMIIKSILPDFIPTDLWNRPMKKKDIGVL
VDYVHKVGGIGITATFLDNLKTLGFRYATKAGISISMEDIITPKDKQKMVEKAKVEVKKI
QQQYDQGLLTDQERYNKIIDTWTEVNDKMSKEMMTAIAQDKEGFNSIYMMADSGARGSAA
QIRQLSAMRGLMTKPDGSIIETPIISNFKEGLNVLEYFNSTHGARKGLADTALKTANAGY
LTRKLIDVSQNVKVVSDDCGTHEGIEITDIAVGSELIEPLEERIFGRVLLEDVIDPITNE
ILLYADTLIDEEGAKKVVEAGIKSITIRTPVTCKAPKGVCAKCYGLNLGEGKMSYPGEAV
GVVAAQSIGEPGTQLTLRTFHVGGTASRSQDEREIVASKEGFVRFYNLRTYTNKEGKNII
ANRRNASILVVEPKIKAPFDGELRIETVYEEVVVSVKNGDQEAKFVLRRSDIVKPSELAG
VGGKIEGKVYLPYASGHKVHKGGSIADIIQEGWNVPNRIPYASELLVKDNDPIAQDVYAK
EKGVIKYYVLEANHLERTHGIKKGDMVSEKGLFAVIADDNGREAARHYIARGSEILIDDN
SEVSTNSVISKPTTNTFKTIATWDPYNTPIIADFKGKVGFVDVIAGVTVAEKEDENTGIT
SLVVNDYIPSGYKPSLFLEGANGEEMRYFLEPKTSIAISDGSSVEQAEVLAKIPKATVKS
RDITGGLPRVSELFEARKPKPKDVAILSEVDGIVSFGKPIRNKEHIIVTSKDGRSMDYFV
DKGKQILVHADEFVHAGEAMTDGVISSHDILRISGEKELYKYIVSEVQQVYRRQGVSIAD
KHIEIIVSQMLRQVRILDSGDSKFIEGDLVSKKLFKEENARVIALKGEPAIAEPVLLGIT
RAAIGSDSIISAASFQETTKVLTEASIAMKKDFLEDLKENVVLGRMIPVGTGMYKNKKIV
LRALEDNSKF
>AE000764    AE000764 9803 Aquifex aeolicus section 96 of
MSEARRGIFPFSKIKLMLASPEDIRSWSHGEVKRPETLNYRTLKPEKDGLFCAKIFGPIK
DYECLCGKYRGKRYEGKICEKCGVEVTTSYVRRQRFGHIELAAPVVHIWFLKSTPSKIGT
LLNLTSRDVERVAYFESYLVIEYPNEEEEEKFEKDEHTIPLNDGISTKWVKLHVVNEEEF
EEKYAFTIDEKYEHGMGAEILKEVLSKLDLDAYSRKLKEIVKPYSIGFEDLGKEIEQKYK
NLYQKLIKVIADDFRAYGVEIKGLEDHGLSLEQAIHRILNEELYLNVETGEISLEDCGDS
CLTGRDALKEYYERVREHKKDIPIFEKIKEDIRSTVLREISEARIRKALRTLQLVEGFKK
SGNRPEWMILEVLPVLPPELRPLVALDGGRFATSDLNDFYRRVINRNNRLKRLIELNAPD
IIIRNEKRMLQEAVDALIDNGKRGNPVKQNGRPLKSLADYLKGKQGRFRQNLLGKRVDYS
GRSVIVVGPELQMHQCGLPKIMALELFKPFVYRRLEEKGYATSIKHAKRLVEQKTPEVWE
CLEEVVKEHPVLLNRAPTLHRPSIQAFEPVLVEGKAIQLHPLVCPPFNADFDGDQMAVHV
PLGIEAQLESYILMLSTQNVLSPAHGKPLTMPSQDMVLGTYYITHDPIPGRKGEGKAFGT
FEEVLKALELGHVDIHAKIKVKVGNEWIETTPGRVLFNSIMPEGQPFVNKTLDKKGLSKL
ITELYIRVGNEETVKFLDRVKELGFLRSTLAGISIGVEDLQVPKAKKKIIEEALKKTEEI
WNQYVQGIITNKERYNRIIDVWSEATNLVSKAMFEEIEKSKRIENGKEYPGTFNPIYMMA
ISGARGNRDQIRQLAGMRGLMAKHSGEFIETPIISNFREGLSVLEYFISTYGARKGLADT
ALKTAFAGYLTRRLVDVAQDITITERDCGTVKGFEMEPIVEAGEERVPLKDRIFGRVLAE
DVKDPYTGEIIARRNEVIDEKLAEKITKAGIEKVRVRSPLTCEAKHGVCAMCYGWDLSQR
KIVSVGEAVGIIAAQSIGEPGTQLTMRTFHIGGAATAQKVQSFVKAESDGKVKFYNVKLI
VNRKGEKINISKDAAIGIVDEEGRLLERHTIPYGARILVEEGQEVKAETKLADWDPFNTY
IIAEVGGKVELRDIILDVTVREERDPITGKTASVISFMRPRDAMLHTPRIAVITEDGKEY
IYDLPVNAILNIPPEKISLEWRVCPTCSESEETTIQHQYYVVKDLEVQPGDILARIPKET
AKVRDIVGGLPRVEELFEARKPKNPAILSEIDGYVKIYEDADEVIIFNPRTGETAKYSIK
KDELILVRHGQFVKKGQKITETKVAEIDGQVRIKGRGFKVIVYNPETGLQREYFVPKGKF
LLVKEGDFVKAGDQLTDGTPVPEEILRIKGIEELEKFLLKEVQMVYKLQGVDINDKHFEI
IIKQMLKKVRIIDPGDSRFLVGEEVDKEELEEEIQRIKLEGGKLPKAEPVLVGITRAALS
TRSWISAASFQETTRVLTDASVEGKIDELRGLKENVIIGNIIPAGTGVDEYREVDVIPAE
EKVLEEKKEPKEGS
>BB0388 DNAdirected RNA polymerase (rpoC) {Escherichia coli}
MKEIKDFERIKIKIASPDQIRNWSYGEVKKSETINYRTLRPEKDGLFCERIFGTTKEWEC
YCGKFKSVRYKGIICDRCNVEVTHFKVRRERMGHIELAAPVAHIWYYKYIPSRIGLLLDI
TASSLNSILYYEKYVVIEPGDTDLKKMQLLNEDEYIEARERYGMSFNASMGAEAIKTLLE
NLDLDELSSKLRIQMIDKDDKTDKKLLRRLEIIENFKISGNKPEWMIMEVLPVIPPEIRP
MVQLDGGRFATSDLNDLYRRVINRNNRLRKLLLLNAPEIIVRNEKRMLQESVDSLFDNSH
KRKVVKGSSSRPLKSLSDALKGKQGRFRQNLLGKRVDYSGRSVIVVGPELKLHQCGLPAK
MALELFKPFVIRRLIESEAVFNIKRAKNLIEQEVDEVWQILDLVIKEHPILLNRAPTLHR
LGIQAFEPVLVEGKAIKLHPLVCHAYNADFDGDQMAVHVPLTPAAQAESWALMLSTNNLL
NPANGHPIVFPSQDIVLGLYYLTMEKKNVVGEGKKFLNFNNVILAINNRSLDYNASIYVK
IHGEYKKTTAGRVIFNEALPKGIEFVNKTLSDLELQILISKVYVVHGSSIVIEMLDIIKE
LGFRYATKFGCTISMSDIIVPDEKRTYVERANKEIAKIQNDYAKGVITGEERYNNVVSVW
LKTNEELTNKMMEILKKDRDGFNVIYMMADSGARGSRNQIRQLAGMRGLMAKTSGDIIEL
PIISNFKEGLSVIEFFISTNGARKGLADTALKTADAGYLTRRLVDIAQDVVVRIEDCGTI
NGIKVETVKNGEEILESLKEKAVGSYSIERIKNPITGEIVLDANEEISEAKIELLEKIGI
EKLVIRSVLTCEAEHGVCQKCYGRDFSKNKPVNIGEAVGIIAAQSIGQPGTQLTMRTFHI
GGVAQAGSEDDKISLKNAFILNGIEGFNVRVDNGILFTRKGTLKIINVFYEEKIKNIKEI
KVLDSQRVIKGIPLFIDKKGSEILSSYIGYVKLRDDNFFIVSEEQEVSLKAGTKLEIEVG
DYVESGKVIGTFDPFAEPIIAEVKGKIKFKDIILGTTLKEEINTETGNVEKRITDNVFES
LDPRIFIIDSSGMEVASYVLPGDAYLQVEDGQSINIGDIIAKLSKGSEKTQDITGGLPRV
NDLFETRIPKNLTEMAKVSGIVQFKSIQKGKRLINILDEYGVEHKHYIPAGKHLLVRDGD
VVKAGDMLCDGRINPHDVLEILGGISLQEFLLAEIQDVYRKQGVSINDKHIGVIIKQMMK
KVKIVAVGDTNFVYGQKVDKHTFYEQNRKVIEQGGEPAIASPILIGVTKTSLNIDSFISA
ASFQETTKVLTDASIAGKIDDLRGLKENVVIGHLIPTGTGMGLYKKIKVSENIDSEV
>TP0242 DNAdirected RNA polymerase, beta' subunit {Borrelia burgdo
MKDIRDFDSLQIKLASPDTIRAWSYGEVKKPETINYRTLRPEREGLFCERIFGTTKEWEC
FCGKFKSIRYRGVICDRCGVEVTHFKVRRERMGHIELATPVSHIWYYRCVPSRMGLLLDL
QVIALRSVLYYEKYIVIEPGDTDLKKNQLLTETEYNDAQERYGGGFTAGMGAEAIRTLLQ
NLDLDALVAQLREKMMEKGAKSDKRLLRRIEIVENFRVSGNKPEWMILSVIPVIPPDLRP
MVQLDGGRFATSDLNDLYRRVIHRNSRLIRLMELKAPDIIIRNEKRMLQEAVDALFDNSK
RKPAIKGASNRPLKSISDMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPELKLWQCGLPTK
MALELFKPFIMKKLVEKEIVSNIKKAKMLVEQESPKVFSVLDEVVKEHPVMLNRAPTLHR
LGIQAFEPVLVEGKAIRLHPLVCKPFNADFDGDQMAVHVPLTQAAQMECWTLMLSNRNLL
DPANGRTIVYPSQDMVLGLYYLTKERSLPEGARPRRFSSVEEVMMAAEKGVIGWQDQIQV
RYHKCDGQLVVTTAGRLVLNEEVPAEIPFVNETLDDKRIRKLIERVFKRQDSWLAVQMLD
ALKTIGYTYATFFGATLSMDDIIVPEQKVQMLEKANKEVLAIASQYRGGHITQEERYNRV
VEVWSKTSEELTSLMMETLERDKDGFNTIYMMATSGARGSRNQIRQLAGMRGLMAKPSGD
IIELPIRSNFKEGLNVIEFFISTNGARKGLADTALKTADAGYLTRRLVDIAQDVVVNEED
CGTINGIEYRAVKSGDEIIESLAERIVGKYTLERVEHPITHELLLDVNEYIDDERAEKVE
EAGVESVKLRTVLTCESKRGVCVCCYGRNLARNKIVEIGEAVGIVAAQSIGQPGTQLTMR
TFHVGGTASSTTEENRITFKYPILVKSIEGVHVKMEDGSQLFTRRGTLFFHKTLAEYQLQ
EGDSVQVRDRARVLKDEVLYHTTDGQTVYASVSGFARIIDRTVYLVGPEQKTEIRNGSNV
VIKADEYVPPGKTVATFDPFTEPILAEQDGFVRYEDIILGSTLIEEVNTETGMVERRITT
LKTGIQLQPRVFISDESGNALGSYYLPEEARLMVEEGAQVKAGTVIVKLAKAIQKTSDIT
GGLPRVSELFEARRPKNAAVLAQISGVVSFKGLFKGKRIVVVRDHYGKEYKHLVSMSRQL
LVRDGDTVEAGERLCDGCFDPHDILAILGENALQNYLMNEIRDVYRVQGVSINDQHIGLV
VRQMLRKTEVVSVGDTRFIYGQQVDKYRFHEENRRVEAEGGQPAVARPMFQGITKAALNI
DSFISAASFQETNKVLTNAAIAGSVDDLCGLKENVIIGHLIPAGTGMRRYRQVKLFDKNK
RDLDVQMEEVIRRRKLEEEALAQAVAGMEGEPEGEA
>Caulobacter crescentus|gi|13421684|gb|AAK22490.1| DNAdirected RNA
MNQEVLNIFNPVQAAPTFDQIRISLASPEKIRSWSFGEIKKPETINYRTFKPERDGLFCA
RIFGPTKDYECLCGKYKRMKYKGIICEKCGVEVTLARVRRERMGHIELASPVAHIWFLKS
LPSRIAMMLDMPLKDIERVLYFEYYIVTEPGLTPLKQHQLLSEDDYMRAQEEYGDDSFTA
EIGAEAIQNLLKAIDLEKEAERLREELSGTVSDMKQKKFSKRLKILEAFQESGNRPEWMV
LTVVPVIPPELRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLIELRAPDIIIRNEKRM
LQESVDALFDNGRRGRVITGANKRPLKSLADMLKGKQGRFRQNLLGKRVDYSGRSVIVVG
PELKLHECGLPKKMALELFKPFIYARLDAKGLSGTVKQSKRMVEREQPQVWDILEEVIRE
HPVLLNRAPTLHRLGIQAFEPKLIEGKAIQLHPLVCAAFNADFDGDQMAVHVPLSLEAQL
EARVLMMSTNNILSPANGRPIIVPSQDIVLGLYYLSVARDGEPGEGKIFADLGEIEAAMD
AGVVSLHAKIKARHTEMTPEGVLLRKVIDTTPGRMKIAALLPHHPQIGHRLIEKALTKKE
IGNLIDIVYRHCGQKATVIFADKVMGLGFKEAAKAGISFGKDDIIIPVRKTAIVEETRKL
AEEYEQQYADGLITKGEKYNKVVDAWAKATDRVADEMMAELQMKHKDENGREKEINAIYM
MAHSGARGSQAQMKQLGGMRGLMAKPSGEIIETPIVSNFKEGLTVQEYFNSTHGARKGLA
DTALKTANSGYLTRRLVDVAQDCIIVEEDCGTTKGITLRAVVEGGDVLVSLGSRVLGRFT
AEDVKDPGTGELVVPADTYIDENIADAIEAAVVQSVKVRSVLTCEAKIGVCGACYGRDLA
RGTPVNIGEAVGVIAAQSIGEPGTQLTMRTFHIGGTAQVAEQSFFEASNEGTVRVIGPTV
VGSDGALVIMSRNTSVSVLVDGKERETYKPPYGARLRVKDGDLVKRGQRLGDWDPYTTPI
ITEVAGKIRAEDLVDGLSIREEVDEATGIAQRVVADWRTSARGSDLRPAMGVLSEDGSYK
RLSNGGEARYLLSAGAILSVADGDEVKPGEVIARIPTEGAKTRDITGGLPRVAELFEARR
PKDCAVIAEMDGRVEFGKDYKNKRRIKITPDVDADGNQPEAVEFLIPKGKHIAVHDGDYI
TKGEYIIDGNPDPHDILRILGVEALANFLVDEIQEVYRLQGVPINDKHIETIVRQMLQKV
EILEPGDTGLIKGDHLDKPEFDKEQEKAIARGGRPAVTQPVLLGITKASLQTKSFISAAS
FQETTRVLTEASVHGKTDTLEGLKENVIVGRLIPAGTGSYLRSLQRVAAKRDEQLAQQRE
DAMEPLPAEIALSDAE
>mlr0277
MNQEVMNLFNPQAPAQVFDSIRISLASPEKILSWSFGEIKKPETINYRTFKPERDGLFCA
RIFGPIKDYECLCGKYKRMKYKGVICEKCGVEVTLSRVRRERMGHIELAAPVAHIWFLKS
LPSRIGTLLDMTLKDIERVLYFENYIVTEPGLTALKEHQLLSEEEYMIAVDEYGEDSFTA
MIGAEAIHDLLAGMDLEKIAGDLRSELASTTSELKQKKYLKRLKVVENFMESGNRPEWMI
MKVVPVIPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLIELRAPGIIVRNEKRM
LQEAVDALFDNGRRGRVITGANKRPLKSLSDMLKGKQGRFRQNLLGKRVDYSGRSVIVTG
PELKLHQCGLPKKMALELFKPFIYARLDAKGYSSTVKQAKKLVEKERPEVWDILDEVIRE
HPVLLNRAPTLHRLGIQAFEPILIEGKAIQLHPLVCTAFNADFDGDQMAVHVPLSLEAQL
EARVLMMSTNNILHPASGAPIIVPSQDMVLGLYYLSIVNQNEPGEGMVFADMGELQHALE
TKAVTLHAKIKGRFRTVDAEGKVVSKIHDTTPGRMIIGELLPKNVNVPYETANQEMTKKN
ISKMIDTVYRHCGQKETVIFCDRIMALGFAHACRAGISFGKDDMLIPDSKIKLVSDTEAL
AKEYEQQYNDGLITQGEKYNKVVDAWAKCSEKVADEMMARIKAVEFEDNGRQKPMNSIYM
MSHSGARGSPTQMRQLAGMRGLMAKPSGEIIETPIISNFKEGLTVLEYFNSTHGARKGLA
DTALKTANSGYLTRRLVDVAQDCIVNSVDCGTDKGLTMQPIVDAGQVVASVGQRVLGRTA
LDDINHPVTGDLLVKAGTLMDERDVEQIEKAGVQSVRIRSALTCEVRVGVCAVCYGRDLA
RGTPVNQGEAVGVIAAQSIGEPGTQLTMRTFHMGGTAQVVDSSFLEASYEGKVEIRNRNV
VRNSDGQQMVMGRNMAVLILDEAGKERATHRVTYGSRIFVDDGDKVKRGQRIAEWDPYTR
PILTEIEGRVAFEDLVDGISVQETADESTGITKREVIDWRSTPRGNDLKPAIVVQDAKGK
VGKLSKGGDARFLLSVEAILSVEPGAQVRPGDVLARIPMESAKTKDITGGLPRVAELFEA
RRPKDHAIIAEIDGTIRFGRDYKNKRRIIIEPHDSTLEPVEYLIPKGKPFHLQDGDVIEK
GDYILDGNPAPHDILAIKGVEALASYLVNEIQEVYRLQGVSINDKHIEVIVRQMLQKVEI
TTQGDSTYIPGDHVDVIELEEVNERLIEDGKKPAEGQPVLLGITKASLQTPSFISAASFQ
ETTRVLTEAAVAGKTDMLQGLKENVIVGRLIPAGTGGTMSQIRRIATSRDELIIDERRKA
SGVEVAEPMLADMTTAAQ
>RPXX01      AJ235270 9811 Rickettsia prowazekii strain Ma
MSVVNFYGQLSNTQQFDQIRINIASPDQVRSWSFGEVTKPETINYRTFKPEKDGLFCARI
FGPVKDYECLCGKYKRMKNRGITCEKCGVEVTVSRVRRERMGHIELAAPVAHIWFLKSLP
SRISTLLDMTMRDVEKILYFENYVVVDPGLSILQKGELLTEEELQKAKDKYGEDAFTASI
GAEVIQQMLKELDFSKLKQELYDELHITSSEVKKKKLVKRLKLVEDFLESENKPEWMIMD
VLPVIPPEIRPLVMLDGGRFATSDLNELYRRVINRNNRLKKLIESKAPDIIVRNEKRMLQ
EAVDALFDNGRRGRAAKNANKRPFKSLSDMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPE
LKLHQCGLPKKMALELFKPFIYSKLELYGIATTIKAAKRMVEAEKPEVWDVLEEVIREHP
VLLNRAPTLHRLGIQAFEPLLIEGKAIQLHPLVCAAFNADFDGDQMAVHIPLSIEAQLEA
RVFMMSTNNILSPANGRPIIVPDKDIVLGLYYLTIAFDNEVGEGMMFSDLAEMEHALYNK
FITIHTKIKYRRDQLNAEGKMVPVIIDTTYGRLMVGELLPSNPNIEFKFINKQLTKKDIS
LVIDLVYRHCGQKATVIFADQLMKLGFKYACSSGISFGMDDMVVPESKSTHINETQLEIK
EFEQQYSNGLITYGEKYNKVVDAWSRCTDRVANDMMKEIATPPVNDYPNHQKINAIYMMA
ISGARGSFQQIKQLGGMRGLMTKSNGQIIQTPIISNFKEGLTEFECFNSANGMRKGQIDT
ALKTASSGYLTRKLVDVAQDCIITEKDCGTDKGIEVKSVIEGGEIIVPLAEKILGRTAAI
DIFHPVTNDLILNKGELINESKLEQIESAGLDRIMIKSVLTCESSTGICSICYGRDLATG
TLVSEGEAIGVIAAQSIGEPGTQLTMRTFHIGGAATKGAEVSSVEASYDAKVKIISRNVV
INSEERKIVMSRNCELLLLDNNGNEKARHKIPYGARLLVDDGDMVIKTQKLAEWDPYTIP
IITEKSGKVLFKDMVEGISIRDVTDEATGIPSKVIIESKQYSRGAELRPRIQLLDSKGEV
ITLSNGLEARYYLPVGAVLSVEDGIQISVGDIIARIPKESTTTKDITGGLPRVAELVEAR
RPKDHAVIAEVDGRVEFGKDYKSKRRIIIHPIDGTMSIEYMVPKGKHVVVNEGDFVKKGD
LLIDGNPVLQDILKVMGVEVLANYIVKEVQAVYRLQGVKIDDKHIEVIIRQMLQKVEVTD
SGGTTLLVGEKIDRHEFDEINAKAMKNGLKPAEAQLILQGITKASLQTRSFISAASFQET
TRVLTEAAIAGKVDKLRGLKENVIVGRLVPAGTGYFMDKMRKAAVKLDEENV
>AE004969 Neisseria gonorrhoeae FA1090 complete sequence 2153894 bp
MNLLNLFNPLQTAGMEEEFDAIKIGIASPETIRSWSYGEVKKPETINYRTFKPERDGLFC
AKIFGPVKDYECLCGKYKRLKFKGVTCEKCGVEVTLSKVRRERMGHIELAAPVAHIWFLK
SLPSRLGMVLNMTLRDIERVLYFEAFVVTDPGMTPLQRRQLLTEDDYYNKLDEYGDDFDA
KMGAEGIRELLRTLDVAGEIEILRQELESTGSDTKIKKIAKRLKVLEAFHRSGMKLEWMI
MDVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLELHAPDIIVRNEKRM
LQEAVDSLLDNGRRGKAMTGANKRPLKSLADMIKGKGGRFRQNLLGKRVDYSGRSVITVG
PYLRLHQCGLPKKMALELFKPFIFHKLEKQGLASTVKAAKKLVEQEVPEVWDILEEVIRE
HPIMLNRAPTLHRLGIQAFEPILIEGKAIQLHPLVCAAFNADFDGDQMAVHVPLSLEAQM
EARTLMLASNNVLSPANGEPIIVPSQDIVLGLYYMTRDRINAKGEGSLFADVKEVHRAYH
TKQVELGTKITVRLREWVKNEAGEFEPVVNRYETTVGRALLSEILPKGLPFEYVNKALKK
KEISKLINASFRLCGLRDTVIFADHLMYTGFGFAAKGGISIAVDDMEIPKEKAALLAEAN
AEVKEIEDQYRQGLVTNGERYNKVVDIWGRAGDKIAKAMMDNLSKQKVIDRDGNEVDQES
FNSIYMMADSGARGSAAQIKQLSGMRGLMAKPDGSIIETPITSNFREGLTVLQYFIATHG
ARKGLADTALKTANSGYLTRRLVDVTQDLVVVEDDCGTSDGFVMKAVVQGGDVIEALRDR
ILGRVTASDVVDPSSGETLVEAGTLLTEKLVDMIDQSGVDEVKVRTPITCKTRHGLCAHC
YGRDLARGKLVNAGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAASQVEAKSNGTA
RFSSQMRYVANNKGELVVIGRSCEVVIHDDIGRERERHKVPYGAILLVQDGMAIKAGQTL
ATWDPHTRPMITEHAGMVKFENMEEGVTVAKQTDDVTGLSTLVVIDGKRRSSSASKLLRP
TVKLLDENGVEICIPGTSTPVSMAFPVGAVITVREGQEIGKGDVLARIPQASSKTRDITG
GLPRVAELFEARVPKDAGMLAEITGTVSFGKETKGKQRLIITDVDGVAYETLISKEKQIL
VHDGQVVNRGETIVDGAVDPHDILRLQGIEALARYIVQEVQEVYRLQGVKISDKHIEVII
RQMLRRVNIADAGETGFITGEQVERGDVMAANEKALEEGKEPARYENILLGITKASLSTD
SFISAASFQETTRVLTEAAIMGKQDELRGLKENVIVGRLIPAGTGLTYHRSRHQQWQGVE
QETAETQVTDE
>NMA0141, rpoC, DNAdirected RNA polymerase beta' chain 122773:1269
MNLLNLFNPLQTAGMEEEFDAIKIGIASPETIRSWSYGEVKKPETINYRTFKPERDGLFC
AKIFGPVKDYECLCGKYKRLKFKGVTCEKCGVEVTLSKVRRERMGHIELAAPVAHIWFLK
SLPSRLGMVLDMTLRDIERVLYFEAFVVTDPGMTPLQRRQLLTEDDYYNKLDEYGDDFDA
KMGAEGIRELLRTLNVAGEIEILRQELESTGSDTKIKKIAKRLKVLEAFHRSGMKLEWMI
MDVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLELHAPDIIVRNEKRM
LQEAVDSLLDNGRRGKAMTGANKRPLKSLADMIKGKGGRFRQNLLGKRVDYSGRSVITVG
PYLRLHQCGLPKKMALELFKPFIFHKLEKQGLASTVKAAKKLVEQEVPEVWDILEEVIRE
HPIMLNRAPTLHRLGIQAFEPILIEGKAIQLHPLVCAAFNADFDGDQMAVHVPLSLEAQM
EARTLMLASNNVLSPANGEPIIVPSQDIVLGLYYMTRDRINAKGEGSLFADVKEVHRAYH
TKQVELGTKITVRLREWVKNEAGEFEPVVNRYETTVGRALLSEILPKGLPFEYVNKALKK
KEISKLINASFRLCGLRDTVIFADHLMYTGFGFAAKGGISIAVDDMEIPKEKAALLAEAN
AEVKEIEDQYRQGLVTNGERYNKVVDIWGRAGDKIAKAMMDNLSKQKVIDRDGNEVDQES
FNSIYMMADSGARGSAAQIKQLSGMRGLMAKPDGSIIETPITSNFREGLTVLQYFIATHG
ARKGLADTALKTANSGYLTRRLVDVTQDLVVVEDDCGTSDGFVMKAVVQGGDVIEALRDR
ILGRVTASDVVDPSSGETLVEAGTLLTEKLVDMIDQSGVDEVKVRTPITCKTRHGLCAHC
YGRDLARGKLVNAGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAASQVEAKSNGTA
RFSSQMRYVANNKGELVVIGRSCEVVIHDDIGRERERHKVPYGAILLVQDGMAIKAGQTL
ATWDPHTRPMITEHAGMVKFENVEEGVTVAKQTDDVTGLSTLVVIDGKRRSSSASKLLRP
TVKLLDENGVEICIPGTSTPVSMAFPVGAVITVREGQEIGKGDVLARIPQASSKTRDITG
GLPRVAELFEARVPKDAGMLAEITGTVSFGKETKGKQRLIVTDVDGVAYETLISKEKQIL
VHDGQVVNRGETIVDGAVDPHDILRLQGIEALARYIVQEVQEVYRLQGVKISDKHIEVII
RQMLRRVNIADAGETGFITGEQVERGDVMAANEKALEEGKEPARYENVLLGITKASLSTD
SFISAASFQETTRVLTEAAIMGKQDELRGLKENVIVGRLIPAGTGLTYHRSRHQQWQGVE
QETAETQVTDE
>NMB0133 DNAdirected RNA polymerase, beta' subunit (rpoC)
MNLLNLFNPLQTAGMEEEFDAIKIGIASPETIRSWSYGEVKKPETINYRTFKPERDGLFC
AKIFGPVKDYECLCGKYKRLKFKGVTCEKCGVEVTLSKVRRERMGHIELAAPVAHIWFLK
SLPSRLGMVLDMTLRDIERVLYFEAFVVTDPGMTPLQRRQLLTEDDYYNKLDEYGDDFDA
KMGAEGIRELLRTLNVAGEIEILRQELESTGSDTKIKKIAKRLKVLEAFHRSGMKLEWMI
MDVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLELHAPDIIVRNEKRM
LQEAVDSLLDNGRRGKAMTGANKRPLKSLADMIKGKGGRFRQNLLGKRVDYSGRSVITVG
PYLRLHQCGLPKKMALELFKPFIFHKLEKQGLASTVKAAKKLVEQEVPEVWDILEEVIRE
HPIMLNRAPTLHRLGIQAFEPILIEGKAIQLHPLVCAAFNADFDGDQMAVHVPLSLEAQM
EARTLMLASNNVLSPANGEPIIVPSQDIVLGLYYMTRDRINAKGEGSLFADVKEVHRAYH
TKQVELGTKITVRLREWVKNEAGEFEPVVNRYETTVGRALLSEILPKGLPFEYVNKALKK
KEISKLINASFRLCGLRDTVIFADHLMYTGFGFAAKGGISIAVDDMEIPKEKAALLAEAN
AEVKEIEDQYRQGLVTNGERYNKVVDIWGRAGDKIAKAMMDNLSKQKVIDRAGNEVDQES
FNSIYMMADSGARGSAAQIKQLSGMRGLMAKPDGSIIETPITSNFREGLTVLQYFIATHG
ARKGLADTALKTANSGYLTRRLVDVTQDLVVVEDDCGTSDGFVMKAVVQGGDVIEALRDR
ILGRVTASDVVDPSSGETLVEAGTLLTEKLVDMIDQSGVDEVKVRTPITCKTRHGLCAHC
YGRDLARGKLVNAGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAASQVEAKSNGTA
RFSSQMRYVANNKGELVVIGRSCEVVIHDDIGRERERHKVPYGAILLVQDGMAIKAGQTL
ATWDPHTRPMITEHAGMVKFENVEEGVTVAKQTDDVTGLSTLVVIDGKRRSSSASKLLRP
TVKLLDENGVEICIPGTSTPVSMAFPVGAVITVREGQEIGKGDVLARIPQASSKTRDITG
GLPRVAELFEARVPKDAGMLAEITGTVSFGKETKGKQRLIVTDVDGVAYETLISKEKQIL
VHDGQVVNRGETIVDGAVDPHDILRLQGIEALARYIVQEVQEVYRLQGVKISDKHIEVII
RQMLRRVNIADAGETGFITGEQVERGDVMAANEKALEEGKEPARYENVLLGITKASLSTD
SFISAASFQETTRVLTEAAIMGKQDELRGLKENVIVGRLIPAGTGLTYHRSRHQQWQEVE
QETAETQVTDE
>XF2632 (XF07H03EGL03)
MEAQSSIVLCFQLVPMPEFRRRSMKDLLNLFNQQRQTLDFDAIKIGLASPALIRSWSFGE
VKKPETINYRTFKPERDGLFCAAIFGPIKDYECLCGKYKRMKHRGVVCEKCGTEVTLAKV
RRERMGSIELASPVAHIWFLKSLPSRIGLMLDMTLRDIERVLYFEAYVVTEPGLTPLERR
QLLTEEQYLQARQEHADDFDATMGAEAVYELLRMIDLQSEMARLREEIVVTGSETKLKRL
TKRIKLIEAFIESGNRPEWMILTVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNN
RLCRLLELSAPDIIVRNEKRMLQESVDALLDNGRRGRAITGTNKRPLKSLADMIKGKQGR
FRQNLLGKRVDYSARSVIIVGPNLRLHQCGLPKKMALELFKPFVFAKLQRRGLATTIKAA
KKLVEREEAEVWDILEEVISEHPVVLNRAPTLHRQGIQAFEPVLIEGKAIQLHPLVCTAF
NADFDGDQMAVHVPLSLEAQLEARALMMSTNNILSPANGEPIIVPSQDVVLGLYYMSRAL
ENKKGEGMVFANTSELKRAYDNSVVELHAKVKVRITEIETDDRGLRSKASLIVDTTVGRA
LLSEILPEGLPFVLVNTEMTKKNISRLINSSYRMLGLKDTVVFADKLMYTGYAYATRAGV
SICIDDMLIPIEKKEILGEAEQEVLEIQEQYQSGLVTAGERYNKVVDIWSRTNERIAKAM
MDTIGTEKVVNTDGEIVDQKSMNSLYIMADSGARGSPQQIRQLAAMRGLMVRPDGSIIET
PIKANFREGLSVQEYFNSTHGARKGLADTALKTANSGYLTRRLVDVTQDLCVVQLDCGTA
GGLTMTPIVEGGDVVEPLKDRVLGRVVAEDVFLPGNDDEPIVTRSTLLDEQWVAKLEEAG
VQSVKVRSPITCESPFGVCALCYGRDLARGHLVNMGEAVGVIAAQSIGEPGTQLTMRTFH
IGGTALSAAAVDNITVKTSGSVKFTNLKYVEHVNGTLVAVSRSGEISVLDTHGRERERYK
LPYGATINVKDMAEVKSGQILANWDPHNHPIVSEVAGFVRFIDFVDGVTVIEKTDDLTGL
SSREIADLKRRGSQGKDLRPLVRIVDKKGNDLTIPGTDLSAQYLLPPRSIVNLQDGAPVG
IGDVVAKIPQEASKTRDITGGLPRVADLFEARRPKDPAILAERSGVISFGKDTKGKQRLI
IKDADGSEHEELIPKYRQIIVFEGEHVTKGETIVDGEPSPQDILRLLGIEPLAAYLVKEI
QDVYRLQGVKINDKHIEVITRQMLRKVEIVDQGNSKFLNGEQVERQRVIDENAKLIARNE
LPAKYNPVLLGITKASLATESFISAASFQETTRVLTEAAVRGTRDNLRGLKENVIVGRLI
PAGTGQRYHSQRRYSSVGLTQSEMETLVGRSTSSGTEVTSPSKDAIPLGG
>Pseudomonas aeruginosa AE004842_6 DNAdirected RNA polymerase beta
MKDLLNLLKNQGQIEEFDAIRIGLASPEMIRSWSFGEVKKPETINYRTFKPERDGLFCAK
IFGPVKDYECLCGKYKRLKHRGVICEKCGVEVALAKVRRERMGHIELASPVAHIWFLKSL
PSRIGLLLDMTLRDIERVLYFESYVVIDPGMTTLEKGQLLNDEQYFEALEEFGDDFDARM
GAEAVHELLNAIDLEHEIGRLREEIPQTNSETKIKKLSKRLKLMEAFQGSGNKPEWMVLT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQ
EAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPT
LRLHQCGLPKKMALELFKPFIFGKLEGRGMATTIKAAKKMVERELPEVWDVLAEVIREHP
VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVMGLYYMTREAINAKGEGMAFADLQEVDRAYRSG
QASLHARVKVRINEKIKGEDGQLTANTRIVDTTVGRALLFQVVPAGLPFDVVNQSMKKKA
ISKLINHCYRVVGLKDTVIFADQLMYTGFAYSTISGVSIGVNDFVIPDEKARIINAATDE
VKEIESQYASGLVTQGEKYNKVIDLWSKANDEVSKAMMANLSKEKVVDREGKEVDQESFN
SMYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGAR
KGLADTALKTANSGYLTRRLVDVAQDLVVTEIDCGTEHGLLMSPHIEGGDVVEPLGERVL
GRVIARDVFKPGSDEVIVPAGTLIDEKWVDFLEVMSVDEVVVRSPITCETRHGICAMCYG
RDLARGHRVNIGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRTSAADNVQVKNGGTIRL
HNLKHVVRADGALVAVSRSGELAVADDFGRERERYKLPYGAVISVKEGDKVDPGAIVAKW
DPHTHPIVTEVDGTVAFVGMEEGITVKRQTDELTGLTNIEVMDPKDRPAAGKDIRPAVKL
IDAAGKDLLLPGTDVPAQYFLPANALVNLTDGAKVSIGDVVARIPQETSKTRDITGGLPR
VADLFEARRPKEPSILAEISGTISFGKETKGKRRLVITPNDGSDPYEELIPKWRHLNVFE
GEQVNRGEVISDGPSNPHDILRLLGVSSLAKYIVNEIQDVYRLQGVKINDKHIETILRQM
LRKVEVSESGDSSFIKGDQVELTQVLEENEQLGTEDKFPAKYERVLLGITKASLSTESFI
SAASFQETTRVLTEAAVTGKRDFLRGLKENVVVGRLIPAGTGLAYHSERKRQRDLGKPQR
VSASEAEAALTEALNSSGN
>VC0329 DNAdirected RNA polymerase, beta' subunit (rpoC)
MKDLLNFLKAQHKTEEFDAIKIGLASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCAR
IFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRDRMGHIELASPVAHIWFLKSL
PSRIGLLMDMPLRDIERVLYFEMYVVTEPGMTDLERGQMLTEEEYLDRLEEWGDEFTAKM
GAEAIKDLLASMDLPAEAEQMREELDTTNSETKRKKLTKRLKLVEAFVASGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLELAAPDIIVRNEKRMLQ
ESVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPY
LRLHQCGLPKKMALELFKPFIYSKLETRGLATTIKAAKKMVEREEAVVWDILDEVIREHP
VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RTLMMSTNNILSPASGDPIIVPSQDVVLGLYYMTREKINAKGEGMYLTGPAEAEKAYRTK
TAELHARVKVRITETIKHENGKLTTETKMIDTTVGRAMLWQIVPKGLPYSLVNQKLGKKQ
ISNLLNEAYRKLGLKDTVIFADQIMYTGFAYAALSGVSVGIDDMVVPAAKYTEIAEAEEE
VREIQEQFQSGLVTAGERYNKVIDIWASTNDRVAKAMMENLSSEQVINRQGEQEKQESFN
SIYMMADSGARGSAAQIRQLAGMRGLMARPDGSIIETPITANFKEGLNVLQYFISTHGAR
KGLADTALKTANSGYLTRRLVDVAQDVVVTEHDCGTLEGVVMTPHIEGGDVKVALTELAL
GRVVSEDILKPGTDEVLIPRNTLLDEKWCKVINDNSVDQIKVRSVVTCDSDFGCCAQCYG
RDLARGHLVNQGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASTAAAENSIQAKNNGSVKL
HNAKFVTNKDGKLVITSRASELTIIDEFGRTKEKHKLPYGSMLSKADGDAVAAGETVANW
EAHTMPIITEVAGRVQFVDMIDGVTVSRQTDDLTGLSSSEVTEAAARPAAGKDMRPAIKL
VDANGKDVLIPGTDMPAQYFLPGKAIVNLDDGAEVNVGDTLARIPQKSGGNKDITGGLPR
VADLFEARKPKEPAILAEHSGTVSFGKETKGKRRLIITRDSGDTYEEMIPKHRQLNVFEG
ERIERGDVIADGPESPHDILRLRGIHAVTTYIANEVQEVYRLQGVKINDKHIETIVRQML
RKCTITFAGDSEFLPGETVEYSQVKIANRKLVEEGKEPARFERELLGITKASLATESFIS
AASFQETTRVLTEAAVSGKRDDLRGLKENVIVGRLIPAGTGFAYHQDRQAKRAQEQQGPS
AEQATDNLAALLNAGFSSDDE
>HI0514 DNAdirected RNA polymerase, beta' chain (rpoC)
MKDLVKFLKAQSKTSEDFDVIKIGLASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCA
RIFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELASPVAHIWFLKS
LPSRIGLLLDMPLRDIERVLYFEMYIVTEPGMTDLERGQLLTEEQYLDAEDRWQDEFEAK
MGAEAIQDLLKGMDLEAECEKLREELQETNSETKRKKITKRLKLLEAFVQSGNKPEWMVM
TVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLIAPDIIVRNEKRML
QESVDALLDNGRRGRAITGSNRRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGP
YLHLHQCGLPKKMALELFRPFIYAKLESRGYATTIKAAKKMVEREDAIVWDILAEVIREH
PILLNRAPTLHRLGIQAFEPILIEGKAIQLHPLVCAAFNADFDGDQMAVHVPLTLEAQLE
ARALMMSTNNVLSPANGDPIIVPSQDVVLGLYYMTREKVNGKGEGMLLQDPREAEKAYRT
GEAELHSRVKVRITEYVKNEAGEFDAKTTLTDTTIGRAILWMIAPKGMPYSLFNQTLGKK
AISKLINEAYRRLGLKEAVMFADQIMYTGFAYAARSGSSVGIDDMEIPAKKYEIISAAEE
EVAEIQEQFQSGLVTAGERYNKVIDIWAAANERVAKAMMENLSQEEVINREGNPEKQASF
NSIFMMADSGARGSAAQIRQLAGMRGLMARPDGSIIETPITANFREGLNVLQYFISTHGA
RKGLADTALKTANSGYLTRRLVDVAQDLVIVEDDCGTHEGLVMTPLIEGGDEKVPLRELV
LGRVAAEDILKPGTEEVLIPRNTLLDEKLCDVLDANSVDSVKVRSVVTCDTDFGVCAKCY
GRDLARGHLINQGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASAAAKESSVQVKNTGTVH
LMNAKFVTNDESKLVLTSRNTELTITDAFGRTKEHYKVPYGAVLSKGDGQEVTAGETIAN
WDPHTMPVVSEVSGFVKFVDIIDGLTVTRQTDELTGLSSIVVQDVGERATAGKDLRPTIK
LVDANGNDIFLPETDVLAQYFLPGKAIVSLDDGTAVKVGEPLARIPQESVGTKDITGGLP
RVADLFEARKPKEPAILAEISGIVSFGKETKGKRRLLITPAEGETYEEMIPKWRQLNVFE
GEMVQRGDVISDGAETPHDILRLRGVRAVTEYIVNEVQDVYRLQGVKINDKHIEVIVRQM
LRKAVITKAYDSEFLEGEQVEVARVKIVNRQREAEGKPPVEFERELLGITKASLATESFI
SAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGFAYHQNRHKHRLVDDVVA
KLSEEDEAAIADEFVITADDATQNLATLLNSEIED
>gi|12722150|gb|AAK03820.1| RpoC [Pasteurella multocida]
MKDLVKFLKAQSKTSEDFDVIKIGLASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCA
RIFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELASPVAHIWFLKS
LPSRIGLLLDMPLRDIERVLYFESYIVIEPGMTDLDKGQLLTEEQYIDAEDRWGDEFDAK
MGAEAIQALLRDMDLPQECENLREELQETNSETKRKKITKRLKLLEAFIQSGNKPEWMVM
TVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLIAPDIIVRNEKRML
QESVDALLDNGRRGRAITGSNRRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGP
YLHLHQCGLPKKMALELFRPFIYAKLESRGFASTIKAAKKMVEREDAIVWDILADVIREH
PILLNRAPTLHRLGIQAFEPLLIEGKAIQLHPLVCAAFNADFDGDQMAVHVPLTLEAQLE
ARALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDKVNGKGEGMLLQDPREAEKAYRT
GQVELHSRVKVRITEYVKNAVGEFEPQTNLVDTTIGRAILWMIAPKGMPFSLFNQTLGKK
AISKLINESYRRLGMKPSVLFADQIMYTGFAYAARSGSSVGIDDMVIPAKKYEIISAAED
EVAEIQEQFQSGLVTAGERYNKVIDIWAAANERVAKAMMENLSTEEVINREGQPEKQASF
NSIFMMADSGARGSAAQIRQLAGMRGLMARPDGSIIETPITANFREGLNVLQYFISTHGA
RKGLADTALKTANSGYLTRRLVDVAQDLVIIEDDCGTHEGIVMTPLIEGGDVKEALRDRV
LGRVVAEDVLKPGTEEVLIARNTLLDEKLCDVIDSNSVDSIKVRSVVTCNTDFGVCAKCY
GRDLARGHLINQGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASAAAKESSIQVKNTGTLR
LANVKFVTNNEGKLVLTSRNTELTIIDAFGRTKEHYKVPYGAILSKGDGQEVTAGETVAN
WDPHTMPVVSEVSGFVKFIDLIDGLTVTRQTDELTGLSSIVVQDVGERATAGKDLRPAIK
LVDAKGNDILIPGTDVVAQYFLPGKAIVTLDDNAEVHIGEPLARIPQESVGTKDITGGLP
RVADLFEARKPKEPAILAEISGIVSFGKETKGKRRLLITPAEGETYEEMIPKWRQLNVFE
GEMVERGDLISDGAETPHDILRLRGVHAVTEYIVNEVQEVYRLQGVKINDKHIEVIVRQM
LRKGIVTKAYDSEFLEGEQVEVARVKIVNRKREAEGKPLVEFERELLGITKASLATESFI
SAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGFAYHQNRQKKVVMSDEMP
VKLSAADEEEIAAEFTVTAEDATASLAEMLNMADDAE
>rpoC;BU033;DNAdirected RNA polymerase beta' chain|36321..40544(co
MKDLLKFLKSQTKNEDFDAIKISLASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCAR
IFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQSKVRRERMGHIELSSPTAHIWFLKSL
PSRIGLLLDMPLRDIERVLYFESYVVIETGMTNLEKRQILTEEQYLDSLEEFGDEFHATM
GAEAIQFLLKDINLVQECNVLRIELNETNSETKRKKLTKRIKLLESFIQSHNKPEWMILN
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQ
EAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPY
LHLHQCGLPKKMALELFKPFIYGKLEVRGLATTIKAAKKMVEREEAIVWDILDEVIREHP
VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLESQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTREKINGKGEGMLLNGSNEAEKVYRLG
IAELHSLVKVRIIEYKKNEDKSFTAIKKIIPTTIGRAILWMIIPKGLPFSIVNQTLGKKD
ISKMLNTCYRILGLKSTVFFADQIMYTGFAYAARSGASVGIDDMVIPEKKANIINEAEIE
VAEIQEQFQSGLVTAGERYNKVIDIWAAANERVAKAMMQNLSTESVINKKGYKQKQISFN
SIFMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGAR
KGLADTALKTANSGYLTRRLVDVAQDLVVTQDDCRTHEGILMTPLIEGGDVKEPLRERVL
GRVTAENIIIPNTKNILIKRNTLLNEKWCDLLEHNSIDNVKVRSVVNCDTDFGVCAYCYG
RDLARGNLVNKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNQGIINL
NNAKFVINSAGKTVITSRNVELNIIDNFGRTKESYKVPYGAIMAKGDGEKVHSGETVAKW
DPHTMPVITEVNGLVRFVDMIDGQSITRQADELTGLSSIVILDTAERMSSGKDLRPALKI
IDCNGNDVLISGTDMPAQYFLPGKAIVQLDDGVQISSGDTLARVPQESGGTKDITGGLPR
VADLFEARRPKELAILAEISGIISFGKETKGKRRLVITPVDGSDSYEEMIPKWRQLNVFE
GERVDRGDVISDGPESPHDILRLRGVQAVTRYIVNEVQEVYRLQGVKINDKHIEVIIRQM
LRKATVVKSRDSDFLEGEQVEFSHIKISNRMLDKKKKMPATFSRDLLGITKASLATESFI
SAASFQETTRVLTESAVAGKRDELRGLKENVIVGRLIPAGTGYAYHKERLNRRQKKHNNP
TVSSSQISAEEASASLSELLNSALIEK
>Yersinia_pestis_strain_CO92
VPIRSNSDRSQSVKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRT
FKPERDGLFCARIFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELA
SPTAHIWFLKSLPSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERRQILTEEQYLDA
LEEFGDEFDAKMGAEAIQALLKNMDLEAECEILREELNETNSETKRKKLTKRIKLLEAFV
QSGNKPEWMILTVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAP
DIIVRNEKRMLQEAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVD
YSGRSVITVGPYLRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKMVEREEAVV
WDILDEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAV
HVPLTLEAQLEARALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLT
GPKEAERIYRAGLASLHARVKVRITEEIRNTEGESITRTSIIDTTVGRAILWMIVPQGLP
YSIVNQPLGKKAISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVGIDDMVIPE
AKAGIIEEAETEVAEIQEQFQSGLVTAGERYNKVIDIWAAANERVAKAMMDNLSVEDVVN
RDGVVEQQVSFNSIFMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLN
VLQYFISTHGARKGLADTALKTANSGYLTRRLVDVAQDLVVTEDDCGTHNGIVMTPVIEG
GDVKEPLRDRVLGRVTAEEVIKPGSADILVPRNTLLDEKWCDLLEENSVDSVKVRSVVSC
ETDFGVCANCYGRDLARGHIINKGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAES
SIQVKNKGSLKLSNVKFVTNAAGKLVITSRNTELKLIDEFGRTKESYKVPYGAVMAKGDG
AEVQGGETVANWDPHIMPVVTEVSGFIRFADMVDGQTITRQTDELTGLSSLVVLDSAERT
GSGKDLRPALKIVDAKGNDVLIPGTDMPAQYFLPGKAIVQLEDGIQIGAGDTLARIPQES
SGTKDITGGLPRVADLFEARRPKEPAILAEISGIISFGKETKGKRRLVISPLDGSDAYEE
MIPKWRQLNVFEGEVVERGDVVSDGPESPHDILRLRGVHAVTRYITNEVQEVYRLQGVKI
NDKHIEVIVRQMLRKGTIVDAGSTDFLEGEQAEMSRVKIANRKLAAEGKIEATFTRDLLG
ITKASLATESFISAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGYAYHQD
RMRRKAQGEAPVVPQVSADEATANLAELLNAGFGNNKG
>AP002567    AP002567 0103 Escherichia coli O157:H7 DNA, c
MKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCAR
IFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSL
PSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQILTEEQYLDALEEFGDEFDAKM
GAEAIQALLKSMDLEQECEQLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQ
EAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPY
LRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKMVEREEAVVWDILDEVIREHP
VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERLYRSG
LASLHARVKVRITEYEKDANGELVAKTSLKDTTVGRAILWMIVPKGLPYSIVNQALGKKA
ISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVGIDDMVIPEKKHEIISEAEAE
VAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEKQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGAR
KGLADTALKTANSGYLTRRLVDVAQDLVVTEDDCGTHEGIMMTPVIEGGDVKEPLRDRVL
GRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEENSVDAVKVRSVVSCDTDFGVCAHCYG
RDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGETVANW
DPHTMPVITEVSGFVRFTDMIDGQTITRQTDELTGLSSLVVLDSAERTAGGKDLRPALKI
VDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDTLARIPQESGGTKDITGGLPR
VADLFEARRPKEPAILAEISGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQM
LRKATIVNAGSSDFLEGEQVEYSRVKIANRELEANGKVGATYSRDLLGITKASLATESFI
SAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGYAYHQDRMRRRAAGEAPA
APQVTAEDASASLAELLNAGLGGSDNE
>AE005630    AE005630 0101 Escherichia coli O157:H7 genome
MKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCAR
IFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSL
PSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQILTEEQYLDALEEFGDEFDAKM
GAEAIQALLKSMDLEQECEQLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQ
EAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPY
LRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKMVEREEAVVWDILDEVIREHP
VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERLYRSG
LASLHARVKVRITEYEKDANGELVAKTSLKDTTVGRAILWMIVPKGLPYSIVNQALGKKA
ISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVGIDDMVIPEKKHEIISEAEAE
VAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEKQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGAR
KGLADTALKTANSGYLTRRLVDVAQDLVVTEDDCGTHEGIMMTPVIEGGDVKEPLRDRVL
GRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEENSVDAVKVRSVVSCDTDFGVCAHCYG
RDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGETVANW
DPHTMPVITEVSGFVRFTDMIDGQTITRQTDELTGLSSLVVLDSAERTAGGKDLRPALKI
VDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDTLARIPQESGGTKDITGGLPR
VADLFEARRPKEPAILAEISGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQM
LRKATIVNAGSSDFLEGEQVEYSRVKIANRELEANGKVGATYSRDLLGITKASLATESFI
SAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGYAYHQDRMRRRAAGEAPA
APQVTAEDASASLAELLNAGLGGSDNE
>ECOLI       U00096 9709 Escherichia coli K12, complete
MKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCAR
IFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSL
PSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQILTEEQYLDALEEFGDEFDAKM
GAEAIQALLKSMDLEQECEQLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQ
EAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPY
LRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKMVEREEAVVWDILDEVIREHP
VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERLYRSG
LASLHARVKVRITEYEKDANGELVAKTSLKDTTVGRAILWMIVPKGLPYSIVNQALGKKA
ISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVGIDDMVIPEKKHEIISEAEAE
VAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEKQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGAR
KGLADTALKTANSGYLTRRLVDVAQDLVVTEDDCGTHEGIMMTPVIEGGDVKEPLRDRVL
GRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEENSVDAVKVRSVVSCDTDFGVCAHCYG
RDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGETVANW
DPHTMPVITEVSGFVRFTDMIDGQTITRQTDELTGLSSLVVLDSAERTAGGKDLRPALKI
VDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDTLARIPQESGGTKDITGGLPR
VADLFEARRPKEPAILAEISGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQM
LRKATIVNAGSSDFLEGEQVEYSRVKIANRELEANGKVGATYSRDLLGITKASLATESFI
SAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGYAYHQDRMRRRAAGEAPA
APQVTAEDASASLAELLNAGLGGSDNE
>Salmonella typhi CT18 chromosome
VKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCAR
IFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSL
PSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQILTEEQYLDALEEFGDEFDAKM
GAEAIQALLKSMDLEQECETLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQ
EAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPY
LRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKMVEREEAVVWDILDEVIREHP
VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERIYRAG
LASLHARVKVRITEYEKDENGEFVAHTSLKDTTVGRAILWMIVPKGLPFSIVNQALGKKA
ISKMLNTCYRILGLKPTVIFADQTMYTGFAYAARSGASVGIDDMVIPEKKHEIISEAEAE
VAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEQQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGAR
KGLADTALKTANSGYLTRRLVDVAQDLVVTEDDCGTHEGILMTPVIEGGDVKEPLRDRVL
GRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEANSVDAVKVRSVVSCDTDFGVCAHCYG
RDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVMAKGDGEQVAGGETVANW
DPHTMPVITEVSGFIRFTDMIDGQTITRQTDELTGLSSLVVLDSAERTTGGKDLRPALKI
VDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDTLARIPQESGGTKDITGGLPR
VADLFEARRPKEPAILAEIAGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQM
LRKATIESAGSSDFLEGEQVEYSRVKIANRELEANGKVGATFSRDLLGITKASLATESFI
SAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGYAYHQDRMRRRAAGEQPA
TPQVTAEDASASLAELLNAGLGGSDNE