Mercurial > repos > rnateam > splitfasta
view test-data/part1.fasta @ 5:733ca84b21ee draft default tip
"planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/splitfasta commit 31945d5d8c5ebee64ebf29c6ea022fb831f47274"
author | rnateam |
---|---|
date | Mon, 21 Sep 2020 15:40:14 +0000 |
parents | |
children |
line wrap: on
line source
>NP_001007355.1 gi|55925472|ref|NP_001007355.1| eukaryotic translation initiation factor 4E-binding protein 3 [Danio rerio] MSNSEASSTCPIPSRSIHEKSWSPLPDSYSQTPGGTVFSTTPGGTRIIYDRKFLLECRNS PIARTPPCCLPDIPGVTRPSLQIIEQEEDSKDLSIDDSQFVIDI >NP_956692.1 gi|41055339|ref|NP_956692.1| transmembrane protein 218 [Danio rerio] MADVVLGVGTGVFIITLIWILTLALTIILSRATGPTKLGIIPVVLLALIITLVLVFFPRA AEVPAPQRAAQIVDMFFIGRYVLLSLVSLVFLAALFMLLPLHFLEPIYAKPLRTH >NP_001003767.1 gi|57524633|ref|NP_001003767.1| transmembrane protein 179 [Danio rerio] MAVDNFLFGQCILYFLAFLFGFIAVVPLSENGDDFQGKCLLFTEGIWQNENMTMGKQRFI VEEWGPESSCRFITFVGIVSLILSAVQAWRTFFFLCKGHDDSLFHSFLNLLLSLLVLFVV FVAGTISSVGFSIWCDSVTENGAMPSSCEDLQDTDLELGVENSSFYDQFAIAQFGLWSAW LCWLGLTVLAFLKVYHNHRQQELLESLVQEKELLLGHPLQRSSYVYNRNAMI >NP_001002700.1 gi|50540464|ref|NP_001002700.1| fatty-acid amide hydrolase 2-A [Danio rerio] MALTRFERFLGRLLRAVVWILFAAFKLFAPQQRHGVSRLPPITNPLLLLSAMQLARKIRR KEVTSVEVVQAYIDRIQEVNPLINAMVKDRFSAALQEAAQVDKLIEEETGGEDVLEDRLP LLGVPITVKEAFALQGMPNSTGLLTRRDLVSGADAPSVALLKRAGAIPLGVTNCSELCMW LESHNHLYGITNNPYDFERIVGGSSGGEGSILGAGSSVIGIGSDIGGSIRIPCFFNGIFG HKPSVGIVNNEGQYPPASGQQMGFLCTGPMCRYAEDLIPMLSIMGGPNAEKLSLFTEVDL KKLRFFSVPHNGGSHLVSPVEPQLLHAQKMVVKRLEADLGVKVQELLIPQLKYSFQIWGT MMASPGKDGKPPTTFAELMSEGGKKVWPAWELFKWFLGFSSHTLAAIGLALVELFQSSHP SPFIMQQKESLQQELEELLGTDGVLLYPSHPLIAQKHHHPIFTPFNFSYTGIFNILGLPV TQCPLGLSAEGLPLGVQIVAGKLQDRLSLATALYLEKAFGGWREPGKTTIKP >NP_001003555.1 gi|57525887|ref|NP_001003555.1| centromere protein P [Danio rerio] MEQKYEEDIQKLQQEIEMLEAEQEETLRSIFVQHGDRLQQGVKSACEERGGGGAQQHTLS KLITEVRELEKDLRRQTEINGITLNECFVKTLHKSERKLIQQLRLAGHCGLLLFQVEFAV TEIQEDNVLHRRVTELNIVVDGVEFKDFSAFVSRVEDTKDLLLFFRTLRTFSERCEDRRQ TFQHFQEKYPDVVNLPEGCRSEIMIIRSPQLPGISMTLFWKIHVSKEGVVKPLLDLLLKM PDQALELDTKKVMEKASDYFQSLLQLLGVEASIEGLIRTVCS >NP_997599.1 gi|47058959|ref|NP_997599.1| protein dispatched homolog 2 [Danio rerio] MESGSISRQREDAEMPDSSTTEGPSLEAPQSEIPEVSLCPPDSDSTESQMCPVEIEENQT KSSSPFNSHSSTQLERQVSQGSAYHSPPHKKCPCCGHQQPSQSDVCPGQMNALHQADCAA SPVKTLYSCSPSRLPSCHTKMQCHWLHGSHDGSNHKPVQHHMVTVRNDGLHRIPRSYSQV IVEYPMTVLISCTLVLFACSLAGILTGPLPDFSDPLLGFEPRGTDISVRLATWTRLKQNT GPGKPLSPVPWQLTEKTTTGKDTIKSEPQFRERSRRMLHRDNAEHNFFCNAPGERYAQLV FRSGNSASLWSLKAIYSMCQMEQTQIRSGPQFDKLCQVKSEFYGSMVKNECCPSWSLGNY LAVLNNISSCFSLTSQQVSESLGLLRFCAPYYHDGSLIASCTERSKFGRCASVPHRCKLS SIFQILHYLVDKDFLGPQTVEYKVPSLKYSIVFLPVEKSDSLMNIYLDHLEGHKLTYNNT TITGMDLGIKQKLFKYYLARDSIYPVLAALALLITIGLYLKSLFIAAMSLVAVILSLSTS YFFYKVAFRLTFFPLLNLAAVFVLLGSCLNQALTFVDFWKLQLSHNPPAVPEKRMNRVLQ EMGYLIIVSGLTSSVTFYSGYISSITAVRCYAVYLGSASLINTLFALVWLPCTLILQERY AVLSSNTVGKVAWKPCCSKNAGGFWETSSRKRCLFTFRQKLRTLGRGFSDTSNLLFLKIL PCGVVKFRYIWICWFAVLAAGGTYISCVDPGMKLPTSDSRTTQLFRSSHPFERYDAEYRH QFMFERMKDGEDEPMMLTLIWGIVPSDNGDHFDPKSNGSLSVDPGFNMSSLQAQIWLRDL CGKIQNQTFYSPLSAEQDTAEDNVCFVEHLIHWVSIRRCSESEDAFSFCCNNIPFPYPPR VFEQCLSMMVAEQHAEGRLPSAGGLRFDSEGRIAALVVIFKTVQLYSFNYNRMSQFYQEI LSWFNREISKAPAGLQRGWFVSQLGLYDLQQCLSSETLEVAGFSVALTFALLLLTTWNIP LSVYVSIAVAGSVFATVGLLVLLEWQLNGVEALFISAAAGLSVDFVANYCISYSLAPHSD RLGRVAHSIKRMGCPVATGAGAYFCVGIIMLPATALLFRKLGIFLLLVKCVACGFATFFF QSLCCFFGPQNNCGRITLPCVTQQSTENILSSCSATEPGTNNPAANGAFGCGKGSRVRRS FNKENEGFLCPNQQHHRKRQPVGGREPEQNELQPLACQLSDSFENSTCTSKLSNRPSVLS DDIQFCGLSPKQDYDRVSIEADSTEMCSRHLKGCNPPPALQTSSPYKENMLRLPQDACKE KVLCKKCRGQSRGGLQLWNVSLSSSSSMDEIMITQTTDTVNERSLSMDDHIHKRLLSCQS QSSIEGLEESNDTCLTEVEAAIPQAGKIEDEFQPGHLNGKRDTLRLSLKETVYDLASPGS GRVRTAQSDVPVILPNSKPDMPDVWIKREGKGEGGS >NP_001013313.1 gi|61651744|ref|NP_001013313.1| coiled-coil domain-containing protein 115 [Danio rerio] MRVDENLRLDEQLLLFMEQLEALEEKRQRLNSLIEEGWFSIAKARYSMGNKQVSALQYAS EMQPLAHVETSLLEGGTAEFKCERSENKAEEQKTKTIEDIGAKETGLRRRVHTKQKEVKE GEQDTDEVKTKTDSPTPEHRNPLKWFGILVPQNLKQAQSAFKEVITLSVEIASLQSTILA TRKEMQVQMKEKQERTEKAQLEVKEE >NP_991238.1 gi|45387769|ref|NP_991238.1| pituitary homeobox 3 [Danio rerio] MDFNLLTDSEARSPALSLSDSGTPQHDPGCKGQDNSDTEKSHQNHTDESNPEDGSLKKKQ RRQRTHFTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRER NQQAELCKNGFGAQFNGLMQPYDDMYSGYSYNNWATKSLASSPLSAKSFPFFNSMNVSPL SSQPMFSPPSSIPSMNMASSMVPSAVAGVPGSGLNNLGNLNNLNSPTLNSAAVSAAACPY ATTAGPYMYRDTCNSSLASLRLKAKQHANFAYPAVQNPVSNLSPCQYAVDRPV >NP_001244093.1 gi|380503827|ref|NP_001244093.1| blood vessel epicardial substance isoform 2 [Danio rerio] MSNTTSALPSSVPAVSLDPNATLCQDWEQSHHLLFHLANLSLGLGFLIPTTLALHMIFLR LLLMTGCSLFIAWATLYRCTLDVMVWNVVFLLVNFMHFFFLLYKRRPIKIDRELKSVYKR MFEPLHVREALFQRLTGQFCTIQTLKKGQVYAAEDKTSVDERLSILLKGKMKVSYRGHFL HNIYTNAFIDSPEFRSTQMNRGERFQVTIAAEENCKLLCWSRERLTYFLESESFLNEVFR YLIGKDITNKLYSLNDPTLSDKAVKKMDRQPSLCSQLSMMQMRNSMASTSDTDDVLNQIL RGGSTGSSLQKNPLTKTSTTMKPIEEGLEDDVFESESPTTSQNVSKTTKKDI >NP_001013309.2 gi|157042782|ref|NP_001013309.2| tRNA 2'-phosphotransferase 1 [Danio rerio] MDCETRGRGRRGRGNRNEESRDVRLSKSLSYVLRHGASKMGLQMNSDGFVFVEELLAHQQ FRSFSVDDVERVVASNDKQRFKLCKHPEDDRLQIRANQGHSVQVTDLELREISQDDQDYP REAVHGSYMKHWPSIRSQGLSRMNRTHIHLAPGLPGEGRVISGMRQSCDLAVYIDVTKAM SDGIKFFWSENGVLLTPGDAAGILAPCYFSRAQRLKPLPCDIELH >NP_001001847.2 gi|380503821|ref|NP_001001847.2| blood vessel epicardial substance isoform 1 [Danio rerio] MSNTTSALPSSVPAVSLDPNATLCQDWEQSHHLLFHLANLSLGLGFLIPTTLALHMIFLR LLLMTGCSLFIAWATLYRCTLDVMVWNVVFLLVNFMHFFFLLYKRRPIKIDRELKSVYKR MFEPLHVREALFQRLTGQFCTIQTLKKGQVYAAEDKTSVDERLSILLKGKMKVSYRGHFL HNIYTNAFIDSPEFRSTQMNRGERFQVTIAAEENCKLLCWSRERLTYFLESESFLNEVFR YLIGKDITNKLYSLNDPTLSDKAVKKMDRQPSLCSQLSMMQMRNSMASTSDTDDVLNQIL RGGSTGSSLPVTSDRA >NP_001015061.1 gi|62632729|ref|NP_001015061.1| putative all-trans-retinol 13,14-reductase precursor [Danio rerio] MWFAVVAIFLALVAFLYRYVVGSGPNPFAIDTREPLKPMVFDRKLKNKVLKQGFLASRVP EDLDAVVVGSGIGGLAIAVLLAKVGKKVLVLEQHDRAGGCCHTFKEQGFEFDVGIHYIGE LSNHKPLRCIIDQMTNGQLQWDPLENPFDNVVIGPPENRRIYQIYSGRKRYMDELKKCFP GEEKAIDEYVRLCKEVGQGVWVMVLLKFLPTPIANFLVRTGLANRLTSFSRYASRSLTDV VNELTQNKDLRAVLSYIFGTYGKIPKEASFSMHSLIVNHYMNGAWYPKGGATEIAYHMIP IIEKAGGAVLVRAPVNRILLNDAKEAIGVSVLKGQEEVHVRAPIVISDAGIFNTYEYLLP KDVQTMPAIQKQLSMLQHGDSGLSIFIGLDGTKEELGLKADNYFIYPENNIDELLEDYRS GNREESAKKNPLIFVASPSAKDSTWPERTPGKSTLTVVSFANYEWFEEWKDDKVKNRSTD YKQLKELFINYILEAVTEIYPKIKDRIEYVDAGTPITNQHYIAAPRGEIYGADHGIPRFS AELNATIRAQTPIKNLYLTGQDLMLCGFAGALTGALTCGSVILNRNLHLEAFSLAKRVQN GNNKKKT >NP_001003580.1 gi|57525791|ref|NP_001003580.1| kelch-like protein 15 [Danio rerio] MSGDVEVYLSQVHDGSVSSGFRALYEERLLLDVTLLIEEHHFQAHKALLATQSDYFRVMF TADMRERDQDKIHMKGLTAAGFGHVLRFMYYGSLELSMLTVQEILQAAMYVQLTEAVEFC CSFLLAKICLENCAEVMRLLEDFSVGVEGVQEQLDAFLLENFVPLMARPDFLSYLSLEKL MAYLDSDQLSRYPEIELYEAVQAWLRHDRRRWRHTDAVVQNLRFCLMTPANIFEKVKTSE FYRYSRQLRLEVDQALSYFHQVNEQPLAETKSNRIRSVRPQTAVFRGMIGHSMVNSKILL LHRPKVWWELEGPQVPLRPDCLAIVNNFAFLLGGEELGPDGEFHASSKVYRYDPRQNSWL RMADMSVPRSEFAVGVIGKYIYAVAGRTRDETFYSTERYDIVEDKWEFVDPYPVNKYGHE GTVLNGKLYITGGITSSSTSKQVCVFDPGREGSSEHRTRRTPILTNCWENKSKMNYARCF HKMISHNGKLYVFGGVCVILRASFESQGCPSTEVYDPETDEWTILASMPIGRSGHGVAVL DKQIMVLGGLCYNGHYSDSILTFDPEENKWKEDEYPRMPCKLDGLQVCSLHFPEYVLEHV RRCS >XP_006779743.1 gi|583968567|ref|XP_006779743.1| PREDICTED: CCAAT/enhancer-binding protein alpha-like [Neolamprologus brichardi] MELSNLYEVAPRPLMNNLNQQPPSGYRDPADLGGEIGDNETSIDLSAYIDPSAFNDDFLA DLFHHSSRQDKLKMMNGEYDPVSCGPGPQQLYMSNYMESKMEPLYEHNPPRLRPVAIKQE PRDDEDMNPGMPPTYHHPHPHPHPQQYSQQQQMPHLQYQIAHCAQTTMHLQPGHPTPPPT PVPSPHQHQHSHPHSHQGGMKLLEQQRGCGKTKKHVDKNSPEYRLRRERNNVAVRKSRDK AKMRNMETQHKVVELTADNDRLRRRVEHLTRELDTLRGIFRQLPDGSFKPMGS >XP_006779744.1 gi|583968570|ref|XP_006779744.1| PREDICTED: ras-related protein Rab-8B-like, partial [Neolamprologus brichardi] SLSGIDFKIRTIELDGKKIKLQIWDTAGQERFRTITTAYYRGAMGIMLVYDITNEKSFDN IKNWIRNIEEHASADVEKMVLGNKCDMNDKRQVSKERGEKLAIDYGIKFLETSAKSSINV EEGFYTLARDIMARLNRKMNDNNPSGGGGPVKITEPRSKKSLFRCSLL >XP_006779746.1 gi|583968574|ref|XP_006779746.1| PREDICTED: calcium and integrin-binding family member 2-like [Neolamprologus brichardi] MGNKQTTFTEEQLEAYQDCTFFTRKEILRLHARYRELAPHLVPLDYTNNPDIKVPMTLIV TMPELKVQFYRYRIVQVLWQLSTESSRWGSGPDFNRDNFICKEDLEKTLNKLTKGELMPE EVTLVCDKAIEEADLDGDHKLSFADFENMISKAPDFLSNFHIRI >XP_006779747.1 gi|583968576|ref|XP_006779747.1| PREDICTED: corticosteroid 11-beta-dehydrogenase isozyme 2-like [Neolamprologus brichardi] MEDYTLPFWIYLVIVTVFIGGAMKKILASHLNTTSTVVAWLGATVLVERLWAFCLPAMLL LVLFGITFCIYYATKTSQPRAMLPAHGKAVIITGCDSGFGNATAKHLDSLGFEVFATVLD LNGDGAKELQRTCSHRLTLLQVDITQPQQVQQALLDTKAKLGLKGLWALVNNAGVCVNFG EVELSLMSNYRGCMEVNFFGTLSITKAFLPLLRQTKGRIVTISSPAGDQPFPCLAAYGAS KAALNLITETLRHELEPWGVQVSTILPSSYRTAQSTNSAYWEKQHKHLLQNLSPALLEDY GEEYMTETKDLFQTFAKHTTTNLQPVVDTIVQALLAPQPQPRYFAGAGLSLMYFLYAYFP YSMSNNFLKKKFLKKNVIPRALRKQSAFDLNLSLHNNNNEEKLQQM >XP_006779748.1 gi|583968578|ref|XP_006779748.1| PREDICTED: transient receptor potential cation channel subfamily M member 1-like [Neolamprologus brichardi] MYIRVSFDSKPDSLLHLMVKDWQLELPTLLISVHGGLQNFDLPPKLKQVFGKGLIKAAVT TGAWIFTGGVSTGVIRHVGDALKDHSSKSRGKVCAIGIAPWGIVENKEDLIGRDVTRPYQ TMSNPLSKLSVLNSSHSHYILADNGTCGKYGAEVRLRRQLEKHISLQKINTRLGQGVPVV CLIVEGGPNVISITLESLKEEPPVPVVVCDGSGRASDILSFAHRYCEEDG >XP_006779749.1 gi|583968580|ref|XP_006779749.1| PREDICTED: chymotrypsin B-like [Neolamprologus brichardi] MAFLWIVSCLAFVGAAYGCGTPAIPPRVTGYARIVNGEEAVPHSWPWQVSLQQTNGFHFC GGSLISEQWVVTAAHCNVRTYHNVIVGEHNKGYGSTENIQVLKPAKVFTHPSWNPQTINN DITLIKLASPARLGTNVSPVCLADTTDSFAAGMKCVTTGWGLTRYNAPSTPNNLQQAALP LLSNEECKKHWGSNISDVMICAGGAGATSCMGDSGGPLVCQKDNVWTLVGIVSWGSSRCS TSTPAVYARVTKLRGWVDQILASN >XP_006779750.1 gi|583968582|ref|XP_006779750.1| PREDICTED: agouti-related protein-like [Neolamprologus brichardi] MFGTVLLCCWSFGLLPLASSLVHGNLPLDEGPVAGRRTETFLSEIERSQVPDRMHEPALL PVDSVEDHFLMDTGSYDEDTSAALQLQGRAMRSPRRCIPHQQSCLGYPLPCCDPCDTCYC RFFNAICYCRRVGHVCPPRRT >XP_006779751.1 gi|583968584|ref|XP_006779751.1| PREDICTED: EMILIN-1-like [Neolamprologus brichardi] MAALPLLLLLVLWTCGNAKGAFPLRQSYNLYTNGHAHGARAASRHRNWCAFVVTKTVSCV VEDGVETYVKPDYHPCSWGSGQCSRVVVYRTYMRPRYKVAYKMVTEMDWKCCHGYSGADC NIGPVGGGGTQISTTRPQPGQGGGTTSGQGGGGHSYGGGSSGSGQSGGNADNEKMRQLEE KIRSLTKNLQDLQSTMSTMNERLQEEGGRNGFGERSSGGRNPADAAQPEIKETIHSIQTK LDQLDNRTQAHDKTLVSINNHLVNGKGNELEGGASGGSLSEGRLNSLKEEILSKLERRVS LSCSSCQAGVEDLRKQQQQDRERIRALEKQMNAMDVQYRQSLDGLRRDVVRSQGCCDIIS DLQDRVTDAERKISTASENFDILQNRLDREISGQGGTSENTGSRGQGLPVGGETGGHGRD AMITEEHLNNRLKDLERRVNSTMQKTEESCSYLENHVKDYFHRELDELRSVFLERFDDQA DRITDVELDVEQVKDSISDHDKRLSKLENTTSQMSWRLEKCGCVASEQGGGGEGRGRGDG GYGGGSWGAGGGGSTGEGKDGGNRGDGGGTWGAGGGGGGSTGGGGRWGGTGGGLPGTGGE KDNSTKKSLEWRVVANEDQIRHFNTQLKDLSMSGDSLYDKVLDLTDDVGKIKALTGDHGE HFNRIVTVVEMLGEDCELCGKVEKELQKMRNYSQNALSNIQNHINRIQNRMDSEGDSCFQ MCSVLQSEVSVLRDDVRRCTNQCKSNPDMTTGVDHARPGGTDDNSGPLDPAKPLDGHSVI EGINNNHLKTLQGELSNVILTFSSINDTLKGLEHTVQKHDSVITDLGNTKDKIISEIDKV QQELTEHIEDNRNRLDKMDRDIRRFESTVLEMGDCKRSGDGLEKRLSKLEGVCGRLDGVS DSILKIKEGLNKHVSSLWTCVSGLNDTVIRHGGLLDFIQDGQDDIHSRVKNLNSSLNQVS RDLQSFSEHDLTGPPGPQGPQGHPGERGFNGPPGLPGPPGFPGPRGEIGPHGPKGETGLP GADAQIPKLSFSAALTAPMDRAGTIVFDKVFVNEGNFYNPRTGIFTAPVDGNYYFSAVLT GHRNEKIEAVLSKSNYGMARVDSGGYQPEGLENNPVAEAKVNPGSLAVFSIILPLQTQDT VCIDLVMGKLAHSVEPLTVFNGMLLYENK >XP_006779752.1 gi|583968586|ref|XP_006779752.1| PREDICTED: zinc finger protein 507-like isoform X1 [Neolamprologus brichardi] MEEITNVITHSSAASSSSSTSGSHTRQTKEKQPSQGFQQKTADDSLIQVIKKLSKIVEKR PQRRCASGGQKRALQVGERGAEQGGGSICKKIKRNLKDEVGVERSTDDSSLPSPWSGDDN NNVTTAVAEVAANPNSSDLKRTVTCYQCSLCPHLSQTLPLLKEHLKQHNEQHSDLILMCS ECHFTSRDHEQLEAHVRMHFDNGDNQKRKYPVSEAKEEVLKNQDVDLTGDNCSAGTEVKK SSVSNAKELPQKKKWYSYEEYGLYRCLICSYVCSQQRMLKTHAWKHAGLVDCSYPIFEDE DGGSAKREVQAAPNNASAREEIVVLQDKSLQKLPTGFKLQLCMPVAVEDKQEVVNLQGSH LSESPKTEEEDEYPIKDMTSEEPAVEVQVTTEAETEVELGGHHESTSATDSLLSSAQKII NRSPNSAGHINVIVERLPSAEDSVMASNPLLLSPDVDGDKSLLEKKAEEQEHVEGVKDEV VLCYSPGNANKSQHLGADIKPSIAKSNDLPRDENVPPAGRKRTHSESLRLHSLAAEVLVA MPMRTPELPNSGAKVALKTVAAQAQSPQAGQKPTEGAAAGQKASDVGTAAAMLNCNEGRE ETLGSLGLGKGDDDGPAANGGISLSLLTVIERLRERSDQNTSDEDILKELQDNAQFQSGA GVVAANGAGSYVCSSVPGMDGLVGSPDSGLVDYIPGSDRPYRCRLCRYSSGNKGYIKQHL RVHRQREPYQCPICEHIASDSKDLENHMIHHCKSRMYQCKQCPDAFHYKSQLRNHEREHH SFSGDVEMLTPVAETAAAMEETERVTYEEGSPQKMFKCDVCNYTSSTYVGVRNHRRIHNS DKPYRCCSCDFATTNMNSLKSHMRRHPQEHQAVQLLEQYRCSLCGYVCSHPPSLKSHMWK HAGDQNYNYEQVNKAINEAISQSSR