Mercurial > repos > iuc > virannot_blast2tsv
view test-data/rps_s1_out.tab @ 4:bb29ae8708b5 draft default tip
planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/virAnnot commit 7036ce0e06b6dc64332b1a5642fc58928523c5c6
author | iuc |
---|---|
date | Tue, 13 May 2025 11:52:17 +0000 |
parents | f8ebd1e802d7 |
children |
line wrap: on
line source
#query_id query_length cdd_id hit_id evalue startQ endQ frame description superkingdom ds2020-267_2 2436 pfam02123 gnl|CDD|280316 2.04111e-21 184 1476 1 pfam02123, RdRP_4, Viral RNA-directed RNA-polymerase. This family includes RNA-dependent RNA polymerase proteins (RdRPs) from Luteovirus, Totivirus and Rotavirus. Viruses(1);Riboviria(1);Orthornavirae(1);Duplornaviricota(1) ds2020-267_4 2297 pfam00680 gnl|CDD|279070 3.12197e-05 995 1873 -2 pfam00680, RdRP_1, RNA dependent RNA polymerase. Viruses(1);Riboviria(1);Orthornavirae(1);Pisuviricota(1) ds2020-267_5 2029 pfam00680 gnl|CDD|279070 8.86955e-06 840 1706 3 pfam00680, RdRP_1, RNA dependent RNA polymerase. Viruses(1);Riboviria(1);Orthornavirae(1);Pisuviricota(1) ds2020-267_6 1860 pfam02123 gnl|CDD|280316 1.27376e-17 1147 1764 -1 pfam02123, RdRP_4, Viral RNA-directed RNA-polymerase. This family includes RNA-dependent RNA polymerase proteins (RdRPs) from Luteovirus, Totivirus and Rotavirus. Viruses(1);Riboviria(1);Orthornavirae(1);Duplornaviricota(1) ds2020-267_8 1703 pfam00680 gnl|CDD|279070 3.19349e-12 685 1458 -3 pfam00680, RdRP_1, RNA dependent RNA polymerase. Viruses(1);Riboviria(1);Orthornavirae(1);Pisuviricota(1) ds2020-267_75 425 pfam00005 gnl|CDD|306511 3.70622e-07 129 275 -1 pfam00005, ABC_tran, ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporters are composed of two copies of this domain and two copies of a transmembrane domain pfam00664. These four domains may belong to a single polypeptide as in CFTR, or belong in different polypeptide chains. Bacteria(2);cellular organisms(1);Terrabacteria group(1) ds2020-267_94 386 pfam01347 gnl|CDD|279663 0.000262768 129 275 -1 pfam01347, Vitellogenin_N, Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitellin, that contains two structural domains. cellular organisms(1);Eukaryota(1);Opisthokonta(1);Metazoa(1) ds2020-267_97 380 pfam04879 gnl|CDD|282703 2.77416e-08 125 274 -2 pfam04879, Molybdop_Fe4S4, Molybdopterin oxidoreductase Fe4S4 domain. This domain is found in formate dehydrogenase H for which the structure is known. This first domain (residues 1 to 60) of Structure 1aa6 is an Fe4S4 cluster just below the protein surface. Bacteria(2);cellular organisms(1);Pseudomonadota(1) ds2020-267_98 379 pfam16203 gnl|CDD|318443 8.05104e-30 131 280 -1 pfam16203, ERCC3_RAD25_C, ERCC3/RAD25/XPB C-terminal helicase. This is the C-terminal helicase domain of ERCC3, RAD25 and XPB helicases. cellular organisms(2);Bacteria(1);Terrabacteria group(1) ds2020-267_100 376 pfam00401 gnl|CDD|306831 6.62013e-05 81 215 -3 pfam00401, ATP-synt_DE, ATP synthase, Delta/Epsilon chain, long alpha-helix domain. Part of the ATP synthase CF(1). These subunits are part of the head unit of the ATP synthase. This subunit is called epsilon in bacteria and delta in mitochondria. In bacteria the delta (D) subunit is equivalent to the mitochondrial Oligomycin sensitive subunit, OSCP (pfam00213). cellular organisms(2);Eukaryota(1);Viridiplantae(1) ds2020-267_114 347 pfam00471 gnl|CDD|306877 8.86568e-13 132 302 3 pfam00471, Ribosomal_L33, Ribosomal protein L33. cellular organisms(2);Bacteria(1);Eukaryota(1) ds2020-267_117 344 pfam00252 gnl|CDD|306711 1.17482e-22 107 295 2 pfam00252, Ribosomal_L16, Ribosomal protein L16p/L10e. cellular organisms(2);Eukaryota(1);Viridiplantae(1) ds2020-267_118 343 pfam00421 gnl|CDD|306845 7.93928e-41 92 337 -1 pfam00421, PSII, Photosystem II protein. cellular organisms(1);Eukaryota(1);Viridiplantae(1);Streptophyta(1) ds2020-267_120 339 pfam01333 gnl|CDD|307480 0.000362606 197 325 -3 pfam01333, Apocytochr_F_C, Apocytochrome F, C-terminal. This is a sub-family of cytochrome C. See pfam00034. cellular organisms(1);Eukaryota(1);Viridiplantae(1);Streptophyta(1) ds2020-267_130 330 pfam00680 gnl|CDD|279070 4.51414e-05 124 282 1 pfam00680, RdRP_1, RNA dependent RNA polymerase. Viruses(1);Riboviria(1);Orthornavirae(1);Pisuviricota(1) ds2020-267_139 320 pfam05860 gnl|CDD|310447 1.29746e-13 167 298 2 pfam05860, Haemagg_act, haemagglutination activity domain. This domain is suggested to be a carbohydrate- dependent haemagglutination activity site. It is found in a range of haemagglutinins and haemolysins. Bacteria(2);cellular organisms(1);Pseudomonadota(1) ds2020-267_312 252 pfam00585 gnl|CDD|278982 1.42752e-05 29 166 2 pfam00585, Thr_dehydrat_C, C-terminal regulatory domain of Threonine dehydratase. Threonine dehydratases pfam00291 all contain a carboxy terminal region. This region may have a regulatory role. Some members contain two copies of this region. This family is homologous to the pfam01842 domain. Bacteria(2);cellular organisms(1);Pseudomonadota(1) ds2020-267_315 251 pfam13188 gnl|CDD|315779 0.000739897 32 241 2 pfam13188, PAS_8, PAS domain. Bacteria(2);cellular organisms(1);Pseudomonadota(1) ds2020-267_316 251 pfam02123 gnl|CDD|280316 3.2928e-08 28 228 -3 pfam02123, RdRP_4, Viral RNA-directed RNA-polymerase. This family includes RNA-dependent RNA polymerase proteins (RdRPs) from Luteovirus, Totivirus and Rotavirus. Viruses(1);Riboviria(1);Orthornavirae(1);Duplornaviricota(1) ds2020-267_318 251 pfam00252 gnl|CDD|306711 7.50297e-12 78 206 -1 pfam00252, Ribosomal_L16, Ribosomal protein L16p/L10e. cellular organisms(2);Eukaryota(1);Viridiplantae(1) ds2020-267_323 250 pfam00227 gnl|CDD|306690 4.91252e-09 10 150 -2 pfam00227, Proteasome, Proteasome subunit. The proteasome is a multisubunit structure that degrades proteins. Protein degradation is an essential component of regulation because proteins can become misfolded, damaged, or unnecessary. Proteasomes and their homologs vary greatly in complexity: from HslV (heat shock locus v), which is encoded by 1 gene in bacteria, to the eukaryotic 20S proteasome, which is encoded by more than 14 genes. Recently evidence of two novel groups of bacterial proteasomes was proposed. The first is Anbu, which is sparsely distributed among cyanobacteria and proteobacteria. The second is call beta-proteobacteria proteasome homolog (BPH). cellular organisms(2);Eukaryota(1);Opisthokonta(1) ds2020-267_329 249 pfam13173 gnl|CDD|315764 2.6724e-08 106 249 1 pfam13173, AAA_14, AAA domain. This family of domains contain a P-loop motif that is characteristic of the AAA superfamily. Bacteria(2);cellular organisms(1);FCB group(1) ds2020-267_336 248 pfam00113 gnl|CDD|278539 3.9331e-13 15 116 -1 pfam00113, Enolase_C, Enolase, C-terminal TIM barrel domain. cellular organisms(2);Bacteria(2) ds2020-267_352 245 pfam00946 gnl|CDD|307203 3.13472e-05 1 141 1 pfam00946, Mononeg_RNA_pol, Mononegavirales RNA dependent RNA polymerase. Members of the Mononegavirales including the Paramyxoviridae, like other non-segmented negative strand RNA viruses, have an RNA-dependent RNA polymerase composed of two subunits, a large protein L and a phosphoprotein P. This is a protein family of the L protein. The L protein confers the RNA polymerase activity on the complex. The P protein acts as a transcription factor. Viruses(1);Riboviria(1);Orthornavirae(1);Negarnaviricota(1) ds2020-267_363 243 pfam00416 gnl|CDD|306841 5.30772e-05 15 134 -2 pfam00416, Ribosomal_S13, Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes. cellular organisms(2);Bacteria(2) ds2020-267_364 243 pfam00216 gnl|CDD|306682 1.89202e-10 134 241 -3 pfam00216, Bac_DNA_binding, Bacterial DNA-binding protein. Bacteria(2);cellular organisms(1);Pseudomonadota(1) ds2020-267_365 243 pfam13041 gnl|CDD|315669 0.000344884 134 241 -3 pfam13041, PPR_2, PPR repeat family. This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR. cellular organisms(1);Eukaryota(1);Viridiplantae(1);Streptophyta(1) ds2020-267_369 243 pfam12137 gnl|CDD|314930 3.71293e-05 137 217 -3 pfam12137, RapA_C, RNA polymerase recycling family C-terminal. This domain is found in bacteria. This domain is about 360 amino acids in length. This domain is found associated with pfam00271, pfam00176. The function of this domain is not known, but structurally it forms an alpha-beta fold in nature with a central beta-sheet flanked by helices and loops, the beta-sheet being mainly antiparallel and flanked by four alpha helices, among which the two longer helices exhibit a coiled-coil arrangement. cellular organisms(1);Bacteria(1);Pseudomonadota(1);Gammaproteobacteria(1) ds2020-267_370 242 pfam00146 gnl|CDD|306623 2.12078e-10 22 111 1 pfam00146, NADHdh, NADH dehydrogenase. cellular organisms(1);Eukaryota(1);Opisthokonta(1);Metazoa(1) ds2020-267_374 242 pfam00124 gnl|CDD|306604 4.44151e-07 21 125 3 pfam00124, Photo_RC, Photosynthetic reaction centre protein. cellular organisms(1);Eukaryota(1);Viridiplantae(1);Streptophyta(1) ds2020-267_388 241 pfam02123 gnl|CDD|280316 5.78854e-08 35 214 -1 pfam02123, RdRP_4, Viral RNA-directed RNA-polymerase. This family includes RNA-dependent RNA polymerase proteins (RdRPs) from Luteovirus, Totivirus and Rotavirus. Viruses(1);Riboviria(1);Orthornavirae(1);Duplornaviricota(1) ds2020-267_402 239 pfam06122 gnl|CDD|310603 1.30391e-05 29 172 2 pfam06122, TraH, Conjugative relaxosome accessory transposon protein. The TraH protein is thought to be a relaxosome accessory component, also necessary for transfer but not for H-pilus synthesis within the conjugative transposon. cellular organisms(1);Bacteria(1);Pseudomonadota(1);Gammaproteobacteria(1) ds2020-267_404 239 pfam00361 gnl|CDD|306795 3.63199e-05 70 219 1 pfam00361, Proton_antipo_M, Proton-conducting membrane transporter. This is a family of membrane transporters that inlcudes some 7 of potentially 14-16 TM regions. In many instances the family forms part of complex I that catalyzes the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane, and in this context is a combination predominantly of subunits 2, 4, 5, 14, L, M and N. In many bacterial species these proteins are probable stand-alone transporters not coupled with oxidoreduction. The family in total represents homologs across the phyla. cellular organisms(1);Eukaryota(1);Opisthokonta(1);Metazoa(1) ds2020-267_407 239 pfam00177 gnl|CDD|306646 1.05327e-06 28 126 1 pfam00177, Ribosomal_S7, Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes. cellular organisms(2);Eukaryota(1);Viridiplantae(1) ds2020-267_427 235 pfam03154 gnl|CDD|308660 0.000842762 28 126 1 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity. Eukaryota(1);cellular organisms(1);Opisthokonta(1);Metazoa(1) ds2020-267_428 235 pfam00164 gnl|CDD|278589 1.83229e-23 3 182 3 pfam00164, Ribosom_S12_S23, Ribosomal protein S12/S23. This protein is known as S12 in bacteria and archaea and S23 in eukaryotes. cellular organisms(2);Eukaryota(1);Viridiplantae(1) ds2020-267_436 234 pfam00155 gnl|CDD|306629 0.000251531 3 182 3 pfam00155, Aminotran_1_2, Aminotransferase class I and II. Bacteria(2);cellular organisms(1);Pseudomonadota(1) ds2020-267_444 233 pfam00680 gnl|CDD|279070 0.000703744 3 182 3 pfam00680, RdRP_1, RNA dependent RNA polymerase. Viruses(1);Riboviria(1);Orthornavirae(1);Pisuviricota(1) ds2020-267_457 231 pfam00481 gnl|CDD|306885 0.00063843 3 182 3 pfam00481, PP2C, Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase. Eukaryota(2);cellular organisms(1);Viridiplantae(1) ds2020-267_466 230 pfam00072 gnl|CDD|306560 5.30837e-08 50 208 2 pfam00072, Response_reg, Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain. Bacteria(2);cellular organisms(1);Pseudomonadota(1) ds2020-267_471 230 pfam00201 gnl|CDD|278624 2.93544e-07 46 210 1 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase. cellular organisms(1);Eukaryota(1);Viridiplantae(1);Streptophytina(1) ds2020-267_486 228 pfam17035 gnl|CDD|319097 3.87403e-09 108 203 3 pfam17035, BET, Bromodomain extra-terminal - transcription regulation. The BET, or bromodomain extra-terminal domain, is found on bromodomain proteins that play key roles in development, cancer progression and virus-host pathogenesis. It interacts with NSD3, JMJD6, CHD4, GLTSCR1, and ATAD5 all of which are shown to impart a pTEFb-independent transcriptional activation function on the bromodomain proteins. cellular organisms(1);Eukaryota(1);Opisthokonta(1);Metazoa(1) ds2020-267_837 207 pfam04061 gnl|CDD|309259 7.30581e-19 1 159 1 pfam04061, ORMDL, ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER. Orm proteins have been identified as negative regulators of sphingolipid synthesis that form a conserved complex with serine palmitoyltransferase, the first and rate-limiting enzyme in sphingolipid production. This novel and conserved protein complex, has been termed the SPOTS complex (serine palmitoyltransferase, Orm1/2, Tsc3, and Sac1). cellular organisms(1);Eukaryota(1);Opisthokonta(1);Metazoa(1) ds2020-267_883 206 pfam10775 gnl|CDD|313884 0.00091969 1 159 1 pfam10775, ATP_sub_h, ATP synthase complex subunit h. Subunit h is a component of the yeast mitochondrial F1-F0 ATP synthase. It is essential for the correct assembly and functioning of this enzyme. Subunit h occupies a central place in the peripheral stalk between the F1 sector and the membrane. cellular organisms(1);Eukaryota(1);Opisthokonta(1);Fungi(1) ds2020-267_1259 1481 pfam02123 gnl|CDD|280316 2.17343e-21 184 1476 1 pfam02123, RdRP_4, Viral RNA-directed RNA-polymerase. This family includes RNA-dependent RNA polymerase proteins (RdRPs) from Luteovirus, Totivirus and Rotavirus. Eukaryota(1) ;Viruses(1);