Mercurial > repos > matteoc > agame_custom_tools
view pfam_search/clans.txt @ 0:68a3648c7d91 draft default tip
Uploaded
author | matteoc |
---|---|
date | Thu, 22 Dec 2016 04:45:31 -0500 |
parents | |
children |
line wrap: on
line source
1 CL0001 EGF \N EGF superfamily Finn RD, Bateman A anon Members of this clan all belong to the EGF superfamily. This particular superfamily is characterised as having least 6 cysteines residues.\ \ \ \ These cysteine form disulphide bonds, in the order 1-3, 2-4, 5-6, which are essential for the stability of the EGF fold. These disulphide bonds are stacked in a ladder-like arrangement. The Laminin EGF family is distinguished by having an an additional disulphide bond. The function of the domains within this family remains unclear, but they are though to largely perform a structural role. More often than not, there domains are arranged a tandem repeats in extracellular proteins. 2008-09-03 15:50:29 2004-03-17 16:02:08 26 325 6259 696 88541 1 3 CL0003 SAM \N Sterile Alpha Motif (SAM) domain Finn RD anon SAM domains are found in a diverse set of proteins, which include scaffolding proteins, transcription regulators, translational regulators tyrosine kinases and serine/threonine kinases [1-3]. SAM domains are found in all eukaryotes and some bacteria [3] . Structures of SAM domains reveal a common five helical structure. The SAM domain is involved in a variety of functions. The most widespread function is in domain-domain interactions. The SAM domain performs domain-domain interactions using multifarious arrangements of the SAM domain. More recently, the SAM domain within the Smaug protein has been demonstrated to bind to the Nanos 3' UTR translation control element (Rfam:RF00161) [3]. This clan currently only represents the diverse SAM domain family and does not contain the more divergent SAM/Pointed family (Pfam:PF02198). 2008-09-03 15:50:29 2004-03-17 16:21:50 20 126 742 467 11010 1 4 CL0004 Concanavalin \N Concanavalin-like lectin/glucanase superfamily Bateman A anon This superfamily includes a diverse range of carbohydrate binding domains and glycosyl hydrolase enzymes that share a common structure. 2008-09-03 15:50:29 2004-03-17 16:44:11 19 1631 2750 3131 34755 1 5 CL0005 Kazal \N Kazal like domain Finn RD anon Kazal domains are found in both serine protease inhibitors and extracellular regions of agrins. The structure of the Kazal domain is a small alpha/beta fold. Typically the Kazal domain consists of 2 short-helices and a 3-stranded anti-parallel sheet. The fold is contains several disulphide bonds. 2008-09-03 15:50:29 2004-03-17 17:00:11 26 106 337 450 6552 1 6 CL0006 C1 \N Protein kinase C, C1 domain Finn RD anon The members of this clan are all variations of the protein kinase C1 domain that is characterised by a rich cysteine and histidine content. The C1 domain is the N-terminal region of conservation found in protein kinase C domains. This domain is involved in binding many ligands, which include diacylglycerol, phorbol esters and zinc [1]. 2008-09-03 15:50:29 2004-03-17 17:47:56 19 30 728 396 10495 1 7 CL0007 KH \N K-Homology (KH) domain Superfamily Finn RD anon The KH domain is thought to be the second most prevalent RNA binding motif in proteins. The motif is characterised by a conserved GXXXGXXG in the middle of the domain. Structures of KH reveal that the KH domain is arranged as either a beta-alpha-alpha-beta-beta (mini-KH domain) or beta-alpha-alpha-beta-beta-alpha (maxi-KH domain). The secondary elements are separated by at least four loop segments. The second loop is located between beta-1 and al The KH domain can be found either as single or multiple copies. The KH domain usually binds RNA as a multimer. 2008-09-03 15:50:29 2004-03-17 17:58:30 17 312 491 5344 38636 1 9 CL0009 ENTH_VHS \N ENTH/ANTH/VHS superfamily Bateman A, McMahon H anon This clan includes the related ENTH and ANTH domains as well as the VHS domain. The ENTH domain is approximately 150 residues in length and is a solenoid of alpha-helices. The various ENTH domains have various lipid specificities but the key feature that distinguishes it functionally from ANTH domains is its ability to bend membranes. It does this by folding an additional N-terminal helix on lipid binding. The ANTH domain is approximately 300 residues in length and is a PtdIns(4,5)P2 binding domain. It has no membrane bending properties. The VHS (Vps-27, Hrs and STAM) domain is a 140 residue long domain present in the very NH2-terminus of at least 60 proteins. Based on their functional characteristics and on recent data on the involvement of VHS in cargo recognition in trans-Golgi, VHS domains are considered to have a general membrane targeting/cargo recognition role in vesicular trafficking [5]. 2008-09-03 15:50:29 2004-03-18 10:53:33 19 75 85 345 4028 1 10 CL0010 SH3 \N Src homology-3 domain Finn RD anon Src homology-3 (SH3) domains are comprised of about 60 amino acids, performing either an assembly or regulatory role.\ For example, SH3 domains in the Grb2 adaptor protein are essential for protein-protein interactions and\ \ signal transduction in the p21 Ras-dependent growth factor signaling pathway. Alternatively, SH3 performs a regulatory role in the Src family of tyrosine kinases. SH3 domains bind a variety of peptide ligands, many of which contain a PxxP motif. This PxxP motif is flanked by different specificity elements [1]. Structures of SH3 domains, both free and ligand complexed, have provided insights into the mechanism of ligand recognition. The SH3 fold consists of two anti-parallel beta sheets that lie at right angles to each other. Within the fold, there are two variable loops, referred to as RT and n-Src loops. When SH3 binds to its ligand, the proline rich ligand adopts a PPII helix conformation, with the PPII helix structure recognised by a pair of grooves on the surface of the SH3 domain that bind turns of the helix. The SH3 grooves are formed by a series of nearly parallel, well-conserved aromatic residues [1]. 2008-09-03 15:50:29 2004-03-18 11:12:55 20 564 2030 4044 40209 1 11 CL0011 Ig \N Immunoglobulin superfamily Bateman A, Finn RD anon Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involved in protein-protein and protein-ligand interactions. The superfamily can be divided into discrete structural sets, by the presence or absence of beta-strands in the structure and the length of the domains [1]. Proteins containing domains of the C1 and V-sets are mostly molecules of the vertebrate immune system. Proteins of the C2-set are mainly lymphocyte antigens, this differs from the composition of the C2-set as originally proposed [1]. The I-set is intermediate in structure between the C1 and V-sets and is found widely in cell surface proteins as well as intracellular muscle proteins. 2008-09-03 15:50:29 2004-03-18 16:23:40 25 3370 4292 3474 136715 1 12 CL0012 Histone \N Histone superfamily Bateman A anon Members of this clan all possess a histone fold. Generally proteins in this clan are DNA binding. 2008-09-03 15:50:29 2004-04-19 14:28:04 17 742 154 9904 23976 1 13 CL0013 Beta-lactamase \N Serine beta-lactamase-like superfamily Finn RD, Bateman A anon This superfamily contains proteins that have a beta-lactamase fold. This includes beta-lactamases as well as Dala-Dala carboxypeptidases and glutaminases. 2008-09-03 15:50:29 2004-04-19 15:42:28 17 961 303 5474 58501 1 14 CL0014 Glutaminase_I \N Class-I Glutamine amidotransferase superfamily Bateman A anon Most members of this clan are glutaminase enzymes. This superfamily is shown to be related in [1]. The clan also contains the DJ-1/PfpI family that includes the peptidase PfpI that has a catalytic Cys-His-Glu triad that differs from the class I GAT Cys-His-Glu triad. 2008-09-03 15:50:29 2004-04-28 09:27:01 21 334 418 7216 67224 1 15 CL0015 MFS \N Major Facilitator Superfamily Bateman A anon The major facilitator superfamily (MFS) is one of the two largest families of membrane transporters found on Earth [1]. It is present ubiquitously in bacteria, archaea, and eukarya and includes members that can function by solute uniport, solute/cation symport, solute/cation antiport and/or solute/solute antiport with inwardly and/or outwardly directed polarity [1]. All permeases of the MFS possess either 12 or 14 transmembrane helices [1]. 2008-09-03 15:50:29 2004-04-30 16:48:27 19 22 846 6356 249360 1 16 CL0016 PKinase \N Protein kinase superfamily Studholme DJ anon This superfamily includes the Serine/Threonine- and Tyrosine- protein kinases as well as related kinases that act on non-protein substrates. 2008-09-03 15:50:29 2004-06-11 14:28:37 21 3279 6514 9586 173964 1 18 CL0018 bZIP \N bZIP-like leucine zipper Studholme DJ anon This family of eukaryotic transcription factors contain a basic region adjacent to a leucine zipper. 2008-09-03 15:50:29 2004-06-16 18:30:26 14 321 111 611 8901 1 20 CL0020 TPR Tetratrico peptide repeat superfamily Studholme DJ anon Tetratricopeptide-like repeats are found in a numerous and diverse proteins involved in such functions as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. 2008-09-03 15:50:29 2004-06-21 18:12:39 24 947 20914 6771 404043 1 21 CL0021 OB \N OB fold Studholme DJ, Bateman A anon The OB (oligonucleotide/oligosaccharide binding) was defined by Murzin [1]. The common part of the OB-fold, has a five-stranded beta-sheet coiled to form a closed beta-barrel. This barrel is capped by an alpha-helix located between the third and fourth strands [1]. 2008-09-03 15:50:29 2004-06-22 18:31:10 17 1592 988 7656 210543 1 22 CL0022 LRR Leucine Rich Repeat Studholme DJ anon Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains. This Pfam entry contains Leucine Rich Repeats not recognised by the Pfam:PF00560 model. 2008-09-03 15:50:29 2004-06-23 16:13:12 31 250 9511 3145 175606 1 23 CL0023 P-loop_NTPase AAA; P-loop containing nucleoside triphosphate hydrolase superfamily Studholme DJ anon AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes [2]. 2008-09-03 15:50:29 2004-06-23 17:05:20 33 5523 12211 50680 1511292 1 25 CL0025 His_Kinase_A His Kinase A (phospho-acceptor) domain Studholme DJ anon This is the dimerisation and phospho-acceptor domain of a sub-family of histidine kinases. It shares sequence similarity with Pfam:PF00512 and Pfam:PF07536. It is usually found adjacent to a C-terminal ATPase domain (Pfam:PF02518). This domain is found in a wide range of Bacteria and also several Archaea. It comprises one of the fundamental units of the two-component signal transduction system [2-7]. 2008-09-03 15:50:29 2004-06-29 14:19:46 13 497 5781 7648 242300 1 26 CL0026 CU_oxidase \N Multicopper oxidase-like domain Studholme DJ, Finn RD anon Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain. 2008-09-03 15:50:29 2004-06-29 16:37:59 19 1015 245 19953 63536 1 27 CL0027 RdRP \N RNA dependent RNA polymerase Bateman A anon This clan represents the replicative RNA dependent RNA polymerase. from a variety of RNA viruses [1]. 2008-09-03 15:50:29 2004-08-26 14:33:23 14 852 1801 12549 220781 1 28 CL0028 AB_hydrolase Alpha/Beta hydrolase fold Bateman A anon This catalytic domain is found in a very wide range of enzymes. 2008-09-03 15:50:29 2004-08-29 17:32:06 21 1989 2396 7428 180167 1 29 CL0029 Cupin Cupin fold Bateman A anon This clan represents the conserved barrel domain of the 'cupin' superfamily [1] ('cupa' is the Latin term for a small barrel). The cupin fold is found in a wide variety of enzymes, but notably contains the non-enzymatic seed storage proteins also. 2008-09-03 15:50:29 2004-09-06 15:03:53 19 945 1162 6529 112082 1 30 CL0030 Ion_channel \N Ion channel (VIC) superfamily Bateman A anon This superfamily contains a diverse range of ion channels that share a pair of transmembrane helices in common. This clan is classified as the VIC (Voltage-gated Ion Channel) superfamily in TCDB. 2008-09-03 15:50:29 2004-09-08 16:21:26 15 718 809 5250 44250 1 31 CL0031 Phosphatase \N Phosphatase superfamily Bateman A anon This family includes tyrosine and dual specificity phosphatase enzymes. 2008-09-03 15:50:29 2004-10-26 13:53:48 12 480 558 2966 20482 1 32 CL0032 Dim_A_B_barrel \N Dimeric alpha/beta barrel superfamily Bateman A anon This superfamily of proteins possess a Ferredoxin-like fold. Pairs of these assemble into a beta barrel. The function of this barrel is quite varied and includes Muconolactone isomerase as well as monooxygenases. 2008-09-03 15:50:29 2004-10-26 16:31:17 14 510 186 4601 39476 1 33 CL0033 POZ \N POZ domain superfamily Bateman A anon The POZ domain is found in a variety of transcription factors. POZ domains are also found in the tetramerisation domain of voltage gated K+ channels. In general these domains mediate homo-oligomerisation. 2008-09-03 15:50:29 2004-10-27 13:52:11 13 223 1167 1178 26677 1 34 CL0034 Amidohydrolase \N Amidohydrolase superfamily Bateman A anon This family includes a large family of metal dependent amidohydrolase enzymes [1]. 2008-09-03 15:50:29 2004-10-27 17:19:50 14 704 479 5687 79783 1 35 CL0035 Peptidase_MH \N Peptidase clan MH/MC/MF Bateman A anon This clan contains peptidases belonging to MEROPS clan MH, MC and MF. We also include Nicastrin that is part of the gamma secretase complex and not known to be a peptidase. 2008-09-03 15:50:29 2004-10-28 13:48:22 15 680 539 5405 63766 1 36 CL0036 TIM_barrel Common phosphate binding-site TIM barrel superfamily Bateman A anon This large superfamily of TIM barrel enzymes all contain a common phosphate binding site. The phosphate is found in a variety of cofactors and ligands such as FMN [1,2]. 2008-09-03 15:50:29 2004-10-28 15:12:01 23 3964 973 10099 253453 1 37 CL0037 Lysozyme \N Lysozyme-like superfamily Bateman A anon Barley chitinase, bacterial chitosanase, and lysozymes from phage and animals all hydrolyse related polysaccharides. The proteins little amino-acid similarity, but have a structurally invariant core consisting of two helices and a three-stranded beta-sheet which form the substrate-binding and catalytic cleft [1]. 2008-09-03 15:50:29 2004-10-28 15:30:21 13 1502 527 5612 33680 1 39 CL0039 HUP PP-loop; PP-ATPase; HUP - HIGH-signature proteins, UspA, and PP-ATPase. Bateman A, Anantharaman V anon The HUP class contains the HIGH-signature proteins, UspA superfamily and the PP-ATPase superfamily [1]. The HIGH superfamily has the HIGH Nucleotidyl transferases and the class I tRNA synthetases both of which have the HIGH and the KMSKS motif [1],[2]. The PP-loop ATPase named after the ATP PyroPhosphatase domain, was initially identified as a conserved amino acid sequence motif in four distinct groups of enzymes that catalyse the hydrolysis of the alpha-beta phosphate bond of ATP, namely GMP synthetases, argininosuccinate synthetases, asparagine synthetases, and ATP sulfurylases [3]. The USPA superfamily contains USPA, ETFP and Photolyases [1] 2008-09-03 15:50:29 2004-10-29 14:36:02 11 1108 838 6582 177746 1 40 CL0040 tRNA_synt_II \N Class II aminoacyl-tRNA and Biotin synthetases Finn RD anon Aminoacyl-tRNA synthetases are key components of the protein translation machinery that catalyse two basic reactions. First, the activation of amino acids via the formation of aminoacyl adenylates and second, linking the activated amino acid to the cognate tRNAs. The aminoacyl-tRNA synthetases generate AMP as the second end product of this reaction, which differentiates them from the majority of ATP-dependent enzymes that produce ADP. In addition, there is a specific aminoacyl-tRNA synthetases for each of the 20 amino acids and there are two structurally distinct classes of aminoacyl-tRNA synthetases, each\ \ encompassing 10 different specificities. The two classes have alternative modes of aminoacylation: class I aminoacylate the 2'OH of the cognate tRNA; class II aminoacylate 3'OH (with the exception of PheRS). Each class contain a conserved core domain that is involved in ATP binding and hydrolysis and combines with additional domains that determine the specificity of interactions with\ the cognate amino acid and tRNA. The class II core domain consist of a mixed-beta sheet, similar to that found in the biotin synthetases, hence why this family has also been included in this clan. The core domain contains three modestly conserved motifs that are responsible for ATP binding. The class II aminoacyl-tRNA synthetases can contain additional nested domains, found inserted in the loops of the core domain [1] (and reference therein). 2008-09-03 15:50:29 2004-11-08 11:24:57 16 514 340 5927 72316 1 41 CL0041 Death \N Death Domain Superfamily Finn RD anon The death domain superfamily is composed of three families: the death domain (DD); the death effector domain (DED) and the caspase recruitment domain (CARD). All of the members perform a pivotal role in signalling events that regulate apoptosis. Protein-protein interactions are mediated by self-self associations, in which CARD-CARD, DD-DD and DED-DED contacts are formed exclusively The three families possess remarkably similar structures, each comprising an antiparallel six helical bundle in the Greek Key topology. Structurally, the DD and CARD families are the most dissimilar. The former is comprised of two perpendicular three-helix bundles, whereas the latter CARD domain contains six helices that are almost parallel with each other. Interestingly, the interactions in CARD or DD containing heterodimers are quite different [1]. 2008-09-03 15:50:29 2004-11-11 10:28:31 12 135 742 249 6544 1 42 CL0042 Flavoprotein \N Flavoprotein Finn RD anon Members of this clan are FMN or FAD-binding redox proteins. Flavoproteins act in various electron-transport systems as functional analogues of ferredoxin. They are characterised by an open twisted alpha/beta structure consisting of five parallel beta-sheets connected by alpha-helices which surround the sheet. 2008-09-03 15:50:29 2004-11-12 15:13:12 11 497 221 5019 35666 1 43 CL0043 Chelatase \N Chelatase Superfamily Finn RD anon Metallated tetrapyrroles are used as prosthetic groups in proteins involved in biologically important processes such as photosynthesis, oxygen transport, drug metabolism and nitric oxide synthesis. In living organisms, metallation is catalysed by a group of enzymes called chelatases. This clan contains ferrochelatase (heme) and cobalt chelatase [1]. 2008-09-03 15:50:29 2004-11-12 16:49:49 11 77 45 4178 7625 1 44 CL0044 Ferritin Ferritin-like Superfamily Finn RD anon The members of this clan all share a distinctive four helical bundle. The four helices are arranged antiparallel with a left-handed twist. This helical bundle is distinguished from others by the long connection between the second and third helices. Some of the members contain a Fe or Mn dimer at the centre of the helical bundle. The ferritin fold was first described by Murzin AG and Chothia C, Cur Opin Struc Biol 1992, 2:895-903. 2008-09-03 15:50:29 2004-11-12 17:32:27 13 2058 121 6054 34310 1 45 CL0045 Rubredoxin \N Rubredoxin-like Finn RD anon The Rubredoxin clan is comprised of three families:Rubredoxin, COX5B and desulforedoxin.Rubredoxin domains are small domains (5-6 kDa) and bind one iron atom tetrahedrally bound by four cysteine residues.Similar, desulforedoxin domains are small (4 kDa), but usually form homodimers. Each monomer binds one iron atom, but in a distorted tetrahedral arrangement. COX5B domains are membrane-anchored rubredoxin-like domains. The domain in the Rubredoxin clan are usually comprised of 2 alpha helixes and 2-3 beta strands. 2008-09-03 15:50:29 2004-11-15 11:37:59 11 172 52 2672 4680 1 46 CL0046 Thiolase \N Thiolase-like Superfamily Finn RD anon Thiolases are ubiquitous and form a large superfamily. Thiolases can function either degradatively, in the beta-oxidation pathway of fatty acids, or biosynthetically. Biosynthetic thiolases catalyse the formation of acetoacetyl-CoA from two molecules of acetyl-CoA . This is one of the fundamental categories of carbon skeletal assembly patterns in biological systems and is the first step in a wide range of biosynthetic pathways [1]. Thiolase are usually dimeric or tetrameric enzymes. Within each monomer there are two similar domains related by pseudo dyad. The N-terminal of these two domains contains a large insertion of about 100 amino acids. 2008-09-03 15:50:29 2004-11-15 12:47:24 15 575 1810 7516 102610 1 47 CL0047 CuAO_N2_N3 \N Copper amine oxidase, domains 1 and 2 Finn RD anon Copper amine oxidase (CuAO) are comprised of three of four domains. In the case of the four domain CuAO, the N-terminal domain (termed N1, and is not present in the three domain CuAO) and the C-terminal catalytic domain sandwich two repeated domains (termed N2 and N3). The function of these two homologous domains is uncertain. N2 and N3 both have a cystatin-like fold [1]. 2008-09-03 15:50:29 2004-11-15 13:04:24 11 186 31 504 1887 1 48 CL0048 LolA_LolB \N Lipoprotein localisation factors LolA/B Finn RD anon Gram-negative bacteria lipoproteins are anchored to the periplasmic surface of the inner or outer membrane depending on the sorting signal, which is the residue at position 2 of the polypeptide. Five Lol proteins are involved in the sorting and membrane localisation of lipoprotein. An ATP-binding cassette (ABC) transporter, LolCDE, releases outer membrane-specific lipoproteins from the inner membrane, causing the formation of a complex between the released lipoproteins and the periplasmic molecular chaperone LolA. When this complex interacts with outer membrane receptor LolB, the lipoproteins are transferred from LolA to LolB and then localised to the outer membrane. The structures of LolA and LolB are remarkably similar to each other. Both have a hydrophobic cavity consisting of an unclosed beta-barrel and an alpha-helical lid [1,2]. 2008-09-03 15:50:29 2004-11-15 13:13:26 11 10 5 2158 3976 1 49 CL0049 Tudor \N Tudor domain 'Royal family' Finn RD, Bateman A anon This clan covers the Tudor domain 'royal family' [1]. This includes chromo, MBT, PWWP and tudor domains. The chromo domain is a comprised of approximately 50 amino acid residues. There are usually one to three Chromo domains found in a single protein. In some chromo domain containing proteins, a second related chromo domain has been found and is referred to as the Chromo-shadow domain. The structure of the Chromo and Chromo-shadow domains reveal an OB-fold, a fold found in a variety of prokaryotic and eukaryotic nucleic acid binding proteins.\ More specifically,the chromo-domain structure reveals a three beta strands that are packed against an alpha helix. Interestingly, a similar structure is found in the archaeal chromatin proteins (7kDa DNA-binding domain). These are sequence neutral DNA binding proteins.\ The DNA binding in these archaeal proteins is mediated through the triple stranded beta sheet. These archaeal domains are though to represent an ancestral chromo domain. Homologs of the chromo domain have been found in fission yeast, ciliated protozoa and all animal species, but appear to be absent in eubacteria, budding yeast and plants [2]. The precise function of the chromo domain is unclear, but the chromo domain is thought to act as a targeting module for chromosomal proteins, although the chromosomal contexts and functional contexts being targeted vary. In all cases studies, the chromo domains are found in proteins that are involved in transcription regulation, positive and negative [2]. 2008-09-03 15:50:29 2004-11-16 16:02:09 14 325 682 504 17304 1 50 CL0050 HotDog \N HotDog superfamily Bateman A anon The HotDog fold was first observed in the structure of Escherichia coli beta-hydroxydecanoyl thiol ester dehydratase (FabA), where Leesong et al. noticed that each subunit of this dimeric enzyme contained a mixed alpha + beta 'hot dog' fold. They described the seven-stranded antiparallel beta-sheet as the 'bun', which wraps around a five-turn alpha-helical 'sausage', This superfamily contains a diverse range of enzymes. Membership includes numerous prokaryotic, archaeal and eukaryotic proteins involved in several related, but distinct, catalytic activities, from metabolic roles such as thioester hydrolysis in fatty acid metabolism, to degradation of phenylacetic acid and the environmental pollutant 4-chlorobenzoate. The superfamily also includes FapR, a non-catalytic bacterial homologue that is involved in transcriptional regulation of fatty acid biosynthesis [1]. 2008-09-03 15:50:29 2004-11-19 13:30:09 11 673 1411 5349 52323 1 51 CL0051 NTF2 NTF2-like superfamily Bateman A anon This superfamily contains a variety of enzymes such as Scytalone dehydratase, Delta-5-3-ketosteroid isomerase, Limonene-1,2-epoxide hydrolase among others. The family also includes presumed non-enzymatic homologues such as NTF2. 2008-09-03 15:50:29 2004-11-19 15:35:00 13 603 332 4350 23892 1 52 CL0052 NTN \N NTN hydrolase superfamily Bateman A anon In the N-terminal nucleophile aminohydrolases (Ntn hydrolases) the N-terminal residue provides two catalytic groups, nucleophile and proton donor. These enzymes use the side chain of the amino-terminal residue, incorporated in a beta-sheet, as the nucleophile in the catalytic attack at the carbonyl carbon. The nucleophile is cysteine in GAT, serine in penicillin acylase, and threonine in the proteasome. All the enzymes share an unusual fold in which the nucleophile and other catalytic groups occupy equivalent sites. This fold provides both the capacity for nucleophilic attack and the possibility of autocatalytic processing [1]. 2008-09-03 15:50:29 2004-11-19 17:25:18 17 2263 255 5468 47927 1 53 CL0053 4H_Cytokine 4-helical cytokine superfamily Bateman A anon Cytokines are regulatory peptides that can be produced by various cells for communicating and orchestrating the large multicellular system. Cytokines are key mediators of hematopoiesis, immunity, allergy, inflammation, tissue remodeling, angiogenesis, and embryonic development [2]. This superfamily includes both the long and short chain helical cytokines. 2008-09-03 15:50:29 2004-11-21 12:08:45 14 272 33 865 5160 1 54 CL0054 Knottin_1 \N Scorpion toxin-like knottin superfamily Bateman A anon This clan includes a number of toxin families that share the knottin structure. These families come from scorpions, plants and arthropods. 2008-09-03 15:50:29 2004-11-22 17:39:28 12 164 12 369 2025 1 55 CL0055 Viral_ssRNA_CP \N Positive stranded ssRNA viruses coat protein Finn RD anon The clan contains a set of viral coat protein families and peptidase A6. The only known peptidase activity is an autolytic cleavage releasing a 44-residue C-terminal fragment. The reaction is very slow and only occurs within the assembled virion. There is debate whether this is actually a true peptidase. The virion with these coat or capsid\ proteins are icosahedral viruses containing sixty triangular coat protein units, each unit consisting of three proteins. The coat protein consists of two subdomains, an eight-stranded beta-barrel on the surface and a three-helix bundle on the inner face. 2008-09-03 15:50:29 2004-11-24 13:16:32 12 790 144 9475 53300 1 56 CL0056 C_Lectin \N C-type lectin-like superfamily Bateman A anon This clan contains domains that have a C-type lectin fold. Many of these are known or expected to mediate interactions with sugars. 2008-09-03 15:50:29 2004-11-24 13:34:58 11 581 825 748 15749 1 57 CL0057 Met_repress MetJ/Arc repressor superfamily Bateman A anon This superfamily contains the MetJ and Arc repressors that feature a ribbon-helix-helix DNA-binding motif with the beta-ribbon located in and recognising the major groove of operator DNA [1]. 2008-09-03 15:50:29 2004-11-24 15:10:49 13 185 88 3932 18127 1 58 CL0058 Glyco_hydro_tim Tim barrel glycosyl hydrolase superfamily Bateman A anon This large superfamily contains a range of glycosyl hydrolase enzymes that possess a TIM barrel fold. This CLAN merges clans GH-A, GH-D, GH-H and GH-K from CAZy. 2008-09-03 15:50:29 2004-11-25 11:37:58 15 2459 2481 7032 114169 1 59 CL0059 6_Hairpin Six-hairpin glycosidase superfamily Bateman A anon This Clan includes CAZy clans GH-L, GH-M and GH-G. The members of this clan share a common structure composed of 6 helical hairpins. Most members of this superfamily are glycosyl hydrolase enzymes. 2008-09-03 15:50:29 2004-11-26 15:47:32 14 530 867 4822 41246 1 60 CL0060 DNA_clamp \N DNA clamp superfamily Bateman A anon Sliding DNA clamps are ring-shaped proteins that allow DNA polymerase to achieve high processivity during chromosome replication by tethering the polymerase catalytic subunit to DNA. All of the structures share a 12-fold symmetry around the ring consisting of a simple structural repeat, though there is structural divergence in some of the repeats. Bacterial beta-clamps contain six repeats per subunit with two subunits per ring while the eukaryotic and bacteriophage clamps contain four repeats per subunit with three subunits per ring. Pairs of these repeats form a domain, which has been termed the 'processivity fold'; thus the ring of the sliding clamp contains six domains and therefore is often described as having 6-fold symmetry. A structural representative of a fourth family of processivity fold proteins, namely the herpes simplex virus UL42 protein, is also available. UL42 does not form a ring-shaped clamp, however, but rather functions as a monomer and interacts with DNA quite differently than do sliding clamps; it has been suggested that UL42 resembles a primitive ancestor of sliding clamps [2]. 2008-09-03 15:50:29 2004-11-26 17:13:41 11 241 64 5243 18116 1 61 CL0061 PLP_aminotran \N PLP dependent aminotransferase superfamily Bateman A anon This superfamily contains a variety of PLP-dependent enzymes. 2008-09-03 15:50:29 2004-11-29 11:27:35 12 1560 574 7712 152775 1 62 CL0062 APC APC superfamily Bateman A anon This large superfamily contains a variety of transporters including amino acid permeases that according to TCDB belong to the APC (Amino acid-Polyamine-organoCation) superfamily. 2008-09-03 15:50:29 2004-11-29 18:13:15 12 36 417 5587 126928 1 63 CL0063 NADP_Rossmann FAD/NAD(P)-binding Rossmann fold Superfamily Finn RD anon A class of redox enzymes are two domain proteins. One domain, termed the catalytic domain, confers substrate specificity and the precise reaction of the enzyme. The other domain, which is common to this class of redox enzymes, is a Rossmann-fold domain. The Rossmann domain binds nicotinamide adenine dinucleotide (NAD+) and it is this cofactor that reversibly accepts a hydride ion, which is lost or gained by the substrate in the redox reaction. Rossmann domains have an alpha/beta fold, which has a central beta sheet, with approximately five alpha helices found surrounding the beta sheet.The strands forming the beta sheet are found in the following characteristic order 654123. The inter sheet crossover of the stands in the sheet form the NAD+ binding site [1]. In some more distantly relate Rossmann domains the NAD+ cofactor is replaced by the functionally similar cofactor FAD. 2008-09-03 15:50:29 2004-11-30 13:45:28 24 8681 8330 18998 984446 1 64 CL0064 CPA_AT \N CPA/AT transporter superfamily Bateman A anon This Clan contains transporter proteins that belong to the CPA superfamily and AT superfamily according to TCDB [1]. 2008-09-03 15:50:29 2004-12-02 10:12:55 11 10 163 5252 48799 1 65 CL0065 Cyclin \N Cyclin-like superfamily Bateman A anon This Clan contains cyclins, Transcription factor IIB (TFIIB), and the Retinoblastoma tumour suppressor proteins. These were predicted to be related by sequence [1]. 2008-09-03 15:50:29 2004-12-02 13:15:02 14 236 144 804 14080 1 66 CL0066 Trefoil \N Beta-trefoil superfamily Bateman A anon This family corresponds to a large set of related beta-trefoil proteins [1]. The beta-trefoil is formed by six two-stranded hairpins [2]. Three of these form a barrel structure and the other three are in a triangular array that caps the barrel. The arrangement of the secondary structures gives the molecules a pseudo 3-fold axis. 2008-09-03 15:50:29 2004-12-06 15:34:20 13 659 720 1705 12929 1 67 CL0067 SIS \N SIS domain fold Bateman A anon This catalytic domain catalyses isomerisation reactions of a variety of sugars [1]. 2008-09-03 15:50:29 2004-12-06 16:11:44 12 275 84 6149 42474 1 68 CL0068 RIIa \N RIIa-like fold Bateman A anon This clan includes both the RIIa dimerisation motif as well as the Dpy-30-like motif [1]. 2008-09-03 15:50:29 2004-12-07 15:58:47 11 37 67 303 1405 1 69 CL0069 GFP \N GFP-like superfamily Bateman A anon This superfamily has an unusual fold of an 11 stranded beta barrel enclosing an alpha-helix. This superfamily includes green fluorescent protein as well as a domain from nidogen. 2008-09-03 15:50:29 2004-12-08 17:25:21 11 623 171 194 602 1 70 CL0070 ACT \N ACT-like domain Bateman A anon These domains are involved in binding to amino-acids and causing allosteric regulation of linked enzyme domains [1]. The relationship between these two families was first noticed in [2]. 2008-09-03 15:50:29 2004-12-08 17:26:00 12 241 221 4906 47783 1 71 CL0071 His_phosphatase PGM; Histidine phosphatase superfamily Finn RD, Rigden DJ anon The histidine phosphatase superfamily is so named because catalysis centres on a conserved His residue that is transiently phosphorylated during the catalytic cycle. Other conserved residues contribute to a 'phosphate pocket' and interact with the phospho group of substrate before, during and after its transfer to the His residue.\ Structure and sequence analyses show that different families contribute different additional residues to the 'phosphate pocket' and, more surprisingly, differ in the position, in sequence and in three dimensions, of a catalytically essential acidic residue. The superfamily may be divided into two main branches [1]. 2008-09-03 15:50:29 2004-12-09 14:54:21 11 292 168 4773 26470 1 72 CL0072 Ubiquitin \N Ubiquitin superfamily Bateman A anon This family includes proteins that share the ubiquitin fold. It currently unites four SCOP superfamilies. 2008-09-03 15:50:29 2004-12-09 18:04:44 19 1053 1522 6169 107825 1 73 CL0073 P53-like \N Beta-sandwich DNA-binding domain Bateman A anon This clan contains a variety of DNA-binding domains that contain an immunoglobulin-like fold. It includes the DNA-binding domains of NF-kappaB, NFAT, p53, STAT-1, the T-domain and the Runt domain [1]. 2008-09-03 15:50:29 2004-12-10 09:44:45 12 340 136 553 5385 1 74 CL0074 Matrix \N Retroviral matrix superfamily Bateman A anon This clan brings together matrix proteins from a variety of retroviruses. 2008-09-03 15:50:29 2004-12-10 10:07:15 12 41 88 280 38433 1 75 CL0075 Defensin \N Defensin/myotoxin-like superfamily Bateman A anon This clan includes diverse defensins as well as myotoxins. 2008-09-03 15:50:29 2004-12-10 13:40:46 12 146 13 137 1375 1 76 CL0076 FAD_Lum_binding \N Riboflavin synthase/Ferredoxin reductase FAD binding domain Finn RD anon Riboflavin nucleotide coenzymes and flavin adenine dinucleotide (FAD) are essential cofactors for a large number of flavoproteins involved in a diverse set of redox reactions. There are thought to be four different FAD-binding folds [1].The FAD-binding fold of this clan is a cylindrical beta-fold. More specifically, the domain forms a flattened six-stranded antiparallel beta-barrel organised into two orthogonal sheets (1-2-5 and 4-3-6) separated by one alpha-helix. The cylinder is open between strands strand 4 and 5. This opening of the cylinder makes space for the isoalloxazine and ribityl moieties of the FAD, to which hydrogen bonds are formed from the open edges of the strands. The other end of the cylinder is covered by the only helix of the domain, which is essential for the binding of the pyrophosphate groups of the FAD [1].The structural differences in the FAD-binding domain are manifested mainly as loops of different length and extra extending structural elements, which may be important for interactions with their redox partners [1]. The structural core of all clan members is highly conserved. 2008-09-03 15:50:29 2004-12-10 14:19:14 11 202 346 5130 31559 1 77 CL0077 FAD_PCMH \N PCMH-like FAD binding Finn RD anon The FAD-binding domains contained in this family fall within the PCMH (p-cresol methyl-hydroxylase) family of FAD binding proteins as defined in [1]. In this family, the structure of the FAD binding domain is comprised of two subdomains. Both of these subdomains have an alpha-beta fold. The first subdomain is comprised of three parallel beta strands, surrounded by alpha helices. The second subdomain contains five antiparallel beta strands, also surrounded by alpha helices. The junction between these two subdomains forms the FAD bind pocket, where the ligand is bound by hydrogen and van der Waals bonds [1]. 2008-09-03 15:50:29 2004-12-10 16:27:25 11 247 229 5425 25538 1 78 CL0078 DNA_ligase \N DNA/RNA ligase superfamily Bateman A anon This superfamily contains both ATP-dependent and NAD dependent DNA ligase enzymes. The family also includes mRNA capping enzymes. The members of this clan were shown to be related by sequence in [1]. 2008-09-03 15:50:29 2004-12-10 18:04:20 12 72 170 5269 10191 1 79 CL0079 Cystine-knot \N Cystine-knot cytokine superfamily Bateman A anon The cytokine families in this clan have the cystine-knot fold. In this 6 cysteines form three disulphide bridges that are interlinked. 2008-09-03 15:50:29 2004-12-14 17:03:09 12 253 73 2754 10562 1 80 CL0080 Mss4-like \N Mss4-like superfamily Bateman A anon This clan contains TCTP, Mss4 and SelR families [1]. 2008-09-03 15:50:29 2004-12-14 17:24:43 11 51 31 4242 6196 1 81 CL0081 MBD-like \N MBD-like DNA-binding domain Bateman A anon This clan contains proteins with a distinctive three stranded DNA-binding domain [1]. 2008-09-03 15:50:29 2004-12-14 17:44:54 12 24 149 2355 10578 1 82 CL0082 MIF \N Tautomerase/MIF superfamily Bateman A anon This clan groups 5-(carboxymethyl)-2-hydroxymuconate isomerase (CHMI) and 4-oxalocrotonate tautomerase (4-OT) with macrophage inhibitory factor (MIF). Interestingly they all share an amino-terminal proline. Members of this clan for homotrimers [1]. 2008-09-03 15:50:29 2004-12-15 11:44:36 11 347 29 3218 5290 1 83 CL0083 Omega_toxin Omega toxin-like Finn RD anon This clan contains a set of related small protein toxins and what appears to be the functionally distinct Albumin I domain. All members of this clan have a knottin-like fold. Additional information about this clan may be found from [1]. 2008-09-03 15:50:29 2004-12-16 13:53:46 14 83 23 290 1853 1 84 CL0084 ADP-ribosyl \N ADP-ribosylation Superfamily Finn RD anon The members of this clan all represent ADP-ribosylating catalytic domains. The structurally conserved regions are located at the NAD binding region [1]. According to SCOP, the ADP-ribosylation domain is thought to have an "unusual fold". 2008-09-03 15:50:29 2004-12-16 14:31:35 12 208 320 842 3271 1 85 CL0085 FAD_DHS \N DHS-like NAD/FAD-binding domain Finn RD anon The members of this family adopt a Rossmann fold, similar to CLAN:CL0063. However, the members of this family are distinguished in that the FAD/NAD cofactor is bound in the opposite direction. In this arrangement, the adenosine moiety is found bound at the second half of the fold.\ In addition, the conserved GxGxxG motif found in classical NADP binding Rossmann folds is absent. Finally, another distinguishing characteristic is the formation of an internal hydrogen bond in the FAD molecule [1]. 2008-09-03 15:50:29 2004-12-16 15:32:59 13 399 174 5116 30822 1 86 CL0086 FAD_oxidored \N FAD-linked oxidoreductase Finn RD anon The members of this clan adopt a TIM barrel fold, which is reminiscent of flavin mononucleotide binding proteins, rather than one similar to other flavin adenine dinucleotide binding domains. However, the way the FAD cofactor binds in quite different compared to the binding of FMN in the TIM-barrel structures [1]. 2008-09-03 15:50:29 2004-12-16 16:43:38 11 60 30 4179 7837 1 87 CL0087 Acyl-CoA_dh \N Acyl-CoA dehydrogenase, C-terminal domain-like Finn RD anon The Acyl-CoA dehydrogenase FAD binding domain forms an mostly alpha helical domain, comprised of four helices\ arranged in up-and-down bundle. In Acyl-CoA oxidase II this domain appears to have been duplicated. 2008-09-03 15:50:29 2004-12-17 10:15:37 12 254 148 3538 31652 1 88 CL0088 Alk_phosphatase \N Alkaline phosphatase-like Finn RD anon The members of this clan all share a common structure of their catalytic domains, which contain conserved metal binding residues [1]. 2008-09-03 15:50:29 2004-12-17 11:25:45 15 217 321 5028 38421 1 89 CL0089 GlnB-like \N GlnB-like superfamily Finn RD anon The members of this clan are characterised by the fact the domains, each comprised of four beta-strand and two alpha helices, tend to form tetrameric structures [1]. 2008-09-03 15:50:29 2004-12-17 12:10:20 12 264 47 4170 11982 1 90 CL0090 Globin \N Globin-like Finn RD anon The globin fold is an evolutionary conserved six helical fold that is found in bacteria and eukaryotes. 2008-09-03 15:50:29 2004-12-17 12:17:06 11 2247 121 4808 13282 1 91 CL0091 NAD_Ferredoxin \N Ferredoxin / Ferric reductase-like NAD binding Finn RD anon The Ferredoxin / Ferric reductase-like NAD binding domain is adopts a Rossmann like fold. However, these families have been excluded from the classical NAD(P) binding Rossmann clan (CLAN:CL0063), due to a divergence of the GxGxxG motif. In this clan, the motif phosphate binding motif is G-T/S-G-A/I-P. The changes in the motif are a reflection of the different way that the NAD(P)H is bound by this fold and the classical Rossmann fold [1,2]. 2008-09-03 15:50:29 2004-12-17 14:54:00 11 192 335 4749 22088 1 92 CL0092 ADF \N Actin depolymerizing Factor Finn RD anon For motile cells such as Amoeba to move, there must be the rapid recycling of their actin cytoskeleton to enable a dynamic change in their shape. Gelsolin (PFAM:PF00626) and Cofilin (PFAM:PF00241) are two key domain families in this process. Both of these domain are structural and functional similar [1,2]. In particular, the beta sheet found at the core of the domain is structurally well conserved, with the helices that surround this sheet less conserved[2]. 2008-09-03 15:50:29 2004-12-17 17:36:22 11 154 175 529 7818 1 93 CL0093 Peptidase_CD \N Peptidase clan CD Finn RD anon The members of this clan are all endopeptidase that have the catalytic dyad histidine followed by cysteine. The catalytic histidine is preceded by a block of hydrophobic residues and a glycine, where as the cysteine is preceded by a block of hydrophobic residues and a glutamine and an alanine. The members with a know structure adopt an alpha/beta fold [1]. 2008-09-03 15:50:29 2004-12-21 13:25:00 13 474 1163 1706 9782 1 94 CL0094 Peptidase_ME \N LuxS/MPP-like metallohydrolase Finn RD anon All members of this clan are characterised by a HXXEH motif, which is is involved in zinc binding. Furthermore all members adopt an alpha and beta fold. More specifically, there us a four to six stranded antiparallel beta sheet surrounded by five helices. However, LuxS (PFAM:PF02664) is not a peptidase, although its hydrolytic mechanism of catalysis appears to be conserved [1]. 2008-09-03 15:50:29 2004-12-21 13:57:38 11 224 73 4739 30361 1 95 CL0095 Peptidase_ML \N Peptidase Clan ML Finn RD anon This clan contains HybD-like domains. HybD is a nickel binding endopeptidase. Structural and sequences analyses have highlighted the presence of two highly conserved motifs that are shared with germination proteases and HybD [1]. Members of this clan adopt an alpha/beta fold, comprised of a central beta sheet, surrounded by alpha helices. 2008-09-03 15:50:29 2004-12-21 15:54:20 12 12 11 1706 3673 1 96 CL0096 Pept_Inhib_IE \N Peptidase Inhibitor Clan IE Finn RD anon The members of this clan are all cystine rich domains, which form a knottin scaffold. This clan should also contain alpha-amylase but currently this family is a singleton and can not be put into Pfam. Also see [1]. 2008-09-03 15:50:29 2004-12-22 12:16:32 11 34 2 22 52 1 97 CL0097 TypeIII_Chap \N Type III secretory system chaperone Finn RD anon The translocation of pathogenic proteins into a host cell is mediated by the type III secretory system. A component of this system is a chaperone, which binds to the protein which is going to be secreted in the bacterial cytosol and is involved in translocation of the secreted protein, although the chaperone is not translocated itself. An individual chaperone associates with one or two specific proteins [1]. There are a large number of type III secretory system chaperones, which are small acidic proteins and exhibit significant sequence divergence. This clan groups type III secretory system chaperones. Members with a known structure form small compact globular domains with an alpha-beta(3)- alpha-beta(2)-alpha like organisation [1]. 2008-09-03 15:50:29 2004-12-22 12:58:31 11 45 14 1697 3343 1 98 CL0098 SPOUT AB_Knot; SPOUT Methyltransferase Superfamily Finn RD anon A distinct class of methylases that includes the SpoU and TrmD superfamilies and two superfamilies of predicted methylases defined by the YbeA and MJ0421 proteins in bacteria and archaea, respectively [1] (PFAM:PF00588 PFAM:PF01746). SPOUT is structurally distinct compared to more classical methyltransferases [1]. More specifically, the members of this clan form alpha/beta knots. Knots are extremely rare in protein structures as they pose a\ folding problem. The mechanism that allow a domain to be folded as a knot are unclear, but are discussed in [2] and reference therein. All members with known structure form homodimers. 2008-09-03 15:50:29 2004-12-22 15:38:51 13 112 85 5028 32567 1 99 CL0099 ALDH-like \N ALDH-like superfamily Finn RD anon The aldehyde dehydrogenases (ALDHs) are a superfamily of multimeric enzymes which catalyse the oxidation of a broad range of aldehydes into their corresponding carboxylic acids with the reduction of their cofactor, NAD(P) into NAD(P)H. The way that the NAD is bound is distinct from other NAD(P)-dependent oxidoreductases. The domain represented by this clan consists of two similar subdomains. 2008-09-03 15:50:29 2004-12-22 15:45:47 12 614 141 5455 48240 1 100 CL0100 C1q_TNF \N C1q and TNF superfamily Finn RD anon The members of the C1q and TNF superfamily are involved in a diverse set of functions, which include: defense, inflammation, apoptosis, autoimmunity differentiation, organogenesis, hibernation and insulin-resistant\ obesity [1]. Both C1q and TNF domains form a compact jelly-roll beta- sandwich. The core of these structures are conserved between the two families and corresponds to the detectable sequence similarity. Proteins containing both of these domains, form trimers before they are active. However, the surfaces of the domains are quite different and this difference is thought to give rise to the function difference between the clan members[1]. 2008-09-03 15:50:29 2004-12-22 15:46:56 12 294 48 443 3534 1 101 CL0101 PELOTA RNA_ribose_bind; Pelota - RNA ribose binding superfamily Finn RD anon The members of this clan are all involved in binding to ribose sugar of RNA[1]. Indeed, the key RNA binding residues are conserved across the different families [1]. Members of this clan form mixed alpha-helical and beta-sheet structures [1][2]. 2012-10-06 18:35:16 2005-01-04 15:00:27 11 212 56 5002 13986 1 103 CL0103 Gal_mutarotase \N Galactose Mutarotase-like superfamily Bateman A anon This clan is composed of a beta-sandwich that was first observed in domain 5 of beta-galactosidase, then as the central domain of copper amine oxidase, the C-terminal domain of chondroitinase, the C-terminal domain of hyaluronate lyase, the N-terminal domain of maltose phosphorylase and in Galactose Mutarotase [1]. All these enzymes act on a sugar substrate. 2008-09-03 15:50:29 2005-01-28 16:18:34 11 539 313 4717 24927 1 104 CL0104 Glyoxalase \N VOC superfamily Bateman A anon This clan contains the VOC metalloenzyme superfamily [1]. The known types of reactions that are catalysed include isomerizations (glyoxalase I), epimerizations (methylmalonyl-CoA epimerase), oxidative cleavage of C-C bonds (extradiol dioxygenase), and nucleophilic substitutions (fosfomycin resistance proteins) [1]. 2008-09-03 15:50:29 2005-01-28 18:11:34 12 363 157 4720 43346 1 105 CL0105 Hybrid \N Barrel sandwich hybrid superfamily Bateman A anon This superfamily contains proteins with a hybrid motif [1]. This motif is embedded in structurally diverse proteins. 2008-09-03 15:50:29 2005-01-31 16:15:18 12 394 669 6221 139176 1 106 CL0106 6PGD_C \N 6-phosphogluconate dehydrogenase C-terminal-like superfamily Bateman A anon This helical domain is found associated with Rossmann domains. 2008-09-03 15:50:29 2005-01-31 18:34:00 12 232 123 5955 39015 1 107 CL0107 KOW \N KOW domain Bateman A anon This superfamily includes proteins involved in translation that have a KOW like SH3-fold. 2008-09-03 15:50:29 2005-03-18 14:03:54 11 584 79 6487 23716 1 108 CL0108 Actin_ATPase \N Actin-like ATPase Superfamily Finn RD anon The actin-like ATPase domain forms an alpha/beta canonical fold. The domain can be subdivided into 1A, 1B, 2A and 2B subdomains. Subdomains 1A and 1B share the same RNAseH-like fold (a five-stranded beta-sheet decorated by a number of alpha-helices). Domains 1A and 2A are conserved in all members of this superfamily, whereas domain 1B and 2B have a variable structure and are even missing from some homologues [1]. Within the actin-like ATPase domain the ATP-binding site is highly conserved. The phosphate part of the ATP is bound in a cleft between subdomains 1A and 2A, whereas the adenosine moiety is bound to residues from domains 2A and 2B[1]. 2008-09-03 15:50:29 2005-03-22 09:34:28 15 1022 508 11461 141414 1 109 CL0109 CDA Cytidine deaminase-like (CDA) superfamily Finn RD, Coin L, Iyer LM, Zhang D, Aravind L anon This clan contains both free nucleotide and nucleic acid deaminases that act on adenosine, cytosine, guanine and cytidine, and are collectively known as the deaminase superfamily. The conserved fold consists of a three-layered alpha/beta/alpha structure with 3 helices and 4 strands in the 2134 order [1,2].This superfamily is further divided into two major divisions based on the presence of a helix (helix-4) that renders the terminal strands (strands 4 and 5) either parallel to each other in its presence, or anti-parallel in its absence [2]. Structurally, the deaminase-like fold is present in four other superfamilies including the JAB-like metalloproteins, the C-terminal AICAR transformylase-catalyzing domains of PurH, Tm1506 and the formate dehydrogenase accessory subunit FdhD. The active site of the deaminases is composed of three residues that coordinate a zinc ion between conserved helices 2 and 3. The residues are typically found as [HCD]xE and CxxC motifs at the beginning of helices 2 and 3. The zinc ion activates a water molecule, which forms a tetrahderal intermediate with the carbon atom that is linked to the amine group. This is followed by deamination of the base. 2008-09-03 15:50:29 2005-03-22 09:57:40 11 208 1010 5319 27014 1 110 CL0110 GT-A Glycosyl transferase clan GT-A Bateman A anon This is the GT-A clan that contains diverse glycosyltransferases that possess a Rossmann like fold [1]. 2008-09-03 15:50:29 2005-03-22 10:54:55 11 713 1496 7680 127750 1 111 CL0111 GT-C Glycosyl transferase GT-C superfamily Bateman A anon This is the GT-C clan that contains diverse glycosyltransferases that possess 8-13 predicted transmembrane segments [1]. 2008-09-03 15:50:29 2005-03-22 13:52:31 10 19 303 4345 19864 1 112 CL0112 Yip1 Yip1/YIF1-like Finn RD, Mistry J anon Yip1 and YIF1 are members of an integral membrane complex which bind to Ras-like GTPases and are required for membrane fusion of ER derived vesicles with the Golgi [1]. 2008-09-03 15:50:29 2005-03-22 15:32:52 12 4 63 2706 5788 1 113 CL0113 GT-B \N Glycosyl transferase clan GT-B Bateman A anon This is the GT-B clan that contains diverse glycosyltransferases that possess a Rossmann like fold [1]. 2008-09-03 15:50:29 2005-03-22 17:19:24 12 539 1601 8102 141567 1 114 CL0114 HMG-box \N HMG-box like superfamily Bateman A anon This clan includes the DNA-binding HMG-box proteins as well as the YABBY-like transcription factors. 2008-09-03 15:50:29 2005-03-23 13:44:43 11 62 241 1493 10953 1 115 CL0115 Steroid_dh \N Steroid oxidoreductase superfamily Bateman A anon This clan includes several enzymes, including steroid dehydrogenases and isoprenylcysteine carboxyl methyltransferase enzymes. These protein contain a varying number of transmembrane regions. 2008-09-03 15:50:29 2005-03-23 14:15:40 11 1 60 2542 7438 1 116 CL0116 Calycin \N Calycin superfamily Bateman A anon The calycin structural superfamily [1-3] includes the lipocalins, the fatty acid-binding proteins (FABPs). 2008-09-03 15:50:29 2005-03-23 14:57:38 12 662 102 3233 10920 1 117 CL0117 uPAR_Ly6_toxin \N uPAR/Ly6/CD59/snake toxin-receptor superfamily Bateman A anon This superfamily contains snake toxins as well as extracellular cysteine rich domains. 2008-09-03 15:50:29 2005-03-30 14:52:26 10 228 28 268 2910 1 118 CL0118 Ribokinase \N Ribokinase-like superfamily Bateman A anon All of these enzymes are phosphotransferases that have an alcohol group as an acceptor (EC:2.7.1.-). However, 4-amino-5-hydroxymethyl-2-methylpyrimidine phosphate kinase (HMPP kinase) catalyses two phosphorylation reactions: one to a hydroxymethyl group of hydroxymethyl pyrimidine (HMP) and the second to the phosphomethyl group of HMPP [1]. The common structural feature for the enzymes in this superfamily is a central eight-stranded sheet that is flanked by eight structurally conserved helices, five on one side and three on the other [1]. The active site is located in a shallow groove along one edge of the sheet, with the phosphate acceptor hydroxyl group and -phosphate of ATP close together in the middle of the groove, and substrate and ATP binding at the ends [1]. 2008-09-03 15:50:29 2005-04-01 13:48:37 11 400 155 5041 42135 1 121 CL0121 Cystatin Cystatin-like superfamily Bateman A anon This superfamily includes cystatins and cathelicidins [1]. The cystatin superfamily comprises cysteine protease inhibitors that play key regulatory roles in protein degradation processes. The progenitor of this superfamily was most probably intracellular and lacked a signal peptide and disulfide bridges, much like the extant Giardia cystatin. A primordial gene duplication produced two ancestral eukaryotic lineages, cystatins and stefins. Stefins - included in Pfam:PF00031 - remain encoded by a single or a small number of genes throughout the eukaryotes, whereas the cystatins have undergone a more complex and dynamic evolution through numerous gene and domain duplications [2]. 2008-09-03 15:50:29 2005-04-05 16:56:37 11 86 26 392 2322 1 122 CL0122 UTRA Chor_lyase; Chorismate lyase/UTRA superfamily Bateman A anon This clan includes chorismate lyase as well as the UTRA domain 2008-09-03 15:50:29 2005-04-05 17:05:44 10 71 33 3574 14169 1 123 CL0123 HTH Helix-turn-helix clan Bateman A anon This family contains a diverse range of mostly DNA-binding domains that contain a helix-turn-helix motif. 2008-09-03 15:50:29 2005-04-05 17:52:07 17 2812 5699 10949 1020775 1 124 CL0124 Peptidase_PA Peptidase clan PA Bateman A anon This clan contains a diverse set of peptidases with the trypsin fold. 2008-09-03 15:50:29 2005-04-06 15:44:18 14 2540 1159 7202 60527 1 125 CL0125 Peptidase_CA Peptidase clan CA Bateman A anon This clan includes peptidases with the papain-like fold. 2008-09-03 15:50:29 2005-04-06 17:40:48 14 814 1796 6922 76571 1 126 CL0126 Peptidase_MA Peptidase clan MA Bateman A anon Clan MA is one of two zinc-dependent metallopeptidases that contain the HEXXH motif. The two histidines are zinc ligands. The structures of this clan show the active site is between its two sub-domains. 2008-09-03 15:50:29 2005-04-07 08:52:20 17 981 1676 6188 88418 1 127 CL0127 ClpP_crotonase ClpP/Crotonase superfamily Bateman A anon This family includes several peptidases of peptidase clan SK as well as crotonase like proteins. 2008-09-03 15:50:29 2005-04-07 12:12:33 11 1093 331 8610 70137 1 128 CL0128 vWA-like von Willebrand factor type A Finn RD anon To add. 2008-09-03 15:50:29 2005-04-07 17:59:38 11 223 1340 4970 35661 1 129 CL0129 Peptidase_AA \N Peptidase clan AA Bateman A anon This clan contains aspartic peptidases, including the pepsins and retropepsins. These enzymes contains a catalytic dyad composed of two aspartates. In the retropepsins one is provided by each copy of a homodimeric protein, whereas in the pepsin-like peptidases these aspartates come from a single protein composed of two duplicated domains. 2008-09-03 15:50:29 2005-04-08 09:36:39 13 1842 629 2629 138509 1 130 CL0130 Peptidase_AD \N Peptidase clan AD Bateman A anon Members of this clan are peptidases that are integral membrane proteins. The catalytic aspartate is in the conserved GXGD motif. 2008-09-03 15:50:29 2005-04-08 11:13:31 10 5 33 3841 6449 1 131 CL0131 DoxD-like DoxD-like Mistry J anon The families in this clan are all membrane proteins. The DoxD family is found on enzymes involved in elemental sulphur oxidation [1]. The other families in this clan are poorly characterised. 2008-09-03 15:50:29 2005-04-08 11:57:52 10 0 56 3235 10608 1 132 CL0132 AbrB \N AbrB/MraZ DNA-binding domain Bateman A anon This superfamily includes the DNA-binding domain of AbrB as well as the presumed DNA-binding protein MraZ (per. comm. A Andreeva and A Murzin). 2008-09-03 15:50:29 2005-04-08 13:53:50 12 50 24 3586 11690 1 133 CL0133 AT14A-like \N AT14A-like Mistry J anon This clan contains plant proteins. DUF677 family members are AT14A-like proteins that have sequence similarity to fungal, insect and human integrins [1]. The other members of this clan are poorly characterised. 2008-09-03 15:50:29 2005-04-08 14:00:57 10 0 14 26 879 1 135 CL0135 Arrestin_N-like \N Arrestin_N-like Mistry J anon The families in this clan are involved in vacuolar protein trafficking, G protein signal termination and sporulation. The Arrestin N terminal domain has an Ig-like beta sandwich fold which binds to receptors and impairs their capacity to active G proteins [1]. Arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways [2]. 2008-09-03 15:50:29 2005-04-12 14:27:53 10 34 59 768 5476 1 136 CL0136 Plasmid_toxin \N Plasmid toxin-antitoxin system Mistry J anon The families in this clan are plasmid encoded toxins involved in plasmid maintenance. The plasmid encodes both a toxin and an antitoxin. Upon loss of the plasmid the antitoxin is inactivated more rapidly than the toxin. This allows the toxin to interact with its target thus killing the cell or impeding growth. 2008-09-03 15:50:29 2005-04-14 09:55:24 11 91 44 3558 18077 1 137 CL0137 HAD HAD superfamily Bateman A anon This clan represents the haloacid dehalogenase (HAD) superfamily that includes a diverse range of enzymes that use an asp carboxylate as a nucleophile [1]. 2008-09-03 15:50:29 2005-04-15 16:57:28 14 780 821 6664 120193 1 139 CL0139 GADPH_aa-bio_dh \N Amino acid biosynthesis and glycosomal dehydrogenase Mistry J anon This clan contains the C terminal domains of dehydrogenase enzymes involved in the biosynthesis of arginine, aspartate and aspartate derived amino acids. It also contains the C terminal domain of GAPDH, a dehydrogenase involved in glycolysis and gluconeogenesis. 2008-09-03 15:50:29 2005-04-18 13:42:09 10 525 39 9289 24271 1 140 CL0140 Viral_NABP \N Viral nucleic acid binding Mistry J anon This clan contains viral nucleic acid binding protein families. Two of the families in this clan are known to contain zinc finger motifs [1][2]. 2008-09-03 15:50:29 2005-04-18 14:24:28 10 0 5 71 578 1 141 CL0141 MtN3-like MtN3-like, vesicle-trafficking cargo-receptors Mistry J anon The clan forms a large and diverse family of proteins with seven transmembrane helices, common topology and, most likely, similar function. Their coding genes exist in all eukaryota and in several prokaryota. Some are responsible for metabolic diseases (cystinosis, congenital disorder of glycosylation), others are candidate genes for genetic disorders (cleft lip and palate, certain forms of cancer) or solute uptake and efflux (SWEETs) and many have not yet been assigned a function. Comparison with the properties of well-annotated clan members suggests that the proteins could be involved in protein trafficking and serve as cargo receptors in vesicle trafficking [3]. 2008-09-03 15:50:29 2005-04-19 09:24:39 11 0 74 1425 7738 1 142 CL0142 Membrane_trans Membrane and transport protein Mistry J anon This clan contains membrane proteins involved in the transport of molecules including amino acids sugars and signalling molecules. It also includes integral membrane cell cycle proteins and some putative ammonia monooxygenases. 2008-09-03 15:50:29 2005-04-20 13:47:04 11 10 78 4928 72098 1 143 CL0143 B_Fructosidase Beta fructosidase superfamily Mistry J anon This beta fructosidase superfamily [4] is composed of glycosyl hydrolase families. The members of this clan adopt a five-bladed beta-propeller fold [2-3]. The beta-fructosidase superfamily is also known as furanosidase superfamily [4]. 2008-09-03 15:50:29 2005-04-25 11:24:48 15 191 281 2992 11034 1 144 CL0144 Periplas_BP Periplas_BP-like; Periplasmic binding protein like Mistry J anon This clan includes proteins involved in chemotaxis, membrane transport of sugars and allocrites, and the LacI family transcriptional regulators. It also includes some antigenic basic membrane lipoproteins. 2008-09-03 15:50:29 2005-04-25 15:04:24 12 462 379 4786 70533 1 145 CL0145 Golgi-transport \N Golgi-transport Mistry J anon This clan contains families that are involved in intracellular transport and signalling.\ \ Arfaptins are proteins which interact with small GTPases involved in vesicular budding at the Golgi complex. They form an elongated dimer of three helix coiled coils and are structurally very similar to the BAR domain [1][2]. The Sec34 family is involved in tethering vesicles to the Golgi [3]. 2008-09-03 15:50:29 2005-04-25 16:18:47 14 49 165 766 6140 1 146 CL0146 Herpes_glyco \N Herpes glycoprotein Mistry J anon This clan contains herpes envelope glycoproteins [1][2]. 2008-09-03 15:50:29 2005-04-25 16:48:44 11 0 3 78 288 1 147 CL0147 Traffic \N Trafficking protein Mistry J anon The members of this clan are involved in protein trafficking. The Sec20 family are integral membrane proteins involved in ER to Golgi transport [1] and V-SNARES are involved in membrane fusion [2]. 2008-09-03 15:50:29 2005-04-26 10:46:36 10 7 29 336 2129 1 148 CL0148 Viral_Gag \N Viral Gag protein Mistry J anon This clan contains Gag proteins which are involved in viral assembly and replication [1][2]. 2008-09-03 15:50:29 2005-04-27 12:13:23 10 185 102 374 45347 1 149 CL0149 CoA-acyltrans \N CoA-dependent acyltransferase superfamily Finn RD anon All characterised families in this clan are involved in CoA-dependent acyltransferase. All families have a characteristic HXXXD motif. 2008-09-03 15:50:29 2005-04-27 14:03:23 11 193 1986 4892 38354 1 151 CL0151 PK_TIM \N Pyruvate kinase-like TIM barrel superfamily Bateman A anon This superfamily consists of a number of TIM barrel domains found in enzymes such as pyruvate kinase, malate synthase and citrate lyase. 2008-09-03 15:50:29 2005-05-03 14:40:00 11 507 138 6403 40855 1 153 CL0153 dUTPase \N dUTPase like superfamily Bateman A anon This clan contains dUTPase and many viral proteins that appear to be related. dUTPases are important in virus replication. 2008-09-03 15:50:29 2005-05-04 09:16:52 10 268 42 4817 8241 1 154 CL0154 C2 C2 superfamily Bateman A anon This superfamily includes C2 domains and C2-like domains. 2008-09-03 15:50:29 2005-05-04 15:46:49 10 222 791 571 27805 1 155 CL0155 CBM_14_19 \N Carbohydrate binding domain 14/19 clan Bateman A anon This clan includes two different carbohydrate binding modules. 2008-09-03 15:50:29 2005-05-04 15:50:34 10 1 158 335 6959 1 156 CL0156 Nucleocapsid \N Mononegaviral nucleocapsid superfamily Bateman A anon This clan contains paramyxoviral and ebola type virus nucleocapsid proteins. 2008-09-03 15:50:29 2005-05-04 16:11:33 10 4 2 470 5978 1 157 CL0157 Kleisin \N Kleisin superfamily Bateman A anon The kleisin superfamily includes ScpA, Scc1, Rec8, and Barren [1]. Scc1 interacts with SMC proteins through N- and C-terminal domains to form a ring-like structure [1]. 2008-09-03 15:50:29 2005-05-04 16:26:02 10 4 11 3187 3510 1 158 CL0158 GH_CE \N Glycoside hydrolase/deacetylase superfamily Bateman A anon This superfamily contains diverse enzymes that act on carbohydrates including both hydrolases and deacetylases. 2008-09-03 15:50:29 2005-05-04 16:44:41 11 125 249 4786 20178 1 159 CL0159 E-set Ig-like fold superfamily (E-set) Finn RD, Bateman A anon This clan includes a diverse range of domains that have an Ig-like fold and appear to be distantly related to each other. The clan includes: PKD domains, cadherins and several families of bacterial Ig-like domains as well as viral tail fibre proteins. it also includes several Fibronectin type III domain-containing families. 2008-12-15 16:59:57 2005-05-09 16:19:14 15 1072 9593 6497 213896 1 160 CL0160 Methionine_synt Cobalamin-independent synthase Finn RD anon The N-terminal and C-terminal cobalamin-independent synthase domains are structurally similar, adopting a TIM beta/alpha barrel. However, the two domain perform functionally different roles. The N-terminal domain and C-terminal domains both define a catalytic cleft in the enzyme. The N-terminal domain is thought to bind the substrate, in particular, the negatively charged polyglutamate chain. The N-terminal domain is also thought to stabilise a loop from the C-terminal domain. The C-terminal domain contains the active site residues[1]. 2008-09-03 15:50:29 2005-05-09 16:58:23 10 56 28 4455 12453 1 161 CL0161 GAF \N GAF domain-like Finn RD anon A clan of related transcriptional regulator domains. 2008-09-03 15:50:29 2005-05-09 18:32:14 11 220 3047 6850 47618 1 162 CL0162 FBA \N F-box associated Finn RD anon Clan containing related F-box associated families. 2008-09-03 15:50:29 2005-05-09 18:41:19 10 0 47 50 1557 1 163 CL0163 Calcineurin Calcineurin-like phosphoesterase superfamily Bateman A anon This clan contains the calcineurin-like phosphoesterases. This clan also includes the apparently inactive homologues from the small DNA polymerase subunits [1]. 2008-09-03 15:50:29 2005-05-10 16:59:39 10 274 576 5495 51255 1 164 CL0164 CUB \N CUB clan Bateman A anon This clan contains the CUB domain [1,2]. 2008-09-03 15:50:29 2005-05-10 17:23:26 12 31 892 190 13225 1 165 CL0165 Cache \N Cache-like domain Finn RD anon The Cache domain an extracellular domain that is thought to have a role in small-molecule recognition in a wide range of proteins, including the animal Ca(2+)-channel subunits and a class of prokaryotic chemotaxis receptors [1]. 2008-09-03 15:50:29 2005-05-10 17:26:33 10 40 407 2883 12801 1 166 CL0166 PRD \N PRD domain superfamily Bateman A anon The PRD domain (for PTS Regulation Domain), is the phosphorylatable regulatory domain found in bacterial transcriptional antiterminator of the BglG family as well as in activators such as MtlR and LevR. The PRD domain is phosphorylated on a conserved histidine residue. PRD-containing proteins are involved in the regulation of catabolic operons in Gram+ and Gram- bacteria and are often characterised by a short N-terminal effector domain that binds to either RNA (CAT-RBD for antiterminators (Pfam:PF03123, see also comments for this family)) or DNA (for activators), and a duplicated PRD module which is phosphorylated on conserved histidines by the sugar phosphotransferase system (PTS) in response to the availability of carbon source. The phosphorylations are thought to modify the stability of the dimeric proteins and thereby the RNA- or DNA-binding activity of the effector domain. 2008-09-03 15:50:29 2005-05-11 14:46:29 11 9 110 2060 16551 1 167 CL0167 Zn_Beta_Ribbon Zinc beta-ribbon Finn RD anon A clan of zinc-binding ribbon domains. 2008-09-03 15:50:29 2005-05-11 16:19:40 14 628 1382 6834 87461 1 168 CL0168 PAN \N PAN-like Finn RD anon PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions [1]. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs. 2008-09-03 15:50:29 2005-05-11 16:56:50 14 59 399 404 5383 1 169 CL0169 Rep \N Rep-like domain Bateman A anon This clan includes replication proteins for viruses and plasmids. This domain is known to bind DNA. The members of this clan have three motifs. The central HXH is conserved in most families in the clan. 2008-09-03 15:50:29 2005-05-13 18:23:32 10 25 62 3938 11895 1 170 CL0170 Peptidase_MD \N Peptidase MD Finn RD anon This clan is comprised of carboxypeptidases and the N-terminal domain from Sonic hedgehog proteins. The structure of the latter is similar to the peptidases, but the N-terminal domain of hedgehog has been demonstrate not to be involved in peptidase activity, but is more likely involved in signal transduction [1]. 2008-09-03 15:50:29 2005-05-19 09:30:19 10 53 142 3407 9216 1 171 CL0171 Phospoesterase \N inositol polyphosphate 1 phosphatase like superfamily Bateman A anon Members of this clan show metal-dependent / lithium sensitive phosphomonoesterase activity. The clan includes inositol polyphosphate 1 phosphatase and fructose 1,6-bisphosphatase [1]. 2008-09-03 15:50:29 2005-05-19 17:12:45 10 289 56 4585 15156 1 172 CL0172 Thioredoxin Thioredoxin-like; Thioredoxin-like Mistry J anon This clan contains families related to the thioredoxin family. Thioredoxins are small enzymes that are involved in redox reactions via the reversible oxidation of an active centre disulfide bond. The thioredoxin fold consists of a 3 layer alpha/beta/alpha sandwich and a central beta sheet. 2008-09-03 15:50:29 2005-05-20 15:54:37 16 2327 1079 7198 142836 1 173 CL0173 STIR \N STIR superfamily Fenech M anon Both members of this clan are thought to be involved in TOLL/IL1R-like pathways, by mediating protein-protein interactions between pathway components. The N-termini of SEFIR and TIR domains are similar, but the domains are more divergent towards the C-terminus [1]. 2008-09-03 15:50:29 2005-08-11 10:18:06 10 27 1046 1412 7537 1 174 CL0174 TetR_C \N TetR protein, C-terminal domain-like Fenech M anon This clan features families of transcriptional regulators for multidrug efflux pumps, which belong to the TetR superfamily. They are induced by the presence of a variety of factors, such as antibiotics or organic solvents. The C-terminal region featured in these families is thought to contain the inducer-binding site; the divergent sequences in this region allow for the binding of a variety of different inducers [1-4]. 2008-09-03 15:50:29 2005-08-11 10:47:22 9 244 37 2464 10523 1 175 CL0175 TRASH TRASH superfamily Fenech M anon TRASH-like domains contain well-conserved cysteine residues that are thought to be involved in metal coordination. These domains are thus expected to be involved in metal trafficking and heavy-metal resistance. It has been suggested that the members adopt a 'treble-clef' fold, with 3/4 beta strands preceding a C-terminal alpha helix [1]. 2008-09-03 15:50:29 2005-08-15 13:59:13 10 149 389 2803 12513 1 176 CL0176 Chemosens_recp \N Chemosensory 7tm receptor superfamily Finn R, Fenech M anon The members of this clan are families of various gustatory and odorant receptors. They are described as being seven-transmembrane receptors, and in fact all show characteristic regions of hydrophobicity on the alignment. 2008-09-03 15:50:29 2005-08-15 14:20:45 9 0 50 129 5871 1 177 CL0177 PBP \N Periplasmic binding protein clan Bateman A anon Periplasmic binding proteins (PBPs) consist of two large lobes that close around the bound ligand. This architecture is reiterated in transcriptional regulators, such as the lac repressors. In the process of evolution, genes encoding the PBPs have fused with genes for integral membrane proteins. Thus, diverse mammalian receptors contain extracellular ligand binding domains that are homologous to the PBPs; these include glutamate/glycine-gated ion channels such as the NMDA receptor, G protein-coupled receptors, including metabotropic glutamate, GABA-B, calcium sensing, and pheromone receptors, and atrial natriuretic peptide-guanylate cyclase receptors [2]. 2008-09-03 15:50:29 2005-08-22 13:15:50 15 1664 837 5964 242652 1 178 CL0178 PUA \N PUA/ASCH superfamily Bateman A anon This clan consists of the RNA binding PUA domain and ASCH domain. It also contains uncharacterised protein families. 2008-09-03 15:50:29 2005-08-22 16:11:13 15 169 216 5007 19669 1 179 CL0179 ATP-grasp \N ATP-grasp superfamily Bateman A anon The ATP-grasp domain is found in a wide variety of carboxylate-amine/thiol ligases [1]. It is composed of two subdomains, with ATP being bound in the cleft between the two. 2008-09-03 15:50:29 2005-08-23 14:07:58 13 397 518 7468 61819 1 181 CL0181 ABC-2 \N ABC-2-transporter-like clan Fenech M anon These families are similar to the ABC-2 transporter subfamily, as described in [1] (Pfam:PF01061). Members of this family are involved in drug transport and resistance. CcmB protein family (Pfam:PF03379) members are also transporters; they are required for haem export into the periplasm [2]. 2008-09-03 15:50:29 2005-08-23 16:06:40 9 3 239 5481 49701 1 182 CL0182 IT \N IT (Ion Transporter) superfamily Bateman A anon This superfamily of secondary carriers specific for cationic and anionic compounds, has been termed the ion transporter (IT) superfamily [1]. 2008-09-03 15:50:29 2005-08-23 18:40:04 12 0 126 4842 49966 1 183 CL0183 PAS_Fold PAS; PAS domain clan Bateman A anon This clan contains PAS domains that are found in a wide variety of bacterial signaling proteins. 2008-09-03 15:50:29 2005-08-24 15:20:49 13 311 6949 5998 88093 1 184 CL0184 DMT Drug/Metabolite transporter superfamily Bateman A anon This clan contains a variety of transporters which have 4, 5, 9 or 10 membrane spanning helices. Many of the 10 membrane spanning transporters appear to be a duplication of the 5 spanning unit [1]. Many of these families contain a characteristic glycine rich motif close to the C-terminus. 2008-09-03 15:50:29 2005-08-25 11:36:38 10 12 341 5620 114318 1 186 CL0186 Beta_propeller Beta propeller clan Bateman A anon This large clan contains proteins that contain beta propellers. These are composed of between 6 and 8 repeats. The individual repeats are composed of a four stranded sheet. The clan includes families such as WD40 Pfam:PF00400 where the individual repeats are modeled. The clan also includes families where the entire propeller is modeled such as Pfam:PF02239 usually because the individual repeats are not discernible. These proteins carry out a very wide diversity of functions including catalysis. 2008-09-03 15:50:29 2005-08-26 09:32:54 13 1163 8240 11212 356413 1 187 CL0187 LysM \N LysM-like domain Fenech M anon The LysM domain (Pfam:PF01476) is thought to be a general peptidoglycan-binding module. Although originally described in bacterial proteins, it has been also found in some eukaryotic sequences. It takes up a beta-alpha-alpha-beta conformation, with the beta strands forming an antiparallel beta sheet and the two alpha helices packing on one side of this sheet [1]. 2008-09-03 15:50:29 2005-08-26 09:44:43 10 8 614 4599 29820 1 188 CL0188 CH \N Calponin homology domain Fenech M anon The calponin homology (CH) domain is found in a variety of contexts, ranging from proteins involved in signalling pathways to cytoskeletal proteins. They seem to have diverse cellular functions, which are thought to include actin binding, involvement in the MAP kinase signalling pathway, and regulation of GEF activity in Rho family GTPase pathways. Structurally, they are organised into three layers, with two parallel alpha helices in the core being sandwiched between another two helices, one on each side [1]. 2008-09-03 15:50:29 2005-08-30 18:32:49 9 107 636 512 11687 1 189 CL0189 Endonuclease \N Endonuclease V-like superfamily Wuster A anon This clan contains DNA repair proteins. In E. coli endonuclease V initiates DNA repair of deaminated DNA bases and has similarity to motifs required for the catalytic activity of the UvrC endonuclease [1]. 2008-09-03 15:50:29 2005-08-31 14:14:38 9 20 27 4568 5628 1 190 CL0190 HSP20 HSP20-like chaperone superfamily Fenech M anon The small heat shock proteins (sHSPs) prevent protein aggregation during heat shock and oppose regulated cell death. A conserved arginine residue in the HSP20/alpha-crystallin domain (Pfam:PF00011) has in fact been implicated in the development of cataracts and myopathies [1]. The CS family (Pfam:PF04969) includes proteins that are known to bind HSP90 [2], as well as p23 (Swiss:Q15185), which is an HSP90 co-chaperone [3]. 2008-09-03 15:50:29 2005-09-01 14:25:02 11 201 170 4085 13158 1 191 CL0191 POTRA \N POTRA domain superfamily Fenech M anon The polypeptide-transport-associated (POTRA) domain is predicted to be organised into three beta-strands and two alpha helices, the latter being found between strands 2 and 3. It is usually found associated with a beta-barrel outer membrane domain. It is thought to have a chaperone-like function; the proteins it is found in are involved in processes as diverse as bacterial septation and protein transport across membranes [1]. 2008-09-03 15:50:29 2005-09-01 14:31:22 10 20 53 4354 20069 1 192 CL0192 GPCR_A Family A G protein-coupled receptor-like superfamily Fenech M anon This clan contains various seven-transmembrane receptors and related proteins. A major member is Pfam:PF00001, members of which have been considered to be typical members of the rhodopsin superfamily. Many members of this clan are Caenorhabditis proteins, suggesting great expansion of the relevant families in these nematode worms. 2008-09-03 15:50:29 2005-09-02 12:58:56 12 304 924 9484 92105 1 193 CL0193 MBB Outer membrane beta-barrel protein superfamily Bateman A anon This clan gathers together a large set of beta barrel membrane proteins.Although these proteins have different numbers of beta strands in the barrel they have significant sequence similarity between families. 2008-09-03 15:50:29 2005-09-05 10:13:31 13 356 515 4665 118601 1 194 CL0194 DNA_pol_B-like \N DNA polymerase B like Mistry J anon DNA polymerases replicate DNA by adding nucleotide triphosphate (dNTP) residues to the 5'-end of a growing chain of DNA. They use a complementary DNA chain as a template.` 2008-09-03 15:50:29 2005-09-05 13:42:41 9 181 97 2623 7651 1 195 CL0195 DBL \N Duff-binding like superfamily Bateman A anon This clan includes DBL (Duffy-binding like) domains from a variety of plasmodium surface proteins. 2008-09-03 15:50:29 2005-09-05 15:25:46 9 4 55 9 832 1 196 CL0196 DSRM \N DSRM-like clan Bateman A anon This clan contains RNA-binding domains. 2008-09-03 15:50:29 2005-09-08 18:03:50 11 267 170 5456 15069 1 197 CL0197 GME \N GME superfamily Bateman A, Shirai H anon This superfamily contains a number of related enzymes such as AstB, peptidyl-arginine deiminase, arginine deiminase and amidinotransferase [1,2]. 2008-09-03 15:50:29 2005-09-15 15:15:35 9 117 27 3437 6472 1 198 CL0198 HHH Helix-hairpin-helix superfamily Bateman A anon This superfamily includes Helix-hairpin-helix DNA-binding domains. 2008-09-03 15:50:29 2005-09-16 17:04:15 15 655 587 6368 81507 1 199 CL0199 DPBB \N Double Psi beta barrel glucanase Bateman A anon The DPBB fold is often an enzymatic domain. The members of this family are quite diverse, and if catalytic this family may contain several different functions [1,2]. This clan represents the barwin like barrels. 2008-09-03 15:50:29 2005-09-16 17:23:52 11 29 104 3343 9071 1 200 CL0200 Prefoldin \N Prefoldin GriffithsJones S, Finn RD, Mistry J anon The Prefoldin domain forms a coiled-coil structure that is involved in substrate-binding in the the chaperone co-factor prefoldin (PFD). Each PFD is assembled from two alpha and four beta subunits. Each alpha subunit contains two, and each beta subunit one, central beta-hairpin that is flanked N- and C-terminally by coiled-coil helices. The N-terminal regions, the prefoldin domain, are found facing into the central cavity of the chaperone. Here exposed hydrophobic patches form an interaction with the substrate (an unfolded protein) [1]. 2008-09-03 15:50:29 2005-09-19 13:51:59 9 9 52 526 2970 1 201 CL0201 Peptidase_SH \N Peptidase clan SH Bateman A anon This clan includes the serine peptidase assemblin from herpes virus as well as other viral peptidase families predicted to be related [1]. 2008-09-03 15:50:29 2005-09-20 12:38:50 9 47 16 1555 2100 1 202 CL0202 GBD Galactose-binding domain-like superfamily Bateman A anon This large superfamily contains beta sandwich domains with a jelly roll topology. Many of these families are involved in carbohydrate recognition. Despite sharing little sequence similarity they do share a weak sequence motif, with a conserved bulge in the C-terminal beta sheet. The probable role of this bulge is in bending of the beta sheet that contains the bulge. This enables the curvature of the sheet forming the sugar binding site [1]. 2008-09-03 15:50:29 2005-09-20 16:50:13 10 807 2917 4159 38880 1 203 CL0203 CBD \N Carbohydrate binding domain superfamily Bateman A anon This superfamily includes several carbohydrate binding domains. These domains have a beta sandwich structure. 2008-09-03 15:50:29 2005-09-21 11:02:43 11 97 423 950 3904 1 204 CL0204 Adhesin \N Bacterial adhesin superfamily Bateman A anon This superfamily includes a variety of bacterial adhesins that have a jelly-roll beta-barrel fold [1]. These domains are involved in sugar recognition. 2008-09-03 15:50:29 2005-09-21 11:17:50 10 156 136 1489 19890 1 205 CL0205 Di-copper \N Di-copper centre-containing domain Bateman A anon This superfamily includes tyrosinases and hemocyanins that share a di-copper centre [1]. 2008-09-03 15:50:29 2005-09-23 12:59:27 10 106 90 1801 4482 1 206 CL0206 TRB \N Transcriptional repressor beta-barrel domain Bateman A anon This beta-barrel domain is found at the C-terminus of a variety of transcriptional repressor proteins. 2008-09-03 15:50:29 2005-09-23 14:02:56 10 132 32 4239 8501 1 207 CL0207 Rhomboid-like \N Integral membrane protein / protease Mistry J anon This clan contains proteins from both bacteria and eukaryotes. The Rhomboid protein is an intramembrane serine protease which is involved in epidermal growth factor (EGF)-dependent signalling pathways [1]. The DER1 family is involved in degradation of misfolded ER proteins [2]. 2008-09-03 15:50:29 2005-12-01 11:27:30 9 16 91 4052 9025 1 208 CL0208 UBC \N Ubiquitin conjugating enzyme like superfamily Bateman A anon This superfamily includes a diverse set of proteins that bind to ubiquitin [1]. 2008-09-03 15:50:29 2005-12-02 15:22:57 10 294 238 870 12063 1 209 CL0209 Bet_V_1_like \N Bet V 1 like Mistry J anon The Bet_V_I family is composed of sequences related to the major Birch (Betula verrucose) pollen antigen Betv1. This allergen is known to cause hayfever, dermatitis, asthma and occasionally anaphylactic shock. The other families in this clan share the same structure as Betv1 which is composed of antiparallel beta sheets and alpha helices. There is a cavity between the beta sheet and a long C terminal helix. The cavity appears to play roles in the binding of lipid molecules [1][2][3] which seems a common feature of the families in this clan. 2008-11-07 17:26:25 2005-12-02 17:50:32 10 443 245 4418 23609 1 210 CL0210 HNOX-like \N Heme NO and oxygen binding like Mistry J anon This clan contains families that bind small molecules and are predominantly involved in signalling. Members include the heme NO binding domain. This domain is related to soluble guanylate cyclases and is mainly alpha helical in structure.\ Other members of this clan include V4R, which is predicted to be a small molecule binding domain, and a domain often found adjacent to this that is found on activators of aromatic catabolism, and on signalling molecules. 2008-09-03 15:50:29 2005-12-06 09:55:10 10 54 47 793 1692 1 212 CL0212 SNARE \N SNARE-like superfamily Bateman A anon This clan includes part of the SNARE like superfamily. 2008-09-03 15:50:29 2005-12-08 17:07:43 8 44 96 372 5553 1 213 CL0213 ShK-like \N Sea anemone toxin k like Mistry J anon Members of this clan include the Crisp domain which is involved in ryanodine receptor Ca2+ signalling, and the ShK domain which is named after the ShK channel inhibitor toxin. Both domains are cysteine rich and contain multiple disulphide bonds [1][2][3]. 2008-09-03 15:50:29 2006-01-03 16:32:02 8 27 121 194 2516 1 214 CL0214 UBA \N UBA superfamily Bateman A anon This superfamily includes domains related to the UBA domain. These domains are often involved in ubiquitin binding. 2008-09-03 15:50:29 2006-01-05 15:54:42 12 148 350 5009 18121 1 217 CL0217 Rotavirus_VP7 \N Rotavirus VP7 protein Bateman A anon This clan consists of several Rotavirus major outer capsid protein VP7 sequences. The rotavirus capsid is composed of three concentric protein layers. Proteins VP4 and VP7 comprise the outer layer. VP4 forms spikes and is the viral attachment protein. VP7 is a glycoprotein and the major constituent of the outer protein layer [1]. 2008-09-03 15:50:29 2006-01-24 14:08:46 8 28 2 702 4849 1 218 CL0218 ox_reductase_C \N Oxidoreductase C terminal like Mistry J anon This clan contains the C terminal region of oxidoreductase proteins and putative oxidoreductase proteins. Families in this clan form an alpha/beta structure and are usually found adjacent to an N terminal Rossman fold. 2008-09-03 15:50:29 2006-01-26 11:16:09 11 193 37 3633 12363 1 219 CL0219 RNase_H \N Ribonuclease H-like superfamily Bateman A anon This clan includes a diverse set of nucleases that share a similar structure to Ribonuclease H. 2008-09-03 15:50:29 2006-01-31 17:56:29 13 1007 2352 7614 190937 1 220 CL0220 EF_hand \N EF-hand like superfamily Bateman A anon The EF hand is a calcium binding domain found in a wide variety of proteins [1]. 2008-09-03 15:50:29 2006-02-01 09:23:33 11 1170 2215 3113 56360 1 221 CL0221 RRM \N RRM-like clan Bateman A anon This clan contains families that are related to the RNA recognition motif domains. However, not all these families are RNA binding. 2008-09-03 15:50:29 2006-03-05 12:30:44 10 736 1162 5818 73704 1 222 CL0222 MviN_MATE \N MviN, MATE-like superfamily Bateman A anon This superfamily consists of a variety of integral membrane protein families. The MATE family are known to be transporters. Other proteins have been implicated in virulence and polysaccharide biosynthesis. 2008-09-03 15:50:29 2006-03-05 13:21:19 7 4 109 5014 53746 1 223 CL0223 MACRO \N MACRO domain superfamily Bateman A anon This superfamily includes the Macro domain as well as the amino terminal domain from peptidase M17 proteins. 2008-09-03 15:50:29 2006-03-06 10:58:15 7 147 130 4353 8980 1 224 CL0224 DHQS \N Dehydroquinate synthase-like superfamily Bateman A anon This superfamily includes Dehydroquinate synthase and Iron containing alcohol dehydrogenase which have a similar active site organisation [1]. 2008-09-03 15:50:29 2006-03-06 13:57:58 7 116 47 4875 18804 1 225 CL0225 FtsL \N FtsL-like superfamily Bateman A anon This clan includes two proteins that are known to interact, FtsL and DivIC which are part of a trimeric complex with DivIB [2]. DivIC and FtsL are bacterial proteins essential for cell division. 2008-09-03 15:50:29 2006-03-06 15:46:31 7 0 9 3920 6384 1 226 CL0226 M6PR \N Mannose 6-phosphate receptor Bateman A anon This clan includes cation dependent and independent mannose 6-phosphate receptors. 2008-09-03 15:50:29 2006-03-06 16:32:14 8 48 95 319 2892 1 227 CL0227 Enolase_N \N Enolase N-terminal domain-like superfamily Bateman A anon This domain is found at the N-terminus of the catalytic Tim barrel-like domain in enolase and other enzymes. 2008-09-03 15:50:29 2006-03-06 16:37:34 8 887 42 5636 14408 1 228 CL0228 Acyltransferase Acyltransferase clan Bateman A anon This clan includes several families of related acyltransferases. 2008-09-03 15:50:29 2006-03-06 16:43:34 7 2 169 4885 24321 1 229 CL0229 RING \N Ring-finger/U-box superfamily Bateman A anon This clan includes the Ring zinc finger domains as well as the U-box domain that appears to have lost the zinc coordinating cysteine residues [1]. 2008-09-03 15:50:29 2006-03-06 16:54:46 10 159 1983 4407 49901 1 230 CL0230 HO \N Heme oxygenase-like superfamily Bateman A anon This clan includes the Heme oxygenase family as well as the TENA/THI-4/PQQC family that are less well characterised [2]. 2008-09-03 15:50:29 2006-03-06 16:57:26 7 195 41 2847 5166 1 231 CL0231 MazG \N all-alpha NTP pyrophosphohydrolase superfamily Bateman A anon This superfamily includes MazG, HisE and dimeric dUTPases (Not yet in Pfam) [1]. 2008-09-03 15:50:29 2006-03-06 17:03:44 8 112 46 4768 13122 1 232 CL0232 NifU \N NifU C-terminal domain-like superfamily Bateman A anon This clan includes the C-terminal domain of NifU as well as a large family of uncharacterised domains. 2008-09-03 15:50:29 2006-03-06 17:10:22 7 28 49 4450 9837 1 233 CL0233 SufE_NifU \N SufE/NifU superfamily Bateman A anon This clan includes iron sulfur cluster assembly proteins. 2008-09-03 15:50:29 2006-03-06 17:18:24 7 30 26 4741 7304 1 234 CL0234 CTPT \N CTP transferase-like superfamily Bateman A anon This clan includes the integral membrane CTP transferase family as well as a large family of uncharacterised proteins that may also function as nucleotidyltransferases. 2008-09-03 15:50:29 2006-03-06 17:30:58 7 0 20 4821 6518 1 235 CL0235 PspA \N PspA/ESCRT-III Bateman A anon This clan includes PspA like proteins that are transcriptional activators as well as Snf7, a protein involved in cellular trafficking. 2008-09-03 15:50:29 2006-03-07 09:10:43 8 24 36 2194 5578 1 236 CL0236 PDDEXK PD-(D/E)XK nuclease superfamily Bateman A anon This clan includes a large number of nuclease families related to holliday junction resolvases [1,2]. 2012-10-03 14:09:06 2006-03-07 10:09:50 16 584 895 6041 71985 1 237 CL0237 HD_PDEase \N HD/PDEase superfamily Bateman A anon This clan includes a range of phosphohydrolase enzymes with a common helical fold. 2008-09-03 15:50:29 2006-03-07 17:08:35 7 446 682 5155 45938 1 238 CL0238 PP2C \N PP2C-like superfamily Bateman A anon This clan includes the PP2C family of phosphatases as well as the SpoIIE family. This suggests SpoIIE proteins may also be phosphatases. 2008-09-03 15:50:29 2006-03-08 17:11:21 7 81 723 3947 17476 1 239 CL0239 Insulin \N Insulin-like superfamily Bateman A anon This superfamily includes the insulin like hormones. 2008-09-03 15:50:29 2006-03-09 13:10:30 7 895 8 341 1563 1 240 CL0240 PFK \N PFK-like superfamily Bateman A anon This clan includes two SCOP superfamilies. Strong similarities between NAD kinases, DAG kinase, sphingosine kinase and PFK have previously been shown[1]. 2008-09-03 15:50:29 2006-03-09 16:35:37 7 146 173 5088 20115 1 241 CL0241 ABC_membrane \N ABC transporter membrane domain clan Bateman A anon This clan includes families that are the membrane components of ABC transporter complexes. In general these regions are composed of six transmembrane helices [1]. 2008-09-03 15:50:29 2006-03-09 17:38:52 7 22 211 5258 54224 1 242 CL0242 DNA_primase_lrg \N DNA primase large subunit like Mistry J anon This clan contains the large subunit of archaeal and eukaryotic DNA primase, an enzyme which synthesises the oligoribonucleotide primers essential to DNA replication. The large subunit of DNA primase forms interactions with the small subunit and the structure implicates that it is not directly involved in catalysis, but plays a roles in correctly positioning the primase/DNA complex, and in the transfer of RNA to DNA polymerase [1]. The clan also contains the Lef-2 family, which is required for the expression of late genes. There is some evidence to suggest that LEF2 binds to both DNA and the DNA primase small subunit LEF-1 [3]. 2008-09-03 15:50:29 2006-04-21 14:57:17 8 11 11 506 609 1 243 CL0243 AEP Archaeo-eukaryotic primase Mistry J anon This clan includes the small subunit of 2 and eukaryotic DNA primase, and primase-helicase proteins from bacteriophages and plasmids. All known cellular life forms use primases to synthesis a short RNA primer which is extended during DNA replication by a polymerase. Bacterial DNA primase adopts a different fold to archaeal and eukaryotic primases and belongs to a different superfamily. 2008-09-03 15:50:29 2006-04-24 16:40:55 9 21 52 1612 2766 1 244 CL0244 PGBD \N PGBD superfamily Bateman A anon This clan consists of small putative peptidoglycan binding domains composed of three alpha helices. 2008-09-03 15:50:29 2006-05-16 17:51:06 8 23 345 3348 11009 1 245 CL0245 EDD \N EDD superfamily Bateman A anon The EDD superfamily was identified as an evolutionarily conserved domain (EDD) common to three different folds: mannose transporter EIIA domain (EIIA-man), dihydroxyacetone kinase (Dak), and DegV [1]. Both Dak and EIIA-man perform similar phosphotransfer reactions, suggesting a phosphotransferase activity for the DegV-like family of proteins, whose function other than lipid binding revealed in the crystal structure remains unknown [1]. 2008-09-03 15:50:29 2006-05-17 09:35:18 8 92 62 3559 18450 1 246 CL0246 ISOCOT_Fold NagB-like; Isomerase,CoA transferase & Translation initiation factor Superfamily Bateman A, Anantharaman V anon This superfamily contains a variety of enzymes and non-enzymatic ligand binding domains. 2008-09-03 15:50:29 2006-05-17 13:20:27 7 279 161 5233 53370 1 247 CL0247 2H \N 2H phosphoesterase superfamily Bateman A anon This clan includes a number of phosphoesterases that contain an internal duplication. 2008-09-03 15:50:29 2006-05-17 15:53:15 8 24 78 2901 4878 1 248 CL0248 ParBc \N ParB-like superfamily Bateman A anon This superfamily includes nucleases related to ParB as well as uncharacterised proteins. 2008-09-03 15:50:29 2006-06-02 16:12:35 7 28 97 4788 13574 1 249 CL0249 Phage_tail_L \N Phage minor tail protein L clan Bateman A anon This clan includes the phage minor tail protein L as well as a group of uncharacterised proteins that are also presumably phage components. 2008-09-03 15:50:29 2006-06-02 18:35:28 7 0 7 706 1564 1 250 CL0250 GAD \N GAD domain superfamily Bateman A anon This domain is found as an insert within aspartyl-tRNA synthetase as well as GatB proteins. 2008-09-03 15:50:29 2006-07-27 15:00:10 6 16 17 4522 4847 1 251 CL0251 MORN \N MORN repeat Mistry J anon The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000;6:11-22).\ A MORN-repeat protein has been identified in the parasite Toxoplasma gondiis as dynamic component of cell division apparatus [1].\ It has been hypothesised to function as a linker protein between certain membrane regions and the parasite's cytoskeleton [1]. 2008-09-03 15:50:29 2006-07-31 12:59:47 7 27 334 1412 30871 1 252 CL0252 NfeD-like \N NfeD like Mistry J anon This clan includes the NfeD family which contains several proteins described as nodulation efficiency protein D (NfeD). The nfe genes (nfeA, nfeB, and nfeD) are involved in the nodulation efficiency and competitiveness of the Sinorhizobium meliloti strain GR4 on alfalfa roots [1]. The specific function the NfeD family is unknown although it is unlikely that NfeD is specifically involved in nodulation as the family contains several different archaeal and bacterial species most of which are not symbionts. 2008-09-03 15:50:29 2006-07-31 13:35:37 6 3 8 2902 4042 1 254 CL0254 THDP-binding Thiamin diphosphate-binding superfamily Mistry J, Bateman A anon This clan includes pyruvate dehydrogenases, branched chain alpha-keto acid decarboxylases, phosphoketolases and the pyrimidine binding region of transketolases. 2008-09-03 15:50:29 2006-07-31 17:23:48 8 598 271 5650 93998 1 255 CL0255 ATP_synthase ATP synthase F0 subunit Mistry J anon This clan contains subunits of the F0 complex of ATP-synthase. The F0 complex is the non-catalytic unit of ATPase and is involved in proton translocation across membranes. 2008-09-03 15:50:29 2006-08-01 13:45:46 8 29 78 10710 23627 1 256 CL0256 Enolase_TIM \N Enolase like TIM barrel Mistry J anon This clan contains enzymes which adopt a TIM barrel fold. 2008-09-03 15:50:29 2006-08-01 15:00:35 6 1004 67 6025 21543 1 257 CL0257 Acetyltrans Acetyltrans-like; N-acetyltransferase like Mistry J anon This clan contains families related to N-acetyltransferases. N-acetyltransferases catalyse the transfer of acetyl groups from acetyl-CoA to arylamines. 2008-09-03 15:50:29 2006-08-02 14:16:08 8 664 967 6524 128815 1 258 CL0258 DALR \N DALR superfamily Bateman A anon Members of this family are anticodon binding domains from various tRNA synthetases. 2008-09-03 15:50:29 2006-08-17 18:00:51 6 13 28 4898 11747 1 259 CL0259 OstA \N OstA superfamily Bateman A anon This superfamily includes the OstA family as well as a large family of uncharacterised proteins. 2008-09-03 15:50:29 2006-08-22 10:05:45 6 11 32 2407 6093 1 260 CL0260 NTP_transf \N Nucleotidyltransferase superfamily Bateman A anon This clan contains a diverse set of nucleotidyltransferase enzymes. 2008-09-03 15:50:29 2006-08-22 16:46:46 7 338 577 5508 42917 1 261 CL0261 NUDIX NUDIX superfamily Bateman A anon This superfamily contains the NUDIX family and one related family. 2008-09-03 15:50:29 2006-08-24 18:58:30 6 360 321 5236 48986 1 262 CL0262 Trigger_C \N Trigger factor/SurA domain Bateman A anon This helical domain is found in two families of chaperones. It is found at the N terminus of the SurA proteins and at the C-terminus of the trigger factors where presumably it shares a common but as yet unknown function. 2008-09-03 15:50:29 2006-08-30 10:54:44 6 20 54 4464 9322 1 263 CL0263 His-Me_finger His-Me finger endonuclease superfamily Bateman A anon This superfamily defined originally by SCOP contains a diverse range of endonucleases. Later Grishin identified the MH1 domain as belonging to the superfamily [1]. 2008-09-03 15:50:29 2006-09-06 17:41:01 7 141 462 5328 22334 1 264 CL0264 SGNH_hydrolase \N SGNH hydrolase superfamily Bateman A anon This superfamily contains a diversity of hydrolytic enzyme activities. 2008-09-03 15:50:29 2006-09-07 09:23:58 6 122 427 4610 21200 1 265 CL0265 HIT \N HIT superfamily Bateman A anon The HIT superfamily are a superfamily of nucleotide hydrolases and transferases, which act on the alpha-phosphate of ribonucleotides [1]. 2008-09-03 15:50:29 2006-11-09 16:05:44 6 147 99 4930 15333 1 266 CL0266 PH \N PH domain-like superfamily Bateman A anon Members of this clan share a PH-like fold. Many families in this clan bind to short peptide motifs in proteins and are involved in signalling. 2008-09-03 15:50:29 2006-11-10 10:42:30 8 419 1526 2586 41992 1 267 CL0267 S11_L18p \N Ribosomal protein S11/L18p superfamily Bateman A anon This superfamily includes two ribosomal proteins S11 and L18p as well as a domain from eukaryotic peptide chain release factor. This superfamily is likely to share an RNA-binding function. 2008-09-03 15:50:29 2006-11-10 14:46:27 7 451 37 5925 12981 1 268 CL0268 Pec_lyase-like Pec_lyase; Pectate_lyase; Pectate lyase-like beta helix Bateman A anon This superfamily all contain a right handed beta helix similar to that first found in pectate lyase [1]. 2008-09-03 15:50:29 2006-11-10 15:10:57 6 217 1681 4399 45644 1 269 CL0269 Maf \N Maf/Ham1 superfamily Bateman A anon This superfamily includes the Maf-like proteins and ITPases related to YjjX [1]. 2008-09-03 15:50:29 2006-11-10 17:10:08 7 21 29 3802 6399 1 270 CL0270 Iso_DH \N Isocitrate/Isopropylmalate dehydrogenase-like superfamily Bateman A anon This superfamily of enzymes form dimers and have an active site between the two halves. 2008-09-03 15:50:29 2006-11-10 17:50:17 6 323 67 6496 27220 1 271 CL0271 F-box \N F-box-like domain Bateman A anon This clan includes classical F-boxes and the PRANC domain found in pox ankyrin proteins. 2008-09-03 15:50:29 2006-11-22 13:28:07 6 29 805 512 18595 1 272 CL0272 RGS \N RGS-like superfamily Bateman A anon This clan includes RGS domains that possess an alpha helical fold. 2008-09-03 15:50:29 2006-11-27 17:23:03 6 92 152 303 4401 1 273 CL0273 CYTH \N CYTH-like phosphatase superfamily Bateman A anon CyaB like adenylyl cyclase and the mammalian thiamine triphosphatases define a novel superfamily of catalytic domains called the CYTH domain that is present in all three superkingdoms of life. The catalytic core of these enzymes contain a novel alpha beta scaffold with 6 conserved acidic residues and 4 basic residues [1]. 2008-09-03 15:50:29 2006-12-05 13:50:23 6 48 23 3410 3949 1 274 CL0274 WRKY-GCM1 \N WRKY-GCM1 superfamily Bateman A anon WRKY and GCM1 are metal chelating DNA-binding domains (DBD) which share a four stranded fold [1]. We present evidence that they share a stabilising core, which suggests a possible origin from a BED finger-like intermediate that was in turn ultimately derived from a C2H2 Zn-finger domain [1]. 2008-09-03 15:50:29 2006-12-05 13:57:43 7 5 215 443 6439 1 275 CL0275 HAS-barrel \N HAS-barrel superfamily Bateman A anon The HAS barrel is named after HerA-ATP Synthase. In ATP synthases, this domain is implicated in the assembly of the catalytic toroid and docking of accessory subunits, such as the subunit of the ATP synthase complex. Similar roles in docking of the functional partner, the NurA nuclease, and assembly of the HerA toroid complex appear likely for the HAS-barrel of the HerA family [1]. 2008-09-03 15:50:29 2006-12-06 11:13:24 8 401 41 11124 22310 1 276 CL0276 Nucleot_cyclase \N Nucleotide cyclase superfamily Bateman A anon This superfamily includes adenylyl cyclase and the GGDEF domain [1]. 2008-09-03 15:50:29 2006-12-06 14:41:38 7 119 2538 3679 52683 1 277 CL0277 FAD-oxidase_C \N FAD-linked oxidase C-terminal domain superfamily Bateman A anon This clan consists of a duplicated subdomain in a variety of FAD-liked oxidase/dehydrogenase enzymes. 2008-09-03 15:50:29 2006-12-07 10:48:02 6 129 128 3530 14475 1 278 CL0278 AIG2 \N AIG2/ChaC-like superfamily Bateman A anon The structure consists of a five-stranded beta-barrel surrounded by two alpha-helices and a small beta-sheet.\ Conservation of residues in a hydrophilic cavity able to bind small ligands in some members suggests that this may also serve as an active site. 2008-09-03 15:50:29 2006-12-12 14:39:36 7 20 48 2219 4183 1 279 CL0279 GatB_YqeY \N YqeY-like superfamily Bateman A anon This superfamily includes a domain from GatB as well as one from YqeY. Although being structurally distinct they share a common sequence relationship. 2008-09-03 15:50:29 2007-01-24 12:42:11 6 38 17 4060 6337 1 280 CL0280 PIN \N PIN domain superfamily Bateman A anon This superfamily contains a variety of nuclease enzymes, including PIN domains and the FLAP exonucleases. 2008-09-03 15:50:29 2007-01-25 17:50:12 6 117 241 5231 29721 1 281 CL0281 CCT \N CCT like-motif Bateman A anon This clan includes the CCT motif as well as a related motif that is similar to the first half of the CCT motif. 2008-09-03 15:50:29 2007-01-29 13:51:28 6 21 28 152 1759 1 282 CL0282 Serum_albumin \N Serum albumin superfamily Bateman A anon This superfamily includes serum albumin and related families. 2008-09-03 15:50:29 2007-01-30 15:32:33 6 113 8 71 738 1 283 CL0283 LigB \N LigB-like superfamily Bateman A anon This clan includes the LigB subunit of the aromatic ring opening dioxygenase LigAB [1]. The clan also includes the Memo-like proteins. 2008-09-03 15:50:29 2007-02-12 17:08:37 6 13 17 2285 4015 1 284 CL0284 Allatostatin \N Allatostatin superfamily Mistry J anon Allatostatins are pleiotropic neuropeptides. In some insects they are known to inhibit the synthesis of juvenile hormone, an important regulator of development and reproduction. The full role of allatostatins in hormone production is still unclear [1]. 2008-09-03 15:50:29 2007-05-04 14:22:41 5 0 16 41 455 1 285 CL0285 YycI_YycH \N YycI/YycH superfamily Bateman A anon Both, YycH and YycI are always found in a pair on the chromosome, downstream of the essential histidine kinase YycG. Additionally, both proteins share a function in regulating the YycG kinase with which they appear to form a ternary complex. Structural studies show that these two protein families share two related domains. 2008-09-03 15:50:29 2007-05-14 11:16:06 6 13 4 723 1328 1 286 CL0286 GCS \N gamma-glutamylcysteine synthetase/glutamine synthetase clan Bateman A, Pei J anon This clan represents a superfamily of carboxylate-amine/ammonia ligases [1] that includes Gamma-Glutamylcysteine synthetase (gamma-GCS) and glutamine synthetase (GS). Gamma-Glutamylcysteine synthetase (gamma-GCS) catalyses the first step in the de novo biosynthesis of glutathione. 2008-09-03 15:50:29 2007-08-10 14:04:49 5 394 88 9114 22193 1 287 CL0287 Transthyretin \N Transthyretin superfamily Bateman A anon This clan unifies several SCOP superfamilies that all share a 7 stranded beta sandwich fold. 2008-09-03 15:50:29 2007-08-14 16:23:44 6 929 1227 4080 40864 1 288 CL0288 DAP_epimerase \N DAP epimerase superfamily Bateman A anon This superfamily includes DAP epimerase and proline racemase as well as the PrpF protein. It has been suggested that this fold may have evolved from the HotDog fold [1]. 2008-09-03 15:50:29 2007-08-17 11:58:37 5 61 43 3955 13393 1 289 CL0289 FBD \N Folate binding domain Bateman A anon This folate binding domain is found in the GCV T protein as well as the sarcosine oxidase gamma subunit [1]. 2008-09-03 15:50:29 2007-08-17 13:17:41 5 63 59 4863 12811 1 290 CL0290 EPT_RTPC \N EPT/RTPC-like superfamily Bateman A anon This superfamily includes Enolpyruvate transferase (EPT) and RNA 3'-terminal phosphate cyclase (RTPC). 2008-09-03 15:50:29 2007-08-17 13:33:07 5 195 54 4992 12126 1 291 CL0291 KNTase_C Nucleotidyltransferase substrate binding domain Bateman A anon This alpha helical domain is found associated with a variety of nucleotidyltransferase domains. 2008-09-03 15:50:29 2007-08-20 15:49:55 6 35 91 3566 11834 1 292 CL0292 LysE LysE transporter superfamily Bateman A anon This clan includes a diverse range of transporter families [1]. 2008-09-03 15:50:29 2007-10-04 12:03:53 5 0 68 4993 52424 1 293 CL0293 CDC \N Cholesterol-dependent cytolysin superfamily Bateman A anon This superfamily includes the MACPF domain as well as the Cholesterol-dependent cytolysins [1]. 2008-09-03 15:50:29 2007-10-19 12:42:17 5 21 68 720 2339 1 294 CL0294 Sec10 \N Sec10-like superfamily Bateman A anon This superfamily includes large proteins that are parts of the conserved oligomeric Golgi complex and exocyst complex. 2008-09-03 15:50:29 2007-11-20 17:02:32 5 0 34 298 1198 1 295 CL0295 Vps51 Vps51 domain superfamily Bateman A anon This clan includes an N-terminal domain from several vesicle transport proteins that are related to Vps51. 2008-09-03 15:50:29 2007-11-20 17:11:52 6 11 168 380 6736 1 296 CL0296 GroES \N GroES-like superfamily Bateman A anon This superfamily includes the GroES protein as well as the N-terminal GroES-like domain from Alcohol dehydrogenase. 2008-09-03 15:50:29 2008-02-08 16:40:31 4 569 527 6115 49027 1 297 CL0297 PhoU \N PhoU-like superfamily Bateman A anon This superfamily includes PhoU and its relatives that contain a three helical bundle domain structure. 2008-09-03 15:50:29 2008-02-08 17:40:34 4 21 32 4083 13356 1 298 CL0298 tRNA_bind_arm \N tRNA-binding arm superfamily Bateman A anon This domain is found in Phe and Ser tRNA synthetases at the N-terminus, and at the C-terminus of Val tRNA synthetase. The domain is composed of two helices. 2008-09-03 15:50:29 2008-02-12 16:57:18 4 43 35 4877 13679 1 299 CL0299 Peptidase_SF \N Peptidase clan SF Mistry J, Rawlings N anon This clan includes the peptidase S24 and S26 families. These families adopt a mainly beta fold. Members of the family S24 have an additional C-terminal domain containing a bundle of three helices presumably important for binding DNA. 2008-09-03 15:50:29 2008-02-25 12:55:51 4 47 75 5096 20383 1 300 CL0300 TAT \N Twin-Arginine Translocation Motif Bateman A anon This motif is found in a wide range of secreted proteins. It is named after the conserved pair of arginines that is followed by a hydrophobic stretch. 2008-09-03 15:50:29 2008-02-29 15:32:23 4 26 172 1437 2441 1 301 CL0301 PA14 \N PA14 superfamily Bateman A anon This clan includes the PA14 domain and related families. 2008-09-03 15:50:29 2008-03-03 17:31:48 5 32 336 731 2126 1 302 CL0302 Arginase \N Arginase/deacetylase superfamily Bateman A anon This superfamily includes arginase enzymes as well as histone deacetylases and related enzymes [1]. 2008-09-03 15:50:29 2008-04-30 14:26:28 5 410 86 3456 11530 1 303 CL0303 H2TH \N Helix-two-turns-helix superfamily Bateman A anon This domain is thought to play a role in binding nucleic acids. It is DNA binding in nucleases and RNA-binding in ribosomal S13. 2008-09-03 15:50:29 2008-05-28 16:24:30 4 289 53 5430 13968 1 304 CL0304 CheY \N CheY-like superfamily Bateman A anon This clan includes the CheY-like response regulators from bacteria [1-2]. 2008-09-03 15:50:29 2008-06-04 14:50:30 4 475 3478 5545 154594 1 305 CL0305 PTH2 \N Peptidyl-tRNA hydrolase II superfamily Bateman A anon This clan includes Peptidyl-tRNA hydrolase II as well as a large family of uncharacterised proteins called DUF2000. A structure for DUF2000 shows a similar structure to PTH2. It is not clear if the DUF2000 family are also Peptidyl-tRNA hydrolases. Both families contain a conserved positively charged residue close to the amino terminus that may be part of the active site. 2008-09-03 15:50:29 2008-06-05 14:00:28 4 24 15 963 1328 1 306 CL0306 HeH \N LEM/SAP HeH motif Bateman A anon This superfamily includes protein domains with the helix-extended loop-helix (HeH) structure. 2008-09-03 15:50:29 2008-09-03 10:49:37 3 74 240 4102 7665 1 307 CL0307 FUSC Fusaric acid resistance protein-like superfamily Bateman A anon Members of this clan are likely to be integral membrane bound transporters. 2008-09-05 13:12:16 2008-09-05 14:12:16 3 0 73 3485 13824 1 308 CL0308 DMSO_reductase \N Dimethyl sulfoxide reductase type II family Coggill P anon This clan includes members that are type II dimethyl sulfoxide reductase families, all of which are also membrane anchor proteins belonging to the iron-sulfur molybdoenzyme (CISM) family [1]. 2008-09-23 15:56:29 2008-09-23 16:56:29 3 8 25 1456 4376 1 310 CL0310 DinB DinB-like superfamily Bateman A anon This superfamily are thought to be metalloenzymes. They possess a four helical bundle core structure with a beta hairpin. Members of the superfamily have a predicted active site composed of three histidines that chelate Nickel or Zinc. In some cases these histidines are replaced with Aspartate or Glutamate. Mostly they form a dimeric structure. The dinB gene is one of the DNA-damage-induced genes and the corresponding protein, DinB, is the founding member of the clan. The protein contains a four-helix up-down-down-up bundle that has previously been described in the literature in three disparate proteins: the enzyme MDMPI (mycothiol-dependent maleylpyruvate isomerase), YfiT and TTHA0303, a member of a small DUF (domain of unknown function). Most (but not all) clan members seem to have the ability to coordinate a metal ion using a conserved histidine-triad motif. The proteins that share the fold exhibit four different quaternary structures: monomeric and three different dimeric forms [1]. 2008-10-08 12:01:30 2008-10-08 13:01:30 3 36 73 2472 9715 1 311 CL0311 SCP2 \N SCP-2 sterol transfer superfamily Bateman A anon This superfamily includes the SCP2 family as well as a domain from the mycothiol dependent maleylpyruvate isomerase. 2008-10-13 14:19:21 2008-10-13 15:19:21 3 29 62 2115 4976 1 312 CL0312 HemS_ChuX \N Heme iron utilization protein-like superfamily Bateman A anon This superfamily includes HemS and ChuX like protein families. 2008-10-22 16:49:13 2008-10-22 17:49:13 3 13 11 760 1473 1 314 CL0314 PP-binding \N ACP-like superfamily Bateman A anon \N 2008-11-19 12:30:53 2008-11-19 12:30:53 3 127 3226 5406 42314 1 315 CL0315 Gx_transp Gx transporter superfamily Bateman A anon This superfamily includes a wide range of transporters that contain many conserved glycine residues in the presumed transmembrane regions. 2008-11-21 16:40:49 2008-11-21 16:40:49 3 4 132 4338 23854 1 316 CL0316 Acyl_transf_3 \N Membrane acyl transferase superfamily Bateman A anon This superfamily includes a wide variety of integral membrane acyltransferase enzymes that often acylate sugars. 2008-11-24 13:14:42 2008-11-24 13:14:42 3 0 72 4090 17608 1 317 CL0317 Multiheme_cytos Multiheme cytochrome superfamily Coggill P anon This family includes cytochromes that contain multiple CxxCH motifs. 2008-12-10 17:38:20 2008-12-10 17:38:20 3 247 280 1655 11292 1 318 CL0318 Cytochrome-c Cytochrome c superfamily Coggill P anon This family includes proteins where a covalently-bound haem completes the core. The core is three helices in an open folded leaf formation. The members are monodomain cytochromes. 2008-12-12 14:30:33 2008-12-12 14:30:33 3 552 457 3452 28903 1 319 CL0319 SHS2 Rob_SOUL; SHS2 domain Bateman A, Anantharaman V anon SHS2 is a novel domain with a simple fold containing a core of 3 strands, forming a curved sheet, and a single helix in a strand-helix-strand-strand (SHS2) configuration [1]. SHS2 is found in the bacterial cell division ATPase FtsA, the archaeo-eukaryotic RNA polymerase subunit Rpb7p, the GyrI superfamily, and the uncharacterized MTH1598/Tm1083-like proteins [1]. The fold exists as single copy versions in FtsA (where it is inserted into the RNAseH fold), Rbp7p and Dodecin [1]. It is found as a diad in the GyrI superfamily. In MTH1598/Tm1083-like proteins two copies of SHS2 are found with one inserted into another [1]. The single-copy versions in FtsA and Rbp7 mediate protein–protein interactions, while the one in Dodecin is a small molecule binding domain. The GyrI also binds small molecule, while the MTH1598 is predicted to be enzymatic [1]. 2008-12-12 19:39:43 2008-12-12 19:39:43 3 250 83 4602 13173 1 320 CL0320 PepSY \N PepSY domain-like superfamily Bateman A anon This family includes the PepSY domain as well as a family of uncharacterised proteins. 2008-12-15 13:39:08 2008-12-15 13:39:08 3 53 124 3218 10624 1 321 CL0321 PLAT \N PLAT domain like superfamily Bateman A anon This domain has an 8-stranded sandwich structure. 2008-12-15 14:40:00 2008-12-15 14:40:00 3 91 192 284 3462 1 322 CL0322 RND_permease \N RND permease superfamily Bateman A anon Different members of the RND superfamily have been shown to transport hydrophobic drugs, fatty acids, bile salts, organic solvents, heavy metals, autoinducers and lipooligosaccharides in bacteria [1]. 2008-12-18 12:52:06 2008-12-18 12:52:06 3 91 154 4816 43065 1 323 CL0323 Patatin \N Patatin/FabD/lysophospholipase-like superfamily Bateman A anon This superfamily of enzymes contains a Ser/Asp catalytic dyad. Members of this superfamily are all serine acylhydrolase enzymes. 2008-12-18 16:43:32 2008-12-18 16:43:32 3 75 1413 5254 25679 1 324 CL0324 Homing_endonuc \N Homing endonuclease-like superfamily Bateman A anon This superfamily includes a variety of LAGLIDADG-like homing endonuclease like families. 2009-01-08 17:06:53 2009-01-08 17:06:53 3 109 309 2676 6450 1 325 CL0325 Form_Glyc_dh \N Formate/glycerate dehydrogenase catalytic domain-like superfamily Bateman A anon This superfamily includes the catalytic domain of a variety of dehydrogenase enzymes. The domain has a flavodoxin-like fold and contains an inserted Rossman fold NAD-binding domain. 2009-01-09 10:02:56 2009-01-09 10:02:56 3 370 93 4952 24576 1 326 CL0326 Reo_sigma \N Virus attachment protein superfamily Bateman A anon This superfamily includes virus attachment proteins that share a common beta sandwich domain. 2009-01-09 10:44:19 2009-01-09 10:44:19 3 291 20 282 678 1 327 CL0327 Pilus \N Pilus subunit Mistry J anon This is a clan contains bacterial pilus subunits and proteins involved in secretion. Pili proteins enable the transfer of plasmid between bacteria. The families in this clan adopt an alpha helical structure which is packed against a beta sheet [2-3]. 2009-01-12 11:15:14 2009-01-12 11:15:14 3 115 552 3218 21160 1 328 CL0328 2heme_cytochrom Transmembrane di-heme cytochrome superfamily Bateman A anon This superfamily includes a variety of different heme binding cytochromes. 2009-05-08 19:48:23 \N 3 111 270 42880 138619 1 329 CL0329 S5 \N Ribosomal protein S5 domain 2-like superfamily Bateman A anon This superfamily contains a wide range of families that possess a structure similar to the second domain of ribosomal S5 protein. 2009-05-08 19:52:37 \N 3 764 344 6126 79017 1 330 CL0330 AVL9 Late secretory pathway transport machinery Bateman A anon Members of this clan are involved in vesicle formation/trafficking. 2009-05-08 19:52:58 \N 3 2 129 304 2924 1 331 CL0331 EpsM General secretion pathway protein M Coggill P anon These families are involved in the general secretory pathways of bacteria and are normally membrane-bound. 2009-05-08 19:53:19 \N 3 6 33 1882 7823 1 332 CL0332 AcetylDC-like Acetyl-decarboxylase like superfamily Coggill P anon These families are double psi-beta barrel structures. 2009-05-08 19:53:39 \N 3 175 114 3736 19949 1 333 CL0333 gCrystallin gCrystallin-like; Gamma-Crystallin-like superfamily Coggill P anon This superfamily includes a number of mammalian crystallins as well as ancestral beta gamma-crystallin precursor structures. 2009-05-08 19:54:54 \N 3 107 65 320 3580 1 334 CL0334 THBO-biosyn Tetrahydrobiopterin biosynthesis-like enzyme superfamily Coggill P anon The families in this clan bind purine or ptein in topologically similar sites between subunits. 2009-05-08 19:57:58 \N 3 423 52 4568 17543 1 335 CL0335 FumRed-TM Fumarate reductase respiratory complex transmembrane subunits Coggill P anon This superfamily constitutes two distinct families: in one family the common fold is contained in a single-chain subunit, in the other it is formed by two chains. 2009-05-08 19:58:20 \N 3 154 20 3092 7355 1 336 CL0336 FMN-binding FMN-binding split barrel superfamily Coggill P anon This includes those related to the ferredoxin reductase-like FAD-binding domain and those that are Pyridoxine 5'-phosphate oxidase (PNP)-like. 2009-05-08 19:58:45 \N 3 224 139 4634 22171 1 337 CL0337 RF \N Release factor superfamily Coggill P anon These families are peptide chain release factors. 2009-05-08 19:59:09 \N 3 19 24 4802 12227 1 339 CL0339 PFL-like PFL-like glycyl radical enzyme superfamily Coggill P anon The N- and C-terminal halves of the structure have similar topologies but in some cases only one is represented by the members here, viz; the C-terminal domain of the R1 subunit of ribonucleotide reductase, and the N-terminal of PFL. The full-length structure is modelled by NRDD. 2009-05-08 19:59:52 \N 3 156 130 5379 23959 1 340 CL0340 PTase-anion_tr Phosphotransferase/anion transport protein superfamily Coggill P anon The families here are the cytoplasmic regions of anion transporter proteins. 2009-05-08 20:00:18 \N 3 25 126 3482 19584 1 341 CL0341 LDH_C \N LDH C-terminal domain-like superfamily Bateman A anon This superfamily includes the C-terminal domain of lactate/malate dehydrogenase as well as the C-terminal domain of the glycosyl hydrolase 4 family. 2009-05-08 20:00:39 \N 3 454 19 7644 15125 1 342 CL0342 TolB_N TolB, N-terminal domain Bateman A anon Members of this superfamily appear to behave like the N-terminal fold of the TolB transport-portal complex protein, which is beta-stranded. 2009-05-08 20:00:59 \N 3 23 67 2109 6978 1 343 CL0343 MHC MHC antigen-recognition domain Coggill P anon This superfamily includes all the Class I-related antigen-recognition domain families. 2009-05-08 20:01:29 \N 3 1002 27 843 46029 1 344 CL0344 4Fe-4S 4Fe-4S ferredoxins Bateman A anon Superfamily includes proteins containing domains which bind to iron-sulfur clusters. Members include bacterial ferredoxins, various dehydrogenases and various reductases. The structure of the domain is an alpha-antiparallel beta sandwich. 2009-05-08 20:01:52 \N 3 366 2266 6690 129668 1 345 CL0345 Aerolisin_ETX Aerolysin/ETX pore-forming domain superfamily Coggill P anon This superfamily includes pore-forming venoms and toxins from bacteria, plants, insects and fish. 2009-05-08 20:02:15 \N 3 26 17 136 416 1 346 CL0346 Ribo_L29 \N Ribosomal protein L29, L29p, superfamily Coggill P anon Superfamily includes Ribosomal protein L29 family and its corresponding mitochondrial ribosomal family, L47. 2009-05-08 20:02:37 \N 3 242 12 4897 5412 1 347 CL0347 Tetraspannin Tetraspannin-like Mistry J, Finn RD anon This clan includes the tetraspanin family which contains four transmembrane regions. The CD20 family also has four transmembrane regions, but its members are not considered true tetraspanins as they lack nearly all of the key functional tetraspanin residues [1]. 2009-05-08 20:04:53 \N 3 8 50 746 6471 1 348 CL0348 Phage_tail Phage virion morphogenesis superfamily Finn RD, Coggill P anon Families involved in joining the tail to the head of the phage as well as those completing the head are included herein. 2009-05-08 20:05:12 \N 3 0 8 1841 2993 1 349 CL0349 DprA MoCo carrier protein-like superfamily Bateman A anon Known family members of this superfamily are required for natural chromosomal and plasmid transformation. DprA is a new member of the recombination-mediator protein family, dedicated to natural bacterial transformation [1]. Superfamily includes lysine_decarboxylases. 2009-05-08 20:05:29 \N 3 59 53 4587 10964 1 350 CL0350 PRC-barrel \N PRC-barrel like superfamily Finn RD, Coggill P anon The PRC-barrel is an all beta barrel domain found in the photosynthetic reaction centre subunit H of the purple bacteria [1]. 2009-05-08 20:05:48 \N 3 122 34 4098 7238 1 351 CL0351 CHCH Coiled-coil helix coiled-coil helix superfamily Bateman A anon The conserved [coiled coil 1]-[helix 1]-[coiled coil 2]-[helix 2] domain (CHCH domain) superfamily members include NADH-ubiquinone oxidoreductases, some cytochrome oxidases and yeast mitochondrial ribosomal proteins. Within each helix of the CHCH domain there are two cysteines present in a C-X9-C motif. 2009-05-08 20:06:07 \N 3 58 39 353 2801 1 352 CL0352 EsxAB WXG100-A/WXG100-B dimer Finn RD anon The WXG100 protein secretion system (Wss) is responsible for the secretion of WXG100 proteins (PF06013), such as ESAT-10 (6 kDa early secreted antigenic target) and CFP-10 (10 kDa culture filtrate protein) in Mycobacterium tuberculosis or EsxA (ESAT-6-like extracellularly secreted protein A) and EsxB in Staphylococcus aureus. These two proteins, generally encoded in the same gene cluster, form a 1:1 heterodimeric complex. These proteins are virulence factors involved in host-pathogen interaction [1], as demonstrated in Mycobacterium tuberculosis, Staphylococcus aureus or Bacillus anthracis. The Wss is encoded in many other Gram-positive (monoderm) bacteria. This superfamily contains a number of DUFs which are closely related and may or may not represent the same family of proteins. 2009-05-08 20:06:25 \N 3 34 135 901 12295 1 353 CL0353 TIMP-like \N TIMP-like superfamily Bateman A anon This superfamily consists of the C-terminal domains of netrins, complement proteins C3, C4, C5, secreted frizzled-related proteins, and type I procollagen C-proteinase enhancer proteins, as well as the homologous N-terminal domains of tissue inhibitors of metalloproteinases (TIMPs). 2009-05-08 20:06:43 \N 3 69 60 208 1675 1 354 CL0354 bBprotInhib \N beta-Barrel protease inhibitors Coggill P anon Superfamily consists of both metalloprotease- inhibitors and staphostatins. 2009-05-08 20:07:02 \N 2 12 6 424 573 1 355 CL0355 CheC-like CheC-like superfamily Bateman A, Tuff TJ anon The chemotactic response regulator superfamily are CheY-P phosphatases. Their structure is two intertwined alpha-beta-(X)-beta(2) motifs. This superfamily comprises two classes of proteins each shown to interact with the chemotaxis response regulator CheY: the FliM switch proteins and the CheC-type phosphatases [1]. FliM is a component of the flagellar switch found across the bacteria and is responsible for binding CheY-P and changing the rotational direction of the flagella. The N-terminal domain is CheC-like and the C-terminal shares the SpoA domain with FliN and FliY. The CheC family is broadly broken down into three phosphatase subfamilies: CheC, CheX, and FliY. All three have an active site consensus sequence of D/S-X(3)-E-X(2)-N-X(22)-P. 2009-05-08 20:07:19 \N 3 20 27 2027 5059 1 356 CL0356 AMP_N-like Creatinase/prolidase N-terminal domain superfamily Coggill P anon Bacterial amino-peptidases and creatinases, where the fold is a ribonuclease H-like motif, are grouped in this superfamily. 2009-05-08 20:07:37 \N 3 91 34 4808 10134 1 357 CL0357 SMAD-FHA SMAD/FHA domain superfamily Bateman A anon Superfamily members carry a few short helices inserted in loops within the 11 strands in 2 sheets (greek-key) of the parent fold. 2009-05-08 20:07:54 \N 3 135 399 2102 12462 1 359 CL0359 Intron-mat_II \N Type II intron maturase-like superfamily Bateman A anon Superfamily includes a variety of transcription factors that bind intron RNA during reverse transcription and splicing. 2009-05-08 20:08:30 \N 3 14 51 26168 39344 1 360 CL0360 MTH1187-YkoF MTH1187/YkoF-like superfamily Coggill P anon Putative cell-wall biogenesis proteins and HMP-binding proteins, all with the same Ferredoxin fold, are included in this superfamily. 2009-05-08 20:08:47 \N 3 35 7 1666 2146 1 361 CL0361 C2H2-zf Classical C2H2 and C2HC zinc fingers Coggill P anon Superfamily of classical and closely related C2H2 or beta-beta-alpha zinc finger DNA-binding domains. 2009-05-08 20:09:05 \N 3 241 8637 2337 328673 1 362 CL0362 RAMPS-Cas5-like CRISPR-associated (Cas) Repair Associated Mysterious Proteins Bateman A, Coggill P anon This group of families is one of several protein families that are always found associated with prokaryotic CRISPRs, themselves a family of clustered regularly interspaced short palindromic repeats, DNA repeats found in nearly half of all bacterial and archaeal genomes. These DNA repeat regions have a remarkably regular structure: unique sequences of constant size, called spacers, sit between each pair of repeats [1]. It has been shown that the CRISPRs are virus-derived sequences acquired by the host to enable them to resist viral infection. The Cas proteins from the host use the CRISPRs to mediate an antiviral response. After transcription of the CRISPR, a complex of Cas proteins termed Cascade cleaves a CRISPR RNA precursor in each repeat and retains the cleavage products containing the virus-derived sequence. Assisted by the helicase Cas3, these mature CRISPR RNAs then serve as small guide RNAs that enable Cascade to interfere with virus proliferation [2]. Cas5 contains an endonuclease motif, whose inactivation leads to loss of resistance, even in the presence of phage-derived spacers [3]. 2009-05-08 20:09:23 \N 3 24 24 1529 4925 1 363 CL0363 H-int Hedgehog/intein (Hint) superfamily Bateman A anon This superfamily includes Hedgehog C-terminal (Hog) autoprocessing domain and Intein (protein splicing domain) families. 2009-05-08 20:09:41 \N 3 29 477 1142 3298 1 364 CL0364 Leu-IlvD \N LeuD/IlvD-like Bateman A anon Superfamily includes LeuD-like, IlvD/EDD C-terminal domain-like, and AF0055-like families. 2009-05-08 20:09:58 \N 3 36 36 4411 10971 1 365 CL0365 MurF-HprK_N MurF and HprK N-domain-like superfamily Bateman A anon This includes both the MurE/MurF-ligases N-terminal domain and HPr kinase/phosphatase HprK N-terminal domain superfamilies. 2009-05-08 20:10:16 \N 3 14 40 3206 4809 1 366 CL0366 JAB Mov34-like; JAB-like superfamily Bateman A, Iyer LM, Zhang D, Aravind L anon This superfamily includes a number of proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits, regulators of transcription factors and ubiquitination-assisting protein families. In eukaryotes and in prokaryotic cognates of the ubiquitin-based modification pathway, they function as ubiquitin isopeptidases/ deubiquitinases. JAB domains are also found in diverse metabolic pathways in prokaryotes such as siderophore and cysteine biosynthesis. Other distinct versions of the JAB domain, such as RadC are predicted to function as nucleases. Structurally, the JAB domain is related to the nucleotide deaminase and binds a Zinc ion in a similar structural location. 2009-05-08 20:10:33 \N 3 35 121 4285 10806 1 367 CL0367 CI-2 CI-2 family of serine protease inhibitors Bateman A anon This superfamily includes a range of universally found subtilases, that are serine proteases. 2009-05-08 20:10:50 \N 3 54 6 411 761 1 368 CL0368 PhosC-NucP1 \N Phospholipase C/P1 nuclease superfamily Coggill P anon This superfamily includes the Phospholipase C and P1-nuclease families. 2009-05-08 20:11:07 \N 3 26 29 2543 5327 1 369 CL0369 GHD Glycosyl hydrolase domain superfamily Bateman A anon This domain is C-terminal to the catalytic beta/alpha barrel domain. The superfamily includes the C-terminal domain of a number of sugar-lytic families. 2009-05-08 20:11:24 \N 3 345 276 4003 18678 1 370 CL0370 Uteroglobin Uteroglobin-like superfamily Coggill P anon Members of this superfamily are disulfide-linked dimers of two identical chains, with 4 helices in each. They constitute important new cat, rat and rabbit allergens that are contributing to asthma world-wide. 2009-05-08 20:11:41 \N 3 12 2 47 309 1 371 CL0371 Inovirus-Coat \N Inovirus (filamentous phage) major coat protein Coggill P anon Superfamily contains a number of filamentous phage coat-protein families. 2009-05-08 20:11:58 \N 3 29 3 67 82 1 372 CL0372 Hy-ly_N \N Hyaluronate lyase-like catalytic, N-terminal domain Bateman A anon This contains virus envelope protein, Chondroitin AC lyase and hyaluronate lyase families. 2009-05-08 20:12:16 \N 3 44 53 774 1055 1 373 CL0373 Phage-coat Phage coat superfamily Coggill P anon A number of different phage coat-proteins are collected together in this superfamily. 2009-05-08 20:12:33 \N 3 76 25 2757 4895 1 374 CL0374 PEP-carboxyk \N PEP carboxykinase-like superfamily Bateman A anon This includes the PEP carboxykinase C-terminal domain and HPr kinase HprK C-terminal domain families. 2009-05-08 20:12:50 \N 3 100 16 5373 7630 1 375 CL0375 Transporter Transporter superfamily Bateman A anon The members of this superfamily are probably all transporter protein domains. 2009-05-08 20:13:08 \N 3 0 42 409 5210 1 376 CL0376 Oxa1 Cytochrome oxidase biogenesis family Bateman A anon The cytochrome oxidase biogenesis families are membrane transporters akin to the E coli protein YidC. For those proteins whose N-termini must reside in the intermembrane space, export is mediated by the Oxa1p export machinery, machinery that depends upon the membrane potential. Qxa1p homologues are found in all living organisms. TCDB:2.A.9. 2009-05-08 20:13:27 \N 3 0 30 4817 6863 1 377 CL0377 FAH \N Fumarylacetoacetate hydrolase, C-terminal domain, superfamily Bateman A anon Superfamily contains fumarylacetoacetate hydrolase and related enzymes, 2009-05-08 20:13:45 \N 3 104 39 3513 10941 1 378 CL0378 ANL Ac-CoA-synth; ANL superfamily Bateman A anon This superfamily consists of enzymes including luciferase, long chain fatty acid Co-A ligase, acetyl-CoA synthetase and various other closely-related synthetases as well as a plant auxin-responsive promoter family. The name ANL derives from from three of the subfamilies - Acyl-CoA synthetases, the NRPS adenylation domains, and the Luciferase enzymes [1]. Members of this superfamily catalyse the initial adenylation of a carboxylate to form an acyl-AMP intermediate, followed by a second partial reaction, most commonly the formation of a thioester [1]. 2009-05-08 20:14:02 \N 3 149 2269 6257 68785 1 379 CL0379 PgaPase \N Pyroglutamate aminopeptidase superfamily Bateman A anon This is a collection of pyrrolidone carboxyl peptidase or pyroglutamate aminopeptidase families from bacteria and archaea. 2009-05-08 20:14:20 \N 3 60 16 1654 2070 1 380 CL0380 IDO-like Indolic compounds 2,3-dioxygenase-like superfamily Bateman A anon Superfamily contains bacterial tryptophan 2,3-dioxygenase and indoleamine 2,3-dioxygenase-like families. 2009-05-08 20:14:37 \N 3 60 18 837 1644 1 381 CL0381 Metallo-HOrase \N Metallo-hydrolase/oxidoreductase superfamily Bateman A anon This superfamily of enzymes including beta-lactamases, thiolesterases, members of the glyoxalase II family that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid all bind two ions of zinc. An additional family of competence proteins essential for natural transformation do not appear to bind zinc, and might be a transporter involved in DNA uptake. 2009-05-08 20:14:54 \N 3 403 305 5380 50665 1 382 CL0382 DNA-mend \N DNA breaking-rejoining enzyme superfamily Bateman A anon This is a superfamily of DNA recombinases, topoisomerases and integrases. 2009-05-08 20:15:13 \N 3 116 209 6017 46990 1 383 CL0383 PheT-TilS Phenylalanine- and lysidine-tRNA synthetase domain superfamily Coggill P anon Families here are thought to contain a putative tRNA-binding structural motif. The families are the C-terminal domains of tRNA-Ile-lysidine and the phenylalanine-tRNA synthetases. 2009-05-08 20:15:30 \N 3 31 54 4735 8465 1 384 CL0384 PLC PLC-like phosphodiesterases Coggill P anon Superfamily consists of Glycerophosphoryl diester phosphodiesterase and phosphatidylinositol-specific phospholipase C families. 2009-05-08 20:15:48 \N 3 108 279 4173 15000 1 385 CL0385 Hydrophilin \N Hydrophilin-like superfamily Coggill P anon This superfamily includes plant and bacterial hydrophilin families. 2009-05-08 20:16:05 \N 3 0 17 752 2456 1 386 CL0386 Ant-toxin_C \N Superantigen toxins, C-terminal domain superfamily Coggill P anon Superfamily contains bacterial super-antigen toxins and the MAP family. 2009-05-08 20:16:23 \N 3 216 9 194 4767 1 387 CL0387 DHFred \N Dihydrofolate reductase-like Coggill P anon Superfamily contains the dihydrofolate reductases and the RibD C-terminal domain-like including HTP reductase families. 2009-05-08 20:16:40 \N 3 474 41 4883 12238 1 388 CL0388 FadR-C \N Fatty acid responsive transcription factor FadR, C-terminal domain Coggill P anon Superfamily includes C-terminal domain ligand-binding GntR families and families of fatty acid responsive transcription factors. This C-terminal domain, an antiparallel array of six alpha helices, forms a barrel-like structure, while a seventh alpha helix forms a 'lid' at the end closest to the N-terminal domain - a separate, DNA-binding winged-helix, domain. 2009-05-08 20:16:57 \N 3 24 47 3168 21611 1 389 CL0389 TRAF \N TRAF domain-like superfamily Coggill P anon Superfamily has a circularly permuted immunoglobulin-fold topology with extra an extra beta-strand. Families include the Math and the SIAH, or Seven in absentia, members. 2009-05-08 20:17:16 \N 3 106 201 1094 6315 1 390 CL0390 zf-FYVE-PHD FYVE/PHD zinc finger superfamily Coggill P anon Superfamily contains a number of zinc-fingers, of the FYVE/PHD type, which are found in several groups of proteins including myelin-associated oligodendrocytic basic proteins (MOBP) Rabphilins, melanophilins, exophilins and myosin-VIIA and Rab-interacting protein families. 2009-05-08 20:17:33 \N 3 184 1096 1231 20485 1 391 CL0391 CAP_C-like Adenylate cyclase associated (CAP) C terminal like Mistry J anon Families in this clan adopt a beta super helix structure [1-2]. The clan includes the C terminal domain of adenylate cyclase which binds binds actin [1]. 2009-05-08 20:17:50 \N 3 14 29 324 1064 1 392 CL0392 Chaperone-J \N Chaperone J-domain superfamily Coggill P anon The J-domain is found in a number of stress-response proteins. It is found at the N-terminal of Hsc20, DnaJ-chaperone in E. coli, and viral large T-antigen proteins; it is also in Hsc40, mammalian auxilin and in both animal and plant DnaJ proteins. It is also found in degenerate form in Pam16 proteins. 2009-05-08 20:18:07 \N 3 72 636 5592 28318 1 393 CL0393 FucI-AraA_C \N FucI/AraA C-terminal domain-like [50443] Coggill P anon The enzymes in this superfamily function as a hexamer, which is the largest structurally known ketol isomerase, that has no sequence or structural similarity to other ketol isomerases. 2009-05-08 20:18:26 \N 3 24 7 1558 2470 1 394 CL0394 DsrEFH-like \N DsrEFH-like superfamily Bateman A anon This is a superfamily of small proteins from phototrophic sulfur bacteria that are involved in oxidisation of intracellular sulfur. 2009-05-08 20:18:45 \N 3 54 21 2423 6201 1 395 CL0395 Tubby_C Tubby C-terminal domain-like Bateman A anon This superfamily contains the scramblase protein family, the Tub family and the DUF567, a family of plant and bacterial proteins of hitherto unknown function. All members are membrane-tethered transcription factors. 2009-05-08 20:19:04 \N 3 9 44 886 2617 1 396 CL0396 Marvel-like MARVEL domain containing superfamily Bateman A anon The MAL and related proteins for vesicle trafficking and membrane link (MARVEL) domain is a module with a four transmembrane-helix architecture that has been identified in proteins of the myelin and lymphocyte (MAL), physins, gyrins and occludin families. 2009-05-08 20:19:22 \N 3 0 28 346 3075 1 397 CL0397 TusA-like SirA-like; TusA-like superfamily Bateman A anon Member families include sulfurtransferase TusA. 2009-05-08 20:19:39 \N 3 8 61 3075 6124 1 398 CL0398 RMMBL_DRMBL \N RNA/DNA-metabolising metallo-beta-lactamase motif Mistry J anon This clan contains the fifth motif found in RNA and DNA metabolising metallo-beta-lactamases. The fifth motif appears to be specific to function [1]. 2009-05-08 20:19:56 \N 3 52 66 3672 6801 1 399 CL0399 Asp-glut_race Aspartate/glutamate racemase superfamily Bateman A anon Superfamily contains aspartate racemase, glutamate racemase, hydantoin racemase and arylmalonate decarboxylase families from fungi, plants, bacteria and archaeal species. 2009-05-08 20:20:14 \N 3 108 17 4346 8931 1 400 CL0400 GG-leader \N Double-Glycine leader-peptide cleavage motif Coggill P anon This is a collection of short bacterial families that carry a distinctive GG-cleavage motif. Conservation C-terminal to the GG-motif is not apparent. However, the families are all interconnected with critical virulence attributes of one kind or another. 2009-05-08 20:20:32 \N 3 5 10 475 1960 1 401 CL0401 AsmA-like AsmA-like OmpF regulator protein superfamily Bateman A anon Families in this collection are AsmA-like. Mutations in the AsmA gene restore the assembly of OmpF, a trimeric outer membrane porin from E coli and related bacteria necessary for the cytotoxic action of group-A colicins. 2009-05-08 20:20:49 \N 3 0 60 2432 11866 1 402 CL0402 Cdc48_2-like Cdc48 domain 2-like Coggill P anon Superfamily contains C-terminal domains of N-ethylmaleimide sensitive fusion proteins, VCP-like ATPases, membrane fusion ATPase p97 domain 2, peroxisome biogenesis factor 1 (PEX-1), domain 2, and ubiquitin fusion degradation protein UFD1 families. 2009-05-08 20:21:06 \N 3 52 46 547 1899 1 403 CL0403 ADC-like \N Acetoacetate decarboxylase-like Finn RD anon Superfamily contains the acetoacetate decarboxylase enzyme family EC:4.1.1.4, and a family of uncharacterized proteins from bacteria. 2009-05-08 20:21:23 \N 3 20 14 682 968 1 404 CL0404 BPD_transp_1 BPD_transp_1-like; BPD transporter like Mistry J, Finn RD anon This clan contains families that are involved in transport of molecules across membranes. It includes the bacterial binding protein-dependent transport system inner membrane component, Pfam:PF00528, which is ATP dependent system involved in transport of a range of substrates [1-2]. 2009-05-08 20:21:41 \N 3 36 162 5100 199771 1 405 CL0405 DNA_b-psBarrel DNA-bdg_psBarrel; DNA-binding pseudo-barrel domain Finn RD anon Superfamily consists of type II restriction endonuclease effector (N-term) domain and plant B3 DNA binding domain families. 2009-05-08 20:21:58 \N 3 5 51 174 2348 1 406 CL0406 YjbJ-CsbD-like YjbJ-CsbD-like superfamily Finn RD anon CsbD is a bacterial general stress response protein. It's expression is mediated by sigma-B, an alternative sigma factor [1]. The role of CsbD in stress response is unclear. YjbJ is a hypothetical protein with a similar structure. 2009-05-08 20:22:16 \N 3 2 5 2526 5640 1 407 CL0407 TBP-like TATA-binding protein like Mistry J, Finn RD anon TBP is a transcription factor whose DNA binding fold is composed of a curved antiparallel beta-sheet [1]. This fold is also found in the N terminal region of DNA repair glycosylases. The N terminal domain of DNA glycosylase has only a single copy of the fold, whereas TBP contains a duplication of this fold [2-3]. 2009-05-08 20:22:35 \N 3 138 32 2007 3848 1 408 CL0408 PUP \N Purine and uridine phosphorylase superfamily Bateman A anon superfamily contains a number of purine nucleoside phosphorylase, uridine nucleoside phosphorylase, and various nucleosidase families of proteins. 2009-05-08 20:22:53 \N 3 825 229 4830 16130 1 409 CL0409 GAP \N GTPase activation domain superfamily Finn RD anon Superfamily contains BCR-homology GTPase activation domain (BH-domain) and p120GAP domain-like, including the GAP related domain of neurofibromin, families. 2009-05-08 20:23:11 \N 3 42 410 365 9451 1 410 CL0410 LEF-8-like LEF-8 like region of RNA polymerase Rpb2 Mistry J, Finn RD anon Late expression factor 8 (LEF-8) is one of the primary components of RNA polymerase produced by polyhedrosis viruses. LEF-8 shows homology to domain 6 of the second largest subunit of prokaryotic DNA-directed RNA polymerase[1]. 2009-05-08 20:23:28 \N 3 138 141 12554 17032 1 411 CL0411 Vir Antigenic variants from Plasmodium cell-surface Finn RD anon Several families of paralogous proteins are included in this superfamily, largely from Plasmodium species. The genome expresses great numbers of them, and they vary subtly from each other. 2009-05-08 20:23:45 \N 3 0 7 9 1914 1 412 CL0412 Frag1-like \N Frag1 like Mistry J, Finn RD anon This clan contains the Frag1/DRA</Sfk1 Pfam family (Pfam:PF10277) and related families. 2009-05-08 20:24:02 \N 3 0 22 963 1905 1 413 CL0413 Toprim-like Toprim domain Finn RD anon This is families that have a conserved region from DNA primase. This corresponds to the Toprim domain common to DnaG primases, topoisomerases, OLD family nucleases and RecR proteins [1]. 2009-05-08 20:24:21 \N 3 91 322 8438 38368 1 414 CL0414 TerB AB-resistance; Tellurits and antibiotic resistance superfamily Finn RD anon A number of families which show resistance to antibiotics or tellurite are included here. 2012-09-28 17:52:30 \N 3 5 36 1833 3980 1 416 CL0416 Anoctamin-like \N Transmembrane protein families of the Anoctamin type Finn RD, Coggill P anon The families are transmembrane families, where some of the members have sodium tolerance activity. 2009-05-08 20:25:15 \N 3 0 98 322 3409 1 417 CL0417 BIR-like \N BIR-like domains Mistry J, Finn RD anon Clan of domains all involved in binding nucleic acid and sharing the sequence motif C3HC. 2009-05-08 20:25:32 \N 3 187 45 401 2356 1 418 CL0418 GIY-YIG \N GIY-YIG endonuclease superfamily Finn RD, Coggill P anon Based on the analysis of genomic distribution, patterns of domain fusions and phylogenetic considerations for individual families, an evolutionary scenario is proposed that explains the emergence and development of the major branches of the GIY-YIG superfamily that links the Slx-type with the UvrC-like endonucleases. Most families appear to target DNA. The GIY-YIG domain has been quite successful in forming monomeric nucleases that utilise additional domains to recognise its DNA targets; this collection of domains can range from extremely simple DNA-binding elements (as in the case of I-TevI) to modules with independent enzymatic activities (as in the case of UvrC or the Penelope elements) [1]. 2009-05-08 20:25:49 \N 3 20 115 5010 11825 1 419 CL0419 T3SS-Chaperone \N Type III secretion system types a and b chaperone Finn RD, Coggill P anon Type III secretion systems of the two forms, a - translocation-associated, and b - bacterial flagellum, share homology between comparable parts. This superfamily includes the chaperone proteins FliJ and HrpB7. 2009-05-08 20:26:06 \N 3 1 5 1738 1982 1 420 CL0420 GlpM-like \N Bacterial membrane GlpM-like group Finn RD, Coggill P anon This is a group of membrane proteins of the GlpM type. 2009-05-08 20:26:23 \N 3 0 3 964 1206 1 421 CL0421 LppaM \N Lipo-protein attachment motif superfamily Bateman A anon Superfamily consists of those protein families wherein the lipo-protein attachmnet motif, usually at the N-terminus, is found. regions of sequence downstream of this motif may not be conserved between families in the clan. 2009-05-08 21:19:03 \N 3 0 25 1166 2588 1 422 CL0422 Fibrinogen_C Fibrinogen C-terminal domain-like Finn RD anon This is a collection of families with C-terminal domains that are of similar structure. 2009-05-08 21:19:20 \N 3 239 308 1080 6291 1 423 CL0423 AhpD-like \N AhpD-like superfamily Bateman A anon Superfamily contains CMD-like, and Antioxidant defense protein AhpD families. 2009-05-08 21:19:37 \N 3 112 49 3494 9816 1 424 CL0424 T3SS-hook Type III secretion system-derived superfamily, hook-related Coggill P anon There is homology between the proteins forming the flagellar system and those involved in translocation of effector proteins at host-infection. Both systems, in Gram-negative bacteria, are Sec-independent but ATPase-dependent. There are up to 9 proteins that are homologous between the two systems. It is not clear, however, why HpaP proteins, which are essentially suppressants of HrpB2, an essential pathogenicity factor, should also be homologous to any of these families. 2009-05-08 21:19:55 \N 3 1 12 3179 4912 1 425 CL0425 ComplexI-N \N NADH dehydrogenase I, subunit N Bateman A anon This superfamily contains proteins from the families Oxidored_q1 and NADHdeh_related. The Oxidored_q1 family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane. Many members of the NADHdeh_related family are archaeal and bacterial, indicating the evolutionary origins of ComplexI. 2009-05-08 21:20:12 \N 3 6 58 35038 91907 1 426 CL0426 HRDC-like \N HRDC-like superfamily Coggill P anon Superfamily includes HRDC domain from helicases, RNA polymerase II subunit RBP4, RNase D C-terminal domains, and EXOSC10 HRDC domain-like families. 2009-05-08 21:20:30 \N 3 84 86 3700 6906 1 428 CL0428 TolA-TonB-Cterm TolA/TonB C-terminal Mistry J, Finn RD anon This clan contains the TolA and TonB C terminal domains. These domains adopt a structure containing anti-parallel beta sheets with some alpha helical regions. The TolA family contains an extra N-terminal helix over the common fold, and the TonB is an isolated domain that can form different segment-swapped dimers depending on the fragment length [1]. 2009-05-08 21:28:43 \N 3 17 93 2250 7994 1 429 CL0429 UcrQ-like \N UcrQ-like Finn RD anon The families here include those that are the H and S chains of cytochrome c subunit B1. 2009-05-08 21:29:01 \N 3 51 6 247 299 1 430 CL0430 CopD_like Copper resistance protein D Mistry J anon This clan contains families that are related to the CopD family of proteins.\ CopD, together with CopC, performs copper uptake into the cytoplasm. 2009-05-08 21:29:20 \N 3 0 20 2640 4510 1 431 CL0431 PF Profilin-like superfamily Bateman A anon The families here all show the Profilin-like fold, and represent both the Profilin (actin-binding protein) (55770) and the Roadblock/LC7 domain-type (103196) superfamilies. 2009-05-08 21:29:38 \N 3 97 27 918 3101 1 433 CL0433 SPFH \N SPFH superfamily Bateman A anon This superfamily includes the SPFH family as well as the shoulder domain from the Major Vault Protein [1]. 2009-07-06 09:16:10 \N 3 46 70 4547 14789 1 434 CL0434 Sialidase Sialidase superfamily Bateman A agb This superfamily includes sialidases enzymes. Several viruses use sialic acid as a cell surface receptor for host invasion. These viruses then have cell surface neuraminidase enzymes to cleave sialic acid from cell surface proteins allowing them to leave the host cell after replication. This superfamily are composed of six beta-sheets that form a six-fold beta-propeller structure. Many members of this superfamily contain BNR sequence motifs Pfam:PF02012. 2009-08-03 11:40:27 2009-08-03 12:40:27 3 387 191 30583 37487 1 435 CL0435 NPR Nitrogen permease regulator family Coggill P pcc The several protein families of nitrogen permease regulators, that complex together and regulator TORC1 in response to amino acid starvation are likely to have arisen by a gene duplication event. 2009-08-24 12:51:07 2009-08-24 13:51:07 3 0 11 254 791 1 436 CL0436 FliG Flagellar motor switch family Coggill P pcc This superfamily is characterised by a fragmented superhelix consisting of 3-4 helical motifs and connecting helices. Memebers include the direct flagellar motor as well as the MG2+ families. 2009-08-24 13:45:28 2009-08-24 14:45:28 3 22 37 3585 10762 1 437 CL0437 EF-G_C Transcription elongation factor G C-terminal Coggill P pcc This superfamily is characterised by being an alpha-beta 2-Layer Sandwich. It is found in EF2 proteins from both prokaryotes and eukaryotes, as well as in some tetracycline resistance proteins, peptide chain release factors ], and in the C-terminal region of the bacterial hypothetical protein, YigZ. 2009-08-24 15:16:01 2009-08-24 16:16:01 3 77 100 5888 33570 1 438 CL0438 TTHA10132-281 Transcriptional regulators of TTHA1013 andTTHA0281 type Coggill P pcc These families are characterised by sharing a 2-layer, beta(3)-alpha-beta topology. Domains dimerise via the long C-terminal strand with the formation of a single beta-sheet. 2009-08-24 16:09:48 2009-08-24 17:09:48 3 12 22 1503 2871 1 439 CL0439 NusG-like NusG-like Mistry J jm14 Members of this clan are related to the NusG transcription termination factor. The eukaryotic orthologue of NusG is Spt5 [1]. 2009-08-25 13:27:05 2009-08-25 14:27:05 3 24 30 4806 6515 1 441 CL0441 AlbA RNA-DNA binding Alba-like superfamily Coggill P pcc This is a family of DNA and RNA binding proteins found from archaea to humans. 2009-08-27 10:57:56 2009-08-27 11:57:56 3 32 9 406 878 1 442 CL0442 Tubulin-like Tubulin_C; Tubulin, FtsZ and Misato and their C-termini, GTPase Coggill P pcc This superfamily is characterised by being of alpha-beta two-layer sandwich structures. A segmental duplication early on in the primate lineage has led to the creation of up to three tubulin-like domains in the higher primates. The most N-terminal one is the Misato domain, which probably represents the original domain [3]. 2009-08-28 12:07:36 2009-08-28 13:07:36 3 172 73 11198 35977 1 444 CL0444 YNI YcfA-nrd-intein domain Coggill P pcc This superfamily is characterised by a short approx. 55 residue fold that is found in domains associated with flagellum formation and a short region withiin the homing endonucleases. 2009-09-10 10:49:46 2009-09-10 11:49:46 3 3 8 2112 3558 1 445 CL0445 SNARE-fusion SNARE-fusion membrane complex superfamily Coggill P pcc The SNARE-fusion complex families are characterised by being tetrameric coiled-coil structures. 2009-09-10 14:55:14 2009-09-10 15:55:14 3 106 90 533 8433 1 446 CL0446 Bacteriocin_TLN Translocation domain of colicin-like bacteriocins Coggill P pcc A number of different bacterial species produce bacteriocins to kill competitor species, and the colicin class is a group of complex three-domain structures. The receptor-binding domain recognises and binds to specific cell surface receptors on the target cells; the N-terminal translocation domain interacts with cell membrane proteins to gain access to the cell interior; the C-terminal domain specifies a killing activity, such as pore formation or nuclease activity. This superfamily is a collection of the translocating domains. 2009-11-02 18:54:23 2009-11-02 18:54:23 2 20 31 181 342 1 447 CL0447 Hypoth_1 DUF1304/DUF3784 clan Bateman A agb This superfamily includes two related families of short uncharacterised proteins from bacteria. 2009-11-06 14:19:03 2009-11-06 14:19:03 2 0 9 1408 1614 1 448 CL0448 Cargo_bd_muHD Second domain of Mu2 adaptin subunit (ap50) of ap2 adaptor Coggill P pcc Internalisation of diverse transmembrane cargos from the plasma membrane requires a similarly diverse array of specialized adaptors, this domain is the binding domain of these endocytioc adaptors. The muHD binds directly to the cytosolic tail of the Mid2 (for the transmembrane stress sensor protein Mid2) cargo protein and mediates Mid2 internalization [1]. These are all-beta proteins. 2009-11-10 17:50:06 2009-11-10 17:50:06 2 32 35 501 2637 1 449 CL0449 G-PATCH DExH-box splicing factor binding site Coggill P pcc This superfamily is characterised by domains which are related to the G-patch domain in proteins with RNA-binding/splicing activity. This domain affects RNA-splicing by binding to a DExH-box splicing factor. 2009-11-12 13:38:49 2009-11-12 13:38:49 2 0 141 389 3724 1 450 CL0450 FimbA Fimbriae A and Mfa superfamily Xu Q, Finn RD, Coggill P pcc To add the reference from ActaF when it is published, A conserved fold for fimbriae components revealed by the crystal structure of a putative fimbriae assembly protein (BT1062) from Bacteroides thetaiotaomicron at 2.2 Angstrom resolution. This family includes the fimbrillin-A set of fimbrial proteins and the shorter Mfa1 fimbrial set and their asoociated anchor-proteins Mfa2. 2009-11-25 16:52:50 2009-11-25 16:52:50 2 11 10 96 707 1 451 CL0451 FnI-like von Willebrand Factor like superfamily Coggill P pcc This superfamily is characterised by being disulfide-rich, all-beta with an extra C-terminal domain in the fibronectin-like, Fn1, members. 2009-12-18 14:27:48 2009-12-18 14:27:48 2 8 460 161 4895 1 452 CL0452 Tropomyosin-lke Tropomyosin-like superfamily Coggill P pcc 2009-12-18 15:27:21 2009-12-18 15:27:21 2 54 27 554 1620 1 453 CL0453 Apoptosis-Inhib Inhibitors of suppressors of apoptosis Bax/Bak Coggill P pcc This family includes members which are Bax inhibitors. 2009-12-23 16:41:04 2009-12-23 16:41:04 2 0 14 4078 6492 1 454 CL0454 Protease_inhib Protease inhibitor toxin superfamily Coggill P pcc This superfamily includes several families of toxins which seem to act as protease inuibitors. 2010-01-05 09:18:17 2010-01-05 09:18:17 2 215 445 329 6504 1 455 CL0455 MIM-OM_import Mitochondrial membrane transporting or complex assembly Coggill P pcc This superfamily is characterised by being mitochondrial membrane proteins with importing activities. 2010-01-07 14:15:14 2010-01-07 14:15:14 2 1 8 326 612 1 456 CL0456 Golgi_traff Shuttling between cyosol and Golgi, peripheral membrane family Coggill P pcc This superfamily is characterised by comprising multiple-pass membrane proteins that are associated with various aspects of trafficking of molecules into and out of the Golgi apparatus. 2010-01-08 15:41:51 2010-01-08 15:41:51 2 0 13 303 639 1 457 CL0457 4HB_MCP Aspartate chemoreceptor, signal-transduction ligand-binding Ulrich L, Coggill P pcc This superfamily is characterised by a four-helix bundle structure that forms a ubiquitous sensory module in prokaryotic signal-transduction. The 4HB_MCP is always found between two predicted transmembrane helices indicating that it detects only extracellular signals. In many cases the domain is associated with a cytoplasmic HAMP domain suggesting that most proteins carrying the bundle might share the mechanism of transmembrane signalling which is well-characterised in E coli chemoreceptors. 2010-01-11 14:55:11 2010-01-11 14:55:11 2 16 242 1922 9476 1 458 CL0458 IIaaRS-ABD Class II aaRS Anticodon-binding domain-like Coggill P pcc This superfamily is characterised by including the anticodon-binding domain of Class II aaRS and the Brix domain. 2010-02-01 17:01:44 2010-02-01 17:01:44 2 142 103 4984 18087 1 459 CL0459 BRCT-like BRCT like Coggill P pcc The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. 2010-02-01 17:02:12 2010-02-01 17:02:12 2 112 343 4755 10126 1 461 CL0461 Metallothionein Metal-bound fold, usually iron, metallothionein superfamily Coggill P pcc This superfamily contains related families of metallothioneins, prokaryotes and eukaryotes. 2010-02-03 17:36:45 2010-02-03 17:36:45 2 24 9 332 753 1 462 CL0462 TPA-repeat Transcription elongation factor C-terminal nonapeptide repeat Coggill P pcc This superfamily includes several families of this nonapeptide repeat, that carries a characteristic TPA motif. The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. 2010-02-04 11:33:22 2010-02-04 11:33:22 2 10 36 254 580 1 464 CL0464 5_3_exonuc_C 5'-3'-exonuclease C-terminal sub-region Coggill P pcc This superfamily includes C-terminal domains from a number of DNA-processing enzymes including T4 RNase H, 5' to 3' exonuclease domain of DNA polymerase Taq, T5 5'-exonuclease, Flap endonuclease-1 (Fen-1 nuclease), and other eukaryotic endonucleases. 2010-02-04 17:13:51 2010-02-04 17:13:51 2 38 66 5048 8326 1 465 CL0465 Ank Ankyrin repeat superfamily Bateman A agb The ankyrin repeat is a short sequence region that is about 30-34 amino-acids in length. Multiple copies of the repeat composed of two beta strands and two alpha helices combine to form long arrays. In general these repeats are involved in protein-protein interactions. This superfamily also includes some families that are arrays of several repeats. 2010-02-18 14:19:32 2010-02-18 14:19:32 2 168 6571 3000 134027 1 466 CL0466 PDZ-like PDZ domain-like peptide-binding superfamily Coggill P pcc This superfamily comprises families of PDZ domains, which are peptide binding sites. 2010-02-23 10:29:00 2010-02-23 10:29:00 2 615 901 4966 46701 1 468 CL0468 Malectin-like Malectin-like superfamily Coggill P pcc This superfamily includes several families of Malectin-type domains. 2010-02-24 16:54:44 2010-02-24 16:54:44 2 3 227 339 1899 1 469 CL0469 l-integrase_N lambda integrase N-terminal domain Coggill P pcc This superfamily is characterised by families having 4-5 helices in a bundle of two orthogonally packed alpha-hairpins that are involved in interactions with DNA and proteins. 2010-02-24 17:58:15 2010-02-24 17:58:15 2 29 85 4649 20176 1 470 CL0470 UMP_1 Uncharacterised membrane protein superfamily (UMP1) Bateman A agb This superfamily includes two related families of membrane proteins. One of these is known as DUF1212. As yet there is no known function for any of these proteins. 2010-02-25 14:21:20 2010-02-25 14:21:20 2 0 13 2781 7529 1 471 CL0471 DUF362 DUF362-like superfamily Bateman A agb This superfamily incldues two uncharacterised families: DUF362 and DUF2088. These families contain a number of potential conserved metal binding residues and may be metalloenzymes. 2010-03-01 12:56:12 2010-03-01 12:56:12 2 2 26 685 1641 1 472 CL0472 Peptidase_U Peptidase clan U Bateman A agb This superfamily includes proteins classed as clan U by the MEROPS database. The proteins are membrane bound peptidases. The clan also includes Pfam:PF10086 which is a related family of uncharacterised proteins. 2010-03-11 15:27:58 2010-03-11 15:27:58 2 0 61 3770 14791 1 474 CL0474 SICA_like SICA alpha/beta like extracellular domains Finn RD rdf The SICA (schizont-infected cell agglutination) proteins of P. knowlesi, one of the variant antigen gene families, are associated with parasitic virulence. These proteins are comprised of three domains (alpha, beta and C-terminal). The two extracelullar domains, termed SICA alpha and beta, occur with varying copy number. The alpha domain, although it appears to be related to the beta domain, contains few cysteine residues and is found between 1 and 2 times, compared with the beta domain that occurs between 1 and 10 times (when present) [1]. 2010-03-17 15:36:20 2010-03-17 15:36:20 2 0 38 2 575 1 475 CL0475 Cyclophil-like Cyclophilin-like superfamily Coggill P pcc This superfamily is characterised by being the core, closed, beta barrel of cyclophilin with complex topology, with some family members lacking the N-terminal strand of cyclophilin but retaining a closed barrel. 2010-03-22 18:14:27 2010-03-22 18:14:27 2 310 183 4947 18056 1 476 CL0476 tRNA-IECD_N tRNA-intron endonuclease catalytic domain-like N-term Coggill P pcc This superfamily contains families of the N-termini of tRNA-splicing endonucleases. 2010-04-13 08:59:45 2010-04-13 09:59:45 2 33 16 391 617 1 477 CL0477 TRD DNA methylase specificity domain Bateman A agb This superfamily includes domains that are found associated with restriction modification methylase enzymes. These domains are involved in defining the DNA substrate specificity. 2010-05-04 15:55:02 2010-05-04 16:55:02 2 29 59 3119 12266 1 478 CL0478 ATPase_I_AtpR F_N_ATPase_a; F-type and N-type ATPase, I/AtpR subunit Galperin M, Finn RD rdf These two subunits from the F-type and N-type ATPases have been demonstrated to from functionally distinct components of these two different ATPases[1]. 2010-05-19 12:41:39 2010-05-19 13:41:39 2 0 2 1835 2145 1 479 CL0479 PLD Phospholipase D superfamily Bateman A pcc This superfamily includes proteins that are related to Phospholiase D. 2010-06-17 09:09:37 2010-06-17 10:09:37 2 68 157 5070 25964 1 480 CL0480 Ribosome_L1 Ribosomal L1 protein superfamily Coggill P pcc This is a superfamily of Ribosomal L1 protein families. 2010-06-29 09:40:59 2010-06-29 10:40:59 2 95 18 4960 6004 1 481 CL0481 HUH His-hydrophobic-His tranposase Y1,Y2 superfamily Coggill P pcc This superfamily consists of families of transposases that use tyrosine as nucleophiles and contain a conserved HUH motif where U is a non-conserved hydrophobic residue. Particular examples are the Y1 and Y2 transposases. Y1 transposases come from the IS608 group and the Y2 from the IS91 group. These transposases are nucleases involved in initiating rolling-circle plasmid replication, in initiating replication of certain single-strand (ss) DNA viruses (Rep proteins), and in initiation of conjugative plasmid transfer (relaxases). The IS91 transposase contains two of the three conserved protein sequence motifs identified in RCR proteins, a His-hydrophobic-His (HUH) motif required for metal ion binding, and a YxxxY motif containing two conserved catalytic tyrosine residues. The HUH motif participates in coordinating divalent metal ions essential for catalysis. This clan is also known as the Transposase IS200-like whose members contain extra N-terminal hairpin and C-terminal helix, both or which are involved in dimerisation; there can be helix-swapping in the dimer. 2010-07-13 12:33:47 2010-07-13 13:33:47 2 40 49 2384 6860 1 482 CL0482 Prolamin Di-sulfide bond pairrings Coggill P pcc This superfamily is characterised by families with only slightly differing disulfide-bonding patterning. 2010-07-21 12:41:48 2010-07-21 13:41:48 2 54 35 364 5909 1 483 CL0483 PreATP-grasp Probable substrate-binding preceding ATP-grasp domain Coggill P pcc This superfamily is characterised by bein g the copies of the domain that precedes the ATP-grasp domain common to all superfamily members, and it can contain a substrate-binding function. 2010-07-21 12:47:06 2010-07-21 13:47:06 2 32 18 1652 2776 1 484 CL0484 Peroxisome Peroxisome-like domain Coggill P pcc This superfamily combines peroxisome families and a family found at the C-terminus of metazoan phosphoribosyltransferases. 2010-07-21 12:49:56 2010-07-21 13:49:56 2 0 76 283 1175 1 486 CL0486 Fer2 2Fe-2S iron-sulfur cluster binding domain Bateman A agb The 2Fe-2S ferredoxin family have a general core structure consisting of beta(2)-alpha-beta(2) which abeta-grasp type fold. The domani is around one hundred amino acids with four conserved cysteine residues to which the 2Fe-2S cluster is ligated. 2010-07-24 11:00:26 2010-07-24 12:00:26 2 362 461 5503 37176 1 487 CL0487 FKBP FKBP-like superfamily Bateman A agb This superfamily includes the FKBP domain which catalyses the peptidyl-prolyl cis-trans isomerisation reaction. The superfamily also includes the C-terminal domain of GreA and the c-terminal dmoain of 3-mercaptopyruvate sulfurtransferase. 2010-07-25 11:45:24 2010-07-25 12:45:24 2 104 100 4781 19148 1 488 CL0488 FixH-like FixH-like Coggill P pcc This is a superfamily of FixH-like proteins of unknown function. 2010-07-28 13:48:41 2010-07-28 14:48:41 2 0 16 1183 1566 1 489 CL0489 AFP_III-like Type III antifreeze and spore coat polysaccharide Coggill P pcc This is a superfamily of antifreeze proteins, flagellar FlgA, pilus and spore coat polysaccharide families. 2010-07-29 13:32:26 2010-07-29 14:32:26 2 26 28 2940 6862 1 490 CL0490 PepSY_TM-like PepSY_TM-like Coggill P pcc This is a superfamily of PepSY-transmembrane-like domain families. 2010-08-04 15:51:29 2010-08-04 16:51:29 2 0 70 1771 6672 1 491 CL0491 LYR-like Complex1_LYR-like superfamily Coggill P pcc This is a superfamily of sequences with the characteristic LYR, or similar, motif. In some yeasts this family plays a role in Fe-S cluster biogenesis in mitochondria. 2010-08-19 16:20:34 2010-08-19 17:20:34 2 0 33 323 2362 1 492 CL0492 S4 S4 domain superfamily Bateman A agb This superfamily includes diverse proteins related to the ribosomal S4 protein. 2010-08-27 15:46:09 2010-08-27 16:46:09 2 226 44 10099 38684 1 493 CL0493 PTS_EIIC Phosphotransferase system, EIIC superfamily Bateman A agb 2010-09-02 14:47:12 2010-09-02 15:47:12 2 4 39 2905 27857 1 494 CL0494 DnaA_N DNA-A N-terminal domain-like superfamily Bateman A agb This superfamily includes the N-terminal domain from DnaA and the N-terminal domain from DNA polymerase III alpha subunit. 2010-09-17 12:39:03 2010-09-17 13:39:03 2 2 33 3617 4495 1 496 CL0496 Tad-like Flp pilus-biogenesis Tad-like superfamily Coggill P pcc This is a superfamily of Tad-like protein families all involved in Flp pilus biogenesis. 2010-09-28 15:11:00 2010-09-28 16:11:00 2 0 22 1048 3415 1 497 CL0497 GST_C Glutathione S-transferase, C-terminal domain Bateman A agb This clan represents the C-terminal domain of Glutathione S-transferase. 2010-10-04 10:28:59 2010-10-04 11:28:59 2 839 219 3208 23337 1 498 CL0498 Nribosyltransf N-(deoxy)ribosyltransferase-like superfamily Bateman A agb This superfamily includes the N-deoxyribosyltransferase and ADP-ribosyl cyclase-like families as well as the family that includes the hypothetical protein PA1492 for which the structure is known. PA1942 has a very similar putative active site and is predicted by SCOP to have a deoxyribosyltransferase activity [1]. 2010-10-07 15:46:04 2010-10-07 16:46:04 2 193 14 1181 1635 1 499 CL0499 O-antige_ligase O-antigen-ligase like superfamily Coggill P pcc This superfamily is characterised by a group of bacterial proteins that are membrane proteins, and include O-antigen ligases and putative hydrogen carbonate transporters [1]. 2010-10-07 17:04:39 2010-10-07 18:04:39 2 0 73 3419 7279 1 500 CL0500 Glycine-zipper Glycine-zipper TM superfamily Coggill P pcc This superfamily is characterised by having long glycine-zipper motifs, characteristically GxxxGxxxG [1], often with several contiguous copies. The x following the G is frequently found to be an alanine. This sequence region can be found in both symmetric and non-symmetrical membrane structures, and is relatively common. The fact that such a large fraction of the relatively few known membrane channel structures use the glycine zipper suggests that the sequence motif plays a special role in forming these homo-oligomeric bundles [1]. 2010-10-14 10:24:07 2010-10-14 11:24:07 2 0 44 2301 7226 1 501 CL0501 Gram-pos_anchor Cell wall anchoring Coggill P pcc This superfamily is characterised by families involved in the various mechanisms by which listerial proteins are attached at the bacterial surface and their many functions, including peptidoglycan metabolism, protein processing, adhesion to host cells, and invasion of host tissues. 2010-10-26 16:08:05 2010-10-26 17:08:05 2 20 882 1222 13906 1 502 CL0502 STAS STAS domain superfamily Bateman A agb This superfamily includes proteins that have a STAS domain. 2010-10-28 16:36:18 2010-10-28 17:36:18 2 44 161 3695 13648 1 503 CL0503 SOR Superoxide reductase-like superfamily Bateman A agb According to SCOP this superfamily includes the superoxide reductase domain as well as the C-terminal domain of desulfoferrodoxin. 2010-11-08 17:40:12 2010-11-08 17:40:12 2 55 13 767 1038 1 504 CL0504 Phage_barrel Phage tail beta-barrel superfamily Bateman A agb This superfamily consists of a variety of bacteriophage protein families. Structurally members of this family contain a beta barrel. 2010-11-22 16:23:10 2010-11-22 16:23:10 2 18 106 2048 9306 1 505 CL0505 Pentapeptide Pentapeptide repeat Bateman A agb This clan includes proteins that form a four sided parallel beta helix. They are generally compoased of pentapeptide repeat motifs. 2010-11-26 10:06:54 2010-11-26 10:06:54 2 43 484 2438 18381 1 506 CL0506 Succ_CoA_synth Succinyl-CoA synthetase flavodoxin domain superfamily Bateman A agb This superfamily includes the flavodoxin like domain of succinyl-CoA synthetase and other related enzymes. 2010-12-02 12:48:20 2010-12-02 12:48:20 2 70 59 3740 11176 1 507 CL0507 Fungal_trans Fungal specific transcription factor domain Bateman A agb This putative domain is found in fungal transcription factors. 2010-12-06 17:47:20 2010-12-06 17:47:20 2 0 306 239 14641 1 508 CL0508 YkuD L,D-transpeptidase catalytic domain Bateman A agb This superfamily incldues the L,D-transpeptidase catalytic domain. 2010-12-08 12:28:21 2010-12-08 12:28:21 2 8 84 3166 11690 1 509 CL0509 RBP11-like RBP11-like subunits of RNA polymerase Coggill P pcc This superfamily is characterised by all members carrying RNA Polymerase alpha subunit, chain A, domain 2. The member families form homo- and hetero-dimers. 2010-12-09 14:07:20 2010-12-09 14:07:20 2 280 32 6187 8189 1 511 CL0511 Retroviral_zf Retrovirus zinc finger-like domains Coggill P pcc This superfamily has zinc-finger domains from GAG proteins, HIV nucleocapsids, nucleocapsid proteins from mason-pfizer monkey virus, and from other nucleic-acid binding proteins. Sometimes the domains appears in duplicate. 2011-01-17 16:09:42 2011-01-17 16:09:42 2 46 796 752 52586 1 512 CL0512 CRAL_TRIO CRAL-TRIO domain superfamily Bateman A agb This superfamily includes the CRAL-TRIO domain. 2011-01-21 18:16:17 2011-01-21 18:16:17 2 28 141 377 5546 1 513 CL0513 LCCL-domain LCCL-domain like Coggill P pcc This superfamily is characterised by an unusual fold exemplified in PDB:1jbi. Vertebrate members are the cysteine-rich secretory protein LCCL domain-containing proteins including Colchin. The fungal members are found to be necessary for histone de-acetylation. 2011-01-28 14:34:24 2011-01-28 14:34:24 2 1 51 235 1023 1 515 CL0515 LTXXQ-like LTXXQ-like superfamily Coggill P pcc This is a superfamily of members all carrying the repeated LTxxQ paired sequence-motif region. 2011-02-10 14:04:19 2011-02-10 14:04:19 2 36 8 1376 2943 1 516 CL0516 ISP-domain Rieske-like iron-sulphur domain Coggill P pcc This superfamily is characterised by Rieske Iron-sulfur families of the [2Fe-2S] type including NADH-nitrite reductase small subunit NirD proteins. This domain has an all-beta rubredoxin-like fold. 2011-02-14 16:52:53 2011-02-14 16:52:53 2 328 118 3882 14974 1 517 CL0517 MBOAT-like Membrane-bound O-acyltransferase,MBOAT,superfamily Coggill P pcc This is a large superfamily of different enzymes. All biochemically characterised members of the superfamily encode enzymes that transfer organic acids, typically fatty acids, onto hydroxyl groups of membrane-embedded targets. 2011-02-16 11:16:11 2011-02-16 11:16:11 2 0 40 2323 5182 1 520 CL0520 Keratin_assoc Keratin_associated superfamily Coggill P pcc Families in this clan are cysteine-rich and are from proteins associated with Keratin. 2011-03-09 11:23:29 2011-03-09 11:23:29 2 0 33 65 2969 1 521 CL0521 GOLD-like Sec14-like superfamily of golgi trafficking Coggill P pcc This superfamily is characterised by proteins with a sandwich structure of 8 strands in 2 sheets as a jelly-roll. The structure has similarity to the Nucleoplasmin-like/VP fold. The members are all involved in golgi trafficking. 2011-03-21 16:36:53 2011-03-21 16:36:53 2 5 40 359 2519 1 522 CL0522 YbjQ-like YbjQ-like superfamily Coggill P pcc This superfamily is characterised by chains that form pentamers. Family structures are beta/alpha-propellers with additional short beta-strands that promote oligomeric assembly. Family members appear to be heavy-metal binding proteins. 2011-05-13 14:50:08 2011-05-13 15:50:08 2 25 7 1982 2593 1 523 CL0523 GAG-polyprotein LTR-copia-type polyprtien segments Coggill P pcc This superfamily is characterised by family members that ar derived from retrotransposons of the copia-type. 2011-05-13 16:31:38 2011-05-13 17:31:38 2 0 851 262 10309 1 524 CL0524 MPT63-MPB63 Antigen MPT63/MPB63 (immunoprotective extracellular Coggill P pcc This superfamily is characterised by extracellular proteins with immuno-protectivce receptor domains on their surfaces resembling Ig domains. These are putative lipoproteins. 2011-05-23 13:54:12 2011-05-23 14:54:12 2 5 23 844 1193 1 525 CL0525 pap2 Acid phosphatase/Vanadium-dependent haloperoxidase Coggill P pcc The PAP2 superfamily is characterised by being mult-helical, with the core consisting of a 5-helical bundle. Normally the family will bind cofactor at the beginning of the third helix. The superfamily includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2. The family also includes a variety of haloperoxidases [1,2] that function by oxidising halides in the presence of hydrogen peroxide to form the corresponding hypohalous acids. 2011-06-02 14:59:14 2011-06-02 15:59:14 2 44 106 4454 15593 1 526 CL0526 SUKH SUKH superfamily Zhang D, Iyer LM, Aravind L la_psag SUKH superfamily unites a diverse group of proteins including Smi1/Knr4, PGs2, Fbxo3, Skip16, Syd, herpesviral US22, IRS1 and TRS1, and their bacterial homologs [1]. 2011-07-18 20:15:40 2011-07-18 21:15:40 2 8 49 1725 4317 1 527 CL0527 Sm-like Sm (Small RNA binding protein domain) Anantharaman V la_psag Conserved domain of core RNA-binding proteins of mRNA splicing and rRNA processing complexes, with a SH3-like Beta-barrel 2011-09-10 05:20:17 2011-09-10 06:20:17 2 515 85 2737 9033 1 528 CL0528 DUF1214 DUF1214-like Finn RD rdf Members of this clan usually appear in pairs on the same sequence. In cases where there is only a single example of the domain, there is often a second protein containing a single domain that is located sequentially along the chromosome. The crystal structure of pdb:2P3Y reveals that Pfam:PF06863 and Pfam:PF06742 come together to from a deep, positively charged pocket or cleft. Structural comparisons reveal that this may this pocket that may bind some form of carbohydrate. Conserved residues are predominantly found on the surface of the positively charged cleft [Jayaraman et al. personal communication]. 2011-09-23 21:21:03 2011-09-23 22:21:03 2 9 15 656 2556 1 529 CL0529 FMN-dep-NRtase FMN-dependent nitroreductase-like Coggill P pcc This superfamily is involved in the reduction of nitrogen containing compounds. Members of this family utilise FMN as a cofactor and are often found to be homodimers. Possible characteristics include Oxygen-insensitive NAD(P)H nitroreductase (FMN-dependent nitroreductase) (Dihydropteridine reductase) and NADH dehydrogenase. A number of the proteins are described as oxidoreductases. They are primarily found in bacterial lineages though a number of eukaryotic homologues have been identified: Caenorhabditis elegans, Drosophila melanogaster, mouse and human. These protein are not found in photosynthetic eukaryotes; sequences containing this entry in photosynthetic organisms are possible false positives. 2011-11-17 15:49:14 2011-11-17 15:49:14 1 206 111 4539 16601 1 530 CL0530 DNase_I-like DNase I-like Coggill P pcc This superfamily is characterised by members with an alpha-beta four-layer sandwich structure. It includes DNase I-like families of DNA-repair enzyme exonuclease III, DNA repair endonucleases, deoxyribonuclease I, endonuclease domain of retro-transposons. It also includes families of inositol polyphosphatases and shingomyelins-phosphodiesterases. 2011-11-29 11:19:36 2011-11-29 11:19:36 1 148 496 4823 22404 1 531 CL0531 AMP-binding_C AMP-binding enzyme C-terminal domain superfamily Eberhardt R re3 This superfamily contains domains found at the C-terminus of AMP-binding enzyme. 2011-12-02 13:00:41 2011-12-02 13:00:41 1 111 1767 5099 46854 1 532 CL0532 LIG Lipid-binding Ig-like superfamily Bateman A agb This superfamily includes a number of lipid binding families that have a similar topology to the immunoglobulin fold. The family includes The ML domain Pfam:PF02221. It includes the uncharacterised family Pfam:PF06477. 2011-12-21 10:45:29 2011-12-21 10:45:29 1 68 32 336 2678 1 533 CL0533 PRTase-like PRPP synthetase-associated protein 1 Coggill P pcc This superfamily of phosphoribosyl-transferases (PRTases) and phosphoribosyl-pyrophosphate synthetase-like protein families is characterised by a core fold of three layers, a/b/a with a mixed beta-sheet of six strands. In one of the families consists of two domains of this fold. 2012-01-09 11:27:00 2012-01-09 11:27:00 1 392 122 5229 42734 1 534 CL0534 YjgF-like YjgF-like superfamily Coggill P pcc This superfamily is characterised by proteins that form trimers with three closely packed beta-sheets. Some member proteins are chorismate mutases, others are endoribonucleases active on single-stranded mRNA. It is the highly conserved YjgF/YER057c/UK114 protein superfamily. Homologues of this protein occur in eubacteria, archaea, and eukaryotes. Proteins are functionally diverse and are involved in a variety of enzymatic and non-enzymatic functions. The high sequence and structural similarity between members of this protein superfamily offer an example of minimalistic changes leading to functional diversification. This feature is best exemplified by the three close homologues of YjgF proteins in mammals (human, rat, and goat) with sequence identity better than 85%. These homologues perform different functions, including tumour antigen activity in the goat homologue, translation inhibition in the human and rat homologues (hp14.5 and rp14.5), endoribonuclease activity in rp14.5, calpain activation in the bovine homologue, molecular chaperone activity in DUK114, and involvement in the regulation of purine and removal of toxic metabolites in YjgF7, and involvement in isoleucine biosynthetic pathways (YjgF, YER057c, Ibm1). In addition, members of this protein family have also been shown to regulate mitochondrial maintenance (Ibm1) in yeast. Proteins from the YjgF family in plants are involved in photosynthesis and chromoplastogenesis (CHRD). 2012-01-18 11:20:47 2012-01-18 11:20:47 1 251 69 4391 13028 1 535 CL0535 CBM Carbohydrate-binding superfamily Coggill P pcc This clan includes families of two different carbohydrate binding modules, chitin- and cellulase-binders. 2012-01-23 14:31:27 2012-01-23 14:31:27 1 41 221 1097 3033 1 536 CL0536 HEXAPEP Hexapeptide repeat superfamily Coggill P pcc This superfamily is characterised by superhelical turns made up of three short strands duplicated many times; the sequence hexapeptide repeats correspond to individual strands. These hexapeptide repeat regions occur in a range of transferase enzymes. 2012-01-23 14:56:06 2012-01-23 14:56:06 1 380 348 5071 79539 1 537 CL0537 CCCH_zf CCCH-zinc finger Coggill P pcc This superfamily is characterised by families of proteins with several Cys3His zinc-binding domains in tandem. 2012-01-26 16:19:39 2012-01-26 16:19:39 1 17 263 457 8803 1 538 CL0538 DRMIP-like MAPK-interacting Drmip-like superfamily Coggill P pcc This superfamily contains members that are similar to the MAPK-interacting DRMIP-like family. 2012-02-01 16:26:27 2012-02-01 16:26:27 1 0 20 175 786 1 539 CL0539 RNase_III RNase III domain-like superfamily Coggill P pcc This superfamily is characterised structurally by having a core of 5 helices where one of the helices is surrounded by the others. Many members have the endonuclease catalytic domain. 2012-02-06 18:13:43 2012-02-06 18:13:43 1 62 100 4733 7980 1 540 CL0540 GCP Gamma-tubulin complex superfamily Coggill P pcc This superfamily carries families of proteins all derived via gene-duplication events that act together in the gamma-tubulin complex. This complex is required for microtubule nucleation at the centrosome. 2012-02-09 14:22:48 2012-02-09 14:22:48 1 1 23 316 1604 1 541 CL0541 SH2-like SH2, phosphotyrosine-recognition domain superfamily Coggill P pcc This superfamily is characterised by proteins with the SH2-like fold. The proesence of this domain guides signal-transduction towards the phosphorylated tyrosine residues on its interacting protein-partner. 2012-02-21 16:19:05 2012-02-21 16:19:05 1 387 421 453 9393 1 542 CL0542 RAS_GEF_N Ras guanyl-nucleotide exchange factor activity N-term Coggill P pcc This is the more N-terminal domain of the RAS-GEF superfamily. 2012-02-29 11:57:43 2012-02-29 11:57:43 1 14 157 277 2682 1 543 CL0543 Viral_gly_cn_dm Viral glycoprotein central and dimerisation domains Eberhardt R re3 Flavivirus and alpha virus glycoprotein E/E1 consists of three domains. A dimerisation domain, a central domain and an immunoglobulin-like domain. The dimerisation and central domains are intertwined [1-3]. 2012-03-23 15:37:00 2012-03-23 15:37:00 1 93 37 200 12815 1 544 CL0544 AcylCoA_ox_dh_N Acyl-coenzyme A oxidase/dehydrogenase N-terminal Eberhardt R re3 Acyl-CoA dehydrogenases and acyl-coenzyme A oxidases consist of three domains. An N-terminal all alpha domain, a beta-barrel middle domain and a C-terminal catalytic domain [1-2]. 2012-03-27 08:17:33 2012-03-27 09:17:33 1 223 112 3478 26792 1 545 CL0545 APCOP-app_sub Clathrin (AP) and COPI appendage platform subdomain Coggill P pcc This superfamily is characterised by subdomains from the clathrin and coatomer appendages. The superfamily possesses a single protein/protein interaction site that in yeast binds to the ARFGAP Glo3p, and in mammalian gamma-COP binds to a Glo3p orthologue, ARFGAP2 [1]. 2012-04-18 13:17:34 2012-04-18 14:17:34 1 0 11 303 446 1 546 CL0546 Hexosaminidase beta-N-acetylhexosaminidase-like domain Coggill P pcc This superfamily is characterised by a mixed beta sheet with connection over the free side of the sheet. The fold is like a zincin fold lacking the catalytic centre. 2012-04-30 14:53:24 2012-04-30 15:53:24 1 124 150 1212 3278 1 547 CL0547 GF_recep_C-rich Growth factor receptor Cys-rich Eberhardt R re3 The cysteine-rich regions of growth factor receptor tyrosine kinases consist of eight disulphide-linked modules [1]. 2012-05-01 09:19:56 2012-05-01 10:19:56 1 49 88 140 2125 1 548 CL0548 IHF-likeDNA-bdg IHF-like DNA-binding protein supewrfamily Coggill P pcc This superfamily is characterised by being a dimer of identical subunits of a core of four helices in a bundle, partly opened, capped with a beta-sheet. All members appear to be prokaryotic DNA-binding domains. 2012-05-01 16:00:04 2012-05-01 17:00:04 1 60 13 4459 12213 1 549 CL0549 NicO_HupE_DsbD NicO/HupE/DsdD superfamily Eberhardt R re3 This clan contains the nickel transpot family NicO [1-2] and the HupE/UreJ proteins, which may be involved in nickel binding. NicO and HupE contain a conserved GxxHxxxDH motif, which may bind to nickel. 2012-05-31 10:50:16 2012-05-31 11:50:16 1 0 55 3953 10451 1 550 CL0550 SRCR SRCR-like Eberhardt R re3 2012-07-11 13:20:22 2012-07-11 14:20:22 1 19 402 139 9451 1 551 CL0551 BCLiA Bcl-2 inhibitors of programmed cell death Coggill P pcc This superfamily is characterised by families of proteins that inhibit apoptosis, They are regulated by all BH3-only proteins to promote apoptosis. 2012-08-31 14:55:14 2012-08-31 15:55:14 1 151 28 348 1402 1 552 CL0552 Hect Hect, E3 ligase catalytic domain Coggill P pcc This superfamily is characterised by fmailies with E3-ligase catalytic acitvity. The fold consists of two alpha+beta domains; where the N-terminal domain is an array of helices and beta-hairpins; the C-terminal domain is an a/b sandwich with one left-handed beta-alpha(n)-beta unit. 2012-09-07 15:42:50 2012-09-07 16:42:50 1 23 249 342 4684 1 553 CL0553 HBMR Helical backbone metal receptor superfamily Coggill P pcc This superfamily is characterised by a long alpha helical insertion in the interdomain linker in the Chelatase-like fold. Representative families include the periplasmic ferric siderophore binding proteins, the TroA-like nitrogenase iron-molybdenum proteins and the putative iron(III) transporter family, TM0189-like, or periplasmic-binding family. 2012-09-12 10:08:39 2012-09-12 11:08:39 1 190 51 4828 19772 1