990 resultados para PROTEIN FAMILIES
Resumo:
We propose a new characterization of protein structure based on the natural tetrahedral geometry of the β carbon and a new geometric measure of structural similarity, called visible volume. In our model, the side-chains are replaced by an ideal tetrahedron, the orientation of which is fixed with respect to the backbone and corresponds to the preferred rotamer directions. Visible volume is a measure of the non-occluded empty space surrounding each residue position after the side-chains have been removed. It is a robust, parameter-free, locally-computed quantity that accounts for many of the spatial constraints that are of relevance to the corresponding position in the native structure. When computing visible volume, we ignore the nature of both the residue observed at each site and the ones surrounding it. We focus instead on the space that, together, these residues could occupy. By doing so, we are able to quantify a new kind of invariance beyond the apparent variations in protein families, namely, the conservation of the physical space available at structurally equivalent positions for side-chain packing. Corresponding positions in native structures are likely to be of interest in protein structure prediction, protein design, and homology modeling. Visible volume is related to the degree of exposure of a residue position and to the actual rotamers in native proteins. In this article, we discuss the properties of this new measure, namely, its robustness with respect to both crystallographic uncertainties and naturally occurring variations in atomic coordinates, and the remarkable fact that it is essentially independent of the choice of the parameters used in calculating it. We also show how visible volume can be used to align protein structures, to identify structurally equivalent positions that are conserved in a family of proteins, and to single out positions in a protein that are likely to be of biological interest. These properties qualify visible volume as a powerful tool in a variety of applications, from the detailed analysis of protein structure to homology modeling, protein structural alignment, and the definition of better scoring functions for threading purposes.
Resumo:
Tese de doutoramento, Informática (Bioinformática), Universidade de Lisboa, Faculdade de Ciências, 2015
Resumo:
We describe AMIN (Amidase N-terminal domain), a novel protein domain found specifically in bacterial periplasmic proteins. AMIN domains are widely distributed among peptidoglycan hydrolases and transporter protein families. Based on experimental data, contextual information and phyletic profiles, we suggest that AMIN domains mediate the targeting of periplasmic or extracellular proteins to specific regions of the bacterial envelope.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Sirtuins and hypoxia-inducible transcription factors (HIF) have well-established roles in regulating cellular responses to metabolic and oxidative stress. Recent reports have linked these two protein families by demonstrating that sirtuins can regulate the activity of HIF-1 and HIF-2. Here we investigated the role of SIRT1, a NAD+-dependent deacetylase, in the regulation of HIF-1 activity in hypoxic conditions. Our results show that in hepatocellular carcinoma (HCC) cell lines, hypoxia did not alter SIRT1 mRNA or protein expression, whereas it predictably led to the accumulation of HIF-1α and the up-regulation of its target genes. In hypoxic models in vitro and in in vivo models of systemic hypoxia and xenograft tumor growth, knockdown of SIRT1 protein with shRNA or inhibition of its activity with small molecule inhibitors impaired the accumulation of HIF-1α protein and the transcriptional increase of its target genes. In addition, endogenous SIRT1 and HIF-1α proteins co-immunoprecipitated and loss of SIRT1 activity led to a hyperacetylation of HIF-1α. Taken together, our data suggest that HIF-1α and SIRT1 proteins interact in HCC cells and that HIF-1α is a target of SIRT1 deacetylase activity. Moreover, SIRT1 is necessary for HIF-1α protein accumulation and activation of HIF-1 target genes under hypoxic conditions.
Resumo:
The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.
Resumo:
The iProClass database is an integrated resource that provides comprehensive family relationships and structural and functional features of proteins, with rich links to various databases. It is extended from ProClass, a protein family database that integrates PIR superfamilies and PROSITE motifs. The iProClass currently consists of more than 200 000 non-redundant PIR and SWISS-PROT proteins organized with more than 28 000 superfamilies, 2600 domains, 1300 motifs, 280 post-translational modification sites and links to more than 30 databases of protein families, structures, functions, genes, genomes, literature and taxonomy. Protein and family summary reports provide rich annotations, including membership information with length, taxonomy and keyword statistics, full family relationships, comprehensive enzyme and PDB cross-references and graphical feature display. The database facilitates classification-driven annotation for protein sequence databases and complete genomes, and supports structural and functional genomic research. The iProClass is implemented in Oracle 8i object-relational system and available for sequence search and report retrieval at http://pir.georgetow n.edu/iproclass/.
Resumo:
TIGRFAMs is a collection of protein families featuring curated multiple sequence alignments, hidden Markov models and associated information designed to support the automated functional identification of proteins by sequence homology. We introduce the term ‘equivalog’ to describe members of a set of homologous proteins that are conserved with respect to function since their last common ancestor. Related proteins are grouped into equivalog families where possible, and otherwise into protein families with other hierarchically defined homology types. TIGRFAMs currently contains over 800 protein families, available for searching or downloading at www.tigr.org/TIGRFAMs. Classification by equivalog family, where achievable, complements classification by orthology, superfamily, domain or motif. It provides the information best suited for automatic assignment of specific functions to proteins from large-scale genome sequencing projects.
Resumo:
We identified a novel human homologue of the rat FE65 gene, hFE65L, by screening the cytoplasmic domain of beta-amyloid precursor protein (beta PP) with the "interaction trap." The cytoplasmic domains of the beta PP homologues, APLP1 and APLP2 (amyloid precursor-like proteins), were also tested for interaction with hFE65L. APLP2, but not APLP1, was found to interact with hFE65L. We confirmed these interactions in vivo by successfully coimmunoprecipatating endogenous beta PP and APLP2 from mammalian cells overexpressing a hemagglutinin-tagged fusion of the C-terminal region of hFE65L. We report the existence of a human FE65 gene family and evidence supporting specific interactions between members of the beta PP and FE65 protein families. Sequence analysis of the FE65 human gene family reveals the presence of two phosphotyrosine interaction (PI) domains. Our data show that a single PI domain is sufficient for binding of hFE65L to the cytoplasmic domain of beta PP and APLP2. The PI domain of the protein, Shc, is known to interact with the NPXYp motif found in the cytoplasmic domain of a number of different growth factor receptors. Thus, it is likely that the PI domains present in the C-terminal moiety of the hFE65L protein bind the NPXY motif located in the cytoplasmic domain of beta PP and APLP2.
Resumo:
In the evolution of eukaryotic genes, introns are believed to have played a major role in increasing the probability of favorable duplication events, chance recombinations, and exon shuffling resulting in functional hybrid proteins. As a rule, prokaryotic genes lack introns, and the examples of prokaryotic introns described do not seem to have contributed to gene evolution by exon shuffling. Still, certain protein families in modern bacteria evolve rapidly by recombination of genes, duplication of functional domains, and as shown for protein PAB of the anaerobic bacterial species Peptostreptococcus magnus, by the shuffling of an albumin-binding protein module from group C and G streptococci. Characterization of a protein PAB-related gene in a P. magnus strain with less albumin-binding activity revealed that the shuffled module was missing. Based on this fact and observations made when comparing gene sequences of this family of bacterial surface proteins interacting with albumin and/or immunoglobulin, a model is presented that can explain how this rapid intronless evolution takes place. A new kind of genetic element is introduced: the recer sequence promoting interdomain, in frame recombination and acting as a structure-less flexibility-promoting spacer in the corresponding protein. The data presented also suggest that antibiotics could represent the selective pressure behind the shuffling of protein modules in P. magnus, a member of the indigenous bacterial flora.
Resumo:
The regions surrounding the catalytic amino acids previously identified in a few "retaining" O-glycosyl hydrolases (EC 3.2.1) have been analyzed by hydrophobic cluster analysis and have been used to define sequence motifs. These motifs have been found in more than 150 glycosyl hydrolase sequences representing at least eight established protein families that act on a large variety of substrates. This allows the localization and the precise role of the catalytic residues (nucleophile and acid catalyst) to be predicted for each of these enzymes, including several lysosomal glycosidases. An identical arrangement of the catalytic nucleophile was also found for S-glycosyl hydrolases (myrosinases; EC 3.2.3.1) for which the acid catalyst is lacking. A (beta/alpha)8 barrel structure has been reported for two of the eight families of proteins that have been grouped. It is suggested that the six other families also share this fold at their catalytic domain. These enzymes illustrate how evolutionary events led to a wide diversification of substrate specificity with a similar disposition of identical catalytic residues onto the same ancestral (beta/alpha)8 barrel structure.
Resumo:
The C2 domain is one of the most frequent and widely distributed calcium-binding motifs. Its structure comprises an eight-stranded beta-sandwich with two structural types as if the result of a circular permutation. Combining sequence, structural and modelling information, we have explored, at different levels of granularity, the functional characteristics of several families of C2 domains. At the coarsest level,the similarity correlates with key structural determinants of the C2 domain fold and, at the finest level, with the domain architecture of the proteins containing them, highlighting the functional diversity between the various subfamilies. The functional diversity appears as different conserved surface patches throughout this common fold. In some cases, these patches are related to substrate-binding sites whereas in others they correspond to interfaces of presumably permanent interaction between other domains within the same polypeptide chain. For those related to substrate-binding sites, the predictions overlap with biochemical data in addition to providing some novel observations. For those acting as protein-protein interfaces' our modelling analysis suggests that slight variations between families are a result of not only complementary adaptations in the interfaces involved but also different domain architecture. In the light of the sequence and structural genomic projects, the work presented here shows that modelling approaches along with careful sub-typing of protein families will be a powerful combination for a broader coverage in proteomics. (C) 2003 Elsevier Ltd. All rights reserved.
Resumo:
The EF-hand superfamily of calcium binding proteins includes the S100, calcium binding protein, and troponin subfamilies. This study represents a genome, structure, and expression analysis of the S100 protein family, in mouse, human, and rat. We confirm the high level of conservation between mammalian sequences but show that four members, including S100A12, are present only in the human genome. We describe three new members of the S100 family in the three species and their locations within the S100 genomic clusters and propose a revised nomenclature and phylogenetic relationship between members of the EF-hand superfamily. Two of the three new genes were induced in bone-marrow-derived macrophages activated with bacterial lipopolysaccharide, suggesting a role in inflammation. Normal human and murine tissue distribution profiles indicate that some members of the family are expressed in a specific manner, whereas others are more ubiquitous. Structure-function analysis of the chemotactic properties of murine S100A8 and human S100A12, particularly within the active hinge domain, suggests that the human protein is the functional homolog of the murine protein. Strong similarities between the promoter regions of human S100A12 and murine S100A8 support this possibility. This study provides insights into the possible processes of evolution of the EF-hand protein superfamily. Evolution of the S100 proteins appears to have occurred in a modular fashion, also seen in other protein families such as the C2H2-type zinc-finger family. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
The G-protein coupled receptors--or GPCRs--comprise simultaneously one of the largest and one of the most multi-functional protein families known to modern-day molecular bioscience. From a drug discovery and pharmaceutical industry perspective, the GPCRs constitute one of the most commercially and economically important groups of proteins known. The GPCRs undertake numerous vital metabolic functions and interact with a hugely diverse range of small and large ligands. Many different methodologies have been developed to efficiently and accurately classify the GPCRs. These range from motif-based techniques to machine learning as well as a variety of alignment-free techniques based on the physiochemical properties of sequences. We review here the available methodologies for the classification of GPCRs. Part of this work focuses on how we have tried to build the intrinsically hierarchical nature of sequence relations, implicit within the family, into an adaptive approach to classification. Importantly, we also allude to some of the key innate problems in developing an effective approach to classifying the GPCRs: the lack of sequence similarity between the six classes that comprise the GPCR family and the low sequence similarity to other family members evinced by many newly revealed members of the family.
Resumo:
Nerve development, which includes axon outgrowth and guidance, is regulated by many protein families, including receptor protein tyrosine phosphatases (RPTP's).Protein tyrosine phosphatase receptor type 0 (PTPRO) is a type III RPTP that is important for axon growth and guidance, as observed in chicks and flies. In order to examine the effects ofPTPRO on mammalian development, standard behavioral tests were used to compare mice lacking the gene for PTPRO (ROKO mice) to wild-type (WT) mice. The ROKO mice showed a significant delay in reacting to a thermal noxious stimulus, hotplate analgesia, when compared to the WT mice suggesting deficient nociceptive function. In a rotarod test for proprioceptive function the ROKO mice exhibited a significant decrease in the amount of time spent on the rotating rod than did the WT mice. Additional proprioception tests were performed including the climb, step reflex, beam, and mesh walk tests. In the climb and step (place) test, the ROKO group had a significantly lower accuracy in performing the tests than did the WT mice. Thus, mice lacking the PTPRO gene showed behavioral deficiencies that reflect impairment in sensory function, specifically for nociception and proprioception.