990 resultados para PROTEIN FAMILIES
Resumo:
The A mating type genes of the mushroom Coprinus cinereus encode two families of dissimilar homeodomain proteins (HD1 and HD2). The proteins heterodimerize when mating cells fuse to generate a transcriptional regulator that promotes expression of genes required for early steps in sexual development. In previous work we showed that heterodimerization brings together different functional domains of the HD1 and HD2 proteins; a potential activation domain at the C terminus of the HD1 protein and an essential HD2 DNA-binding motif. Two predicted nuclear localization signals (NLS) are present in the HD1 protein but none are in the HD2 protein. We deleted each NLS separately from an HD1 protein and showed that one (NLS1) is essential for normal heterodimer function. Fusion of the NLS sequences to the C terminus of an HD2 protein compensated for their deletion from the HD1 protein partner and permitted the two modified proteins to form a functional transcriptional regulator. The nuclear targeting properties of the A protein NLS sequences were demonstrated by fusing the region that encodes them to the bacterial uidA (β-glucuronidase) gene and showing that β-glucuronidase expression localized to the nuclei of onion epidermal cells. These observations lead to the proposal that heterodimerization regulates entry of the active transcription factor complex to the nucleus.
Resumo:
In the last decade, two tools, one drawn from information theory and the other from artificial neural networks, have proven particularly useful in many different areas of sequence analysis. The work presented herein indicates that these two approaches can be joined in a general fashion to produce a very powerful search engine that is capable of locating members of a given nucleic acid sequence family in either local or global sequence searches. This program can, in turn, be queried for its definition of the motif under investigation, ranking each base in context for its contribution to membership in the motif family. In principle, the method used can be applied to any binding motif, including both DNA and RNA sequence families, given sufficient family size.
Resumo:
MetaFam is a comprehensive relational database of protein family information. This web-accessible resource integrates data from several primary sequence and secondary protein family databases. By pooling together the information from these disparate sources, MetaFam is able to provide the most complete protein family sets available. Users are able to explore the interrelationships among these primary and secondary databases using a powerful graphical visualization tool, MetaFamView. Additionally, users can identify corresponding sequence entries among the sequence databases, obtain a quick summary of corresponding families (and their sequence members) among the family databases, and even attempt to classify their own unassigned sequences. Hypertext links to the appropriate source databases are provided at every level of navigation. Global family database statistics and information are also provided. Public access to the data is available at http://metafam.ahc.umn.edu/.
Resumo:
The Protein Information Resource, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the most comprehensive and expertly annotated protein sequence database in the public domain, the PIR-International Protein Sequence Database. To provide timely and high quality annotation and promote database interoperability, the PIR-International employs rule-based and classification-driven procedures based on controlled vocabulary and standard nomenclature and includes status tags to distinguish experimentally determined from predicted protein features. The database contains about 200 000 non-redundant protein sequences, which are classified into families and superfamilies and their domains and motifs identified. Entries are extensively cross-referenced to other sequence, classification, genome, structure and activity databases. The PIR web site features search engines that use sequence similarity and database annotation to facilitate the analysis and functional identification of proteins. The PIR-International databases and search tools are accessible on the PIR web site at http://pir.georgetown.edu/ and at the MIPS web site at http://www.mips.biochem.mpg.de. The PIR-International Protein Sequence Database and other files are also available by FTP.
Resumo:
PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain structures with each family made up of at least two members. Each member in a family has been structurally aligned with every other member in the same family (pairwise alignment) and all the members in the family are also aligned using simultaneous superposition (multiple alignment). The structural alignments are performed largely automatically, with manual interventions especially in the cases of distantly related proteins, using the program STAMP (version 4.2). Every family is also associated with two dendrograms, calculated using PHYLIP (version 3.5), one based on a structural dissimilarity metric defined for every pairwise alignment and the other based on similarity of topologically equivalent residues. These dendrograms enable easy comparison of sequence and structure-based relationships among the members in a family. Structure-based alignments with the details of structural and sequence similarities, superposed coordinate sets and dendrograms can be accessed conveniently using a web interface. The database can be queried for protein pairs with sequence or structural similarities falling within a specified range. Thus PALI forms a useful resource to help in analysing the relationship between sequence and structure variation at a given level of sequence similarity. PALI also contains over 653 ‘orphans’ (single member families). Using the web interface involving PSI_BLAST and PHYLIP it is possible to associate the sequence of a new protein with one of the families in PALI and generate a phylogenetic tree combining the query sequence and proteins of known 3-D structure. The database with the web interfaced search and dendrogram generation tools can be accessed at http://pa uling.mbu.iisc.ernet.in/~pali.
Resumo:
There is no control over the information provided with sequences when they are deposited in the sequence databases. Consequently mistakes can seed the incorrect annotation of other sequences. Grouping genes into families and applying controlled annotation overcomes the problems of incorrect annotation associated with individual sequences. Two databases (http://www.mendel.ac.uk) were created to apply controlled annotation to plant genes and plant ESTs: Mendel-GFDb is a database of plant protein (gene) families based on gapped-BLAST analysis of all sequences in the SWISS-PROT family of databases. Sequences are aligned (ClustalW) and identical and similar residues shaded. The families are visually curated to ensure that one or more criteria, for example overall relatedness and/or domain similarity relate all sequences within a family. Sequence families are assigned a ‘Gene Family Number’ and a unified description is developed which best describes the family and its members. If authority exists the gene family is assigned a ‘Gene Family Name’. This information is placed in Mendel-GFDb. Mendel-ESTS is primarily a database of plant ESTs, which have been compared to Mendel-GFDb, completely sequenced genomes and domain databases. This approach associated ESTs with individual sequences and the controlled annotation of gene families and protein domains; the information being placed in Mendel-ESTS. The controlled annotation applied to genes and ESTs provides a basis from which a plant transcription database can be developed.
Resumo:
Ewes from the Booroola strain of Australian Mérino sheep are characterized by high ovulation rate and litter size. This phenotype is due to the action of the FecBB allele of a major gene named FecB, as determined by statistical analysis of phenotypic data. By genetic analysis of 31 informative half-sib families from heterozygous sires, we showed that the FecB locus is situated in the region of ovine chromosome 6 corresponding to the human chromosome 4q22–23 that contains the bone morphogenetic protein receptor IB (BMPR-IB) gene encoding a member of the transforming growth factor-β (TGF-β) receptor family. A nonconservative substitution (Q249R) in the BMPR-IB coding sequence was found to be associated fully with the hyperprolificacy phenotype of Booroola ewes. In vitro, ovarian granulosa cells from FecBB/FecBB ewes were less responsive than granulosa cells from FecB+/FecB+ ewes to the inhibitory effect on steroidogenesis of GDF-5 and BMP-4, natural ligands of BMPR-IB. It is suggested that in FecBB/FecBB ewes, BMPR-IB would be inactivated partially, leading to an advanced differentiation of granulosa cells and an advanced maturation of ovulatory follicles.
Resumo:
Local anesthetics, commonly used for treating cardiac arrhythmias, pain, and seizures, are best known for their inhibitory effects on voltage-gated Na+ channels. Cardiovascular and central nervous system toxicity are unwanted side-effects from local anesthetics that cannot be attributed to the inhibition of only Na+ channels. Here, we report that extracellular application of the membrane-permeant local anesthetic bupivacaine selectively inhibited G protein-gated inwardly rectifying K+ channels (GIRK:Kir3) but not other families of inwardly rectifying K+ channels (ROMK:Kir1 and IRK:Kir2). Bupivacaine inhibited GIRK channels within seconds of application, regardless of whether channels were activated through the muscarinic receptor or directly via coexpressed G protein Gβγ subunits. Bupivacaine also inhibited alcohol-induced GIRK currents in the absence of functional pertussis toxin-sensitive G proteins. The mutated GIRK1 and GIRK2 (GIRK1/2) channels containing the high-affinity phosphatidylinositol 4,5-bisphosphate (PIP2) domain from IRK1, on the other hand, showed dramatically less inhibition with bupivacaine. Surprisingly, GIRK1/2 channels with high affinity for PIP2 were inhibited by ethanol, like IRK1 channels. We propose that membrane-permeant local anesthetics inhibit GIRK channels by antagonizing the interaction of PIP2 with the channel, which is essential for Gβγ and ethanol activation of GIRK channels.
Resumo:
We present a method for discovering conserved sequence motifs from families of aligned protein sequences. The method has been implemented as a computer program called emotif (http://motif.stanford.edu/emotif). Given an aligned set of protein sequences, emotif generates a set of motifs with a wide range of specificities and sensitivities. emotif also can generate motifs that describe possible subfamilies of a protein superfamily. A disjunction of such motifs often can represent the entire superfamily with high specificity and sensitivity. We have used emotif to generate sets of motifs from all 7,000 protein alignments in the blocks and prints databases. The resulting database, called identify (http://motif.stanford.edu/identify), contains more than 50,000 motifs. For each alignment, the database contains several motifs having a probability of matching a false positive that range from 10−10 to 10−5. Highly specific motifs are well suited for searching entire proteomes, while generating very few false predictions. identify assigns biological functions to 25–30% of all proteins encoded by the Saccharomyces cerevisiae genome and by several bacterial genomes. In particular, identify assigned functions to 172 of proteins of unknown function in the yeast genome.
Resumo:
Anacardic acids, a class of secondary compounds derived from fatty acids, are found in a variety of dicotyledonous families. Pest resistance (e.g., spider mites and aphids) in Pelargonium xhortorum (geranium) is associated with high levels (approximately 81%) of unsaturated 22:1 omega 5 and 24:1 omega 5 anacardic acids in the glandular trichome exudate. A single dominant locus controls the production of these omega 5 anacardic acids, which arise from novel 16:1 delta 11 and 18:1 delta 13 fatty acids. We describe the isolation and characterization of a cDNA encoding a unique delta 9 14:0-acyl carrier protein fatty acid desaturase. Several lines of evidence indicated that expression of this desaturase leads to the production of the omega 5 anacardic acids involved in pest resistance. First, its expression was found in pest-resistant, but not suspectible, plants and its expression followed the production of the omega 5 anacardic acids in segregating populations. Second, its expression and the occurrence of the novel 16:1 delta 11 and 18:1 delta 13 fatty acids and the omega 5 anacardic acids were specific to tall glandular trichomes. Third, assays of the recombinant protein demonstrated that this desaturase produced the 14:1 delta 9 fatty acid precursor to the novel 16:1 delta 11 and 18:1 delta 13 fatty acids. Based on our genetic and biochemical studies, we conclude that expression of this delta 9 14:0-ACP desaturase gene is required for the production of omega 5 anacardic acids that have been shown to be necessary for pest resistance in geranium.
Resumo:
Bone morphogenic protein-1 (BMP-1) was originally identified as one of several BMPs that induced new bone formation when implanted into ectopic sites in rodents. BMP-1, however, differed from other BMPs in that it its structure was not similar to transforming growth factor beta. Instead, it had a large domain homologous to a metalloendopeptidase isolated from crayfish, an epidermal growth-factor-like domain, and three regions of internal sequence homology referred to as CUB domains. Therefore, BMP-1 was a member of the "astacin families" of zinc-requiring endopeptidases. Many astacins have been shown to play critical roles in embryonic hatching, dorsal/ventral patterning, and early developmental decisions. Here, we have obtained amino acid sequences and isolated cDNA clones for procollagen C-proteinase (EC 3.4.24.19), an enzyme that is essential for the processing of procollagens to fibrillar collagens. The results demonstrate that procollagen C-proteinase is identical to BMP-1.
Resumo:
BACKGROUND Eosinophilic esophagitis (EoE) is a rapidly emerging, chronic inflammatory, genetically impacted disease of the esophagus, defined clinically by symptoms of esophageal dysfunction and, pathologically, by an eosinophil-predominant tissue infiltration. However, in four EoE-families, we have identified patients presenting with EoE-typical and corticosteroid-responsive symptoms, but without tissue eosinophilia. It was the aim of this study to clinically and immunologically characterize these patients with EoE-like disease. METHODS Five patients suffering from an EoE-like disease were evaluated with endoscopic, histologic, functional and quantitative immunohistologic examinations, and mRNA expression determination. RESULTS The frequency of first generation offspring of EoE-like disease patients affected by EoE or EoE-like disease was 40%. Immunofluorescence analysis confirmed an almost complete absence of eosinophils in the esophageal tissues of patients with EoE-like disease, but revealed a considerable T cell infiltration, comparable to EoE. In contrast to EoE, eotaxin-3 mRNA and protein were markedly reduced in EoE-like disease (P < 0.05). The mRNA expression levels of three selected EoE genes (eotaxin-3, MUC4 and CDH26) allowed to discriminate between EoE-like disease, EoE and normal epithelium. CONCLUSIONS Patients suffering from "EoE without eosinophilia" do not fulfill formally the diagnostic criteria for EoE. However, their clinical manifestation, immunohistology and gene-expression pattern, plus the fact that they bequeath EoE to their offspring, suggest a uniform underlying pathogenesis. Conventional EoE, with its prominent eosinophilia, therefore appears to be only one phenotype of a broader "inflammatory dysphagia syndrome" spectrum. In this light, the role of the eosinophils, the definition of EoE, and its diagnostic criteria must likely be reconsidered. This article is protected by copyright. All rights reserved.
Resumo:
With the completion of the human and mouse genome sequences, the task now turns to identifying their encoded transcripts and assigning gene function. In this study, we have undertaken a computational approach to identify and classify all of the protein kinases and phosphatases present in the mouse gene complement. A nonredundant set of these sequences was produced by mining Ensembl gene predictions and publicly available cDNA sequences with a panel of InterPro domains. This approach identified 561 candidate protein kinases and 162 candidate protein phosphatases. This cohort was then analyzed using TribeMCL protein sequence similarity clustering followed by CLUSTALV alignment and hierarchical tree generation. This approach allowed us to (1) distinguish between true members of the protein kinase and phosphatase families and enzymes of related biochemistry, (2) determine the structure of the families, and (3) suggest functions for previously uncharacterized members. The classifications obtained by this approach were in good agreement with previous schemes and allowed us to demonstrate domain associations with a number of clusters. Finally, we comment on the complementary nature of cDNA and genome-based gene detection and the impact of the FANTOM2 transcriptome project.
Resumo:
A protein-truncating variant of CHEK2, 1100delC, is associated with a moderate increase in breast cancer risk. We have determined the prevalence of this allele in index cases from 300 Australian multiple-case breast cancer families, 95% of which had been found to be negative for mutations in BRCA1 and BRCA2. Only two (0.6%) index cases heterozygous for the CHEK2 mutation were identified. All available relatives in these two families were genotyped, but there was no evidence of co-segregation between the CHEK2 variant and breast cancer. Lymphoblastoid cell lines established from a heterozygous carrier contained approximately 20% of the CHEK2 1100delC mRNA relative to wild-type CHEK2 transcript. However, no truncated CHK2 protein was detectable. Analyses of expression and phosphorylation of wild-type CHK2 suggest that the variant is likely to act by haploinsufficiency. Analysis of CDC25A degradation, a downstream target of CHK2, suggests that some compensation occurs to allow normal degradation of CDC25A. Such compensation of the 1100delC defect in CHEK2 might explain the rather low breast cancer risk associated with the CHEK2 variant, compared to that associated with truncating mutations in BRCA1 or BRCA2.
Resumo:
A large number of macrocyclic miniproteins with diverse biological activities have been isolated from the Rubiaceae, Violaceae, and Cucurbitaceae plant families in recent years. Here we report the three-dimensional structure determined using H-1 NMR spectroscopy and demonstrate potent insecticidal activity for one of these peptides, kalata B2. This peptide is one of the major components of an extract from the leaves of the plant Oldenlandia affinis. The structure consists of a distorted triple-stranded beta-sheet and a cystine knot arrangement of the disulfide bonds and is similar to those described for other members of the cyclotide family. The unique cyclic and knotted nature of these molecules makes them a fascinating example of topologically complex proteins. Examination of the sequences reveals that they can be separated into two subfamilies, one of which contains a larger number of positively charged residues and has a bracelet-like circularization of the backbone. The second subfamily contains a backbone twist due to a cis-peptidyl-proline bond and may conceptually be regarded as a molecular Mobius strip. Kalata B2 is the second putative member of the Mobius cyclotide family to be structurally characterized and has a cis-peptidyl-proline bond, thus validating the suggested name for this subfamily of cyclotides. The observation that kalata B2 inhibits the growth and development of Helicoverpa armigera larvae suggests a role for the cyclotides in plant defense. A comparison of the sequences and structures of kalata B1 and B2 provides insight into the biological activity of these peptides.