977 resultados para PROTEIN DOMAINS


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Integral membrane proteins (IMPs) contain localization signals necessary for targeting to their resident subcellular compartments. To define signals that mediate localization to the Golgi complex, we have analyzed a resident IMP of the Saccharomyces cerevisiae Golgi complex, guanosine diphosphatase (GDPase). GDPase, which is necessary for Golgi-specific glycosylation reactions, is a type II IMP with a short amino-terminal cytoplasmic domain, a single transmembrane domain (TMD), and a large catalytic lumenal domain. Regions specifying Golgi localization were identified by analyzing recombinant proteins either lacking GDPase domains or containing corresponding domains from type II vacuolar IMPs. Neither deletion nor substitution of the GDPase cytoplasmic domain perturbed Golgi localization. Exchanging the GDPase TMD with vacuolar protein TMDs only marginally affected Golgi localization. Replacement of the lumenal domain resulted in mislocalization of the chimeric protein from the Golgi to the vacuole, but a similar substitution leaving 34 amino acids of the GDPase lumenal domain intact was properly localized. These results identify a major Golgi localization determinant in the membrane-adjacent lumenal region (stem) of GDPase. Although necessary, the stem domain is not sufficient to mediate localization; in addition, a membrane-anchoring domain and either the cytoplasmic or full-length lumenal domain must be present to maintain Golgi residence. The importance of lumenal domain sequences in GDPase Golgi localization and the requirement for multiple hydrophilic protein domains support a model for Golgi localization invoking proteinprotein interactions rather than interactions between the TMD and the lipid bilayer.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Caenorhabditis elegans dynamin is expressed at high levels in neurons and at lower levels in other cell types, consistent with the important role that dynamin plays in the recycling of synaptic vesicles. Indirect immunofluorescence showed that dynamin is concentrated along the dorsal and ventral nerve cords and in the synapse-rich nerve ring. Green fluorescent protein (GFP) fused to the N terminus of dynamin is localized to synapse-rich regions. Furthermore, this chimera was detected along the apical membrane of intestinal cells, in spermathecae, and in coelomocytes. Dynamin localization was not affected by disrupting axonal transport of synaptic vesicles in the unc-104 (kinesin) mutant. To investigate the alternative mechanisms that dynamin might use for translocation to the synapse, we systematically tested the localization of different protein domains by fusion to GFP. Localization of each chimera was measured in one specific neuron, the ALM. The GTPase, a middle domain, and the putative coiled coil each contribute to synaptic localization. Surprisingly, the pleckstrin homology domain and the proline-rich domain, which are known to bind to coated-pit constituents, did not contribute to synaptic localization. The GFP-GTPase chimera was most strongly localized, although the GTPase domain has no known interactions with proteins other than with dynamin itself. Our results suggest that different dynamin domains contribute to axonal transport and the sequestration of a pool of dynamin molecules in synaptic cytosol.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

There is no control over the information provided with sequences when they are deposited in the sequence databases. Consequently mistakes can seed the incorrect annotation of other sequences. Grouping genes into families and applying controlled annotation overcomes the problems of incorrect annotation associated with individual sequences. Two databases (http://www.mendel.ac.uk) were created to apply controlled annotation to plant genes and plant ESTs: Mendel-GFDb is a database of plant protein (gene) families based on gapped-BLAST analysis of all sequences in the SWISS-PROT family of databases. Sequences are aligned (ClustalW) and identical and similar residues shaded. The families are visually curated to ensure that one or more criteria, for example overall relatedness and/or domain similarity relate all sequences within a family. Sequence families are assigned a ‘Gene Family Number’ and a unified description is developed which best describes the family and its members. If authority exists the gene family is assigned a ‘Gene Family Name’. This information is placed in Mendel-GFDb. Mendel-ESTS is primarily a database of plant ESTs, which have been compared to Mendel-GFDb, completely sequenced genomes and domain databases. This approach associated ESTs with individual sequences and the controlled annotation of gene families and protein domains; the information being placed in Mendel-ESTS. The controlled annotation applied to genes and ESTs provides a basis from which a plant transcription database can be developed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The K homology (KH) module is a widespread RNA-binding motif that has been detected by sequence similarity searches in such proteins as heterogeneous nuclear ribonucleoprotein K (hnRNP K) and ribosomal protein S3. Analysis of spatial structures of KH domains in hnRNP K and S3 reveals that they are topologically dissimilar and thus belong to different protein folds. Thus KH motif proteins provide a rare example of protein domains that share significant sequence similarity in the motif regions but possess globally distinct structures. The two distinct topologies might have arisen from an ancestral KH motif protein by N- and C-terminal extensions, or one of the existing topologies may have evolved from the other by extension, displacement and deletion. C-terminal extension (deletion) requires β-sheet rearrangement through the insertion (removal) of a β-strand in a manner similar to that observed in serine protease inhibitors serpins. Current analysis offers a new look on how proteins can change fold in the course of evolution.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We present an approach for assessing the significance of sequence and structure comparisons by using nearly identical statistical formalisms for both sequence and structure. Doing so involves an all-vs.-all comparison of protein domains [taken here from the Structural Classification of Proteins (scop) database] and then fitting a simple distribution function to the observed scores. By using this distribution, we can attach a statistical significance to each comparison score in the form of a P value, the probability that a better score would occur by chance. As expected, we find that the scores for sequence matching follow an extreme-value distribution. The agreement, moreover, between the P values that we derive from this distribution and those reported by standard programs (e.g., blast and fasta validates our approach. Structure comparison scores also follow an extreme-value distribution when the statistics are expressed in terms of a structural alignment score (essentially the sum of reciprocated distances between aligned atoms minus gap penalties). We find that the traditional metric of structural similarity, the rms deviation in atom positions after fitting aligned atoms, follows a different distribution of scores and does not perform as well as the structural alignment score. Comparison of the sequence and structure statistics for pairs of proteins known to be related distantly shows that structural comparison is able to detect approximately twice as many distant relationships as sequence comparison at the same error rate. The comparison also indicates that there are very few pairs with significant similarity in terms of sequence but not structure whereas many pairs have significant similarity in terms of structure but not sequence.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The guinea pig estrogen sulfotransferase gene has been cloned and compared to three other cloned steroid and phenol sulfotransferase genes (human estrogen sulfotransferase, human phenol sulfotransferase, and guinea pig 3 alpha-hydroxysteroid sulfotransferase). The four sulfotransferase genes demonstrate a common outstanding feature: the splice sites for their 3'-terminal exons are identically located. That is, the 3'-terminal exon splice sites involve a glycine that constitutes the N-terminal glycine of an invariably conserved GXXGXXK motif present in all steroid and phenol sulfotransferases for which primary structures are known. This consistency strongly suggests that all steroid and phenol sulfotransferase genes will be similarly spliced. The GXXGXXK motif forms the active binding site for the universal sulfonate donor 3'-phosphoadenosine 5'-phosphosulfate. Amino acid sequence alignment of 19 cloned steroid and phenol sulfotransferases starting with the GXXGXXK motif indicates that the 3'-terminal exon for each steroid and phenol sulfotransferase gene encodes a similarly sized C-terminal fragment of the protein. Interestingly, on further analysis of the alignment, three distinct amino acid sequence patterns emerge. The presence of the conserved functional GXXGXXK motif suggests that the protein domains encoded by steroid and phenol sulfotransferase 3'-terminal exons have evolved from a common ancestor. Furthermore, it is hypothesized that during the course of evolution, the 3'-terminal exon further diverged into at least three sulfotransferase subdivisions: a phenol or aryl group, an estrogen or phenolic steroid group, and a neutral steroid group.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

During viral infection, fusion of the viral envelope with endosomal membranes and nucleocapsid release were thought to be concomitant events. We show here that for the vesicular stomatitis virus they occur sequentially, at two successive steps of the endocytic pathway. Fusion already occurs in transport intermediates between early and late endosomes, presumably releasing the nucleocapsid within the lumen of intra- endosomal vesicles, where it remains hidden. Transport to late endosomes is then required for the nucleocapsid to be delivered to the cytoplasm. This last step, which initiates infection, depends on the late endosomal lipid lysobisphosphatidic acid ( LBPA) and its putative effector Alix/ AIP1, and is regulated by phosphatidylinositol- 3-phosphate ( PtdIns( 3) P) signalling via the PtdIns( 3) P- binding protein Snx16. We conclude that the nucleocapsid is exported into the cytoplasm after the back- fusion of internal vesicles with the limiting membrane of late endosomes, and that this process is controlled by the phospholipids LBPA and PtdIns( 3) P and their effectors.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Major Histocompatibility Complex (MHC) comprises the most polymorphic loci in animals. MHC plays an important role during the first steps of the immune response in vertebrates. In humans, MHC molecules (also named human leukocyte antigens, HLA) were initially regarded as class I or class II molecules. Each of them, presents to different T cells subsets. MHC class I molecules, are heterodimers in which the heavy chain (alpha) has three extracellular domains, two of which (alpha 1 and alpha 2) are polymorphic and conform the antigen recognition sites (ARS). The ARS is thought to be subjected to balancing selection for variability, which is the cause of the very high polymorphism of the MHC molecules. Different pathogenic epitopes would be the evolutionary force causing balancing selection. MHC class I genes have been completely sequenced (α1 and α2 protein domains) and thoroughly studied in Gallus gallus (chicken) as well as in mammals. In fact, the MHC locus was first defined in chicken, specifically in the highly consanguineous variety „Leghorn‟. It has been found that, in the case of chickens the MHC genetic region is considerably smaller than it is in mammals (remarkably shorter introns were found in chickens), and is organized quite differently. The noteworthy presence of short introns in chickens; supported the hypothesis that chicken‟s MHC represented a „minimal essential MHC‟. Until now, it has been assumed that chicken (order Galliformes) MHC was similar to all species included in the whole class Aves...

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: The Nme gene family is involved in multiple physiological and pathological processes such as cellular differentiation, development, metastatic dissemination, and cilia functions. Despite the known importance of Nme genes and their use as clinical markers of tumor aggressiveness, the associated cellular mechanisms remain poorly understood. Over the last 20 years, several non-vertebrate model species have been used to investigate Nme functions. However, the evolutionary history of the family remains poorly understood outside the vertebrate lineage. The aim of the study was thus to elucidate the evolutionary history of the Nme gene family in Metazoans. Methodology/Principal Findings: Using a total of 21 eukaryote species including 14 metazoans, the evolutionary history of Nme genes was reconstructed in the metazoan lineage. We demonstrated that the complexity of the Nme gene family, initially thought to be restricted to chordates, was also shared by the metazoan ancestor. We also provide evidence suggesting that the complexity of the family is mainly a eukaryotic innovation, with the exception of Nme8 that is likely to be a choanoflagellate/metazoan innovation. Highly conserved gene structure, genomic linkage, and protein domains were identified among metazoans, some features being also conserved in eukaryotes. When considering the entire Nme family, the starlet sea anemone is the studied metazoan species exhibiting the most conserved gene and protein sequence features with humans. In addition, we were able to show that most of the proteins known to interact with human NME proteins were also found in starlet sea anemone. Conclusion/Significance: Together, our observations further support the association of Nme genes with key cellular functions that have been conserved throughout metazoan evolution. Future investigations of evolutionarily conserved Nme gene functions using the starlet sea anemone could shed new light on a wide variety of key developmental and cellular processes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Group A Streptococcus (GAS), or Streptococcus pyogenes, is a strict human pathogen that colonizes a variety of sites within the host. Infections can vary from minor and easily treatable, to life-threatening, invasive forms of disease. In order to adapt to niches, GAS utilizes environmental cues, such as carbohydrates, to coordinate the expression of virulence factors. Research efforts to date have focused on identifying how either components of the phosphoenolpyruvate-phosphotransferase system (PTS) or global transcriptional networks affect the regulation of virulence factors, but not the synergistic relationship between the two. The present study investigates the role of a putative PTS-fructose operon encoded by fruRBA and its role in virulence in the M1T1 strain 5448. Growth in fructose resulted in induction of fruRBA. RT-PCR showed that fruRBA formed an operon, which was repressed by FruR in the absence of fructose. Growth and carbon utilization profiles revealed that although the entire fruRBA operon was required for growth in fructose, FruA was the main fructose transporter. The ability of both ΔfruR and ΔfruB mutants to survive in whole human blood or neutrophils was impaired. However, the phenotypes were not reproduced in murine whole blood or in a mouse intraperitoneal infection, indicating a human-specific mechanism. While it is known that the PTS can affect activity of the Mga virulence regulator, further characterization of the mechanism by which sugars and its protein domains affect activity have not been studied. Transcriptional studies revealed that the core Mga regulon is activated more in a glucose-rich than a glucose-poor environment. This activation correlates with the differential phosphorylation of Mga at its PTS regulatory domains (PRDs). Using a 5448 mga mutant, transcriptome studies in THY or C media established that the Mga regulon reflects the media used. Interestingly, Mga regulates phage-encoded DNases in a low glucose environment. We also show that Mga activity is dependent on C-terminal amino acid interactions that aid in the formation of homodimers. Overall, the studies presented sought to define how external environmental cues, specifically carbohydrates, control complex regulatory networks used by GAS, contribute to pathogenesis, and aid in adaptation to various nutrient conditions encountered.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Analyzing large-scale gene expression data is a labor-intensive and time-consuming process. To make data analysis easier, we developed a set of pipelines for rapid processing and analysis poplar gene expression data for knowledge discovery. Of all pipelines developed, differentially expressed genes (DEGs) pipeline is the one designed to identify biologically important genes that are differentially expressed in one of multiple time points for conditions. Pathway analysis pipeline was designed to identify the differentially expression metabolic pathways. Protein domain enrichment pipeline can identify the enriched protein domains present in the DEGs. Finally, Gene Ontology (GO) enrichment analysis pipeline was developed to identify the enriched GO terms in the DEGs. Our pipeline tools can analyze both microarray gene data and high-throughput gene data. These two types of data are obtained by two different technologies. A microarray technology is to measure gene expression levels via microarray chips, a collection of microscopic DNA spots attached to a solid (glass) surface, whereas high throughput sequencing, also called as the next-generation sequencing, is a new technology to measure gene expression levels by directly sequencing mRNAs, and obtaining each mRNA’s copy numbers in cells or tissues. We also developed a web portal (http://sys.bio.mtu.edu/) to make all pipelines available to public to facilitate users to analyze their gene expression data. In addition to the analyses mentioned above, it can also perform GO hierarchy analysis, i.e. construct GO trees using a list of GO terms as an input.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The present work describes the molecular characterization of five circular plasmids found in the human clinical strain Lactococcus garvieae 21881. The plasmids were designated pGL1-pGL5, with molecular sizes of 4,536 bp, 4,572 bp, 12,948 bp, 14,006 bp and 68,798 bp, respectively. Based on detailed sequence analysis, some of these plasmids appear to be mosaics composed of DNA obtained by modular exchange between different species of lactic acid bacteria. Based on sequence data and the derived presence of certain genes and proteins, the plasmid pGL2 appears to replicate via a rolling-circle mechanism, while the other four plasmids appear to belong to the group of lactococcal theta-type replicons. The plasmids pGL1, pGL2 and pGL5 encode putative proteins related with bacteriocin synthesis and bacteriocin secretion and immunity. The plasmid pGL5 harbors genes (txn, orf5 and orf25) encoding proteins that could be considered putative virulence factors. The gene txn encodes a protein with an enzymatic domain corresponding to the family actin-ADP-ribosyltransferases toxins, which are known to play a key role in pathogenesis of a variety of bacterial pathogens. The genes orf5 and orf25 encode two putative surface proteins containing the cell wall-sorting motif LPXTG, with mucin-binding and collagen-binding protein domains, respectively. These proteins could be involved in the adherence of L. garvieae to mucus from the intestine, facilitating further interaction with intestinal epithelial cells and to collagenous tissues such as the collagen-rich heart valves. To our knowledge, this is the first report on the characterization of plasmids in a human clinical strain of this pathogen.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We explore the fuse of information on co-occurrence of domains in multi-domain proteins in predicting protein-protein interactions. The basic premise of our work is the assumption that domains co-occurring in a polypeptide chain undergo either structural or functional interactions among themselves. In this study we use a template dataset of domains in multidomain proteins and predict protein-protein interactions in a target organism. We note that maximum number of correct predictions of interacting protein domain families (158) is made in S. cerevisiae when the dataset of closely related organisms is used as the template followed by the more diverse dataset of bacterial proteins (48) and a dataset of randomly chosen proteins (23). We conclude that use of multi-domain information from organisms closely-related to the target can aid prediction of interacting protein families.