5 resultados para Codon Usage
em University of Queensland eSpace - Australia
Resumo:
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
Translational pausing may occur due to a number of mechanisms, including the presence of non-optimal codons, and it is thought to play a role in the folding of specific polypeptide domains during translation and in the facilitation of signal peptide recognition during see-dependent protein targeting. In this whole genome analysis of Escherichia coli we have found that non-optimal codons in the signal peptide-encoding sequences of secretory genes are overrepresented relative to the mature portions of these genes; this is in addition to their overrepresentation in the 5'-regions of genes encoding non-secretory proteins. We also find increased non-optimal codon usage at the 3' ends of most E. coli genes, in both non-secretory and secretory sequences. Whereas presumptive translational pausing at the 5' and 3' ends of E. coli messenger RNAs may clearly have a general role in translation, we suggest that it also has a specific role in sec-dependent protein export, possibly in facilitating signal peptide recognition. This finding may have important implications for our understanding of how the majority of non-cytoplasmic proteins are targeted, a process that is essential to all biological cells. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
Schistosoma japonicum paramyosin, a 97 kDa myofibrillar protein, is a recognized vaccine candidate against schistosomiasis. To improve its expression and to identify protective epitopic regions on paramyosin, the published Chinese Schistosoma japonicum paramyosin cDNA sequence was redesigned using Pichia codon usage and divided into four overlapping fragments (fragments 1, 2, 3, 4) of 747, 651, 669 and 678 bp, respectively. These gene fragments were synthesized and expressed in Pichia pastoris (fragments 2 and 3) or E. coli (fragments 1 and 4). The recombinant proteins were produced at high level and purified using a two-step process involving Ni-NTA affinity chromatography and gel filtration. BALB/c mice were immunized subcutaneously three times at 2-week-intervals with the purified proteins formulated in adjuvant Quil A. The protein fragments were highly immunogenic, inducing high, though variable, ELISA antibody titres, and each was shown to resemble native paramyosin in terms of its recognition by the anti-fragment antibodies in Western blotting. The immunized mice were subjected to cercarial challenge 2 weeks after the final injection and promising protective efficacy in terms of significant reductions in worm burdens, worm-pair numbers and liver eggs in the vaccinated mice resulted. There was no apparent correlation between the antibody titres generated and protective efficacy, as all fragments produced effective but similar levels of protection.
Resumo:
Failure to express soluble proteins in bacteria is mainly attributed to the properties of the target protein itself, as well as the choice of the vector, the purification tag and the linker between the tag and protein, and codon usage. The expression of proteins with fusion tags to facilitate subsequent purification steps is a widely used procedure in the production of recombinant proteins. However, the additional residues can affect the properties of the protein; therefore, it is often desirable to remove the tag after purification. This is usually done by engineering a cleavage site between the tag and the encoded protein that is recognised by a site-specific protease, such as the one from tobacco etch virus (TEV). In this study, we investigated the effect of four different tags on the bacterial expression and solubility of nine mouse proteins. Two of the four engineered constructs contained hexahistidine tags with either a long or short linker. The other two constructs contained a TEV cleavage site engineered into the linker region. Our data show that inclusion of the TEV recognition site directly downstream of the recombination site of the Invitrogen Gateway vector resulted, in a loss of solubility of the nine mouse proteins. Our work suggests that one needs to be very careful when making modifications to expression vectors and combining different affinity and fusion tags and cleavage sites: (c) 2006 Elsevier Inc. All rights reserved.
Resumo:
Cystic fibrosis is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene, which encodes a chloride channel present in many cells. In cardiomyocytes, we report that multiple exon 1 usage and alternative splicing produces four CFTR transcripts, with different 5'-untranslated regions, CFTRTRAD-139, CFTR-1C/-1A, CFTR-1C, and CFTR-1B. CFTR transcripts containing the novel upstream exons (exons -1C, -1B, and -1A) represent more than 90% of cardiac expressed CFTR mRNA. Regulation of cardiac CFTR expression, in response to developmental and pathological stimuli, is exclusively due to the modulation of CFTR-1C and CFTR-1C/-1A expression. Upstream open reading frames have been identified in the 5'-untranslated regions of all CFTR transcripts that, in conjunction with adjacent stem-loop structures, modulate the efficiency of translation initiation at the AUG codon of the main CFTR coding region in CFTRTRAD-139 and CFTR-1C/-1A transcripts. Exon(-1A), only present in CFTR-1C/-1A transcripts, encodes an AUG codon that is in-frame with the main CFTR open reading frame, the efficient translation of which produces a novel CFTR protein isoform with a curtailed amino terminus. As the expression of this CFTR transcript parallels the spatial and temporal distribution of the cAMP-activated whole-cell current density in normal and diseased hearts, we suggest that CFTR-1C/-1A provides the molecular basis for the cardiac cAMP-activated chloride channel. Our findings provide further insight into the complex nature of in vivo CFTR expression, to which multiple mRNA transcripts, protein isoforms, and post-transcriptional regulatory mechanisms are now added.