5 resultados para norm-based coding

em National Center for Biotechnology Information - NCBI


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many small bacterial, archaebacterial, and eukaryotic genomes have been sequenced, and the larger eukaryotic genomes are predicted to be completely sequenced within the next decade. In all genomes sequenced to date, a large portion of these organisms’ predicted protein coding regions encode polypeptides of unknown biochemical, biophysical, and/or cellular functions. Three-dimensional structures of these proteins may suggest biochemical or biophysical functions. Here we report the crystal structure of one such protein, MJ0577, from a hyperthermophile, Methanococcus jannaschii, at 1.7-Å resolution. The structure contains a bound ATP, suggesting MJ0577 is an ATPase or an ATP-mediated molecular switch, which we confirm by biochemical experiments. Furthermore, the structure reveals different ATP binding motifs that are shared among many homologous hypothetical proteins in this family. This result indicates that structure-based assignment of molecular function is a viable approach for the large-scale biochemical assignment of proteins and for discovering new motifs, a basic premise of structural genomics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Photosystem II is a reaction center protein complex located in photosynthetic membranes of plants, algae, and cyanobacteria. Using light energy, photosystem II catalyzes the oxidation of water and the reduction of plastoquinone, resulting in the release of molecular oxygen. A key component of photosystem II is cytochrome b559, a membrane-embedded heme protein with an unknown function. The cytochrome is unusual in that a heme links two separate polypeptide subunits, α and β, either as a heterodimer (αβ) or as two homodimers (α2 and β2). To determine the structural organization of cytochrome b559 in the membrane, we used site-directed mutagenesis to fuse the coding regions of the two respective genes in the cyanobacterium Synechocystis sp. PCC 6803. In this construction, the C terminus of the α subunit (9 kDa) is attached to the N terminus of the β subunit (5 kDa) to form a 14-kDa αβ fusion protein that is predicted to have two membrane-spanning α-helices with antiparallel orientations. Cells containing the αβ fusion protein grow photoautotrophically and assemble functional photosystem II complexes. Optical spectroscopy shows that the αβ fusion protein binds heme and is incorporated into photosystem II. These data support a structural model of cytochrome b559 in which one heme is coordinated to an α2 homodimer and a second heme is coordinated to a β2 homodimer. In this model, each photosystem II complex contains two cytochrome b559 hemes, with the α2 heme located near the stromal side of the membrane and the β2 heme located near the lumenal side.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

β-Galactosidases (EC 3.2.1.23) constitute a widespread family of enzymes characterized by their ability to hydrolyze terminal, nonreducing β-d-galactosyl residues from β-d-galactosides. Several β-galactosidases, sometimes referred to as exo-galactanases, have been purified from plants and shown to possess in vitro activity against extracted cell wall material via the release of galactose from wall polymers containing β(1→4)-d-galactan. Although β-galactosidase II, a protein present in tomato (Lycopersicon esculentum Mill.) fruit during ripening and capable of degrading tomato fruit galactan, has been purified, cloning of the corresponding gene has been elusive. We report here the cloning of a cDNA, pTomβgal 4 (accession no. AF020390), corresponding to β-galactosidase II, and show that its corresponding gene is expressed during fruit ripening. Northern-blot analysis revealed that the β-galactosidase II gene transcript was detectable at the breaker stage of ripeness, maximum at the turning stage, and present at decreasing levels during the later stages of normal tomato fruit ripening. At the turning stage of ripeness, the transcript was present in all fruit tissues and was highest in the outermost tissues (including the peel). Confirmation that pTomβgal 4 codes for β-galactosidase II was derived from matching protein and deduced amino acid sequences. Furthermore, analysis of the deduced amino acid sequence of pTomβgal 4 suggested a high probability for secretion based on the presence of a hydrophobic leader sequence, a leader-sequence cleavage site, and three possible N-glycosylation sites. The predicted molecular mass and isoelectric point of the pTomβgal 4-encoded mature protein were similar to those reported for the purified β-galactosidase II protein from tomato fruit.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, a new way to think about, and to construct, pairwise as well as multiple alignments of DNA and protein sequences is proposed. Rather than forcing alignments to either align single residues or to introduce gaps by defining an alignment as a path running right from the source up to the sink in the associated dot-matrix diagram, we propose to consider alignments as consistent equivalence relations defined on the set of all positions occurring in all sequences under consideration. We also propose constructing alignments from whole segments exhibiting highly significant overall similarity rather than by aligning individual residues. Consequently, we present an alignment algorithm that (i) is based on segment-to-segment comparison instead of the commonly used residue-to-residue comparison and which (ii) avoids the well-known difficulties concerning the choice of appropriate gap penalties: gaps are not treated explicity, but remain as those parts of the sequences that do not belong to any of the aligned segments. Finally, we discuss the application of our algorithm to two test examples and compare it with commonly used alignment methods. As a first example, we aligned a set of 11 DNA sequences coding for functional helix-loop-helix proteins. Though the sequences show only low overall similarity, our program correctly aligned all of the 11 functional sites, which was a unique result among the methods tested. As a by-product, the reading frames of the sequences were identified. Next, we aligned a set of ribonuclease H proteins and compared our results with alignments produced by other programs as reported by McClure et al. [McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571-592]. Our program was one of the best scoring programs. However, in contrast to other methods, our protein alignments are independent of user-defined parameters.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We had earlier identified the pcnB locus as the gene for the major Escherichia coli poly(A) polymerase (PAP I). In this report, we describe the disruption and identification of a candidate gene for a second poly(A) polymerase (PAP II) by an experimental strategy which was based on the assumption that the viability of E. coli depends on the presence of either PAP I or PAP II. The coding region thus identified is the open reading frame f310, located at about 87 min on the E. coli chromosome. The following lines of evidence support f310 as the gene for PAP II: (i) the deduced peptide encoded by f310 has a molecular weight of 36,300, similar to the molecular weight of 35,000 estimated by gel filtration of PAP II; (ii) the deduced f310 product is a relatively hydrophobic polypeptide with a pI of 9.4, consistent with the properties of partially purified PAP II; (iii) overexpression of f310 leads to the formation of inclusion bodies whose solubilization and renaturation yields poly(A) polymerase activity that corresponds to a 35-kDa protein as shown by enzyme blotting; and (iv) expression of a f310 fusion construct with hexahistidine at the N-terminus of the coding region allowed purification of a poly(A) polymerase fraction whose major component is a 36-kDa protein. E. coli PAP II has no significant sequence homology either to PAP I or to the viral and eukaryotic poly(A) polymerases, suggesting that the bacterial poly(A) polymerases have evolved independently. An interesting feature of the PAP II sequence is the presence of sets of two paired cysteine and histidine residues that resemble the RNA binding motifs seen in some other proteins.