20 resultados para Assignments
em National Center for Biotechnology Information - NCBI
Resumo:
The parasitic bacterium Mycoplasma genitalium has a small, reduced genome with close to a basic set of genes. As a first step toward determining the families of protein domains that form the products of these genes, we have used the multiple sequence programs psi-blast and geanfammer to match the sequences of the 467 gene products of M. genitalium to the sequences of the domains that form proteins of known structure [Protein Data Bank (PDB) sequences]. PDB sequences (274) match all of 106 M. genitalium sequences and some parts of another 85; thus, 41% of its total sequences are matched in all or part. The evolutionary relationships of the PDB domains that match M. genitalium are described in the structural classification of proteins (SCOP) database. Using this information, we show that the domains in the matched M. genitalium sequences come from 114 superfamilies and that 58% of them have arisen by gene duplication. This level of duplication is more than twice that found by using pairwise sequence comparisons. The PDB domain matches also describe the domain structure of the matched sequences: just over a quarter contain one domain and the rest have combinations of two or more domains.
Resumo:
The electron density map of the small ribosomal subunit from Thermus thermophilus, constructed at 4.5 Å resolution, shows the recognizable morphology of this particle, as well as structural features that were interpreted as ribosomal RNA and proteins. Unbiased assignments, carried out by quantitative covalent binding of heavy atom compounds at predetermined sites, led to the localization of the surface of the ribosomal protein S13 at a position compatible with previous assignments, whereas the surface of S11 was localized at a distance of about twice its diameter from the site suggested for its center by neutron scattering. Proteins S5 and S7, whose structures have been determined crystallographically, were visually placed in the map with no alterations in their conformations. Regions suitable to host the fold of protein S15 were detected in several positions, all at a significant distance from the location of this protein in the neutron scattering map. Targeting the 16S RNA region, where mRNA docks to allow the formation of the initiation complex by a mercurated mRNA analog, led to the characterization of its vicinity.
Resumo:
The pupal defensive secretion of the 24-pointed ladybird beetle, Subcoccinella vigintiquatuorpunctata, consists of a mixture of macrocyclic polyamines, dominated by the three dimeric, 30-membered macrocycles 11-13, derived from the two building blocks 11-(2-hydoxyethylamino)-5-tetradecenoic acid (9) and 11-(2-hydoxyethylamino)-5,8-tetradecadienoic acid (10). Smaller amounts of the four possible cyclic trimers of 9 and 10 were also detected, corresponding to 45-membered macrocycles. Structural assignments were based on NMR-spectroscopic investigations and HPLC–MS analyses. In addition, the all-S absolute configuration of the S. vigintiquatuorpunctata macrocycles was determined by comparison of derivatives of the natural material with enantiomerically pure synthetic samples. Comparing this alkaloid mixture with that of the pupal defensive secretion in related ladybird beetle species indicates that the degree of oligomerization of the 2-hydroxyethylamino carboxylic acid building blocks can be carefully controlled by the insects.
Resumo:
13C-selective NMR, combined with inhibitor perturbation experiments, shows that the Cɛ1—H proton of the catalytic histidine in resting α-lytic protease and subtilisin BPN′ resonates, when protonated, at 9.22 ppm and 9.18 ppm, respectively, which is outside the normal range for such protons and ≈0.6 to 0.8 ppm further downfield than previously reported. They also show that the previous α-lytic protease assignments [Markley, J. L., Neves, D. E., Westler, W. M., Ibanez, I. B., Porubcan, M. A. & Baillargeon, M. W. (1980) Front. Protein Chem. 10, 31–61] were to signals from inactive or denatured protein. Simulations of linewidth vs. pH demonstrate that the true signal is more difficult to detect than corresponding signals from inactive derivatives, owing to higher imidazole pKa values and larger chemical shift differences between protonated and neutral forms. A compilation and analysis of available NMR data indicates that the true Cɛ1—H signals from other serine proteases are similarly displaced downfield, with past assignments to more upfield signals probably in error. The downfield displacement of these proton resonances is shown to be consistent with an H-bond involving the histidine Cɛ1—H as donor, confirming the original hypothesis of Derewenda et al. [Derewenda, Z. S., Derewenda, U. & Kobos, P. M. (1994) J. Mol. Biol. 241, 83–93], which was based on an analysis of literature x-ray crystal structures of serine hydrolases. The invariability of this H-bond among enzymes containing Asp-His-Ser triads indicates functional importance. Here, we propose that it enables a reaction-driven imidazole ring flip mechanism, overcoming a major dilemma inherent in all previous mechanisms, namely how these enzymes catalyze both the formation and productive breakdown of tetrahedral intermediates.
Resumo:
The prion diseases seem to be caused by a conformational change of the prion protein (PrP) from the benign cellular form PrPC to the infectious scrapie form PrPSc; thus, detailed information about PrP structure may provide essential insights into the mechanism by which these diseases develop. In this study, the secondary structure of the recombinant Syrian hamster PrP of residues 29–231 [PrP(29–231)] is investigated by multidimensional heteronuclear NMR. Chemical shift index analysis and nuclear Overhauser effect data show that PrP(29–231) contains three helices and possibly one short β-strand. Most striking is the random-coil nature of chemical shifts for residues 30–124 in the full-length PrP. Although the secondary structure elements are similar to those found in mouse PrP fragment PrP(121–231), the secondary structure boundaries of PrP(29–231) are different from those in mouse PrP(121–231) but similar to those found in the structure of Syrian hamster PrP(90–231). Comparison of resonance assignments of PrP(29–231) and PrP(90–231) indicates that there may be transient interactions between the additional residues and the structured core. Backbone dynamics studies done by using the heteronuclear [1H]-15N nuclear Overhauser effect indicate that almost half of PrP(29–231), residues 29–124, is highly flexible. This plastic region could feature in the conversion of PrPC to PrPSc by template-assisted formation of β-structure.
Resumo:
To study the origin and evolution of biochemical pathways in microorganisms, we have developed methods and software for automatic, large-scale reconstructions of phylogenetic relationships. We define the complete set of phylogenetic trees derived from the proteome of an organism as the phylome and introduce the term phylogenetic connection as a concept that describes the relative relationships between taxa in a tree. A query system has been incorporated into the system so as to allow searches for defined categories of trees within the phylome. As a complement, we have developed the pyphy system for visualising the results of complex queries on phylogenetic connections, genomic locations and functional assignments in a graphical format. Our phylogenomics approach, which links phylogenetic information to the flow of biochemical pathways within and among microbial species, has been used to examine more than 8000 phylogenetic trees from seven microbial genomes. The results have revealed a rich web of phylogenetic connections. However, the separation of Bacteria and Archaea into two separate domains remains robust.
Resumo:
The Helix Research Institute (HRI) in Japan is releasing 4356 HUman Novel Transcripts and related information in the newly established HUNT database. The institute is a joint research project principally funded by the Japanese Ministry of International Trade and Industry, and the clones were sequenced in the governmental New Energy and Industrial Technology Development Organization (NEDO) Human cDNA Sequencing Project. The HUNT database contains an extensive amount of annotation from advanced analysis and represents an essential bioinformatics contribution towards understanding of the gene function. The HRI human cDNA clones were obtained from full-length enriched cDNA libraries constructed with the oligo-capping method and have resulted in novel full-length cDNA sequences. A large fraction has little similarity to any proteins of known function and to obtain clues about possible function we have developed original analysis procedures. Any putative function deduced here can be validated or refuted by complementary analysis results. The user can also extract information from specific categories like PROSITE patterns, PFAM domains, PSORT localization, transmembrane helices and clones with GENIUS structure assignments. The HUNT database can be accessed at http://www.hri.co.jp/HUNT.
Resumo:
One challenge presented by large-scale genome sequencing efforts is effective display of uniform information to the scientific community. The Comprehensive Microbial Resource (CMR) contains robust annotation of all complete microbial genomes and allows for a wide variety of data retrievals. The bacterial information has been placed on the Web at http://www.tigr.org/CMR for retrieval using standard web browsing technology. Retrievals can be based on protein properties such as molecular weight or hydrophobicity, GC-content, functional role assignments and taxonomy. The CMR also has special web-based tools to allow data mining using pre-run homology searches, whole genome dot-plots, batch downloading and traversal across genomes using a variety of datatypes.
Resumo:
The 2H,13C,15N-labeled, 148-residue integral membrane protein OmpX from Escherichia coli was reconstituted with dihexanoyl phosphatidylcholine (DHPC) in mixed micelles of molecular mass of about 60 kDa. Transverse relaxation-optimized spectroscopy (TROSY)-type triple resonance NMR experiments and TROSY-type nuclear Overhauser enhancement spectra were recorded in 2 mM aqueous solutions of these mixed micelles at pH 6.8 and 30°C. Complete sequence-specific NMR assignments for the polypeptide backbone thus have been obtained. The 13C chemical shifts and the nuclear Overhauser effect data then resulted in the identification of the regular secondary structure elements of OmpX/DHPC in solution and in the collection of an input of conformational constraints for the computation of the global fold of the protein. The same type of polypeptide backbone fold is observed in the presently determined solution structure and the previously reported crystal structure of OmpX determined in the presence of the detergent n-octyltetraoxyethylene. Further structure refinement will have to rely on the additional resonance assignment of partially or fully protonated amino acid side chains, but the present data already demonstrate that relaxation-optimized NMR techniques open novel avenues for studies of structure and function of integral membrane proteins.
Resumo:
Pseudogenes are non-functioning copies of genes in genomic DNA, which may either result from reverse transcription from an mRNA transcript (processed pseudogenes) or from gene duplication and subsequent disablement (non-processed pseudogenes). As pseudogenes are apparently ‘dead’, they usually have a variety of obvious disablements (e.g., insertions, deletions, frameshifts and truncations) relative to their functioning homologs. We have derived an initial estimate of the size, distribution and characteristics of the pseudogene population in the Caenorhabditis elegans genome, performing a survey in ‘molecular archaeology’. Corresponding to the 18 576 annotated proteins in the worm (i.e., in Wormpep18), we have found an estimated total of 2168 pseudogenes, about one for every eight genes. Few of these appear to be processed. Details of our pseudogene assignments are available from http://bioinfo.mbb.yale.edu/genome/worm/pseudogene. The population of pseudogenes differs significantly from that of genes in a number of respects: (i) pseudogenes are distributed unevenly across the genome relative to genes, with a disproportionate number on chromosome IV; (ii) the density of pseudogenes is higher on the arms of the chromosomes; (iii) the amino acid composition of pseudogenes is midway between that of genes and (translations of) random intergenic DNA, with enrichment of Phe, Ile, Leu and Lys, and depletion of Asp, Ala, Glu and Gly relative to the worm proteome; and (iv) the most common protein folds and families differ somewhat between genes and pseudogenes—whereas the most common fold found in the worm proteome is the immunoglobulin fold and the most common ‘pseudofold’ is the C-type lectin. In addition, the size of a gene family bears little overall relationship to the size of its corresponding pseudogene complement, indicating a highly dynamic genome. There are in fact a number of families associated with large populations of pseudogenes. For example, one family of seven-transmembrane receptors (represented by gene B0334.7) has one pseudogene for every four genes, and another uncharacterized family (represented by gene B0403.1) is approximately two-thirds pseudogenic. Furthermore, over a hundred apparent pseudogenic fragments do not have any obvious homologs in the worm.
Resumo:
Spectral changes in the photocycle of the photoactive yellow protein (PYP) are investigated by using ab initio multiconfigurational second-order perturbation theory at the available structures experimentally determined. Using the dark ground-state crystal structure [Genick, U. K., Soltis, S. M., Kuhn, P., Canestrelli, I. L. & Getzoff, E. D. (1998) Nature (London) 392, 206–209], the ππ* transition to the lowest excited state is related to the typical blue-light absorption observed at 446 nm. The different nature of the second excited state (nπ*) is consistent with the alternative route detected at 395-nm excitation. The results suggest the low-temperature photoproduct PYPHL as the most plausible candidate for the assignment of the cryogenically trapped early intermediate (Genick et al.). We cannot establish, however, a successful correspondence between the theoretical spectrum for the nanosecond time-resolved x-ray structure [Perman, B., Šrajer, V., Ren, Z., Teng, T., Pradervand, C., et al. (1998) Science 279, 1946–1950] and any of the spectroscopic photoproducts known up to date. It is fully confirmed that the colorless light-activated intermediate recorded by millisecond time-resolved crystallography [Genick, U. K., Borgstahl, G. E. O., Ng, K., Ren, Z., Pradervand, C., et al. (1997) Science 275, 1471–1475] is protonated, nicely matching the spectroscopic features of the photoproduct PYPM. The overall contribution demonstrates that a combined analysis of high-level theoretical results and experimental data can be of great value to perform assignments of detected intermediates in a photocycle.
Resumo:
We used Computer-Assisted Personalized Approach (CAPA), a networked teaching and learning tool that generates computer individualized homework problem sets, in our large-enrollment introductory plant physiology course. We saw significant improvement in student examination performance with regular homework assignments, with CAPA being an effective and efficient substitute for hand-graded homework. Using CAPA, each student received a printed set of similar but individualized problems of a conceptual (qualitative) and/or quantitative nature with quality graphics. Because each set of problems is unique, students were encouraged to work together to clarify concepts but were required to do their own work for credit. Students could enter answers multiple times without penalty, and they were able to obtain immediate feedback and hints until the due date. These features increased student time on task, allowing higher course standards and student achievement in a diverse student population. CAPA handles routine tasks such as grading, recording, summarizing, and posting grades. In anonymous surveys, students indicated an overwhelming preference for homework in CAPA format, citing several features such as immediate feedback, multiple tries, and on-line accessibility as reasons for their preference. We wrote and used more than 170 problems on 17 topics in introductory plant physiology, cataloging them in a computer library for general access. Representative problems are compared and discussed.
Resumo:
The Ensatina eschscholtzii complex of plethodontid salamanders, a well-known “ring species,” is thought to illustrate stages in the speciation process. Early research, based on morphology and coloration, has been extended by the incorporation of studies of protein variation and mitochondrial DNA sequences. The new data show that the complex includes a number of geographically and genetically distinct components that are at or near the species level. The complex is old and apparently has undergone instances of range contraction, isolation, differentiation, and then expansion and secondary contact. While the hypothesis that speciation is retarded by gene flow around the ring is not supported by molecular data, the general biogeographical hypothesis is supported. There is evidence of a north to south range expansion along two axes, with secondary contact and completion of the ring in southern California. Current research targets regions once thought to show primary intergradation, but which molecular markers reveal to be zones of secondary contact. Here emphasis is on the subspecies E. e. xanthoptica, which is involved in four distinct secondary contacts in central California. There is evidence of renewed genetic interactions upon recontact, with greater genetic differentiation within xanthoptica than between it and some of the interacting populations. The complex presents a full array of intermediate conditions between well-marked species and geographically variable populations. Geographically differentiated segments represent a diversity of depths of time of isolation and admixture, reflecting the complicated geomorphological history of California. Ensatina illustrates the continuing difficulty in making taxonomic assignments in complexes studied during species formation.
Resumo:
A whole genome cattle-hamster radiation hybrid cell panel was used to construct a map of 54 markers located on bovine chromosome 5 (BTA5). Of the 54 markers, 34 are microsatellites selected from the cattle linkage map and 20 are genes. Among the 20 mapped genes, 10 are new assignments that were made by using the comparative mapping by annotation and sequence similarity strategy. A LOD-3 radiation hybrid framework map consisting of 21 markers was constructed. The relatively low retention frequency of markers on this chromosome (19%) prevented unambiguous ordering of the other 33 markers. The length of the map is 398.7 cR, corresponding to a ratio of ≈2.8 cR5,000/cM. Type I genes were binned for comparison of gene order among cattle, humans, and mice. Multiple internal rearrangements within conserved syntenic groups were apparent upon comparison of gene order on BTA5 and HSA12 and HSA22. A similarly high number of rearrangements were observed between BTA5 and MMU6, MMU10, and MMU15. The detailed comparative map of BTA5 should facilitate identification of genes affecting economically important traits that have been mapped to this chromosome and should contribute to our understanding of mammalian chromosome evolution.
Resumo:
The question of whether proteins originate from random sequences of amino acids is addressed. A statistical analysis is performed in terms of blocked and random walk values formed by binary hydrophobic assignments of the amino acids along the protein chains. Theoretical expectations of these variables from random distributions of hydrophobicities are compared with those obtained from functional proteins. The results, which are based upon proteins in the SWISS-PROT data base, convincingly show that the amino acid sequences in proteins differ from what is expected from random sequences in a statistically significant way. By performing Fourier transforms on the random walks, one obtains additional evidence for nonrandomness of the distributions. We have also analyzed results from a synthetic model containing only two amino acid types, hydrophobic and hydrophilic. With reasonable criteria on good folding properties in terms of thermodynamical and kinetic behavior, sequences that fold well are isolated. Performing the same statistical analysis on the sequences that fold well indicates similar deviations from randomness as for the functional proteins. The deviations from randomness can be interpreted as originating from anticorrelations in terms of an Ising spin model for the hydrophobicities. Our results, which differ from some previous investigations using other methods, might have impact on how permissive with respect to sequence specificity protein folding process is-only sequences with nonrandom hydrophobicity distributions fold well. Other distributions give rise to energy landscapes with poor folding properties and hence did not survive the evolution.