963 resultados para Multiple Sequence Alignment


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In most strains of Saccharomyces cerevisiae the mitochondrial gene COX1, for subunit 1 of cytochrome oxidase, contains multiple exons and introns. Processing of COX1 primary transcript requires accessory proteins factors, some of which are encoded by nuclear genes and others by reading frames residing in some of the introns of the COX1 and COB genes. Here we show that the low molecular weight protein product of open reading frame YLR204W, for which we propose the name COX24, is also involved in processing of COX1 RNA intermediates. The growth defect of cox24 mutants is partially rescued in strains harboring mitochondrial DNA lacking introns. Northern blot analyses of mitochondrial transcripts indicate cox24 null mutants to be blocked in processing of introns aI2 and aI3. The dependence of intron aI3 excision on Cox24p is also supported by the growth properties of the cox24 mutant harboring mitochondrial DNA with different intron compositions. The intermediate phenotype of the cox24 mutant in the background of intronless mitochondrial DNA, however, suggests that in addition to its role in splicing of the COX1 pre-mRNA, Cox24p still has another function. Based on the analysis of a cox14-cox24 double mutant, we propose that the other function of Cox24p is related to translation of the COX1 mRNA. © 2006 by The American Society for Biochemistry and Molecular Biology, Inc.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Patógenos transmitidos por carrapatos atingem uma variedade de hospedeiros vertebrados. Para identificar os agentes patogênicos transmitidos por carrapatos entre cães soropositivos para Leishmania infantum no município Campo Grande-MS, foi realizado um estudo sorológico e molecular para a detecção de Ehrlichia canis, Anaplasma platys e Babesia vogeli em 60 amostras de soro e baço, respectivamente. Adicionalmente, foi realizado o diagnóstico confirmatório de L. infantum por meio de técnicas sorológicas e moleculares. Também foi realizado o alinhamento e análise filogenética das sequências para indicar a identidade das espécies de parasitas que infectam esses animais. Anticorpos IgG anti-Ehrlichia spp., anti-B. vogeli e anti-L. infantum foram detectados em 39 (65%), 49 (81,6%) e 60 (100%) dos cães amostrados, respectivamente. Vinte e sete (45%), cinquenta e quatro (90%), cinquenta e três (88,3%), dois (3,3%) e um (1,6%) cães mostraram-se positivos na PCR para E. canis, Leishmania spp., Leishmania donovani complex, Babesia sp. e Anaplasma sp., respectivamente. Após o seqüenciamento, os amplicons mostraram 99% de similaridade com isolados de E. canis, B. vogeli e A. platys e Leishmania chagasi. Os resultados deste estudo indicaram que os cães soropositivos para L. infantum de Campo Grande, MS, são expostos a vários agentes transmitidos por carrapatos, e, portanto, devem ser incluídos no diagnóstico diferencial em cães com suspeita clínica de leishmaniose.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

ABSTRACTDie vorliegende Arbeit befasste sich mit der Reinigung,heterologen Expression, Charakterisierung, molekularenAnalyse, Mutation und Kristallisation des EnzymsVinorin-Synthase. Das Enzym spielt eine wichtige Rolle inder Ajmalin-Biosynthese, da es in einerAcetyl-CoA-abhängigen Reaktion die Umwandlung desSarpagan-Alkaloids 16-epi-Vellosimin zu Vinorin unterBildung des Ajmalan-Grundgerüstes katalysiert. Nach der Reinigung der Vinorin-Synthase ausHybrid-Zellkulturen von Rauvolfia serpentina/Rhazya strictamit den fünf chromatographischen TrennmethodenAnionenaustauschchromatographie an SOURCE 30Q, HydrophobeInteraktionen Chromatographie an SOURCE 15PHE,Chromatographie an MacroPrep Ceramic Hydroxyapatit,Anionenaustauschchromatographie an Mono Q undGrößenausschlußchromatographie an Superdex 75 konnte dieVinorin-Synthase aus 2 kg Zellkulturgewebe 991fachangereichert werden.Das nach der Reinigung angefertigte SDS-Gel ermöglichte eineklare Zuordnung der Protein-Bande als Vinorin-Synthase.Der Verdau der Enzymbande mit der Endoproteinase LysC unddie darauffolgende Sequenzierung der Spaltpeptide führte zuvier Peptidsequenzen. Der Datenbankvergleich (SwissProt)zeigte keinerlei Homologien zu Sequenzen bekannterPflanzenenzyme. Mit degenerierten Primern, abgeleitet voneinem der erhaltenen Peptidfragmente und einer konserviertenRegion bekannter Acetyltransferasen gelang es, ein erstescDNA-Fragment der Vinorin-Synthase zu amplifizieren. Mit derMethode der RACE-PCR wurde die Nukleoidsequenzvervollständigt, was zu einem cDNA-Vollängenklon mit einerGröße von 1263 bp führte, der für ein Protein mit 421Aminosäuren (46 kDa) codiert.Das Vinorin-Synthase-Gen wurde in den pQE2-Expressionsvektorligiert, der für einen N-terminalen 6-fachen His-tagcodiert. Anschließend wurde sie erstmals erfolgreich in E.coli im mg-Maßstab exprimiert und bis zur Homogenitätgereinigt. Durch die erfolgreiche Überexpression konnte dieVinorin-Synthase eingehend charakterisiert werden. DerKM-Wert für das Substrat Gardneral wurde mit 20 µM, bzw.41.2 µM bestimmt und Vmax betrug 1 pkat, bzw. 1.71 pkat.Nach erfolgreicher Abspaltung des His-tags wurden diekinetischen Parameter erneut bestimmt (KM- Wert 7.5 µM, bzw.27.52 µM, Vmax 0.7 pkat, bzw. 1.21 pkat). Das Co-Substratzeigt einen KM- Wert von 60.5 µM (Vmax 0.6 pkat). DieVinorin-Synthase besitzt ein Temperatur-Optimum von 35 °Cund ein pH-Optimum bei 7.8.Homologievergleiche mit anderen Enzymen zeigten, dass dieVinorin-Synthase zu einer noch kleinen Familie von bisher 10Acetyltransferasen gehört. Alle Enzyme der Familie haben einHxxxD und ein DFGWG-Motiv zu 100 % konserviert. Basierendauf diesen Homologievergleichen und Inhibitorstudien wurden11 in dieser Proteinfamilie konservierte Aminosäuren gegenAlanin ausgetauscht, um so die Aminosäuren einer in derLiteratur postulierten katalytischen Triade(Ser/Cys-His-Asp) zu identifizieren.Die Mutation aller vorhandenen konservierten Serine undCysteine resultierte in keiner Mutante, die zumvollständigen Aktivitätsverlust des Enzyms führte. Nur dieMutationen H160A und D164A resultierten in einemvollständigen Aktivitätsverlust des Enzyms. Dieses Ergebniswiderlegt die Theorie einer katalytischen Triade und zeigte,dass die Aminosäuren H160A und D164A exklusiv an derkatalytischen Reaktion beteiligt sind.Zur Überprüfung dieser Ergebnisse und zur vollständigenAufklärung des Reaktionsmechanismus wurde dieVinorin-Synthase kristallisiert. Die bis jetzt erhaltenenKristalle (Kristallgröße in µm x: 150, y: 200, z: 200)gehören der Raumgruppe P212121 (orthorhombisch primitiv) anund beugen bis 3.3 Å. Da es bis jetzt keine Kristallstruktureines zur Vinorin-Synthase homologen Proteins gibt, konntedie Struktur noch nicht vollständig aufgeklärt werden. ZurLösung des Phasenproblems wird mit der Methode der multiplenanomalen Dispersion (MAD) jetzt versucht, die ersteKristallstruktur in dieser Enzymfamilie aufzuklären.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

OBJECTIVE: A severely virilized 46, XX newborn girl was referred to our center for evaluation and treatment of congenital adrenal hyperplasia (CAH) because of highly elevated 17alpha-hydroxyprogesterone levels at newborn screening; biochemical tests confirmed the diagnosis of salt-wasting CAH. Genetic analysis revealed that the girl was compound heterozygote for a previously reported Q318X mutation in exon 8 and a novel insertion of an adenine between nucleotides 962 and 963 in exon 4 of the CYP21A2 gene. This 962_963insA mutation created a frameshift leading to a stop codon at amino acid 161 of the P450c21 protein. AIM AND METHODS: To better understand structure-function relationships of mutant P450c21 proteins, we performed multiple sequence alignments of P450c21 with three mammalian P450s (P450 2C8, 2C9 and 2B4) with known structures as well as with human P450c17. Comparative molecular modeling of human P450c21 was then performed by MODELLER using the X-ray crystal structure of rabbit P450 2B4 as a template. RESULTS: The new three dimensional model of human P450c21 and the sequence alignment were found to be helpful in predicting the role of various amino acids in P450c21, especially those involved in heme binding and interaction with P450 oxidoreductase, the obligate electron donor. CONCLUSION: Our model will help in analyzing the genotype-phenotype relationship of P450c21 mutations which have not been tested for their functional activity in an in vitro assay.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Cells must rapidly sense and respond to a wide variety of potentially cytotoxic external stressors to survive in a constantly changing environment. In a search for novel genes required for stress tolerance in Saccharomyces cerevisiae, we identified the uncharacterized open reading frame YER139C as a gene required for growth at 37 degrees C in the presence of the heat shock mimetic formamide. YER139C encodes the closest yeast homolog of the human RPAP2 protein, recently identified as a novel RNA polymerase II (RNAPII)-associated factor. Multiple lines of evidence support a role for this gene family in transcription, prompting us to rename YER139C RTR1 (regulator of transcription). The core RNAPII subunits RPB5, RPB7, and RPB9 were isolated as potent high-copy-number suppressors of the rtr1Delta temperature-sensitive growth phenotype, and deletion of the nonessential subunits RPB4 and RPB9 hypersensitized cells to RTR1 overexpression. Disruption of RTR1 resulted in mycophenolic acid sensitivity and synthetic genetic interactions with a number of genes involved in multiple phases of transcription. Consistently, rtr1Delta cells are defective in inducible transcription from the GAL1 promoter. Rtr1 constitutively shuttles between the cytoplasm and nucleus, where it physically associates with an active RNAPII transcriptional complex. Taken together, our data reveal a role for members of the RTR1/RPAP2 family as regulators of core RNAPII function.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Historically morphological features were used as the primary means to classify organisms. However, the age of molecular genetics has allowed us to approach this field from the perspective of the organism's genetic code. Early work used highly conserved sequences, such as ribosomal RNA. The increasing number of complete genomes in the public data repositories provides the opportunity to look not only at a single gene, but at organisms' entire parts list. ^ Here the Sequence Comparison Index (SCI) and the Organism Comparison Index (OCI), algorithms and methods to compare proteins and proteomes, are presented. The complete proteomes of 104 sequenced organisms were compared. Over 280 million full Smith-Waterman alignments were performed on sequence pairs which had a reasonable expectation of being related. From these alignments a whole proteome phylogenetic tree was constructed. This method was also used to compare the small subunit (SSU) rRNA from each organism and a tree constructed from these results. The SSU rRNA tree by the SCI/OCI method looks very much like accepted SSU rRNA trees from sources such as the Ribosomal Database Project, thus validating the method. The SCI/OCI proteome tree showed a number of small but significant differences when compared to the SSU rRNA tree and proteome trees constructed by other methods. Horizontal gene transfer does not appear to affect the SCI/OCI trees until the transferred genes make up a large portion of the proteome. ^ As part of this work, the Database of Related Local Alignments (DaRLA) was created and contains over 81 million rows of sequence alignment information. DaRLA, while primarily used to build the whole proteome trees, can also be applied shared gene content analysis, gene order analysis, and creating individual protein trees. ^ Finally, the standard BLAST method for analyzing shared gene content was compared to the SCI method using 4 spirochetes. The SCI system performed flawlessly, finding all proteins from one organism against itself and finding all the ribosomal proteins between organisms. The BLAST system missed some proteins from its respective organism and failed to detect small ribosomal proteins between organisms. ^

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Background: Protein tertiary structure can be partly characterized via each amino acid's contact number measuring how residues are spatially arranged. The contact number of a residue in a folded protein is a measure of its exposure to the local environment, and is defined as the number of C-beta atoms in other residues within a sphere around the C-beta atom of the residue of interest. Contact number is partly conserved between protein folds and thus is useful for protein fold and structure prediction. In turn, each residue's contact number can be partially predicted from primary amino acid sequence, assisting tertiary fold analysis from sequence data. In this study, we provide a more accurate contact number prediction method from protein primary sequence. Results: We predict contact number from protein sequence using a novel support vector regression algorithm. Using protein local sequences with multiple sequence alignments (PSI-BLAST profiles), we demonstrate a correlation coefficient between predicted and observed contact numbers of 0.70, which outperforms previously achieved accuracies. Including additional information about sequence weight and amino acid composition further improves prediction accuracies significantly with the correlation coefficient reaching 0.73. If residues are classified as being either contacted or non-contacted, the prediction accuracies are all greater than 77%, regardless of the choice of classification thresholds. Conclusion: The successful application of support vector regression to the prediction of protein contact number reported here, together with previous applications of this approach to the prediction of protein accessible surface area and B-factor profile, suggests that a support vector regression approach may be very useful for determining the structure-function relation between primary sequence and higher order consecutive protein structural and functional properties.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

A 16S rRNA gene database (http://greengenes.bl.gov) addresses limitations of public repositories by providing chimera screening, standard alignment, and taxonomic classification using multiple published taxonomies. It was found that there is incongruent taxonomic nomenclature among curators even at the phylum level. Putative chimeras were identified in 3% of environmental sequences and in 0.2% of records derived from isolates. Environmental sequences were classified into 100 phylum-level lineages in the Archaea and Bacteria.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

At present, little is known about signal transduction mechanisms in schistosomes, which cause the disease of schistosomiasis. The mitogen-activated protein kinase (MAPK) signaling pathways, which are evolutionarily conserved from yeast to Homo sapiens, play key roles in multiple cellular processes. Here, we reconstructed the hypothetical MAPK signaling pathways in Schistosoma japonicum and compared the schistosome pathways with those of model eukaryote species. We identified 60 homologous components in the S. japoncium MAPK signaling pathways. Among these, 27 were predicted to be full-length sequences. Phylogenetic analysis of these proteins confirmed the evolutionary conservation of the MAPK signaling pathways. Remarkably, we identified S. japonicum homologues of GTP-binding protein beta and alpha-I subunits in the yeast mating pathway, which might be involved in the regulation of different life stages and female sexual maturation processes as well in schistosomes. In addition, several pathway member genes, including ERK, JNK, Sja-DSP, MRAS and RAS, were determined through quantitative PCR analysis to be expressed in a stage-specific manner, with ERK, JNK and their inhibitor Sja-DSP markedly upregulated in adult female schistosomes. (c) 2006 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Subunit vaccine discovery is an accepted clinical priority. The empirical approach is time- and labor-consuming and can often end in failure. Rational information-driven approaches can overcome these limitations in a fast and efficient manner. However, informatics solutions require reliable algorithms for antigen identification. All known algorithms use sequence similarity to identify antigens. However, antigenicity may be encoded subtly in a sequence and may not be directly identifiable by sequence alignment. We propose a new alignment-independent method for antigen recognition based on the principal chemical properties of protein amino acid sequences. The method is tested by cross-validation on a training set of bacterial antigens and external validation on a test set of known antigens. The prediction accuracy is 83% for the cross-validation and 80% for the external test set. Our approach is accurate and robust, and provides a potent tool for the in silico discovery of medically relevant subunit vaccines.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The cyanobacterial population in the Cajati waste stabilization pond system (WSP) from Sao Paulo State, Brazil was assessed by cell isolation and direct microscope counting techniques. Ten strains, belonging to five genera (Synechococcus, Merismopedia, Leptolyngbya, Limnothrix, and Nostoc), were isolated and identified by morphological and molecular analyses. Morphological identification of the isolated strains was congruent with their phylogenetic analyses based on 16S rDNA gene sequences. Six cyanobacterial genera (Synechocystis, Aphanocapsa, Merismopedia, Lyngbya, Phormidium, and Pseudanabaena) were identified by direct microscope inspection. Both techniques were complementary, since, of the six genera identified by direct microscopic inspection, only Merismopedia was isolated, and the four other isolated genera were not detected by direct inspection. Direct microscope counting of preserved cells showed that cyanobacteria were the dominant members (> 90%) of the phytoplankton community during both periods evaluated (summer and autumn). ELISA tests specific for hepatotoxicmicrocystins gave positive results for six strains (Synechococcus CENA108, Merismopedia CENA106, Leptolyngbya CENA103, Leptolyngbya CENA112, Limnothrix CENA109, and Limnothrix CENA110), and for wastewater samples collected from raw influent (3.70 mu g microcystins/l) and treated effluent (3.74 mu g microcystins/l) in summer. Our findings indicate that toxic cyanobacteria in WSP systems are of concern, since the treated effluent containing cyanotoxins will be discharged into rivers, irrigation channels, estuaries, or reservoirs, and can affect human and animal health.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The production of hydrogen from soft-drink wastewater in two upflow anaerobic packed-bed reactors was evaluated. The results show that soft-drink wastewater is a good source for hydrogen generation. Data from both reactors indicate that the reactor without medium containing macro- and micronutrients (R2) provided a higher hydrogen yield (3.5 mol H(2) mol(-1) of sucrose) as compared to the reactor (R1) with a nutrient-containing medium (3.3 mol H(2) mol(-1) of sucrose). Reactor R2 continuously produced hydrogen, whereas reactor R1 exhibited a short period of production and produced lower amounts of hydrogen. Better hydrogen production rates and percentages of biogas were also observed for reactor R2, which produced 0.4 L h(-1) L(-1) and 15.8% of H(2), compared to reactor R1, which produced 0.2 L h(-1) L(-1) and 2.6% of H(2). The difference in performance between the reactors was likely due to changes in the metabolic pathway for hydrogen production and decreases in bed porosity as a result of excessive biomass growth in reactor R1. Molecular biological analyses of samples from reactors R1 and R2 indicated the presence of several microorganisms, including Clostridium (91% similarity), Enterobacter (93% similarity) and Klebsiella (97% similarity). Copyright (C) 2011, Hydrogen Energy Publications, LLC. Published by Elsevier Ltd. All rights reserved.