943 resultados para RNA secondary structure


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Os flavivírus são conhecidos por seu complexo ciclo biológico e importância na saúde pública e na economia mundial. Os aspectos ecológicos e quadros clínicos estão estreitamente relacionados à filogenia e evolução dos flavivírus. Este trabalho objetiva a caracterização molecular dos genomas dos flavivírus Bussuquara (VBSQ), Iguape (VIGU), Ilhéus (VILH) e Rocio (VROC), determinando relações filogenéticas com os demais integrantes do gênero Flavivirus. Foi realizado o seqüenciamento completo da região codificadora (ORF) e regiões não codificantes (RNC) 5’ e 3’; análise da estrutura secundária do RNA viral e das sequências conservadas da 3’RNC; determinação dos sítios de clivagem, glicosilação, resíduos Cis e motivos conservados na poliproteína; e as análises de similaridade e filogenética. Os genomas dos VBSQ, VIGU, VILH e VROC apresentaram a mesma organização que os demais flavivírus, medindo 10.815 nt, 10.922 nt, 10.775 nt, 10.794 nt, respectivamente. O padrão das sequências conservadas da 3’RNC do VBSQ foi RCS2-CS2-CS1, enquanto que para os VIGU, VILH e VROC foram CS3-RCS2-CS2-CS1. As características das estruturas secundárias do RNAs dos flavivírus em estudo foram similares aos demais flavivírus. O número dos sítios de glicosilação das proteínas PrM, E e NS1 foi distinto entre os flavivírus brasileiros, porém o padrão 6,12,12 dos resíduos de Cis e do sítios de clivagem permaneceram conservados. Na proteína E, alterações aminoacídicas pontuais foram observadas no peptídeo de fusão dos VBSQ, VIGU e VROC, e a sequência do tripepídeo RGD foi distinta para os quatro vírus em estudo. Os motivos determinantes das atividades de MTase-SAM da NS5, bem como da helicase e protease da NS3, permanecem conservados. Dentre os oito motivos da polimerase viral (NS5), somente os motivos V, VI e VII possuem alguma substituição nucleotídica para o VILH e VROC. As análises de similaridade mostram que VBSQ apresenta maior relação com VIGU enquanto que o VILH e VROC são mais relacionados entre si, porém sendo consideradas espécies virais distintas. Com base nas análises filogenéticas, características moleculares do genoma e biológicas, propõem-se a formação de três grupos genéticos: o grupo Rocio, que agrupa VROC e VILH; o grupo Bussuquara formado pelos VBSQ e Vírus naranjal e o grupo Aroa que inclui o Vírus Aroa e VIGU.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The Dipteran a native Brazilian insect that has become a valuable model system for developmental biology research because it provides an interesting opportunity to study a different type of insect oogenesis. Sequences from a cDNA library that was constructed with poly A + RNA from the ovaries of larvae at different ages were analyzed. Molecular characterization confirmed interesting findings, such as the presence of . The gene encodes a conserved RNA-binding protein that is required during early development for the maintenance and division of the primordial germ cells of Diptera. plays an important role in specifying the posterior regions of insect embryos and is important for abdomen formation. In the present work, we showed the spatial and temporal expression profiles of this important gene, which is involved in oogenesis and early development. Data mining techniques were used to obtain the complete sequence of . Bioinformatic tools were used to determine the following: (1) the secondary structure of the 3'-untranslated region of the mRNA, (2) the encoded protein of the isolated gene, (3) the conserved zinc-finger domains of the Nanos protein, and (4) phylogenetic analyses. Furthermore, RNA in situ hybridization and immunolocalization were used to determine mRNA and protein expression in the tissues that were studied and to define as a germ cell molecular marker.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Abstract Background HCV is prevalent throughout the world. It is a major cause of chronic liver disease. There is no effective vaccine and the most common therapy, based on Peginterferon, has a success rate of ~50%. The mechanisms underlying viral resistance have not been elucidated but it has been suggested that both host and virus contribute to therapy outcome. Non-structural 5A (NS5A) protein, a critical virus component, is involved in cellular and viral processes. Methods The present study analyzed structural and functional features of 345 sequences of HCV-NS5A genotypes 1 or 3, using in silico tools. Results There was residue type composition and secondary structure differences between the genotypes. In addition, second structural variance were statistical different for each response group in genotype 3. A motif search indicated conserved glycosylation, phosphorylation and myristoylation sites that could be important in structural stabilization and function. Furthermore, a highly conserved integrin ligation site was identified, and could be linked to nuclear forms of NS5A. ProtFun indicated NS5A to have diverse enzymatic and nonenzymatic activities, participating in a great range of cell functions, with statistical difference between genotypes. Conclusion This study presents new insights into the HCV-NS5A. It is the first study that using bioinformatics tools, suggests differences between genotypes and response to therapy that can be related to NS5A protein features. Therefore, it emphasizes the importance of using bioinformatics tools in viral studies. Data acquired herein will aid in clarifying the structure/function of this protein and in the development of antiviral agents.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The study of protein fold is a central problem in life science, leading in the last years to several attempts for improving our knowledge of the protein structures. In this thesis this challenging problem is tackled by means of molecular dynamics, chirality and NMR studies. In the last decades, many algorithms were designed for the protein secondary structure assignment, which reveals the local protein shape adopted by segments of amino acids. In this regard, the use of local chirality for the protein secondary structure assignment was demonstreted, trying to correlate as well the propensity of a given amino acid for a particular secondary structure. The protein fold can be studied also by Nuclear Magnetic Resonance (NMR) investigations, finding the average structure adopted from a protein. In this context, the effect of Residual Dipolar Couplings (RDCs) in the structure refinement was shown, revealing a strong improvement of structure resolution. A wide extent of this thesis is devoted to the study of avian prion protein. Prion protein is the main responsible of a vast class of neurodegenerative diseases, known as Bovine Spongiform Encephalopathy (BSE), present in mammals, but not in avian species and it is caused from the conversion of cellular prion protein to the pathogenic misfolded isoform, accumulating in the brain in form of amiloyd plaques. In particular, the N-terminal region, namely the initial part of the protein, is quite different between mammal and avian species but both of them contain multimeric sequences called Repeats, octameric in mammals and hexameric in avians. However, such repeat regions show differences in the contained amino acids, in particular only avian hexarepeats contain tyrosine residues. The chirality analysis of avian prion protein configurations obtained from molecular dynamics reveals a high stiffness of the avian protein, which tends to preserve its regular secondary structure. This is due to the presence of prolines, histidines and especially tyrosines, which form a hydrogen bond network in the hexarepeat region, only possible in the avian protein, and thus probably hampering the aggregation.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Grass carp reovirus (GCRV) is a member of the Aquareovirus genus of the family Reoviridae, a large family of double-stranded RNA (dsRNA) viruses infecting plants, insects, fishes and mammals. We report the first subnanometer-resolution three-dimensional structures of both GCRV core and virion by cryoelectron microscopy. These structures have allowed the delineation of interactions among the over 1000 molecules in this enormous macromolecular machine and a detailed comparison with other dsRNA viruses at the secondary-structure level. The GCRV core structure shows that the inner proteins have strong structural similarities with those of orthoreoviruses even at the level of secondary-structure elements, indicating that the structures involved in viral dsRNA interaction and transcription are highly conserved. In contrast, the level of similarity in structures decreases in the proteins situated in the outer layers of the virion. The proteins involved in host recognition and attachment exhibit the least similarities to other members of Reoviridae. Furthermore, in GCRV, the RNA-translocating turrets are in an open state and lack a counterpart for the sigma1 protein situated on top of the close turrets observed in mammalian orthoreovirus. Interestingly, the distribution and the organization of GCRV core proteins resemble those of the cytoplasmic polyhedrosis virus, a cypovirus and the structurally simplest member of the Reoviridae family. Our results suggest that GCRV occupies a unique structure niche between the simpler cypoviruses and the considerably more complex mammalian orthoreovirus, thus providing an important model for understanding the structural and functional conservation and diversity of this enormous family of dsRNA viruses.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We have used three beta-thalassemic mutations, IVS2-654, -705 and -745, that create aberrant 5' splice sites (5' ss) and activate a common cryptic 3' ss further upstream in intron 2 of the human beta-globin gene to optimize a generally applicable exon-skipping strategy using antisense derivatives of U7 small nuclear RNA (snRNA). Introducing a modified U7 snRNA gene carrying an antisense sequence against the cryptic 3' ss into cultured cells expressing the mutant beta-globin genes, restored correct beta-globin mRNA splicing for all three mutations, but the efficiency was much weaker for IVS2-654 than for the other mutations. The length of antisense sequence influenced the efficiency with an optimum of approximately 24 nucleotides. Combining two antisense sequences directed against different target sites in intron 2, either on separate antisense RNAs or, even better, on a single U7 snRNA, significantly enhanced the efficiency of splicing correction. One double-target U7 RNA was expressed on stable transformation resulting in permanent and efficient suppression of the IVS2-654 mutation and production of beta-globin. These results suggest that forcing the aberrant exon into a looped secondary structure may strongly promote its exclusion from the mRNA and that this approach may be used generally to induce exon skipping.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Essential biological processes are governed by organized, dynamic interactions between multiple biomolecular systems. Complexes are thus formed to enable the biological function and get dissembled as the process is completed. Examples of such processes include the translation of the messenger RNA into protein by the ribosome, the folding of proteins by chaperonins or the entry of viruses in host cells. Understanding these fundamental processes by characterizing the molecular mechanisms that enable then, would allow the (better) design of therapies and drugs. Such molecular mechanisms may be revealed trough the structural elucidation of the biomolecular assemblies at the core of these processes. Various experimental techniques may be applied to investigate the molecular architecture of biomolecular assemblies. High-resolution techniques, such as X-ray crystallography, may solve the atomic structure of the system, but are typically constrained to biomolecules of reduced flexibility and dimensions. In particular, X-ray crystallography requires the sample to form a three dimensional (3D) crystal lattice which is technically di‑cult, if not impossible, to obtain, especially for large, dynamic systems. Often these techniques solve the structure of the different constituent components within the assembly, but encounter difficulties when investigating the entire system. On the other hand, imaging techniques, such as cryo-electron microscopy (cryo-EM), are able to depict large systems in near-native environment, without requiring the formation of crystals. The structures solved by cryo-EM cover a wide range of resolutions, from very low level of detail where only the overall shape of the system is visible, to high-resolution that approach, but not yet reach, atomic level of detail. In this dissertation, several modeling methods are introduced to either integrate cryo-EM datasets with structural data from X-ray crystallography, or to directly interpret the cryo-EM reconstruction. Such computational techniques were developed with the goal of creating an atomic model for the cryo-EM data. The low-resolution reconstructions lack the level of detail to permit a direct atomic interpretation, i.e. one cannot reliably locate the atoms or amino-acid residues within the structure obtained by cryo-EM. Thereby one needs to consider additional information, for example, structural data from other sources such as X-ray crystallography, in order to enable such a high-resolution interpretation. Modeling techniques are thus developed to integrate the structural data from the different biophysical sources, examples including the work described in the manuscript I and II of this dissertation. At intermediate and high-resolution, cryo-EM reconstructions depict consistent 3D folds such as tubular features which in general correspond to alpha-helices. Such features can be annotated and later on used to build the atomic model of the system, see manuscript III as alternative. Three manuscripts are presented as part of the PhD dissertation, each introducing a computational technique that facilitates the interpretation of cryo-EM reconstructions. The first manuscript is an application paper that describes a heuristics to generate the atomic model for the protein envelope of the Rift Valley fever virus. The second manuscript introduces the evolutionary tabu search strategies to enable the integration of multiple component atomic structures with the cryo-EM map of their assembly. Finally, the third manuscript develops further the latter technique and apply it to annotate consistent 3D patterns in intermediate-resolution cryo-EM reconstructions. The first manuscript, titled An assembly model for Rift Valley fever virus, was submitted for publication in the Journal of Molecular Biology. The cryo-EM structure of the Rift Valley fever virus was previously solved at 27Å-resolution by Dr. Freiberg and collaborators. Such reconstruction shows the overall shape of the virus envelope, yet the reduced level of detail prevents the direct atomic interpretation. High-resolution structures are not yet available for the entire virus nor for the two different component glycoproteins that form its envelope. However, homology models may be generated for these glycoproteins based on similar structures that are available at atomic resolutions. The manuscript presents the steps required to identify an atomic model of the entire virus envelope, based on the low-resolution cryo-EM map of the envelope and the homology models of the two glycoproteins. Starting with the results of the exhaustive search to place the two glycoproteins, the model is built iterative by running multiple multi-body refinements to hierarchically generate models for the different regions of the envelope. The generated atomic model is supported by prior knowledge regarding virus biology and contains valuable information about the molecular architecture of the system. It provides the basis for further investigations seeking to reveal different processes in which the virus is involved such as assembly or fusion. The second manuscript was recently published in the of Journal of Structural Biology (doi:10.1016/j.jsb.2009.12.028) under the title Evolutionary tabu search strategies for the simultaneous registration of multiple atomic structures in cryo-EM reconstructions. This manuscript introduces the evolutionary tabu search strategies applied to enable a multi-body registration. This technique is a hybrid approach that combines a genetic algorithm with a tabu search strategy to promote the proper exploration of the high-dimensional search space. Similar to the Rift Valley fever virus, it is common that the structure of a large multi-component assembly is available at low-resolution from cryo-EM, while high-resolution structures are solved for the different components but lack for the entire system. Evolutionary tabu search strategies enable the building of an atomic model for the entire system by considering simultaneously the different components. Such registration indirectly introduces spatial constrains as all components need to be placed within the assembly, enabling the proper docked in the low-resolution map of the entire assembly. Along with the method description, the manuscript covers the validation, presenting the benefit of the technique in both synthetic and experimental test cases. Such approach successfully docked multiple components up to resolutions of 40Å. The third manuscript is entitled Evolutionary Bidirectional Expansion for the Annotation of Alpha Helices in Electron Cryo-Microscopy Reconstructions and was submitted for publication in the Journal of Structural Biology. The modeling approach described in this manuscript applies the evolutionary tabu search strategies in combination with the bidirectional expansion to annotate secondary structure elements in intermediate resolution cryo-EM reconstructions. In particular, secondary structure elements such as alpha helices show consistent patterns in cryo-EM data, and are visible as rod-like patterns of high density. The evolutionary tabu search strategy is applied to identify the placement of the different alpha helices, while the bidirectional expansion characterizes their length and curvature. The manuscript presents the validation of the approach at resolutions ranging between 6 and 14Å, a level of detail where alpha helices are visible. Up to resolution of 12 Å, the method measures sensitivities between 70-100% as estimated in experimental test cases, i.e. 70-100% of the alpha-helices were correctly predicted in an automatic manner in the experimental data. The three manuscripts presented in this PhD dissertation cover different computation methods for the integration and interpretation of cryo-EM reconstructions. The methods were developed in the molecular modeling software Sculptor (http://sculptor.biomachina.org) and are available for the scientific community interested in the multi-resolution modeling of cryo-EM data. The work spans a wide range of resolution covering multi-body refinement and registration at low-resolution along with annotation of consistent patterns at high-resolution. Such methods are essential for the modeling of cryo-EM data, and may be applied in other fields where similar spatial problems are encountered, such as medical imaging.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The recA gene is essential for SOS response induction, for inducible DNA repair and for homologous recombination in E. coli. The level of recA expression is significant for these functions. A basal level of about 1000 molecules of RecA protein is sufficient for homologous recombination of the cell and is essential for the induction of the SOS response. Based on previous observations, two models regarding the origin of the basal RecA protein were postulated. One was that it comes from the leaky expression of the LexA repressed promoter. The other was that it is from another weak but constitutive promoter. The first part of this thesis is to study these possibilities. An $\Omega$ cartridge containing the transcription terminator of gene 32 of T4 phage was exploited to define a second promoter for recA expression. Insertion of this $\Omega$ cartridge downstream of the known promoter gave rise to only minor expression. Purification and N-terminus sequencing of the RecA protein from the insertion mutant did not support the existence of a second promoter. To determine whether the basal RecA is due to the leaky expression of the known LexA repressed promoter, recA expression of a SOS induction minus strain (basal level expression of recA) was compared with that of a recA promoter down mutation recA1270. The result demonstrated that there is leaky expression from the LexA repressed promoter. All the evidence supports the conclusion that there is only one promoter for both basal and induced expression levels of recA.^ Several translation enhancer sequences which are complementary to different regions of the 16S rRNA were found to exist in recA mRNA. The leader sequence of recA mRNA is highly complementary to a region of the 16S rRNA. Thus it appeared that recA expression could be regulated at post-transcriptional levels. The second part of this thesis is focused on the study of the post-transcriptional control of recA expression. Deletions of the complementary regions were created to examine their effect on recA expression. The results indicated that all of the complementary regions were important for the normal expression of recA and their effects were post-transcriptional. RNA secondary structures of wild type recA mRNA was inspected and a stem-loop structure was revealed. The expression down mutations at codon 10 and 11 were found to stabilize this structure. The conclusions of the second part of this thesis are that there is post-transcriptional control for recA expression and the leader sequence of recA mRNA plays more than one role in the control of recA expression. ^

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The proprotein convertases are a family of at least seven calcium-dependent endoproteases that process a wide variety of precursor proteins in the secretory pathway. All members of this family possess an N-terminal proregion, a subtilisin-like catalytic module, and an additional downstream well-conserved region of ≈150 amino acid residues, the P domain, which is not found in any other subtilase. The pro and catalytic domains cannot be expressed in the absence of the P domains; their thermodynamic instability may be attributable to the presence of large numbers of negatively charged Glu and Asp side chains in the substrate binding region for recognition of multibasic residue cleavage sites. Based on secondary structure predictions, we here propose that the P domains consist of 8-stranded β-barrels with well-organized inner hydrophobic cores, and therefore are independently folded components of the proprotein convertases. We hypothesize further that the P domains are integrated through strong hydrophobic interactions with the catalytic domains, conferring structural stability and regulating the properties and activity of the convertases. A molecular model of these interdomain interactions is proposed in this report.