907 resultados para secondary structure detection


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Motivation: Conformational flexibility is essential to the function of many proteins, e.g. catalytic activity. To assist efforts in determining and exploring the functional properties of a protein, it is desirable to automatically identify regions that are prone to undergo conformational changes. It was recently shown that a probabilistic predictor of continuum secondary structure is more accurate than categorical predictors for structurally ambivalent sequence regions, suggesting that such models are suited to characterize protein flexibility. Results: We develop a computational method for identifying regions that are prone to conformational change directly from the amino acid sequence. The method uses the entropy of the probabilistic output of an 8-class continuum secondary structure predictor. Results for 171 unique amino acid sequences with well-characterized variable structure (identified in the 'Macromolecular movements database') indicate that the method is highly sensitive at identifying flexible protein regions, but false positives remain a problem. The method can be used to explore conformational flexibility of proteins (including hypothetical or synthetic ones) whose structure is yet to be determined experimentally.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models. Results: Several probabilistic models not only successfully estimate the continuum secondary structure, but also provide a categorical output on par with models directly trained on categorical data. Importantly, models trained on the continuum secondary structure are also better than their categorical counterparts at identifying the conformational state for structurally ambivalent residues. Conclusion: Cascaded probabilistic neural networks trained on the continuum secondary structure exhibit better accuracy in structurally ambivalent regions of proteins, while sustaining an overall classification accuracy on par with standard, categorical prediction methods.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Our previous studies using trans-complementation analysis of Kunjin virus (KUN) full-length cDNA clones harboring in-frame deletions in the NS3 gene demonstrated the inability of these defective complemented RNAs to be packaged into virus particles (W. J. Liu, P. L. Sedlak, N. Kondratieva, and A. A. Khromykh, J. Virol. 76:10766-10775). In this study we aimed to establish whether this requirement for NS3 in RNA packaging is determined by the secondary RNA structure of the NS3 gene or by the essential role of the translated NS3 gene product. Multiple silent mutations of three computer-predicted stable RNA structures in the NS3 coding region of KUN replicon RNA aimed at disrupting RNA secondary structure without affecting amino acid sequence did not affect RNA replication and packaging into virus-like particles in the packaging cell line, thus demonstrating that the predicted conserved RNA structures in the NS3 gene do not play a role in RNA replication and/or packaging. In contrast, double frameshift mutations in the NS3 coding region of full-length KUN RNA, producing scrambled NS3 protein but retaining secondary RNA structure, resulted in the loss of ability of these defective RNAs to be packaged into virus particles in complementation experiments in KUN replicon-expressing cells. Furthermore, the more robust complementation-packaging system based on established stable cell lines producing large amounts of complemented replicating NS3-deficient replicon RNAs and infection with KUN virus to provide structural proteins also failed to detect any secreted virus-like particles containing packaged NS3-deficient replicon RNAs. These results have now firmly established the requirement of KUN NS3 protein translated in cis for genome packaging into virus particles.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The high demanding computational requirements necessary to carry out protein motion simulations make it difficult to obtain information related to protein motion. On the one hand, molecular dynamics simulation requires huge computational resources to achieve satisfactory motion simulations. On the other hand, less accurate procedures such as interpolation methods, do not generate realistic morphs from the kinematic point of view. Analyzing a protein's movement is very similar to serial robots; thus, it is possible to treat the protein chain as a serial mechanism composed of rotational degrees of freedom. Recently, based on this hypothesis, new methodologies have arisen, based on mechanism and robot kinematics, to simulate protein motion. Probabilistic roadmap method, which discretizes the protein configurational space against a scoring function, or the kinetostatic compliance method that minimizes the torques that appear in bonds, aim to simulate protein motion with a reduced computational cost. Results: In this paper a new viewpoint for protein motion simulation, based on mechanism kinematics is presented. The paper describes a set of methodologies, combining different techniques such as structure normalization normalization processes, simulation algorithms and secondary structure detection procedures. The combination of all these procedures allows to obtain kinematic morphs of proteins achieving a very good computational cost-error rate, while maintaining the biological meaning of the obtained structures and the kinematic viability of the obtained motion. Conclusions: The procedure presented in this paper, implements different modules to perform the simulation of the conformational change suffered by a protein when exerting its function. The combination of a main simulation procedure assisted by a secondary structure process, and a side chain orientation strategy, allows to obtain a fast and reliable simulations of protein motion.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Sensitive remote homology detection and accurate alignments especially in the midnight zone of sequence similarity are needed for better function annotation and structural modeling of proteins. An algorithm, AlignHUSH for HMM-HMM alignment has been developed which is capable of recognizing distantly related domain families The method uses structural information, in the form of predicted secondary structure probabilities, and hydrophobicity of amino acids to align HMMs of two sets of aligned sequences. The effect of using adjoining column(s) information has also been investigated and is found to increase the sensitivity of HMM-HMM alignments and remote homology detection. Results: We have assessed the performance of AlignHUSH using known evolutionary relationships available in SCOP. AlignHUSH performs better than the best HMM-HMM alignment methods and is observed to be even more sensitive at higher error rates. Accuracy of the alignments obtained using AlignHUSH has been assessed using the structure-based alignments available in BaliBASE. The alignment length and the alignment quality are found to be appropriate for homology modeling and function annotation. The alignment accuracy is found to be comparable to existing methods for profile-profile alignments. Conclusions: A new method to align HMMs has been developed and is shown to have better sensitivity at error rates of 10% and above when compared to other available programs. The proposed method could effectively aid obtaining clues to functions of proteins of yet unknown function. A web-server incorporating the AlignHUSH method is available at http://crick.mbu.iisc.ernet.in/similar to alignhush/

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is extremely difficult to explore mRNA folding structure by biological experiments. In this report, we use stochastic sampling and folding simulation to test the existence of the stable secondary structural units of-mRNA, look for the folding units, and explore the probabilistic stabilization of the units. Using this method, We made simulations for all possible local optimum secondary structures of a single strand mRNA within a certain range, and searched for the common parts of the secondary structures. The consensus secondary structure units (CSSUs) extracted from the above method are mainly hairpins, with a few single strands. These CSSUs suggest that the mRNA folding units could be relatively stable and could perform specific biological function. The significance of these observations for the mRNA folding problem in general is also discussed. (c) 2004 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

L’évolution des protéines est un domaine important de la recherche en bioinformatique et catalyse l'intérêt de trouver des outils d'alignement qui peuvent être utilisés de manière fiable et modéliser avec précision l'évolution d'une famille de protéines. TM-Align (Zhang and Skolnick, 2005) est considéré comme l'outil idéal pour une telle tâche, en termes de rapidité et de précision. Par conséquent, dans cette étude, TM-Align a été utilisé comme point de référence pour faciliter la détection des autres outils d'alignement qui sont en mesure de préciser l'évolution des protéines. En parallèle, nous avons élargi l'actuel outil d'exploration de structures secondaires de protéines, Helix Explorer (Marrakchi, 2006), afin qu'il puisse également être utilisé comme un outil pour la modélisation de l'évolution des protéines.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Presently, Hop stunt viroid(HSVd) and Citrus exocortis viroid (CEVd) are the only viroids reported to infect grapevines (Vitis spp.) in Brazil, among the seven viroid species already reported infecting this host in other countries. All grapevine viroid diseases are graft-transmissible and can induce losses especiallywhenassociatedwithviruses.Theaimofthisworkwas to confirm infection by Grapevine yellow speckle viroid 1(GYSVd-1) in grapevine samples exhibiting yellow speckle symptoms in the leaves and in asymptomatic samples sequenced by next generation sequencing (NGS). The occurrence of this viroid in Brazil was further investigated in a second study. Total RNAs and dsRNAs were extracted from five symptomatic plants and 16 asymptomatic samples, respectively. Specific primers were used for RT-PCR and amplified DNA fragments were cloned and sequenced by the Sanger method. Eleven complete nucleotide sequences of GYSVd-1 isolates (366 ?367 nt) were obtained from NGS and from RT-PCR amplicons. Comparisons showed high identities (95.9 ?100 %) among ten isolates and an identity of 87.2 ?90.4 % with a divergent isolate (RM-BR). Phylogenetic analyses placed GYSVd-1 isolates in four clusters (types 1, 2, 3 and 4). All GYSVd-1 infections were confirmed by conventional RT-PCR and RT-qPCR using specific oligonucleo-tides and a labeled probe. This is the first report and molecular characterization of GYSVd-1 infecting grapevines in Brazil, and our survey indicates that this viroid could be widespread in the major grape producing regions of Brazil. Keywords GYSVd-1 . Incidence . Next generation sequencing. Secondary structure. Vine.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Genomic and proteomic analyses have attracted a great deal of interests in biological research in recent years. Many methods have been applied to discover useful information contained in the enormous databases of genomic sequences and amino acid sequences. The results of these investigations inspire further research in biological fields in return. These biological sequences, which may be considered as multiscale sequences, have some specific features which need further efforts to characterise using more refined methods. This project aims to study some of these biological challenges with multiscale analysis methods and stochastic modelling approach. The first part of the thesis aims to cluster some unknown proteins, and classify their families as well as their structural classes. A development in proteomic analysis is concerned with the determination of protein functions. The first step in this development is to classify proteins and predict their families. This motives us to study some unknown proteins from specific families, and to cluster them into families and structural classes. We select a large number of proteins from the same families or superfamilies, and link them to simulate some unknown large proteins from these families. We use multifractal analysis and the wavelet method to capture the characteristics of these linked proteins. The simulation results show that the method is valid for the classification of large proteins. The second part of the thesis aims to explore the relationship of proteins based on a layered comparison with their components. Many methods are based on homology of proteins because the resemblance at the protein sequence level normally indicates the similarity of functions and structures. However, some proteins may have similar functions with low sequential identity. We consider protein sequences at detail level to investigate the problem of comparison of proteins. The comparison is based on the empirical mode decomposition (EMD), and protein sequences are detected with the intrinsic mode functions. A measure of similarity is introduced with a new cross-correlation formula. The similarity results show that the EMD is useful for detection of functional relationships of proteins. The third part of the thesis aims to investigate the transcriptional regulatory network of yeast cell cycle via stochastic differential equations. As the investigation of genome-wide gene expressions has become a focus in genomic analysis, researchers have tried to understand the mechanisms of the yeast genome for many years. How cells control gene expressions still needs further investigation. We use a stochastic differential equation to model the expression profile of a target gene. We modify the model with a Gaussian membership function. For each target gene, a transcriptional rate is obtained, and the estimated transcriptional rate is also calculated with the information from five possible transcriptional regulators. Some regulators of these target genes are verified with the related references. With these results, we construct a transcriptional regulatory network for the genes from the yeast Saccharomyces cerevisiae. The construction of transcriptional regulatory network is useful for detecting more mechanisms of the yeast cell cycle.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The caseins (αs1, αs2, β, and κ) are phosphoproteins present in bovine milk that have been studied for over a century and whose structures remain obscure. Here we describe the chemical synthesis and structure elucidation of the N-terminal segment (1–44) of bovine κ-casein, the protein which maintains the micellar structure of the caseins. κ-Casein (1–44) was synthesised by highly optimised Boc solid-phase peptide chemistry and characterised by mass spectrometry. Structure elucidation was carried out by circular dichroism and nuclear magnetic resonance spectroscopy. CD analysis demonstrated that the segment was ill defined in aqueous medium but in 30% trifluoroethanol it exhibited considerable helical structure. Further, NMR analysis showed the presence of a helical segment containing 26 residues which extends from Pro8 to Arg34. This is the first report which demonstrates extensive secondary structure within the casein class of proteins.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

While many measures of viewpoint goodness have been proposed in computer graphics, none have been evaluated for ribbon representations of protein secondary structure. To fill this gap, we conducted a user study on Amazon’s Mechanical Turk platform, collecting human viewpoint preferences from 65 participants for 4 representative su- perfamilies of protein domains. In particular, we evaluated viewpoint entropy, which was previously shown to be a good predictor for human viewpoint preference of other, mostly non-abstract objects. In a second study, we asked 7 molecular biology experts to find the best viewpoint of the same protein domains and compared their choices with viewpoint entropy. Our results show that viewpoint entropy overall is a significant predictor of human viewpoint preference for ribbon representations of protein secondary structure. However, the accuracy is highly dependent on the complexity of the structure: while most participants agree on good viewpoints for small, non-globular structures with few secondary structure elements, viewpoint preference varies considerably for complex structures. Finally, experts tend to choose viewpoints of both low and high viewpoint entropy to emphasize different aspects of the respective structure.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Hantaviruses, members of the genus Hantavirus in the Bunyaviridae family, are enveloped single-stranded RNA viruses with tri-segmented genome of negative polarity. In humans, hantaviruses cause two diseases, hemorrhagic fever with renal syndrome (HFRS) and hantavirus pulmonary syndrome (HPS), which vary in severity depending on the causative agent. Each hantavirus is carried by a specific rodent host and is transmitted to humans through excreta of infected rodents. The genome of hantaviruses encodes four structural proteins: the nucleocapsid protein (N), the glycoproteins (Gn and Gc), and the polymerase (L) and also the nonstructural protein (NSs). This thesis deals with the functional characterization of hantavirus N protein with regard to its structure. Structural studies of the N protein have progressed slowly and the crystal structure of the whole protein is still not available, therefore biochemical assays coupled with bioinformatical modeling proved essential for studying N protein structure and functions. Presumably, during RNA encapsidation, the N protein first forms intermediate trimers and then oligomers. First, we investigated the role of N-terminal domain in the N protein oligomerization. The results suggested that the N-terminal region of the N protein forms a coiled-coil, in which two antiparallel alpha helices interact via their hydrophobic seams. Hydrophobic residues L4, I11, L18, L25 and V32 in the first helix and L44, V51, L58 and L65 in the second helix were crucial for stabilizing the structure. The results were consistent with the head-to-head, tail-to-tail model for hantavirus N protein trimerization. We demonstrated that an intact coiled-coil structure of the N terminus is crucial for the oligomerization capacity of the N protein. We also added new details to the head-to-head, tail-to-tail model of trimerization by suggesting that the initial step is based on interaction(s) between intact intra-molecular coiled-coils of the monomers. We further analyzed the importance of charged aa residues located within the coiled-coil for the N protein oligomerization. To predict the interacting surfaces of the monomers we used an upgraded in silico model of the coiled-coil domain that was docked into a trimer. Next the predicted target residues were mutated. The results obtained using the mammalian two-hybrid assay suggested that conserved charged aa residues within the coiled-coil make a substantial contribution to the N protein oligomerization. This contribution probably involves the formation of interacting surfaces of the N monomers and also stabilization of the coiled-coil via intramolecular ionic bridging. We proposed that the tips of the coiled-coils are the first to come into direct contact and thus initiate tight packing of the three monomers into a compact structure. This was in agreement with the previous results showing that an increase in ionic strength abolished the interaction between N protein molecules. We also showed that residues having the strongest effect on the N protein oligomerization are not scattered randomly throughout the coiled-coil 3D model structure, but form clusters. Next we found evidence for the hantaviral N protein interaction with the cytoplasmic tail of the glycoprotein Gn. In order to study this interaction we used the GST pull-down assay in combination with mutagenesis technique. The results demonstrated that intact, properly folded zinc fingers of the Gn protein cytoplasmic tail as well as the middle domain of the N protein (that includes aa residues 80 248 and supposedly carries the RNA-binding domain) are essential for the interaction. Since hantaviruses do not have a matrix protein that mediates the packaging of the viral RNA in other negatve stranded viruses (NSRV), hantaviral RNPs should be involved in a direct interaction with the intraviral domains of the envelope-embedded glycoproteins. By showing the N-Gn interaction we provided the evidence for one of the crucial steps in the virus replication at which RNPs are directed to the site of the virus assembly. Finally we started analysis of the N protein RNA-binding region, which is supposedly located in the middle domain of the N protein molecule. We developed a model for the initial step of RNA-binding by the hantaviral N protein. We hypothesized that the hantaviral N protein possesses two secondary structure elements that initiate the RNA encapsidation. The results suggest that amino acid residues (172-176) presumably act as a hook to catch vRNA and that the positively charged interaction surface (aa residues 144-160) enhances the initial N-RNA interacation. In conclusion, we elucidated new functions of hantavirus N protein. Using in silico modeling we predicted the domain structure of the protein and using experimental techniques showed that each domain is responsible for executing certain function(s). We showed that intact N terminal coiled-coil domain is crucial for oligomerization and charged residues located on its surface form a interaction surface for the N monomers. The middle domain is essential for interaction with the cytoplasmic tail of the Gn protein and RNA binding.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Solution structures of a 23 residue glycopeptide II (KIS* RFLLYMKNLLNRIIDDMVEQ, where * denotes the glycan Gal-beta-(1-3)-alpha-GalNAc) and its deglycosylated counterpart I derived from the C-terminal leucine zipper domain of low molecular weight human salivary mucin (MUC7) were studied using CD, NMR spectroscopy and molecular modeling. The peptide I was synthesized using the Fmoc chemistry following the conventional procedure and the glycopeptide II was synthesized incorporating the O-glycosylated building block (N alpha-Fmoc-Ser-[Ac-4,-beta-D-Gal-(1,3)-Ac(2)alpha-D-GalN(3)]-OPfp) at the appropriate position in stepwise assembly of peptide chain. Solution structures of these glycosylated and nonglycosylated peptides were studied in water and in the presence of 50% of an organic cosolvent, trifluoroethanol (TFE) using circular dichroism (CD), and in 50% TFE using two-dimensional proton nuclear magnetic resonance (2D H-1 NMR) spectroscopy. CD spectra in aqueous medium indicate that the apopeptide I adapts, mostly, a beta-sheet conformation whereas the glycopeptide II assumes helical structure. This transition in the secondary structure, upon glycosylation, demonstrates that the carbohydrate moiety exerts significant effect on the peptide backbone conformation. However, in 50% TFE both the peptides show pronounced helical structure. Sequential and medium range NOEs, C alpha H chemical shift perturbations, (3)J(NH:C alpha H) couplings and deuterium exchange rates of the amide proton resonances in water containing 50% TFE indicate that the peptide I adapts alpha-helical structure from Ile2-Val21 and the glycopeptide II adapts alpha-helical structure from Ser3-Glu22. The observation of continuous stretch of helix in both the peptides as observed by both NMR and CD spectroscopy strongly suggests that the C-terminal domain of MUC7 with heptad repeats of leucines or methionine residues may be stabilized by dimeric leucine zipper motif. The results reported herein may be invaluable in understanding the aggregation (or dimerization) of MUC7 glycoprotein which would eventually have implications in determining its structure-function relationship.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The conformation of amino acid side chains as observed in well-determined structures of globular proteins has earlier been extensively investigated. In contrast, the structural features of the polypeptide backbone that result from the occurrence of specific amino acids along the polypeptide have not been analysed. In this article, we present the statistically significant features in the backbone geometry that appear to be a consequence of the occurrence of rotamers of different amino acid side chains by analysing 102 well-refined structures that form a random collection of proteins. It is found that the persistence of helical segments around each residue is influenced by the residue type. Several residues exert asymmetrical influence between the carboxyl and amino terminal polypeptide segments. The degree to which secondary structures depart from an average geometry also appears to depend on residue type. These departures are correlated to the corresponding Chou and Fasman parameters of amino acid residues. The frequency distribution of the side chain rotamers is influenced by polypeptide secondary structure. In turn, the rotamer conformation of side chain affects the extension of the secondary structure of the backbone. The strongest correlation is found between the occurrence of g+ conformation and helix propagation on the carboxyl side of many residues.