971 resultados para secondary structure detection


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background Small RNA sequencing is commonly used to identify novel miRNAs and to determine their expression levels in plants. There are several miRNA identification tools for animals such as miRDeep, miRDeep2 and miRDeep*. miRDeep-P was developed to identify plant miRNA using miRDeep’s probabilistic model of miRNA biogenesis, but it depends on several third party tools and lacks a user-friendly interface. The objective of our miRPlant program is to predict novel plant miRNA, while providing a user-friendly interface with improved accuracy of prediction. Result We have developed a user-friendly plant miRNA prediction tool called miRPlant. We show using 16 plant miRNA datasets from four different plant species that miRPlant has at least a 10% improvement in accuracy compared to miRDeep-P, which is the most popular plant miRNA prediction tool. Furthermore, miRPlant uses a Graphical User Interface for data input and output, and identified miRNA are shown with all RNAseq reads in a hairpin diagram. Conclusions We have developed miRPlant which extends miRDeep* to various plant species by adopting suitable strategies to identify hairpin excision regions and hairpin structure filtering for plants. miRPlant does not require any third party tools such as mapping or RNA secondary structure prediction tools. miRPlant is also the first plant miRNA prediction tool that dynamically plots miRNA hairpin structure with small reads for identified novel miRNAs. This feature will enable biologists to visualize novel pre-miRNA structure and the location of small RNA reads relative to the hairpin. Moreover, miRPlant can be easily used by biologists with limited bioinformatics skills.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Coleoptera is the most diverse group of insects with over 360,000 described species divided into four suborders: Adephaga, Archostemata, Myxophaga, and Polyphaga. In this study, we present six new complete mitochondrial genome (mtgenome) descriptions, including a representative of each suborder, and analyze the evolution of mtgenomes from a comparative framework using all available coleopteran mtgenomes. We propose a modification of atypical cox1 start codons based on sequence alignment to better reflect the conservation observed across species as well as findings of TTG start codons in other genes. We also analyze tRNA-Ser(AGN) anticodons, usually GCU in arthropods, and report a conserved UCU anticodon as a possible synapomorphy across Polyphaga. We further analyze the secondary structure of tRNA-Ser(AGN) and present a consensus structure and an updated covariance model that allows tRNAscan-SE (via the COVE software package) to locate and fold these atypical tRNAs with much greater consistency. We also report secondary structure predictions for both rRNA genes based on conserved stems. All six species of beetle have the same gene order as the ancestral insect. We report noncoding DNA regions, including a small gap region of about 20 bp between tRNA-Ser(UCN) and nad1 that is present in all six genomes, and present results of a base composition analysis.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We present a machine learning model that predicts a structural disruption score from a protein s primary structure. SCHEMA was introduced by Frances Arnold and colleagues as a method for determining putative recombination sites of a protein on the basis of the full (PDB) description of its structure. The present method provides an alternative to SCHEMA that is able to determine the same score from sequence data only. Circumventing the need for resolving the full structure enables the exploration of yet unresolved and even hypothetical sequences for protein design efforts. Deriving the SCHEMA score from a primary structure is achieved using a two step approach: first predicting a secondary structure from the sequence and then predicting the SCHEMA score from the predicted secondary structure. The correlation coefficient for the prediction is 0.88 and indicates the feasibility of replacing SCHEMA with little loss of precision.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The predicted secondary structure of sub-genomic RNA in dengue virus defective interfering (D.I.) particles from patients, or generated in vitro, resembled that of the 3′ and 5′ regions of wild type dengue virus (DENV) genomes. While these structures in the sub-genomic RNA were found to be essential for its replication, their nucleotide sequences were not, so long as any new sequences maintained wild type RNA secondary structure. These observations suggested that these sub-genomic fragments of RNA from dengue viruses were replicated in the same manner as the full length genomes of their wild type, “helper”, viruses and that they probably represent the smallest fragments of DENV RNA that can be replicated during a natural infection. While D.I. particles containing sub-genomic RNA are completely parasitic, the relationship between wild type and D.I. DENV may be symbiotic, with the D.I. particles enhancing the transmission of infectious DENV.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background Strand specific RNAseq data is now more common in RNAseq projects. Visualizing RNAseq data has become an important matter in Analysis of sequencing data. The most widely used visualization tool is the UCSC genome browser that introduced the custom track concept that enabled researchers to simultaneously visualize gene expression at a particular locus from multiple experiments. Our objective of the software tool is to provide friendly interface for visualization of RNAseq datasets. Results This paper introduces a visualization tool (RNASeqBrowser) that incorporates and extends the functionality of the UCSC genome browser. For example, RNASeqBrowser simultaneously displays read coverage, SNPs, InDels and raw read tracks with other BED and wiggle tracks -- all being dynamically built from the BAM file. Paired reads are also connected in the browser to enable easier identification of novel exon/intron borders and chimaeric transcripts. Strand specific RNAseq data is also supported by RNASeqBrowser that displays reads above (positive strand transcript) or below (negative strand transcripts) a central line. Finally, RNASeqBrowser was designed for ease of use for users with few bioinformatic skills, and incorporates the features of many genome browsers into one platform. Conclusions The features of RNASeqBrowser: (1) RNASeqBrowser integrates UCSC genome browser and NGS visualization tools such as IGV. It extends the functionality of the UCSC genome browser by adding several new types of tracks to show NGS data such as individual raw reads, SNPs and InDels. (2) RNASeqBrowser can dynamically generate RNA secondary structure. It is useful for identifying non-coding RNA such as miRNA. (3) Overlaying NGS wiggle data is helpful in displaying differential expression and is simple to implement in RNASeqBrowser. (4) NGS data accumulates a lot of raw reads. Thus, RNASeqBrowser collapses exact duplicate reads to reduce visualization space. Normal PC’s can show many windows of NGS individual raw reads without much delay. (5) Multiple popup windows of individual raw reads provide users with more viewing space. This avoids existing approaches (such as IGV) which squeeze all raw reads into one window. This will be helpful for visualizing multiple datasets simultaneously. RNASeqBrowser and its manual are freely available at http://www.australianprostatecentre.org/research/software/rnaseqbrowser webcite or http://sourceforge.net/projects/rnaseqbrowser/ webcite

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Copy number variations (CNVs) as described in the healthy population are purported to contribute significantly to genetic heterogeneity. Recent studies have described CNVs using lymphoblastoid cell lines or by application of specifically developed algorithms to interrogate previously described data. However, the full extent of CNVs remains unclear. Using high-density SNP array, we have undertaken a comprehensive investigation of chromosome 18 for CNV discovery and characterisation of distribution and association with chromosome architecture. We identified 399 CNVs, of which loss represents 98%, 58% are less than 2.5 kb in size and 71% are intergenic. Intronic deletions account for the majority of copy number changes with gene involvement. Furthermore, one-third of CNVs do not have putative breakpoints within repetitive sequences. We conclude that replicative processes, mediated either by repetitive elements or microhomology, account for the majority of CNVs in the healthy population. Genomic instability involving the formation of a non-B structure is demonstrated in one region.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The structural stabilizing property of 2,2,2-trifluoroethanol (TFE) in peptides has been widely demonstrated, More recently, TFE has been shown to enhance secondary structure content in globular proteins, and to influence quaternary interactions in protein multimers. The molecular mechanisms by which TFE exerts its Influence on peptide and protein structures remain poorly understood. The present analysis integrates the known physical properties of TFE with a variety of experimental observations on the interaction of TFE with peptides and proteins and on the properties of fluorocarbons. Two features of TFE, namely the hydrophobicity of the trifluoromethyl group and the hydrogen bonding character (strong donor and poor acceptor), emerge as the most important factors for rationalising the observed effects of TFE. A model is proposed for TFE interaction with peptides which involves an initial replacement of the hydration shell by fluoroalcohol molecules, a process driven by apolar interactions and favourable entropy of dehydration. Subsequent bifurcated hydrogen-bond formation with peptide carbonyl groups, which leave intramolecular interactions unaffected, promotes secondary structure formation.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Elucidation of the detailed structural features and sequence requirements for iv helices of various lengths could be very important in understanding secondary structure formation in proteins and, hence. in the protein folding mechanism. An algorithm to characterize the geometry of an alpha helix from its C-alpha coordinates has been developed and used to analyze the structures of long cu helices (number of residues greater than or equal to 25) found in globular proteins, the crystal structure coordinates of which are available from the Brookhaven Protein Data Bank, Ail long a helices can be unambiguously characterized as belonging to one of three classes: linear, curved, or kinked, with a majority being curved. Analysis of the sequences of these helices reveals that the long alpha helices have unique sequence characteristics that distinguish them from the short alpha helices in globular proteins, The distribution and statistical propensities of individual amino acids to occur in long alpha heices are different from those found in short alpha helices, with amino acids having longer side chains and/or having a greater number of functional groups occurring more frequently in these helices, The sequences of the long alpha helices can be correlated with their gross structural features, i.e., whether they are curved, linear, or kinked, and in case of the curved helices, with their curvature.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We study the secondary structure of RNA determined by Watson-Crick pairing without pseudo-knots using Milnor invariants of links. We focus on the first non-trivial invariant, which we call the Heisenber invariant. The Heisenberg invariant, which is an integer, can be interpreted in terms of the Heisenberg group as well as in terms of lattice paths. We show that the Heisenberg invariant gives a lower bound on the number of unpaired bases in an RNA secondary structure. We also show that the Heisenberg invariant can predict allosteric structures for RNA. Namely, if the Heisenberg invariant is large, then there are widely separated local maxima (i.e., allosteric structures) for the number of Watson-Crick pairs found.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The far-ultraviolet region circular dichroic spectrumof serine hydroxymethyltransferase from monkey liver showed that the protein is in an α-helical conformation. The near ultraviolet circular dichoric spectrum revealed two negative bands originating from the tertiary conformational environment of the aromatic amino acid residues. Addition of urea or guanidinium chloride perturbed the characteristic fluorescence and far ultraviolet circular dichroic spectrum of the enzyme. The decrease in (θ)222 and enzyme activity followed identical patterns with increasing concentrations of urea, whereas with guanidinium chloride, the loss of enzyme activity preceded the loss of secondary structure. 2-Chloroethanol, trifluoroethanol and sodium dodecyl sulphate enhanced the mean residue ellipticity values. In addition, sodium dodecyl sulphate also caused a perturbation of the fluorescence emission spectrum of the enzyme. Extremes of pH decreased the – (θ)222 value. Plots of –(θ)222and enzyme activity as a function of pH showed maximal values at pH 7.4-7.5. These results suggested the prevalence of "conformational flexibility" in the structure of serine hydroxymethyltransferase.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The homogeneous serine hydroxymethyltransferase from monkey liver was optimally activate at 60°C and the Arrhenius plot for the enzyme was nonlinear with a break at 15°C. The monkey liver enzyme showed high thermal stability of 62°C, as monitored by circular dichroism at 222 nm, absorbance at 280 nm and enzyme activity. The enzyme exhibited a sharp co-operative thermal transition in the range of 50°-70° (Tm= 65°C), as monitored by circular dichroism. L-Serine protected the enzyme against both thermal inactivation and thermal disruption of the secondary structure. The homotropic interactions of tetrahydrofolate with the enzyme was abolished at high temperatures (at 70°C, the Hill coefficient value was 1.0). A plot of h values vs. assay temperature of tetrahydrofolate saturation experiments, showed the presence of an intermediate conformer with an h value of 1.7 in the temperature range of 45°-60°C. Inclusion of a heat denaturation step in the scheme employed for the purification of serine hydroxymethyltransferase resulted in the loss of cooperative interactions with tetrahydrofolate. The temperature effects on the serine hydroxylmethyltransferase, reported for the first time, lead to a better understanding of the heat induced alterations in conformation and activity for this oligomeric protein.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Comparative studies on protein structures form an integral part of protein crystallography. Here, a fast method of comparing protein structures is presented. Protein structures are represented as a set of secondary structural elements. The method also provides information regarding preferred packing arrangements and evolutionary dynamics of secondary structural elements. This information is not easily obtained from previous methods. In contrast to those methods, the present one can be used only for proteins with some secondary structure. The method is illustrated with globin folds, cytochromes and dehydrogenases as examples.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The galactose-binding lectin from the seeds of the jequirity plant (Abrus precatorius) was subjected to various chemical modifications in order to detect the amino acid residues involved in its binding activity. Modification of lysine, tyrosine, arginine, histidine, glutamic acid and aspartic acid residues did not affect the carbohydratebinding activity of the agglutinin. However, modification of tryptophan residues carried out in native and denaturing conditions with N-bromosuccinimide and 2- hydroxy-5-nitrobenzyl bromide led to a complete loss of its carbohydrate-binding activity. Under denaturing conditions 30 tryptophan residues/molecule were modified by both reagents, whereas only 16 and 18 residues/molecule were available for modification by N-bromosuccinimide and 2-hydroxy-5-nitrobenzyl bromide respectively under native conditions. The relative loss in haemagglutinating activity after the modification of tryptophan residues indicates that two residues/molecule are required for the carbohydrate-binding activity of the agglutinin. A partial protection was observed in the presence of saturating concentrations of lactose (0.15 M). The decrease in fluorescence intensity of Abrus agglutinin on modification of tryptophan residues is linear in the absence of lactose and shows a biphasic pattern in the presence of lactose, indicating that tryptophan residues go from a similar to a different molecular environment on saccharide binding. The secondary structure of the protein remains practically unchanged upon modification of tryptophan residues, as indicated by c.d. and immunodiffusion studies, confirming that the loss in activity is due to modification only.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

CXCL-8 (Interleukin 8) is a CXC chemokine with a central role in the human immune response. We have undertaken extensive in silico analyses to elucidate the interactions of CXCL-8 with its various binding partners, which are crucial for its biological function. Sequence and structure analyses showed that residues in the thirdq β-sheet and basic residues in the heparin binding site are highly variable, while residues in the second β-sheet are highly conserved. Molecular dynamics simulations in aqueous solution of dimeric CXCL-8 have been performed with starting geometries from both X-ray and NMR structures showed shearing movements between the two antiparallel C-terminal helices. Dynamic conservation analyses of these simulations agreed with experimental data indicating that structural differences between the two structures at quaternary level arise from changes in the secondary structure of the N-terminal loop, the 310-helix, the 30s, 40s, and 50s loops and the third β-sheet, resulting in a different interhelical separation. Nevertheless, the observation of these different states indicates that CXCL-8 has the potential to undergo conformational changes, and it seems likely that this feature is relevant to the mode of binding of glycosaminoglycan (GAG) mimetics such as cyclitols. Simulations of the receptor peptide fragment−CXCL-8 complex identified several specific interactions of the receptor peptide with CXCL-8 that could be exploited in the structure-based design of competitive peptides and nonpeptidic molecules targeting CXCL-8 for combating inflammatory diseases. Simulations of the CXCL-8 dimer complexed with a 24-mer heparin fragment and of the CXCL-8−receptor peptide complex revealed that Arg60, Lys64, and Arg68 in the dimer bind to cyclitols in a horseshoe pattern, defining a region which is spatially distinct from the receptor binding site. There appears to be an optimum number of sulfates and an optimum length of alkyl spacers required for the interaction of cyclitol inhibitors with the dimeric form of CXCL-8. Calculation of the binding affinities of cyclitol inhibitors reflected satisfactorily the ranking of experimentally determined inhibitory potencies. The findings of these molecular modeling studies will help in the search for inhibitors which can modulate various CXCL-8 biological activities and serve as an excellent model system to study CXC-inhibitor interactions.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The novel multidomain organization in the multimeric Escherichia coli AHAS I (ilvBN) enzyme has been dissected to generate polypeptide fragments. These fragments when cloned, expressed and purified reassemble in the presence of cofactors to yield a catalytically competent enzyme. Structural characterization of AHAS has been impeded due to the fact that the holoenzyme is prone to dissociation leading to heterogeneity in samples. Our approach has enabled the structural characterization using high-resolution nuclear magnetic resonance methods. Near complete sequence specific NMR assignments for backbone H-N, N-15, C-13 alpha and C-13(beta) atoms of the FAD binding domain of ilvB have been obtained on samples isotopically enriched in H-2, C-13 and N-15. The secondary structure determined on the basis of observed C-13(alpha) secondary chemical shifts and sequential NOEs indicates that the secondary structure of the FAD binding domain of E. coli AHAS large Subunit (ilvB) is similar to the structure of this domain in the catalytic subunit of yeast AHAS. Protein-protein interactions involving the regulatory subunit (ilvN) and the domains of the catalytic subunit (ilvB) were studied using circular dichroic and isotope edited solution nuclear magnetic resonance spectroscopic methods. Observed changes in circular dichroic spectra indicate that the regulatory subunit (ilvN) interacts with ilvB alpha and ilvB beta domains of the catalytic subunit and not with the ilvB gamma domain. NMR chemical shift mapping methods show that ilvN binds close to the FAD binding site in ilvB beta and proximal to the intrasubunit ilvB alpha/ilvB beta domain interface. The implication of this interaction on the role of the regulatory subunit oil the activity of the holoenzyme is discussed. NMR studies of the regulatory domains show that these domains are structured in solution. Preliminary evidence for the interaction of ilvN with the metabolic end product of the pathway, viz., valine is also presented.