151 resultados para Backbone-cyclized Proteins Database


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The cyclotides are a large family of plant proteins that have a cyclic backbone and a knotted arrangement of three conserved disulfide bonds. Despite the apparent complexity of their cystine knot motif it is possible to efficiently fold these proteins, as exemplified by oxidative folding studies on the prototypic cyclotide, kalata B1. This mini-review reports on the current understanding of the folding process in cyclotides. The synthesis and folding of these molecules paves the way for their application as stable molecular templates.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In recent years, there have been increasing numbers of transcripts identified that do not encode proteins, many of which are developmentally regulated and appear to have regulatory functions. Here, we describe the construction of a comprehensive mammalian noncoding RNA database (RNAdb) which contains over 800 unique experimentally studied noncoding RNAs (ncRNAs), including many associated with diseases and/or developmental processes. The database is available at http://research.imb.uq. edu.au/RNAdb and is searchable by many criteria. It includes microRNAs and snoRNAs, but not infrastructural RNAs, such as rRNAs and tRNAs, which are catalogued elsewhere. The database also includes over 1100 putative antisense ncRNAs and almost 20000 putative ncRNAs identified in high-quality murine and human cDNA libraries, with more to be added in the near future. Many of these RNAs are large, and many are spliced, some alternatively. The database will be useful as a foundation for the emerging field of RNomics and the characterization of the roles of ncRNAs in mammalian gene expression and regulation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The presence and location of intramolecular disulphide bonds are a key determinant of the structure and function of proteins. Intramolecular disulphide bonds in proteins have previously been analyzed under the assumption that there is no clear relationship between disulphide arrangement and disulphide concentration. To investigate this, a set of sequence nonhomologous protein chains containing one or more intramolecular disulphide bonds was extracted from the Protein Data Bank, and the arrangements of the bonds, Protein Data Bank header, and Structural Characterization of Proteins fold were analyzed as a function of intramolecular, containing proteins were disulphide bond concentration. Two populations of intramolecular disulphide bond-containing identified, with a naturally occurring partition at 25 residues per bond. These populations were named intramolecular disulphide bond-rich and -poor. Benefits of partitioning were illustrated by three results: (1) rich chains most frequently contained three disulphides, explaining the plateaux in extant disulphide frequency distributions; (2) a positive relationship between median chain length and the number of disulphides, only seen when the data were partitioned-, and (3) the most common bonding pattern for chains with three disulphide bonds was based on the most common for two, only when the data were partitioned. The two populations had different headers, folds, bond arrangements, and chain lengths. Associations between IDSB concentration, IDSB bonding pattern, loop sizes, SCOP fold, and PDB header were also found. From this, we found that intramolecular disulphide bond-rich and -poor proteins follow different bonding rules, and must be considered separately to generate meaningful models of bond formation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Candida albicans is a pathogen commonly infecting patients who receive immunosuppressive drug therapy, long-term catheterization, or those who suffer from acquired immune deficiency syndrome (AIDS). The major factor accountable for pathogenicity of C. albicans is host immune status. Various virulence molecules, or factors, of are also responsible for the disease progression. Virulence proteins are published in public databases but they normally lack detailed functional annotations. We have developed CandiVF, a specialized database of C. albicans virulence factors (http://antigen.i2r.a-star.edu.sg/Templar/DB/CandiVF/) to facilitate efficient extraction and analysis of data aimed to assist research on immune responses, pathogenesis, prevention, and control of candidiasis. CandiVF contains a large number of annotated virulence proteins, including secretory, cell wall-associated, membrane, cytoplasmic, and nuclear proteins. This database has in-built bioinformatics tools including keyword and BLAST search, visualization of 3D-structures, HLA-DR epitope prediction, virulence descriptors, and virulence factors ontology.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cyclotides are peptides from plants of the Rubiaceae and Violaceae families that have the unusual characteristic of a macrocylic backbone. They are further characterized by their incorporation of a cystine knot in which two disulfides, along with the intervening backbone residues, form a ring through which a third disulfide is threaded. The cyclotides have been found in every Violaceae species screened to date but are apparently present in only a few Rubiaceae species. The selective distribution reported so far raises questions about the evolution of the cyclotides within the plant kingdom. In this study, we use a combined bioinformatics and expression analysis approach to elucidate the evolution and distribution of the cyclotides in the plant kingdom and report the discovery of related sequences widespread in the Poaceae family, including crop plants such as rice ( Oryza sativa), maize ( Zea mays), and wheat ( Triticum aestivum), which carry considerable economic and social importance. The presence of cyclotide-like sequences within these plants suggests that the cyclotides may be derived from an ancestral gene of great antiquity. Quantitative RT-PCR was used to show that two of the discovered cyclotide-like genes from rice and barley ( Hordeum vulgare) have tissue-specific expression patterns.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Anisotropic magnetic susceptibility tensors chi of paramagnetic metal ions are manifested in pseudocontact shifts, residual dipolar couplings, and other paramagnetic observables that present valuable long-range information for structure determinations of protein-ligand complexes. A program was developed for automatic determination of the chi-tensor anisotropy parameters and amide resonance assignments in proteins labeled with paramagnetic metal ions. The program requires knowledge of the three-dimensional structure of the protein, the backbone resonance assignments of the diamagnetic protein, and a pair of 2D N-15-HSQC or 3D HNCO spectra recorded with and without paramagnetic metal ion. It allows the determination of reliable chi-tensor anisotropy parameters from 2D spectra of uniformly N-15-labeled proteins of fairly high molecular weight. Examples are shown for the 185-residue N-terminal domain of the subunit epsilon from E. coli DNA polymerase III in complex with the subunit theta and La3+ in its diamagnetic and Dy3+, Tb3+, and Er3+ in its paramagnetic form.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The aim of this work was to elucidate the oxidative folding mechanism of the macrocyclic cystine knot protein MCoTI-II. We aimed to investigate how the six-cysteine residues distributed on the circular backbone of the reduced unfolded peptide recognize their correct partner and join up to form a complex cystine-knotted topology. To answer this question, we studied the oxidative folding of the naturally occurring peptide using a range of spectroscopic methods. For both oxidative folding and reductive unfolding, the same disulfide intermediate species was prevalent and was characterized to be a native-like two-disulfide intermediate in which the Cys(1)-Cys(18) disulfide bond was absent. Overall, the folding pathway of this head-to-tail cyclized protein was found to be similar to that of linear cystine knot proteins from the squash family of trypsin inhibitors. However, the pathway differs in an important way from that of the cyclotide kalata B1, in that the equivalent two-disulfide intermediate in that case is not a direct precursor of the native protein. The size of the embedded ring within the cystine knot motif appears to play a crucial role in the folding pathway. Larger rings contribute to the independence of disulfides and favor an on-pathway native-like intermediate that has a smaller energy barrier to cross to form the native fold. The fact that macrocyclic proteins are readily able to fold to a complex knotted structure in vitro in the absence of chaperones makes them suitable as protein engineering scaffolds that have remarkable stability.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. Results: We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. Conclusion: The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Application of a computational membrane organization prediction pipeline, MemO, identified putative type II membrane proteins as proteins predicted to encode a single alpha-helical transmembrane domain (TMD) and no signal peptides. MemO was applied to RIKEN's mouse isoform protein set to identify 1436 non-overlapping genomic regions or transcriptional units (TUs), which encode exclusively type II membrane proteins. Proteins with overlapping predicted InterPro and TMDs were reviewed to discard false positive predictions resulting in a dataset comprised of 1831 transcripts in 1408 TUs. This dataset was used to develop a systematic protocol to document subcellular localization of type II membrane proteins. This approach combines mining of published literature to identify subcellular localization data and a high-throughput, polymerase chain reaction (PCR)-based approach to experimentally characterize subcellular localization. These approaches have provided localization data for 244 and 169 proteins. Type II membrane proteins are localized to all major organelle compartments; however, some biases were observed towards the early secretory pathway and punctate structures. Collectively, this study reports the subcellular localization of 26% of the defined dataset. All reported localization data are presented in the LOCATE database (http://www.locate.imb.uq.edu.au).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Short proteins play key roles in cell signalling and other processes, but their abundance in the mammalian proteome is unknown. Current catalogues of mammalian proteins exhibit an artefactual discontinuity at a length of 100 aa, so that protein abundance peaks just above this length and falls off sharply below it. To clarify the abundance of short proteins, we identify proteins in the FANTOM collection of mouse cDNAs by analysing synonymous and nonsynonymous substitutions with the computer program CRITICA. This analysis confirms that there is no real discontinuity at length 100. Roughly 10% of mouse proteins are shorter than 100 aa, although the majority of these are variants of proteins longer than 100 aa. We identify many novel short proteins, including a dark matter'' subset containing ones that lack detectable homology to other known proteins. Translation assays confirm that some of these novel proteins can be translated and localised to the secretory pathway.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cyclotides are a fascinating family of plant-derived peptides characterized by their head-to-tail cyclized backbone and knotted arrangement of three disulfide bonds. This conserved structural architecture, termed the CCK (cyclic cystine knot), is responsible for their exceptional resistance to thermal, chemical and enzymatic degradation. Cyclotides have a variety of biological activities, but their insecticidal activities suggest that their primary function is in plant defence. In the present study, we determined the cyclotide content of the sweet violet Viola odorata, a member of the Violaceae family. We identified 30 cyclotides from the aerial parts and roots of this plant, 13 of which are novel sequences. The new sequences provide information about the natural diversity of cyclotides and the role of particular residues in defining structure and function. As many of the biological activities of cyclotides appear to be associated with membrane interactions, we used haemolytic activity as a marker of bioactivity for a selection of the new cyclotides. The new cyclotides were tested for their ability to resist proteolysis by a range of enzymes and, in common with other cyclotides, were completely resistant to trypsin, pepsin and thermolysin. The results show that while biological activity varies with the sequence, the proteolytic stability of the framework does not, and appears to be an inherent feature of the cyclotide framework. The structure of one of the new cyclotides, cycloviolacin O14, was determined and shown to contain the CCK motif. This study confirms that cyclotides may be regarded as a natural combinatorial template that displays a variety of peptide epitopes most likely targeted to a range of plant pests and pathogens.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A novel strategy for fast NMR resonance assignment of N-15 HSQC spectra of proteins is presented. It requires the structure coordinates of the protein, a paramagnetic center, and one or more residue-selectively N-15-labeled samples. Comparison of sensitive undecoupled N-15 HSQC spectra recorded of paramagnetic and diamagnetic samples yields data for every cross-peak on pseudocontact shift, paramagnetic relaxation enhancement, cross-correlation between Curie-spin and dipole-dipole relaxation, and residual dipolar coupling. Comparison of these four different paramagnetic quantities with predictions from the three-dimensional structure simultaneously yields the resonance assignment and the anisotropy of the susceptibility tensor of the paramagnetic center. The method is demonstrated with the 30 kDa complex between the N-terminal domain of the epsilon subunit and the theta subunit of Escherichia Coll DNA polymerase III. The program PLATYPUS was developed to perform the assignment, provide a measure of reliability of the assignment, and determine the susceptibility tensor anisotropy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

As part of ACIAR project ASEM/2003/052, Improving Financial Returns to Smallholder Tree Farmers in the Philippines, plantations of timber trees in Leyte Island, the Philippines were located using a systematic survey of the island. The survey was undertaken in order to compile a database of plantations which could be used to guide the planning of project activities. In addition to recording a range of qualitative and quantitative information for each plantation, the survey spatially referenced each site using a Global Positioning System (GPS) to electronic maps of the island which were held in a Geographical Information System (GIS). Microsoft Excel and Mapsource® software were used as the software links between GPS coordinates and the GIS. Mapping of farm positions was complicated by different datums being used for maps of Leyte Island and this caused GPS positions to be displaced from equivalent positions on the map. Photos of the sites were hyperlinked to their map positions in the GIS in order to assist staff to recall site characteristics.