Biblioteca Digital

159 resultados para MAIN-SEQUENCE STARS

Cascaded walks in protein sequence space: use of artificial sequences in remote homology detection between natural proteins

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Over the past two decades, many ingenious efforts have been made in protein remote homology detection. Because homologous proteins often diversify extensively in sequence, it is challenging to demonstrate such relatedness through entirely sequence-driven searches. Here, we describe a computational method for the generation of `protein-like' sequences that serves to bridge gaps in protein sequence space. Sequence profile information, as embodied in a position-specific scoring matrix of multiply aligned sequences of bona fide family members, serves as the starting point in this algorithm. The observed amino acid propensity and the selection of a random number dictate the selection of a residue for each position in the sequence. In a systematic manner, and by applying a `roulette-wheel' selection approach at each position, we generate parent family-like sequences and thus facilitate an enlargement of sequence space around the family. When generated for a large number of families, we demonstrate that they expand the utility of natural intermediately related sequences in linking distant proteins. In 91% of the assessed examples, inclusion of designed sequences improved fold coverage by 5-10% over searches made in their absence. Furthermore, with several examples from proteins adopting folds such as TIM, globin, lipocalin and others, we demonstrate that the success of including designed sequences in a database positively sensitized methods such as PSI-BLAST and Cascade PSI-BLAST and is a promising opportunity for enormously improved remote homology recognition using sequence information alone.

Progressive structure-based alignment of homologous proteins: Adopting sequence comparison strategies

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Comparison of multiple protein structures has a broad range of applications in the analysis of protein structure, function and evolution. Multiple structure alignment tools (MSTAs) are necessary to obtain a simultaneous comparison of a family of related folds. In this study, we have developed a method for multiple structure comparison largely based on sequence alignment techniques. A widely used Structural Alphabet named Protein Blocks (PBs) was used to transform the information on 3D protein backbone conformation as a ID sequence string. A progressive alignment strategy similar to CLUSTALW was adopted for multiple PB sequence alignment (mulPBA). Highly similar stretches identified by the pairwise alignments are given higher weights during the alignment. The residue equivalences from PB based alignments are used to obtain a three dimensional fit of the structures followed by an iterative refinement of the structural superposition. Systematic comparisons using benchmark datasets of MSTAs underlines that the alignment quality is better than MULTIPROT, MUSTANG and the alignments in HOMSTRAD, in more than 85% of the cases. Comparison with other rigid-body and flexible MSTAs also indicate that mulPBA alignments are superior to most of the rigid-body MSTAs and highly comparable to the flexible alignment methods. (C) 2012 Elsevier Masson SAS. All rights reserved.

Sequence and structural basis for chromosomal fragility during translocations in cancer

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Chromosomal aberration is considered to be one of the major characteristic features in many cancers. Chromosomal translocation, one type of genomic abnormality, can lead to deregulation of critical genes involved in regulating important physiological functions such as cell proliferation and DNA repair. Although chromosomal translocations were thought to be random events, recent findings suggest that certain regions in the human genome are more susceptible to breakage than others. The possibility of deviation from the usual B-DNA conformation in such fragile regions has been an active area of investigation. This review summarizes the factors that contribute towards the fragility of these regions in the chromosomes, such as DNA sequences and the role of different forms of DNA structures. Proteins responsible for chromosomal fragility, and their mechanism of action are also discussed. The effect of positioning of chromosomes within the nucleus favoring chromosomal translocations and the role of repair mechanisms are also addressed.

Draft Genome Sequence of Staphylococcus aureus ST672, an Emerging Disease Clone from India

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report the draft genome sequence of methicillin-resistant Staphylococcus aureus (MRSA) strain ST672, an emerging disease clone in India, from a septicemia patient. The genome size is about 2.82 Mb with 2,485 open reading frames (ORFs). The staphylococcal cassette chromosome mec (SCCmec) element (type V) and immune evasion cluster appear to be different from those of strain ST772 on preliminary examination.

Generation and analysis of drought stressed subtracted expressed sequence tags from safflower (Carthamus tinctorius L.)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Drought is the most crucial environmental factor that limits productivity of many crop plants. Exploring novel genes and gene combinations is of primary importance in plant drought tolerance research. Stress tolerant genotypes/species are known to express novel stress responsive genes with unique functional significance. Hence, identification and characterization of stress responsive genes from these tolerant species might be a reliable option to engineer the drought tolerance. Safflower has been found to be a relatively drought tolerant crop and thus, it has been the choice of study to characterize the genes expressed under drought stress. In the present study, we have evaluated differential drought tolerance of two cultivars of safflower namely, A1 and Nira using selective physiological marker traits and we have identified cultivar A1 as relatively drought tolerant. To identify the drought responsive genes, we have constructed a stress subtracted cDNA library from cultivar A1 following subtractive hybridization. Analysis of similar to 1,300 cDNA clones resulted in the identification of 667 unique drought responsive ESTs. Protein homology search revealed that 521 (78 %) out of 667 ESTs showed significant similarity to known sequences in the database and majority of them previously identified as drought stress-related genes and were found to be involved in a variety of cellular functions ranging from stress perception to cellular protection. Remaining 146 (22 %) ESTs were not homologous to known sequences in the database and therefore, they were considered to be unique and novel drought responsive genes of safflower. Since safflower is a stress-adapted oil-seed crop this observation has great relevance. In addition, to validate the differential expression of the identified genes, expression profiles of selected clones were analyzed using dot blot (reverse northern), and northern blot analysis. We showed that these clones were differentially expressed under different abiotic stress conditions. The implications of the analyzed genes in abiotic stress tolerance are discussed in our study.

The defect sequence for contractive tuples

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We introduce the defect sequence for a contractive tuple of Hilbert space operators and investigate its properties. The defect sequence is a sequence of numbers, called defect dimensions associated with a contractive tuple. We show that there are upper bounds for the defect dimensions. The tuples for which these upper bounds are obtained, are called maximal contractive tuples. The upper bounds are different in the non-commutative and in the commutative case. We show that the creation operators on the full Fock space and the coordinate multipliers on the Drury-Arveson space are maximal. We also study pure tuples and see how the defect dimensions play a role in their irreducibility. (C) 2012 Elsevier Inc. All rights reserved.

Structural and molecular basis of interaction of HCV non-structural protein 5A with human casein kinase 1 alpha and PKR

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Interaction of non-structural protein 5A (NS5A) of Hepatitis C virus (HCV) with human kinases namely, casein kinase 1 alpha (ck1 alpha) and protein kinase R (PKR) have different functional implications such as regulation of viral replication and evasion of interferon induced immune response respectively. Understanding the structural and molecular basis of interactions of the viral protein with two different human kinases can be useful in developing strategies for treatment against HCV. Results: Serine 232 of NS5A is known to be phosphorylated by human ck1 alpha. A structural model of NS5A peptide containing phosphoacceptor residue Serine 232 bound to ck1 alpha has been generated using the known 3-D structures of kinase-peptide complexes. The substrate interacting residues in ck1 alpha has been identified from the model and these are found to be conserved well in the ck1 family. ck1 alpha - substrate peptide complex has also been used to understand the structural basis of association between ck1 alpha and its other viral stress induced substrate, tumour suppressor p53 transactivation domain which has a crystal structure available. Interaction of NS5A with another human kinase PKR is primarily genotype specific. NS5A from genotype 1b has been shown to interact and inhibit PKR whereas NS5A from genotype 2a/3a are unable to bind and inhibit PKR efficiently. This is one of the main reasons for the varied response to interferon therapy in HCV patients across different genotypes. Using PKR crystal structure, sequence alignment and evolutionary trace analysis some of the critical residues responsible for the interaction of NS5A 1b with PKR have been identified. Conclusions: The substrate interacting residues in ck1 alpha have been identified using the structural model of kinase substrate peptide. The PKR interacting NS5A 1b residues have also been predicted using PKR crystal structure, NS5A sequence analysis along with known experimental results. Functional significance and nature of interaction of interferon sensitivity determining region and variable region 3 of NS5A in different genotypes with PKR which was experimentally shown are also supported by the findings of evolutionary trace analysis. Designing inhibitors to prevent this interaction could enable the HCV genotype 1 infected patients respond well to interferon therapy.

Improved Detection of Remote Homologues Using Cascade PSI-BLAST: Influence of Neighbouring Protein Families on Sequence Coverage

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Development of sensitive sequence search procedures for the detection of distant relationships between proteins at superfamily/fold level is still a big challenge. The intermediate sequence search approach is the most frequently employed manner of identifying remote homologues effectively. In this study, examination of serine proteases of prolyl oligopeptidase, rhomboid and subtilisin protein families were carried out using plant serine proteases as queries from two genomes including A. thaliana and O. sativa and 13 other families of unrelated folds to identify the distant homologues which could not be obtained using PSI-BLAST. Methodology/Principal Findings: We have proposed to start with multiple queries of classical serine protease members to identify remote homologues in families, using a rigorous approach like Cascade PSI-BLAST. We found that classical sequence based approaches, like PSI-BLAST, showed very low sequence coverage in identifying plant serine proteases. The algorithm was applied on enriched sequence database of homologous domains and we obtained overall average coverage of 88% at family, 77% at superfamily or fold level along with specificity of similar to 100% and Mathew's correlation coefficient of 0.91. Similar approach was also implemented on 13 other protein families representing every structural class in SCOP database. Further investigation with statistical tests, like jackknifing, helped us to better understand the influence of neighbouring protein families. Conclusions/Significance: Our study suggests that employment of multiple queries of a family for the Cascade PSI-BLAST searches is useful for predicting distant relationships effectively even at superfamily level. We have proposed a generalized strategy to cover all the distant members of a particular family using multiple query sequences. Our findings reveal that prior selection of sequences as query and the presence of neighbouring families can be important for covering the search space effectively in minimal computational time. This study also provides an understanding of the `bridging' role of related families.

Non-polio enteroviruses and their association with acute diarrhea in children in India

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A causative agent in approximately 40% of diarrhea] cases. still remains unidentified. Though many enteroviruses (EVs) are transmitted through fecal-oral route and replicate in the intestinal cells, their association with acute diarrhea has not so far been recognized due to lack of detailed epidemiological investigations. This long-term, detailed molecular epidemiological study aims to conclusively determine the association of non-polio enteroviruses (NPEVs) with acute diarrhea in comaparison with rotavirus (RV) in children. Diarrheal stool specimens from 2161 children aged 0-2 years and 169 children between 2 and 9 years, and 1800 normal stool samples from age-matched healthy children between 0 and 9 years were examined during 2008-2012 for enterovirus (oral polio vaccine strains (OPVs) and NPEVs). Enterovirus serotypes were identified by complete VP1 gene sequence analysis. Enterovirus and rotavirus were detected in 19.01% (380/2330) and 13.82% (322/2330) diarrheal stools. During the study period, annual prevalence of EV- and RV-associated diarrhea ranged between 8% and 22%, but with contrasting seasonal prevalence with RV predominating during winter months and NPEV prevailing in other seasons. NPEVs are associated with epidemics-like outbreaks during which they are detected in up to 50% of diarrheic children, and in non-epidemic seasons in 0-10% of the patients. After subtraction of OPV-positive diarrheal cases (1.81%), while NPEVs are associated with about 17% of acute diarrhea, about 6% of healthy children showed asymptomatic NPEV excretion. Of 37 NPEV serotypes detected in diarrheal children, seven echovirus types 1, 7, 11, 13, 14, 30 and 33 are frequently observed, with Ell being more prevalent followed by E30. In conclusion, NPEVs are significantly associated with acute diarrhea, and NPEVs and rotavirus exhibit contrasting seasonal predominance. This study signifies the need for a new direction of research on enteroviruses involving systematic analysis of their contribution to diarrheal burden. (C) 2013 Elsevier B.V. All rights reserved.

The sequence and structure of snake gourd (Trichosanthes anguina) seed lectin, a three-chain nontoxic homologue of type II RIPs

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The sequence and structure of snake gourd seed lectin (SGSL), a nontoxic homologue of type II ribosome-inactivating proteins (RIPs), have been determined by mass spectrometry and X-ray crystallography, respectively. As in type II RIPs, the molecule consists of a lectin chain made up of two beta-trefoil domains. The catalytic chain, which is connected through a disulfide bridge to the lectin chain in type II RIPs, is cleaved into two in SGSL. However, the integrity of the three-dimensional structure of the catalytic component of the molecule is preserved. This is the first time that a three-chain RIP or RIP homologue has been observed. A thorough examination of the sequence and structure of the protein and of its interactions with the bound methyl-alpha-galactose indicate that the nontoxicity of SGSL results from a combination of changes in the catalytic and the carbohydrate-binding sites. Detailed analyses of the sequences of type II RIPs of known structure and their homologues with unknown structure provide valuable insights into the evolution of this class of proteins. They also indicate some variability in carbohydrate-binding sites, which appears to contribute to the different levels of toxicity exhibited by lectins from various sources.

Common recognition principles across diverse sequence and structural families of sialic acid binding proteins

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Sialic acids form a large family of 9-carbon monosaccharides and are integral components of glycoconjugates. They are known to bind to a wide range of receptors belonging to diverse sequence families and fold classes and are key mediators in a plethora of cellular processes. Thus, it is of great interest to understand the features that give rise to such a recognition capability. Structural analyses using a non-redundant data set of known sialic acid binding proteins was carried out, which included exhaustive binding site comparisons and site alignments using in-house algorithms, followed by clustering and tree computation, which has led to derivation of sialic acid recognition principles. Although the proteins in the data set belong to several sequence and structure families, their binding sites could be grouped into only six types. Structural comparison of the binding sites indicates that all sites contain one or more different combinations of key structural features over a common scaffold. The six binding site types thus serve as structural motifs for recognizing sialic acid. Scanning the motifs against a non-redundant set of binding sites from PDB indicated the motifs to be specific for sialic acid recognition. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. As an example analysis, a genome-wide scan for the motifs in structures of Mycobacterium tuberculosis proteome identified 17 hits that contain combinations of the features, suggesting a possible function of sialic acid binding by these proteins.

Filling-in Void and Sparse Regions in Protein Sequence Space by Protein-Like Artificial Sequences Enables Remarkable Enhancement in Remote Homology Detection Capability

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Protein functional annotation relies on the identification of accurate relationships, sequence divergence being a key factor. This is especially evident when distant protein relationships are demonstrated only with three-dimensional structures. To address this challenge, we describe a computational approach to purposefully bridge gaps between related protein families through directed design of protein-like ``linker'' sequences. For this, we represented SCOP domain families, integrated with sequence homologues, as multiple profiles and performed HMM-HMM alignments between related domain families. Where convincing alignments were achieved, we applied a roulette wheel-based method to design 3,611,010 protein-like sequences corresponding to 374 SCOP folds. To analyze their ability to link proteins in homology searches, we used 3024 queries to search two databases, one containing only natural sequences and another one additionally containing designed sequences. Our results showed that augmented database searches showed up to 30% improvement in fold coverage for over 74% of the folds, with 52 folds achieving all theoretically possible connections. Although sequences could not be designed between some families, the availability of designed sequences between other families within the fold established the sequence continuum to demonstrate 373 difficult relationships. Ultimately, as a practical and realistic extension, we demonstrate that such protein-like sequences can be ``plugged-into'' routine and generic sequence database searches to empower not only remote homology detection but also fold recognition. Our richly statistically supported findings show that complementary searches in both databases will increase the effectiveness of sequence-based searches in recognizing all homologues sharing a common fold. (C) 2013 Elsevier Ltd. All rights reserved.

Rapid determination of main constituents of packed juices by reverse phase-high performance liquid chromatography: an insight in to commercial fruit drinks

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The present work reports the compositional analysis of thirteen different packed fruit juices using high performance liquid chromatography (HPLC). Vitamin C, organic acids (citric and malic) and sugars (fructose, glucose and sucrose) were separated, analyzed and quantified using different reverse phase methods. A new rapid reverse phase HPLC method was developed for routine analysis of vitamin C in fruit juices. The precision results of the methods showed that the relative standard deviations of the repeatability and reproducibility were < 0.05 and < 0.1 respectively. Correlation coefficient of the calibration models developed was found to be higher than 0.99 in each case. It has been found that the content of Vitamin C was less variable amongst different varieties involved in the study. It is also observed that in comparison to fresh juices, the packed juices contain lesser amounts of vitamin C. Citric acid was found as the major organic acids present in packed juices while maximum portion of sugars was of sucrose. Comparison of the amount of vitamin C, organic acids and sugars in same fruit juice of different commercial brands is also reported.

Specific Sequence of a Beta Turn in Human La Protein May Contribute to Species Specificity of Hepatitis C Virus

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Human La protein is known to be an essential host factor for translation and replication of hepatitis C virus (HCV) RNA. Previously, we have demonstrated that residues responsible for interaction of human La protein with the HCV internal ribosomal entry site (IRES) around the initiator AUG within stem-loop IV form a beta-turn in the RNA recognition motif (RRM) structure. In this study, sequence alignment and mutagenesis suggest that the HCV RNA-interacting beta-turn is conserved only in humans and chimpanzees, the species primarily known to be infected by HCV. A 7-mer peptide corresponding to the HCV RNA-interacting region of human La inhibits HCV translation, whereas another peptide corresponding to the mouse La sequence was unable to do so. Furthermore, IRES-mediated translation was found to be significantly high in the presence of recombinant human La protein in vitro in rabbit reticulocyte lysate. We observed enhanced replication with HCV subgenomic and full-length replicons upon overexpression of either human La protein or a chimeric mouse La protein harboring a human La beta-turn sequence in mouse cells. Taken together, our results raise the possibility of creating an immunocompetent HCV mouse model using human-specific cell entry factors and a humanized form of La protein.

RES-TOCSY: A facile approach for accurate determination of magnitudes, and relative signs of (n)J(HF)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The RES-TOCSY experiment for accurate determination of heteronuclear (n)J(HF) is reported. The main feature of the proposed technique is the accurate measurement of magnitudes of heteronuclear couplings from the displacement of cross sections of the 2D spectrum and their relative signs from the slopes of their displacement vectors. The experiment is highly advantageous as the couplings of smaller magnitudes hidden within line widths could also be accurately determined, and also in situations when the spectrum does not display any coupling fine structures. The efficient utility of the developed pulse sequence is unambiguously established on fluorine containing aromatic and aliphatic molecules. (C) 2014 Elsevier B.V. All rights reserved.

«
1
2
3
4
5
6
7
8
9
10
11
»