Biblioteca Digital

997 resultados para intrinsically disordered sequences

Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.

An algorithm to find all palindromic sequences in proteins

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A palindrome is a set of characters that reads the same forwards and backwards. Since the discovery of palindromic peptide sequences two decades ago, little effort has been made to understand its structural, functional and evolutionary significance. Therefore, in view of this, an algorithm has been developed to identify all perfect palindromes (excluding the palindromic subset and tandem repeats) in a single protein sequence. The proposed algorithm does not impose any restriction on the number of residues to be given in the input sequence. This avant-garde algorithm will aid in the identification of palindromic peptide sequences of varying lengths in a single protein sequence.

Promotion of Folding in Hybrid Peptides through Unconstrained gamma Residues: Structural Characterization of Helices in (alpha gamma gamma)(n) and (alpha gamma alpha)(n) Sequences

Relevância:

20.00% 20.00%

Publicador:

Majorana fermions in superconducting 1D systems having periodic, quasiperiodic, and disordered potentials

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present a unified study of the effect of periodic, quasiperiodic, and disordered potentials on topological phases that are characterized by Majorana end modes in one-dimensional p-wave superconducting systems. We define a topological invariant derived from the equations of motion for Majorana modes and, as our first application, employ it to characterize the phase diagram for simple periodic structures. Our general result is a relation between the topological invariant and the normal state localization length. This link allows us to leverage the considerable literature on localization physics and obtain the topological phase diagrams and their salient features for quasiperiodic and disordered systems for the entire region of parameter space. DOI: 10.1103/PhysRevLett.110.146404

C12-Helix development in ()n sequences - spectroscopic characterization of Boc-Aib-4(R)Val]-OMe oligomers

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The solution conformations of the -hybrid oligopeptides Boc-Aib-4(R)Val]n-OMe (n = 1-8) in organic solvents have been probed by NMR, IR, and CD spectroscopic methods. In the solid state, this peptide series favors C12-helical conformations, which are backbone-expanded analogues of 310 helices in -peptide sequences. NMR studies of the six- (n = 3) and 16-residue (n = 8) peptides reveal that only two NH protons attached the N-terminus residues Aib(1) and 4(R)Val(2) are solvent-exposed. Sequential NiH-Ni+1H NOEs characteristic of local helical conformations are also observed at the residues. IR studies establish that chain extension leads to a large enhancement in the intensities of the hydrogen-bonded NH stretching bands (3343-3280 cm-1), which suggest elongation of intramolecularly hydrogen-bonded structures. The development of C12-helical structures upon lengthening of the sequence is supported by the NMR and IR observations. The CD spectra of the ()n peptides reveal a negative maximum at ca. 206 nm and a positive maximum at ca. 192 nm, spectral feature that are distinct from those of 310 helices in -peptides.

Detection of Hepatitis B DNA Sequences on Polyelectrolyte Based Non-Covalently Functionalized Flexible Plastic Substrates

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Development of simple functionalization methods to attach biomolecules such as proteins and DNA on inexpensive substrates is important for widespread use of low cost, disposable biosensors. Here, we describe a method based on polyelectrolyte multilayers to attach single stranded DNA molecules to conventional glass slides as well as a completely non-standard substrate, namely flexible plastic transparency sheets. We then use the functionalized transparency sheets to specifically detect single stranded Hepatitis B DNA sequences from samples. We also demonstrate a blocking method for reducing non-specific binding of target DNA sequences using negatively charged polyelectrolyte molecules. The polyelectrolyte based functionalization method, which relies on surface charge as opposed to covalent surface linkages, could be an attractive platform to develop assays on inexpensive substrates for low cost biosensing.

RepEx: Repeat extractor for biological sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genomic sequences are far from being random but are made up of systematically ordered and information rich patterns. These repeated sequence patterns have been vastly utilized for their fundamental importance in understanding the genome function and organization. To this end, a comprehensive toolkit, RepEx, has been developed which extracts repeat (inverted, everted and mirror) patterns from the given genome sequence(s) without any constraints. The toolkit can also be used to fetch the inverted repeats present in the protein sequence (s). Further, it is capable of extracting exact and degenerate repeats with a user defined spacer intervals. It is remarkably more precise and sensitive when compared to the existing tools. An example with comprehensive case studies and a performance evaluation of the proposed toolkit has been presented to authenticate its efficiency and accuracy. (C) 2013 Elsevier Inc. All rights reserved.

Filling-in Void and Sparse Regions in Protein Sequence Space by Protein-Like Artificial Sequences Enables Remarkable Enhancement in Remote Homology Detection Capability

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Protein functional annotation relies on the identification of accurate relationships, sequence divergence being a key factor. This is especially evident when distant protein relationships are demonstrated only with three-dimensional structures. To address this challenge, we describe a computational approach to purposefully bridge gaps between related protein families through directed design of protein-like ``linker'' sequences. For this, we represented SCOP domain families, integrated with sequence homologues, as multiple profiles and performed HMM-HMM alignments between related domain families. Where convincing alignments were achieved, we applied a roulette wheel-based method to design 3,611,010 protein-like sequences corresponding to 374 SCOP folds. To analyze their ability to link proteins in homology searches, we used 3024 queries to search two databases, one containing only natural sequences and another one additionally containing designed sequences. Our results showed that augmented database searches showed up to 30% improvement in fold coverage for over 74% of the folds, with 52 folds achieving all theoretically possible connections. Although sequences could not be designed between some families, the availability of designed sequences between other families within the fold established the sequence continuum to demonstrate 373 difficult relationships. Ultimately, as a practical and realistic extension, we demonstrate that such protein-like sequences can be ``plugged-into'' routine and generic sequence database searches to empower not only remote homology detection but also fold recognition. Our richly statistically supported findings show that complementary searches in both databases will increase the effectiveness of sequence-based searches in recognizing all homologues sharing a common fold. (C) 2013 Elsevier Ltd. All rights reserved.

Universal Conductance Fluctuations as a direct probe to valley coherence and universality class of disordered graphene

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We demonstrate that the universal conductance fluctuations (UCF) can be used as a direct probe to study the valley quantum states in disordered graphene. The UCF magnitude in graphene is suppressed by a factor of four at high carrier densities where the short-range disorder essentially breaks the valley degeneracy of the K and K' valleys, leading to a density dependent crossover of symmetry class from symplectic near the Dirac point to orthogonal at high densities.

Temperature dependent magnetic, dielectric and Raman studies of partially disordered La2NiMnO6

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report a detailed magnetic, dielectric and Raman studies on partially disordered and biphasic double perovskite La2NiMnO6. DC and AC magnetic susceptibility measurements show two magnetic anomalies at T-C1 similar to 270 K and T-C2 similar to 240 K, which may indicate the ferromagnetic ordering of the monoclinic and rhombohedral phases, respectively. A broad peak at a lower temperature (T-sg similar to 70 K) is also observed indicating a spin-glass transition due to partial anti-site disorder of Ni2+ and Mn4+ ions. Unlike the pure monoclinic phase, the biphasic compound exhibits a broad but a clear dielectric anomaly around 270 K which is a signature of magneto-dielectric effect. Temperature-dependent Raman studies between the temperature range 12-300 K in a wide spectral range from 220 cm(-1) to 1530 cm(-1) reveal a strong renormalization of the first as well as second-order Raman modes associated with the (Ni/Mn)O-6 octahedra near T-C1 implying a strong spin-phonon coupling. In addition, an anomaly is seen in the vicinity of spin-glass transition temperature in the temperature dependence of the frequency of the anti-symmetric stretching vibration of the octahedra. (C) 2014 Elsevier Ltd. All rights reserved.

Analytical Closed-Form Expressions for Harmonic Distortion Corresponding to Novel Switching Sequences for Neutral-Point-Clamped Inverters

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Analytical closed-form expressions for harmonic distortion factors corresponding to various pulsewidth modulation (PWM) techniques for a two-level inverter have been reported in the literature. This paper derives such analytical closed-form expressions, pertaining to centered space-vector PWM (CSVPWM) and eight different advanced bus-clamping PWM (ABCPWM) schemes, for a three-level neutral-point-clamped (NPC) inverter. These ABCPWM schemes switch each phase at twice the nominal switching frequency in certain intervals of the line cycle while clamping each phase to one of the dc terminals over certain other intervals. The harmonic spectra of the output voltages, corresponding to the eight ABCPWM schemes, are studied and compared experimentally with that of CSVPWM over the entire modulation range. The measured values of weighted total harmonic distortion (WTHD) of the line voltage V-WTHD are used to validate the analytical closed-form expressions derived. The analytical expressions, pertaining to two of the ABCPWM methods, are also validated by measuring the total harmonic distortion (THD) in the line current I-THD on a 2.2-kW constant volts-per-hertz induction motor drive.

Cubic Sieve Congruence of the Discrete Logarithm Problem, and fractional part sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Cubic Sieve Method for solving the Discrete Logarithm Problem in prime fields requires a nontrivial solution to the Cubic Sieve Congruence (CSC) x(3) equivalent to y(2)z (mod p), where p is a given prime number. A nontrivial solution must also satisfy x(3) not equal y(2)z and 1 <= x, y, z < p(alpha), where alpha is a given real number such that 1/3 < alpha <= 1/2. The CSC problem is to find an efficient algorithm to obtain a nontrivial solution to CSC. CSC can be parametrized as x equivalent to v(2)z (mod p) and y equivalent to v(3)z (mod p). In this paper, we give a deterministic polynomial-time (O(ln(3) p) bit-operations) algorithm to determine, for a given v, a nontrivial solution to CSC, if one exists. Previously it took (O) over tilde (p(alpha)) time in the worst case to determine this. We relate the CSC problem to the gap problem of fractional part sequences, where we need to determine the non-negative integers N satisfying the fractional part inequality {theta N} < phi (theta and phi are given real numbers). The correspondence between the CSC problem and the gap problem is that determining the parameter z in the former problem corresponds to determining N in the latter problem. We also show in the alpha = 1/2 case of CSC that for a certain class of primes the CSC problem can be solved deterministically in <(O)over tilde>(p(1/3)) time compared to the previous best of (O) over tilde (p(1/2)). It is empirically observed that about one out of three primes is covered by the above class. (C) 2013 Elsevier B.V. All rights reserved.

Robust dielectric properties of B-site size-disordered hexagonal Ln(2)CuTiO(6) (Ln = Y, Dy, Ho, Er, and Yb)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Hexagonal Ln(2)CuTiO(6) (Ln = Y, Dy, Ho, Er, and Yb) exhibits a rare combination of interesting dielectric properties, in the form of relatively large dielectric constants (epsilon' > 30), low losses, and extremely small temperature and frequency dependencies, over large ranges of temperature and frequency Choudhury et al., Appl. Phys. Lett. 96, 162903 (2010) and Choudhury et al., Phys. Rev. B 82, 134203 (2010)], making these compounds promising as high-k dielectric materials. The authors present a brief review of the existing literature on this interesting class of oxides, complimenting it with spectroscopic data in conjunction with first-principles calculation results, revealing a novel mechanism underlying these robust dielectric properties. These show that the large size differences in Cu2+ and Ti4+ at the B-site, aided by an inherent random distribution of CuO5 and TiO5 polyhedral units, frustrates the ferroelectric instability, inherent to the noncentrosymmetric P6(3) cm space group of this system, and gives rise to the observed relatively large dielectric constant values. Additionally, the phononic contributions to the dielectric constant are dominated primarily by mid-frequency (>100 cm(-1)) polar modes, involving mainly Ti4+ 3d(0) ions. In contrast, the soft polar phonon modes with frequencies typically less than 100 cm(-1), usually responsible for dielectric properties of materials, are found to be associated with non-d(0) Cu2+ ions and to contribute very little, giving rise to the remarkable temperature stability of dielectric properties of these compounds. (C) 2014 American Vacuum Society.

Use of a structural alphabet to find compatible folds for amino acid sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as Protein Blocks (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa.

NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection

Relevância:

20.00% 20.00%

Publicador:

Resumo:

NrichD ( ext-link-type=''uri'' xlink:href=''http://proline.biochem.iisc.ernet.in/NRICHD/'' xlink:type=''simple''>http://proline.biochem.iisc.ernet.in/NRICHD/)< /named-content> is a database of computationally designed protein-like sequences, augmented into natural sequence databases that can perform hops in protein sequence space to assist in the detection of remote relationships. Establishing protein relationships in the absence of structural evidence or natural `intermediately related sequences' is a challenging task. Recently, we have demonstrated that the computational design of artificial intermediary sequences/linkers is an effective approach to fill naturally occurring voids in protein sequence space. Through a large-scale assessment we have demonstrated that such sequences can be plugged into commonly employed search databases to improve the performance of routinely used sequence search methods in detecting remote relationships. Since it is anticipated that such data sets will be employed to establish protein relationships, two databases that have already captured these relationships at the structural and functional domain level, namely, the SCOP database and the Pfam database, have been `enriched' with these artificial intermediary sequences. NrichD database currently contains 3 611 010 artificial sequences that have been generated between 27 882 pairs of families from 374 SCOP folds. The data sets are freely available for download. Additional features include the design of artificial sequences between any two protein families of interest to the user.

«
1
2
...
11
12
13
14
15
16
17
...
66
67
»