44 resultados para NUCLEOTIDE-SEQUENCES


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

MicroRNAs (miRNA) are recognized posttranscriptional gene repressors involved in the control of almost every biological process. Allelic variants in these regions may be an important source of phenotypic diversity and contribute to disease susceptibility. We analyzed the genomic organization of 325 human miRNAs (release 7.1, miRBase) to construct a panel of 768 single-nucleotide polymorphisms (SNPs) covering approximately 1 Mb of genomic DNA, including 131 isolated miRNAs (40%) and 194 miRNAs arranged in 48 miRNA clusters, as well as their 5-kb flanking regions. Of these miRNAs, 37% were inside known protein-coding genes, which were significantly associated with biological functions regarding neurological, psychological or nutritional disorders. SNP coverage analysis revealed a lower SNP density in miRNAs compared with the average of the genome, with only 24 SNPs located in the 325 miRNAs studied. Further genotyping of 340 unrelated Spanish individuals showed that more than half of the SNPs in miRNAs were either rare or monomorphic, in agreement with the reported selective constraint on human miRNAs. A comparison of the minor allele frequencies between Spanish and HapMap population samples confirmed the applicability of this SNP panel to the study of complex disorders among the Spanish population, and revealed two miRNA regions, hsa-mir-26a-2 in the CTDSP2 gene and hsa-mir-128-1 in the R3HDM1 gene, showing geographical allelic frequency variation among the four HapMap populations, probably because of differences in natural selection. The designed miRNA SNP panel could help to identify still hidden links between miRNAs and human disease.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Single nucleotide polymorphisms (SNPs) are the most frequent type of sequence variation between individuals, and represent a promising tool for finding genetic determinants of complex diseases and understanding the differences in drug response. In this regard, it is of particular interest to study the effect of non-synonymous SNPs in the context of biological networks such as cell signalling pathways. UniProt provides curated information about the functional and phenotypic effects of sequence variation, including SNPs, as well as on mutations of protein sequences. However, no strategy has been developed to integrate this information with biological networks, with the ultimate goal of studying the impact of the functional effect of SNPs in the structure and dynamics of biological networks. Results: First, we identified the different challenges posed by the integration of the phenotypic effect of sequence variants and mutations with biological networks. Second, we developed a strategy for the combination of data extracted from public resources, such as UniProt, NCBI dbSNP, Reactome and BioModels. We generated attribute files containing phenotypic and genotypic annotations to the nodes of biological networks, which can be imported into network visualization tools such as Cytoscape. These resources allow the mapping and visualization of mutations and natural variations of human proteins and their phenotypic effect on biological networks (e.g. signalling pathways, protein-protein interaction networks, dynamic models). Finally, an example on the use of the sequence variation data in the dynamics of a network model is presented. Conclusion: In this paper we present a general strategy for the integration of pathway and sequence variation data for visualization, analysis and modelling purposes, including the study of the functional impact of protein sequence variations on the dynamics of signalling pathways. This is of particular interest when the SNP or mutation is known to be associated to disease. We expect that this approach will help in the study of the functional impact of disease-associated SNPs on the behaviour of cell signalling pathways, which ultimately will lead to a better understanding of the mechanisms underlying complex diseases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Sequential randomized prediction of an arbitrary binary sequence isinvestigated. No assumption is made on the mechanism of generating the bit sequence. The goal of the predictor is to minimize its relative loss, i.e., to make (almost) as few mistakes as the best ``expert'' in a fixed, possibly infinite, set of experts. We point out a surprising connection between this prediction problem and empirical process theory. First, in the special case of static (memoryless) experts, we completely characterize the minimax relative loss in terms of the maximum of an associated Rademacher process. Then we show general upper and lower bounds on the minimaxrelative loss in terms of the geometry of the class of experts. As main examples, we determine the exact order of magnitude of the minimax relative loss for the class of autoregressive linear predictors and for the class of Markov experts.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We consider adaptive sequential lossy coding of bounded individual sequences when the performance is measured by the sequentially accumulated mean squared distortion. Theencoder and the decoder are connected via a noiseless channel of capacity $R$ and both are assumed to have zero delay. No probabilistic assumptions are made on how the sequence to be encoded is generated. For any bounded sequence of length $n$, the distortion redundancy is defined as the normalized cumulative distortion of the sequential scheme minus the normalized cumulative distortion of the best scalarquantizer of rate $R$ which is matched to this particular sequence. We demonstrate the existence of a zero-delay sequential scheme which uses common randomization in the encoder and the decoder such that the normalized maximum distortion redundancy converges to zero at a rate $n^{-1/5}\log n$ as the length of the encoded sequence $n$ increases without bound.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We construct spectral sequences in the framework of Baues-Wirsching cohomology and homology for functors between small categories and analyze particular cases including Grothendieck fibrations. We also give applications to more classical cohomology and homology theories including Hochschild-Mitchell cohomology and those studied before by Watts, Roos, Quillen and others

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The turbot (Scophthalmus maximus) is a commercially valuable flatfish and one of the most promising aquaculture species in Europe. Two transcriptome 454-pyrosequencing runs were used in order to detect Single Nucleotide Polymorphisms (SNPs) in genesrelated to immune response and gonad differentiation. A total of 866 true SNPs were detected in 140 different contigs representing 262,093 bp as a whole. Only one true SNP was analyzed in each contig. One hundred and thirteen SNPs out of the 140 analyzed were feasible (genotyped), while Ш were polymorphic in a wild population. Transition/transversion ratio (1.354) was similar to that observed in other fish studies. Unbiased gene diversity (He) estimates ranged from 0.060 to 0.510 (mean = 0.351), minimum allele frequency (MAF) from 0.030 to 0.500 (mean = 0.259) and all loci were in Hardy-Weinberg equilibrium after Bonferroni correction. A large number of SNPs (49) were located in the coding region, 33 representing synonymous and 16 non-synonymous changes. Most SNP-containing genes were related to immune response and gonad differentiation processes, and could be candidates for functional changes leading to phenotypic changes. These markers will be useful for population screening to look for adaptive variation in wild and domestic turbot

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Sponge morphological plasticity has been a long-standing source of taxonomic difficulty. In the Caribbean, several morphotypes of the sponge Callyspongia vaginalis have been observed. To determine the taxonomic status of three of these morphotypes and their relationship with the congeneric species C. plicifera and C. fallax, we compared the spicule composition, spongin fiber skeleton and sequenced fragments of the mitochondrial genes 16S and COI and nuclear genes 28S and 18S ribosomal RNA. Phylogenetic analyses with ribosomal markers 18S and 28S rRNA confirmed the position of our sequences within the Callyspongiidae. None of the genetic markers provided evidence for consistent differentiation among the three morphotypes of C. vaginalis and C. fallax, and only C. plicifera stood as a distinct species. The 16S mtDNA gene was the most variable molecular marker for this group, presenting a nucleotide variability (π = 0.024) higher than that reported for COI. Unlike recent studies for other sponge genera, our results indicate that species in the genus Callyspongia maintain a high degree of phenotypic plasticity, and that morphological characteristics may not reflect reproductive boundaries in C. vaginalis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The alternatives used for minimizing the usage of chlorine dioxide in bleaching sequences included a hot acid hydrolysis (Ahot) stage, the use of hot chlorine dioxide (Dhot) and ozone stages at medium consistency and high consistency (Zmc and Zhc), in addition to stages with atmospheric hydrogen peroxide (P) and pressurized hydrogen peroxide (PO). The results were interpreted based on the cost of the chemical products, bleaching process yields and on minimizing the environmental impact of the bleaching process. In spite of some process restrictions, high ISO brightness levels were kept around 90 % brightness. Additionally, the inclusion of stages like acid hydrolysis, pressurized peroxide and ozone in the bleaching sequences provided an increase in operating flexibility, aimed at reducing environmental impact (ECF Light). The Dhot(EOP)D(PO) sequence presented lower operating cost for ISO brightness above 92 %. However, this kind of sequence was not allowed for closing the wastewater circuit, even partially. For ISO brightness level around 91%, the AhotZhcDP sequence presented a lower operating cost than the others

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We provide a description of the interpolating and sampling sequences on a space of holomorphic functions on a finite Riemann surface, where a uniform growth restriction is imposed on the holomorphic functions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Consider the celebrated Lyness recurrence $x_{n+2}=(a+x_{n+1})/x_{n}$ with $a\in\Q$. First we prove that there exist initial conditions and values of $a$ for which it generates periodic sequences of rational numbers with prime periods $1,2,3,5,6,7,8,9,10$ or $12$ and that these are the only periods that rational sequences $\{x_n\}_n$ can have. It is known that if we restrict our attention to positive rational values of $a$ and positive rational initial conditions the only possible periods are $1,5$ and $9$. Moreover 1-periodic and 5-periodic sequences are easily obtained. We prove that for infinitely many positive values of $a,$ positive 9-period rational sequences occur. This last result is our main contribution and answers an open question left in previous works of Bastien \& Rogalski and Zeeman. We also prove that the level sets of the invariant associated to the Lyness map is a two-parameter family of elliptic curves that is a universal family of the elliptic curves with a point of order $n, n\ge5,$ including $n$ infinity. This fact implies that the Lyness map is a universal normal form for most birrational maps on elliptic curves.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Image registration has been proposed as an automatic method for recovering cardiac displacement fields from Tagged Magnetic Resonance Imaging (tMRI) sequences. Initially performed as a set of pairwise registrations, these techniques have evolved to the use of 3D+t deformation models, requiring metrics of joint image alignment (JA). However, only linear combinations of cost functions defined with respect to the first frame have been used. In this paper, we have applied k-Nearest Neighbors Graphs (kNNG) estimators of the -entropy (H ) to measure the joint similarity between frames, and to combine the information provided by different cardiac views in an unified metric. Experiments performed on six subjects showed a significantly higher accuracy (p < 0.05) with respect to a standard pairwise alignment (PA) approach in terms of mean positional error and variance with respect to manually placed landmarks. The developed method was used to study strains in patients with myocardial infarction, showing a consistency between strain, infarction location, and coronary occlusion. This paper also presentsan interesting clinical application of graph-based metric estimators, showing their value for solving practical problems found in medical imaging.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The repetitive DNA sequences found at telomeres and centromeres play a crucial role in the structure and function of eukaryotic chromosomes. This role may be related to the tendency observed in many repetitive DNAs to adopt non-canonical structures. Although there is an increasing recognition of the importance of DNA quadruplexes in chromosome biology, the co-existence of different quadruplex-forming elements in the same DNA structure is still a matter of debate. Here we report the structural study of the oligonucleotide d(TCGTTTCGT) and its cyclic analog d. Both sequences form dimeric quadruplex structures consisting of a minimal i-motif capped, at both ends, by a slipped minor groove-aligned G:T:G:T tetrad. These mini i-motifs, which do not exhibit the characteristic CD spectra of other i-motif structures, can be observed at neutral pH, although they are more stable under acidic conditions. This finding is particularly relevant since these oligonucleotide sequences do not contain contiguous cytosines. Importantly, these structures resemble the loop moiety adopted by an 11-nucleotide fragment of the conserved centromeric protein B (CENP-B) box motif, which is the binding site for the CENP-B.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The repetitive DNA sequences found at telomeres and centromeres play a crucial role in the structure and function of eukaryotic chromosomes. This role may be related to the tendency observed in many repetitive DNAs to adopt non-canonical structures. Although there is an increasing recognition of the importance of DNA quadruplexes in chromosome biology, the co-existence of different quadruplex-forming elements in the same DNA structure is still a matter of debate. Here we report the structural study of the oligonucleotide d(TCGTTTCGT) and its cyclic analog d. Both sequences form dimeric quadruplex structures consisting of a minimal i-motif capped, at both ends, by a slipped minor groove-aligned G:T:G:T tetrad. These mini i-motifs, which do not exhibit the characteristic CD spectra of other i-motif structures, can be observed at neutral pH, although they are more stable under acidic conditions. This finding is particularly relevant since these oligonucleotide sequences do not contain contiguous cytosines. Importantly, these structures resemble the loop moiety adopted by an 11-nucleotide fragment of the conserved centromeric protein B (CENP-B) box motif, which is the binding site for the CENP-B.