12 resultados para statistical significance

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this study, we estimate the statistical significance of structure prediction by threading. We introduce a single parameter ɛ that serves as a universal measure determining the probability that the best alignment is indeed a native-like analog. Parameter ɛ takes into account both length and composition of the query sequence and the number of decoys in threading simulation. It can be computed directly from the query sequence and potential of interactions, eliminating the need for sequence reshuffling and realignment. Although our theoretical analysis is general, here we compare its predictions with the results of gapless threading. Finally we estimate the number of decoys from which the native structure can be found by existing potentials of interactions. We discuss how this analysis can be extended to determine the optimal gap penalties for any sequence-structure alignment (threading) method, thus optimizing it to maximum possible performance.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The distribution of optimal local alignment scores of random sequences plays a vital role in evaluating the statistical significance of sequence alignments. These scores can be well described by an extreme-value distribution. The distribution’s parameters depend upon the scoring system employed and the random letter frequencies; in general they cannot be derived analytically, but must be estimated by curve fitting. For obtaining accurate parameter estimates, a form of the recently described ‘island’ method has several advantages. We describe this method in detail, and use it to investigate the functional dependence of these parameters on finite-length edge effects.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

A statistical modeling approach is proposed for use in searching large microarray data sets for genes that have a transcriptional response to a stimulus. The approach is unrestricted with respect to the timing, magnitude or duration of the response, or the overall abundance of the transcript. The statistical model makes an accommodation for systematic heterogeneity in expression levels. Corresponding data analyses provide gene-specific information, and the approach provides a means for evaluating the statistical significance of such information. To illustrate this strategy we have derived a model to depict the profile expected for a periodically transcribed gene and used it to look for budding yeast transcripts that adhere to this profile. Using objective criteria, this method identifies 81% of the known periodic transcripts and 1,088 genes, which show significant periodicity in at least one of the three data sets analyzed. However, only one-quarter of these genes show significant oscillations in at least two data sets and can be classified as periodic with high confidence. The method provides estimates of the mean activation and deactivation times, induced and basal expression levels, and statistical measures of the precision of these estimates for each periodic transcript.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

We present an approach for assessing the significance of sequence and structure comparisons by using nearly identical statistical formalisms for both sequence and structure. Doing so involves an all-vs.-all comparison of protein domains [taken here from the Structural Classification of Proteins (scop) database] and then fitting a simple distribution function to the observed scores. By using this distribution, we can attach a statistical significance to each comparison score in the form of a P value, the probability that a better score would occur by chance. As expected, we find that the scores for sequence matching follow an extreme-value distribution. The agreement, moreover, between the P values that we derive from this distribution and those reported by standard programs (e.g., blast and fasta validates our approach. Structure comparison scores also follow an extreme-value distribution when the statistics are expressed in terms of a structural alignment score (essentially the sum of reciprocated distances between aligned atoms minus gap penalties). We find that the traditional metric of structural similarity, the rms deviation in atom positions after fitting aligned atoms, follows a different distribution of scores and does not perform as well as the structural alignment score. Comparison of the sequence and structure statistics for pairs of proteins known to be related distantly shows that structural comparison is able to detect approximately twice as many distant relationships as sequence comparison at the same error rate. The comparison also indicates that there are very few pairs with significant similarity in terms of sequence but not structure whereas many pairs have significant similarity in terms of structure but not sequence.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In populations that are small and asexual, mutations with slight negative effects on fitness will drift to fixation more often than in large or sexual populations in which they will be eliminated by selection. If such mutations occur in substantial numbers, the combined effects of long-term asexuality and small population size may result in substantial accumulation of mildly deleterious substitutions. Prokaryotic endosymbionts of animals that are transmitted maternally for very long periods are effectively asexual and experience smaller effective population size than their free-living relatives. The contrast between such endosymbionts and related free-living bacteria allows us to test whether a population structure imposing frequent bottlenecks and asexuality does lead to an accumulation of slightly deleterious substitutions. Here we show that several independently derived insect endosymbionts, each with a long history of maternal transmission, have accumulated destabilizing base substitutions in the highly conserved 16S rRNA. Stabilities of Domain I of this subunit are 15–25% lower in endosymbionts than in closely related free-living bacteria. By mapping destabilizing substitutions onto a reconstructed phylogeny, we show that decreased ribosomal stability has evolved separately in each endosymbiont lineage. Our phylogenetic approach allows us to demonstrate statistical significance for this pattern: becoming endosymbiotic predictably results in decreased stability of rRNA secondary structure.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The physical validity of the hypothesis of (redshift-dependent) luminosity evolution in galaxies is tested by statistical analysis of an intensively studied complete high-redshift sample of normal galaxies. The necessity of the evolution hypothesis in the frame of big-bang cosmology is confirmed at a high level of statistical significance; however, this evolution is quantitatively just as predicted by chronometric cosmology, in which there is no such evolution. Since there is no direct observational means to establish the evolution postulated in big-bang studies of higher-redshift galaxies, and the chronometric predictions involve no adjustable parameters (in contrast to the two in big-bang cosmology), the hypothesized evolution appears from the standpoint of conservative scientific methodology as a possible theoretical artifact.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The relative abundance of alternatively spliced long (γ2L) and short (γ2S) mRNAs of the γ2 subunit of the γ-amino butyrate type A (GABAA) receptor was examined in dorsolateral prefrontal cortex of schizophrenics and matched controls by using in situ hybridization histochemistry and semiquantitative reverse transcription–PCR (RT-PCR) amplification. A cRNA probe identifying both mRNAs showed that the transcripts are normally expressed at moderately high levels in the prefrontal cortex. Consistent with previous studies, overall levels of γ2 transcripts in prefrontal cortex of brains from schizophrenics were reduced by 28.0%, although this reduction did not reach statistical significance. RT-PCR, performed under nonsaturating conditions on total RNA from the same blocks of tissue used for in situ hybridization histochemistry, revealed a marked reduction in the relative proportion of γ2S transcripts in schizophrenic brains compared with controls. In schizophrenics, γ2S transcripts had fallen to 51.7% (±7.9% SE; P < 0.0001) relative to control levels. Levels of γ2L transcripts showed only a small and nonsignificant reduction of 16.9% (±12.0% SE, P > 0.05). These findings indicate differential transcriptional regulation of two functionally distinct isoforms of one of the major GABAA receptor subunits in the prefrontal cortex of schizophrenics. The specific reduction in relative abundance of γ2S mRNAs and the associated relative increase in γ2L mRNAs should result in functionally less active GABAA receptors and have severe consequences for cortical integrative function.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The energy of DNA deformation plays a crucial and active role in its packaging and its function in the cell. Considerable effort has gone into developing methodologies capable of evaluating the local sequence-directed curvature and flexibility of a DNA chain. These studies thus far have focused on DNA constructs expressly tailored either with anomalous flexibility or curvature tracts. Here we demonstrate that these two structural properties can be mapped also along the chain of a “natural” DNA with any sequence on the basis of its scanning force microscope (SFM) images. To know the orientation of the sequence of the investigated DNA molecules in their SFM images, we prepared a palindromic dimer of the long DNA molecule under study. The palindromic symmetry also acted as an internal gauge of the statistical significance of the analysis carried out on the SFM images of the dimer molecules. It was found that although the curvature modulus is not efficient in separating static and dynamic contributions to the curvature of the population of molecules, the curvature taken with its direction (its sign in two dimensions) permits the direct separation of the intrinsic curvature from the flexibility contributions. The sequence-dependent flexibility seems to vary monotonically with the chain's intrinsic curvature; the chain rigidity was found to modulate as its local thermodynamic stability and does not correlate with the dinucleotide chain rigidities evaluation made from x-ray data by other authors.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

There is a need for faster and more sensitive algorithms for sequence similarity searching in view of the rapidly increasing amounts of genomic sequence data available. Parallel processing capabilities in the form of the single instruction, multiple data (SIMD) technology are now available in common microprocessors and enable a single microprocessor to perform many operations in parallel. The ParAlign algorithm has been specifically designed to take advantage of this technology. The new algorithm initially exploits parallelism to perform a very rapid computation of the exact optimal ungapped alignment score for all diagonals in the alignment matrix. Then, a novel heuristic is employed to compute an approximate score of a gapped alignment by combining the scores of several diagonals. This approximate score is used to select the most interesting database sequences for a subsequent Smith–Waterman alignment, which is also parallelised. The resulting method represents a substantial improvement compared to existing heuristics. The sensitivity and specificity of ParAlign was found to be as good as Smith–Waterman implementations when the same method for computing the statistical significance of the matches was used. In terms of speed, only the significantly less sensitive NCBI BLAST 2 program was found to outperform the new approach. Online searches are available at http://dna.uio.no/search/

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We discuss two tests of the hypothesis that the first genes were assembled from exons. The hypothesis of exon shuffling in the progenote predicts that intron phases will be correlated so that exons will be an integer number of codons and predicts that the exons will be correlated with compact regions of polypeptide chain. These predictions have been tested on ancient conserved proteins (proteins without introns in prokaryotes but with introns in eukaryotes) and hold with high statistical significance. We conclude that introns are correlated with compact features of proteins 15-, 22-, or 30-amino acid residues long, as was predicted by “The Exon Theory of Genes.”

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Dystrophic cardiac calcinosis, an age-related cardiomyopathy that occurs among certain inbred strains of mice, involves myocardial injury, necrosis, and calcification. Using a complete linkage map approach and quantitative trait locus analysis, we sought to identify genetic loci determining dystrophic cardiac calcinosis in an F2 intercross of resistant C57BL/6J and susceptible C3H/HeJ inbred strains. We identified a single major locus, designated Dyscalc, located on proximal chromosome 7 in a region syntenic with human chromosomes 19q13 and 11p15. The statistical significance of Dyscalc (logarithm of odds score 14.6) was tested by analysis of permuted trait data. Analysis of BxH recombinant inbred strains confirmed the mapping position. The inheritance pattern indicated that this locus influences susceptibility of cells both to enter necrosis and to subsequently undergo calcification.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Myofibroblasts, defined by their expression of smooth muscle alpha-actin, appear at corneal and dermal incisions and promote wound contraction. We report here that cultured fibroblasts differentiate into myofibroblasts by a cell density-dependent mechanism. Fibroblasts seeded at low density (5 cells per mm2) produced a cell culture population consisting of 70-80% myofibroblasts, 5-7 days after seeding. In contrast, fibroblasts seeded at high density (500 cells per mm2) produced cultures with only 5-10% myofibroblasts. When the myofibroblast-enriched cultures were subsequently passaged at high density, the smooth muscle alpha-actin phenotype was lost within 3 days. Furthermore, initially 60% of the low density-cultured cells incorporated BrdUrd compared to 30% of cells passaged at high density. Media from myofibroblast-enriched cultures had more latent and active transforming growth factor beta (TGF-beta) than did media from fibroblast-enriched cultures. Although there was a trend towards increased numbers of myofibroblasts after addition of exogenous TGF-beta, the results did not reach statistical significance. We conclude that myofibroblast differentiation can be induced in fibroblasts by plating at low density. We propose a cell density-dependent model of myofibroblast differentiation during wounding and healing in which at least two factors interact: loss of cell contact and the presence of TGF-beta.