21 resultados para NUCLEOTIDE EXCISION


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Theoretical and empirical studies were conducted on the pattern of nucleotide and amino acid substitution in evolution, taking into account the effects of mutation at the nucleotide level and purifying selection at the amino acid level. A theoretical model for predicting the evolutionary change in electrophoretic mobility of a protein was also developed by using information on the pattern of amino acid substitution. The specific problems studied and the main results obtained are as follows: (1) Estimation of the pattern of nucleotide substitution in DNA nuclear genomes. The pattern of point mutations and nucleotide substitutions among the four different nucleotides are inferred from the evolutionary changes of pseudogenes and functional genes, respectively. Both patterns are non-random, the rate of change varying considerably with nucleotide pair, and that in both cases transitions occur somewhat more frequently than transversions. In protein evolution, substitution occurs more often between amino acids with similar physico-chemical properties than between dissimilar amino acids. (2) Estimation of the pattern of nucleotide substitution in RNA genomes. The majority of mutations in retroviruses accumulate at the reverse transcription stage. Selection at the amino acid level is very weak, and almost non-existent between synonymous codons. The pattern of mutation is very different from that in DNA genomes. Nevertheless, the pattern of purifying selection at the amino acid level is similar to that in DNA genomes, although selection intensity is much weaker. (3) Evaluation of the determinants of molecular evolutionary rates in protein-coding genes. Based on rates of nucleotide substitution for mammalian genes, the rate of amino acid substitution of a protein is determined by its amino acid composition. The content of glycine is shown to correlate strongly and negatively with the rate of substitution. Empirical formulae, called indices of mutability, are developed in order to predict the rate of molecular evolution of a protein from data on its amino acid sequence. (4) Studies on the evolutionary patterns of electrophoretic mobility of proteins. A theoretical model was constructed that predicts the electric charge of a protein at any given pH and its isoelectric point from data on its primary and quaternary structures. Using this model, the evolutionary change in electrophoretic mobilities of different proteins and the expected amount of electrophoretically hidden genetic variation were studied. In the absence of selection for the pI value, proteins will on the average evolve toward a mildly basic pI. (Abstract shortened with permission of author.) ^

Relevância:

20.00% 20.00%

Publicador:

Resumo:

With hundreds of single nucleotide polymorphisms (SNPs) in a candidate gene and millions of SNPs across the genome, selecting an informative subset of SNPs to maximize the ability to detect genotype-phenotype association is of great interest and importance. In addition, with a large number of SNPs, analytic methods are needed that allow investigators to control the false positive rate resulting from large numbers of SNP genotype-phenotype analyses. This dissertation uses simulated data to explore methods for selecting SNPs for genotype-phenotype association studies. I examined the pattern of linkage disequilibrium (LD) across a candidate gene region and used this pattern to aid in localizing a disease-influencing mutation. The results indicate that the r2 measure of linkage disequilibrium is preferred over the common D′ measure for use in genotype-phenotype association studies. Using step-wise linear regression, the best predictor of the quantitative trait was not usually the single functional mutation. Rather it was a SNP that was in high linkage disequilibrium with the functional mutation. Next, I compared three strategies for selecting SNPs for application to phenotype association studies: based on measures of linkage disequilibrium, based on a measure of haplotype diversity, and random selection. The results demonstrate that SNPs selected based on maximum haplotype diversity are more informative and yield higher power than randomly selected SNPs or SNPs selected based on low pair-wise LD. The data also indicate that for genes with small contribution to the phenotype, it is more prudent for investigators to increase their sample size than to continuously increase the number of SNPs in order to improve statistical power. When typing large numbers of SNPs, researchers are faced with the challenge of utilizing an appropriate statistical method that controls the type I error rate while maintaining adequate power. We show that an empirical genotype based multi-locus global test that uses permutation testing to investigate the null distribution of the maximum test statistic maintains a desired overall type I error rate while not overly sacrificing statistical power. The results also show that when the penetrance model is simple the multi-locus global test does as well or better than the haplotype analysis. However, for more complex models, haplotype analyses offer advantages. The results of this dissertation will be of utility to human geneticists designing large-scale multi-locus genotype-phenotype association studies. ^

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Systemic sclerosis (SSc) or Scleroderma is a complex disease and its etiopathogenesis remains unelucidated. Fibrosis in multiple organs is a key feature of SSc and studies have shown that transforming growth factor-β (TGF-β) pathway has a crucial role in fibrotic responses. For a complex disease such as SSc, expression quantitative trait loci (eQTL) analysis is a powerful tool for identifying genetic variations that affect expression of genes involved in this disease. In this study, a multilevel model is described to perform a multivariate eQTL for identifying genetic variation (SNPs) specifically associated with the expression of three members of TGF-β pathway, CTGF, SPARC and COL3A1. The uniqueness of this model is that all three genes were included in one model, rather than one gene being examined at a time. A protein might contribute to multiple pathways and this approach allows the identification of important genetic variations linked to multiple genes belonging to the same pathway. In this study, 29 SNPs were identified and 16 of them located in known genes. Exploring the roles of these genes in TGF-β regulation will help elucidate the etiology of SSc, which will in turn help to better manage this complex disease. ^

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Like other simple retroviruses the murine sarcoma virus ts110 (MuSVts110) displays an inefficient mode of genome splicing. But, unlike the splicing phenotypic of other retroviruses, the splicing event effected upon the transcript of MuSVts110 is temperature sensitive. Previous work in this laboratory has established that the conditionally defective nature of MuSVts110 RNA splicing is mediated in cis by features in the viral transcript. Here we show that the 5$\sp\prime$ splice site of the MuSVts110 transcript acts as a point of control of the overall splicing efficiency at both permissive and nonpermissive temperatures for splicing. We strengthened and simultaneously weakened the nucleotide structure of the 5$\sp\prime$ splice site in an attempt to elucidate the differential effects each of the two known critical splicing components which interact with the 5$\sp\prime$ splice site have on the overall efficiency of intron excision. We found that a transversion of the sixth nucleotide, resulting in the formation of a near-consensus 5$\sp\prime$ splice site, dramatically increased the overall efficiency of MuSVts110 RNA splicing and abrogated the thermosensitive nature of this splicing event. Various secondary mutations within this original transversion mutant, designed to selectively decrease specific splicing component interactions, lead to recovery of inefficient and thermosensitive splicing. We have further shown that a sequence of 415 nucleotides lying in the downstream exon of the viral RNA and hypothesized to act as an element in the temperature-dependent inhibition of splicing displays a functional redundancy throughout its length; loss and/or replacement of any one sequence of 100 nucleotides within this sequence does not, with one exception detailed below, diminish the degree to which MuSVts110 RNA is inhibited to splice at the restrictive temperature. One specific deletion, though, fortuitously juxtaposed and activated cryptic consensus splicing signals for the excision of a cryptic intron within the downstream exon and markedly potentiated--across a newly defined cryptic exon--the splicing event effected upon the upstream, native intron. We have exploited this mutant of MuSVts110 to further an understanding of the process of exon definition and intron definition and show that the polypyrimidine tract and consensus 3$\sp\prime$ splice site, as well as the 5$\sp\prime$ splice site, within the intron at the 3$\sp\prime$ flank of the defined exon are required for the exon's definition; implying that definition of the downstream intron is required for the in vivo definition of the proximal, upstream exon. Finally; we have shown, through the construction of heterologous mutants of MuSVts110 employing a foreign 3$\sp\prime$ end-forming sequence, that efficiency of transcript splicing can be increased--to a degree which abrogates its thermosensitive nature--in direct proportion to increasing proximity of the 3$\sp\prime$ end-forming signal to the terminal 3$\sp\prime$ splice site. ^