28 resultados para DNA data banks


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Academic and industrial research in the late 90s have brought about an exponential explosion of DNA sequence data. Automated expert systems are being created to help biologists to extract patterns, trends and links from this ever-deepening ocean of information. Two such systems aimed on retrieving and subsequently utilizing phylogenetically relevant information have been developed in this dissertation, the major objective of which was to automate the often difficult and confusing phylogenetic reconstruction process. ^ Popular phylogenetic reconstruction methods, such as distance-based methods, attempt to find an optimal tree topology (that reflects the relationships among related sequences and their evolutionary history) by searching through the topology space. Various compromises between the fast (but incomplete) and exhaustive (but computationally prohibitive) search heuristics have been suggested. An intelligent compromise algorithm that relies on a flexible “beam” search principle from the Artificial Intelligence domain and uses the pre-computed local topology reliability information to adjust the beam search space continuously is described in the second chapter of this dissertation. ^ However, sometimes even a (virtually) complete distance-based method is inferior to the significantly more elaborate (and computationally expensive) maximum likelihood (ML) method. In fact, depending on the nature of the sequence data in question either method might prove to be superior. Therefore, it is difficult (even for an expert) to tell a priori which phylogenetic reconstruction method—distance-based, ML or maybe maximum parsimony (MP)—should be chosen for any particular data set. ^ A number of factors, often hidden, influence the performance of a method. For example, it is generally understood that for a phylogenetically “difficult” data set more sophisticated methods (e.g., ML) tend to be more effective and thus should be chosen. However, it is the interplay of many factors that one needs to consider in order to avoid choosing an inferior method (potentially a costly mistake, both in terms of computational expenses and in terms of reconstruction accuracy.) ^ Chapter III of this dissertation details a phylogenetic reconstruction expert system that selects a superior proper method automatically. It uses a classifier (a Decision Tree-inducing algorithm) to map a new data set to the proper phylogenetic reconstruction method. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA sequence variation is currently a major source of data for studying human origins, evolution, and demographic history, and for detecting linkage association of complex diseases. In this dissertation, I investigated DNA variation in worldwide populations from two ∼10 kb autosomal regions on 22q11.2 (noncoding) and 1q24 (introns). A total of 75 variant sites were found among 128 human sequences in the 22q11.2 region, yielding an estimate of 0.088% for nucleotide diversity (π), and a total of 52 variant sites were found among 122 human sequences in the 1q24 region with an estimated π value of 0.057%. The data from these two regions and a 10 kb noncoding region on Xq13.3 all show a strong excess of low-frequency variants in comparison to that expected from an equilibrium population, indicating a relatively recent population expansion. The effective population sizes estimated from the three regions were 11,000, 12,700, and 8,600, respectively, which are close to the commonly used value of 10,000. In each of the two autosomal regions, the age of the most recent common ancestor (MRCA) was estimated to be older than 1 million years among all the sequences and ∼600,000 years among non-African sequences, providing first evidence from autosomal noncoding or intronic regions for a genetic history of humans much more ancient than the emergence of modern humans. The ancient genetic history of humans indicates no severe bottleneck during the evolution of humans in the last half million years; otherwise, much of the ancient genetic history would have been lost during a severe bottleneck. This study strongly suggests that both the “out of Africa” and the multiregional models are too simple for explaining the evolution of modern humans. A compilation of genome-wide data revealed that nucleotide diversity is highest in autosomal regions, intermediate in X-linked regions, and lowest in Y-linked regions. The data suggest the existence of background selection or selective sweep on Y-linked loci. In general, the nucleotide diversity in humans is low compared to that in chimpanzee and Drosophila populations. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA interstrand crosslinks (ICLs) are among the most toxic type of damage to a cell. Many ICL-inducing agents are widely used as therapeutic agents, e.g. cisplatin, psoralen. A bettor understanding of the cellular mechanism that eliminates ICLs is important for the improvement of human health. However, ICL repair is still poorly understood in mammals. Using a triplex-directed site-specific ICL model, we studied the roles of mismatch repair (MMR) proteins in ICL repair in human cells. We are also interested in using psoralen-conjugated triplex-forming oligonucleotides (TFOs) to direct ICLs to a specific site in targeted DNA and in the mammalian genomes. ^ MSH2 protein is the common subunit of two MMR recognition complexes, and MutSα and MutSβ. We showed that MSH2 deficiency renders human cell hypersensitive to psoralen ICLs. MMR recognition complexes bind specifically to triplex-directed psoralen ICLs in vitro. Together with the fact that psoralen ICL-induced repair synthesis is dramatically decreased in MSH2 deficient cell extracts, we demonstrated that MSH2 function is critical for the recognition and processing of psoralen ICLs in human cells. Interestingly, lack of MSH2 does not reduce the level of psoralen ICL-induced mutagenesis in human cells, suggesting that MSH2 does not contribute to error-generating repair of psoralen ICLs, and therefore, may represent a novel error-free mechanism for repairing ICLs. We also studied the role of MLH1, anther key protein in MMR, in the processing of psoralen ICLs. MLH1-deficient human cells are more resistant to psoralen plus UVA treatment. Importantly, MLH1 function is not required for the mutagenic repair of psoralen ICLs, suggesting that it is not involved in the error-generating repair of this type of DNA damage in human cells. ^ These are the first data indicating mismatch repair proteins may participate in a relatively error-free mechanism for processing psoralen ICL in human cells. Enhancement of MMR protein function relative to nucleotide excision repair proteins may reduce the mutagenesis caused by DNA ICLs in humans. ^ In order to specifically target ICLs to mammalian genes, we identified novel TFO target sequences in mouse and human genomes. Using this information, many critical mammalian genes can now be targeted by TFOs.^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Hereditary nonpolyposis colorectal cancer (HNPCC) is an autosomal dominant disease caused by germline mutations in DNA mismatch repair(MMR) genes. The nucleotide excision repair(NER) pathway plays a very important role in cancer development. We systematically studied interactions between NER and MMR genes to identify NER gene single nucleotide polymorphism (SNP) risk factors that modify the effect of MMR mutations on risk for cancer in HNPCC. We analyzed data from polymorphisms in 10 NER genes that had been genotyped in HNPCC patients that carry MSH2 and MLH1 gene mutations. The influence of the NER gene SNPs on time to onset of colorectal cancer (CRC) was assessed using survival analysis and a semiparametric proportional hazard model. We found the median age of onset for CRC among MMR mutation carriers with the ERCC1 mutation was 3.9 years earlier than patients with wildtype ERCC1(median 47.7 vs 51.6, log-rank test p=0.035). The influence of Rad23B A249V SNP on age of onset of HNPCC is age dependent (likelihood ratio test p=0.0056). Interestingly, using the likelihood ratio test, we also found evidence of genetic interactions between the MMR gene mutations and SNPs in ERCC1 gene(C8092A) and XPG/ERCC5 gene(D1104H) with p-values of 0.004 and 0.042, respectively. An assessment using tree structured survival analysis (TSSA) showed distinct gene interactions in MLH1 mutation carriers and MSH2 mutation carriers. ERCC1 SNP genotypes greatly modified the age onset of HNPCC in MSH2 mutation carriers, while no effect was detected in MLH1 mutation carriers. Given the NER genes in this study play different roles in NER pathway, they may have distinct influences on the development of HNPCC. The findings of this study are very important for elucidation of the molecular mechanism of colon cancer development and for understanding why some mutation carriers of the MSH2 and MLH1 gene develop CRC early and others never develop CRC. Overall, the findings also have important implications for the development of early detection strategies and prevention as well as understanding the mechanism of colorectal carcinogenesis in HNPCC. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The DNA replication polymerases δ and ϵ have an inherent proofreading mechanism in the form of a 3'→5' exonuclease. Upon recognition of errant deoxynucleotide incorporation into DNA, the nascent primer terminus is partitioned to the exonuclease active site where the incorrectly paired nucleotide is excised before resumption of polymerization. The goal of this project was to identify the cellular and molecular consequences of an exonuclease deficiency. The proofreading capability of model system MEFs with EXOII mutations was abolished without altering polymerase function.^ It was hypothesized that 3'→5' exonucleases of polymerases δ and ϵ are critical for prevention of replication stress and important for sensitization to nucleoside analogs. To test this hypothesis, two aims were formulated: Determine the effect of the exonuclease active site mutation on replication related molecular signaling and identify the molecular consequences of an exonuclease deficiency when replication is challenged with nucleoside analogs.^ Via cell cycle studies it was determined that larger populations of exonuclease deficient cells are in the S-phase. There was an increase in levels of replication proteins, cell population growth and DNA synthesis capacity without alteration in cell cycle progression. These findings led to studies of proteins involved in checkpoint activation and DNA damage sensing. Finally, collective modifications at the level of DNA replication likely affect the strand integrity of DNA at the chromosomal level.^ Gemcitabine, a DNA directed nucleoside analog is a substrate of polymerases δ and ϵ and exploits replication to become incorporated into DNA. Though accumulation of gemcitabine triphosphate was similar in all cell types, incorporation into DNA and rates of DNA synthesis were increased in exonuclease defective cells and were not consistent with clonogenic survival. This led to molecular signaling investigations which demonstrated an increase in S-phase cells and activation of a DNA damage response upon gemcitabine treatment.^ Collectively, these data indicate that the loss of exonuclease results in a replication stress response that is likely required to employ other repair mechanisms to remove unexcised mismatches introduced into DNA during replication. When challenged with nucleoside analogs, this ongoing stress response coupled with repair serves as a resistance mechanism to cell death.^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The difficulty of detecting differential gene expression in microarray data has existed for many years. Several correction procedures try to avoid the family-wise error rate in multiple comparison process, including the Bonferroni and Sidak single-step p-value adjustments, Holm's step-down correction method, and Benjamini and Hochberg's false discovery rate (FDR) correction procedure. Each multiple comparison technique has its advantages and weaknesses. We studied each multiple comparison method through numerical studies (simulations) and applied the methods to the real exploratory DNA microarray data, which detect of molecular signatures in papillary thyroid cancer (PTC) patients. According to our results of simulation studies, Benjamini and Hochberg step-up FDR controlling procedure is the best process among these multiple comparison methods and we discovered 1277 potential biomarkers among 54675 probe sets after applying the Benjamini and Hochberg's method to PTC microarray data.^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Next-generation DNA sequencing platforms can effectively detect the entire spectrum of genomic variation and is emerging to be a major tool for systematic exploration of the universe of variants and interactions in the entire genome. However, the data produced by next-generation sequencing technologies will suffer from three basic problems: sequence errors, assembly errors, and missing data. Current statistical methods for genetic analysis are well suited for detecting the association of common variants, but are less suitable to rare variants. This raises great challenge for sequence-based genetic studies of complex diseases.^ This research dissertation utilized genome continuum model as a general principle, and stochastic calculus and functional data analysis as tools for developing novel and powerful statistical methods for next generation of association studies of both qualitative and quantitative traits in the context of sequencing data, which finally lead to shifting the paradigm of association analysis from the current locus-by-locus analysis to collectively analyzing genome regions.^ In this project, the functional principal component (FPC) methods coupled with high-dimensional data reduction techniques will be used to develop novel and powerful methods for testing the associations of the entire spectrum of genetic variation within a segment of genome or a gene regardless of whether the variants are common or rare.^ The classical quantitative genetics suffer from high type I error rates and low power for rare variants. To overcome these limitations for resequencing data, this project used functional linear models with scalar response to develop statistics for identifying quantitative trait loci (QTLs) for both common and rare variants. To illustrate their applications, the functional linear models were applied to five quantitative traits in Framingham heart studies. ^ This project proposed a novel concept of gene-gene co-association in which a gene or a genomic region is taken as a unit of association analysis and used stochastic calculus to develop a unified framework for testing the association of multiple genes or genomic regions for both common and rare alleles. The proposed methods were applied to gene-gene co-association analysis of psoriasis in two independent GWAS datasets which led to discovery of networks significantly associated with psoriasis.^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA ligase and DNA polymerase play important roles in DNA replication, repair, and recombination. Frequencies of spontaneous and chemical- and physical-induced mutations are correlated to the fidelity of DNA replication. This dissertation elucidates the mechanisms of the DNA ligation reaction by DNA ligases and demonstrates that human DNA ligase I and DNA polymerase $\alpha$ are the molecular targets for two metal ions, Zn$\sp{2+}$ and Cd$\sp{2+},$ and an anticancer drug, F-ara-ATP.^ Human DNA ligases were purified to homogeneity and their AMP binding domains were mapped. Although their AMP-binding domains are similar, there could be difference between the two ligases in their DNA binding domains.^ The formation of the AMP-DNA intermediate and the successive ligation reaction by human DNA ligases were analyzed. Both reactions showed their substrate specificity for ligases I and II, required Mg2+, and were inhibited by ATP.^ A protein inhibitor from HeLa cells and specific for human DNA ligase I but not ligase II and T4 ligase was discovered. It reversibly inhibited DNA ligation activity but not the AMP-binding activity due to the formation of a reversible ligase I-inhibitor complex.^ F-ara-ATP inhibited human DNA ligase I activity by competing with ATP for the AMP-binding site of DNA ligase I, forming a ligase I-F-ara-AMP complex, as well as when it was incorporated at 3$\sp\prime$-terminus of DNA nick by DNA polymerase $\alpha.$^ All steps of the DNA ligation reaction were inhibited by Zn$\sp{2+}$ and Cd$\sp{2+}$ in a concentration-dependent manner. Both ions did not show the ability to change the fidelity of DNA ligation reaction catalyzed by human DNA ligase I. However, Zn$\sp{2+}$ and Cd$\sp{2+}$ showed their contradictory effects on the fidelity of the reaction by human DNA polymerase $\alpha.$ Zn$\sp{2+}$ decreased the frequency of misinsertion but less affected that of mispair extension. On the contrary, Cd$\sp{2+}$ increased the frequencies of both misinsertion and mispair extension at very low concentration. Our data provided strong evidence in the molecular mechanisms for the mutagenicity of zinc and cadmium, and were comparable with the results previously reported. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This dissertation examines the biological functions and the regulation of expression of DNA ligase I by studying its expression under different conditions.^ The gene expression of DNA ligase I was induced two- to four-fold in S-phase lymphoblastoid cells but was decreased to 15% of control after administration of a DNA damaging agent, 4-nitroquinoline-1-oxide. When cells were induced into differentiation, the expression level of DNA ligase I was decreased to less than 15% of that of the control cells. When the gene of DNA ligase I was examined for tissue specific expression in adult rats, high levels of DNA ligase I mRNA were observed in testis (8-fold), intermediate levels in ovary and brain (4-fold), and low levels were found in intestine, spleen, and liver (1- to 2-fold).^ In confluent cells of normal skin fibroblasts, UV irradiation induced the gene expression of DNA ligase I at 24 and 48 h. The induction of DNA ligase I gene expression requires active p53 protein. Introducing a vector containing the wild type p53 protein in the cells caused an induction of the DNA ligase I protein 24 h after the treatment.^ Our results indicate that, in addition to the regulation by phosphorylation/dephosphorylation, cellular DNA ligase I activity can be regulated at the gene transcription level, and the p53 tumor suppresser is one of the transcription factors for the DNA ligase I gene. Also, our results suggest that DNA ligase I is involved in DNA repair as well as in DNA replication.^ Also, as an early attempt to clone the human homolog of the yeast CDC9 gene which has been shown to be involved in DNA replication, DNA repair, and DNA recombination, we have identified a human gene with mRNA of 1.7 kb. This dissertation studies the gene regulation and the possible biological functions of this new human gene by examining its expression at different stages of the cell cycle, during cell differentiation, and in cellular response to DNA damage.^ The new gene that we recently identified from human cells is highly expressed in brain and reproductive organs (BRE). This BRE gene encodes an mRNA of 1.7-1.9 kb, with an open reading frame of 1,149 bp, and gives rise to a deduced polypeptide of 383 amino acid residues. No extensive homology was found between BRE and sequences from the EMBL-Gene Banks. BRE showed tissue-specific expression in adult rats. The steady state mRNA levels were high in testis (5-6 fold), ovary and brain (3-4 fold) compared to the spleen level, but low in intestine and liver (1-2 fold). The expression of this gene is responsive to DNA damage and/or retinoic acid (RA) treatment. Treatment of fibroblast cells with UV irradiation and 4-nitroquinoline-1-oxide caused more than 90% and 50% decreases in BRE mRNA, respectively. Similar decreases in BRE expression were observed after treatment of the brain glioma cell line U-251 and the promyelocytic cell line HL-60 with retinoic acid. (Abstract shortened by UMI). ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A clone of the primary Eco R1 family of human DNA sequences has been used as an indicator sequence for detecting alterations induced by a toxic agent. Specific clones of this family have been examined and compared to the consensus sequence to determine the normal variability of this family. Though variations were observed, data indicated that such clones can be used to study induced DNA modifications. This DNA was exposed to the toxic agent dimethyl sulfate under various conditions and a distinct pattern of aberrations was shown to occur. It is suggested that this approach be used to characterize patterns of damage induced by various agents in the ultimate development of a system capable of monitoring human genotoxic exposure. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Quantitative real-time polymerase chain reaction (qPCR) is a sensitive gene quantitation method that has been widely used in the biological and biomedical fields. The currently used methods for PCR data analysis, including the threshold cycle (CT) method, linear and non-linear model fitting methods, all require subtracting background fluorescence. However, the removal of background fluorescence is usually inaccurate, and therefore can distort results. Here, we propose a new method, the taking-difference linear regression method, to overcome this limitation. Briefly, for each two consecutive PCR cycles, we subtracted the fluorescence in the former cycle from that in the later cycle, transforming the n cycle raw data into n-1 cycle data. Then linear regression was applied to the natural logarithm of the transformed data. Finally, amplification efficiencies and the initial DNA molecular numbers were calculated for each PCR run. To evaluate this new method, we compared it in terms of accuracy and precision with the original linear regression method with three background corrections, being the mean of cycles 1-3, the mean of cycles 3-7, and the minimum. Three criteria, including threshold identification, max R2, and max slope, were employed to search for target data points. Considering that PCR data are time series data, we also applied linear mixed models. Collectively, when the threshold identification criterion was applied and when the linear mixed model was adopted, the taking-difference linear regression method was superior as it gave an accurate estimation of initial DNA amount and a reasonable estimation of PCR amplification efficiencies. When the criteria of max R2 and max slope were used, the original linear regression method gave an accurate estimation of initial DNA amount. Overall, the taking-difference linear regression method avoids the error in subtracting an unknown background and thus it is theoretically more accurate and reliable. This method is easy to perform and the taking-difference strategy can be extended to all current methods for qPCR data analysis.^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The combitiatorial approach restriction endonuclease protection selection and amplification REPSA was successfully used to determine ideal DNA interactions sites of covalent ligands. Unlike most other combinatorial methods, REPSA is based on inhibition of enzymatic cleavage by specific ligand-DNA complexes, which enables identification of binding sites of various ligands. However, the inherent nature of this technique posses a problem during selection of binding sites of covalent ligands. By modifying the technique according to the nature of the ligand, we demonstrate the flexibility of REPSA in identifying the preferred binding sites for monocovalent ligands, topoisomerase I and tallimustine, and the bicovalent ligand topoisomerase II. From among the preferred binding sites, we identified the consensus binding sequence of camptothecin induced topoisomerase I cleavage as ‘aGWT/Gc’, and tallimustine consensus sequences as ‘GTTCTA’ and ‘TTTTTTC’. We have shown for the first time that preferential binding of tallimustine occurs at sequences not previously reported. Furthermore, our data indicate that tallimustine is a novel DNA minor groove, guanine-specific alkylating agent. ^ Additionally, we have demonstrated in vivo that sequence-specific covalent DNA-binding small molecules have the ability to regulate transcription by inhibiting RNA polymerase II. Tallimustine, binding to its preferred sequences located in the 5′ untranslated region were an effective impediment for transcribing polymerase II. The ability of covalent binding small molecules to target predetermined DNA sequences located downstream of the promoter suggests a general approach for regulation of gene expression. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nitric oxide is involved in a multitude of processes including regulation of vascular tone, neurotransmission, immunity, and cancer. Evidence suggests that nitric oxide exhibits anti-apoptotic activity in melanoma cells. Our laboratory showed that tumor expression of inducible nitric oxide synthase correlated strongly with poor survival in stage III and IV melanoma patients, suggesting an antagonistic role for nitric oxide in melanoma response to therapy. Therefore, the hypothesis that endogenously produced nitric oxide antagonizes chemotherapy-induced apoptosis was formed. Using cisplatin as a model for DNA damage in melanoma cell lines, the capacity of nitric oxide to regulate cell growth and apoptotic responses to cisplatin treatment was examined. The depletion of endogenously generated nitric oxide resulted in changes in cell cycle regulation and enhanced cisplatin-induced apoptosis in melanoma cells. Since nitric oxide was shown to be involved in the regulation of p53 stability, conformation and DNA binding activity, whether signaling through wild-type p53 in melanoma cells is regulated by nitric oxide was tested. Cisplatin-induced p53 accumulation and p21Waf1/Cip1/Sdi1 expression in nitric oxide-depleted melanoma cells were found to be strongly suppressed. When p53 binding to the p21Waf1/Cip1/Sdi1 promoter was examined, it was found that nitric oxide depletion significantly reduced the cisplatin-induced formation of p53-DNA complexes. These results suggest that nitric oxide is required for activation of wild-type p53 after DNA damage in melanoma cells. Finally, whether signaling through p53 controls melanoma response to DNA damage was examined. Transfection of a plasmid containing a dominant negative form of mutated p53 inhibited p21 Waf1/Cip1/Sdi1 expression and concomitantly enhanced apoptosis after cisplatin treatment. These data suggest that the induction of wild-type p53 protects melanoma cells against DNA damage via the up-regulation of p21 Waf1/Cip1/Sdi1. Together, these data strongly support the model that endogenous nitric oxide is required for p53 activation and p21Waf1/Cip1/Sdi1 expression after DNA damage, which can enhance melanoma resistance to therapy. Thus, in context of melanoma cells with wild-type p53 , low levels of endogenous constitutively-produced nitric oxide appear to facilitate the activation of p53 in response to DNA damage, thereby allowing for cell cycle arrest via p21Waf1/Cip1/Sdi1 induction, adequate DNA repair, and ultimately enhanced resistance to apoptosis. ^