13 resultados para DNA VARIATION
em DigitalCommons@The Texas Medical Center
Resumo:
DNA sequence variation is currently a major source of data for studying human origins, evolution, and demographic history, and for detecting linkage association of complex diseases. In this dissertation, I investigated DNA variation in worldwide populations from two ∼10 kb autosomal regions on 22q11.2 (noncoding) and 1q24 (introns). A total of 75 variant sites were found among 128 human sequences in the 22q11.2 region, yielding an estimate of 0.088% for nucleotide diversity (π), and a total of 52 variant sites were found among 122 human sequences in the 1q24 region with an estimated π value of 0.057%. The data from these two regions and a 10 kb noncoding region on Xq13.3 all show a strong excess of low-frequency variants in comparison to that expected from an equilibrium population, indicating a relatively recent population expansion. The effective population sizes estimated from the three regions were 11,000, 12,700, and 8,600, respectively, which are close to the commonly used value of 10,000. In each of the two autosomal regions, the age of the most recent common ancestor (MRCA) was estimated to be older than 1 million years among all the sequences and ∼600,000 years among non-African sequences, providing first evidence from autosomal noncoding or intronic regions for a genetic history of humans much more ancient than the emergence of modern humans. The ancient genetic history of humans indicates no severe bottleneck during the evolution of humans in the last half million years; otherwise, much of the ancient genetic history would have been lost during a severe bottleneck. This study strongly suggests that both the “out of Africa” and the multiregional models are too simple for explaining the evolution of modern humans. A compilation of genome-wide data revealed that nucleotide diversity is highest in autosomal regions, intermediate in X-linked regions, and lowest in Y-linked regions. The data suggest the existence of background selection or selective sweep on Y-linked loci. In general, the nucleotide diversity in humans is low compared to that in chimpanzee and Drosophila populations. ^
Resumo:
The purpose of this research is to develop a new statistical method to determine the minimum set of rows (R) in a R x C contingency table of discrete data that explains the dependence of observations. The statistical power of the method will be empirically determined by computer simulation to judge its efficiency over the presently existing methods. The method will be applied to data on DNA fragment length variation at six VNTR loci in over 72 populations from five major racial groups of human (total sample size is over 15,000 individuals; each sample having at least 50 individuals). DNA fragment lengths grouped in bins will form the basis of studying inter-population DNA variation within the racial groups are significant, will provide a rigorous re-binning procedure for forensic computation of DNA profile frequencies that takes into account intra-racial DNA variation among populations. ^
Resumo:
Objectives. The chief goal of this study was to analyze copy number variation (CNV) in breast cancer tumors from 25 African American women with early stage breast cancer (BC) using molecular inversion probes (MIP) in order to: (1) compare the degree of CNV in tumors compared to normal lymph nodes, and (2) determine whether gains and/or losses of genes in specific chromosomes differ between pathologic subtypes of breast cancer defined by known prognostic markers, (3) determine whether gains/losses in CN are associated with known oncogenes or tumor suppressor genes, and (4) determine whether increased gains/losses in CN for specific chromosomes were associated with differences in breast cancer recurrence. ^ Methods. Twenty to 37 nanograms of DNA extracted from 25 formalin-fixed paraffin embedded (FFPE) tumor samples and matched normal lymph nodes were added to individual tubes. Oligonucleotide probes with recognition sequences at each terminus were hybridized with a genomic target sequence to form a circular structure. Probes are released from genomic DNA obtained from FFPE samples, and those which have been correctly "circularized" in the proper allele/nucleotide reaction combination are amplified using polymerase chain reaction (PCR) primers. Amplicons were fluorescently labeled and the tag sequences released from the genome homology regions by treatment with uracil-N-glycosylase to cleave the probe at the site where uracils are present, and detected using a complementary tag array developed by Affymetrix. ^ Results. Analysis of CN gains and losses from tumors and normal tissues showed marked differences in tumors with numerous chromosomes affected. Similar changes were not observed in normal lymph nodes. When tumors were stratified into four groups based on expression or lack of expression of the estrogen receptor and HER2/neu, distinct patterns of CNV for different chromosomes were observed. Gains or losses in CN for specific chromosomes correlated with amplifications/deletions of particular oncogenes or tumor suppressor genes (i.e. such as found on chromosome 17) known to be associated with aggressive tumor phenotype and poor prognosis. There was a trend for increases in CN observed for chromosome 17 to correlate inversely with time to recurrence of BC (p=0.14 for trend). CNV was also observed for chromosomes 5, 8, 10, 11, and 16, which are known sites for several breast cancer susceptibility alleles. ^ Conclusions. This study is the first to validate the MIP technique, to correlate differences in gene expression with known prognostic tumor markers, and to correlate significant increases/decreases in CN with known tumor markers associated with prognosis. The results of this study may have far reaching public health implications towards identifying new high-risk groups based on genomic differences in CNP, both with respect to prognosis and response to therapy, and to eventually identify new therapeutic targets for prevention and treatment of this disease. ^
Resumo:
The effect of circadian variation on susceptibility to the chemical induction of cancer was assessed utilizing the mouse pulmonary adenoma bioassay. Different groups of male A/Jax mice (standardized for rhythm analysis with light from 0600-1800 and darkness from 1800-0600) each received a single timed i.p. injection of urethan (Bioassay I: 0.25, 0.5 or 1.0 mg/g body weight; Bioassay II: 0.75, 1.0, 1.25 mg/g body weight; Bioassay III: 1.0 mg/g body weight) at the following times, 0100, 0500, 0900, 1300, 1700 or 2100. Mice were sacrificed 16 weeks after treatment. The tumorigenic effect of urethan on the lungs (lung surface pulmonary adenomas) was assessed. In addition, mortality, body weight changes and the anesthetic effect of urethan were determined. The rhythmic pattern of DNA synthesis in the lung and the comparative rhythmic pattern in the liver were assessed using a tritiated thymidine incorporation assay.^ In the first adenoma bioassay, the lung tumorigenic response in mice given the highest dose of urethan exhibited a 12-hour rhythm with a major peak in tumor yield at 0100 and a secondary peak at 1300; reduced yields occurred at 0500-0900 and 2100. The second adenoma bioassay, studied at a 6-month seasonal divergence in time from the first study showed a peak at 1300 but not at 0100. The mice from the third adenoma bioassay, studied at an 11-month seasonal divergence in time from the 2nd study showed an increase in tumor yield during the rest cycle (0900-1700).^ This study found a definite suggestion of a low amplitude rhythm in susceptibility to urethan induced effects. The acute toxic and pharmacological effects correlated to exhibit a maximal effect during dark hours (activity span). This rhythmicity might be explained by an alteration in the amplitude of hepatic metabolism. The chronic carcinogenic response exhibited an opposite pattern. Urethan induced tumor response was greater during daylight hours (rest cycle). This correlated with the slight elevation in DNA synthetic activity found in the lung and liver which might be responsible for the increase in carcinogenic response. (Abstract shortened with permission of author.) ^
Resumo:
The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.
Resumo:
Methylating agents are involved in carcinogenesis, and the DNA repair protein O(6)-methylguanine-DNA methyltransferase (MGMT) removes methyl group from O(6)-methylguanine. Genetic variation in DNA repair genes has been shown to contribute to susceptibility to squamous cell carcinoma of the head and neck (SCCHN). We hypothesize that MGMT polymorphisms are associated with risk of SCCHN. In a hospital-based case-control study of 721 patients with SCCHN and 1234 cancer-free controls frequency-matched by age, sex and ethnicity, we genotyped four MGMT polymorphisms, two in exon 3, 16195C>T and 16286C>T and two in the promoter region, 45996G>T and 46346C>A. We found that none of these polymorphisms alone had a significant effect on risk of SCCHN. However, when these four polymorphisms were evaluated together by the number of putative risk genotypes (i.e. 16195CC, 16286CC, 45996GT+TT, and 46346CA+AA), a statistically significantly increased risk of SCCHN was associated with the combined genotypes with three to four risk genotypes, compared with those with zero to two risk genotypes (adjusted odds ratio (OR)=1.27; 95% confidence interval (CI)=1.05-1.53). This increased risk was also more pronounced among young subjects (OR=1.81; 95% CI=1.11-2.96), men (OR=1.24; 95% CI=1.00-1.55), ever smokers (OR=1.25; 95%=1.01-1.56), ever drinkers (OR=1.29; 95% CI=1.04-1.60), patients with oropharyngeal cancer (OR=1.45; 95% CI=1.12-1.87), and oropharyngeal cancer with regional lymph node metastasis (OR=1.52; 95% CI=1.16-1.89). In conclusion, our results suggest that any one of MGMT variants may not have a substantial effect on SCCHN risk, but a joint effect of several MGMT variants may contribute to risk and progression of SCCHN, particularly for oropharyngeal cancer, in non-Hispanic whites.
Resumo:
Individuals with Lynch syndrome are predisposed to cancer due to an inherited DNA mismatch repair gene mutation. However, there is significant variability observed in disease expression likely due to the influence of other environmental, lifestyle, or genetic factors. Polymorphisms in genes encoding xenobiotic-metabolizing enzymes may modify cancer risk by influencing the metabolism and clearance of potential carcinogens from the body. In this retrospective analysis, we examined key candidate gene polymorphisms in CYP1A1, EPHX1, GSTT1, GSTM1, and GSTP1 as modifiers of age at onset of colorectal cancer among 257 individuals with Lynch syndrome. We found that subjects heterozygous for CYP1A1 I462V (c.1384A>G) developed colorectal cancer 4 years earlier than those with the homozygous wild-type genotype (median ages, 39 and 43 years, respectively; log-rank test P = 0.018). Furthermore, being heterozygous for the CYP1A1 polymorphisms, I462V and Msp1 (g.6235T>C), was associated with an increased risk for developing colorectal cancer [adjusted hazard ratio for AG relative to AA, 1.78; 95% confidence interval, 1.16-2.74; P = 0.008; hazard ratio for TC relative to TT, 1.53; 95% confidence interval, 1.06-2.22; P = 0.02]. Because homozygous variants for both CYP1A1 polymorphisms were rare, risk estimates were imprecise. None of the other gene polymorphisms examined were associated with an earlier onset age for colorectal cancer. Our results suggest that the I462V and Msp1 polymorphisms in CYP1A1 may be an additional susceptibility factor for disease expression in Lynch syndrome because they modify the age of colorectal cancer onset by up to 4 years.
Resumo:
BACKGROUND: Enterococcus faecalis has emerged as a major hospital pathogen. To explore its diversity, we sequenced E. faecalis strain OG1RF, which is commonly used for molecular manipulation and virulence studies. RESULTS: The 2,739,625 base pair chromosome of OG1RF was found to contain approximately 232 kilobases unique to this strain compared to V583, the only publicly available sequenced strain. Almost no mobile genetic elements were found in OG1RF. The 64 areas of divergence were classified into three categories. First, OG1RF carries 39 unique regions, including 2 CRISPR loci and a new WxL locus. Second, we found nine replacements where a sequence specific to V583 was substituted by a sequence specific to OG1RF. For example, the iol operon of OG1RF replaces a possible prophage and the vanB transposon in V583. Finally, we found 16 regions that were present in V583 but missing from OG1RF, including the proposed pathogenicity island, several probable prophages, and the cpsCDEFGHIJK capsular polysaccharide operon. OG1RF was more rapidly but less frequently lethal than V583 in the mouse peritonitis model and considerably outcompeted V583 in a murine model of urinary tract infections. CONCLUSION: E. faecalis OG1RF carries a number of unique loci compared to V583, but the almost complete lack of mobile genetic elements demonstrates that this is not a defining feature of the species. Additionally, OG1RF's effects in experimental models suggest that mediators of virulence may be diverse between different E. faecalis strains and that virulence is not dependent on the presence of mobile genetic elements.
Resumo:
Lyme disease Borrelia can infect humans and animals for months to years, despite the presence of an active host immune response. The vls antigenic variation system, which expresses the surface-exposed lipoprotein VlsE, plays a major role in B. burgdorferi immune evasion. Gene conversion between vls silent cassettes and the vlsE expression site occurs at high frequency during mammalian infection, resulting in sequence variation in the VlsE product. In this study, we examined vlsE sequence variation in B. burgdorferi B31 during mouse infection by analyzing 1,399 clones isolated from bladder, heart, joint, ear, and skin tissues of mice infected for 4 to 365 days. The median number of codon changes increased progressively in C3H/HeN mice from 4 to 28 days post infection, and no clones retained the parental vlsE sequence at 28 days. In contrast, the decrease in the number of clones with the parental vlsE sequence and the increase in the number of sequence changes occurred more gradually in severe combined immunodeficiency (SCID) mice. Clones containing a stop codon were isolated, indicating that continuous expression of full-length VlsE is not required for survival in vivo; also, these clones continued to undergo vlsE recombination. Analysis of clones with apparent single recombination events indicated that recombinations into vlsE are nonselective with regard to the silent cassette utilized, as well as the length and location of the recombination event. Sequence changes as small as one base pair were common. Fifteen percent of recovered vlsE variants contained "template-independent" sequence changes, which clustered in the variable regions of vlsE. We hypothesize that the increased frequency and complexity of vlsE sequence changes observed in clones recovered from immunocompetent mice (as compared with SCID mice) is due to rapid clearance of relatively invariant clones by variable region-specific anti-VlsE antibody responses.
Resumo:
I have undertaken measurements of the genetic (or inherited) and nongenetic (or noninherited) components of the variability of metastasis formation and tumor diameter doubling time in more than 100 metastatic lines from each of three murine tumors (sarcoma SANH, sarcoma SA4020, and hepatocarcinoma HCA-I) syngeneic to C3Hf/Kam mice. These lines were isolated twice from lung metastases and analysed immediately thereafter to obtain the variance to spontaneous lung metastasis and tumor diameter doubling time. Additional studies utilized cells obtained from within 4 passages of isolation. Under the assumption that no genetic differences in metastasis formation or diameter doubling time existed among the cells of a given line, the variance within a line would estimate nongenetic variation. The variability derived from differences between lines would represent genetic origin. The estimates of the genetic contribution to the variation of metastasis and tumor diameter doubling time were significantly greater than zero, but only in the metastatic lines of tumor SANH was genetic variation the major source of metastatic variability (contributing 53% of the variability). In the tumor cell lines of SA4020 and HCA-I, however, the contribution of nongenetic factors predominated over genetic factors in the variability of the number of metastasis and tumor diameter doubling time. A number of other parameters examined, such as DNA content, karyotype, and selection and variance analysis with passage in vivo, indicated that genetic differences existed within the cell lines and that these differences were probably created by genetic instability. The mean metastatic propensity of the lines may have increased somewhat during their isolation and isotransplantation, but the variance was only slightly affected, if at all. Analysis of the DNA profiles of the metastatic lines of SA4020 and HCA-I revealed differences between these lines and their primary parent tumors, but not among the SANH lines and their parent tumor. Furthermore, there was a direct correlation between the extent of genetic influence on metastasis formation and the ability of the tumor cells to develop resistance to cisplatinum. Thus although nongenetic factors might predominate in contributing to metastasis formation, it is probably genetic variation and genetic instability that cause the progression of tumor cells to a more metastatic phenotype and leads to the emergence of drug resistance. ^
Resumo:
Models of DNA sequence evolution and methods for estimating evolutionary distances are needed for studying the rate and pattern of molecular evolution and for inferring the evolutionary relationships of organisms or genes. In this dissertation, several new models and methods are developed.^ The rate variation among nucleotide sites: To obtain unbiased estimates of evolutionary distances, the rate heterogeneity among nucleotide sites of a gene should be considered. Commonly, it is assumed that the substitution rate varies among sites according to a gamma distribution (gamma model) or, more generally, an invariant+gamma model which includes some invariable sites. A maximum likelihood (ML) approach was developed for estimating the shape parameter of the gamma distribution $(\alpha)$ and/or the proportion of invariable sites $(\theta).$ Computer simulation showed that (1) under the gamma model, $\alpha$ can be well estimated from 3 or 4 sequences if the sequence length is long; and (2) the distance estimate is unbiased and robust against violations of the assumptions of the invariant+gamma model.^ However, this ML method requires a huge amount of computational time and is useful only for less than 6 sequences. Therefore, I developed a fast method for estimating $\alpha,$ which is easy to implement and requires no knowledge of tree. A computer program was developed for estimating $\alpha$ and evolutionary distances, which can handle the number of sequences as large as 30.^ Evolutionary distances under the stationary, time-reversible (SR) model: The SR model is a general model of nucleotide substitution, which assumes (i) stationary nucleotide frequencies and (ii) time-reversibility. It can be extended to SRV model which allows rate variation among sites. I developed a method for estimating the distance under the SR or SRV model, as well as the variance-covariance matrix of distances. Computer simulation showed that the SR method is better than a simpler method when the sequence length $L>1,000$ bp and is robust against deviations from time-reversibility. As expected, when the rate varies among sites, the SRV method is much better than the SR method.^ The evolutionary distances under nonstationary nucleotide frequencies: The statistical properties of the paralinear and LogDet distances under nonstationary nucleotide frequencies were studied. First, I developed formulas for correcting the estimation biases of the paralinear and LogDet distances. The performances of these formulas and the formulas for sampling variances were examined by computer simulation. Second, I developed a method for estimating the variance-covariance matrix of the paralinear distance, so that statistical tests of phylogenies can be conducted when the nucleotide frequencies are nonstationary. Third, a new method for testing the molecular clock hypothesis was developed in the nonstationary case. ^
Resumo:
Double minutes (dm) are small chromatin particles of 0.3 microns diameter found only in the metaphase cells of human and murine tumors. Dm are unique cytogenetic structures since their numbers per cell show wide variation. At cell division, dm are retained despite the lack of centromeres. In squash preparations, dm show clustering often in association with chromosomes. Human carcinoma cell line SW613-S18 was found to have large numbers of dm and biological characteristics favorable for mitotic synchronization and chromosome isolation experiments.^ S18 cells were synchronized to mitosis with metabolic and mitotic blocking compounds. Mitotic cells were lysed to release chromosomes and dm from the mitotic spindle and the resulting suspensions were fractionated to enrich for dm. The DNA in enriched fractions was characterized. The reassociation kinetics of dm-DNA driven with placental human DNA was similar to the reassociation curve of labeled placental DNA under similar conditions. In situ hybridization of dm-DNA to tumor and normal metaphase cells showed grain localization over the entire karyotype. Dm-DNA was shown by pulse chase DNA replication experiments to replicate during early and mid S-phase of the cell cycle, but not in late S-phase. In addition, BrdUrd incorporation studies showed that dm-DNA replicates only once during the S-phase. Premature chromosome condensation studies suggest the basis of numerical heterogeneity of dm is nondisjunction, not anomalous or unscheduled DNA replication.^ These data and previous cytochemical banding studies of dm in SW613-S18 indicate that dm-DNA is chromosomal in origin. No evidence of gene amplification was found in the DNA reassociation data. It is likely that dm-DNA represents the pale-staining G-band regions of the human karyotype in this cell line. ^
Resumo:
I-compounds are newly discovered covalent DNA modifications detected by the $\sp{32}$P-postlabeling assay. They are age-dependent, tissue-specific and sex-different. The origin(s), chemistry and function(s) of I-compounds are unknown. The total level of I-compounds in 8-10 month old rat liver is 1 adduct in 10$\sp7$ nucleotides, which is not neglectable. It is proposed that I-compounds may play a role in spontaneous tumorigenesis and aging.^ In the present project, I-compounds were investigated by several different approaches. (1) Dietary modulation of I-compounds. (2) Comparison of I-compounds with persistent carcinogen DNA adducts and 5-methylcytosine. (3) Strain differences of I-compounds in relation to organ site spontaneous tumorigenesis. (4) Effects of nongenotoxic hepatocarcinogenes on I-compounds.^ It was demonstrated that the formation of I-compounds is diet-related. Rats fed natural ingredient diet exhibited more complex I-spot patterns and much higher levels than rats fed purified diet. Variation of major nutrients (carbohydrate, protein and fat) in the diet, produced quantitative differences in I-compounds of rat liver and kidney DNAs. Physiological level of vitamin E in the diet reduced intensity of one I-spot compared with vitamin E deficient diet. However, extremely high level of vitamin E in the diet gave extra spot and enhanced the intensities of some I-spots.^ In regenerating rat liver, I-compounds levels were reduced, as carcinogen DNA adducts, but not 5-methylcytosine, i.e. a normal DNA modification.^ Animals with higher incidences of spontaneous tumor or degenerative diseases tended to have a lower level of I-compounds.^ Choline devoid diet induced a drastic reduction of I-compound level in rat liver compared with choline supplemented diet. I-compound levels were reduced after multi-doses of carbon tetrachloride (CCl$\sb4$) exposure in rats and single dose exposure in mice. An inverse relationship was observed between I-compound level and DNA replication rate. CCl$\sb4$-related DNA adduct was detected in mice liver and intensities of some I-spots were enhanced 24 h after a single dose exposure.^ The mechanisms and explanations of these observations will be discussed. I-compounds are potentially useful indicators in carcinogenesis studies. ^