279 resultados para nucleotide repeat
Resumo:
The proportion of functional sequence in the human genome is currently a subject of debate. The most widely accepted figure is that approximately 5% is under purifying selection. In Drosophila, estimates are an order of magnitude higher, though this corresponds to a similar quantity of sequence. These estimates depend on the difference between the distribution of genomewide evolutionary rates and that observed in a subset of sequences presumed to be neutrally evolving. Motivated by the widening gap between these estimates and experimental evidence of genome function, especially in mammals, we developed a sensitive technique for evaluating such distributions and found that they are much more complex than previously apparent. We found strong evidence for at least nine well-resolved evolutionary rate classes in an alignment of four Drosophila species and at least seven classes in an alignment of four mammals, including human. We also identified at least three rate classes in human ancestral repeats. By positing that the largest of these ancestral repeat classes is neutrally evolving, we estimate that the proportion of nonneutrally evolving sequence is 30% of human ancestral repeats and 45% of the aligned portion of the genome. However, we also question whether any of the classes represent neutrally evolving sequences and argue that a plausible alternative is that they reflect variable structure-function constraints operating throughout the genomes of complex organisms.
Resumo:
Background Phylogeographic reconstruction of some bacterial populations is hindered by low diversity coupled with high levels of lateral gene transfer. A comparison of recombination levels and diversity at seven housekeeping genes for eleven bacterial species, most of which are commonly cited as having high levels of lateral gene transfer shows that the relative contributions of homologous recombination versus mutation for Burkholderia pseudomallei is over two times higher than for Streptococcus pneumoniae and is thus the highest value yet reported in bacteria. Despite the potential for homologous recombination to increase diversity, B. pseudomallei exhibits a relative lack of diversity at these loci. In these situations, whole genome genotyping of orthologous shared single nucleotide polymorphism loci, discovered using next generation sequencing technologies, can provide very large data sets capable of estimating core phylogenetic relationships. We compared and searched 43 whole genome sequences of B. pseudomallei and its closest relatives for single nucleotide polymorphisms in orthologous shared regions to use in phylogenetic reconstruction. Results Bayesian phylogenetic analyses of >14,000 single nucleotide polymorphisms yielded completely resolved trees for these 43 strains with high levels of statistical support. These results enable a better understanding of a separate analysis of population differentiation among >1,700 B. pseudomallei isolates as defined by sequence data from seven housekeeping genes. We analyzed this larger data set for population structure and allele sharing that can be attributed to lateral gene transfer. Our results suggest that despite an almost panmictic population, we can detect two distinct populations of B. pseudomallei that conform to biogeographic patterns found in many plant and animal species. That is, separation along Wallace's Line, a biogeographic boundary between Southeast Asia and Australia. Conclusion We describe an Australian origin for B. pseudomallei, characterized by a single introduction event into Southeast Asia during a recent glacial period, and variable levels of lateral gene transfer within populations. These patterns provide insights into mechanisms of genetic diversification in B. pseudomallei and its closest relatives, and provide a framework for integrating the traditionally separate fields of population genetics and phylogenetics for other bacterial species with high levels of lateral gene transfer.
Resumo:
Background Chlamydia pneumoniae is a widespread pathogen causing upper and lower respiratory tract infections in addition to a range of other diseases in humans and animals. Previous whole genome analyses have focused on four essentially clonal (> 99% identity) C. pneumoniae human genomes (AR39, CWL029, J138 and TW183), providing relatively little insight into strain diversity and evolution of this species. Results We performed individual gene-by-gene comparisons of the recently sequenced C. pneumoniae koala genome and four C. pneumoniae human genomes to identify species-specific genes, and more importantly, to gain an insight into the genetic diversity and evolution of the species. We selected genes dispersed throughout the chromosome, representing genes that were specific to C. pneumoniae, genes with a demonstrated role in chlamydial biology and/or pathogenicity (n = 49), genes encoding nucleotide salvage or amino acid biosynthesis proteins (n = 6), and extrachromosomal elements (9 plasmid and 2 bacteriophage genes). Conclusions We have identified strain-specific differences and targets for detection of C. pneumoniae isolates from both human and animal origin. Such characterisation is necessary for an improved understanding of disease transmission and intervention.
Resumo:
Genetic research of complex diseases is a challenging, but exciting, area of research. The early development of the research was limited, however, until the completion of the Human Genome and HapMap projects, along with the reduction in the cost of genotyping, which paves the way for understanding the genetic composition of complex diseases. In this thesis, we focus on the statistical methods for two aspects of genetic research: phenotype definition for diseases with complex etiology and methods for identifying potentially associated Single Nucleotide Polymorphisms (SNPs) and SNP-SNP interactions. With regard to phenotype definition for diseases with complex etiology, we firstly investigated the effects of different statistical phenotyping approaches on the subsequent analysis. In light of the findings, and the difficulties in validating the estimated phenotype, we proposed two different methods for reconciling phenotypes of different models using Bayesian model averaging as a coherent mechanism for accounting for model uncertainty. In the second part of the thesis, the focus is turned to the methods for identifying associated SNPs and SNP interactions. We review the use of Bayesian logistic regression with variable selection for SNP identification and extended the model for detecting the interaction effects for population based case-control studies. In this part of study, we also develop a machine learning algorithm to cope with the large scale data analysis, namely modified Logic Regression with Genetic Program (MLR-GEP), which is then compared with the Bayesian model, Random Forests and other variants of logic regression.
Resumo:
Although numerous genetic and acquired factors are appreciated as risk factors for venous thromboembolism (VTE) [1,2], only recently have male gender [3,4], dyslipoproteinemia [5], and silent atherosclerotic vascular disease [6] been linked to VTE. We recently found that high-density lipoprotein (HDL) deficiency is a key feature of a pattern of dyslipoproteinemia that is associated with VTE in males, and we found that the common TaqI B1 variation in the cholesteryl ester transfer protein (CETP) gene is significantly linked to VTE [5]. However, the TaqI B1/B2 single nucleotide polymorphism (SNP) itself is unlikely to affect directly CETP activity, but it is linked to nonsynonymous CETP SNPs Ala373Pro and Arg451Gln [7–9]. Here, we demonstrate that these two CETP variations are associated with VTE and low plasma HDL levels in males.
Resumo:
We have previously reported the use of a novel mini-sequencing protocol for detection of the factor V Leiden variant, the first nucleotide change (FNC) technology. This technology is based on a single nucleotide extension of a primer, which is hybridized immediately adjacent to the site of mutation. The extended nucleotide that carries a reporter molecule (fluorescein) has the power to discriminate the genotype at the site of mutation. More recently, the prothrombin 20210 and thermolabile methylene tetrahydrofolate reductase (MTHFR) 677 variants have been identified as possible risk factors associated with thrombophilia. This study describes the use of the FNC technology in a combined assay to detect factor V, prothrombin and MTHFR variants in a population of Australian blood donors, and describes the objective numerical methodology used to determine genotype cut-off values for each genetic variation. Using FNC to test 500 normal blood donors, the incidence of Factor V Leiden was 3.6% (all heterozygous), that of prothrombin 20210 was 2.8% (all heterozygous) and that of MTHFR was 10% (homozygous). The combined FNC technology offers a simple, rapid, automatable DNA-based test for the detection of these three important mutations that are associated with familial thrombophilia. (C) 2000 Lippincott Williams and Wilkins.
Resumo:
Background Nitric oxide is released by immune, epithelial and endothelial cells, and plays an important part in the pathophysiology of asthma. Objective To investigate the association of inducible nitric oxide synthases (iNOS) gene repeat polymorphisms with asthma. Methods 230 families with asthma (842 individuals) were recruited to identify and establish the genetic association of iNOS repeats with asthma and associated phenotypes. Serum nitric oxide levels in selected individuals were measured and correlated with specific genotypes. Multiple logistic regression analysis was performed to determine the effect of age and sex. Results A total of four repeats—a (CCTTT)n promoter repeat, a novel intron 2 (GT)n repeat (BV680047), an intron 4 (GT)n repeat (AFM311ZB1) and an intron 5 (CA)n repeat (D17S1878)—were identified and genotyped. A significant transmission distortion to the probands with asthma was seen for allele 3 of the AFM311ZB1 gene (p = 0.006). This allele was also found to be significantly associated with percentage blood eosinophils (p<0.001) and asthma severity (p = 0.04). Moreover, it was functionally correlated with high serum nitric oxide levels (p = 0.006). Similarly, the promoter repeat was found to be associated with serum total immunoglobulin (Ig)E (p = 0.028). Individuals carrying allele 4 of this repeat have high serum IgE (p<0.001) and nitric oxide levels (p = 0.03). Conclusion This is the first study to identify the repeat polymorphisms in the iNOS gene that are associated with severity of asthma and eosinophils. The functional relevance of the associated alleles with serum nitric oxide levels was also shown. Therefore, these results could be valuable in elucidating the role of nitric oxide in asthma pathogenesis.
Resumo:
BACKGROUND Endometriosis is a polygenic disease with a complex and multifactorial aetiology that affects 8-10% of women of reproductive age. Epidemiological data support a link between endometriosis and cancers of the reproductive tract. Fibroblast growth factor receptor 2 (FGFR2) has recently been implicated in both endometrial and breast cancer. Our previous studies on endometriosis identified significant linkage to a novel susceptibility locus on chromosome 10q26 and the FGFR2 gene maps within this linkage region. We therefore hypothesized that variation in FGFR2 may contribute to the risk of endometriosis. METHODS We genotyped 13 single nucleotide polymorphisms (SNPs) densely covering a 27 kb region within intron 2 of FGFR2 including two SNPs (rs2981582 and rs1219648) significantly associated with breast cancer and a total 40 tagSNPs across 150 kb of the FGFR2 gene. SNPs were genotyped in 958 endometriosis cases and 959 unrelated controls. RESULTS We found no evidence for association between endometriosis and FGFR2 intron 2 SNPs or SNP haplotypes and no evidence for association between endometriosis and variation across the FGFR2 gene. CONCLUSIONS Common variation in the breast-cancer implicated intron 2 and other highly plausible causative candidate regions of FGFR2 do not appear to be a major contributor to endometriosis susceptibility in our large Australian sample.
Resumo:
The progesterone receptor (PR) is a candidate gene for the development of endometriosis, a complex disease with strong hormonal features, common in women of reproductive age. We typed the 306 base pair Alu insertion (AluIns) polymorphism in intron G of PR in 101 individuals, estimated linkage disequilibrium (LD) between five single-nucleotide polymorphisms (SNPs) across the PR locus in 980 Australian triads (endometriosis case and two parents) and used transmission disequilibrium testing (TDT) for association with endometriosis. The five SNPs showed strong pairwise LD, and the AluIns was highly correlated with proximal SNPs rs1042839 (Δ2 = 0.877, D9 = 1.00, P < 0.0001) and rs500760 (Δ2 = 0.438, D9 = 0.942, P < 0.0001). TDT showed weak evidence of allelic association between endometriosis and rs500760 (P = 0.027) but not in the expected direction. We identified a common susceptibility haplotype GGGCA across the five SNPs (P = 0.0167) in the whole sample, but likelihood ratio testing of haplotype transmission and non-transmission of the AluIns and flanking SNPs showed no significant pattern. Further, analysis of our results pooled with those from two previous studies suggested that neither the T2 allele of the AluIns nor the T1/T2 genotype was associated with endometriosis.
Resumo:
Retrotransposons are a class of transposable elements that represent a major fraction of the repetitive DNA of most eukaryotes. Their abundance stems from their expansive replication strategies. We screened and isolated sequence fragments of long terminal repeat (LTR), gypsy-like reverse transcriptase (rt) and gypsy-like envelope (env) domains, and two partial sequences of non-LTR retrotransposons, long interspersed element (LINE), in the clonally propagated allohexaploid sweet potato (Ipomoea batatas (L.) Lam.) genome. Using dot-blot hybridization, these elements were found to be present in the ~1597 Mb haploid sweet potato genome with copy numbers ranging from ~50 to ~4100 as observed in the partial LTR (IbLtr-1) and LINE (IbLi-1) sequences, respectively. The continuous clonal propagation of sweet potato may have contributed to such a multitude of copies of some of these genomic elements. Interestingly, the isolated gypsy-like env and gypsy-like rt sequence fragments, IbGy-1 (~2100 copies) and IbGy-2 (~540 copies), respectively, were found to be homologous to the Bagy-2 cDNA sequences of barley (Hordeum vulgare L.). Although the isolated partial sequences were found to be homologous to other transcriptionally active elements, future studies are required to determine whether they represent elements that are transcriptionally active under normal and (or) stressful conditions.
Resumo:
The CDKN2 gene, encoding the cyclin-dependent kinase inhibitor p16, is a tumour suppressor gene that maps to chromosome band 9p21-p22. The most common mechanism of inactivation of this gene in human cancers is through homozygous deletion; however, in a smaller proportion of tumours and tumour cell lines intragenic mutations occur. In this study we have compiled a database of over 120 published point mutations in the CDKN2 gene from a wide variety of tumour types. A further 50 deletions, insertions, and splice mutations in CDKN2 have also been compiled. Furthermore, we have standardised the numbering of all mutations according to the full-length 156 amino acid form of p16. From this study we are able to define several hot spots, some of which occur at conserved residues within the ankyrin domains of p16. While many of the hotspots are shared by a number of cancers, the relative importance of each position varies, possibly reflecting the role of different carcinogens in the development of certain tumours. As reported previously, the mutational spectrum of CDKN2 in melanomas differs from that of internal malignancies and supports the involvement of UV in melanoma tumorigenesis. Notably, 52% of all substitutions in melanoma-derived samples occurred at just six nucleotide positions. Nonsense mutations comprise a comparatively high proportion of mutations present in the CDKN2 gene, and possible explanations for this are discussed.
Resumo:
Germ-line mutations in CDKN2A have been shown to predispose to cutaneous malignant melanoma. We have identified 2 new melanoma kindreds which carry a duplication of a 24bp repeat present in the 5' region of CDKN2A previously identified in melanoma families from Australia and the United States. This mutation has now been reported in 5 melanoma families from 3 continents: Europe, North America, and Australasia. The M53I mutation in exon 2 of CDKN2A has also been documented in 5 melanoma families from Australia and North America. The aim of this study was to determine whether the occurrence of the mutations in these families from geographically diverse populations represented mutation hotspots within CDKN2A or were due to common ancestors. Haplotypes of 11 microsatellite markers flanking CDKN2A were constructed in 5 families carrying the M53I mutation and 5 families carrying the 24bp duplication. There were some differences in the segregating haplotypes due primarily to recombinations and mutations within the short tandem-repeat markers; however, the data provide evidence to indicate that there were at least 3 independent 24bp duplication events and possibly only 1 original M53I mutation. This is the first study to date which indicates common founders in melanoma families from different continents.
Resumo:
A series of high-performance polycarbonates have been prepared with glass-transition temperatures and decomposition temperatures that are tunable by varying the repeat-unit chemical structure. Patterning of the polymers with extreme UV lithography has been achieved by taking advantage of direct photoinduced chain scission of the polymer chains, which results in a molecular-weight based solubility switch. After selective development of the irradiated regions of the polymers, feature sizes as small as 28.6 nm have been printed and the importance of resist-developer interactions for maximizing image quality has been demonstrated.
Resumo:
This paper reports the feasibility and methodological considerations of using the Short Message System Experience Sampling (SMS-ES) Method, which is an experience sampling research method developed to assist researchers to collect repeat measures of consumers’ affective experiences. The method combines SMS with web-based technology in a simple yet effective way. It is described using a practical implementation study that collected consumers’ emotions in response to using mobile phones in everyday situations. The method is further evaluated in terms of the quality of data collected in the study, as well as against the methodological considerations for experience sampling studies. These two evaluations suggest that the SMS-ES Method is both a valid and reliable approach for collecting consumers’ affective experiences. Moreover, the method can be applied across a range of for-profit and not-for-profit contexts where researchers want to capture repeated measures of consumers’ affective experiences occurring over a period of time. The benefits of the method are discussed to assist researchers who wish to apply the SMS-ES Method in their own research designs.