931 resultados para genomic
Resumo:
In the last few years, two paradigms underlying human evolution have crumbled. Modern humans have not totally replaced previous hominins without any admixture, and the expected signatures of adaptations to new environments are surprisingly lacking at the genomic level. Here we review current evidence about archaic admixture and lack of strong selective sweeps in humans. We underline the need to properly model differential admixture in various populations to correctly reconstruct past demography. We also stress the importance of taking into account the spatial dimension of human evolution, which proceeded by a series of range expansions that could have promoted both the introgression of archaic genes and background selection.
Resumo:
Background Levels of differentiation among populations depend both on demographic and selective factors: genetic drift and local adaptation increase population differentiation, which is eroded by gene flow and balancing selection. We describe here the genomic distribution and the properties of genomic regions with unusually high and low levels of population differentiation in humans to assess the influence of selective and neutral processes on human genetic structure. Methods Individual SNPs of the Human Genome Diversity Panel (HGDP) showing significantly high or low levels of population differentiation were detected under a hierarchical-island model (HIM). A Hidden Markov Model allowed us to detect genomic regions or islands of high or low population differentiation. Results Under the HIM, only 1.5% of all SNPs are significant at the 1% level, but their genomic spatial distribution is significantly non-random. We find evidence that local adaptation shaped high-differentiation islands, as they are enriched for non-synonymous SNPs and overlap with previously identified candidate regions for positive selection. Moreover there is a negative relationship between the size of islands and recombination rate, which is stronger for islands overlapping with genes. Gene ontology analysis supports the role of diet as a major selective pressure in those highly differentiated islands. Low-differentiation islands are also enriched for non-synonymous SNPs, and contain an overly high proportion of genes belonging to the 'Oncogenesis' biological process. Conclusions Even though selection seems to be acting in shaping islands of high population differentiation, neutral demographic processes might have promoted the appearance of some genomic islands since i) as much as 20% of islands are in non-genic regions ii) these non-genic islands are on average two times shorter than genic islands, suggesting a more rapid erosion by recombination, and iii) most loci are strongly differentiated between Africans and non-Africans, a result consistent with known human demographic history.
Resumo:
The recently accomplished complete genomic sequence analysis of the type strain PG1 of Mycoplasma mycoides subsp. mycoides small-colony type revealed four large repeated segments of 24, 13, 12, and 8 kb that are flanked by insertion sequence (IS) elements. Genetic analysis of type strain PG1 and African, European, and Australian field and vaccine strains revealed that the 24-kb genetic locus is repeated only in PG1 and not in other M. mycoides subsp. mycoides SC strains. In contrast, the 13-kb genetic locus was found duplicated in some strains originating from Africa and Australia but not in strains that were isolated from the European outbreaks. The 12- and 8-kb genetic loci were found in two and three copies, respectively, in all 28 strains analyzed. The flanking IS elements are assumed to lead to these tandem duplications, thus contributing to genomic plasticity. This aspect must be considered when designing novel diagnostic approaches and recombinant vaccines.
Resumo:
High-throughput gene expression technologies such as microarrays have been utilized in a variety of scientific applications. Most of the work has been on assessing univariate associations between gene expression with clinical outcome (variable selection) or on developing classification procedures with gene expression data (supervised learning). We consider a hybrid variable selection/classification approach that is based on linear combinations of the gene expression profiles that maximize an accuracy measure summarized using the receiver operating characteristic curve. Under a specific probability model, this leads to consideration of linear discriminant functions. We incorporate an automated variable selection approach using LASSO. An equivalence between LASSO estimation with support vector machines allows for model fitting using standard software. We apply the proposed method to simulated data as well as data from a recently published prostate cancer study.
Resumo:
The advent of experimental techniques capable of probing biomolecules and cells at high levels of resolution has led to a rapid change in the methods used for the analysis of experimental molecular biology data. In this article we give an overview over visualization techniques and methods that can be used to assess various aspects of genomic data.
Resumo:
The last few years have seen the advent of high-throughput technologies to analyze various properties of the transcriptome and proteome of several organisms. The congruency of these different data sources, or lack thereof, can shed light on the mechanisms that govern cellular function. A central challenge for bioinformatics research is to develop a unified framework for combining the multiple sources of functional genomics information and testing associations between them, thus obtaining a robust and integrated view of the underlying biology. We present a graph theoretic approach to test the significance of the association between multiple disparate sources of functional genomics data by proposing two statistical tests, namely edge permutation and node label permutation tests. We demonstrate the use of the proposed tests by finding significant association between a Gene Ontology-derived "predictome" and data obtained from mRNA expression and phenotypic experiments for Saccharomyces cerevisiae. Moreover, we employ the graph theoretic framework to recast a surprising discrepancy presented in Giaever et al. (2002) between gene expression and knockout phenotype, using expression data from a different set of experiments.
Resumo:
DNA sequence copy number has been shown to be associated with cancer development and progression. Array-based Comparative Genomic Hybridization (aCGH) is a recent development that seeks to identify the copy number ratio at large numbers of markers across the genome. Due to experimental and biological variations across chromosomes and across hybridizations, current methods are limited to analyses of single chromosomes. We propose a more powerful approach that borrows strength across chromosomes and across hybridizations. We assume a Gaussian mixture model, with a hidden Markov dependence structure, and with random effects to allow for intertumoral variation, as well as intratumoral clonal variation. For ease of computation, we base estimation on a pseudolikelihood function. The method produces quantitative assessments of the likelihood of genetic alterations at each clone, along with a graphical display for simple visual interpretation. We assess the characteristics of the method through simulation studies and through analysis of a brain tumor aCGH data set. We show that the pseudolikelihood approach is superior to existing methods both in detecting small regions of copy number alteration and in accurately classifying regions of change when intratumoral clonal variation is present.
Resumo:
Thirteen spontaneous multiple-antibiotic-resistant (Mar) mutants of Escherichia coli AG100 were isolated on Luria-Bertani (LB) agar in the presence of tetracycline (4 microg/ml). The phenotype was linked to insertion sequence (IS) insertions in marR or acrR or unstable large tandem genomic amplifications which included acrAB and which were bordered by IS3 or IS5 sequences. Five different lon mutations, not related to the Mar phenotype, were also found in 12 of the 13 mutants. Under specific selective conditions, most drug-resistant mutants appearing late on the selective plates evolved from a subpopulation of AG100 with lon mutations. That the lon locus was involved in the evolution to low levels of multidrug resistance was supported by the following findings: (i) AG100 grown in LB broth had an important spontaneous subpopulation (about 3.7x10(-4)) of lon::IS186 mutants, (ii) new lon mutants appeared during the selection on antibiotic-containing agar plates, (iii) lon mutants could slowly grow in the presence of low amounts (about 2x MIC of the wild type) of chloramphenicol or tetracycline, and (iv) a lon mutation conferred a mutator phenotype which increased IS transposition and genome rearrangements. The association between lon mutations and mutations causing the Mar phenotype was dependent on the medium (LB versus MacConkey medium) and the antibiotic used for the selection. A previously reported unstable amplifiable high-level resistance observed after the prolonged growth of Mar mutants in a low concentration of tetracycline or chloramphenicol can be explained by genomic amplification.
Resumo:
A key energy-saving adaptation to chronic hypoxia that enables cardiomyocytes to withstand severe ischemic insults is hibernation, i.e., a reversible arrest of contractile function. Whereas hibernating cardiomyocytes represent the critical reserve of dysfunctional cells that can be potentially rescued, a lack of a suitable animal model has hampered insights on this medically important condition. We developed a transgenic mouse system for conditional induction of long-term hibernation and a system to rescue hibernating cardiomyocytes at will. Via myocardium-specific induction (and, in turn, deinduction) of a VEGF-sequestering soluble receptor, we show that VEGF is indispensable for adjusting the coronary vasculature to match increased oxygen consumption and exploit this finding to generate a hypoperfused heart. Importantly, ensuing ischemia is tunable to a level at which large cohorts of cardiomyocytes are driven to enter a hibernation mode, without cardiac cell death. Relieving the VEGF blockade even months later resulted in rapid revascularization and full recovery of contractile function. Furthermore, we show that left ventricular remodeling associated with hibernation is also fully reversible. The unique opportunity to uncouple hibernation from other ischemic heart phenotypes (e.g., infarction) was used to determine the genetic program of hibernation; uncovering hypoxia-inducible factor target genes associated with metabolic adjustments and induced expression of several cardioprotective genes. Autophagy, specifically self-digestion of mitochondria, was identified as a key prosurvival mechanism in hibernating cardiomyocytes. This system may lend itself for examining the potential utility of treatments to rescue dysfunctional cardiomyocytes and reverse maladaptive remodeling.
Resumo:
Mutations in the FBN1 gene are the major cause of Marfan syndrome (MFS), an autosomal dominant connective tissue disorder, which displays variable manifestations in the cardiovascular, ocular, and skeletal systems. Current molecular genetic testing of FBN1 may miss mutations in the promoter region or in other noncoding sequences as well as partial or complete gene deletions and duplications. In this study, we tested for copy number variations by successively applying multiplex ligation-dependent probe amplification (MLPA) and the Affymetrix Human Mapping 500 K Array Set, which contains probes for approximately 500,000 single-nucleotide polymorphisms (SNPs) across the genome. By analyzing genomic DNA of 101 unrelated individuals with MFS or related phenotypes in whom standard genetic testing detected no mutation, we identified FBN1 deletions in two patients with MFS. Our high-resolution approach narrowed down the deletion breakpoints. Subsequent sequencing of the junctional fragments revealed the deletion sizes of 26,887 and 302,580 bp, respectively. Surprisingly, both deletions affect the putative regulatory and promoter region of the FBN1 gene, strongly indicating that they abolish transcription of the deleted allele. This expectation of complete loss of function of one allele, i.e. true haploinsufficiency, was confirmed by transcript analyses. Our findings not only emphasize the importance of screening for large genomic rearrangements in comprehensive genetic testing of FBN1 but, importantly, also extend the molecular etiology of MFS by providing hitherto unreported evidence that true haploinsufficiency is sufficient to cause MFS.
Resumo:
Hardwoods comprise about half of the biomass of forestlands in North America and present many uses including economic, ecological and aesthetic functions. Forest trees rely on the genetic variation within tree populations to overcome the many biotic, abiotic, anthropogenic factors which are further worsened by climate change, that threaten their continued survival and functionality. To harness these inherent genetic variations of tree populations, informed knowledge of the genomic resources and techniques, which are currently lacking or very limited, are imperative for forest managers. The current study therefore aimed to develop genomic microsatellite markers for the leguminous tree species, honey locust, Gleditsia triacanthos L. and test their applicability in assessing genetic variation, estimation of gene flow patterns and identification of a full-sib mapping population. We also aimed to test the usefulness of already developed nuclear and gene-based microsatellite markers in delineation of species and taxonomic relationships between four of the taxonomically difficult Section Lobatae species (Quercus coccinea, Q. ellipsoidalis, Q. rubra and Q. velutina. We recorded 100% amplification of G. triacanthos genomic microsatellites developed using Illumina sequencing techniques in a panel of seven unrelated individuals with 14 of these showing high polymorphism and reproducibility. When characterized in 36 natural population samples, we recorded 20 alleles per locus with no indication for null alleles at 13 of the 14 microsatellites. This is the first report of genomic microsatellites for this species. Honey locust trees occur in fragmented populations of abandoned farmlands and pastures and is described as essentially dioecious. Pollen dispersal if the main source of gene flow within and between populations with the ability to offset the effects of random genetic drift. Factors known to influence gene include fragmentation and degree of isolation, which make the patterns gene flow in fragmented populations of honey locust a necessity for their sustainable management. In this follow-up study, we used a subset of nine of the 14 developed gSSRs to estimate gene flow and identify a full-sib mapping population in two isolated fragments of honey locust. Our analyses indicated that the majority of the seedlings (65-100% - at both strict and relaxed assignment thresholds) were sired by pollen from outside the two fragment populations. Only one selfing event was recorded confirming the functional dioeciousness of honey locust and that the seed parents are almost completely outcrossed. From the Butternut Valley, TN population, pollen donor genotypes were reconstructed and used in paternity assignment analyses to identify a relatively large full-sib family comprised of 149 individuals, proving the usefulness of isolated forest fragments in identification of full-sib families. In the Ames Plantation stand, contemporary pollen dispersal followed a fat-tailed exponential-power distribution, an indication of effective gene flow. Our estimate of δ was 4,282.28 m, suggesting that insect pollinators of honey locust disperse pollen over very long distances. The high proportion of pollen influx into our sampled population implies that our fragment population forms part of a large effectively reproducing population. The high tendency of oak species to hybridize while still maintaining their species identity make it difficult to resolve their taxonomic relationships. Oaks of the section Lobatae are famous in this regard and remain unresolved at both morphological and genetic markers. We applied 28 microsatellite markers including outlier loci with potential roles in reproductive isolation and adaptive divergence between species to natural populations of four known interfertile red oaks, Q. coccinea, Q. ellpsoidalis, Q. rubra and Q. velutina. To better resolve the taxonomic relationships in this difficult clade, we assigned individual samples to species, identified hybrids and introgressive forms and reconstructed phylogenetic relationships among the four species after exclusion of genetically intermediate individuals. Genetic assignment analyses identified four distinct species clusters, with Q. rubra most differentiated from the three other species, but also with a comparatively large number of misclassified individuals (7.14%), hybrids (7.14%) and introgressive forms (18.83%) between Q. ellipsoidalis and Q. velutina. After the exclusion of genetically intermediate individuals, Q. ellipsoidalis grouped as sister species to the largely parapatric Q. coccinea with high bootstrap support (91 %). Genetically intermediate forms in a mixed species stand were located proximate to both potential parental species, which supports recent hybridization of Q. velutina with both Q. ellipsoidalis and Q. rubra. Analyses of genome-wide patterns of interspecific differentiation can provide a better understanding of speciation processes and taxonomic relationships in this taxonomically difficult group of red oak species.
Resumo:
Unique and shared cytogenetic abnormalities have been documented for marginal zone lymphomas (MZLs) arising at different sites. Recently, homozygous deletions of the chromosomal band 6q23, involving the tumor necrosis factor alpha-induced protein 3 (TNFAIP3, A20) gene, a negative regulator of NF-kappaB, were described in ocular adnexal MZL, suggesting a role for A20 as a tumor suppressor in this disease. Here, we investigated inactivation of A20 by DNA mutations or deletions in a panel of extranodal MZL (EMZL), nodal MZL (NMZL), and splenic MZL (SMZL). Inactivating mutations encoding truncated A20 proteins were identified in 6 (19%) of 32 MZLs, including 2 (18%) of 11 EMZLs, 3 (33%) of 9 NMZLs, and 1 (8%) of 12 SMZLs. Two additional unmutated nonsplenic MZLs also showed monoallelic or biallelic A20 deletions by fluorescent in situ hybridization (FISH) and/or SNP-arrays. Thus, A20 inactivation by either somatic mutation and/or deletion represents a common genetic aberration across all MZL subtypes, which may contribute to lymphomagenesis by inducing constitutive NF-kappaB activation.