896 resultados para Whole genome mapping
Resumo:
Plant reproduction depends on the concerted activation of many genes to ensure correct communication between pollen and pistil. Here, we queried the whole transcriptome of Arabidopsis (Arabidopsis thaliana) in order to identify genes with specific reproductive functions. We used the Affymetrix ATH1 whole genome array to profile wild-type unpollinated pistils and unfertilized ovules. By comparing the expression profile of pistils at 0.5, 3.5, and 8.0 h after pollination and applying a number of statistical and bioinformatics criteria, we found 1,373 genes differentially regulated during pollen-pistil interactions. Robust clustering analysis grouped these genes in 16 time-course clusters representing distinct patterns of regulation. Coregulation within each cluster suggests the presence of distinct genetic pathways, which might be under the control of specific transcriptional regulators. A total of 78% of the regulated genes were expressed initially in unpollinated pistil and/or ovules, 15% were initially detected in the pollen data sets as enriched or preferentially expressed, and 7% were induced upon pollination. Among those, we found a particular enrichment for unknown transcripts predicted to encode secreted proteins or representing signaling and cell wall-related proteins, which may function by remodeling the extracellular matrix or as extracellular signaling molecules. A strict regulatory control in various metabolic pathways suggests that fine-tuning of the biochemical and physiological cellular environment is crucial for reproductive success. Our study provides a unique and detailed temporal and spatial gene expression profile of in vivo pollen-pistil interactions, providing a framework to better understand the basis of the molecular mechanisms operating during the reproductive process in higher plants.
Resumo:
Klebsiella pneumoniae U25 is a multidrug resistant strain isolated from a tertiary care hospital in Chennai, India. Here, we report the complete annotated genome sequence of strain U25 obtained using PacBio RSII. This is the first report of the whole genome of K. pneumoniae species from Chennai. It consists of a single circular chromosome of size 5,491,870-bp and two plasmids of size 211,813 and 172,619-bp. The genes associated with multidrug resistance were identified. The chromosome of U25 was found to have eight antibiotic resistant genes [blaOXA-1, blaSHV-28, aac(6’)1b-cr, catB3, oqxAB, dfrA1]. The plasmid pMGRU25-001 was found to have only one resistant gene (catA1) while plasmid pMGRU25-002 had 20 resistant genes [strAB, aadA1, aac(6’)-Ib, aac(3)-IId, sul1,2, blaTEM-1A,1B, blaOXA-9, blaCTX-M-15, blaSHV-11, cmlA1, erm(B), mph(A)]. A mutation in the porin OmpK36 was identified which is likely to be associated with the intermediate resistance to carbapenems in the absence of carbapenemase genes. U25 is one of the few K. pneumoniae strains to harbour clustered regularly interspaced short palindromic repeats (CRISPR) systems. Two CRISPR arrays corresponding to Cas3 family helicase were identified in the genome. When compared to K. pneumoniae NTUHK2044, a transposase gene InsH of IS5-13 was found inserted.
Resumo:
2016
Resumo:
2016
Resumo:
Cytosine methylation is important for transposon silencing and epigenetic regulation of endogenous genes, although the extent to which this DNA modification functions to regulate the genome is still unknown. Here we report the first comprehensive DNA methylation map of an entire genome, at 35 base pair resolution, using the flowering plant Arabidopsis thaliana as a model. We find that pericentromeric heterochromatin, repetitive sequences, and regions producing small interfering RNAs are heavily methylated. Unexpectedly, over one-third of expressed genes contain methylation within transcribed regions, whereas only approximately 5% of genes show methylation within promoter regions. Interestingly, genes methylated in transcribed regions are highly expressed and constitutively active, whereas promoter-methylated genes show a greater degree of tissue-specific expression. Whole-genome tiling-array transcriptional profiling of DNA methyltransferase null mutants identified hundreds of genes and intergenic noncoding RNAs with altered expression levels, many of which may be epigenetically controlled by DNA methylation.
Resumo:
A whole-genome scan was conducted to map quantitative trait loci (QTL) for BSE resistance or susceptibility. Cows from four half-sib families were included and 173 microsatellite markers were used to construct a 2835-cM (Kosambi) linkage map covering 29 autosomes and the pseudoautosomal region of the sex chromosome. Interval mapping by linear regression was applied and extended to a multiple-QTL analysis approach that used identified QTL on other chromosomes as cofactors to increase mapping power. In the multiple-QTL analysis, two genome-wide significant QTL (BTA17 and X/Y ps) and four genome-wide suggestive QTL (BTA1, 6, 13, and 19) were revealed. The QTL identified here using linkage analysis do not overlap with regions previously identified using TDT analysis. One factor that may explain the disparity between the results is that a more extensive data set was used in the present study. Furthermore, methodological differences between TDT and linkage analyses may affect the power of these approaches.
Resumo:
Genome-wide association studies (GWAS) have identified around 60 common variants associated with multiple sclerosis (MS), but these loci only explain a fraction of the heritability of MS. Some missing heritability may be caused by rare variants that have been suggested to play an important role in the aetiology of complex diseases such as MS. However current genetic and statistical methods for detecting rare variants are expensive and time consuming. 'Population-based linkage analysis' (PBLA) or so called identity-by-descent (IBD) mapping is a novel way to detect rare variants in extant GWAS datasets. We employed BEAGLE fastIBD to search for rare MS variants utilising IBD mapping in a large GWAS dataset of 3,543 cases and 5,898 controls. We identified a genome-wide significant linkage signal on chromosome 19 (LOD = 4.65; p = 1.9×10-6). Network analysis of cases and controls sharing haplotypes on chromosome 19 further strengthened the association as there are more large networks of cases sharing haplotypes than controls. This linkage region includes a cluster of zinc finger genes of unknown function. Analysis of genome wide transcriptome data suggests that genes in this zinc finger cluster may be involved in very early developmental regulation of the CNS. Our study also indicates that BEAGLE fastIBD allowed identification of rare variants in large unrelated population with moderate computational intensity. Even with the development of whole-genome sequencing, IBD mapping still may be a promising way to narrow down the region of interest for sequencing priority. © 2013 Lin et al.
Resumo:
The successful completion of the Human Genome Project (HGP) was an unprecedented scientific advance that has become an invaluable resource in the search for genes that cause monogenic and common (polygenic) diseases. Prior to the HGP, linkage analysis had successfully mapped many disease genes for monogenic disorders; however, the limitations of this approach were particularly evident for identifying causative genes in rare genetic disorders affecting lifespan and/or reproductive fitness, such as skeletal dysplasias. In this review, we illustrate the challenges of mapping disease genes in such conditions through the ultra-rare disorder fibrodysplasia ossificans progressiva (FOP) and we discuss the advances that are being made through current massively parallel (“next generation”) sequencing (MPS) technologies.
Resumo:
To gain insight into the mechanisms by which the Myb transcription factor controls normal hematopoiesis and particularly, how it contributes to leukemogenesis, we mapped the genome-wide occupancy of Myb by chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) in ERMYB myeloid progenitor cells. By integrating the genome occupancy data with whole genome expression profiling data, we identified a Myb-regulated transcriptional program. Gene signatures for leukemia stem cells, normal hematopoietic stem/progenitor cells and myeloid development were overrepresented in 2368 Myb regulated genes. Of these, Myb bound directly near or within 793 genes. Myb directly activates some genes known critical in maintaining hematopoietic stem cells, such as Gfi1 and Cited2. Importantly, we also show that, despite being usually considered as a transactivator, Myb also functions to repress approximately half of its direct targets, including several key regulators of myeloid differentiation, such as Sfpi1 (also known as Pu.1), Runx1, Junb and Cebpb. Furthermore, our results demonstrate that interaction with p300, an established coactivator for Myb, is unexpectedly required for Myb-mediated transcriptional repression. We propose that the repression of the above mentioned key pro-differentiation factors may contribute essentially to Myb's ability to suppress differentiation and promote self-renewal, thus maintaining progenitor cells in an undifferentiated state and promoting leukemic transformation. © 2011 The Author(s).
Resumo:
The sequential nature of gel-based marker systems entails low throughput and high costs per assay. Commonly used marker systems such as SSR and SNP are also dependent on sequence information. These limitations result in high cost per data point and significantly limit the capacity of breeding programs to obtain sufficient return on investment to justify the routine use of marker-assisted breeding for many traits and particularly quantitative traits. Diversity Arrays Technology (DArT™) is a cost effective hybridisation-based marker technology that offers a high multiplexing level while being independent of sequence information. This technology offers sorghum breeding programs an alternative approach to whole-genome profiling. We report on the development, application, mapping and utility of DArT™ markers for sorghum germplasm. Results: A genotyping array was developed representing approximately 12,000 genomic clones using PstI+BanII complexity with a subset of clones obtained through the suppression subtractive hybridisation (SSH) method. The genotyping array was used to analyse a diverse set of sorghum genotypes and screening a Recombinant Inbred Lines (RIL) mapping population. Over 500 markers detected variation among 90 accessions used in a diversity analysis. Cluster analysis discriminated well between all 90 genotypes. To confirm that the sorghum DArT markers behave in a Mendelian manner, we constructed a genetic linkage map for a cross between R931945-2-2 and IS 8525 integrating DArT and other marker types. In total, 596 markers could be placed on the integrated linkage map, which spanned 1431.6 cM. The genetic linkage map had an average marker density of 1/2.39 cM, with an average DArT marker density of 1/3.9 cM. Conclusion: We have successfully developed DArT markers for Sorghum bicolor and have demonstrated that DArT provides high quality markers that can be used for diversity analyses and to construct medium-density genetic linkage maps. The high number of DArT markers generated in a single assay not only provides a precise estimate of genetic relationships among genotypes, but also their even distribution over the genome offers real advantages for a range of molecular breeding and genomics applications.
Resumo:
With complete sets of chromosome-specific painting probes derived from flow-sorted chromosomes of human and grey squirrel (Sciurus carolinensis), the whole genome homologies between human and representatives of tree squirrels (Sciurus carolinensis, Callosciurus erythraeus), flying squirrels (Petaurista albiventer) and chipmunks (Tamias sibiricus) have been defined by cross-species chromosome painting. The results show that, unlike the highly rearranged karyotypes of mouse and rat, the karyotypes of squirrels are highly conserved. Two methods have been used to reconstruct the genome phylogeny of squirrels with the laboratory rabbit (Oryctolagus cuniculus) as the out-group: ( 1) phylogenetic analysis by parsimony using chromosomal characters identified by comparative cytogenetic approaches; ( 2) mapping the genome rearrangements onto recently published sequence-based molecular trees. Our chromosome painting results, in combination with molecular data, show that flying squirrels are phylogenetically close to New World tree squirrels. Chromosome painting and G-banding comparisons place chipmunks ( Tamias sibiricus), with a derived karyotype, outside the clade comprising tree and flying squirrels. The superorder Glires (order Rodentia + order Lagomorpha) is firmly supported by two conserved syntenic associations between human chromosomes 1 and 10p homologues, and between 9 and 11 homologues.
Resumo:
The Indian muntjac (Muntiacus muntjak vaginalis) has a karyotype of 2n=6 in the female and 7 in the male, the karyotypic evolution of which through extensive tandem fusions and several centric fusions has been well-documented by recent molecular cytogenetic studies. In an attempt to define the fusion orientations of conserved chromosomal segments and the molecular mechanisms underlying the tandem fusions, we have constructed a highly redundant (more than six times of whole genome coverage) bacterial artificial chromosome (BAC) library of Indian muntjac. The BAC library contains 124,800 clones with no chromosome bias and has an average insert DNA size of 120 kb. A total of 223 clones have been mapped by fluorescent in situ hybridization onto the chromosomes of both Indian muntjac and Chinese muntjac and a high-resolution comparative map has been established. Our mapping results demonstrate that all tandem fusions that occurred during the evolution of Indian muntjac karyotype from the acrocentric 2n=70 hypothetical ancestral karyotype are centromere-telomere (head-tail) fusions.
Resumo:
Chromatin immunoprecipitation (ChIP) allows enrichment of genomic regions which are associated with specific transcription factors, histone modifications, and indeed any other epitopes which are present on chromatin. The original ChIP methods used site-specific PCR and Southern blotting to confirm which regions of the genome were enriched, on a candidate basis. The combination of ChIP with genomic tiling arrays (ChIP-chip) allowed a more unbiased approach to map ChIP-enriched sites. However, limitations of microarray probe design and probe number have a detrimental impact on the coverage, resolution, sensitivity, and cost of whole-genome tiling microarray sets for higher eukaryotes with large genomes. The combination of ChIP with high-throughput sequencing technology has allowed more comprehensive surveys of genome occupancy, greater resolution, and lower cost for whole genome coverage. Herein, we provide a comparison of high-throughput sequencing platforms and a survey of ChIP-seq analysis tools, discuss experimental design, and describe a detailed ChIP-seq method.Chromatin immunoprecipitation (ChIP) allows enrichment of genomic regions which are associated with specific transcription factors, histone modifications, and indeed any other epitopes which are present on chromatin. The original ChIP methods used site-specific PCR and Southern blotting to confirm which regions of the genome were enriched, on a candidate basis. The combination of ChIP with genomic tiling arrays (ChIP-chip) allowed a more unbiased approach to map ChIP-enriched sites. However, limitations of microarray probe design and probe number have a detrimental impact on the coverage, resolution, sensitivity, and cost of whole-genome tiling microarray sets for higher eukaryotes with large genomes. The combination of ChIP with high-throughput sequencing technology has allowed more comprehensive surveys of genome occupancy, greater resolution, and lower cost for whole genome coverage. Herein, we provide a comparison of high-throughput sequencing platforms and a survey of ChIP-seq analysis tools, discuss experimental design, and describe a detailed ChIP-seq method.
Resumo:
Numerous CCT domain genes are known to control flowering in plants. They belong to the CONSTANS-like (COL) and PREUDORESPONSE REGULATOR (PRR) gene families, which in addition to a CCT domain possess B-box or response-regulator domains, respectively. Ghd7 is the most recently identified COL gene to have a proven role in the control of flowering time in the Poaceae. However, as it lacks B-box domains, its inclusion within the COL gene family, technically, is incorrect. Here, we show Ghd7 belongs to a larger family of previously uncharacterized Poaceae genes which possess just a single CCT domain, termed here CCT MOTIF FAMILY (CMF) genes. We molecularly describe the CMF (and related COL and PRR) gene families in four sequenced Poaceae species, as well as in the draft genome assembly of barley (Hordeum vulgare). Genetic mapping of the ten barley CMF genes identified, as well as twelve previously unmapped HvCOL and HvPRR genes, finds the majority map to colinear positions relative to their Poaceae orthologues. Combined inter-/intra-species comparative and phylogenetic analysis of CMF, COL and PRR gene families indicates they evolved prior to the monocot/dicot divergence ~200 mya, with Poaceae CMF evolution described as the interplay between whole genome duplication in the ancestral cereal, and subsequent clade-specific mutation, deletion and duplication events. Given the proven role of CMF genes in the modulation of cereals flowering, the molecular, phylogenetic and comparative analysis of the Poaceae CMF, COL and PRR gene families presented here provides the foundation from which functional investigation can be undertaken.
Resumo:
This thesis develops and evaluates statistical methods for different types of genetic analyses, including quantitative trait loci (QTL) analysis, genome-wide association study (GWAS), and genomic evaluation. The main contribution of the thesis is to provide novel insights in modeling genetic variance, especially via random effects models. In variance component QTL analysis, a full likelihood model accounting for uncertainty in the identity-by-descent (IBD) matrix was developed. It was found to be able to correctly adjust the bias in genetic variance component estimation and gain power in QTL mapping in terms of precision. Double hierarchical generalized linear models, and a non-iterative simplified version, were implemented and applied to fit data of an entire genome. These whole genome models were shown to have good performance in both QTL mapping and genomic prediction. A re-analysis of a publicly available GWAS data set identified significant loci in Arabidopsis that control phenotypic variance instead of mean, which validated the idea of variance-controlling genes. The works in the thesis are accompanied by R packages available online, including a general statistical tool for fitting random effects models (hglm), an efficient generalized ridge regression for high-dimensional data (bigRR), a double-layer mixed model for genomic data analysis (iQTL), a stochastic IBD matrix calculator (MCIBD), a computational interface for QTL mapping (qtl.outbred), and a GWAS analysis tool for mapping variance-controlling loci (vGWAS).