896 resultados para Whole genome mapping


Relevância:

90.00% 90.00%

Publicador:

Resumo:

A novel canine muscular dystrophy in Landseer dogs was observed. We had access to five affected dogs from two litters. The clinical signs started at a few weeks of age and the severe progressive muscle weakness led to euthanasia between 5 and 15 months of age. The pedigrees of the affected dogs suggested a monogenic autosomal recessive inheritance of the trait. Linkage and homozygosity mapping indicated two potential genome segments for the causative variant on chromosomes 10 and 31 harboring a total of 4.8 Mb of DNA or 0.2% of the canine genome. Using the illumina sequencing technology we obtained a whole genome sequence from one affected Landseer. Variants were called with respect to the dog reference genome and compared to the genetic variants of 170 control dogs from other breeds. The affected Landseer dog was homozygous for a single private non-synonymous variant in the critical intervals, a nonsense variant in the COL6A1 gene (Chr31:39,303,964G>T; COL6A1:c.289G>T; p.E97*). Genotypes at this variant showed perfect concordance with the muscular dystrophy phenotype in all five cases and more than one thousand control dogs. Variants in the human COL6A1 gene cause Bethlem myopathy or Ullrich congenital muscular dystrophy. We therefore conclude that the identified canine COL6A1 variant is most likely causative for the observed muscular dystrophy in Landseer dogs. Based on the nature of the genetic variant in Landseer dogs and their severe clinical phenotype these dogs represent a model for human Ullrich congenital muscular dystrophy.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We observed a hereditary phenotype in Alaskan Huskies, which was characterized by polyneuropathy with ocular abnormalities and neuronal vacuolation (POANV). The affected dogs developed a progressive severe ataxia, which led to euthanasia between 8 and 16 months of age. The pedigrees were consistent with a monogenic autosomal recessive inheritance. We localized the causative genetic defect to a 4 Mb interval on chromosome 19 by a combined linkage and homozygosity mapping approach. Whole genome sequencing of one affected dog, an obligate carrier and an unrelated control revealed a 218 bp SINE insertion into exon 7 of the RAB3GAP1 gene. The SINE insertion was perfectly associated with the disease phenotype in a cohort of 43 Alaskan Huskies and it was absent from 541 control dogs of diverse other breeds. The SINE insertion induced aberrant splicing and led to a transcript with a greatly altered exon 7. RAB3GAP1 loss-of-function variants in humans cause Warburg Micro Syndrome 1 (WARBM1), which is characterized by additional developmental defects compared to canine POANV, whereas Rab3gap1 deficient mice have a much milder phenotype than either humans or dogs. Thus the RAB3GAP1 mutant Alaskan Huskies provide an interesting intermediate phenotype that may help to better understand the function of RAB3GAP1 in development. Furthermore, the identification of the presumed causative genetic variant will enable genetic testing to avoid the non-intentional breeding of affected dogs.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Whole exome sequencing (WES) is increasingly used in research and diagnostics. WES users expect coverage of the entire coding region of known genes as well as sufficient read depth for the covered regions. It is, however, unknown which recent WES platform is most suitable to meet these expectations. We present insights into the performance of the most recent standard exome enrichment platforms from Agilent, NimbleGen and Illumina applied to six different DNA samples by two sequencing vendors per platform. Our results suggest that both Agilent and NimbleGen overall perform better than Illumina and that the high enrichment performance of Agilent is stable among samples and between vendors, whereas NimbleGen is only able to achieve vendor- and sample-specific best exome coverage. Moreover, the recent Agilent platform overall captures more coding exons with sufficient read depth than NimbleGen and Illumina. Due to considerable gaps in effective exome coverage, however, the three platforms cannot capture all known coding exons alone or in combination, requiring improvement. Our data emphasize the importance of evaluation of updated platform versions and suggest that enrichment-free whole genome sequencing can overcome the limitations of WES in sufficiently covering coding exons, especially GC-rich regions, and in characterizing structural variants.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Klebsiella pneumoniaesequence type (ST) 307, carryingblaKPC-3,blaCTX-M-15,blaOXA-1,aac(6')-Ib-cr, andqnrB1 genes, is replacing the predominant hyperepidemic ST258 clone in Italy. Whole-genome and complete plasmid sequencing of one ST307 strain was performed and new features were identified.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Linkage and association studies are major analytical tools to search for susceptibility genes for complex diseases. With the availability of large collection of single nucleotide polymorphisms (SNPs) and the rapid progresses for high throughput genotyping technologies, together with the ambitious goals of the International HapMap Project, genetic markers covering the whole genome will be available for genome-wide linkage and association studies. In order not to inflate the type I error rate in performing genome-wide linkage and association studies, multiple adjustment for the significant level for each independent linkage and/or association test is required, and this has led to the suggestion of genome-wide significant cut-off as low as 5 × 10 −7. Almost no linkage and/or association study can meet such a stringent threshold by the standard statistical methods. Developing new statistics with high power is urgently needed to tackle this problem. This dissertation proposes and explores a class of novel test statistics that can be used in both population-based and family-based genetic data by employing a completely new strategy, which uses nonlinear transformation of the sample means to construct test statistics for linkage and association studies. Extensive simulation studies are used to illustrate the properties of the nonlinear test statistics. Power calculations are performed using both analytical and empirical methods. Finally, real data sets are analyzed with the nonlinear test statistics. Results show that the nonlinear test statistics have correct type I error rates, and most of the studied nonlinear test statistics have higher power than the standard chi-square test. This dissertation introduces a new idea to design novel test statistics with high power and might open new ways to mapping susceptibility genes for complex diseases. ^

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In population studies, most current methods focus on identifying one outcome-related SNP at a time by testing for differences of genotype frequencies between disease and healthy groups or among different population groups. However, testing a great number of SNPs simultaneously has a problem of multiple testing and will give false-positive results. Although, this problem can be effectively dealt with through several approaches such as Bonferroni correction, permutation testing and false discovery rates, patterns of the joint effects by several genes, each with weak effect, might not be able to be determined. With the availability of high-throughput genotyping technology, searching for multiple scattered SNPs over the whole genome and modeling their joint effect on the target variable has become possible. Exhaustive search of all SNP subsets is computationally infeasible for millions of SNPs in a genome-wide study. Several effective feature selection methods combined with classification functions have been proposed to search for an optimal SNP subset among big data sets where the number of feature SNPs far exceeds the number of observations. ^ In this study, we take two steps to achieve the goal. First we selected 1000 SNPs through an effective filter method and then we performed a feature selection wrapped around a classifier to identify an optimal SNP subset for predicting disease. And also we developed a novel classification method-sequential information bottleneck method wrapped inside different search algorithms to identify an optimal subset of SNPs for classifying the outcome variable. This new method was compared with the classical linear discriminant analysis in terms of classification performance. Finally, we performed chi-square test to look at the relationship between each SNP and disease from another point of view. ^ In general, our results show that filtering features using harmononic mean of sensitivity and specificity(HMSS) through linear discriminant analysis (LDA) is better than using LDA training accuracy or mutual information in our study. Our results also demonstrate that exhaustive search of a small subset with one SNP, two SNPs or 3 SNP subset based on best 100 composite 2-SNPs can find an optimal subset and further inclusion of more SNPs through heuristic algorithm doesn't always increase the performance of SNP subsets. Although sequential forward floating selection can be applied to prevent from the nesting effect of forward selection, it does not always out-perform the latter due to overfitting from observing more complex subset states. ^ Our results also indicate that HMSS as a criterion to evaluate the classification ability of a function can be used in imbalanced data without modifying the original dataset as against classification accuracy. Our four studies suggest that Sequential Information Bottleneck(sIB), a new unsupervised technique, can be adopted to predict the outcome and its ability to detect the target status is superior to the traditional LDA in the study. ^ From our results we can see that the best test probability-HMSS for predicting CVD, stroke,CAD and psoriasis through sIB is 0.59406, 0.641815, 0.645315 and 0.678658, respectively. In terms of group prediction accuracy, the highest test accuracy of sIB for diagnosing a normal status among controls can reach 0.708999, 0.863216, 0.639918 and 0.850275 respectively in the four studies if the test accuracy among cases is required to be not less than 0.4. On the other hand, the highest test accuracy of sIB for diagnosing a disease among cases can reach 0.748644, 0.789916, 0.705701 and 0.749436 respectively in the four studies if the test accuracy among controls is required to be at least 0.4. ^ A further genome-wide association study through Chi square test shows that there are no significant SNPs detected at the cut-off level 9.09451E-08 in the Framingham heart study of CVD. Study results in WTCCC can only detect two significant SNPs that are associated with CAD. In the genome-wide study of psoriasis most of top 20 SNP markers with impressive classification accuracy are also significantly associated with the disease through chi-square test at the cut-off value 1.11E-07. ^ Although our classification methods can achieve high accuracy in the study, complete descriptions of those classification results(95% confidence interval or statistical test of differences) require more cost-effective methods or efficient computing system, both of which can't be accomplished currently in our genome-wide study. We should also note that the purpose of this study is to identify subsets of SNPs with high prediction ability and those SNPs with good discriminant power are not necessary to be causal markers for the disease.^

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Whole-genome duplication approximately 108 years ago was proposed as an explanation for the many duplicated chromosomal regions in Saccharomyces cerevisiae. Here we have used computer simulations and analytic methods to estimate some parameters describing the evolution of the yeast genome after this duplication event. Computer simulation of a model in which 8% of the original genes were retained in duplicate after genome duplication, and 70–100 reciprocal translocations occurred between chromosomes, produced arrangements of duplicated chromosomal regions very similar to the map of real duplications in yeast. An analytical method produced an independent estimate of 84 map disruptions. These results imply that many smaller duplicated chromosomal regions exist in the yeast genome in addition to the 55 originally reported. We also examined the possibility of determining the original order of chromosomal blocks in the ancestral unduplicated genome, but this cannot be done without information from one or more additional species. If the genome sequence of one other species (such as Kluyveromyces lactis) were known it should be possible to identify 150–200 paired regions covering the whole yeast genome and to reconstruct approximately two-thirds of the original order of blocks of genes in yeast. Rates of interchromosome translocation in yeast and mammals appear similar despite their very different rates of homologous recombination per kilobase.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We have developed high-density DNA microarrays of yeast ORFs. These microarrays can monitor hybridization to ORFs for applications such as quantitative differential gene expression analysis and screening for sequence polymorphisms. Automated scripts retrieved sequence information from public databases to locate predicted ORFs and select appropriate primers for amplification. The primers were used to amplify yeast ORFs in 96-well plates, and the resulting products were arrayed using an automated micro arraying device. Arrays containing up to 2,479 yeast ORFs were printed on a single slide. The hybridization of fluorescently labeled samples to the array were detected and quantitated with a laser confocal scanning microscope. Applications of the microarrays are shown for genetic and gene expression analysis at the whole genome level.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The ARKdb genome databases provide comprehensive public repositories for genome mapping data from farmed species and other animals (http://www.thearkdb.org) providing a resource similar in function to that offered by GDB or MGD for human or mouse genome mapping data, respectively. Because we have attempted to build a generic mapping database, the system has wide utility, particularly for those species for which development of a specific resource would be prohibitive. The ARKdb genome database model has been implemented for 10 species to date. These are pig, chicken, sheep, cattle, horse, deer, tilapia, cat, turkey and salmon. Access to the ARKdb databases is effected via the World Wide Web using the ARKdb browser and Anubis map viewer. The information stored includes details of loci, maps, experimental methods and the source references. Links to other information sources such as PubMed and EMBL/GenBank are provided. Responsibility for data entry and curation is shared amongst scientists active in genome research in the species of interest. Mirror sites in the United States are maintained in addition to the central genome server at Roslin.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The evolution of novelty in tightly integrated biological systems, such as hormones and their receptors, seems to challenge the theory of natural selection: it has not been clear how a new function for any one part (such as a ligand) can be selected for unless the other members of the system (e.g., a receptor) are already present. Here I show—based on identification and phylogenetic analysis of steroid receptors in basal vertebrates and reconstruction of the sequences and functional attributes of ancestral proteins—that the first steroid receptor was an estrogen receptor, followed by a progesterone receptor. Genome mapping and phylogenetic analyses indicate that the full complement of mammalian steroid receptors evolved from these ancient receptors by two large-scale genome expansions, one before the advent of jawed vertebrates and one after. Specific regulation of physiological processes by androgens and corticoids are relatively recent innovations that emerged after these duplications. These findings support a model of ligand exploitation in which the terminal ligand in a biosynthetic pathway is the first for which a receptor evolves; selection for this hormone also selects for the synthesis of intermediates despite the absence of receptors, and duplicated receptors then evolve affinity for these substances. In this way, novel hormone-receptor pairs are created, and an integrated system of increasing complexity elaborated. This model suggests that ligands for some “orphan” receptors may be found among intermediates in the synthesis of ligands for phylogenetically related receptors.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

"March 1994."

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The SOX family of transcription factors are found throughout the animal kingdom and are important in a variety of developmental contexts. Genome analysis has identified 20 Sox genes in human and mouse, which can be subdivided into 8 groups, based on sequence comparison and intron-exon structure. Most of the SOX groups identified in mammals are represented by a single SOX sequence in invertebrate model organisms, suggesting a duplication and divergence mechanism has operated during vertebrate evolution. We have now analysed the Sox gene complement in the pufferfish, Fugu rubripes, in order to shed further light on the diversity and origins of the Sox gene family. Major differences were found between the Sox family in Fugu and those in humans and mice. In particular, Fugu does not have orthologues of Sry, Sox,15 and Sox30, which appear to be specific to mammals, while Sox19, found in Fugu and zebrafish but absent in mammals, seems to be specific to fishes. Six mammalian Sox genes are represented by two copies each in Fugu, indicating a large-scale gene duplication in the fish lineage. These findings point to recent Sox gene loss, duplication and divergence occurring during the evolution of tetrapod and teleost lineages, and provide further evidence for large-scale segmental or a whole-genome duplication occurring early in the radiation of teleosts. (C) 2004 Elsevier B.V. All rights reserved.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Acknowledgements This study was funded by a Natural Environment Research Council grant (NERC, project code: NBAF704). FML is funded by a NERC Doctoral Training Grant (Project Reference: NE/L50175X/1). RLS was an undergraduate student at the University of Aberdeen and benefitted from financial support from the School of Biological Sciences. DJM is indebted to Dr. Steven Weiss (University of Graz, Austria), Dr. Takashi Yada (National Research Institute of Fisheries Science, Japan), Dr. Robert Devlin (Fisheries and Oceans Canada, Canada), Prof. Samuel Martin (University of Aberdeen, UK), Mr. Neil Lincoln (Environment Agency, UK) and Prof. Colin Adams/Mr. Stuart Wilson (University of Glasgow, UK) for providing salmonid material or assisting with its sampling. We are grateful to staff at the Centre for Genomics Research (University of Liverpool, UK) (i.e. NERC Biomolecular Analysis Facility – Liverpool; NBAF-Liverpool) for performing sequence capture/Illumina sequencing and providing us with details on associated methods that were incorporated into the manuscript. Finally, we are grateful to the organizers of the Society of Experimental Biology Satellite meeting 'Genome-powered perspectives in integrative physiology and evolutionary biology' (held in Prague, July 2015) for inviting us to contribute to this special edition of Marine Genomics and hosting a really stimulating meeting.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The Arabidopsis root apical meristem (RAM) is a complex tissue capable of generating all the cell types that ultimately make up the root. The work presented in this thesis takes advantage of the versatility of high-throughput sequencing to address two independent questions about the root meristem. Although a lot of information is known regarding the cell fate decisions that occur at the RAM, cortex specification and differentiation remain poorly understood. In the first part of this thesis, I used an ethylmethanesulfonate (EMS) mutagenized marker line to perform a forward genetics screen. The goal of this screen was to identify novel genes involved in the specification and differentiation of the cortex tissue. Mapping analysis from the results obtained in this screen revealed a new allele of BRASSINOSTEROID4 with abnormal marker expression in the cortex tissue. Although this allele proved to be non-cortex specific, this project highlights new technology that allows mapping of EMS-generated mutations without the need to map-cross or back-cross. In the second part of this thesis, using fluorescence activated cell sorting (FACS) coupled with high throughput sequencing, my collaborators and I generated single-base resolution whole genome DNA methylomes, mRNA transcriptomes, and smallRNA transcriptomes for six different populations of cell types in the Arabidopsis root meristem. We were able to discover that the columella is hypermethylated in the CHH context within transposable elements. This hypermethylation is accompanied by upregulation of the RNA-dependent DNA methylation pathway (RdDM), including higher levels of 24-nt silencing RNAs (siRNAs). In summary, our studies demonstrate the versatility of high-throughput sequencing as a method for identifying single mutations or to perform complex comparative genomic analyses.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Valuable genetic variation for bean breeding programs is held within the common bean secondary gene pool which consists of Phaseolus albescens, P. coccineus, P. costaricensis, and P. dumosus. However, the use of close relatives for bean improvement is limited due to the lack of knowledge about genetic variation and genetic plasticity of many of these species. Characterisation and analysis of the genetic diversity is necessary among beans' wild relatives; in addition, conflicting phylogenies and relationships need to be understood and a hypothesis of a hybrid origin of P. dumosus needs to be tested. This thesis research was orientated to generate information about the patterns of relationships among the common bean secondary gene pool, with particular focus on the species Phaseolus dumosus. This species displays a set of characteristics of agronomic interest, not only for the direct improvement of common bean but also as a source of valuable genes for adaptation to climate change. Here I undertake the first comprehensive study of the genetic diversity of P. dumosus as ascertained from both nuclear and chloroplast genome markers. A germplasm collection of the ancestral forms of P. dumosus together with wild, landrace and cultivar representatives of all other species of the common bean secondary gene pool, were used to analyse genetic diversity, phylogenetic relationships and structure of P. dumosus. Data on molecular variation was generated from sequences of cpDNA loci accD-psaI spacer, trnT-trnL spacer, trnL intron and rps14-psaB spacer and from the nrDNA the ITS region. A whole genome DArT array was developed and used for the genotyping of P. dumosus and its closes relatives. 4208 polymorphic markers were generated in the DArT array and from those, 742 markers presented a call rate >95% and zero discordance. DArT markers revealed a moderate genetic polymorphism among P. dumosus samples (13% of polymorphic loci), while P. coccineus presented the highest level of polymorphism (88% of polymorphic loci). At the cpDNA one ancestral haplotype was detected among all samples of all species in the secondary genepool. The ITS region of P. dumosus revealed high homogeneity and polymorphism bias to P. coccineus genome. Phylogenetic reconstructions made with Maximum likelihood and Bayesian methods confirmed previously reported discrepancies among the nuclear and chloroplast genomes of P. dumosus. The outline of relationships by hybridization networks displayed a considerable number of interactions within and between species. This research provides compelling evidence that P. dumosus arose from hybridisation between P. vulgaris and P. coccineus and confirms that P. costaricensis has likely been involved in the genesis or backcrossing events (or both) in the history of P. dumosus. The classification of the specie P. persistentus was analysed based on cpDNA and ITS sequences, the results found this species to be highly related to P. vulgaris but not too similar to P. leptostachyus as previously proposed. This research demonstrates that wild types of the secondary genepool carry a significant genetic variation which makes this a valuable genetic resource for common bean improvement. The DArT array generated in this research is a valuable resource for breeding programs since it has the potential to be used in several approaches including genotyping, discovery of novel traits, mapping and marker-trait associations. Efforts should be made to search for potential populations of P. persistentus and to increase the collection of new populations of P. dumosus, P. albescens and P. costaricensis that may provide valuable traits for introgression into common bean and other Phaseolus crops.