988 resultados para Sequence Assembly


Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND: Several approaches can be used to determine the order of loci on chromosomes and hence develop maps of the genome. However, all mapping approaches are prone to errors either arising from technical deficiencies or lack of statistical support to distinguish between alternative orders of loci. The accuracy of the genome maps could be improved, in principle, if information from different sources was combined to produce integrated maps. The publicly available bovine genomic sequence assembly with 6x coverage (Btau_2.0) is based on whole genome shotgun sequence data and limited mapping data however, it is recognised that this assembly is a draft that contains errors. Correcting the sequence assembly requires extensive additional mapping information to improve the reliability of the ordering of sequence scaffolds on chromosomes. The radiation hybrid (RH) map described here has been contributed to the international sequencing project to aid this process. RESULTS: An RH map for the 30 bovine chromosomes is presented. The map was built using the Roslin 3000-rad RH panel (BovGen RH map) and contains 3966 markers including 2473 new loci in addition to 262 amplified fragment-length polymorphisms (AFLP) and 1231 markers previously published with the first generation RH map. Sequences of the mapped loci were aligned with published bovine genome maps to identify inconsistencies. In addition to differences in the order of loci, several cases were observed where the chromosomal assignment of loci differed between maps. All the chromosome maps were aligned with the current 6x bovine assembly (Btau_2.0) and 2898 loci were unambiguously located in the bovine sequence. The order of loci on the RH map for BTA 5, 7, 16, 22, 25 and 29 differed substantially from the assembled bovine sequence. From the 2898 loci unambiguously identified in the bovine sequence assembly, 131 mapped to different chromosomes in the BovGen RH map. CONCLUSION: Alignment of the BovGen RH map with other published RH and genetic maps showed higher consistency in marker order and chromosome assignment than with the current 6x sequence assembly. This suggests that the bovine sequence assembly could be significantly improved by incorporating additional independent mapping information.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

With the advent of cheaper and faster DNA sequencing technologies, assembly methods have greatly changed. Instead of outputting reads that are thousands of base pairs long, new sequencers parallelize the task by producing read lengths between 35 and 400 base pairs. Reconstructing an organism’s genome from these millions of reads is a computationally expensive task. Our algorithm solves this problem by organizing and indexing the reads using n-grams, which are short, fixed-length DNA sequences of length n. These n-grams are used to efficiently locate putative read joins, thereby eliminating the need to perform an exhaustive search over all possible read pairs. Our goal was develop a novel n-gram method for the assembly of genomes from next-generation sequencers. Specifically, a probabilistic, iterative approach was utilized to determine the most likely reads to join through development of a new metric that models the probability of any two arbitrary reads being joined together. Tests were run using simulated short read data based on randomly created genomes ranging in lengths from 10,000 to 100,000 nucleotides with 16 to 20x coverage. We were able to successfully re-assemble entire genomes up to 100,000 nucleotides in length.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Genomic plasticity of human chromosome 8p23.1 region is highly influenced by two groups of complex segmental duplications (SDs), termed REPD and REPP, that mediate different kinds of rearrangements. Part of the difficulty to explain the wide range of phenotypes associated with 8p23.1 rearrangements is that REPP and REPD are not yet well characterized, probably due to their polymorphic status. Here, we describe a novel primate-specific gene family, named FAM90A (family with sequence similarity 90), found within these SDs. According to the current human reference sequence assembly, the FAM90A family includes 24 members along 8p23.1 region plus a single member on chromosome 12p13.31, showing copy number variation (CNV) between individuals. These genes can be classified into subfamilies I and II, which differ in their upstream and 5′-untranslated region sequences, but both share the same open reading frame and are ubiquitously expressed. Sequence analysis and comparative fluorescence in situ hybridization studies showed that FAM90A subfamily II suffered a big expansion in the hominoid lineage, whereas subfamily I members were likely generated sometime around the divergence of orangutan and African great apes by a fusion process. In addition, the analysis of the Ka/Ks ratios provides evidence of functional constraint of some FAM90A genes in all species. The characterization of the FAM90A gene family contributes to a better understanding of the structural polymorphism of the human 8p23.1 region and constitutes a good example of how SDs, CNVs and rearrangements within themselves can promote the formation of new gene sequences with potential functional consequences.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNPbased linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2 %) were heterozygous in one of the two parents of the progeny, 1,007 (12.8 %) were heterozygous in both parental genotypes, whilst just 2.8 % of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7 % of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or misassignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The imprints of domestication and breed development on the genomes of livestock likely differ from those of companion animals. A deep draft sequence assembly of shotgun reads from a single Hereford female and comparative sequences sampled from six additional breeds were used to develop probes to interrogate 37,470 single-nucleotide polymorphisms (SNPs) in 497 cattle from 19 geographically and biologically diverse breeds. These data show that cattle have undergone a rapid recent decrease in effective population size from a very large ancestral population, possibly due to bottlenecks associated with domestication, selection, and breed formation. Domestication and artificial selection appear to have left detectable signatures of selection within the cattle genome, yet the current levels of diversity within breeds are at least as great as exists within humans.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We report the first radiation hybrid map of the river buffalo X chromosome generated from a recently constructed river buffalo (Bubalus bubalis) whole-genome radiation hybrid panel (BBURH5000). This map contains a total of 33 cattle-derived markers, including 10 genes, four ESTs and 19 microsatellites. The markers are distributed in two linkage groups: LG1 contains eight markers spanning 125.6 cR, and LG2 contains 25 markers spanning 366.3 cR. LG1 contains six markers in common with bovine sequence assembly BUILD 3.1. With the exception of BMS2152, the order of these markers on our BBUX map is shuffled when compared to the cow X chromosome (Bos taurus; BTAX). From LG2, two markers (AMELX and BL22) map to a more distal portion of BTAX compared to BBUX. In addition, two pairs of LG2 markers exhibit inversions compared to BTAX (ILSTS017 and ATRX; XBM38 and PPEF1). Alternatively, when compared to the most recent bovine RH map (Bov-Gen 3000rads), BL1098 and BMS2227 from LG1 as well as PLS3 and BMS1820 from LG2 showed inverted positions on the BBUX map. These discrepancies in buffalo and cattle maps may reflect evolutionary divergence of the chromosomes or mapping errors in one of the two species. Although the set of mapped markers does not cover the entire X chromosome, this map is a starting point for the construction of a high-resolution map, which is necessary for characterization of small rearrangements that might have occurred between the Bubalus bubalis and Bos taurus X chromosomes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This work presents a fast, easy, standardized and planned method to build family homes in order to collaborate with the decline of housing deficit and, at the same time, meet families who wish to invest in building their own houses, once that our country is experiencing an economical prosperity . This method has been widely used nowadays because it is being deployed in large-scale and in a standardized pattern, reducing decreasing significantly the construction time. The concrete wall, shaped in loco depends on unique processes. For this reason, a well trained staff is required so the constructive process can be executed in a fast pace, showing better results than other construction methods. This project will address all the materials used to build the molded concrete walls in loco, their sequence assembly, application, types, characteristics and performances. At the end, a way used by building companies of locating the concrete will also be given. Although there is no specific Brazilian norm for this type of construction, a study on this topic will soon be released. Therefore, the studies for the execution of this project were based on Brazilian standards prevailing at the time, construction magazines and books that are based on building methods considered more common

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BACKGROUND: The neuronal ceroid lipofuscinoses (NCL) are a heterogenous group of inherited progressive neurodegenerative diseases in different mammalian species. Tibetan Terrier and Polish Owczarek Nizinny (PON) dogs show rare late-onset NCL variants with autosomal recessive inheritance, which can not be explained by mutations of known human NCL genes. These dog breeds represent animal models for human late-onset NCL. In mice the chloride channel 3 gene (Clcn3) encoding an intracellular chloride channel was described to cause a phenotype similar to NCL. RESULTS: Two full-length cDNA splice variants of the canine CLCN3 gene are reported. The current canine whole genome sequence assembly was used for gene structure analyses and revealed 13 coding CLCN3 exons in 52 kb of genomic sequence. Sequence analysis of the coding exons and flanking intron regions of CLCN3 using six NCL-affected Tibetan terrier dogs and an NCL-affected Polish Owczarek Nizinny (PON) dog, as well as eight healthy Tibetan terrier dogs revealed 13 SNPs. No consistent CLCN3 haplotype was associated with NCL. CONCLUSION: For the examined animals we excluded the complete coding region and adjacent intronic regions of canine CLCN3 to harbor disease-causing mutations. Therefore it seems to be unlikely that a mutation in this gene is responsible for the late-onset NCL phenotype in these two dog breeds.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The development of a completely annotated sheep genome sequence is a key need for understanding the phylogenetic relationships and genetic diversity among the many different sheep breeds worldwide and for identifying genes controlling economically and physiologically important traits. The ovine genome sequence assembly will be crucial for developing optimized breeding programs based on highly productive, healthy sheep phenotypes that are adapted to modern breeding and production conditions. Scientists and breeders around the globe have been contributing to this goal by generating genomic and cDNA libraries, performing genome-wide and trait-associated analyses of polymorphism, expression analysis, genome sequencing, and by developing virtual and physical comparative maps. The International Sheep Genomics Consortium (ISGC), an informal network of sheep genomics researchers, is playing a major role in coordinating many of these activities. In addition to serving as an essential tool for monitoring chromosome abnormalities in specific sheep populations, ovine molecular cytogenetics provides physical anchors which link and order genome regions, such as sequence contigs, genes and polymorphic DNA markers to ovine chromosomes. Likewise, molecular cytogenetics can contribute to the process of defining evolutionary breakpoints between related species. The selective expansion of the sheep cytogenetic map, using loci to connect maps and identify chromosome bands, can substantially contribute to improving the quality of the annotated sheep genome sequence and will also accelerate its assembly. Furthermore, identifying major morphological chromosome anomalies and micro-rearrangements, such as gene duplications or deletions, that might occur between different sheep breeds and other Ovis species will also be important to understand the diversity of sheep chromosome structure and its implications for cross-breeding. To date, 566 loci have been assigned to specific chromosome regions in sheep and the new cytogenetic map is presented as part of this review. This review will also summarize the current cytogenomic status of the sheep genome, describe current activities in the sheep cytogenomics research sector, and will discuss the cytogenomics data in context with other major sheep genomics projects.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Programa Doutoral em Líderes para as Indústrias Tecnológicas

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Monomer-sequence information in synthetic copolyimides can be recognised by tweezer-type molecules binding to adjacent triplet-sequences on the polymer chains. In the present paper different tweezer-molecules are found to have different sequence-selectivities, as demonstrated in solution by 1H NMR spectroscopy and in the solid state by single crystal X-ray analyses of tweezer-complexes with linear and macrocyclic oligo-imides. This work provides clear-cut confirmation of polyimide chain-folding and adjacent-tweezer-binding. It also reveals a new and entirely unexpected mechanism for sequence-recognition which, by analogy with a related process in biomolecular information processing, may be termed "frameshift-reading". The ability of one particular tweezer-molecule to detect, with exceptionally high sensitivity, long-range sequence-information in chain-folding aromatic copolyimides, is readily explained by this novel process.