61 resultados para Contig
Resumo:
In this paper we describe the assembly and restriction map of a 1.05-Mb cosmid contig spanning the candidate region for familial Mediterranean fever (FMF), a recessively inherited disorder of inflammation localized to 16p13.3. Using a combination of cosmid walking and screening for P1, PAC, BAG, and YAC clones, we have generated a contig of genomic clones spanning similar to 1050 kb that contains the FMF critical region. The map consists of 179 cosmid, 15 P1, 10 PAC, 3 BAG, and 17 YAC clones, anchored by 27 STS markers. Eight additional STSs have been developed from the similar to 700 kb immediately centromeric to this genomic region. Five of the 35 STSs are microsatellites that have not been previously reported. NotI and EcoRI mapping of the overlapping cosmids, hybridization of restriction fragments from cosmids to one another, and STS analyses have been used to validate the assembly of the contig. Our contig totally subsumes the 250-kb interval recently reported, by founder haplotype analysis, to contain the FMF gene. Thus, our high-resolution clone map provides an ideal resource for transcriptional mapping toward the eventual identification of this disease gene. (C) 1997 Academic Press.
Resumo:
Nucleotide composition analyses of bacterial genomes such as cumulative GC skew highlight the atypical, strongly asymmetric architecture of the recently published chromosome of Idiomarina loihiensis L2TR, suggesting that an inversion of a 600-kb chromosomal segment occurred. The presence of 3.4-kb inverted repeated sequences at the borders of the putative rearrangement supports this hypothesis. Reverting in silico this segment restores (1) a symmetric chromosome architecture; (2) the co-orientation of transcription of all rRNA operons with DNA replication; and (3) a better conservation of gene order between this chromosome and other gamma-proteobacterial ones. Finally, long-range PCRs encompassing the ends of the 600-kb segment reveal the existence of the reverted configuration but not of the published one. This demonstrates how cumulative nucleotide-skew analyses can validate genome assemblies.
Resumo:
DNA assembly is among the most fundamental and difficult problems in bioinformatics. Near optimal assembly solutions are available for bacterial and small genomes, however assembling large and complex genomes especially the human genome using Next-Generation-Sequencing (NGS) technologies is shown to be very difficult because of the highly repetitive and complex nature of the human genome, short read lengths, uneven data coverage and tools that are not specifically built for human genomes. Moreover, many algorithms are not even scalable to human genome datasets containing hundreds of millions of short reads. The DNA assembly problem is usually divided into several subproblems including DNA data error detection and correction, contig creation, scaffolding and contigs orientation; each can be seen as a distinct research area. This thesis specifically focuses on creating contigs from the short reads and combining them with outputs from other tools in order to obtain better results. Three different assemblers including SOAPdenovo [Li09], Velvet [ZB08] and Meraculous [CHS+11] are selected for comparative purposes in this thesis. Obtained results show that this thesis’ work produces comparable results to other assemblers and combining our contigs to outputs from other tools, produces the best results outperforming all other investigated assemblers.
Resumo:
The bovine RPCI-42 BAC library was screened to construct a sequence-ready ~4 Mb single contig of 92 BAC clones on BTA 1q12. The contig covers the region between the genes KRTAP8P1 and CLIC6. This genomic segment in cattle is of special interest as it contains the dominant gene responsible for the hornless or polled phenotype in cattle. The construction of the BAC contig was initiated by screening the bovine BAC library with heterologous cDNA probes derived from 12 human genes of the syntenic region on HSA 21q22. Contig building was facilitated by BAC end sequencing and chromosome walking. During the construction of the contig, 165 BAC end sequences and 109 single-copy STS markers were generated. For comparative mapping of 25 HSA 21q22 genes, genomic PCR primers were designed from bovine EST sequences and the gene-associated STSs mapped on the contig. Furthermore, bovine BAC end sequence comparisons against the human genome sequence revealed significant matches to HSA 21q22 and allowed the in silico mapping of two new genes in cattle. In total, 31 orthologues of human genes located on HSA 21q22 were directly mapped within the bovine BAC contig, of which 16 genes have been cloned and mapped for the first time in cattle. In contrast to the existing comparative bovine-human RH maps of this region, these results provide a better alignment and reveal a completely conserved gene order in this 4 Mb segment between cattle, human and mouse. The mapping of known polled linked BTA 1q12 microsatellite markers allowed the integration of the physical contig map with existing linkage maps of this region and also determined the exact order of these markers for the first time. Our physical map and transcript map may be useful for positional cloning of the putative polled gene in cattle.
Resumo:
A high-resolution physical and genetic map of a major fruit weight quantitative trait locus (QTL), fw2.2, has been constructed for a region of tomato chromosome 2. Using an F2 nearly isogenic line mapping population (3472 individuals) derived from Lycopersicon esculentum (domesticated tomato) × Lycopersicon pennellii (wild tomato), fw2.2 has been placed near TG91 and TG167, which have an interval distance of 0.13 ± 0.03 centimorgan. The physical distance between TG91 and TG167 was estimated to be ≤ 150 kb by pulsed-field gel electrophoresis of tomato DNA. A physical contig composed of six yeast artificial chromosomes (YACs) and encompassing fw2.2 was isolated. No rearrangements or chimerisms were detected within the YAC contig based on restriction fragment length polymorphism analysis using YAC-end sequences and anchored molecular markers from the high-resolution map. Based on genetic recombination events, fw2.2 could be narrowed down to a region less than 150 kb between molecular markers TG91 and HSF24 and included within two YACs: YAC264 (210 kb) and YAC355 (300 kb). This marks the first time, to our knowledge, that a QTL has been mapped with such precision and delimited to a segment of cloned DNA. The fact that the phenotypic effect of the fw2.2 QTL can be mapped to a small interval suggests that the action of this QTL is likely due to a single gene. The development of the high-resolution genetic map, in combination with the physical YAC contig, suggests that the gene responsible for this QTL and other QTLs in plants can be isolated using a positional cloning strategy. The cloning of fw2.2 will likely lead to a better understanding of the molecular biology of fruit development and to the genetic engineering of fruit size characteristics.
Resumo:
For many agronomically important plant genes, only their position on a genetic map is known. In the absence of an efficient transposon tagging system, such genes have to be isolated by map-based cloning. In bread wheat Triticum aestivum, the genome is hexaploid, has a size of 1.6 × 1010 bp, and contains more than 80% of repetitive sequences. So far, this genome complexity has not allowed chromosome walking and positional cloning. Here, we demonstrate that chromosome walking using bacterial artificial chromosome (BAC) clones is possible in the diploid wheat Triticum monococcum (Am genome). BAC end sequences were mostly repetitive and could not be used for the first walking step. New probes corresponding to rare low-copy sequences were efficiently identified by low-pass DNA sequencing of the BACs. Two walking steps resulted in a physical contig of 450 kb on chromosome 1AmS. Genetic mapping of the probes derived from the BAC contig demonstrated perfect colinearity between the physical map of T. monococcum and the genetic map of bread wheat on chromosome 1AS. The contig genetically spans the Lr10 leaf rust disease resistance locus in bread wheat, with 0.13 centimorgans corresponding to 300 kb between the closest flanking markers. Comparison of the genetic to physical distances has shown large variations within 350 kb of the contig. The physical contig can now be used for the isolation of the orthologous regions in bread wheat. Thus, subgenome chromosome walking in wheat can produce large physical contigs and saturate genomic regions to support positional cloning.
Resumo:
We have constructed a physical map of human chromosome 22q using bacterial artificial chromosome (BAC) clones. The map consists of 613 chromosome 22-specific BAC clones that have been localized and assembled into contigs using 452 landmarks, 346 of which were previously ordered and mapped to specific regions of the q arm of the chromosome by means of chromosome 22-specific yeast artificial chromosome clones. The BAC-based map provides immediate access to clones that are stable and convenient for direct genome analysis. The approach to rapidly developing marker-specific BAC contigs is relatively straightforward and can be extended to generate scaffold BAC contig maps of the rest of the chromosomes. These contigs will provide substrates for sequencing the entire human genome. We discuss how to efficiently close contig gaps using the end sequences of BAC clone inserts.
Resumo:
Many human malignant cells lack methylthioadenosine phosphorylase (MTAP) enzyme activity. The gene (MTAP) encoding this enzyme was previously mapped to the short arm of chromosome 9, band p21-22, a region that is frequently deleted in multiple tumor types. To clone candidate tumor suppressor genes from the deleted region on 9p21-22, we have constructed a long-range physical map of 2.8 megabases for 9p21 by using overlapping yeast artificial chromosome and cosmid clones. This map includes the type IIFN gene cluster, the recently identified candidate tumor suppressor genes CDKN2 (p16INK4A) and CDKN2B (p15INK4B), and several CpG islands. In addition, we have identified other transcription units within the yeast artificial chromosome contig. Sequence analysis of a 2.5-kb cDNA clone isolated from a CpG island that maps between the IFN genes and CDKN2 reveals a predicted open reading frame of 283 amino acids followed by 1302 nucleotides of 3' untranslated sequence. This gene is evolutionarily conserved and shows significant amino acid homologies to mouse and human purine nucleoside phosphorylases and to a hypothetical 25.8-kDa protein in the pet gene (coding for cytochrome bc1 complex) region of Rhodospirillum rubrum. The location, expression pattern, and nucleotide sequence of this gene suggest that it codes for the MTAP enzyme.
Resumo:
It has been proposed that common aphidicolin-inducible fragile sites, in general, predispose to specific chromosomal breakage associated with deletion, amplification, and/or translocation in certain forms of cancer. Although this appears to be the case for the fragile site FRA3B and may be the case for FRA7G, it is not Set clear whether this association is a general property of this class of fragile site. The major aim of the present study was to determine whether the FRA16D chromosomal fragile site locus has a role to play in predisposing DNA sequences within and adjacent to the fragile site to DNA instability (such as deletion or translocation), which could lead to or be associated with neoplasia. We report the localization of FRA16D within a contig of cloned DNA and demonstrate that this fragile site coincides with a region of homozygous deletion in a gastric adenocarcinoma cell line and is bracketed by translocation breakpoints in multiple myeloma, as reported previously (Chesi, M., et al., Blood, 91: 4457-4463, 1998), Therefore, given similar findings at the FRA3B and FRA7G fragile sites, it is likely that common aphidicolin-inducible fragile sites exhibit the general property of localized DNA instability in cancer cells.
Resumo:
Familial Mediterranean fever (FMF) is a recessive disorder of inflammation caused by mutations in a gene (designated MEFV) on chromosome 16p13.3, We have recently constructed a 1-Mb cosmid contig that includes the FMF critical region. Here we show genotype data for 12 markers from our physical map, including 5 newly identified microsatellites, in FMF families. Intrafamilial recombinations placed MEFV in the similar to 285 kb between D16S468/D16S3070 and D16S3376. We observed significant linkage disequilibrium in the North African Jewish population, and historical recombinants in the founder haplotype placed MEFV between D16S3082 and D16S3373 (similar to 200 kb). In smaller panels of Iraqi Jewish, Arab, and Armenian families, there were significant allelic associations only for D16S3370 and D16S2617 among the Armenians. A sizable minority of Iraqi Jewish and Armenian carrier chromosomes appeared to be derived from the North African Jewish ancestral haplotype. We observed a unique FMF haplotype common to Iraqi Jews, Arabs, and Armenians and two other haplotypes restricted to either the Iraqi Jewish or the Armenian population. These data support the view that a few major mutations account for a large percentage of the cases of FMF and suggest that same of these mutations arose before the affected Middle Eastern populations diverged from one another. (C) 1997 Academic Press.
Resumo:
A cross between two different races (race 7 x race 25) of the soybean root and stem rot pathogen Phytophthora sojae was analyzed to characterize the genomic region flanking two cosegregating avirulence genes, Anur4 and Anur6. Both genes cosegregated in the ratio of 82:17 (avirulent:virulent) in an F-2 population, suggestive of a single locus controlling both phenotypes. A chromosome walk was commenced from RAPD marker OPE7.1C, 2.0 cM distant from the Anur4/6 locus. Three overlapping cosmids were isolated which included genetic markers that flank the Anur4/6 locus. The chromosome walk spanned a physical distance of 67 kb which represented a genetic map distance of 22.3cM, an average recombination frequency of 3.0kb/cM and 11.7-fold greater than the predicted average recombination frequency of 35.3 kb/cM for the entire P. sojae genome. Six genes (cDNA clones) expressed from the Anur4/6 genomic region encompassed by the cosmid contig were identified. Single nucleotide polymorphisms and restriction fragment length polymorphisms showed these six genes were closely linked to the Anur4/6 locus. Physical mapping of the cDNA clones within the cosmid contig made it possible to deduce the precise linkage order of the cDNAs. None of the six cDNA clones appear to be candidates for Anur4/6. We conclude that two of these cDNA clones flank a physical region of approximately 24 kb and 4.3 cM that appears to include the Anur4/6 locus. (C) 2003 Elsevier Inc. All rights reserved.
Resumo:
The last Crypto-Jews (Marranos) are the survivors of Spanish Jews who were persecuted in the late fifteenth century, escaped to Portugal and were forced to convert to save their lives. Isolated groups still exist in mountainous areas such as Belmonte in the Beira-Baixa province of Portugal. We report here the genetic study of a highly consanguineous endogamic population of Crypto-Jews of Belmonte affected with autosomal recessive retinitis pigmentosa (RP). A genome-wide search for homozygosity allowed us to localize the disease gene to chromosome 15q22-q24 (Zmax=2.95 at θ=0 at the D15S131 locus). Interestingly, the photoreceptor cell-specific nuclear receptor (PNR) gene, the expression of which is restricted to the outer nuclear layer of retinal photoreceptor cells, was found to map to the YAC contig encompassing the disease locus. A search for mutations allowed us to ascribe the RP of Crypto-Jews of Belmonte to a homozygous missense mutation in the PNR gene. Preliminary haplotype studies support the view that this mutation is relatively ancient but probably occurred after the population settled in Belmonte.
Resumo:
Ape chromosomes homologous to human chromosomes 14 and 15 were generated by a fission event of an ancestral submetacentric chromosome, where the two chromosomes were joined head-to-tail. The hominoid ancestral chromosome most closely resembles the macaque chromosome 7. In this work, we provide insights into the evolution of human chromosomes 14 and 15, performing a comparative study between macaque boundary region 14/15 and the orthologous human regions. We construct a 1.6-Mb contig of macaque BAC clones in the region orthologous to the ancestral hominoid fission site and use it to define the structural changes that occurred on human 14q pericentromeric and 15q subtelomeric regions. We characterize the novel euchromatin-heterochromatin transition region (∼20 Mb) acquired during the neocentromere establishment on chromosome 14, and find it was mainly derived through pericentromeric duplications from ancestral hominoid chromosomes homologous to human 2q14-qter and 10. Further, we show a relationship between evolutionary hotspots and low-copy repeat loci for chromosome 15, revealing a possible role of segmental duplications not only in mediating but also in "stitching" together rearrangement breakpoints.
Resumo:
Recent technological progress has greatly facilitated de novo genome sequencing. However, de novo assemblies consist in many pieces of contiguous sequence (contigs) arranged in thousands of scaffolds instead of small numbers of chromosomes. Confirming and improving the quality of such assemblies is critical for subsequent analysis. We present a method to evaluate genome scaffolding by aligning independently obtained transcriptome sequences to the genome and visually summarizing the alignments using the Cytoscape software. Applying this method to the genome of the red fire ant Solenopsis invicta allowed us to identify inconsistencies in 7%, confirm contig order in 20% and extend 16% of scaffolds.Scripts that generate tables for visualization in Cytoscape from FASTA sequence and scaffolding information files are publicly available at https://github.com/ksanao/TGNet.
Resumo:
The turbot (Scophthalmus maximus) is a commercially valuable flatfish and one of the most promising aquaculture species in Europe. Two transcriptome 454-pyrosequencing runs were used in order to detect Single Nucleotide Polymorphisms (SNPs) in genesrelated to immune response and gonad differentiation. A total of 866 true SNPs were detected in 140 different contigs representing 262,093 bp as a whole. Only one true SNP was analyzed in each contig. One hundred and thirteen SNPs out of the 140 analyzed were feasible (genotyped), while Ш were polymorphic in a wild population. Transition/transversion ratio (1.354) was similar to that observed in other fish studies. Unbiased gene diversity (He) estimates ranged from 0.060 to 0.510 (mean = 0.351), minimum allele frequency (MAF) from 0.030 to 0.500 (mean = 0.259) and all loci were in Hardy-Weinberg equilibrium after Bonferroni correction. A large number of SNPs (49) were located in the coding region, 33 representing synonymous and 16 non-synonymous changes. Most SNP-containing genes were related to immune response and gonad differentiation processes, and could be candidates for functional changes leading to phenotypic changes. These markers will be useful for population screening to look for adaptive variation in wild and domestic turbot