41 resultados para Whole genome mapping


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Clusters of orthologous groups [COGs; Tatusov, R. L., Koonin, E. V. & Lipman, D. J. (1997) Science 278, 631–637] were identified for a set of 13 completely sequenced herpesviruses. Each COG represented a family of gene products conserved across several herpes genomes. These families were defined without using an arbitrary threshold criterion based on sequence similarity. The COG technique was modified so that variable stringency in COG construction was possible. High stringencies identify a core set of highly conserved genes. Varying COG stringency reveals differences in the degree of conservation between functional classes of genes. The COG data were used to construct whole-genome phylogenetic trees based on gene content. These trees agree well with trees based on other methods and are robust when tested by bootstrap analysis. The COG data also were used to construct a reciprocal tree that clustered genes with similar phylogenetic profiles. This clustering may give clues to genes with related functions or with related histories of acquisition and loss during herpesvirus evolution.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The molecular identity and function of the Drosophila melanogaster Y-linked fertility factors have long eluded researchers. Although the D. melanogaster genome sequence was recently completed, the fertility factors still were not identified, in part because of low cloning efficiency of heterochromatic Y sequences. Here we report a method for iterative blast searching to assemble heterochromatic genes from shotgun assemblies, and we successfully identify kl-2 and kl-3 as 1β- and γ-dynein heavy chains, respectively. Our conclusions are supported by formal genetics with X-Y translocation lines. Reverse transcription–PCR was successful in linking together unmapped sequence fragments from the whole-genome shotgun assembly, although some sequences were missing altogether from the shotgun effort and had to be generated de novo. We also found a previously undescribed Y gene, polycystine-related (PRY). The closest paralogs of kl-2, kl-3, and PRY (and also of kl-5) are autosomal and not X-linked, suggesting that the evolution of the Drosophila Y chromosome has been driven by an accumulation of male-related genes arising de novo from the autosomes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

One challenge presented by large-scale genome sequencing efforts is effective display of uniform information to the scientific community. The Comprehensive Microbial Resource (CMR) contains robust annotation of all complete microbial genomes and allows for a wide variety of data retrievals. The bacterial information has been placed on the Web at http://www.tigr.org/CMR for retrieval using standard web browsing technology. Retrievals can be based on protein properties such as molecular weight or hydrophobicity, GC-content, functional role assignments and taxonomy. The CMR also has special web-based tools to allow data mining using pre-run homology searches, whole genome dot-plots, batch downloading and traversal across genomes using a variety of datatypes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We present here the complete genome sequence of a common avian clone of Pasteurella multocida, Pm70. The genome of Pm70 is a single circular chromosome 2,257,487 base pairs in length and contains 2,014 predicted coding regions, 6 ribosomal RNA operons, and 57 tRNAs. Genome-scale evolutionary analyses based on pairwise comparisons of 1,197 orthologous sequences between P. multocida, Haemophilus influenzae, and Escherichia coli suggest that P. multocida and H. influenzae diverged ≈270 million years ago and the γ subdivision of the proteobacteria radiated about 680 million years ago. Two previously undescribed open reading frames, accounting for ≈1% of the genome, encode large proteins with homology to the virulence-associated filamentous hemagglutinin of Bordetella pertussis. Consistent with the critical role of iron in the survival of many microbial pathogens, in silico and whole-genome microarray analyses identified more than 50 Pm70 genes with a potential role in iron acquisition and metabolism. Overall, the complete genomic sequence and preliminary functional analyses provide a foundation for future research into the mechanisms of pathogenesis and host specificity of this important multispecies pathogen.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that insert between genes. These retroelements are less abundant in smaller genome plants, including rice and sorghum. Although 5- to 200-kb blocks of methylated, presumably heterochromatic, retrotransposons flank most maize genes, rice and sorghum genes are often adjacent. Similar genes are commonly found in the same relative chromosomal locations and orientations in each of these three species, although there are numerous exceptions to this collinearity (i.e., rearrangements) that can be detected at the levels of both the recombinational map and cloned DNA. Evolutionarily conserved sequences are largely confined to genes and their regulatory elements. Our results indicate that a knowledge of grass genome structure will be a useful tool for gene discovery and isolation, but the general rules and biological significance of grass genome organization remain to be determined. Moreover, the nature and frequency of exceptions to the general patterns of grass genome structure and collinearity are still largely unknown and will require extensive further investigation.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Unlike many pathogens that are overtly toxic to their hosts, the primary virulence determinant of Mycobacterium tuberculosis appears to be its ability to persist for years or decades within humans in a clinically latent state. Since early in the 20th century latency has been linked to hypoxic conditions within the host, but the response of M. tuberculosis to a hypoxic signal remains poorly characterized. The M. tuberculosis α-crystallin (acr) gene is powerfully and rapidly induced at reduced oxygen tensions, providing us with a means to identify regulators of the hypoxic response. Using a whole genome microarray, we identified >100 genes whose expression is rapidly altered by defined hypoxic conditions. Numerous genes involved in biosynthesis and aerobic metabolism are repressed, whereas a high proportion of the induced genes have no known function. Among the induced genes is an apparent operon that includes the putative two-component response regulator pair Rv3133c/Rv3132c. When we interrupted expression of this operon by targeted disruption of the upstream gene Rv3134c, the hypoxic regulation of acr was eliminated. These results suggest a possible role for Rv3132c/3133c/3134c in mycobacterial latency.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Caenorhabditis elegans is an ideal organism for the study of the molecular basis of fundamental biological processes such as germ-line development, especially because of availability of the whole genome sequence and applicability of the RNA interference (RNAi) technique. To identify genes involved in germ-line development, we produced subtracted cDNA pools either enriched for or deprived of the cDNAs from germ-line tissues. We then performed differential hybridization on the high-density cDNA grid, on which about 7,600 nonoverlapping expressed sequence tag (EST) clones were spotted, to identify a set of genes specifically expressed in the germ line. One hundred and sixty-eight clones were then tested with the RNAi technique. Of these, 15 clones showed sterility with a variety of defects in germ-line development. Seven of them led to the production of unfertilized eggs, because of defects in spermatogenesis (4 clones), or defects in the oocytes (3 clones). The other 8 clones led to failure of oogenesis. These failures were caused by germ-line proliferation defect (Glp phenotype), meiotic arrest, and defects in sperm–oocyte switch (Mog phenotype) among others. These results demonstrate the efficacy of the screening strategy using the EST library combined with the RNAi technique in C. elegans.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The whole genome sequence (1.83 Mbp) of Haemophilus influenzae strain Rd was searched to identify tandem oligonucleotide repeat sequences. Loss or gain of one or more nucleotide repeats through a recombination-independent slippage mechanism is known to mediate phase variation of surface molecules of pathogenic bacteria, including H. influenzae. This facilitates evasion of host defenses and adaptation to the varying microenvironments of the host. We reasoned that iterative nucleotides could identify novel genes relevant to microbe-host interactions. Our search of the Rd genome sequence identified 9 novel loci with multiple (range 6-36, mean 22) tandem tetranucleotide repeats. All were found to be located within putative open reading frames and included homologues of hemoglobin-binding proteins of Neisseria, a glycosyltransferase (IgtC gene product) of Neisseria, and an adhesin of Yersinia. These tetranucleotide repeat sequences were also shown to be present in two other epidemiologically different H. influenzae type b strains, although the number and distribution of repeats was different. Further characterization of the IgtC gene showed that it was involved in phenotypic switching of a lipopolysaccharide epitope and that this variable expression was associated with changes in the number of tetranucleotide repeats. Mutation of IgtC resulted in attenuated virulence of H. influenzae in an infant rat model of invasive infection. These data indicate the rapidity, economy, and completeness with which whole genome sequences can be used to investigate the biology of pathogenic bacteria.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

An assay that allows measurement of absolute induction frequencies for DNA double-strand breaks (dsbs) in defined regions of the genome and that quantitates rejoining of correct DNA ends has been used to study repair of dsbs in normal human fibroblasts after x-irradiation. The approach involves hybridization of single-copy DNA probes to Not I restriction fragments separated according to size by pulsed-field gel electrophoresis. Induction of dsbs is quantitated from the decrease in the intensity of the hybridizing restriction fragment and an accumulation of a smear below the band. Rejoining of dsbs results in reconstitution of the intact restriction fragment only if correct DNA ends are joined. By comparing results from this technique with results from a conventional electrophoresis assay that detects all rejoining events, it is possible to quantitate the misrejoining frequency. Three Not I fragments on the long arm of chromosome 21 were investigated with regard to dsb induction, yielding an identical induction rate of 5.8 X 10(-3) break per megabase pair per Gy. Correct dsb rejoining was measured for two of these Not I fragments after initial doses of 80 and 160 Gy. The misrejoining frequency was about 25% for both fragments and was independent of dose. This result appears to be representative for the whole genome as shown by analysis of the entire Not I fragment distribution. The correct rejoining events primarily occurred within the first 2 h, while the misrejoining kinetics included a much slower component, with about half of the events occurring between 2 and 24 h. These misrejoining kinetics are similar to those previously reported for production of exchange aberrations in interphase chromosomes.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The mouse is the best model system for the study of mammalian genetics and physiology. Because of the feasibility and importance of studying genetic crosses, the mouse genetic map has received tremendous attention in recent years. It currently contains over 14,000 genetically mapped markers, including 700 mutant loci, 3500 genes, and 6500 simple sequence length polymorphisms (SSLPs). The mutant loci and genes allow insights and correlations concerning physiology and development. The SSLPs provide highly polymorphic anchor points that allow inheritance to be traced in any cross and provide a scaffold for assembling physical maps. Adequate physical mapping resources--notably large-insert yeast artificial chromosome (YAC) libraries--are available to support positional cloning projects based on the genetic map, but a comprehensive physical map is still a few years away. Large-scale sequencing efforts have not yet begun in mouse, but comparative sequence analysis between mouse and human is likely to provide tremendous information about gene structure and regulation.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Fluorescence in situ hybridization (FISH) is a powerful tool for physical mapping in human and other mammalian species. However, application of the FISH technique has been limited in plant species, especially for mapping single- or low-copy DNA sequences, due to inconsistent signal production in plant chromosome preparations. Here we demonstrate that bacterial artificial chromosome (BAC) clones can be mapped readily on rice (Oryza sativa L.) chromosomes by FISH. Repetitive DNA sequences in BAC clones can be suppressed efficiently by using rice genomic DNA as a competitor in the hybridization mixture. BAC clones as small as 40 kb were successfully mapped. To demonstrate the application of the FISH technique in physical mapping of plant genomes, both anonymous BAC clones and clones closely linked to a rice bacterial blight-resistance locus, Xa21, were chosen for analysis. The physical location of Xa21 and the relationships among the linked clones were established, thus demonstrating the utility of FISH in plant genome analysis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

KIF (kinesin superfamily) proteins are microtubule-dependent molecular motors that play important roles in intracellular transport and cell division. The extent to which KIFs are involved in various transporting phenomena, as well as their regulation mechanism, are unknown. The identification of 16 new KIFs in this report doubles the existing number of KIFs known in the mouse. Conserved nucleotide sequences in the motor domain were amplified by PCR using cDNAs of mouse nervous tissue, kidney, and small intestine as templates. The new KIFs were studied with respect to their expression patterns in different tissues, chromosomal location, and molecular evolution. Our results suggest that (i) there is no apparent tendency among related subclasses of KIFs of cosegregation in chromosomal mapping, and (ii) according to their tissue distribution patterns, KIFs can be divided into two classes–i.e., ubiquitous and specific tissue-dominant. Further characterization of KIFs may elucidate unknown fundamental phenomena underlying intracellular transport. Finally, we propose a straightforward nomenclature system for the members of the mouse kinesin superfamily.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Multiple-complete-digest mapping is a DNA mapping technique based on complete-restriction-digest fingerprints of a set of clones that provides highly redundant coverage of the mapping target. The maps assembled from these fingerprints order both the clones and the restriction fragments. Maps are coordinated across three enzymes in the examples presented. Starting with yeast artificial chromosome contigs from the 7q31.3 and 7p14 regions of the human genome, we have produced cosmid-based maps spanning more than one million base pairs. Each yeast artificial chromosome is first subcloned into cosmids at a redundancy of ×15–30. Complete-digest fragments are electrophoresed on agarose gels, poststained, and imaged on a fluorescent scanner. Aberrant clones that are not representative of the underlying genome are rejected in the map construction process. Almost every restriction fragment is ordered, allowing selection of minimal tiling paths with clone-to-clone overlaps of only a few thousand base pairs. These maps demonstrate the practicality of applying the experimental and software-based steps in multiple-complete-digest mapping to a target of significant size and complexity. We present evidence that the maps are sufficiently accurate to validate both the clones selected for sequencing and the sequence assemblies obtained once these clones have been sequenced by a “shotgun” method.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The region of human chromosome 22q11 is prone to rearrangements. The resulting chromosomal abnormalities are involved in Velo-cardio-facial and DiGeorge syndromes (VCFS and DGS) (deletions), “cat eye” syndrome (duplications), and certain types of tumors (translocations). As a prelude to the development of mouse models for VCFS/DGS by generating targeted deletions in the mouse genome, we examined the organization of genes from human chromosome 22q11 in the mouse. Using genetic linkage analysis and detailed physical mapping, we show that genes from a relatively small region of human 22q11 are distributed on three mouse chromosomes (MMU6, MMU10, and MMU16). Furthermore, although the region corresponding to about 2.5 megabases of the VCFS/DGS critical region is located on mouse chromosome 16, the relative organization of the region is quite different from that in humans. Our results show that the instability of the 22q11 region is not restricted to humans but may have been present throughout evolution. The results also underscore the importance of detailed comparative mapping of genes in mice and humans as a prerequisite for the development of mouse models of human diseases involving chromosomal rearrangements.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present an approach to map large numbers of Tc1 transposon insertions in the genome of Caenorhabditis elegans. Strains have been described that contain up to 500 polymorphic Tc1 insertions. From these we have cloned and shotgun sequenced over 2000 Tc1 flanks, resulting in an estimated set of 400 or more distinct Tc1 insertion alleles. Alignment of these sequences revealed a weak Tc1 insertion site consensus sequence that was symmetric around the invariant TA target site and reads CAYATATRTG. The Tc1 flanking sequences were compared with 40 Mbp of a C. elegans genome sequence. We found 151 insertions within the sequenced area, a density of ≈1 Tc1 insertion in every 265 kb. As the rest of the C. elegans genome sequence is obtained, remaining Tc1 alleles will fall into place. These mapped Tc1 insertions can serve two functions: (i) insertions in or near genes can be used to isolate deletion derivatives that have that gene mutated; and (ii) they represent a dense collection of polymorphic sequence-tagged sites. We demonstrate a strategy to use these Tc1 sequence-tagged sites in fine-mapping mutations.