924 resultados para UCSC genome browser
Resumo:
The genome of the pufferfish (Fugu rubripes) (400 Mb) is approximately 7.5 times smaller than the human genome, but it has a similar gene repertoire to that of man. If regions of the two genomes exhibited conservation of gene order (i.e., were syntenic), it should be possible to reduce dramatically the effort required for identification of candidate genes in human disease loci by sequencing syntenic regions of the compact Fugu genome. We have demonstrated that three genes (dihydrolipoamide succinyltransferase, S31iii125, and S20i15), which are linked to FOS in the familial Alzheimer disease focus (AD3) on human chromosome 14, have homologues in the Fugu genome adjacent to Fugu cFOS. The relative gene order of cFOS, S31iii125, and S20i15 was the same in both genomes, but in Fugu these three genes lay within a 12.4-kb region, compared to >600 kb in the human AD3 locus. These results demonstrate the conservation of synteny between the genomes of Fugu and man and highlight the utility of this approach for sequence-based identification of genes in human disease loci.
Resumo:
We report several classes of human interspersed repeats that resemble fossils of DNA transposons, elements that move by excision and reintegration in the genome, whereas previously characterized mammalian repeats all appear to have accumulated by retrotransposition, which involves an RNA intermediate. The human genome contains at least 14 families and > 100,000 degenerate copies of short (180-1200 bp) elements that have 14- to 25-bp terminal inverted repeats and are flanked by either 8 bp or TA target site duplications. We describe two ancient 2.5-kb elements with coding capacity, Tigger1 and -2, that closely resemble pogo, a DNA transposon in Drosophila, and probably were responsible for the distribution of some of the short elements. The deduced pogo and Tigger proteins are related to products of five DNA transposons found in fungi and nematodes, and more distantly, to the Tc1 and mariner transposases. They also are very similar to the major mammalian centromere protein CENP-B, suggesting that this may have a transposase origin. We further identified relatively low-copy-number mariner elements in both human and sheep DNA. These belong to two subfamilies previously identified in insect genomes, suggesting lateral transfer between diverse species.
Resumo:
An efficient method of constructing recombinant adenoviruses (Ads) has been established. The expression unit to be introduced into recombinant Ad was first inserted into the unique Swa I site of the full-length Ad genome cloned in a cassette cosmid. The cassette bearing the expression unit was then cotransfected into human embryonic kidney 293 cells together with the Ad DNA-terminal protein complex digested at several sites with Eco T22I or Ase I/EcoRI. The use of the parent Ad DNA-terminal protein complex instead of the deproteinized Ad genome DNA allowed very efficient recovery of the desired recombinant Ad, and the above restriction digestion drastically reduced regeneration of the parent virus. Several hundred virus clones were readily obtained in each experiment, and about 70% of the clones were the desired recombinant viruses. Furthermore, because the cassette contained the full-length Ad genome, any position of the genome could be easily modified to develop a new vector design. We established construction systems for two types of Ad vectors, the E1-substitution type and the E4-insertion type. This method may greatly facilitate the application of recombinant Ads and should be useful for further improvement of Ad vectors.
Resumo:
Integration of viral DNA into the host nuclear genome, although not unusual in bacterial and animal systems, has surprisingly not been reported for plants. We have discovered geminvirus-related DNA (GRD) sequences, in the form of distinct sets of multiple direct repeats comprising three related repeat classes, situated in a unique locus in the Nicotiana tabacum (tobacco) nuclear genome. The organization of these sequences is similar or identical in eight different tobacco cultivars we have examined. DNA sequence analysis reveals that each repeat has sequences most resembling those of the New World geminiviral DNA replication origin plus the adjacent AL1 gene, encoding the viral replication protein. We believe these GRD sequences originated quite recently in Nicotiana evolution through integration of geminiviral DNA by some combination of the processes of illegitimate recombination, amplification, deletions, and rearrangements. These events must have occurred in plant tissue that was subsequently able to contribute to meristematic tissue yielding gametes. GRD may have been retained in tobacco by selection or by random fixation in a small evolving population. Although we cannot detect transcription of these sequences, this does not exclude the possibility that they may originally have been expressed.
Resumo:
In cells simultaneously infected with any two of the three reovirus serotypes ST1, ST2, and ST3, up to 15% of the yields are intertypic reassortants that contain all possible combinations of parental genome segments. We have now found that not all genome segments in reassortants are wild type. In reassortants that possess more ST1 than ST3 genome segments, all ST1 genome segments appear to be wild type, but the incoming ST3 genome segments possess mutations that make them more similar to the ST1 genome segments that they replace. In reassortants resulting from crosses of the more distantly related ST3 and ST2 viruses that possess a majority of ST3 genome segments, all incoming ST2 genome segments are wild type, but the ST3 S4 genome segment possesses two mutations, G74 to A and G624 to A, that function as acceptance signals. Recognition of these signals has far-reaching implications for the construction of reoviruses with novel properties and functions.
Resumo:
We have characterized a family of repetitive DNA elements with homology to the MgPa cellular adhesion operon of Mycoplasma genitalium, a bacterium that has the smallest known genome of any free-living organism. One element, 2272 bp in length and flanked by DNA with no homology to MgPa, was completely sequenced. At least four others were partially sequenced. The complete element is a composite of six regions. Five of these regions show sequence similarity with nonadjacent segments of genes of the MgPa operon. The sixth region, located near the center of the element, is an A+T-rich sequence that has only been found in this repeat family. Open reading frames are present within the five individual regions showing sequence homology to MgPa and the adjacent open reading frame 3 (ORF3) gene. However, termination codons are found between adjacent regions of homology to the MgPa operon and in the A+T-rich sequence. Thus, these repetitive elements do not appear to be directly expressible protein coding sequences. The sequence of one region from five different repetitive elements was compared with the homologous region of the MgPa gene from the type strain G37 and four newly isolated M. genitalium strains. Recombination between repetitive elements of strain G37 and the MgPa operon can explain the majority of polymorphisms within our partial sequences of the MgPa genes of the new isolates. Therefore, we propose that the repetitive elements of M. genitalium provide a reservoir of sequence that contributes to antigenic variation in proteins of the MgPa cellular adhesion operon.
Resumo:
Biologists require genetic as well as molecular tools to decipher genomic information and ultimately to understand gene function. The Berkeley Drosophila Genome Project is addressing these needs with a massive gene disruption project that uses individual, genetically engineered P transposable elements to target open reading frames throughout the Drosophila genome. DNA flanking the insertions is sequenced, thereby placing an extensive series of genetic markers on the physical genomic map and associating insertions with specific open reading frames and genes. Insertions from the collection now lie within or near most Drosophila genes, greatly reducing the time required to identify new mutations and analyze gene functions. Information revealed from these studies about P element site specificity is being used to target the remaining open reading frames.
Resumo:
Arabidopsis thaliana is a small flowering plant that is a member of the family cruciferae. It has many characteristics--diploid genetics, rapid growth cycle, relatively low repetitive DNA content, and small genome size--that recommend it as the model for a plant genome project. The current status of the genetic and physical maps, as well as efforts to sequence the genome, are presented. Examples are given of genes isolated by using map-based cloning. The importance of the Arabidopsis project for plant biology in general is discussed.
Resumo:
The physical map of the 100-Mb Caenorhabditis elegans genome consists of 17,500 cosmids and 3500 yeast artificial chromosomes (YACs). A total of 22.5 Mb has been sequenced, with the remainder expected by 1998. A further 15.5 Mb of unfinished sequence is freely available online: because the areas sequenced so far are relatively gene rich, about half the 13,000 genes can now be scanned. More than a quarter of the genes are represented by expressed sequence tags (ESTs). All information pertaining to the genome is publicly available in the ACeDB data base.
Resumo:
In this paper, we describe the accomplishments of the initial phase of the Human Genome Project, with particular attention to the progress made toward achieving the defined goals for constructing genetic and physical maps of the human genome and determining the sequence of human DNA, identifying the complete set of human genes, and analyzing the need for adequate policies for using the information about human genetics in ways that maximize the benefits for individuals and society.
Resumo:
The mouse is the best model system for the study of mammalian genetics and physiology. Because of the feasibility and importance of studying genetic crosses, the mouse genetic map has received tremendous attention in recent years. It currently contains over 14,000 genetically mapped markers, including 700 mutant loci, 3500 genes, and 6500 simple sequence length polymorphisms (SSLPs). The mutant loci and genes allow insights and correlations concerning physiology and development. The SSLPs provide highly polymorphic anchor points that allow inheritance to be traced in any cross and provide a scaffold for assembling physical maps. Adequate physical mapping resources--notably large-insert yeast artificial chromosome (YAC) libraries--are available to support positional cloning projects based on the genetic map, but a comprehensive physical map is still a few years away. Large-scale sequencing efforts have not yet begun in mouse, but comparative sequence analysis between mouse and human is likely to provide tremendous information about gene structure and regulation.
Resumo:
Previous investigations from our laboratory showed that the genomes of plants, like those of vertebrates, are mosaics of isochores, i.e., of very long DNA segments that are compositionally homogeneous and that can be subdivided into a small number of families characterized by different GC levels (GC is the mole fraction of guanine+cytosine). Compositional DNA fractions corresponding to different isochore families were used to investigate, by hybridization with appropriate probes, the gene distribution in vertebrate genomes. Here we report such a study on the genome of a plant, maize. The gene distribution that we found is most striking, in that almost all genes are present in isochores covering an extremely narrow (1-2%) GC range and only representing 10-20% of the genome. This gene distribution, which seems to characterize other Gramineae as well, is remarkably different from the gene distribution previously found in vertebrate genomes.
Resumo:
Frequencies of meiotic configurations in cytogenetic stocks are dependent on chiasma frequencies in segments defined by centromeres, breakpoints, and telomeres. The expectation maximization algorithm is proposed as a general method to perform maximum likelihood estimations of the chiasma frequencies in the intervals between such locations. The estimates can be translated via mapping functions into genetic maps of cytogenetic landmarks. One set of observational data was analyzed to exemplify application of these methods, results of which were largely concordant with other comparable data. The method was also tested by Monte Carlo simulation of frequencies of meiotic configurations from a monotelodisomic translocation heterozygote, assuming six different sample sizes. The estimate averages were always close to the values given initially to the parameters. The maximum likelihood estimation procedures can be extended readily to other kinds of cytogenetic stocks and allow the pooling of diverse cytogenetic data to collectively estimate lengths of segments, arms, and chromosomes.