16 resultados para genome project
em National Center for Biotechnology Information - NCBI
Resumo:
Progress in agricultural and environmental technologies is hampered by a slower rate of gene discovery in plants than animals. The vast pool of genes in plants, however, will be an important resource for insertion of genes, via biotechnological procedures, into an array of plants, generating unique germ plasms not achievable by conventional breeding. It just became clear that genomes of grasses have evolved in a manner analogous to Lego blocks. Large chromosome segments have been reshuffled and stuffer pieces added between genes. Although some genomes have become very large, the genome with the fewest stuffer pieces, the rice genome, is the Rosetta Stone of all the bigger grass genomes. This means that sequencing the rice genome as anchor genome of the grasses will provide instantaneous access to the same genes in the same relative physical position in other grasses (e.g., corn and wheat), without the need to sequence each of these genomes independently. (i) The sequencing of the entire genome of rice as anchor genome for the grasses will accelerate plant gene discovery in many important crops (e.g., corn, wheat, and rice) by several orders of magnitudes and reduce research and development costs for government and industry at a faster pace. (ii) Costs for sequencing entire genomes have come down significantly. Because of its size, rice is only 12% of the human or the corn genome, and technology improvements by the human genome project are completely transferable, translating in another 50% reduction of the costs. (iii) The physical mapping of the rice genome by a group of Japanese researchers provides a jump start for sequencing the genome and forming an international consortium. Otherwise, other countries would do it alone and own proprietary positions.
Resumo:
Since 1991, the Rice Genome Research Program in Japan has carried out rice genomics, such as large-scale cDNA analysis, construction of a fine-scale restriction fragment length polymorphism map, and physical mapping of the rice genome with yeast artificial chromosome clones. These studies have made a great impact on research into grass genomes and made rice a model plant for other cereal crop research. Starting in 1998, the Rice Genome Research Program will step into a new stage of genomics—that of genome sequencing. This project eventually should reveal all of the genomic sequence information in the rice plant and be an indispensable aid in understanding the genomics of other grass species.
Resumo:
Biologists require genetic as well as molecular tools to decipher genomic information and ultimately to understand gene function. The Berkeley Drosophila Genome Project is addressing these needs with a massive gene disruption project that uses individual, genetically engineered P transposable elements to target open reading frames throughout the Drosophila genome. DNA flanking the insertions is sequenced, thereby placing an extensive series of genetic markers on the physical genomic map and associating insertions with specific open reading frames and genes. Insertions from the collection now lie within or near most Drosophila genes, greatly reducing the time required to identify new mutations and analyze gene functions. Information revealed from these studies about P element site specificity is being used to target the remaining open reading frames.
Resumo:
In this paper, we describe the accomplishments of the initial phase of the Human Genome Project, with particular attention to the progress made toward achieving the defined goals for constructing genetic and physical maps of the human genome and determining the sequence of human DNA, identifying the complete set of human genes, and analyzing the need for adequate policies for using the information about human genetics in ways that maximize the benefits for individuals and society.
Resumo:
Plant genome research is needed as the foundation for an entirely new level of efficiency and success in the application of genetics and breeding to crop plants and products from crop plants. Genetic improvements in crop plants beyond current capabilities are needed to meet the growing world demand not only for more food, but also a greater diversity of food, higher-quality food, and safer food, produced on less land, while conserving soil, water, and genetic resources. Plant biology research, which is poised for dramatic advances, also depends fundamentally on plant genome research. The current Arabidopsis Genome Project has proved of immediate value to plant biology research, but a much greater effort is needed to ensure the full benefits of plant biology and especially plant genome research to agriculture. International cooperation is critical, both because genome projects are too large for any one country and the information forthcoming is of benefit to the world and not just the countries that do the work. Recent research on grass genomes has revealed that, because of extensive senteny and colinearity within linkage groups that make up the chromosomes, new information on the genome of one grass can be used to understand the genomes and predict the location of genes on chromosomes of the other grasses. Genome research applied to grasses as a group thereby can increase the efficiency and effectiveness of breeding for improvement of each member of this group, which includes wheat, corn, and rice, the world’s three most important sources of food.
Resumo:
Arabidopsis thaliana is a small flowering plant that is a member of the family cruciferae. It has many characteristics--diploid genetics, rapid growth cycle, relatively low repetitive DNA content, and small genome size--that recommend it as the model for a plant genome project. The current status of the genetic and physical maps, as well as efforts to sequence the genome, are presented. Examples are given of genes isolated by using map-based cloning. The importance of the Arabidopsis project for plant biology in general is discussed.
Resumo:
The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) is maintained at the European Bioinformatics Institute (EBI) in an international collaboration with the DNA Data Bank of Japan (DDBJ) and GenBank at the NCBI (USA). Data is exchanged amongst the collaborating databases on a daily basis. The major contributors to the EMBL database are individual authors and genome project groups. Webin is the preferred web-based submission system for individual submitters, whilst automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO). Database releases are produced quarterly. Network services allow free access to the most up-to-date data collection via ftp, email and World Wide Web interfaces. EBI’s Sequence Retrieval System (SRS), a network browser for databanks in molecular biology, integrates and links the main nucleotide and protein databases plus many specialized databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, BLAST) are available which allow external users to compare their own sequences against the latest data in the EMBL Nucleotide Sequence Database and SWISS-PROT.
Resumo:
In response to a need for a general catalog of genome variation to address the large-scale sampling designs required by association studies, gene mapping and evolutionary biology, the National Center for Biotechnology Information (NCBI) has established the dbSNP database [S.T.Sherry, M.Ward and K.Sirotkin (1999) Genome Res., 9, 677–679]. Submissions to dbSNP will be integrated with other sources of information at NCBI such as GenBank, PubMed, LocusLink and the Human Genome Project data. The complete contents of dbSNP are available to the public at website: http://www.ncbi.nlm.nih.gov/SNP. The complete contents of dbSNP can also be downloaded in multiple formats via anonymous FTP at ftp://ncbi.nlm.nih.gov/snp/.
Resumo:
The Human Genome Project has generated extensive map and sequence data for a large number of Bacterial Artificial Chromosome (BAC) clones. In order to maximize the efficient use of the data and to minimize the redundant work for the research community, The Institute for Genomic Research (TIGR) comprehensive BAC resource (cBACr) (http://www.tigr.org/tdb/BacResource/BAC_resource_intro.html) was built as an expansion of the TIGR human BAC ends database. This resource collects, integrates and reports the information on library, maps, sequence, annotation and functions for each human and mouse BAC. The current database contains 635 016 human BACs and 265 617 mouse BACs that were characterized by various approaches, among which 22 705 human clones and 1000 mouse clones have sequence and annotation data.
Resumo:
Simple phylogenetic tests were applied to a large data set of nucleotide sequences from two nuclear genes and a region of the mitochondrial genome of Trypanosoma cruzi, the agent of Chagas' disease. Incongruent gene genealogies manifest genetic exchange among distantly related lineages of T. cruzi. Two widely distributed isoenzyme types of T. cruzi are hybrids, their genetic composition being the likely result of genetic exchange between two distantly related lineages. The data show that the reference strain for the T. cruzi genome project (CL Brener) is a hybrid. Well-supported gene genealogies show that mitochondrial and nuclear gene sequences from T. cruzi cluster, respectively, in three or four distinct clades that do not fully correspond to the two previously defined major lineages of T. cruzi. There is clear genetic differentiation among the major groups of sequences, but genetic diversity within each major group is low. We estimate that the major extant lineages of T. cruzi have diverged during the Miocene or early Pliocene (3–16 million years ago).
Resumo:
The development of a highly reliable physical map with landmark sites spaced an average of 100 kbp apart has been a central goal of the Human Genome Project. We have approached the physical mapping of human chromosome 11 with this goal as a primary target. We have focused on strategies that would utilize yeast artificial chromosome (YAC) technology, thus permitting long-range coverage of hundreds of kilobases of genomic DNA, yet we sought to minimize the ambiguities inherent in the use of this technology, particularly the occurrence of chimeric genomic DNA clones. This was achieved through the development of a chromosome 11-specific YAC library from a human somatic cell hybrid line that has retained chromosome 11 as its sole human component.To maximize the efficiency of YAC contig assembly and extension, we have employed an Alu-PCR-based hybridization screening system. This system eliminates many of the more costly and time-consuming steps associated with sequence tagged site content mapping such as sequencing, primer production, and hierarchical screening, resulting in greater efficiency with increased throughput and reduced cost. Using these approaches, we have achieved YAC coverage for >90% of human chromosome 11, with an average intermarker distance of <100 kbp. Cytogenetic localization has been determined for each contig by fluorescent in situ hybridization and/or sequence tagged site content. The YAC contigs that we have generated should provide a robust framework to move forward to sequence-ready templates for the sequencing efforts of the Human Genome Project as well as more focused positional cloning on chromosome 11.
Resumo:
The challenge of the Human Genome Project is to increase the rate of DNA sequence acquisition by two orders of magnitude to complete sequencing of the human genome by the year 2000. The present work describes a rapid detection method using a two-dimensional optical wave guide that allows measurement of real-time binding or melting of a light-scattering label on a DNA array. A particulate label on the target DNA acts as a light-scattering source when illuminated by the evanescent wave of the wave guide and only the label bound to the surface generates a signal. Imaging/visual examination of the scattered light permits interrogation of the entire array simultaneously. Hybridization specificity is equivalent to that obtained with a conventional system using autoradiography. Wave guide melting curves are consistent with those obtained in the liquid phase and single-base discrimination is facile. Dilution experiments showed an apparent lower limit of detection at 0.4 nM oligonucleotide. This performance is comparable to the best currently known fluorescence-based systems. In addition, wave guide detection allows manipulation of hybridization stringency during detection and thereby reduces DNA chip complexity. It is anticipated that this methodology will provide a powerful tool for diagnostic applications that require rapid cost-effective detection of variations from known sequences.
Resumo:
We report a general mass spectrometric approach for the rapid identification and characterization of proteins isolated by preparative two-dimensional polyacrylamide gel electrophoresis. This method possesses the inherent power to detect and structurally characterize covalent modifications. Absolute sensitivities of matrix-assisted laser desorption ionization and high-energy collision-induced dissociation tandem mass spectrometry are exploited to determine the mass and sequence of subpicomole sample quantities of tryptic peptides. These data permit mass matching and sequence homology searching of computerized peptide mass and protein sequence data bases for known proteins and design of oligonucleotide probes for cloning unknown proteins. We have identified 11 proteins in lysates of human A375 melanoma cells, including: alpha-enolase, cytokeratin, stathmin, protein disulfide isomerase, tropomyosin, Cu/Zn superoxide dismutase, nucleoside diphosphate kinase A, galaptin, and triosephosphate isomerase. We have characterized several posttranslational modifications and chemical modifications that may result from electrophoresis or subsequent sample processing steps. Detection of comigrating and covalently modified proteins illustrates the necessity of peptide sequencing and the advantages of tandem mass spectrometry to reliably and unambiguously establish the identity of each protein. This technology paves the way for studies of cell-type dependent gene expression and studies of large suites of cellular proteins with unprecedented speed and rigor to provide information complementary to the ongoing Human Genome Project.