78 resultados para genome analysis

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recent ability to sequence whole genomes allows ready access to all genetic material. The approaches outlined here allow automated analysis of sequence for the synthesis of optimal primers in an automated multiplex oligonucleotide synthesizer (AMOS). The efficiency is such that all ORFs for an organism can be amplified by PCR. The resulting amplicons can be used directly in the construction of DNA arrays or can be cloned for a large variety of functional analyses. These tools allow a replacement of single-gene analysis with a highly efficient whole-genome analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present a method for discovering conserved sequence motifs from families of aligned protein sequences. The method has been implemented as a computer program called emotif (http://motif.stanford.edu/emotif). Given an aligned set of protein sequences, emotif generates a set of motifs with a wide range of specificities and sensitivities. emotif also can generate motifs that describe possible subfamilies of a protein superfamily. A disjunction of such motifs often can represent the entire superfamily with high specificity and sensitivity. We have used emotif to generate sets of motifs from all 7,000 protein alignments in the blocks and prints databases. The resulting database, called identify (http://motif.stanford.edu/identify), contains more than 50,000 motifs. For each alignment, the database contains several motifs having a probability of matching a false positive that range from 10−10 to 10−5. Highly specific motifs are well suited for searching entire proteomes, while generating very few false predictions. identify assigns biological functions to 25–30% of all proteins encoded by the Saccharomyces cerevisiae genome and by several bacterial genomes. In particular, identify assigned functions to 172 of proteins of unknown function in the yeast genome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Fluorescence in situ hybridization (FISH) is a powerful tool for physical mapping in human and other mammalian species. However, application of the FISH technique has been limited in plant species, especially for mapping single- or low-copy DNA sequences, due to inconsistent signal production in plant chromosome preparations. Here we demonstrate that bacterial artificial chromosome (BAC) clones can be mapped readily on rice (Oryza sativa L.) chromosomes by FISH. Repetitive DNA sequences in BAC clones can be suppressed efficiently by using rice genomic DNA as a competitor in the hybridization mixture. BAC clones as small as 40 kb were successfully mapped. To demonstrate the application of the FISH technique in physical mapping of plant genomes, both anonymous BAC clones and clones closely linked to a rice bacterial blight-resistance locus, Xa21, were chosen for analysis. The physical location of Xa21 and the relationships among the linked clones were established, thus demonstrating the utility of FISH in plant genome analysis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A homogeneous DNA diagnostic assay based on template-directed primer extension detected by fluorescence resonance energy transfer, named template-directed dye-terminator incorporation (TDI) assay, has been developed for mutation detection and high throughput genome analysis. Here, we report the successful application of the TDI assay to detect mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene, the human leukocyte antigen H (HLA-H) gene, and the receptor tyrosin kinase (RET) protooncogene that are associated with cystic fibrosis, hemochromatosis, and multiple endocrine neoplasia, type 2, respectively. Starting with total human DNA, the samples are amplified by the PCR followed by enzymatic degradation of excess primers and deoxyribonucleoside triphosphates before the primer extension reaction is performed. All these standardized steps are performed in the same tube, and the fluorescence changes are monitored in real time, making it a useful clinical DNA diagnostic method.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We have asked whether comparative genome analysis and rat transgenesis can be used to identify functional regulatory domains in the gene locus encoding the hypothalamic neuropeptides oxytocin (OT) and vasopressin. Isotocin (IT) and vasotocin (VT) are the teleost homologues of these genes. A contiguous stretch of 46 kb spanning the Fugu IT-VT locus has been sequenced, and nine putative genes were found. Unlike the OT and vasopressin genes, which are closely linked in the mammalian genome in a tail-to-tail orientation, Fugu IT and VT genes are linked head to tail and are separated by five genes. When a cosmid containing the Fugu IT-VT locus was introduced into the rat genome, we found that the Fugu IT gene was specifically expressed in rat hypothalamic oxytocinergic neurons and mimicked the response of the endogenous OT gene to an osmotic stimulus. These data show that cis-acting elements and trans-acting factors mediating the cell-specific and physiological regulation of the OT and IT genes are conserved between mammals and fish. The combination of Fugu genome analysis and transgenesis in a mammal is a powerful tool for identifying and analyzing conserved vertebrate regulatory elements.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Large quantities of DNA sequence information about plant genes are rapidly accumulating in public databases, but to progress from DNA sequence to biological function a mutant allele for each of the genes ideally should be available. Here we describe a gene trap construct that allowed us to disrupt transcribed genes with a high efficiency in Arabidopsis thaliana. In the T-DNA vector used, the expression of a bacterial reporter gene coding for neomycin phosphotransferase II (nptII) depends on the in vivo generation of a translation fusion upon the T-DNA integration into the Arabidopsis genome. Analysis of 20 selected transgenic lines showed that 12 lines are T-DNA insertion mutants. The disrupted genes analyzed encoded ribosomal proteins (three lines), aspartate tRNA synthase, DNA ligase, basic-domain leucine zipper DNA binding protein, ATP-binding cassette transporter, and five proteins of unknown function. Four tagged genes were new for Arabidopsis. The results presented here suggest that gene trapping, using nptII as a reporter gene, can be as high as 80% and opens novel perspectives for systematic gene tagging in A. thaliana.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Here we present the successful application of the microarray technology platform to the analysis of DNA polymorphisms. Using the rice genome as a model, we demonstrate the potential of a high-throughput genome analysis method called Diversity Array Technology, DArT‘. In the format presented here the technology is assaying for the presence (or amount) of a specific DNA fragment in a representation derived from the total genomic DNA of an organism or a population of organisms. Two different approaches are presented: the first involves contrasting two representations on a single array while the second involves contrasting a representation with a reference DNA fragment common to all elements of the array. The Diversity Panels created using this method allow genetic fingerprinting of any organism or group of organisms belonging to the gene pool from which the panel was developed. Diversity Arrays enable rapid and economical application of a highly parallel, solid-state genotyping technology to any genome or complex genomic mixtures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In order to support the structural genomic initiatives, both by rapidly classifying newly determined structures and by suggesting suitable targets for structure determination, we have recently developed several new protocols for classifying structures in the CATH domain database (http://www.biochem.ucl.ac.uk/bsm/cath). These aim to increase the speed of classification of new structures using fast algorithms for structure comparison (GRATH) and to improve the sensitivity in recognising distant structural relatives by incorporating sequence information from relatives in the genomes (DomainFinder). In order to ensure the integrity of the database given the expected increase in data, the CATH Protein Family Database (CATH-PFDB), which currently includes 25 320 structural domains and a further 160 000 sequence relatives has now been installed in a relational ORACLE database. This was essential for developing more rigorous validation procedures and for allowing efficient querying of the database, particularly for genome analysis. The associated Dictionary of Homologous Superfamilies [Bray,J.E., Todd,A.E., Pearl,F.M.G., Thornton,J.M. and Orengo,C.A. (2000) Protein Eng., 13, 153–165], which provides multiple structural alignments and functional information to assist in assigning new relatives, has also been expanded recently and now includes information for 903 homo­logous superfamilies. In order to improve coverage of known structures, preliminary classification levels are now provided for new structures at interim stages in the classification protocol. Since a large proportion of new structures can be rapidly classified using profile-based sequence analysis [e.g. PSI-BLAST: Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 3389–3402], this provides preliminary classification for easily recognisable homologues, which in the latest release of CATH (version 1.7) represented nearly three-quarters of the non-identical structures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Macromolecular transport systems in bacteria currently are classified by function and sequence comparisons into five basic types. In this classification system, type II and type IV secretion systems both possess members of a superfamily of genes for putative NTP hydrolase (NTPase) proteins that are strikingly similar in structure, function, and sequence. These include VirB11, TrbB, TraG, GspE, PilB, PilT, and ComG1. The predicted protein product of tadA, a recently discovered gene required for tenacious adherence of Actinobacillus actinomycetemcomitans, also has significant sequence similarity to members of this superfamily and to several unclassified and uncharacterized gene products of both Archaea and Bacteria. To understand the relationship of tadA and tadA-like genes to those encoding the putative NTPases of type II/IV secretion, we used a phylogenetic approach to obtain a genealogy of 148 NTPase genes and reconstruct a scenario of gene superfamily evolution. In this phylogeny, clear distinctions can be made between type II and type IV families and their constituent subfamilies. In addition, the subgroup containing tadA constitutes a novel and extremely widespread subfamily of the family encompassing all putative NTPases of type IV secretion systems. We report diagnostic amino acid residue positions for each major monophyletic family and subfamily in the phylogenetic tree, and we propose an easy method for precisely classifying and naming putative NTPase genes based on phylogeny. This molecular key-based method can be applied to other gene superfamilies and represents a valuable tool for genome analysis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Complete genome sequences are providing a framework to allow the investigation of biological processes by the use of comprehensive approaches. Genome analysis also is having a dramatic impact on medicine through its identification of genes and mutations involved in disease and the elucidation of entire microbial gene sets. Studies of the sequences of model organisms, such as that of the nematode worm Caenorhabditis elegans, are providing extraordinary insights into development and differentiation that aid the study of these processes in humans. The field of functional genomics seeks to devise and apply technologies that take advantage of the growing body of sequence information to analyze the full complement of genes and proteins encoded by an organism.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Four members of the canine olfactory receptor gene family were characterized. The predicted proteins shared 40-64% identity with previously identified olfactory receptors. The four subfamilies identified in Southern hybridization experiments had as few as 2 and as many as 20 members. All four genes were expressed exclusively in olfactory epithelium. Expression of multiple members of the larger subfamilies was detected, suggesting that most if not all of the cross-hybridizing bands in genomic Southern blots represented actively transcribed olfactory receptor genes. Analysis of large DNA fragments using Southern blots of pulsed-field gels indicated that subfamily members were clustered together, and that two of the subfamilies were closely linked in the dog genome. Analysis of the four olfactory receptor gene subfamilies in 26 breeds of dog provided evidence that the number of genes per subfamily was stable in spite of differential selection on the basis of olfactory acuity in scent hounds, sight hounds, and toy breeds.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Microsatellites, tandem arrays of short (2-5 bp) nucleotide motifs, are present in high numbers in most eukaryotic genomes. We have characterized the physical distribution of microsatellites on chromosomes of sugar beet (Beta vulgaris L.). Each microsatellite sequence shows a characteristic genomic distribution and motif-dependent dispersion, with site-specific amplification on one to seven pairs of centromeres or intercalary chromosomal regions and weaker, dispersed hybridization along chromosomes. Exclusion of some microsatellites from 18S-5.8S-25S rRNA gene sites, centromeres, and intercalary sites was observed. In-gel and in situ hybridization patterns are correlated, with highly repeated restriction fragments indicating major centromeric sites of microsatellite arrays. The results have implications for genome evolution and the suitability of particular microsatellite markers for genetic mapping and genome analysis.