972 resultados para Oligonucleotide Array Sequence Analysis
Resumo:
The hypothesis that chromosomal fragile sites may be “weak links” that result in hot spots for cancer-specific chromosome rearrangements was supported by the discovery that numerous cancer cell homozygous deletions and a familial translocation map within the FHIT gene, which encompasses the common fragile site, FRA3B. Sequence analysis of 276 kb of the FRA3B/FHIT locus and 22 associated cancer cell deletion endpoints shows that this locus is a frequent target of homologous recombination between long interspersed nuclear element sequences resulting in FHIT gene internal deletions, probably as a result of carcinogen-induced damage at FRA3B fragile sites.
Resumo:
In the last decade, two tools, one drawn from information theory and the other from artificial neural networks, have proven particularly useful in many different areas of sequence analysis. The work presented herein indicates that these two approaches can be joined in a general fashion to produce a very powerful search engine that is capable of locating members of a given nucleic acid sequence family in either local or global sequence searches. This program can, in turn, be queried for its definition of the motif under investigation, ranking each base in context for its contribution to membership in the motif family. In principle, the method used can be applied to any binding motif, including both DNA and RNA sequence families, given sufficient family size.
Resumo:
Chromosome 7q22 has been the focus of many cytogenetic and molecular studies aimed at delineating regions commonly deleted in myeloid leukemias and myelodysplastic syndromes. We have compared a gene-dense, GC-rich sub-region of 7q22 with the orthologous region on mouse chromosome 5. A physical map of 640 kb of genomic DNA from mouse chromosome 5 was derived from a series of overlapping bacterial artificial chromosomes. A 296 kb segment from the physical map, spanning Ache to Tfr2, was compared with 267 kb of human sequence. We identified a conserved linkage of 12 genes including an open reading frame flanked by Ache and Asr2, a novel cation-chloride cotransporter interacting protein Cip1, Ephb4, Zan and Perq1. While some of these genes have been previously described, in each case we present new data derived from our comparative sequence analysis. Adjacent unfinished sequence data from the mouse contains an orthologous block of 10 additional genes including three novel cDNA sequences that we subsequently mapped to human 7q22. Methods for displaying comparative genomic information, including unfinished sequence data, are becoming increasingly important. We supplement our printed comparative analysis with a new, Web-based program called Laj (local alignments with java). Laj provides interactive access to archived pairwise sequence alignments via the WWW. It displays synchronized views of a dot-plot, a percent identity plot, a nucleotide-level local alignment and a variety of relevant annotations. Our mouse–human comparison can be viewed at http://web.uvic.ca/~bioweb/laj.html. Laj is available at http://bio.cse.psu.edu/, along with online documentation and additional examples of annotated genomic regions.
Resumo:
While genome sequencing projects are advancing rapidly, EST sequencing and analysis remains a primary research tool for the identification and categorization of gene sequences in a wide variety of species and an important resource for annotation of genomic sequence. The TIGR Gene Indices (http://www.tigr.org/tdb/tgi.shtml) are a collection of species-specific databases that use a highly refined protocol to analyze EST sequences in an attempt to identify the genes represented by that data and to provide additional information regarding those genes. Gene Indices are constructed by first clustering, then assembling EST and annotated gene sequences from GenBank for the targeted species. This process produces a set of unique, high-fidelity virtual transcripts, or Tentative Consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to mapping and genomic sequence data, to provide links between orthologous and paralogous genes and as a resource for comparative sequence analysis.
Resumo:
SINE (short interspersed element) insertion analysis elucidates contentious aspects in the phylogeny of toothed whales and dolphins (Odontoceti), especially river dolphins. Here, we characterize 25 informative SINEs inserted into unique genomic loci during evolution of odontocetes to construct a cladogram, and determine a total of 2.8 kb per taxon of the flanking sequences of these SINE loci to estimate divergence times among lineages. We demonstrate that: (i) Odontocetes are monophyletic; (ii) Ganges River dolphins, beaked whales, and ocean dolphins diverged (in this order) after sperm whales; (iii) three other river dolphin taxa, namely the Amazon, La Plata, and Yangtze river dolphins, form a monophyletic group with Yangtze River dolphins being the most basal; and (iv) the rapid radiation of extant cetacean lineages occurred some 28–33 million years B.P., in strong accord with the fossil record. The combination of SINE and flanking sequence analysis suggests a topology and set of divergence times for odontocete relationships, offering alternative explanations for several long-standing problems in cetacean evolution.
Resumo:
The tobacco N and Arabidopsis RPS2 genes, among several recently cloned disease-resistance genes, share highly conserved structure, a nucleotide-binding site (NBS). Using degenerate oligonucleotide primers for the NBS region of N and RPS2, we have amplified and cloned the NBS sequences from soybean. Each of these PCR-derived NBS clones detected low-or moderate-copy soybean DNA sequences and belongs to 1 of 11 different classes. Sequence analysis showed that all PCR clones encode three motifs (P-loop, kinase-2, and kinase-3a) of NBS nearly identical to those in N and RPS2. The intervening region between P-loop and kinase-3a of the 11 classes has high (26% average) amino acid sequence similarity to the N gene although not as high (19% average) to RPS2. These 11 classes represent a superfamily of NBS-containing soybean genes that are homologous to N and RPS2. Each class or subfamily was assessed for its positional association with known soybean disease-resistance genes through near-isogenic line assays, followed by linkage analysis in F2 populations using restriction fragment length polymorphisms. Five of the 11 subfamilies have thus far been mapped to the vicinity of known soybean genes for resistance to potyviruses (Rsv1 and Rpv), Phytophthora root rot (Rps1, Rps2, and Rps3), and powdery mildew (rmd). The conserved N- or RPS2-homologous NBS sequences and their positional associations with mapped soybean-resistance genes suggest that a number of the soybean disease-resistance genes may belong to this superfamily. The candidate subfamilies of NBS-containing genes identified by genetic mapping should greatly facilitate the molecular cloning of disease-resistance genes.
Resumo:
We have investigated genetic differences between the closely related pathogenic Neisseria species, Neisseria meningitidis and Neisseria gonorrhoeae, as a novel approach to the elucidation of the genetic basis for their different pathogenicities. N. meningitidis is a major cause of cerebrospinal meningitis, whereas N. gonorrhoeae is the agent of gonorrhoea. The technique of representational difference analysis was adapted to the search for genes present in the meningococcus but absent from the gonococcus. The libraries achieved are comprehensive and specific in that they contain sequences corresponding to the presently identified meningococcus-specific genes (capsule, frp, rotamase, and opc) but lack genes more or less homologous between the two species, e.g., ppk and pilC1. Of 35 randomly chosen clones specific to N. meningitidis, DNA sequence analysis has confirmed that the large majority have no homology with published neisserial sequences. Mapping of the cloned DNA fragments onto the chromosome of N. meningitidis strain Z2491 has revealed a nonrandom distribution of meningococcus-specific sequences. Most of the genetic differences between the meningococcus and gonococcus appear to be clustered in three distinct regions, one of which (region 1) contains the capsule-related genes. Region 3 was found only in strains of serogroup A, whereas region 2 is present in a variety of meningococci belonging to different serogroups. At a time when bacterial genomes are being sequenced, we believe that this technique is a powerful tool for a rapid and directed analysis of the genetic basis of inter- or intraspecific phenotypic variations.
Resumo:
In this paper, a reverse-transcriptase PCR-based protocol suitable for efficient expression analysis of multigene families is presented. The method combines restriction fragment length polymorphism (RFLP) technology with a gene family-specific version of mRNA differential display and hence is called "RFLP-coupled domain-directed differential display. "With this method, expression of all members of a multigene family at many different developmental stages, in diverse tissues and even in different organisms, can be displayed on one gel. Moreover, bands of interest, representing gene family members, are directly accessible to sequence analysis, without the need for subcloning. The method thus enables a detailed, high-resolution expression analysis of known gene family members as well as the identification and characterization of new ones. Here the technique was used to analyze differential expression of MADS-box genes in male and female inflorescences of maize (Zea mays ssp. mays). Six different MADS-box genes could be identified, being either specifically expressed in the female sex or preferentially expressed in male or female inflorescences, respectively. Other possible applications of the method are discussed.
Resumo:
The current RIKEN transcript set represents a significant proportion of the mouse transcriptome but transcripts expressed in the innate and acquired immune systems are poorly represented. In the present study we have assessed the complexity of the transcriptome expressed in mouse macrophages before and after treatment with lipopolysaccharide, a global regulator of macrophage gene expression, using existing RIKEN 19K arrays. By comparison to array profiles of other cells and tissues, we identify a large set of macrophage-enriched genes, many of which have obvious functions in endocytosis and phagocytosis. In addition, a significant number of LPS-inducible genes were identified. The data suggest that macrophages are a complex source of mRNA for transcriptome studies. To assess complexity and identify additional macrophage expressed genes, cDNA libraries were created from purified populations of macrophage and dendritic cells, a functionally related cell type. Sequence analysis revealed a high incidence of novel mRNAs within these cDNA libraries. These studies provide insights into the depths of transcriptional complexity still untapped amongst products of inducible genes, and identify macrophage and dendritic cell populations as a starting point for sampling the inducible mammalian transcriptome.
Resumo:
An anaerobic landfill leachate bioreactor was operated with crystalline cellulose and sterile landfill leacbate until a steady state was reached. Cellulose hydrolysis, acidogenesis, and methanogenesis were measured. Microorganisms attached to the cellulose surfaces were hypothesized to be the cellulose hydrolyzers. 16S rRNA gene clone libraries were prepared from this attached fraction and also from the mixed fraction (biomass associated with cellulose particles and in the planktonic phase). Both clone libraries were dominated by Firmicutes phylum sequences (100% of the attached library and 90% of the mixed library), and the majority fell into one of five lineages of the clostridia. Clone group 1 (most closely related to Clostridium stercorarium), clone group 2 (most closely related to Clostridium thermocellum), and clone group 5 (most closely related to Bacteroides cellulosolvens) comprised sequences in Clostridium group III. Clone group 3 sequences were in Clostridium group XIVa (most closely related to Clostridium sp. strain XB90). Clone group 4 sequences were affiliated with a deeply branching clostridial lineage peripherally associated with Clostridium group VI. This monophyletic group comprises a new Clostridium cluster, designated cluster VIa. Specific fluorescence in situ hybridization (FISH) probes for the five groups were designed and synthesized, and it was demonstrated in FISH experiments that bacteria targeted by the probes for clone groups 1, 2, 4, and 5 were very abundant on the surfaces of the cellulose particles and likely the key cellulolytic microorganisms in the landfill bioreactor. The FISH probe for clone group 3 targeted cells in the planktonic phase, and these organisms were hypothesized to be glucose fermenters.
Resumo:
In this study, we propose a novel method to predict the solvent accessible surface areas of transmembrane residues. For both transmembrane alpha-helix and beta-barrel residues, the correlation coefficients between the predicted and observed accessible surface areas are around 0.65. On the basis of predicted accessible surface areas, residues exposed to the lipid environment or buried inside a protein can be identified by using certain cutoff thresholds. We have extensively examined our approach based on different definitions of accessible surface areas and a variety of sets of control parameters. Given that experimentally determining the structures of membrane proteins is very difficult and membrane proteins are actually abundant in nature, our approach is useful for theoretically modeling membrane protein tertiary structures, particularly for modeling the assembly of transmembrane domains. This approach can be used to annotate the membrane proteins in proteomes to provide extra structural and functional information.
Resumo:
Full-length genome sequences of five virulent and five avirulent strains of Newcastle disease virus isolated between 1998 and 2002 in Victoria and New South Wales, Australia were determined. Comparisons between these strains revealed that coding sequence variability in the haemagglutinin-neuraminidase (HN), matrix (M) and phosphoprotein (P) gene sequences appeared to be more variable than in the fusion (F), nucleocapsid (N) and RNA dependent-RNA replicase (L) genes. Sequence analysis of a number of other isolates made during the recent virulent NDV outbreaks, also identified the presence of a number of variants with altered F gene cleavage sites, which resulted in altered biological properties of those viruses. Quasispecies analysis of a number of field isolates indicated the presence of virulent virus in one particular isolate. Gene sequence analysis of the progenitor virus isolated in 1998 showed very little sequence variation when compared to that of a progenitor-like virus isolated in 2001 demonstrating that in the field. viral genome sequence variation appears to be biologically restricted to that of a consensus sequence. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
Recently identified genes located downstream (3') of the msmEF (transport encoding) gene cluster, msmGH, and located 5' of the structural genes for methanesulfonate monooxygenase (MSAMO) are described from Methylosulfonomonas methylovora. Sequence analysis of the derived polypeptide sequences encoded by these genes revealed a high degree of identity to ABC-type transporters. MsmE showed similarity to a putative periplasmic substrate binding protein, MsmF resembled an integral membraneassociated protein, and MsmG was a putative ATP-binding enzyme. MsmH was thought to be the cognate permease component of the sulfonate transport system. The close association of these putative transport genes to the MSAMO structural genes msmABCD suggested a role for these genes in transport of methanesulfonic acid (MSA) into M. methylovora. msmEFGH and msmABCD constituted two operons for the coordinated expression of MSAMO and the MSA transporter systems. Reverse-transcription-PCR analysis of msmABCD and msmEFGH revealed differential expression of these genes during growth on MSA and methanol. The msmEFGH operon was constitutively expressed, whereas MSA induced expression of msmABCD. A mutant defective in msmE had considerably slower growth rates than the wild type, thus supporting the proposed role of MsmE in the transport of MSA into M. methylovora.