972 resultados para Biological Sequence Analysis
Resumo:
While genome sequencing projects are advancing rapidly, EST sequencing and analysis remains a primary research tool for the identification and categorization of gene sequences in a wide variety of species and an important resource for annotation of genomic sequence. The TIGR Gene Indices (http://www.tigr.org/tdb/tgi.shtml) are a collection of species-specific databases that use a highly refined protocol to analyze EST sequences in an attempt to identify the genes represented by that data and to provide additional information regarding those genes. Gene Indices are constructed by first clustering, then assembling EST and annotated gene sequences from GenBank for the targeted species. This process produces a set of unique, high-fidelity virtual transcripts, or Tentative Consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to mapping and genomic sequence data, to provide links between orthologous and paralogous genes and as a resource for comparative sequence analysis.
Resumo:
Human synovial sarcoma has been shown to exclusively harbor the chromosomal translocation t(X;18) that produces the chimeric gene SYT-SSX. However, the role of SYT-SSX in cellular transformation remains unclear. In this study, we have established 3Y1 rat fibroblast cell lines that constitutively express SYT, SSX1, and SYT-SSX1 and found that SYT-SSX1 promoted growth rate in culture, anchorage-independent growth in soft agar, and tumor formation in nude mice. Deletion of the N-terminal 181 amino acids of SYT-SSX1 caused loss of its transforming activity. Furthermore, association of SYT-SSX1 with the chromatin remodeling factor hBRM/hSNF2α, which regulates transcription, was demonstrated in both SYT-SSX1-expressing 3Y1 cells and in the human synovial sarcoma cell line HS-SY-II. The binding region between the two molecules was shown to reside within the N-terminal 181 amino acids stretch (aa 1–181) of SYT-SSX1 and 50 amino acids (aa 156–205) of hBRM/hSNF2α and we found that the overexpression of this binding region of hBRM/hSNF2α significantly suppressed the anchorage-independent growth of SYT-SSX1-expressing 3Y1 cells. To analyze the transcriptional regulation by SYT-SSX1, we established conditional expression system of SYT-SSX1 and examined the gene expression profiles. The down-regulation of potential tumor suppressor DCC was observed among 1,176 genes analyzed by microarray analysis, and semi-quantitative reverse transcription–PCR confirmed this finding. These data clearly demonstrate transforming activity of human oncogene SYT-SSX1 and also involvement of chromatin remodeling factor hBRM/hSNF2α in human cancer.
Resumo:
We have determined the solution structure of the C-terminal quarter of human poly(A)-binding protein (hPABP). The protein fragment contains a protein domain, PABC [for poly(A)-binding protein C-terminal domain], which is also found associated with the HECT family of ubiquitin ligases. By using peptides derived from PABP interacting protein (Paip) 1, Paip2, and eRF3, we show that PABC functions as a peptide binding domain. We use chemical shift perturbation analysis to identify the peptide binding site in PABC and the major elements involved in peptide recognition. From comparative sequence analysis of PABC-binding peptides, we formulate a preliminary PABC consensus sequence and identify human ataxin-2, the protein responsible for type 2 spinocerebellar ataxia (SCA2), as a potential PABC ligand.
Resumo:
Previously conducted sequence analysis of Arabidopsis thaliana (ecotype Columbia-0) reported an insertion of 270-kb mtDNA into the pericentric region on the short arm of chromosome 2. DNA fiber-based fluorescence in situ hybridization analyses reveal that the mtDNA insert is 618 ± 42 kb, ≈2.3 times greater than that determined by contig assembly and sequencing analysis. Portions of the mitochondrial genome previously believed to be absent were identified within the insert. Sections of the mtDNA are repeated throughout the insert. The cytological data illustrate that DNA contig assembly by using bacterial artificial chromosomes tends to produce a minimal clone path by skipping over duplicated regions, thereby resulting in sequencing errors. We demonstrate that fiber-fluorescence in situ hybridization is a powerful technique to analyze large repetitive regions in the higher eukaryotic genomes and is a valuable complement to ongoing large genome sequencing projects.
Resumo:
Filamentous fungi are a large group of diverse and economically important microorganisms. Large-scale gene disruption strategies developed in budding yeast are not applicable to these organisms because of their larger genomes and lower rate of targeted integration (TI) during transformation. We developed transposon-arrayed gene knockouts (TAGKO) to discover genes and simultaneously create gene disruption cassettes for subsequent transformation and mutant analysis. Transposons carrying a bacterial and fungal drug resistance marker are used to mutagenize individual cosmids or entire libraries in vitro. Cosmids are annotated by DNA sequence analysis at the transposon insertion sites, and cosmid inserts are liberated to direct insertional mutagenesis events in the genome. Based on saturation analysis of a cosmid insert and insertions in a fungal cosmid library, we show that TAGKO can be used to rapidly identify and mutate genes. We further show that insertions can create alterations in gene expression, and we have used this approach to investigate an amino acid oxidation pathway in two important fungal phytopathogens.
Resumo:
Dystrobrevin is a component of the dystrophin-associated protein complex and has been shown to interact directly with dystrophin, α1-syntrophin, and the sarcoglycan complex. The precise role of α-dystrobrevin in skeletal muscle has not yet been determined. To study α-dystrobrevin's function in skeletal muscle, we used the yeast two-hybrid approach to look for interacting proteins. Three overlapping clones were identified that encoded an intermediate filament protein we subsequently named desmuslin (DMN). Sequence analysis revealed that DMN has a short N-terminal domain, a conserved rod domain, and a long C-terminal domain, all common features of type 6 intermediate filament proteins. A positive interaction between DMN and α-dystrobrevin was confirmed with an in vitro coimmunoprecipitation assay. By Northern blot analysis, we find that DMN is expressed mainly in heart and skeletal muscle, although there is some expression in brain. Western blotting detected a 160-kDa protein in heart and skeletal muscle. Immunofluorescent microscopy localizes DMN in a stripe-like pattern in longitudinal sections and in a mosaic pattern in cross sections of skeletal muscle. Electron microscopic analysis shows DMN colocalized with desmin at the Z-lines. Subsequent coimmunoprecipitation experiments confirmed an interaction with desmin. Our findings suggest that DMN may serve as a direct linkage between the extracellular matrix and the Z-discs (through plectin) and may play an important role in maintaining muscle cell integrity.
Resumo:
The transcriptional effects of deregulated myc gene overexpression are implicated in tumorigenesis in a spectrum of experimental and naturally occurring neoplasms. In follicles of the chicken bursa of Fabricius, myc induction of B-cell neoplasia requires a target cell population present during early bursal development and progresses through preneoplastic transformed follicles to metastatic lymphomas. We developed a chicken immune system cDNA microarray to analyze broad changes in gene expression that occur during normal embryonic B-cell development and during myc-induced neoplastic transformation in the bursa. The number of mRNAs showing at least 3-fold change was greater during myc-induced lymphomagenesis than during normal development, and hierarchical cluster analysis of expression patterns revealed that levels of several hundred mRNAs varied in concert with levels of myc overexpression. A set of 41 mRNAs were most consistently elevated in myc-overexpressing preneoplastic and neoplastic cells, most involved in processes thought to be subject to regulation by Myc. The mRNAs for another cluster of genes were overexpressed in neoplasia independent of myc expression level, including a small subset with the expression signature of embryonic bursal lymphocytes. Overexpression of myc, and some of the genes overexpressed with myc, may be important for generation of preneoplastic transformed follicles. However, expression profiles of late metastatic tumors showed a large variation in concert with myc expression levels, and some showed minimal myc overexpression. Therefore, high-level myc overexpression may be more important in the early induction of these lymphomas than in maintenance of late-stage metastases.
Resumo:
For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that insert between genes. These retroelements are less abundant in smaller genome plants, including rice and sorghum. Although 5- to 200-kb blocks of methylated, presumably heterochromatic, retrotransposons flank most maize genes, rice and sorghum genes are often adjacent. Similar genes are commonly found in the same relative chromosomal locations and orientations in each of these three species, although there are numerous exceptions to this collinearity (i.e., rearrangements) that can be detected at the levels of both the recombinational map and cloned DNA. Evolutionarily conserved sequences are largely confined to genes and their regulatory elements. Our results indicate that a knowledge of grass genome structure will be a useful tool for gene discovery and isolation, but the general rules and biological significance of grass genome organization remain to be determined. Moreover, the nature and frequency of exceptions to the general patterns of grass genome structure and collinearity are still largely unknown and will require extensive further investigation.
Resumo:
Snake-venom α-bungarotoxin is a member of the α-neurotoxin family that binds with very high affinity to the nicotinic acetylcholine receptor (AChR) at the neuromuscular junction. The structure of the complex between α-bungarotoxin and a 13-mer peptide (WRYYESSLEPYPD) that binds the toxin with high affinity, thus inhibiting its interactions with AChR with an IC50 of 2 nM, has been solved by 1H-NMR spectroscopy. The bound peptide folds into a β-hairpin structure created by two antiparallel β-strands, which combine with the already existing triple-stranded β-sheet of the toxin to form a five-stranded intermolecular, antiparallel β-sheet. Peptide residues Y3P, E5P, and L8P have the highest intermolecular contact area, indicating their importance in the binding of α-bungarotoxin; W1P, R2P, and Y4P also contribute significantly to the binding. A large number of characteristic hydrogen bonds and electrostatic and hydrophobic interactions are observed in the complex. The high-affinity peptide exhibits inhibitory potency that is better than any known peptide derived from AChR, and is equal to that of the whole α-subunit of AChR. The high degree of sequence similarity between the peptide and various types of AChRs implies that the binding mode found within the complex might possibly mimic the receptor binding to the toxin. The design of the high-affinity peptide was based on our previous findings: (i) the detection of a lead peptide (MRYYESSLKSYPD) that binds α-bungarotoxin, using a phage-display peptide library, (ii) the information about the three-dimensional structure of α-bungarotoxin/lead-peptide complex, and (iii) the amino acid sequence analysis of different AChRs.
Resumo:
Support for molecular biology researchers has been limited to traditional library resources and services in most academic health sciences libraries. The University of Washington Health Sciences Libraries have been providing specialized services to this user community since 1995. The library recruited a Ph.D. biologist to assess the molecular biological information needs of researchers and design strategies to enhance library resources and services. A survey of laboratory research groups identified areas of greatest need and led to the development of a three-pronged program: consultation, education, and resource development. Outcomes of this program include bioinformatics consultation services, library-based and graduate level courses, networking of sequence analysis tools, and a biological research Web site. Bioinformatics clients are drawn from diverse departments and include clinical researchers in need of tools that are not readily available outside of basic sciences laboratories. Evaluation and usage statistics indicate that researchers, regardless of departmental affiliation or position, require support to access molecular biology and genetics resources. Centralizing such services in the library is a natural synergy of interests and enhances the provision of traditional library resources. Successful implementation of a library-based bioinformatics program requires both subject-specific and library and information technology expertise.
Resumo:
We have investigated genetic differences between the closely related pathogenic Neisseria species, Neisseria meningitidis and Neisseria gonorrhoeae, as a novel approach to the elucidation of the genetic basis for their different pathogenicities. N. meningitidis is a major cause of cerebrospinal meningitis, whereas N. gonorrhoeae is the agent of gonorrhoea. The technique of representational difference analysis was adapted to the search for genes present in the meningococcus but absent from the gonococcus. The libraries achieved are comprehensive and specific in that they contain sequences corresponding to the presently identified meningococcus-specific genes (capsule, frp, rotamase, and opc) but lack genes more or less homologous between the two species, e.g., ppk and pilC1. Of 35 randomly chosen clones specific to N. meningitidis, DNA sequence analysis has confirmed that the large majority have no homology with published neisserial sequences. Mapping of the cloned DNA fragments onto the chromosome of N. meningitidis strain Z2491 has revealed a nonrandom distribution of meningococcus-specific sequences. Most of the genetic differences between the meningococcus and gonococcus appear to be clustered in three distinct regions, one of which (region 1) contains the capsule-related genes. Region 3 was found only in strains of serogroup A, whereas region 2 is present in a variety of meningococci belonging to different serogroups. At a time when bacterial genomes are being sequenced, we believe that this technique is a powerful tool for a rapid and directed analysis of the genetic basis of inter- or intraspecific phenotypic variations.
Resumo:
The retinoblastoma protein (RB) has been proposed to function as a negative regulator of cell proliferation by complexing with cellular proteins such as the transcription factor E2F. To study the biological consequences of the RB/E2F-1 interaction, point mutants of E2F-1 which fail to bind to RB were isolated by using the yeast two-hybrid system. Sequence analysis revealed that within the minimal 18-amino acid peptide of E2F-1 required for RB binding, five residues, Tyr (position 411), Glu (419), and Asp-Leu-Phe (423-425), are critical. These amino acids are conserved among the known E2F family members. While mutation of any of these five amino acids abolished binding to RB, all mutants retained their full transactivation potential. Expression of mutated E2F-1, when compared with that of wild-type, significantly accelerated entry into S phase and subsequent apoptosis. These results provide direct genetic evidence for the biological significance of the RB/E2F interaction and strongly suggest that the interplay between RB and E2F is critical for proper cell cycle progression.
Resumo:
Bombesin is a tetradecapeptide originally isolated from frog skin and demonstrated to have a wide range of actions in mammals. Based on structural homology and similar biological activities, gastrin-releasing peptide (GRP) has been considered the mammalian equivalent of bombesin. We previously reported that frogs have both GRP and bombesin, which therefore are distinct peptides. We now report the cloning of a bombesin receptor subtype (BB4) that has higher affinity for bombesin than GRP. PCR was used to amplify cDNAs related to the known bombesin receptors from frog brain. Sequence analysis of the amplified cDNAs revealed 3 classes of receptor subtypes. Based on amino acid homology, two classes were clearly the amphibian homologs of the GRP and neuromedin B receptors. The third class was unusual and a full-length clone was isolated from a Bombina orientalis brain cDNA library. Expression of the receptor in Xenopus oocytes demonstrated that the receptor responded to picomolar concentrations of [Phe13]-bombesin, the form of bombesin most prevalent in frog brain. The relative rank potency of bombesin-like peptides for this receptor was [Phe13]bombesin > [Leu13]bombesin > GRP > neuromedin B. In contrast, the rank potency for the GRP receptor is GRP > [Leu13]bombesin > [Phe13]bombesin > neuromedin B. Transient expression in CHOP cells gave a Ki for [Phe13]bombesin of 0.2 nM versus a Ki of 2.1 nM for GRP. Distribution analysis showed that this receptor was expressed only in brain, consistent with the distribution of [Phe13]-bombesin. Thus, based on distribution and affinity, this bombesin receptor is the receptor for [Phe13]bombesin. Phylogenetic analysis suggests that this receptor separated prior to separation of the GRP and neuromedin B receptors; thus, BB4 receptors and their cognate ligands may also exist in mammals.
Resumo:
In this paper, a reverse-transcriptase PCR-based protocol suitable for efficient expression analysis of multigene families is presented. The method combines restriction fragment length polymorphism (RFLP) technology with a gene family-specific version of mRNA differential display and hence is called "RFLP-coupled domain-directed differential display. "With this method, expression of all members of a multigene family at many different developmental stages, in diverse tissues and even in different organisms, can be displayed on one gel. Moreover, bands of interest, representing gene family members, are directly accessible to sequence analysis, without the need for subcloning. The method thus enables a detailed, high-resolution expression analysis of known gene family members as well as the identification and characterization of new ones. Here the technique was used to analyze differential expression of MADS-box genes in male and female inflorescences of maize (Zea mays ssp. mays). Six different MADS-box genes could be identified, being either specifically expressed in the female sex or preferentially expressed in male or female inflorescences, respectively. Other possible applications of the method are discussed.
Resumo:
Flow cytometry, in combination with advances in bead coding technologies, is maturing as a powerful high-throughput approach for analyzing molecular interactions. Applications of this technology include antibody assays and single nucleotide polymorphism mapping. This review describes the recent development of a microbead flow cytometric approach to analyze RNA-protein interactions and discusses emerging bead coding strategies that together will allow genome-wide identification of RNA-protein complexes. The microbead flow cytometric approach is flexible and provides new opportunities for functional genomic studies and small-molecule screening.