9 resultados para SEQUENCING DATA

em National Center for Biotechnology Information - NCBI


Relevância:

60.00% 60.00%

Publicador:

Resumo:

The structures of glycans N-linked to Arabidopsis proteins have been fully identified. From immuno- and affinodetections on blots, chromatography, nuclear magnetic resonance, and glycosidase sequencing data, we show that Arabidopsis proteins are N-glycosylated by high-mannose-type N-glycans from Man5GlcNAc2 to Man9GlcNAc2, and by xylose- and fucose (Fuc)-containing oligosaccharides. However, complex biantenary structures containing the terminal Lewis a epitope recently reported in the literature (A.-C. Fitchette-Lainé, V. Gomord, M. Cabanes, J.-C. Michalski, M. Saint Macary, B. Foucher, B. Cavalier, C. Hawes, P. Lerouge, and L. Faye [1997] Plant J 12: 1411–1417) were not detected. A similar study was done on the Arabidopsis mur1 mutant, which is affected in the biosynthesis of l-Fuc. In this mutant, one-third of the Fuc residues of the xyloglucan has been reported to be replaced by l-galactose (Gal) (E. Zablackis, W.S. York, M. Pauly, S. Hantus, W.D. Reiter, C.C.S. Chapple, P. Albersheim, and A. Darvill [1996] Science 272: 1808–1810). N-linked glycans from the mutant were identified and their structures were compared with those isolated from the wild-type plants. In about 95% of all N-linked glycans from the mur1 plant, l-Fuc residues were absent and were not replaced by another monosaccharide. However, in the remaining 5%, l-Fuc was found to be replaced by a hexose residue. From nuclear magnetic resonance and mass spectrometry data of the mur1 N-glycans, and by analogy with data reported on mur1 xyloglucan, this subpopulation of N-linked glycans was proposed to be l-Gal-containing N-glycans resulting from the replacement of l-Fuc by l-Gal.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Four new members of the fibroblast growth factor (FGF) family, referred to as fibroblast growth factor homologous factors (FHFs), have been identified by a combination of random cDNA sequencing, data base searches, and degenerate PCR. Pairwise comparisons between the four FHFs show between 58% and 71% amino acid sequence identity, but each FHF shows less than 30% identity when compared with other FGFs. Like FGF-1 (acidic FGF) and FGF-2 (basic FGF), the FHFs lack a classical signal sequence and contain clusters of basic residues that can act as nuclear localization signals. In transiently transfected 293 cells FHF-1 accumulates in the nucleus and is not secreted. Each FHF is expressed in the developing and adult nervous systems, suggesting a role for this branch of the FGF family in nervous system development and function.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The pufferfish Fugu rubripes has a genome ≈7.5 times smaller than that of mammals but with a similar number of genes. Although conserved synteny has been demonstrated between pufferfish and mammals across some regions of the genome, there is some controversy as to what extent Fugu will be a useful model for the human genome, e.g., [Gilley, J., Armes, N. & Fried, M. (1997) Nature (London) 385, 305–306]. We report extensive conservation of synteny between a 1.5-Mb region of human chromosome 11 and <100 kb of the Fugu genome in three overlapping cosmids. Our findings support the idea that the majority of DNA in the region of human chromosome 11p13 is intergenic. Comparative analysis of three unrelated genes with quite different roles, WT1, RCN1, and PAX6, has revealed differences in their structural evolution. Whereas the human WT1 gene can generate 16 protein isoforms via a combination of alternative splicing, RNA editing, and alternative start site usage, our data predict that Fugu WT1 is capable of generating only two isoforms. This raises the question of the extent to which the evolution of WT1 isoforms is related to the evolution of the mammalian genitourinary system. In addition, this region of the Fugu genome shows a much greater overall compaction than usual but with significant noncoding homology observed at the PAX6 locus, implying that comparative genomics has identified regulatory elements associated with this gene.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An mAb was raised to the C5 phagosomal antigen in Paramecium multimicronucleatum. To determine its function, the cDNA and genomic DNA encoding C5 were cloned. This antigen consisted of 315 amino acid residues with a predicted molecular weight of 36,594, a value similar to that determined by SDS-PAGE. Sequence comparisons uncovered a low but significant homology with a Schizosaccharomyces pombe protein and the C-terminal half of the β-fructofuranosidase protein of Zymomonas mobilis. Lacking an obvious transmembrane domain or a possible signal sequence at the N terminus, C5 was predicted to be a soluble protein, whereas immunofluorescence data showed that it was present on the membranes of vesicles and digestive vacuoles (DVs). In cells that were minimally permeabilized but with intact DVs, C5 was found to be located on the cytosolic surface of the DV membranes. Immunoblotting of proteins from the purified and KCl-washed DVs showed that C5 was tightly bound to the DV membranes. Cryoelectron microscopy also confirmed that C5 was on the cytosolic surface of the discoidal vesicles, acidosomes, and lysosomes, organelles known to fuse with the membranes of the cytopharynx, the DVs of stages I (DV-I) and II (DV-II), respectively. Although C5 was concentrated more on the mature than on the young DV membranes, the striking observation was that the cytopharyngeal membrane that is derived from the discoidal vesicles was almost devoid of C5. Approximately 80% of the C5 was lost from the discoidal vesicle-derived membrane after this membrane fused with the cytopharyngeal membrane. Microinjection of the mAb to C5 greatly inhibited the fusion of the discoidal vesicles with the cytopharyngeal membrane and thus the incorporation of the discoidal vesicle membranes into the DV membranes. Taken together, these results suggest that C5 is a membrane protein that is involved in binding and/or fusion of the discoidal vesicles with the cytopharyngeal membrane that leads to DV formation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c (12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13–V20 that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈10-kDa size, such as products of limited proteolysis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The release of vast quantities of DNA sequence data by large-scale genome and expressed sequence tag (EST) projects underlines the necessity for the development of efficient and inexpensive ways to link sequence databases with temporal and spatial expression profiles. Here we demonstrate the power of linking cDNA sequence data (including EST sequences) with transcript profiles revealed by cDNA-AFLP, a highly reproducible differential display method based on restriction enzyme digests and selective amplification under high stringency conditions. We have developed a computer program (GenEST) that predicts the sizes of virtual transcript-derived fragments (TDFs) of in silico-digested cDNA sequences retrieved from databases. The vast majority of the resulting virtual TDFs could be traced back among the thousands of TDFs displayed on cDNA-AFLP gels. Sequencing of the corresponding bands excised from cDNA-AFLP gels revealed no inconsistencies. As a consequence, cDNA sequence databases can be screened very efficiently to identify genes with relevant expression profiles. The other way round, it is possible to switch from cDNA-AFLP gels to sequences in the databases. Using the restriction enzyme recognition sites, the primer extensions and the estimated TDF size as identifiers, the DNA sequence(s) corresponding to a TDF with an interesting expression pattern can be identified. In this paper we show examples in both directions by analyzing the plant parasitic nematode Globodera rostochiensis. Various novel pathogenicity factors were identified by combining ESTs from the infective stage juveniles with expression profiles of ∼4000 genes in five developmental stages produced by cDNA-AFLP.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Previously conducted sequence analysis of Arabidopsis thaliana (ecotype Columbia-0) reported an insertion of 270-kb mtDNA into the pericentric region on the short arm of chromosome 2. DNA fiber-based fluorescence in situ hybridization analyses reveal that the mtDNA insert is 618 ± 42 kb, ≈2.3 times greater than that determined by contig assembly and sequencing analysis. Portions of the mitochondrial genome previously believed to be absent were identified within the insert. Sections of the mtDNA are repeated throughout the insert. The cytological data illustrate that DNA contig assembly by using bacterial artificial chromosomes tends to produce a minimal clone path by skipping over duplicated regions, thereby resulting in sequencing errors. We demonstrate that fiber-fluorescence in situ hybridization is a powerful technique to analyze large repetitive regions in the higher eukaryotic genomes and is a valuable complement to ongoing large genome sequencing projects.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We report a general mass spectrometric approach for the rapid identification and characterization of proteins isolated by preparative two-dimensional polyacrylamide gel electrophoresis. This method possesses the inherent power to detect and structurally characterize covalent modifications. Absolute sensitivities of matrix-assisted laser desorption ionization and high-energy collision-induced dissociation tandem mass spectrometry are exploited to determine the mass and sequence of subpicomole sample quantities of tryptic peptides. These data permit mass matching and sequence homology searching of computerized peptide mass and protein sequence data bases for known proteins and design of oligonucleotide probes for cloning unknown proteins. We have identified 11 proteins in lysates of human A375 melanoma cells, including: alpha-enolase, cytokeratin, stathmin, protein disulfide isomerase, tropomyosin, Cu/Zn superoxide dismutase, nucleoside diphosphate kinase A, galaptin, and triosephosphate isomerase. We have characterized several posttranslational modifications and chemical modifications that may result from electrophoresis or subsequent sample processing steps. Detection of comigrating and covalently modified proteins illustrates the necessity of peptide sequencing and the advantages of tandem mass spectrometry to reliably and unambiguously establish the identity of each protein. This technology paves the way for studies of cell-type dependent gene expression and studies of large suites of cellular proteins with unprecedented speed and rigor to provide information complementary to the ongoing Human Genome Project.