62 resultados para expressed sequences tag

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Expressed sequence tags (ESTs) are randomly sequenced cDNA clones. Currently, nearly 3 million human and 2 million mouse ESTs provide valuable resources that enable researchers to investigate the products of gene expression. The EST databases have proven to be useful tools for detecting homologous genes, for exon mapping, revealing differential splicing, etc. With the increasing availability of large amounts of poorly characterised eukaryotic (notably human) genomic sequence, ESTs have now become a vital tool for gene identification, sometimes yielding the only unambiguous evidence for the existence of a gene expression product. However, BLAST-based Web servers available to the general user have not kept pace with these developments and do not provide appropriate tools for querying EST databases with large highly spliced genes, often spanning 50 000–100 000 bases or more. Here we describe Gene2EST (http://woody.embl-heidelberg.de/gene2est/), a server that brings together a set of tools enabling efficient retrieval of ESTs matching large DNA queries and their subsequent analysis. RepeatMasker is used to mask dispersed repetitive sequences (such as Alu elements) in the query, BLAST2 for searching EST databases and Artemis for graphical display of the findings. Gene2EST combines these components into a Web resource targeted at the researcher who wishes to study one or a few genes to a high level of detail.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Nerve growth factor-induced differentiation of adrenal chromaffin PC-12 cells to a neuronal phenotype involves alterations in gene expression and represents a model system to study neuronal differentiation. We have used the expressed-sequence-tag approach to identify approximately 600 differentially expressed mRNAs in untreated and nerve growth factor-treated PC-12 cells that encode proteins with diverse structural and biochemical functions. Many of these mRNAs encode proteins belonging to cellular pathways not previously known to be regulated by nerve growth factor. Comparative expressed-sequence-tag analysis provides a basis for surveying global changes in gene-expression patterns in response to biological signals at an unprecedented scale, is a powerful tool for identifying potential interactions between different cellular pathways, and allows the gene-expression profiles of individual genes belonging to a particular pathway to be followed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits.isb-sib.ch).

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The CCAAT motif is found in the promoters of many eukaryotic genes. In yeast a single complex of three proteins, termed HAP2, HAP3, and HAP5, binds to this sequence, and in mammals the three components of the equivalent complex (called variously NF-Y, CBF, or CP1) are also represented by single genes. Here we report the presence of multiple genes for each of the components of the CCAAT-binding complex, HAP2,3,5, from Arabidopsis. Three independent Arabidopsis HAP subunit 2 (AtHAP2) cDNAs were cloned by functional complementation of a yeast hap2 mutant, and two independent forms each of AtHAP3 and AtHAP5 cDNAs were detected in the expressed sequence tag database. Additional homologs (two of AtHAP3 and one of AtHAP5) have been identified from available Arabidopsis genomic sequences. Northern-blot analysis indicated ubiquitous expression for each AtHAP2 and AtHAP5 cDNA in a range of tissues, whereas expression of each AtHAP3 cDNA was under developmental and/or environmental regulation. The unexpected presence of multiple forms of each HAP homolog in Arabidopsis, compared with the single genes in yeast and vertebrates, suggests that the HAP2,3,5 complex may play diverse roles in gene transcription in higher plants.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Cathepsin B (CTSB) is overexpressed in tumors of the lung, prostate, colon, breast, and stomach. However, evidence of primary genomic alterations in the CTSB gene during tumor initiation or progression has been lacking. We have found a novel amplicon at 8p22–23 that results in CTSB overexpression in esophageal adenocarcinoma. Amplified genomic NotI–HinfI fragments were identified by two-dimensional DNA electrophoresis. Two amplified fragments (D4 and D5) were cloned and yielded unique sequences. Using bacterial artificial chromosome clones containing either D4 or D5, fluorescent in situ hybridization defined a single region of amplification involving chromosome bands 8p22–23. We investigated the candidate cancer-related gene CTSB, and potential coamplified genes from this region including farnesyl-diphosphate farnesyltransferase (FDFT1), arylamine N-acetyltransferase (NAT-1), lipoprotein lipase (LPL), and an uncharacterized expressed sequence tag (D8S503). Southern blot analysis of 66 esophageal adenocarcinomas demonstrated only CTSB and FDFT1 were consistently amplified in eight (12.1%) of the tumors. Neither NAT-1 nor LPL were amplified. Northern blot analysis showed overexpression of CTSB and FDFT1 mRNA in all six of the amplified esophageal adenocarcinomas analyzed. CTSB mRNA overexpression also was present in two of six nonamplified tumors analyzed. However, FDFT1 mRNA overexpression without amplification was not observed. Western blot analysis confirmed CTSB protein overexpression in tumor specimens with CTSB mRNA overexpression compared with either normal controls or tumors without mRNA overexpression. Abundant extracellular expression of CTSB protein was found in 29 of 40 (72.5%) of esophageal adenocarcinoma specimens by using immunohistochemical analysis. The finding of an amplicon at 8p22–23 resulting in CTSB gene amplification and overexpression supports an important role for CTSB in esophageal adenocarcinoma and possibly in other tumors.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The identification of cDNA clones from genomic regions known to contain human genes is usually the rate-limiting factor in positional cloning strategies. We demonstrate here that human genes present on yeast artificial chromosomes (YACs) are transcribed in yeast host cells. We have used the arbitrarily primed RNA (RAP) fingerprinting method to identify human-specific, transcribed sequences from YACs located in the 13q12 chromosome region. By comparing the RAP fingerprints generated using defined, arbitrary primers from various fragmented YACs, megaYACs, and host yeast, we were able to identify and map 20 products transcribed from the human YAC inserts. This method, therefore, permits the simultaneous isolation and mapping of novel expressed sequences directly from whole YACs.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The Drosophila retinal degeneration C (rdgC) gene encodes an unusual protein serine/threonine phosphatase in that it contains at least two EF-hand motifs at its carboxy terminus. By a combination of large-scale sequencing of human retina cDNA clones and searches of expressed sequence tag and genomic DNA databases, we have identified two sequences in mammals [Protein Phosphatase with EF-hands-1 and 2 (PPEF-1 and PPEF-2)] and one in Caenorhabditis elegans (PPEF) that closely resemble rdgC. In the adult, PPEF-2 is expressed specifically in retinal rod photoreceptors and the pineal. In the retina, several isoforms of PPEF-2 are predicted to arise from differential splicing. The isoform that most closely resembles rdgC is localized to rod inner segments. Together with the recently described localization of PPEF-1 transcripts to primary somatosensory neurons and inner ear cells in the developing mouse, these data suggest that the PPEF family of protein serine/threonine phosphatases plays a specific and conserved role in diverse sensory neurons.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Understanding the effects of the external environment on bacterial gene expression can provide valuable insights into an array of cellular mechanisms including pathogenesis, drug resistance, and, in the case of Mycobacterium tuberculosis, latency. Because of the absence of poly(A)+ mRNA in prokaryotic organisms, studies of differential gene expression currently must be performed either with large amounts of total RNA or rely on amplification techniques that can alter the proportional representation of individual mRNA sequences. We have developed an approach to study differences in bacterial mRNA expression that enables amplification by the PCR of a complex mixture of cDNA sequences in a reproducible manner that obviates the confounding effects of selected highly expressed sequences, e.g., ribosomal RNA. Differential expression using customized amplification libraries (DECAL) uses a library of amplifiable genomic sequences to convert total cellular RNA into an amplified probe for gene expression screens. DECAL can detect 4-fold differences in the mRNA levels of rare sequences and can be performed on as little as 10 ng of total RNA. DECAL was used to investigate the in vitro effect of the antibiotic isoniazid on M. tuberculosis, and three previously uncharacterized isoniazid-induced genes, iniA, iniB, and iniC, were identified. The iniB gene has homology to cell wall proteins, and iniA contains a phosphopantetheine attachment site motif suggestive of an acyl carrier protein. The iniA gene is also induced by the antibiotic ethambutol, an agent that inhibits cell wall biosynthesis by a mechanism that is distinct from isoniazid. The DECAL method offers a powerful new tool for the study of differential gene expression.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

G-substrate, an endogenous substrate for cGMP-dependent protein kinase, exists almost exclusively in cerebellar Purkinje cells, where it is possibly involved in the induction of long-term depression. A G-substrate cDNA was identified by screening expressed sequence tag databases from a human brain library. The deduced amino acid sequence of human G-substrate contained two putative phosphorylation sites (Thr-68 and Thr-119) with amino acid sequences [KPRRKDT(p)PALH] that were identical to those reported for rabbit G-substrate. G-substrate mRNA was expressed almost exclusively in the cerebellum as a single transcript. The human G-substrate gene was mapped to human chromosome 7p15 by radiation hybrid panel analysis. In vitro translation products of the cDNA showed an apparent molecular mass of 24 kDa on SDS/PAGE which was close to that of purified rabbit G-substrate (23 kDa). Bacterially expressed human G-substrate is a heat-stable and acid-soluble protein that cross-reacts with antibodies raised against rabbit G-substrate. Recombinant human G-substrate was phosphorylated efficiently by cGMP-dependent protein kinase exclusively at Thr residues, and it was recognized by antibodies specific for rabbit phospho-G-substrate. The amino acid sequences surrounding the sites of phosphorylation in G-substrate are related to those around Thr-34 and Thr-35 of the dopamine- and cAMP-regulated phosphoprotein DARPP-32 and inhibitor-1, respectively, two potent inhibitors of protein phosphatase 1. However, purified G-substrate phosphorylated by cGMP-dependent protein kinase inhibited protein phosphatase 2A more effectively than protein phosphatase 1, suggesting a distinct role as a protein phosphatase inhibitor.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Paired Ig-like receptors (PIR) that can reciprocally modulate cellular activation have been described in mammals. In the present study, we searched expressed sequence tag databases for PIR relatives to identify chicken expressed sequence tags predictive of ≈25% amino acid identity to mouse PIR. Rapid amplification of cDNA ends (RACE)-PCR extension of expressed sequence-tag sequences using chicken splenic cDNA as a template yielded two distinct cDNAs, the sequence analysis of which predicted protein products with related extracellular Ig-like domains. Chicken Ig-like receptor (CHIR)-A was characterized by its transmembrane segment with a positively charged histidine residue and short cytoplasmic tail, thereby identifying CHIR-A as a candidate-activating receptor. Conversely, CHIR-B was characterized by its nonpolar transmembrane segment and cytoplasmic tail with two immunoreceptor tyrosine-based inhibitory motifs, indicating that it may serve as an inhibitory receptor. The use of CHIR amino acid sequences in a search for other PIR relatives led to the recognition of mammalian Fc receptors as distantly related genes. Comparative analyses based on amino acid sequences and three-dimensional protein structures provided molecular evidence for common ancestry of the PIR and Fc receptor gene families.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Natural killer (NK) cells express C-type lectin-like receptors, encoded in the NK gene complex, that interact with major histocompatibility complex class I and either inhibit or activate functional activity. Human NK cells express heterodimers consisting of CD94 and NKG2 family molecules, whereas murine NK cells express homodimers belonging to the Ly-49 family. The corresponding orthologues for other species, however, have not been described. In this report, we used probes derived from the expressed sequence tag database to clone C57BL/6-derived cDNAs homologous to human NKG2-D and CD94. Among normal tissues, murine NKG2-D and CD94 transcripts are highly expressed only in activated NK cells, including both Ly-49A+ and Ly-49A− subpopulations. Additionally, mNKG2-D is expressed in murine NK cell clones KY-1 and KY-2, whereas mCD94 expression is observed only in KY-1 cells but not KY-2. Last, we have finely mapped the physical location of the Cd94 (centromeric) and Nkg2d (telomeric) genes between Cd69 and the Ly49 cluster in the NK complex. Thus, these data indicate the expanding complexity of the NK complex and the corresponding repertoire of C-type lectin-like receptors on murine NK cells.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We performed a genome-wide analysis of gene expression in primary human CD15+ myeloid progenitor cells. By using the serial analysis of gene expression (SAGE) technique, we obtained quantitative information for the expression of 37,519 unique SAGE-tag sequences. Of these unique tags, (i) 25% were detected at high and intermediate levels, whereas 75% were present as single copies, (ii) 53% of the tags matched known expressed sequences, 34% of which were matched to more than one known expressed sequence, and (iii) 47% of the tags had no matches and represent potentially novel genes. The correct genes were confirmed by application of the generation of longer cDNA fragments from SAGE tags for gene identification (GLGI) technique for high-copy tags with multiple matches. A set of genes known to be important in myeloid differentiation were expressed at various levels and used different spliced forms. This study provides a normal baseline for comparison of gene expression in myeloid diseases. The strategy of using SAGE and GLGI techniques in this study has broad applications to the genome-wide identification of expressed genes.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The release of vast quantities of DNA sequence data by large-scale genome and expressed sequence tag (EST) projects underlines the necessity for the development of efficient and inexpensive ways to link sequence databases with temporal and spatial expression profiles. Here we demonstrate the power of linking cDNA sequence data (including EST sequences) with transcript profiles revealed by cDNA-AFLP, a highly reproducible differential display method based on restriction enzyme digests and selective amplification under high stringency conditions. We have developed a computer program (GenEST) that predicts the sizes of virtual transcript-derived fragments (TDFs) of in silico-digested cDNA sequences retrieved from databases. The vast majority of the resulting virtual TDFs could be traced back among the thousands of TDFs displayed on cDNA-AFLP gels. Sequencing of the corresponding bands excised from cDNA-AFLP gels revealed no inconsistencies. As a consequence, cDNA sequence databases can be screened very efficiently to identify genes with relevant expression profiles. The other way round, it is possible to switch from cDNA-AFLP gels to sequences in the databases. Using the restriction enzyme recognition sites, the primer extensions and the estimated TDF size as identifiers, the DNA sequence(s) corresponding to a TDF with an interesting expression pattern can be identified. In this paper we show examples in both directions by analyzing the plant parasitic nematode Globodera rostochiensis. Various novel pathogenicity factors were identified by combining ESTs from the infective stage juveniles with expression profiles of ∼4000 genes in five developmental stages produced by cDNA-AFLP.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Poly(ADP)-ribose polymerase (PADPRP) has been purified to apparent homogeneity from suspension cultures of the maize (Zea mays) callus line. The purified enzyme is a single polypeptide of approximately 115 kD, which appears to dimerize through an S-S linkage. The catalytic properties of the maize enzyme are very similar to those of its animal counterpart. The amino acid sequences of three tryptic peptides were obtained by microsequencing. Antibodies raised against peptides from maize PADPRP cross-reacted specifically with the maize enzyme but not with the enzyme from human cells, and vice versa. We have also characterized a 3.45-kb expressed-sequence-tag clone that contains a full-length cDNA for maize PADPRP. An open reading frame of 2943 bp within this clone encodes a protein of 980 amino acids. The deduced amino acid sequence of the maize PADPRP shows 40% to 42% identity and about 50% similarity to the known vertebrate PADPRP sequences. All important features of the modular structure of the PADPRP molecule, such as two zinc fingers, a putative nuclear localization signal, the automodification domain, and the NAD+-binding domain, are conserved in the maize enzyme. Northern-blot analysis indicated that the cDNA probe hybridizes to a message of about 4 kb.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Phospholipase A2 (PLA2) was purified about 180,000 times compared with the starting soluble-protein extract from developing elm (Ulmus glabra) seeds. On sodium dodecyl sulfate-polyacrylamide gel electrophoresis the purified fraction showed a single protein band with a mobility that corresponded to 15 kD, from which activity could be recovered. When analyzed by matrix-assisted laser-desorption ionization-time-of-flight mass spectrometry, the enzyme had a deduced mass of 13,900 D. A 53-amino acid-long N-terminal sequence was determined and aligned with other sequences, giving 62% identity to the deduced amino acid sequence of some rice (Oryza sativa) expressed sequence tag clones. The purified enzyme had an alkaline pH optimum and required Ca2+ for activity. It was unusually stable with regard to heat, acidity, and organic solvents but was sensitive to disulfide bond-reducing agents. The enzyme is a true PLA2, neither hydrolyzing the sn-1 position of phosphatidylcholine nor having any activity toward lysophosphatidylcholine or diacylglycerol. The biochemical data and amino acid sequence alignments indicate that the enzyme is related to the well-characterized family of animal secretory PLA2s and, to our knowledge, is the first plant enzyme of this type to be described.