974 resultados para Expressed sequence tag analysis


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We report a high-quality draft sequence of the genome of the horse (Equus caballus). The genome is relatively repetitive but has little segmental duplication. Chromosomes appear to have undergone few historical rearrangements: 53% of equine chromosomes show conserved synteny to a single human chromosome. Equine chromosome 11 is shown to have an evolutionary new centromere devoid of centromeric satellite DNA, suggesting that centromeric function may arise before satellite repeat accumulation. Linkage disequilibrium, showing the influences of early domestication of large herds of female horses, is intermediate in length between dog and human, and there is long-range haplotype sharing among breeds.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background Simple Sequence Repeats (SSRs) are widely used in population genetic studies but their classical development is costly and time-consuming. The ever-increasing available DNA datasets generated by high-throughput techniques offer an inexpensive alternative for SSRs discovery. Expressed Sequence Tags (ESTs) have been widely used as SSR source for plants of economic relevance but their application to non-model species is still modest. Methods Here, we explored the use of publicly available ESTs (GenBank at the National Center for Biotechnology Information-NCBI) for SSRs development in non-model plants, focusing on genera listed by the International Union for the Conservation of Nature (IUCN). We also search two model genera with fully annotated genomes for EST-SSRs, Arabidopsis and Oryza, and used them as controls for genome distribution analyses. Overall, we downloaded 16 031 555 sequences for 258 plant genera which were mined for SSRsand their primers with the help of QDD1. Genome distribution analyses in Oryza and Arabidopsis were done by blasting the sequences with SSR against the Oryza sativa and Arabidopsis thaliana reference genomes implemented in the Basal Local Alignment Tool (BLAST) of the NCBI website. Finally, we performed an empirical test to determine the performance of our EST-SSRs in a few individuals from four species of two eudicot genera, Trifolium and Centaurea. Results We explored a total of 14 498 726 EST sequences from the dbEST database (NCBI) in 257 plant genera from the IUCN Red List. We identify a very large number (17 102) of ready-to-test EST-SSRs in most plant genera (193) at no cost. Overall, dinucleotide and trinucleotide repeats were the prevalent types but the abundance of the various types of repeat differed between taxonomic groups. Control genomes revealed that trinucleotide repeats were mostly located in coding regions while dinucleotide repeats were largely associated with untranslated regions. Our results from the empirical test revealed considerable amplification success and transferability between congenerics. Conclusions The present work represents the first large-scale study developing SSRs by utilizing publicly accessible EST databases in threatened plants. Here we provide a very large number of ready-to-test EST-SSR (17 102) for 193 genera. The cross-species transferability suggests that the number of possible target species would be large. Since trinucleotide repeats are abundant and mainly linked to exons they might be useful in evolutionary and conservation studies. Altogether, our study highly supports the use of EST databases as an extremely affordable and fast alternative for SSR developing in threatened plants.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Although more than 100 genes associated with inherited retinal disease have been mapped to chromosomal locations, less than half of these genes have been cloned. This text includes identification and evaluation of candidate genes for three autosomal dominant forms of inherited retinal degeneration: atypical vitelliform macular dystrophy (VMD1), cone-rod dystrophy (CORD), and retinitis pigmentosa (RP). ^ VMD1 is a disorder characterized by complete penetrance but extremely variable expressivity, and includes macular or peripheral retinal lesions and peripappilary abnormalitites. In 1984, linkage was reported between VMD1 and soluble glutamate-pyruvate transaminase GPT); however, placement of GPT to 8q24 on linkage maps had been debated, and VMD1 did not show linkage to microsatellite markers in that region. This study excluded linkage between the loci by cloning GPT, identifying the nucleotide substitution associated with the GPT sozymes, and by assaying VMD1 family samples with an RFLP designed to detect the substitution. In addition, linkage of VMD1 to the known dominant macular degeneration loci was excluded. ^ CORD is characterized by early onset of color-vision deficiency, and decreased visual acuity, However, this retinal degeneration progresses to no light perception, severe macular lesion, and “bone-spicule” accumulations in the peripheral retina. In this study, the disorder in a large Texan family was mapped to the CORD2 locus of 19q13, and a mutation in the retina/pineal-specific cone-rod homeobox gene (CRX) was identified as the disease cause. In addition, mutations in CRX were associated with significantly different retinal disease phenotypes, including retinitis pigmentosa and Leber congenital amaurosis. ^ Many of the mutations leading to inherited retinal disorders have been identified in genes like CRX, which are expressed predominantly in the retina and pineal gland. Therefore, a combination of database analysis and laboratory investigation was used to identify 26 novel retina/pineal-specific expressed sequence tag (EST) clusters as candidate genes for inherited retinal disorders. Eight of these genes were mapped into the candidate regions of inherited retinal degeneration loci. ^ Two of the eight clusters mapped into the retinitis pigmentosa RP13 candidate region of 17p13, and were both determined to represent a single gene that is highly expressed in photoreceptors. This gene, the Ah receptor-interacting like protein-1 (AIPL1), was cloned, characterized, and screened for mutations in RP13 patient DNA samples. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cathepsin B (CTSB) is overexpressed in tumors of the lung, prostate, colon, breast, and stomach. However, evidence of primary genomic alterations in the CTSB gene during tumor initiation or progression has been lacking. We have found a novel amplicon at 8p22–23 that results in CTSB overexpression in esophageal adenocarcinoma. Amplified genomic NotI–HinfI fragments were identified by two-dimensional DNA electrophoresis. Two amplified fragments (D4 and D5) were cloned and yielded unique sequences. Using bacterial artificial chromosome clones containing either D4 or D5, fluorescent in situ hybridization defined a single region of amplification involving chromosome bands 8p22–23. We investigated the candidate cancer-related gene CTSB, and potential coamplified genes from this region including farnesyl-diphosphate farnesyltransferase (FDFT1), arylamine N-acetyltransferase (NAT-1), lipoprotein lipase (LPL), and an uncharacterized expressed sequence tag (D8S503). Southern blot analysis of 66 esophageal adenocarcinomas demonstrated only CTSB and FDFT1 were consistently amplified in eight (12.1%) of the tumors. Neither NAT-1 nor LPL were amplified. Northern blot analysis showed overexpression of CTSB and FDFT1 mRNA in all six of the amplified esophageal adenocarcinomas analyzed. CTSB mRNA overexpression also was present in two of six nonamplified tumors analyzed. However, FDFT1 mRNA overexpression without amplification was not observed. Western blot analysis confirmed CTSB protein overexpression in tumor specimens with CTSB mRNA overexpression compared with either normal controls or tumors without mRNA overexpression. Abundant extracellular expression of CTSB protein was found in 29 of 40 (72.5%) of esophageal adenocarcinoma specimens by using immunohistochemical analysis. The finding of an amplicon at 8p22–23 resulting in CTSB gene amplification and overexpression supports an important role for CTSB in esophageal adenocarcinoma and possibly in other tumors.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Adaptor protein complexes (APs) function as vesicle coat components in different membrane traffic pathways; however, there are a number of pathways for which there is still no candidate coat. To find novel coat components related to AP complexes, we have searched the expressed sequence tag database and have identified, cloned, and sequenced a new member of each of the four AP subunit families. We have shown by a combination of coimmunoprecipitation and yeast two-hybrid analysis that these four proteins (ε, β4, μ4, and ς4) are components of a novel adaptor-like heterotetrameric complex, which we are calling AP-4. Immunofluorescence reveals that AP-4 is localized to ∼10–20 discrete dots in the perinuclear region of the cell. This pattern is disrupted by treating the cells with brefeldin A, indicating that, like other coat proteins, the association of AP-4 with membranes is regulated by the small GTPase ARF. Immunogold electron microscopy indicates that AP-4 is associated with nonclathrin-coated vesicles in the region of the trans-Golgi network. The μ4 subunit of the complex specifically interacts with a tyrosine-based sorting signal, indicating that, like the other three AP complexes, AP-4 is involved in the recognition and sorting of cargo proteins with tyrosine-based motifs. AP-4 is of relatively low abundance, but it is expressed ubiquitously, suggesting that it participates in a specialized trafficking pathway but one that is required in all cell types.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

G-substrate, an endogenous substrate for cGMP-dependent protein kinase, exists almost exclusively in cerebellar Purkinje cells, where it is possibly involved in the induction of long-term depression. A G-substrate cDNA was identified by screening expressed sequence tag databases from a human brain library. The deduced amino acid sequence of human G-substrate contained two putative phosphorylation sites (Thr-68 and Thr-119) with amino acid sequences [KPRRKDT(p)PALH] that were identical to those reported for rabbit G-substrate. G-substrate mRNA was expressed almost exclusively in the cerebellum as a single transcript. The human G-substrate gene was mapped to human chromosome 7p15 by radiation hybrid panel analysis. In vitro translation products of the cDNA showed an apparent molecular mass of 24 kDa on SDS/PAGE which was close to that of purified rabbit G-substrate (23 kDa). Bacterially expressed human G-substrate is a heat-stable and acid-soluble protein that cross-reacts with antibodies raised against rabbit G-substrate. Recombinant human G-substrate was phosphorylated efficiently by cGMP-dependent protein kinase exclusively at Thr residues, and it was recognized by antibodies specific for rabbit phospho-G-substrate. The amino acid sequences surrounding the sites of phosphorylation in G-substrate are related to those around Thr-34 and Thr-35 of the dopamine- and cAMP-regulated phosphoprotein DARPP-32 and inhibitor-1, respectively, two potent inhibitors of protein phosphatase 1. However, purified G-substrate phosphorylated by cGMP-dependent protein kinase inhibited protein phosphatase 2A more effectively than protein phosphatase 1, suggesting a distinct role as a protein phosphatase inhibitor.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Paired Ig-like receptors (PIR) that can reciprocally modulate cellular activation have been described in mammals. In the present study, we searched expressed sequence tag databases for PIR relatives to identify chicken expressed sequence tags predictive of ≈25% amino acid identity to mouse PIR. Rapid amplification of cDNA ends (RACE)-PCR extension of expressed sequence-tag sequences using chicken splenic cDNA as a template yielded two distinct cDNAs, the sequence analysis of which predicted protein products with related extracellular Ig-like domains. Chicken Ig-like receptor (CHIR)-A was characterized by its transmembrane segment with a positively charged histidine residue and short cytoplasmic tail, thereby identifying CHIR-A as a candidate-activating receptor. Conversely, CHIR-B was characterized by its nonpolar transmembrane segment and cytoplasmic tail with two immunoreceptor tyrosine-based inhibitory motifs, indicating that it may serve as an inhibitory receptor. The use of CHIR amino acid sequences in a search for other PIR relatives led to the recognition of mammalian Fc receptors as distantly related genes. Comparative analyses based on amino acid sequences and three-dimensional protein structures provided molecular evidence for common ancestry of the PIR and Fc receptor gene families.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Natural killer (NK) cells express C-type lectin-like receptors, encoded in the NK gene complex, that interact with major histocompatibility complex class I and either inhibit or activate functional activity. Human NK cells express heterodimers consisting of CD94 and NKG2 family molecules, whereas murine NK cells express homodimers belonging to the Ly-49 family. The corresponding orthologues for other species, however, have not been described. In this report, we used probes derived from the expressed sequence tag database to clone C57BL/6-derived cDNAs homologous to human NKG2-D and CD94. Among normal tissues, murine NKG2-D and CD94 transcripts are highly expressed only in activated NK cells, including both Ly-49A+ and Ly-49A− subpopulations. Additionally, mNKG2-D is expressed in murine NK cell clones KY-1 and KY-2, whereas mCD94 expression is observed only in KY-1 cells but not KY-2. Last, we have finely mapped the physical location of the Cd94 (centromeric) and Nkg2d (telomeric) genes between Cd69 and the Ly49 cluster in the NK complex. Thus, these data indicate the expanding complexity of the NK complex and the corresponding repertoire of C-type lectin-like receptors on murine NK cells.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Limitation of water loss and control of gas exchange is accomplished in plant leaves via stomatal guard cells. Stomata open in response to light when an increase in guard cell turgor is triggered by ions and water influx across the plasma membrane. Recent evidence demonstrating the existence of ATP-binding cassette proteins in plants led us to analyze the effect of compounds known for their ability to modulate ATP-sensitive potassium channels (K-ATP) in animal cells. By using epidermal strip bioassays and whole-cell patch-clamp experiments with Vicia faba guard cell protoplasts, we describe a pharmacological profile that is specific for the outward K+ channel and very similar to the one described for ATP-sensitive potassium channels in mammalian cells. Tolbutamide and glibenclamide induced stomatal opening in bioassays and in patch-clamp experiments, a specific inhibition of the outward K+ channel by these compounds was observed. Conversely, application of potassium channel openers such as cromakalim or RP49356 triggered stomatal closure. An apparent competition between sulfonylureas and potassium channel openers occurred in bioassays, and outward potassium currents, previously inhibited by glibenclamide, were partially recovered after application of cromakalim. By using an expressed sequence tag clone from an Arabidopsis thaliana homologue of the sulfonylurea receptor, a 7-kb transcript was detected by Northern blot analysis in guard cells and other tissues. Beside the molecular evidence recently obtained for the expression of ATP-binding cassette protein transcripts in plants, these results give pharmacological support to the presence of a sulfonylurea-receptor-like protein in the guard-cell plasma membrane tightly involved in the outward potassium channel regulation during stomatal movements.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits.isb-sib.ch).

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Efficient motility of the eukaryotic flagellum requires precise temporal and spatial control of its constituent dynein motors. The central pair and its associated structures have been implicated as important members of a signal transduction cascade that ultimately regulates dynein arm activity. To identify central pair components involved in this process, we characterized a Chlamydomonas motility mutant (pf6-2) obtained by insertional mutagenesis. pf6-2 flagella twitch ineffectively and lack the 1a projection on the C1 microtubule of the central pair. Transformation with constructs containing a full-length, wild-type copy of the PF6 gene rescues the functional, structural, and biochemical defects associated with the pf6 mutation. Sequence analysis indicates that the PF6 gene encodes a large polypeptide that contains numerous alanine-rich, proline-rich, and basic domains and has limited homology to an expressed sequence tag derived from a human testis cDNA library. Biochemical analysis of an epitope-tagged PF6 construct demonstrates that the PF6 polypeptide is an axonemal component that cosediments at 12.6S with several other polypeptides. The PF6 protein appears to be an essential component required for assembly of some of these polypeptides into the C1-1a projection.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The release of vast quantities of DNA sequence data by large-scale genome and expressed sequence tag (EST) projects underlines the necessity for the development of efficient and inexpensive ways to link sequence databases with temporal and spatial expression profiles. Here we demonstrate the power of linking cDNA sequence data (including EST sequences) with transcript profiles revealed by cDNA-AFLP, a highly reproducible differential display method based on restriction enzyme digests and selective amplification under high stringency conditions. We have developed a computer program (GenEST) that predicts the sizes of virtual transcript-derived fragments (TDFs) of in silico-digested cDNA sequences retrieved from databases. The vast majority of the resulting virtual TDFs could be traced back among the thousands of TDFs displayed on cDNA-AFLP gels. Sequencing of the corresponding bands excised from cDNA-AFLP gels revealed no inconsistencies. As a consequence, cDNA sequence databases can be screened very efficiently to identify genes with relevant expression profiles. The other way round, it is possible to switch from cDNA-AFLP gels to sequences in the databases. Using the restriction enzyme recognition sites, the primer extensions and the estimated TDF size as identifiers, the DNA sequence(s) corresponding to a TDF with an interesting expression pattern can be identified. In this paper we show examples in both directions by analyzing the plant parasitic nematode Globodera rostochiensis. Various novel pathogenicity factors were identified by combining ESTs from the infective stage juveniles with expression profiles of ∼4000 genes in five developmental stages produced by cDNA-AFLP.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The biological significance of DNA amplification in cancer is thought to be due to the selection of increased expression of a single or few important genes. However, systematic surveys of the copy number and expression of all genes within an amplified region of the genome have not been performed. Here we have used a combination of molecular, genomic, and microarray technologies to identify target genes for 17q23, a common region of amplification in breast cancers with poor prognosis. Construction of a 4-Mb genomic contig made it possible to define two common regions of amplification in breast cancer cell lines. Analysis of 184 primary breast tumors by fluorescence in situ hybridization on tissue microarrays validated these results with the highest amplification frequency (12.5%) observed for the distal region. Based on GeneMap'99 information, 17 known genes and 26 expressed sequence tags were localized to the contig. Analysis of genomic sequence identified 77 additional transcripts. A comprehensive analysis of expression levels of these transcripts in six breast cancer cell lines was carried out by using complementary DNA microarrays. The expression patterns varied from one cell line to another, and several overexpressed genes were identified. Of these, RPS6KB1, MUL, APPBP2, and TRAP240 as well as one uncharacterized expressed sequence tag were located in the two common amplified regions. In summary, comprehensive analysis of the 17q23 amplicon revealed a limited number of highly expressed genes that may contribute to the more aggressive clinical course observed in breast cancer patients with 17q23-amplified tumors.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Poly(ADP)-ribose polymerase (PADPRP) has been purified to apparent homogeneity from suspension cultures of the maize (Zea mays) callus line. The purified enzyme is a single polypeptide of approximately 115 kD, which appears to dimerize through an S-S linkage. The catalytic properties of the maize enzyme are very similar to those of its animal counterpart. The amino acid sequences of three tryptic peptides were obtained by microsequencing. Antibodies raised against peptides from maize PADPRP cross-reacted specifically with the maize enzyme but not with the enzyme from human cells, and vice versa. We have also characterized a 3.45-kb expressed-sequence-tag clone that contains a full-length cDNA for maize PADPRP. An open reading frame of 2943 bp within this clone encodes a protein of 980 amino acids. The deduced amino acid sequence of the maize PADPRP shows 40% to 42% identity and about 50% similarity to the known vertebrate PADPRP sequences. All important features of the modular structure of the PADPRP molecule, such as two zinc fingers, a putative nuclear localization signal, the automodification domain, and the NAD+-binding domain, are conserved in the maize enzyme. Northern-blot analysis indicated that the cDNA probe hybridizes to a message of about 4 kb.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide.