38 resultados para Noncoding Sequences
Resumo:
There are 481 segments longer than 200 base pairs (bp) that are absolutely conserved (100% identity with no insertions or deletions) between orthologous regions of the human, rat, and mouse genomes. Nearly all of these segments are also conserved in the chicken and dog genomes, with an average of 95 and 99% identity, respectively. Many are also significantly conserved in fish. These ultraconserved elements of the human genome are most often located either overlapping exons in genes involved in RNA processing or in introns or nearby genes involved in the regulation of transcription and development. Along with more than 5000 sequences of over 100 bp that are absolutely conserved among the three sequenced mammals, these represent a class of genetic elements whose functions and evolutionary origins are yet to be determined, but which are more highly conserved between these species than are proteins and appear to be essential for the ontogeny of mammals and other vertebrates.
Resumo:
Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in intron 2, 5 kb downstream of the core BRCA1 promoter. The functionality of these elements was examined using homologous-recombination-based mutagenesis of reporter gene-tagged cosmids incorporating these regions and flanking sequences from the BRCA1 locus. This showed that CNS-1 and CNS-2 have differential transcriptional regulatory activity in epithelial cell lines. Mutation of CNS-1 significantly reduced reporter gene expression to 30% of control levels. Conversely mutation of CNS-2 increased expression to 200% of control levels. Regulation is at the level of transcription and shows promoter specificity. Both elements also specifically bind nuclear proteins in vitro. These studies demonstrate that the combination of comparative genomics and functional analysis is a successful strategy to identify novel regulatory elements and provide the first direct evidence that conserved noncoding sequences in BRCA1 regulate gene expression. (c) 2005 Elsevier Inc. All rights reserved.
Resumo:
Although MYB overexpression in colorectal cancer (CRC) is known to be a prognostic indicator for poor survival, the basis for this overexpression is unclear. Among multiple levels of MYB regulation, the most dynamic is the control of transcriptional elongation by sequences within intron I. The authors have proposed that this regulatory sequence is transcribed into an RNA stem-loop and 19-residue polyuridine tract, and is subject to mutation in CRC. When this region was examined in colorectal and breast carcinoma cell lines and tissues, the authors found frequent mutations only in CRC. It was determined that these mutations allowed increased transcription compared with the wild type sequence. These data suggest that this MYB regulatory region within intron I is subject to mutations in CRC but not breast cancer, perhaps consistent with the mutagenic insult that occurs within the colon and not mammary tissue. In CRC, these mutations may contribute to MYB overexpression, highlighting the importance of noncoding sequences in the regulation of key cancer genes. (c) 2006 Wiley-Liss, Inc.
Resumo:
Despite the presence of over 3 million transposons separated on average by similar to 500 bp, the human and mouse genomes each contain almost 1000 transposon-free regions (TFRs) over 10 kb in length. The majority of human TFRs correlate with orthologous TFRs in the mouse, despite the fact that most transposons are lineage specific. Many human TFRs also overlap with orthologous TFRs in the marsupial opossum, indicating that these regions have remained refractory to transposon insertion for long evolutionary periods. Over 90% of the bases covered by TFRs are noncoding, much of which is not highly conserved. Most TFRs are not associated with unusual nucleotide composition, but are significantly associated with genes encoding developmental regulators, suggesting that they represent extended regions of regulatory information that are largely unable to tolerate insertions, a conclusion difficult to reconcile with current conceptions of gene regulation.
Resumo:
With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.
Resumo:
The nuclear import of simian-virus-40 large T-antigen (tumour antigen) is enhanced via phosphorylation by the protein kinase CK2 at Ser(112) in the vicinity of the NLS (nuclear localization sequence). To determine the structural basis of the effect of the sequences flanking the basic cluster KKKRK, and the effect of phosphorylation on the recognition of the NLS by the nuclear import factor importin-alpha (Impalpha), we co-crystallized non-autoinhibited Impalpha with peptides corresponding to the phosphorylated and non-phosphorylated forms of the NLS, and determined the crystal structures of the complexes. The structures show that the amino acids N-terminally flanking the basic cluster make specific contacts with the receptor that are distinct from the interactions between bipartite NLSs and Impalpha. We confirm the important role of flanking sequences using binding assays. Unexpectedly, the regions of the peptides containing the phosphorylation site do not make specific contacts with the receptor. Binding assays confirm that phosphorylation does not increase the affinity of the T-antigen NLS to Impalpha. We conclude that the sequences flanking the basic clusters in NLSs play a crucial role in nuclear import by modulating the recognition of the NLS by Impalpha, whereas phosphorylation of the T-antigen enhances nuclear import by a mechanism that does not involve a direct interaction of the phosphorylated residue with Impalpha.
Resumo:
Although the importance of CD4(+) T cell responses to human cytonnegalovirus (HCMV) has recently been recognized in transplant and immunosuppressed patients, the precise specificity and nature of this response has remained largely unresolved. In the present study we have isolated CD4(+) CTL which recognize epitopes from HCMV glycoproteins gB and gH in association with two different HLA-DR antigens, DRA1*0101/DRB1*0701 (DR7) and DRA1*0101/DRB1*1101 (DR11). Comparison of amino acid sequences of HICMV isolates revealed that the gB and gH epitope sequences recognized by human CD4(+) T cells were not only conserved in clinical isolates from HCMV but also in CMV isolates from higher primates (chimpanzee, rhesus and baboon). Interestingly, these epitope sequences from chimpanzee, rhesus and baboon CMV are efficiently recognized by human CD4(+) CTL. More importantly, we show that gB-specific T cells from humans can also efficiently lyse pepticle-sensitized Patr-DR7(+) cells from chimpanzees. These findings suggest that conserved gB and gH epitopes should be considered while designing a prophylactic vaccine against HCMV. In addition, they also provide a functional basis for the conservation of MHC class 11 lineages between humans and Old World primates and open the possibility for the use of such primate models in vaccine development against HCMV.
Resumo:
Sugarcane moth borers are a diverse group of species occurring in several genera, but predominately within the Noctuidae and Pyraloidea. They cause economic loss in sugarcane and other crops through damage to stems and stalks by larval boring. Partial sequence data from two mitochondrial genes, COII and 16S, were used to construct a molecular phylogeny based on 26 species from ten genera and six tribes. The Noctuidae were found to be monophyletic, providing molecular support for the taxonomy within this subfamily. However, the Pyraloidea are paraphyletic, with the noctuids splitting Galleriinae and Schoenobiinae from the Crambinae. This supports the separation of the Pyralidae and Crambinae, but does not support the concept of the incorporation of the Schoenobiinae in the Crambidae. Of the three crambine genera examined, Diatraea was monophyletic, Chilo paraphyletic, and Eoreuma was basal to the other two genera. Within the Noctuidae, Sesamia and Bathytricha were monophyletic, with Busseola basal to Bathytricha. Many species in this study (both noctuids and pyraloids) had different biotypes within collection localities and across their distribution; however the individual biotypes were not phylogenetically informative. These data highlight the need for taxonomic revisions at all taxon levels and provide a basis for the development of DNA-based diagnostics for rapidly identifying many species at any developmental stage. This ability is vital, as the species are an incursion threat to Australia and have the potential to cause significant losses to the sugar industry.
Resumo:
The folding of HIV gp41 into a 6-helix bundle drives virus-cell membrane fusion. To examine the structural relationship between the 6-helix bundle core domain and other regions of gp41, we expressed in Escherichia coli, the entire ectodomain of HIV-2(ST) gp41 as a soluble, trimeric maltose-binding protein (MBP)/gp41 chimera. Limiting proteolysis indicated that the Cys-591-Cys-597 disulfide-bonded region is outside a core domain comprising two peptides, Thr-529-Trp-589 and Val-604-Ser-666. A biochemical examination of MBP/gp41 chimeras encompassing these core peptides; indicated that the N-terminal polar segment, 521-528, and C-terminal membrane-proximal segment, 658-666, cooperate in stabilizing the ectodomain. A functional interaction between sequences outside the gp41 core may contribute energy to membrane fusion. (C) 2004 Published by Elsevier B.V. on behalf of the Federation of European Biochemical Societies.
Resumo:
Translational pausing may occur due to a number of mechanisms, including the presence of non-optimal codons, and it is thought to play a role in the folding of specific polypeptide domains during translation and in the facilitation of signal peptide recognition during see-dependent protein targeting. In this whole genome analysis of Escherichia coli we have found that non-optimal codons in the signal peptide-encoding sequences of secretory genes are overrepresented relative to the mature portions of these genes; this is in addition to their overrepresentation in the 5'-regions of genes encoding non-secretory proteins. We also find increased non-optimal codon usage at the 3' ends of most E. coli genes, in both non-secretory and secretory sequences. Whereas presumptive translational pausing at the 5' and 3' ends of E. coli messenger RNAs may clearly have a general role in translation, we suggest that it also has a specific role in sec-dependent protein export, possibly in facilitating signal peptide recognition. This finding may have important implications for our understanding of how the majority of non-cytoplasmic proteins are targeted, a process that is essential to all biological cells. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
Phylogenetic relationships within the Capsalidae (Monogenea) were examined Using large subunit ribosomal DNA sequences from 17 capsalid species (representing 7 genera, 5 subfamilies), 2 outgroup taxa (Monocotylidae) plus Udonella caligorum (Udonellidae). Trees were constructed using maximum likelihood, minimum evolution and maximum parsimony algorithms. An initial tree, generated from sequences 315 bases long, Suggests that Capsalinae, Encotyllabinae, Entobdellinae and Trochopodinae are monophyletic, but that Benedeniinae is paraphyletic. Analyses indicate that Neobenedenia, currently in the Benedeniinae, should perhaps be placed in 2 separate subfamily. An additional analysis was made which omitted 3 capsalid taxa (for which only short sequences were available) and all outgroup taxa because of alignment difficulties. Sequence length increased to 693 bases and good branch support was achieved. The Benedeniinae was again paraphyletic. Higher-level classification of the Capsalidae, evolution of the Entobdellinae and issues of species identity in Neobenedenia are discussed.
Resumo:
The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at ftp://fantom2.gsc.riken.go.jp/RTPS/. (C). 2004 Elsevier Inc. All rights reserved.
Resumo:
To investigate the role of the hepatitis C virus internal ribosome entry site (HCV IRES) domain IV in translation initiation and regulation, two chimeric IRES elements were constructed to contain the reciprocal domain IV in the otherwise HCV and classical swine fever virus IRES elements. This permitted an examination of the role of domain IV in the control of HCV translation. A specific inhibitor of the HCV IRES, vitamin B-12 was shown to inhibit translation directed by all IRES elements which contained domain IV from the HCV and the GB virus B IRES elements, whereas the HCV core protein could only suppress translation from the wild-type HCV IRES. Thus, the mechanisms of translation inhibition by vitamin B-12 and the core protein differ, and they target different regions of the IRES.
Resumo:
The promoter regions of plant pararetroviruses direct transcription of the full-length viral genome into a pregenomic RNA that is an intermediate in the replication of the virus. It serves as template for reverse transcription and as polycistronic mRNA for translation to viral proteins. We have identified functional promoter elements in the intergenic region of the Cavendish isolate of Banana streak virus (BSV-Cav), a member of the genus Badnavirus. Potential binding sites for plant transcription factors were found both upstream and downstream of the transcription start site by homology search in the PLACE database of plant cis-acting elements. The functionality of these putative cis-acting elements was tested by constructing loss-of-function and regain-of-function mutant promoters whose activity was quantified in embryogenic sugarcane suspension cells. Four regions that are important for activity of the BSV-Cav promoter were identified: the region containing an as-l-like element, the region around-141 and down to -77, containing several putative transcription factor binding sites, the region including the CAAT-box, and the leader region. The results could help explain the high BSV-Cav promoter activity that was observed previously in transgenic sugarcane plants and give more insight into the plant cell-mediated replication of the viral genome in banana streak disease. (C) 2004 Elsevier B.V. All rights reserved.