26 resultados para SPLICING

em University of Queensland eSpace - Australia


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We analyzed the FANTOM2 clone set of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences. We developed a new computational procedure to identify and classify the forms of splice variation evident in this data set and organized the results into a publicly accessible database that can be used for future expression array construction, structural genomics, and analyses of the mechanism and regulation of alternative splicing. Statistical analysis shows that at least 41% and possibly as much as 60% of multiexon genes in mouse have multiple splice forms. Of the transcription units with multiple splice forms, 49% contain transcripts in which the apparent use of an alternative transcription start (stop) is accompanied by alternative splicing of the initial (terminal) exon. This implies that alternative transcription may frequently induce alternative splicing. The fact that 73% of all exons with splice variation fall within the annotated coding region indicates that most splice variation is likely to affect the protein form. Finally, we compared the set of constitutive (present in all transcripts) exons with the set of cryptic (present only in some transcripts) exons and found statistically significant differences in their length distributions, the nucleoticle distributions around their splice junctions, and the frequencies of occurrence of several short sequence motifs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Limited but significant sequence similarity has been observed between an uncharacterized human protein, SIN1, and the S. pombe SIN1, Dictyostelium RIP3 and S. cerevisiae AVO1 proteins. The human Sin1 gene has been automatically predicted (MAPKAP1; GenBank accession number NM_024117); however, this sequence appears to be incomplete. In this study, we have cloned and characterized the full-length human Sin1 mRNA and identified a highly conserved domain that defines the family of SIN1 orthologues, members of which are widely distributed in the fungal and metazoan kingdoms. We demonstrate that Sin1 transcripts can use alternative polyadenylation signals and describe a number of Sin1 splice variants that potentially encode functionally different isoforms. (C) 2004 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Alternative splicing is widespread in mammalian gene expression, and variant splice patterns are often specific to different stages of development, particular tissues or a disease state. There is a need to systematically collect data on alternatively spliced exons, introns and splice isoforms, and to annotate this data. The Alternative Splicing Database consortium has been addressing this need, and is committed to maintaining and developing a value-added database of alternative splice events, and of experimentally verified regulatory mechanisms that mediate splice variants. In this paper we present two of the products from this project: namely, a database of computationally delineated alternative splice events as seen in alignments of EST/cDNA sequences with genome sequences, and a database of alternatively spliced exons collected from literature. The reported splice events are from nine different organisms and are annotated for various biological features including expression states and cross-species conservation. The data are presented on our ASD web pages (http://www.ebi.ac.uk/asd).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In violation of the 'one gene, one polypeptide' rule, alternative splicing allows individual genes to produce multiple protein isoforms - thereby playing a central part in generating complex proteomes. Alternative splicing also has a largely hidden function in quantitative gene control, by targeting RNAs for nonsense-mediated decay. Traditional gene-by-gene investigations of alternative splicing mechanisms are now being complemented by global approaches. These promise to reveal details of the nature and operation of cellular codes that are constituted by combinations of regulatory elements in pre-mRNA substrates and by cellular complements of splicing regulators, which together determine regulated splicing pathways.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Recently, we identified a large number of ultraconserved (uc) sequences in noncoding regions of human, mouse, and rat genomes that appear to be essential for vertebrate and amniote ontogeny. Here, we used similar methods to identify ultraconserved genomic regions between the insect species Drosophila melanogaster and Drosophila pseudoobscura, as well as the more distantly related Anopheles gambiae. As with vertebrates, ultraconserved sequences in insects appear to Occur primarily in intergenic and intronic sequences, and at intron-exon junctions. The sequences are significantly associated with genes encoding developmental regulators and transcription factors, but are less frequent and are smaller in size than in vertebrates. The longest identical, nongapped orthologous match between the three genomes was found within the homothorax (hth) gene. This sequence spans an internal exon-intron junction, with the majority located within the intron, and is predicted to form a highly stable stem-loop RNA structure. Real-time quantitative PCR analysis of different hth splice isoforms and Northern blotting showed that the conserved element is associated with a high incidence of intron retention in hth pre-mRNA, suggesting that the conserved intronic element is critically important in the post-transcriptional regulation of hth expression in Diptera.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The central dogma of biology holds that genetic information normally flows from DNA to RNA to protein. As a consequence it has been generally assumed that genes generally code for proteins, and that proteins fulfil not only most structural and catalytic but also most regulatory functions, in all cells, from microbes to mammals. However, the latter may not be the case in complex organisms. A number of startling observations about the extent of non-protein-coding RNA (ncRNA) transcription in the higher eukaryotes and the range of genetic and epigenetic phenomena that are RNA-directed suggests that the traditional view of the structure of genetic regulatory systems in animals and plants may be incorrect. ncRNA dominates the genomic output of the higher organisms and has been shown to control chromosome architecture, mRNA turnover and the developmental timing of protein expression, and may also regulate transcription and alternative splicing. This paper re-examines the available evidence and suggests a new framework for considering and understanding the genomic programming of biological complexity, autopoletic development and phenotypic variation. BioEssays 25:930-939,2003. (C) 2003 Wiley Periodicals, Inc.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The chromodomain is 40-50 amino acids in length and is conserved in a wide range of chromatic and regulatory proteins involved in chromatin remodeling. Chromodomain-containing proteins can be classified into families based on their broader characteristics, in particular the presence of other types of domains, and which correlate with different subclasses of the chromodomains themselves. Hidden Markov model (HMM)-generated profiles of different subclasses of chromodomains were used here to identify sequences encoding chromodomain-containing proteins in the mouse transcriptome and genome. A total of 36 different loci encoding proteins containing chromodomains, including 17 novel loci, were identified. Six of these loci (including three apparent pseudogenes, a novel HP1 ortholog, and two novel Msl-3 transcription factor-like proteins) are not present in the human genome, whereas the human genome contains four loci (two CDY orthologs and two apparent CDY pseuclogenes) that are not present in mouse. A number of these loci exhibit alternative splicing to produce different isoforms, including 43 novel variants, some of which lack the chromodomain. The likely functions of these proteins are discussed in relation to the known functions of other chromodomain-containing proteins within the same family.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Classic Hodgkin's lymphoma (HL) tissue contains a small population of morphologically distinct malignant cells called Hodgkin and Reed-Sternberg (HRS) cells, associated with the development of HL. Using 3'-rapid amplification of cDNA ends ( RACE) we identified an alternative mRNA for the DEC-205 multilectin receptor in the HRS cell line L428. Sequence analysis revealed that the mRNA encodes a fusion protein between DEC-205 and a novel C-type lectin DCL-1. Although the 7.5-kb DEC-205 and 4.2-kb DCL-1 mRNA were expressed independently in myeloid and B lymphoid cell lines, the DEC-205/DCL-1 fusion mRNA (9.5 kb) predominated in the HRS cell lines ( L428, KM-H2, and HDLM-2). The DEC-205 and DCL-1 genes comprising 35 and 6 exons, respectively, are juxtaposed on chromosome band 2q24 and separated by only 5.4 kb. We determined the DCL-1 transcription initiation site within the intervening sequence by 5'-RACE, confirming that DCL-1 is an independent gene. Two DEC-205/DCL-1 fusion mRNA variants may result from cotranscription of DEC-205 and DCL-1, followed by splicing DEC-205 exon 35 or 34-35 along with DCL-1 exon 1. The resulting reading frames encode the DEC-205 ectodomain plus the DCL-1 ectodomain, the transmembrane, and the cytoplasmic domain. Using DCL-1 cytoplasmic domain-specific polyclonal and DEC-205 monoclonal antibodies for immunoprecipitation/Western blot analysis, we showed that the fusion mRNA is translated into a DEC-205/DCL-1 fusion protein, expressed in the HRS cell lines. These results imply an unusual transcriptional control mechanism in HRS cells, which cotranscribe an mRNA containing DEC-205 and DCL-1 prior to generating the intergenically spliced mRNA to produce a DEC-205/DCL-1 fusion protein.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objective-To evaluate the haplotype distribution associated with the copper toxicosis gene and the segregation of the mutated allele in a Bedlington Terrier population in Australia. Animals-131 Bedlington Terriers. Procedure-Samples of DNA and RNA were obtained from each dog. Genetic status of each dog was evaluated by use of the DNA markers C04107; single nucleotide polymorphism (SNP), which was adjacent to exon 2 of Murr1; and a deletion marker for exon 2. A subgroup of the study population was evaluated by use of biochemical and histologic techniques to elucidate the correlation between genotype and phenotype. Results-We identified a recombination between the C04107 marker and Murr1 and a variation in a nucleotide in the splice site of exon 2 in our Bedlington Terrier cohort. Furthermore, we identified a novel haplotype associated with copper toxicosis in this cohort. Conclusions and Clinical Relevance-Our findings indicate that the deletion of exon 2 was not the sole cause of copper toxicosis, although only exon 2 deletion of Murr1 has been responsible for copper toxicosis in Bedlington Terriers. Although we failed to find a novel mutation in our cohort, we identified an affected dog family with an intact exon 2. Furthermore, we found that an SNP in the 5' splicing site of exon 2 may or may not be associated with a novel mutation of the Murr1 gene or other genes. Loss of linkage between the C04107 marker and the Murr1 gene was also identified in a certain family of dogs.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at ftp://fantom2.gsc.riken.go.jp/RTPS/. (C). 2004 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cystic fibrosis is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene, which encodes a chloride channel present in many cells. In cardiomyocytes, we report that multiple exon 1 usage and alternative splicing produces four CFTR transcripts, with different 5'-untranslated regions, CFTRTRAD-139, CFTR-1C/-1A, CFTR-1C, and CFTR-1B. CFTR transcripts containing the novel upstream exons (exons -1C, -1B, and -1A) represent more than 90% of cardiac expressed CFTR mRNA. Regulation of cardiac CFTR expression, in response to developmental and pathological stimuli, is exclusively due to the modulation of CFTR-1C and CFTR-1C/-1A expression. Upstream open reading frames have been identified in the 5'-untranslated regions of all CFTR transcripts that, in conjunction with adjacent stem-loop structures, modulate the efficiency of translation initiation at the AUG codon of the main CFTR coding region in CFTRTRAD-139 and CFTR-1C/-1A transcripts. Exon(-1A), only present in CFTR-1C/-1A transcripts, encodes an AUG codon that is in-frame with the main CFTR open reading frame, the efficient translation of which produces a novel CFTR protein isoform with a curtailed amino terminus. As the expression of this CFTR transcript parallels the spatial and temporal distribution of the cAMP-activated whole-cell current density in normal and diseased hearts, we suggest that CFTR-1C/-1A provides the molecular basis for the cardiac cAMP-activated chloride channel. Our findings provide further insight into the complex nature of in vivo CFTR expression, to which multiple mRNA transcripts, protein isoforms, and post-transcriptional regulatory mechanisms are now added.