1000 resultados para noncoding DNA
Resumo:
We have determined the sequence of the first 1371 nucleotides at the 5' end of the genome of mouse mammary tumor virus using molecularly cloned proviral DNA of the GR virus strain. The most likely initiation codon used for the gag gene of mouse mammary tumor virus is the first one, located 312 nucleotides from the 5' end of the viral RNA. The 5' splicing site for the subgenomic mRNA's is located approximately 288 nucleotides downstream from the 5' end of the viral RNA. From the DNA sequence the amino acid sequence of the N-terminal half of the gag precursor protein, including p10 and p21, was deduced (353 amino acids).
Resumo:
A number of experimental methods have been reported for estimating the number of genes in a genome, or the closely related coding density of a genome, defined as the fraction of base pairs in codons. Recently, DNA sequence data representative of the genome as a whole have become available for several organisms, making the problem of estimating coding density amenable to sequence analytic methods. Estimates of coding density for a single genome vary widely, so that methods with characterized error bounds have become increasingly desirable. We present a method to estimate the protein coding density in a corpus of DNA sequence data, in which a ‘coding statistic’ is calculated for a large number of windows of the sequence under study, and the distribution of the statistic is decomposed into two normal distributions, assumed to be the distributions of the coding statistic in the coding and noncoding fractions of the sequence windows. The accuracy of the method is evaluated using known data and application is made to the yeast chromosome III sequence and to C.elegans cosmid sequences. It can also be applied to fragmentary data, for example a collection of short sequences determined in the course of STS mapping.
Resumo:
We performed whole genome sequencing in 16 unrelated patients with autosomal recessive retinitis pigmentosa (ARRP), a disease characterized by progressive retinal degeneration and caused by mutations in over 50 genes, in search of pathogenic DNA variants. Eight patients were from North America, whereas eight were Japanese, a population for which ARRP seems to have different genetic drivers. Using a specific workflow, we assessed both the coding and noncoding regions of the human genome, including the evaluation of highly polymorphic SNPs, structural and copy number variations, as well as 69 control genomes sequenced by the same procedures. We detected homozygous or compound heterozygous mutations in 7 genes associated with ARRP (USH2A, RDH12, CNGB1, EYS, PDE6B, DFNB31, and CERKL) in eight patients, three Japanese and five Americans. Fourteen of the 16 mutant alleles identified were previously unknown. Among these, there was a 2.3-kb deletion in USH2A and an inverted duplication of ∼446 kb in EYS, which would have likely escaped conventional screening techniques or exome sequencing. Moreover, in another Japanese patient, we identified a homozygous frameshift (p.L206fs), absent in more than 2,500 chromosomes from ethnically matched controls, in the ciliary gene NEK2, encoding a serine/threonine-protein kinase. Inactivation of this gene in zebrafish induced retinal photoreceptor defects that were rescued by human NEK2 mRNA. In addition to identifying a previously undescribed ARRP gene, our study highlights the importance of rare structural DNA variations in Mendelian diseases and advocates the need for screening approaches that transcend the analysis of the coding sequences of the human genome.
Resumo:
In sharp contrast to birds and mammals, most cold-blooded vertebrates have homomorphic (morphologically undifferentiated) sex chromosomes. This might result either from recurrent X-Y recombination (occurring e.g. during occasional events of sex reversal) or from frequent turnovers (during which sex-determining genes are overthrown by new autosomal mutations). Evidence for turnovers is indeed mounting in fish, but very few have so far been documented in amphibians, possibly because of practical difficulties in identifying sex chromosomes. Female heterogamety (ZW) has long been established in Bufo bufo, based on sex reversal and crossing experiments. Here, we investigate a sex-linked marker identified from a laboratory cross between Palearctic green toads (Bufo viridis subgroup). The F(1) offspring produced by a female Bufo balearicus and a male Bufo siculus were phenotypically sexed, displaying an even sex ratio. A sex-specific marker detected in highly reproducible AFLP genotypes was cloned. Sequencing revealed a noncoding, microsatellite-containing fragment. Reamplification and genotyping of families of this and a reciprocal cross showed B. siculus to be male heterogametic (XY) and suggested the same system for B. balearicus. Our results thus reveal a cryptic heterogametic transition within bufonid frogs and help explain patterns of hybrid fitness within the B. viridis subgroup. Turnovers of genetic sex-determination systems may be more frequent in amphibians than previously thought and thus contribute to the prevalence of homomorphic sex chromosomes in this group.
Resumo:
In order to contribute to the debate about southern glacial refugia used by temperate species and more northern refugia used by boreal or cold-temperate species, we examined the phylogeography of a widespread snake species (Vipera berus) inhabiting Europe up to the Arctic Circle. The analysis of the mitochondrial DNA (mtDNA) sequence variation in 1043 bp of the cytochrome b gene and in 918 bp of the noncoding control region was performed with phylogenetic approaches. Our results suggest that both the duplicated control region and cytochrome b evolve at a similar rate in this species. Phylogenetic analysis showed that V. berus is divided into three major mitochondrial lineages, probably resulting from an Italian, a Balkan and a Northern (from France to Russia) refugial area in Eastern Europe, near the Carpathian Mountains. In addition, the Northern clade presents an important substructure, suggesting two sequential colonization events in Europe. First, the continent was colonized from the three main refugial areas mentioned above during the Lower-Mid Pleistocene. Second, recolonization of most of Europe most likely originated from several refugia located outside of the Mediterranean peninsulas (Carpathian region, east of the Carpathians, France and possibly Hungary) during the Mid-Late Pleistocene, while populations within the Italian and Balkan Peninsulas fluctuated only slightly in distribution range, with larger lowland populations during glacial times and with refugial mountain populations during interglacials, as in the present time. The phylogeographical structure revealed in our study suggests complex recolonization dynamics of the European continent by V. berus, characterized by latitudinal as well as altitudinal range shifts, driven by both climatic changes and competition with related species.
Resumo:
DnaSP, DNA Sequence Polymorphism, is a software package for the analysis of nucleotide polymorphism from aligned DNA sequence data. DnaSP can estimate several measures of DNA sequence variation within and between populations (in noncoding, synonymous or nonsynonymous sites, or in various sorts of codon positions), as well as linkage disequilibrium, recombination, gene flow and gene conversion parameters. DnaSP can also carry out several tests of neutrality: Hudson, Kreitman and Aguadé (1987), Tajima (1989), McDonald and Kreitman (1991), Fu and Li (1993), and Fu (1997) tests. Additionally, DnaSP can estimate the confidence intervals of some test-statistics by the coalescent. The results of the analyses are displayed on tabular and graphic form.
Resumo:
Genomic sequence comparison across species has enabled the elucidation of important coding and regulatory sequences encoded within DNA. Of particular interest are the noncoding regulatory sequences, which influence gene transcriptional and posttranscriptional processes. A phylogenetic footprinting strategy was employed to identify noncoding conservation patterns of 39 human and bovine orthologous genes. Seventy-three conserved noncoding sequences were identified that shared greater than 70% identity over at least 100 bp. Thirteen of these conserved sequences were also identified in the mouse genome. Evolutionary conservation of noncoding sequences across diverse species may have functional significance, and these conserved sequences may be good candidates for regulatory elements.
Resumo:
Pós-graduação em Biotecnologia - IQ
Resumo:
Abstract Background Pancreatic ductal adenocarcinoma (PDAC) is known by its aggressiveness and lack of effective therapeutic options. Thus, improvement in current knowledge of molecular changes associated with pancreatic cancer is urgently needed to explore novel venues of diagnostics and treatment of this dismal disease. While there is mounting evidence that long noncoding RNAs (lncRNAs) transcribed from intronic and intergenic regions of the human genome may play different roles in the regulation of gene expression in normal and cancer cells, their expression pattern and biological relevance in pancreatic cancer is currently unknown. In the present work we investigated the relative abundance of a collection of lncRNAs in patients' pancreatic tissue samples aiming at identifying gene expression profiles correlated to pancreatic cancer and metastasis. Methods Custom 3,355-element spotted cDNA microarray interrogating protein-coding genes and putative lncRNA were used to obtain expression profiles from 38 clinical samples of tumor and non-tumor pancreatic tissues. Bioinformatics analyses were performed to characterize structure and conservation of lncRNAs expressed in pancreatic tissues, as well as to identify expression signatures correlated to tissue histology. Strand-specific reverse transcription followed by PCR and qRT-PCR were employed to determine strandedness of lncRNAs and to validate microarray results, respectively. Results We show that subsets of intronic/intergenic lncRNAs are expressed across tumor and non-tumor pancreatic tissue samples. Enrichment of promoter-associated chromatin marks and over-representation of conserved DNA elements and stable secondary structure predictions suggest that these transcripts are generated from independent transcriptional units and that at least a fraction is under evolutionary selection, and thus potentially functional. Statistically significant expression signatures comprising protein-coding mRNAs and lncRNAs that correlate to PDAC or to pancreatic cancer metastasis were identified. Interestingly, loci harboring intronic lncRNAs differentially expressed in PDAC metastases were enriched in genes associated to the MAPK pathway. Orientation-specific RT-PCR documented that intronic transcripts are expressed in sense, antisense or both orientations relative to protein-coding mRNAs. Differential expression of a subset of intronic lncRNAs (PPP3CB, MAP3K14 and DAPK1 loci) in metastatic samples was confirmed by Real-Time PCR. Conclusion Our findings reveal sets of intronic lncRNAs expressed in pancreatic tissues whose abundance is correlated to PDAC or metastasis, thus pointing to the potential relevance of this class of transcripts in biological processes related to malignant transformation and metastasis in pancreatic cancer.
Resumo:
The down-regulation of the tumor-suppressor gene RASSF1A has been shown to increase cell proliferation in several tumors. RASSF1A expression is regulated through epigenetic events involving the polycomb repressive complex 2 (PRC2); however, the molecular mechanisms modulating the recruitment of this epigenetic modifier to the RASSF1 locus remain largely unknown. Here, we identify and characterize ANRASSF1, an endogenous unspliced long noncoding RNA (lncRNA) that is transcribed from the opposite strand on the RASSF1 gene locus in several cell lines and tissues and binds PRC2. ANRASSF1 is transcribed through RNA polymerase II and is 5'-capped and polyadenylated; it exhibits nuclear localization and has a shorter half-life compared with other lncRNAs that bind PRC2. ANRASSF1 endogenous expression is higher in breast and prostate tumor cell lines compared with non-tumor, and an opposite pattern is observed for RASSF1A. ANRASSF1 ectopic overexpression reduces RASSF1A abundance and increases the proliferation of HeLa cells, whereas ANRASSF1 silencing causes the opposite effects. These changes in ANRASSF1 levels do not affect the RASSF1C isoform abundance. ANRASSF1 overexpression causes a marked increase in both PRC2 occupancy and histone H3K27me3 repressive marks, specifically at the RASSF1A promoter region. No effect of ANRASSF1 overexpression was detected on PRC2 occupancy and histone H3K27me3 at the promoter regions of RASSF1C and the four other neighboring genes, including two well-characterized tumor suppressor genes. Additionally, we demonstrated that ANRASSF1 forms an RNA/DNA hybrid and recruits PRC2 to the RASSF1A promoter. Together, these results demonstrate a novel mechanism of epigenetic repression of the RASSF1A tumor suppressor gene involving antisense unspliced lncRNA, in which ANRASSF1 selectively represses the expression of the RASSF1 isoform overlapping the antisense transcript in a location-specific manner. In a broader perspective, our findings suggest that other non-characterized unspliced intronic lncRNAs transcribed in the human genome might contribute to a location-specific epigenetic modulation of genes.
Resumo:
Editing of RNA changes the read-out of information from DNA by altering the nucleotide sequence of a transcript. One type of RNA editing found in all metazoans uses double-stranded RNA (dsRNA) as a substrate and results in the deamination of adenosine to give inosine, which is translated as guanosine. Editing thus allows variant proteins to be produced from a single pre-mRNA. A mechanism by which dsRNA substrates form is through pairing of intronic and exonic sequences before the removal of noncoding sequences by splicing. Here we report that the RNA editing enzyme, human dsRNA adenosine deaminase (DRADA1, or ADAR1) contains a domain (Zα) that binds specifically to the left-handed Z-DNA conformation with high affinity (KD = 4 nM). As formation of Z-DNA in vivo occurs 5′ to, or behind, a moving RNA polymerase during transcription, recognition of Z-DNA by DRADA1 provides a plausible mechanism by which DRADA1 can be targeted to a nascent RNA so that editing occurs before splicing. Analysis of sequences related to Zα has allowed identification of motifs common to this class of nucleic acid binding domain.
Resumo:
Mouse skin tumors contain activated c-H-ras oncogenes, often caused by point mutations at codons 12 and 13 in exon 1 and codons 59 and 61 in exon 2. Mutagenesis by the noncoding apurinic sites can produce G-->T and A-->T transversions by DNA misreplication with more frequent insertion of deoxyadenosine opposite the apurinic site. Papillomas were induced in mouse skin by several aromatic hydrocarbons, and mutations in the c-H-ras gene were determined to elucidate the relationship among DNA adducts, apurinic sites, and ras oncogene mutations. Dibenzo[a,l]pyrene (DB[a,l]P), DB[a,l]P-11,12-dihydrodiol, anti-DB[a,l]P-11,12-diol-13,14-epoxide, DB[a,l]P-8,9-dihydrodiol, 7,12-dimethylbenz[a]anthracene (DMBA), and 1,2,3,4-tetrahydro-DMBA consistently induced a CAA-->CTA mutation in codon 61 of the c-H-ras oncogene. Benzo[a]pyrene induced a GGC-->GTC mutation in codon 13 in 54% of tumors and a CAA-->CTA mutation in codon 61 in 15%. The pattern of mutations induced by each hydrocarbon correlated with its profile of DNA adducts. For example, both DB[a,l]P and DMBA primarily form DNA adducts at the N-3 and/or N-7 of deoxyadenosine that are lost from the DNA by depurination, generating apurinic sites. Thus, these results support the hypothesis that misreplication of unrepaired apurinic sites generated by loss of hydrocarbon-DNA adducts is responsible for transforming mutations leading to papillomas in mouse skin.
Resumo:
Large numbers of noncoding RNA transcripts (ncRNAS) are being revealed by complementary DNA cloning and genome tiling array studies in animals. The big and as yet largely unanswered question is whether these transcripts are relevant. A paper by Willingham et al. shows the way forward by developing a strategy for large-scale functional screening of ncRNAs, involving small interfering RNA knockdowns in cell-based screens, which identified a previously unidentified ncRNA repressor of the transcription factor NFAT. It appears likely that ncRNAs constitute a critical hidden layer of gene regulation in complex organisms, the understanding of which requires new approaches in functional genomics.
Resumo:
To detect the presence of male DNA in vaginal samples collected from survivors of sexual violence and stored on filter paper. A pilot study was conducted to evaluate 10 vaginal samples spotted on sterile filter paper: 6 collected at random in April 2009 and 4 in October 2010. Time between sexual assault and sample collection was 4-48hours. After drying at room temperature, the samples were placed in a sterile envelope and stored for 2-3years until processing. DNA extraction was confirmed by polymerase chain reaction for human β-globin, and the presence of prostate-specific antigen (PSA) was quantified. The presence of the Y chromosome was detected using primers for sequences in the TSPY (Y7/Y8 and DYS14) and SRY genes. β-Globin was detected in all 10 samples, while 2 samples were positive for PSA. Half of the samples amplified the Y7/Y8 and DYS14 sequences of the TSPY gene and 30% amplified the SRY gene sequence of the Y chromosome. Four male samples and 1 female sample served as controls. Filter-paper spots stored for periods of up to 3years proved adequate for preserving genetic material from vaginal samples collected following sexual violence.
Resumo:
The Fourier transform-infrared (FT-IR) signature of dry samples of DNA and DNA-polypeptide complexes, as studied by IR microspectroscopy using a diamond attenuated total reflection (ATR) objective, has revealed important discriminatory characteristics relative to the PO2(-) vibrational stretchings. However, DNA IR marks that provide information on the sample's richness in hydrogen bonds have not been resolved in the spectral profiles obtained with this objective. Here we investigated the performance of an all reflecting objective (ARO) for analysis of the FT-IR signal of hydrogen bonds in DNA samples differing in base richness types (salmon testis vs calf thymus). The results obtained using the ARO indicate prominent band peaks at the spectral region representative of the vibration of nitrogenous base hydrogen bonds and of NH and NH2 groups. The band areas at this spectral region differ in agreement with the DNA base richness type when using the ARO. A peak assigned to adenine was more evident in the AT-rich salmon DNA using either the ARO or the ATR objective. It is concluded that, for the discrimination of DNA IR hydrogen bond vibrations associated with varying base type proportions, the use of an ARO is recommended.