27 resultados para Human Genome
Resumo:
Background: A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results: In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions: This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them.
Resumo:
The finished version of the human genome sequence was completed in 2003, and this event initiated a revolution in medical practice, which is usually referred to as the age of genomic or personalized medicine. Genomic medicine aims to be predictive, personalized, preventive, and also participative (4Ps). It offers a new approach to several pathological conditions, although its impact so far has been more evident in mendelian diseases. This article briefly reviews the potential advantages of this approach, and also some issues that may arise in the attempt to apply the accumulated knowledge from genomic medicine to clinical practice in emerging countries. The advantages of applying genomic medicine into clinical practice are obvious, enabling prediction, prevention, and early diagnosis and treatment of several genetic disorders. However, there are also some issues, such as those related to: (a) the need for approval of a law equivalent to the Genetic Information Nondiscrimination Act, which was approved in 2008 in the USA; (b) the need for private and public funding for genetics and genomics; (c) the need for development of innovative healthcare systems that may substantially cut costs (e.g. costs of periodic medical followup); (d) the need for new graduate and postgraduate curricula in which genomic medicine is emphasized; and (e) the need to adequately inform the population and possible consumers of genetic testing, with reference to the basic aspects of genomic medicine.
Resumo:
The down-regulation of the tumor-suppressor gene RASSF1A has been shown to increase cell proliferation in several tumors. RASSF1A expression is regulated through epigenetic events involving the polycomb repressive complex 2 (PRC2); however, the molecular mechanisms modulating the recruitment of this epigenetic modifier to the RASSF1 locus remain largely unknown. Here, we identify and characterize ANRASSF1, an endogenous unspliced long noncoding RNA (lncRNA) that is transcribed from the opposite strand on the RASSF1 gene locus in several cell lines and tissues and binds PRC2. ANRASSF1 is transcribed through RNA polymerase II and is 5'-capped and polyadenylated; it exhibits nuclear localization and has a shorter half-life compared with other lncRNAs that bind PRC2. ANRASSF1 endogenous expression is higher in breast and prostate tumor cell lines compared with non-tumor, and an opposite pattern is observed for RASSF1A. ANRASSF1 ectopic overexpression reduces RASSF1A abundance and increases the proliferation of HeLa cells, whereas ANRASSF1 silencing causes the opposite effects. These changes in ANRASSF1 levels do not affect the RASSF1C isoform abundance. ANRASSF1 overexpression causes a marked increase in both PRC2 occupancy and histone H3K27me3 repressive marks, specifically at the RASSF1A promoter region. No effect of ANRASSF1 overexpression was detected on PRC2 occupancy and histone H3K27me3 at the promoter regions of RASSF1C and the four other neighboring genes, including two well-characterized tumor suppressor genes. Additionally, we demonstrated that ANRASSF1 forms an RNA/DNA hybrid and recruits PRC2 to the RASSF1A promoter. Together, these results demonstrate a novel mechanism of epigenetic repression of the RASSF1A tumor suppressor gene involving antisense unspliced lncRNA, in which ANRASSF1 selectively represses the expression of the RASSF1 isoform overlapping the antisense transcript in a location-specific manner. In a broader perspective, our findings suggest that other non-characterized unspliced intronic lncRNAs transcribed in the human genome might contribute to a location-specific epigenetic modulation of genes.
Resumo:
A family of detoxifying enzymes called aldehyde dehydrogenases (ALDHs) has been a subject of recent interest, as its role in detoxifying aldehydes that accumulate through metabolism and to which we are exposed from the environment has been elucidated. Although the human genome has 19 ALDH genes, one ALDH emerges as a particularly important enzyme in a variety of human pathologies. This ALDH, ALDH2, is located in the mitochondrial matrix with much known about its role in ethanol metabolism. Less known is a new body of research to be discussed in this review, suggesting that ALDH2 dysfunction may contribute to a variety of human diseases including cardiovascular diseases, diabetes, neurodegenerative diseases, stroke, and cancer. Recent studies suggest that ALDH2 dysfunction is also associated with Fanconi anemia, pain, osteoporosis, and the process of aging. Furthermore, an ALDH2 inactivating mutation (termed ALDH2*2) is the most common single point mutation in humans, and epidemiological studies suggest a correlation between this inactivating mutation and increased propensity for common human pathologies. These data together with studies in animal models and the use of new pharmacological tools that activate ALDH2 depict a new picture related to ALDH2 as a critical health-promoting enzyme.
Resumo:
Abstract Background RNAs transcribed from intronic regions of genes are involved in a number of processes related to post-transcriptional control of gene expression. However, the complement of human genes in which introns are transcribed, and the number of intronic transcriptional units and their tissue expression patterns are not known. Results A survey of mRNA and EST public databases revealed more than 55,000 totally intronic noncoding (TIN) RNAs transcribed from the introns of 74% of all unique RefSeq genes. Guided by this information, we designed an oligoarray platform containing sense and antisense probes for each of 7,135 randomly selected TIN transcripts plus the corresponding protein-coding genes. We identified exonic and intronic tissue-specific expression signatures for human liver, prostate and kidney. The most highly expressed antisense TIN RNAs were transcribed from introns of protein-coding genes significantly enriched (p = 0.002 to 0.022) in the 'Regulation of transcription' Gene Ontology category. RNA polymerase II inhibition resulted in increased expression of a fraction of intronic RNAs in cell cultures, suggesting that other RNA polymerases may be involved in their biosynthesis. Members of a subset of intronic and protein-coding signatures transcribed from the same genomic loci have correlated expression patterns, suggesting that intronic RNAs regulate the abundance or the pattern of exon usage in protein-coding messages. Conclusion We have identified diverse intronic RNA expression patterns, pointing to distinct regulatory roles. This gene-oriented approach, using a combined intron-exon oligoarray, should permit further comparative analysis of intronic transcription under various physiological and pathological conditions, thus advancing current knowledge about the biological functions of these noncoding RNAs.
Resumo:
Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease’s etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
Resumo:
Background: Malaria caused by Plasmodium vivax is an experimentally neglected severe disease with a substantial burden on human health. Because of technical limitations, little is known about the biology of this important human pathogen. Whole genome analysis methods on patient-derived material are thus likely to have a substantial impact on our understanding of P. vivax pathogenesis and epidemiology. For example, it will allow study of the evolution and population biology of the parasite, allow parasite transmission patterns to be characterized, and may facilitate the identification of new drug resistance genes. Because parasitemias are typically low and the parasite cannot be readily cultured, on-site leukocyte depletion of blood samples is typically needed to remove human DNA that may be 1000X more abundant than parasite DNA. These features have precluded the analysis of archived blood samples and require the presence of laboratories in close proximity to the collection of field samples for optimal pre-cryopreservation sample preparation. Results: Here we show that in-solution hybridization capture can be used to extract P. vivax DNA from human contaminating DNA in the laboratory without the need for on-site leukocyte filtration. Using a whole genome capture method, we were able to enrich P. vivax DNA from bulk genomic DNA from less than 0.5% to a median of 55% (range 20%-80%). This level of enrichment allows for efficient analysis of the samples by whole genome sequencing and does not introduce any gross biases into the data. With this method, we obtained greater than 5X coverage across 93% of the P. vivax genome for four P. vivax strains from Iquitos, Peru, which is similar to our results using leukocyte filtration (greater than 5X coverage across 96% of the genome). Conclusion: The whole genome capture technique will enable more efficient whole genome analysis of P. vivax from a larger geographic region and from valuable archived sample collections.
Resumo:
Human cells are constantly exposed to DNA damage. Without repair, damage can result in genetic instability and eventually cancer. The strong association between the lack of DNA damage repair, mutations and cancer is dramatically demonstrated by a number of cancer-prone human syndromes, such as xeroderma pigmentosum (XP), ataxia-telangiectasia (AT) and Fanconi anemia (FA). This review focuses on the historical discoveries related with these three diseases and describes their impact on the understanding of DNA repair mechanisms and the causes of human cancer. As deficiencies in DNA repair are also often related with progeria symptoms, unrepaired damage and aging are somehow related. Several other pathologies associated with DNA repair defects, genetic instability and increased cancer risk are also discussed. In fact, studies with cells from these many syndromes have helped in understanding important levels of protection against cancer and aging, although little help has actually been conferred to the patients in terms of therapy. Finally, the recent advances in combined basic and translational research on DNA repair and chemotherapy are presented.
Resumo:
Characterization of population genetic variation and structure can be used as tools for research in human genetics and population isolates are of great interest. The aim of the present study was to characterize the genetic structure of Xavante Indians and compare it with other populations. The Xavante, an indigenous population living in Brazilian Central Plateau, is one of the largest native groups in Brazil. A subset of 53 unrelated subjects was selected from the initial sample of 300 Xavante Indians. Using 86,197 markers, Xavante were compared with all populations of HapMap Phase III and HGDP-CEPH projects and with a Southeast Brazilian population sample to establish its population structure. Principal Components Analysis showed that the Xavante Indians are concentrated in the Amerindian axis near other populations of known Amerindian ancestry such as Karitiana, Pima, Surui and Maya and a low degree of genetic admixture was observed. This is consistent with the historical records of bottlenecks experience and cultural isolation. By calculating pair-wise F-st statistics we characterized the genetic differentiation between Xavante Indians and representative populations of the HapMap and from HGDP-CEPH project. We found that the genetic differentiation between Xavante Indians and populations of Ameridian, Asian, European, and African ancestry increased progressively. Our results indicate that the Xavante is a population that remained genetically isolated over the past decades and can offer advantages for genome-wide mapping studies of inherited disorders.
Resumo:
Pathogenic Leptospira is the etiological agent of leptospirosis, a life-threatening disease that affects populations worldwide. Surface proteins have the potential to promote several activities, including adhesion. This work aimed to study the leptospiral coding sequence (CDS) LIC11087, genome annotated as hypothetical outer membrane protein. The LIC11087 gene was cloned and expressed in Escherichia coil BL21 (DE3) strain by using the expression vector pAE. The recombinant protein tagged with N-terminal 6XHis was purified by metal-charged chromatography and characterized by circular dichroism (CD) spectroscopy. The recombinant protein has the ability to mediate attachment to the extracellular matrix (ECM) components, laminin and plasma fibronectin, and was named Lsa30 (Leptospiral surface adhesin of 30 kDa). Lsa30 binds to laminin and to plasma fibronectin in a dose-dependent and saturable manner, with dissociation equilibrium constants (K-D) of 292 +/- 24 nM and 157 +/- 35 nM, respectively. Moreover, the Lsa30 is a plasminogen (PLC) receptor, capable of generating plasmin, in the presence of activator. This protein may interfere with the complement cascade by interacting with C4bp regulator. The Lsa30 is probably a new surface protein of Leptospira as revealed by immunofluorescence assays with living organisms and the reactivity with antibodies present in serum samples of experimentally infected hamsters. Thus, Lsa30 is a novel versatile protein that may play a role in mediating adhesion and may help pathogenic Leptospira to overcome tissue barriers and to escape the immune system. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Intron splicing is one of the most important steps involved in the maturation process of a pre-mRNA. Although the sequence profiles around the splice sites have been studied extensively, the levels of sequence identity between the exonic sequences preceding the donor sites and the intronic sequences preceding the acceptor sites has not been examined as thoroughly. In this study we investigated identity patterns between the last 15 nucleotides of the exonic sequence preceding the 5' splice site and the intronic sequence preceding the 3' splice site in a set of human protein-coding genes that do not exhibit intron retention. We found that almost 60% of consecutive exons and introns in human protein-coding genes share at least two identical nucleotides at their 3' ends and, on average, the sequence identity length is 2.47 nucleotides. Based on our findings we conclude that the 3' ends of exons and introns tend to have longer identical sequences within a gene than when being taken from different genes. Our results hold even if the pairs are non-consecutive in the transcription order. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Abstract Background Plasmodium vivax is the most widely distributed human malaria, responsible for 70–80 million clinical cases each year and large socio-economical burdens for countries such as Brazil where it is the most prevalent species. Unfortunately, due to the impossibility of growing this parasite in continuous in vitro culture, research on P. vivax remains largely neglected. Methods A pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of P. vivax was performed. To do so, 1,184 clones from a cDNA library constructed with parasites obtained from 10 different human patients in the Brazilian Amazon were sequenced. Sequences were automatedly processed to remove contaminants and low quality reads. A total of 806 sequences with an average length of 586 bp met such criteria and their clustering revealed 666 distinct events. The consensus sequence of each cluster and the unique sequences of the singlets were used in similarity searches against different databases that included P. vivax, Plasmodium falciparum, Plasmodium yoelii, Plasmodium knowlesi, Apicomplexa and the GenBank non-redundant database. An E-value of <10-30 was used to define a significant database match. ESTs were manually assigned a gene ontology (GO) terminology Results A total of 769 ESTs could be assigned a putative identity based upon sequence similarity to known proteins in GenBank. Moreover, 292 ESTs were annotated and a GO terminology was assigned to 164 of them. Conclusion These are the first ESTs reported for P. vivax and, as such, they represent a valuable resource to assist in the annotation of the P. vivax genome currently being sequenced. Moreover, since the GC-content of the P. vivax genome is strikingly different from that of P. falciparum, these ESTs will help in the validation of gene predictions for P. vivax and to create a gene index of this malaria parasite.