30 resultados para The cancer genome atlas
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
Transposons are abundant components of eukaryotic genomes, and play important role in genome evolution. The knowledge about these elements should contribute to the understanding of their impact on the host genomes. The hAT transposon superfamily is one of the best characterized superfamilies in diverse organisms, nevertheless, a detailed study of these elements was never carried in sugarcane. To address this question we analyzed 32 cDNAs similar to that of hAT superfamily of transposons previously identified in the sugarcane transcriptome. Our results revealed that these hAT-like transposases cluster in one highly homogeneous and other more heterogeneous lineage. We present evidences that support the hypothesis that the highly homogeneous group is a domesticated transposase while the remainder of the lineages are composed of transposon units. The first is common to grasses, clusters significantly with domesticated transposases from Arabidopsis, rice and sorghum and is expressed in different tissues of two sugarcane cultivars analyzed. In contrast, the more heterogeneous group represents at least two transposon lineages. We recovered five genomic versions of one lineage, characterizing a novel transposon family with conserved DDE motif, named SChAT. These results indicate the presence of at least three distinct lineages of hAT-like transposase paralogues in sugarcane genome, including a novel transposon family described in Saccharum and a domesticated transposase. Taken together, these findings permit to follow the diversification of some hAT transposase paralogues in sugarcane, aggregating knowledge about the co-evolution of transposons and their host genomes.
Resumo:
The non-classical human leukocyte antigen (HLA) class I genes present a very low rate of variation. So far, only 10 HLA-E alleles encoding three proteins have been described, but only two are frequently found in worldwide populations. Because of its historical background, Brazilians are very suitable for population genetic studies. Therefore, 104 bone marrow donors from Brazil were evaluated for HLA-E exons 14. Seven variation sites were found, including two known single nucleotide polymorphisms (SNPs) at positions +424 and +756 and five new SNPs at positions +170 (intron 1), +1294 (intron 3), +1625, +1645 and +1857 (exon 4). Haplotyping analysis did show eight haplotypes, three of them known as E*01:01:01, E*01:03:01 and E*01:03:02:01 and five HLA-E new alleles that carry the new variation sites. The HLA-E*01:01:01 allele was the predominant haplotype (62.50%), followed by E*01:03:02:01 (24.52%). Selective neutrality tests have disclosed an interesting pattern of selective pressures in which balancing selection is probably shaping allele frequency distributions at an SNP at exon 3 (codon 107), sequence diversity at exon 4 and the non-coding regions is facing significant purifying pressure. Even in an admixed population such as the Brazilian one, the HLA-E locus is very conserved, presenting few polymorphic SNPs in the coding region.
Resumo:
Madrepora is one of the most ecologically important genera of reef-building scleractinians in the deep sea, occurring from tropical to high-latitude regions. Despite this, the taxonomic affinities and relationships within the genus Madrepora remain unclear. To clarify these issues, we sequenced the mitochondrial (mt) genome of the most widespread Madrepora species, M. oculata, and compared this with data for other scleractinians. The architecture of the M. oculara mt genome was very similar to that of other scleractinians, except for a novel gene rearrangement affecting only cox2 and cox3. This pattern of gene organization was common to four geographically distinct M. oculata individuals as well as the congeneric species M. minutiseptum, but was not shared by other genera that are closely related on the basis of cox1 sequence analysis nor other oculinids, suggesting that it might be unique to Madrepora. (C) 2012 Elsevier Inc. All rights reserved.
Resumo:
Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease’s etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
Resumo:
Abstract Background From shotgun libraries used for the genomic sequencing of the phytopathogenic bacterium Xanthomonas axonopodis pv. citri (XAC), clones that were representative of the largest possible number of coding sequences (CDSs) were selected to create a DNA microarray platform on glass slides (XACarray). The creation of the XACarray allowed for the establishment of a tool that is capable of providing data for the analysis of global genome expression in this organism. Findings The inserts from the selected clones were amplified by PCR with the universal oligonucleotide primers M13R and M13F. The obtained products were purified and fixed in duplicate on glass slides specific for use in DNA microarrays. The number of spots on the microarray totaled 6,144 and included 768 positive controls and 624 negative controls per slide. Validation of the platform was performed through hybridization of total DNA probes from XAC labeled with different fluorophores, Cy3 and Cy5. In this validation assay, 86% of all PCR products fixed on the glass slides were confirmed to present a hybridization signal greater than twice the standard deviation of the deviation of the global median signal-to-noise ration. Conclusions Our validation of the XACArray platform using DNA-DNA hybridization revealed that it can be used to evaluate the expression of 2,365 individual CDSs from all major functional categories, which corresponds to 52.7% of the annotated CDSs of the XAC genome. As a proof of concept, we used this platform in a previously work to verify the absence of genomic regions that could not be detected by sequencing in related strains of Xanthomonas.
Resumo:
Abstract Background Recent medical and biological technology advances have stimulated the development of new testing systems that have been providing huge, varied amounts of molecular and clinical data. Growing data volumes pose significant challenges for information processing systems in research centers. Additionally, the routines of genomics laboratory are typically characterized by high parallelism in testing and constant procedure changes. Results This paper describes a formal approach to address this challenge through the implementation of a genetic testing management system applied to human genome laboratory. We introduced the Human Genome Research Center Information System (CEGH) in Brazil, a system that is able to support constant changes in human genome testing and can provide patients updated results based on the most recent and validated genetic knowledge. Our approach uses a common repository for process planning to ensure reusability, specification, instantiation, monitoring, and execution of processes, which are defined using a relational database and rigorous control flow specifications based on process algebra (ACP). The main difference between our approach and related works is that we were able to join two important aspects: 1) process scalability achieved through relational database implementation, and 2) correctness of processes using process algebra. Furthermore, the software allows end users to define genetic testing without requiring any knowledge about business process notation or process algebra. Conclusions This paper presents the CEGH information system that is a Laboratory Information Management System (LIMS) based on a formal framework to support genetic testing management for Mendelian disorder studies. We have proved the feasibility and showed usability benefits of a rigorous approach that is able to specify, validate, and perform genetic testing using easy end user interfaces.
Resumo:
Abstract Background Pancreatic ductal adenocarcinoma (PDAC) is known by its aggressiveness and lack of effective therapeutic options. Thus, improvement in current knowledge of molecular changes associated with pancreatic cancer is urgently needed to explore novel venues of diagnostics and treatment of this dismal disease. While there is mounting evidence that long noncoding RNAs (lncRNAs) transcribed from intronic and intergenic regions of the human genome may play different roles in the regulation of gene expression in normal and cancer cells, their expression pattern and biological relevance in pancreatic cancer is currently unknown. In the present work we investigated the relative abundance of a collection of lncRNAs in patients' pancreatic tissue samples aiming at identifying gene expression profiles correlated to pancreatic cancer and metastasis. Methods Custom 3,355-element spotted cDNA microarray interrogating protein-coding genes and putative lncRNA were used to obtain expression profiles from 38 clinical samples of tumor and non-tumor pancreatic tissues. Bioinformatics analyses were performed to characterize structure and conservation of lncRNAs expressed in pancreatic tissues, as well as to identify expression signatures correlated to tissue histology. Strand-specific reverse transcription followed by PCR and qRT-PCR were employed to determine strandedness of lncRNAs and to validate microarray results, respectively. Results We show that subsets of intronic/intergenic lncRNAs are expressed across tumor and non-tumor pancreatic tissue samples. Enrichment of promoter-associated chromatin marks and over-representation of conserved DNA elements and stable secondary structure predictions suggest that these transcripts are generated from independent transcriptional units and that at least a fraction is under evolutionary selection, and thus potentially functional. Statistically significant expression signatures comprising protein-coding mRNAs and lncRNAs that correlate to PDAC or to pancreatic cancer metastasis were identified. Interestingly, loci harboring intronic lncRNAs differentially expressed in PDAC metastases were enriched in genes associated to the MAPK pathway. Orientation-specific RT-PCR documented that intronic transcripts are expressed in sense, antisense or both orientations relative to protein-coding mRNAs. Differential expression of a subset of intronic lncRNAs (PPP3CB, MAP3K14 and DAPK1 loci) in metastatic samples was confirmed by Real-Time PCR. Conclusion Our findings reveal sets of intronic lncRNAs expressed in pancreatic tissues whose abundance is correlated to PDAC or metastasis, thus pointing to the potential relevance of this class of transcripts in biological processes related to malignant transformation and metastasis in pancreatic cancer.
Resumo:
Abstract Background The implication of post-transcriptional regulation by microRNAs in molecular mechanisms underlying cancer disease is well documented. However, their interference at the cellular level is not fully explored. Functional in vitro studies are fundamental for the comprehension of their role; nevertheless results are highly dependable on the adopted cellular model. Next generation small RNA transcriptomic sequencing data of a tumor cell line and keratinocytes derived from primary culture was generated in order to characterize the microRNA content of these systems, thus helping in their understanding. Both constitute cell models for functional studies of microRNAs in head and neck squamous cell carcinoma (HNSCC), a smoking-related cancer. Known microRNAs were quantified and analyzed in the context of gene regulation. New microRNAs were investigated using similarity and structural search, ab initio classification, and prediction of the location of mature microRNAs within would-be precursor sequences. Results were compared with small RNA transcriptomic sequences from HNSCC samples in order to access the applicability of these cell models for cancer phenotype comprehension and for novel molecule discovery. Results Ten miRNAs represented over 70% of the mature molecules present in each of the cell types. The most expressed molecules were miR-21, miR-24 and miR-205, Accordingly; miR-21 and miR-205 have been previously shown to play a role in epithelial cell biology. Although miR-21 has been implicated in cancer development, and evaluated as a biomarker in HNSCC progression, no significant expression differences were seen between cell types. We demonstrate that differentially expressed mature miRNAs target cell differentiation and apoptosis related biological processes, indicating that they might represent, with acceptable accuracy, the genetic context from which they derive. Most miRNAs identified in the cancer cell line and in keratinocytes were present in tumor samples and cancer-free samples, respectively, with miR-21, miR-24 and miR-205 still among the most prevalent molecules at all instances. Thirteen miRNA-like structures, containing reads identified by the deep sequencing, were predicted from putative miRNA precursor sequences. Strong evidences suggest that one of them could be a new miRNA. This molecule was mostly expressed in the tumor cell line and HNSCC samples indicating a possible biological function in cancer. Conclusions Critical biological features of cells must be fully understood before they can be chosen as models for functional studies. Expression levels of miRNAs relate to cell type and tissue context. This study provides insights on miRNA content of two cell models used for cancer research. Pathways commonly deregulated in HNSCC might be targeted by most expressed and also by differentially expressed miRNAs. Results indicate that the use of cell models for cancer research demands careful assessment of underlying molecular characteristics for proper data interpretation. Additionally, one new miRNA-like molecule with a potential role in cancer was identified in the cell lines and clinical samples.
Resumo:
Background: Malaria caused by Plasmodium vivax is an experimentally neglected severe disease with a substantial burden on human health. Because of technical limitations, little is known about the biology of this important human pathogen. Whole genome analysis methods on patient-derived material are thus likely to have a substantial impact on our understanding of P. vivax pathogenesis and epidemiology. For example, it will allow study of the evolution and population biology of the parasite, allow parasite transmission patterns to be characterized, and may facilitate the identification of new drug resistance genes. Because parasitemias are typically low and the parasite cannot be readily cultured, on-site leukocyte depletion of blood samples is typically needed to remove human DNA that may be 1000X more abundant than parasite DNA. These features have precluded the analysis of archived blood samples and require the presence of laboratories in close proximity to the collection of field samples for optimal pre-cryopreservation sample preparation. Results: Here we show that in-solution hybridization capture can be used to extract P. vivax DNA from human contaminating DNA in the laboratory without the need for on-site leukocyte filtration. Using a whole genome capture method, we were able to enrich P. vivax DNA from bulk genomic DNA from less than 0.5% to a median of 55% (range 20%-80%). This level of enrichment allows for efficient analysis of the samples by whole genome sequencing and does not introduce any gross biases into the data. With this method, we obtained greater than 5X coverage across 93% of the P. vivax genome for four P. vivax strains from Iquitos, Peru, which is similar to our results using leukocyte filtration (greater than 5X coverage across 96% of the genome). Conclusion: The whole genome capture technique will enable more efficient whole genome analysis of P. vivax from a larger geographic region and from valuable archived sample collections.
Resumo:
The research intended to analyze the adoption process of the green certification "Leadership in Energy and Environmental Design" (LEED) from the hotel sector establishments that has already adopted it. For its concretization it was proceeded a bibliographical research, secondary fact-gathering in journals, institutional sites and documentaries, and primary fact-gathering by means of semi structured interviews carried out with responsible people of the certified hotels and of the responsible entity of the certification in Brazil (Green Building Council Brazil). There were 21 interviewee, being 02 of the GBC Brazil and 19 of means of lodging (31% of the certified). For data analysis, it was utilized content analysis technique with the aid of ATLAS.ti software. The results permitted to identify the chronology of the processes of certification and the profile of the hotel categories that adopt the LEED program. Beyond that, the interviews enabled the discussion of the initial motivations for seeking the certification, as well the advantages and the obstacles perceived regarding its adoption.
Resumo:
Managed environments in the form of well watered and water stressed trials were performed to study the genetic basis of grain yield and stay green in sorghum with the objective of validating previously detected QTL. As variations in phenology and plant height may influence QTL detection for the target traits, QTL for flowering time and plant height were introduced as cofactors in QTL analyses for yield and stay green. All but one of the flowering time QTL were detected near yield and stay green QTL. Similar co-localization was observed for two plant height QTL. QTL analysis for yield, using flowering time/plant height cofactors, led to yield QTL on chromosomes 2, 3, 6, 8 and 10. For stay green, QTL on chromosomes 3, 4, 8 and 10 were not related to differences in flowering time/plant height. The physical positions for markers in QTL regions projected on the sorghum genome suggest that the previously detected plant height QTL, Sb-HT9-1, and Dw2, in addition to the maturity gene, Ma5, had a major confounding impact on the expression of yield and stay green QTL. Co-localization between an apparently novel stay green QTL and a yield QTL on chromosome 3 suggests there is potential for indirect selection based on stay green to improve drought tolerance in sorghum. Our QTL study was carried out with a moderately sized population and spanned a limited geographic range, but still the results strongly emphasize the necessity of corrections for phenology in QTL mapping for drought tolerance traits in sorghum.
Resumo:
Modern sugarcane cultivars are complex hybrids resulting from crosses among several Saccharum species. Traditional breeding methods have been employed extensively in different countries over the past decades to develop varieties with increased sucrose yield and resistance to pests and diseases. Conventional variety improvement, however, may be limited by the narrow pool of suitable genes. Thus, molecular genetics is seen as a promising tool to assist in the process of developing improved varieties. The SUCEST-FUN Project (http://sucest-fun.org) aims to associate function with sugarcane genes using a variety of tools, in particular those that enable the study of the sugarcane transcriptome. An extensive analysis has been conducted to characterise, phenotypically, sugarcane genotypes with regard to their sucrose content, biomass and drought responses. Through the analysis of different cultivars, genes associated with sucrose content, yield, lignin and drought have been identified. Currently, tools are being developed to determine signalling and regulatory networks in grasses, and to sequence the sugarcane genome, as well as to identify sugarcane promoters. This is being implemented through the SUCEST-FUN (http://sucest-fun.org) and GRASSIUS databases (http://grassius.org), the cloning of sugarcane promoters, the identification of cis-regulatory elements (CRE) using Chromatin Immunoprecipitation-sequencing (ChIP-Seq) and the generation of a comprehensive Signal Transduction and Transcription gene catalogue (SUCAST Catalogue).
Resumo:
The Epstein-Barr virus (EBV) is associated with a large spectrum of lymphoproliferative diseases. Traditional methods of EBV detection include the immunohistochemical identification of viral proteins and DNA probes to the viral genome in tumoral tissue. The present study explored the detection of the EBV genome, using the BALF5 gene, in the bone marrow or blood mononuclear cells of patients with diffuse large B-cell lymphomas (DLBCL) and related its presence to the clinical variables and risk factors. The results show that EBV detection in 21.5% of patients is not associated with age, gender, staging, B symptoms, international prognostic index scores or any analytical parameters, including lactate dehydrogenase (LDH) or beta-2 microglobulin (B2M). The majority of patients were treated with R-CHOP-like (rituximab. cyclophosphamide, doxorubicin, vincristine and prednisolone or an equivalent combination) and some with CHOP-like chemotherapy. Response rates [complete response (CR) + partial response (PR)] were not significantly different between EBV-negative and -positive cases, with 93.2 and 88.9%, respectively. The survival rate was also similar in the two groups, with 5-year overall survival (OS) rates of 64.3 and 76.7%, respectively. However, when analyzing the treatment groups separately there was a trend in EBV-positive patients for a worse prognosis in patients treated with CHOP-like regimens that was not identified in patients treated with R-CHOP-like regimens. We conclude that EBV detection in the bone marrow and blood mononuclear cells of DLBC patients has the same frequency of EBV detection on tumoral lymphoma tissue but is not associated with the risk factors, response rate and survival in patients treated mainly with immunochemotherapy plus rituximab. These results also suggest that the addition of rituximab to chemotherapy improves the prognosis associated with EBV detection in DLBCL.
Resumo:
Several extensions of the standard model predict the existence of new neutral spin-1 resonances associated with the electroweak symmetry breaking sector. Using the data from ATLAS (with integrated luminosity of L = 1.02 fb(-1)) and CMS (with integrated luminosity of L = 1.55 fb(-1)) on the production of W+W- pairs through the process pp --> l(+)l(-)' is not an element of(T), we place model independent bounds on these new vector resonances masses, couplings, and widths. Our analyses show that the present data exclude new neutral vector resonances with masses up to 1-2.3 TeV depending on their couplings and widths. We also demonstrate how to extend our analysis framework to different models with a specific example.
Resumo:
This study aimed to identify the CD24 and CD44 immunophenotypes within invasive ductal breast carcinoma (I DC) subgroups defined by immunohistochesmistry markers and to determine its influence on prognosis as well as its association with the expression of Ki-67, cytokeratins (CK5 and CK 18) and claudin-7. Immunohistochemical expression of CD44 and CD24 alone or in combination was investigated in 95 IDC cases arranged in a tissue microarray (TMA). The association with subgroups defined as luminal A and B; HER2 rich and triple negative, or with the other markers and prognosis was analyzed. CD44(+)/CD24(-) and CD44(-)/CD24(+) were respectively present in 8.4% and 16.8% of the tumors, a lack of both proteins was detected in 6.3%, while CD441(-)/CD24(+) was observed in 45.3% of the tumors. Although there was no significant correlation between subgroups and different phenotypes, the CD44(+)/CD24(-) phenotype was more common in the basal subgroups but absent in HER2 tumors, whereas luminal tumors are enriched in CD44(-)/CD24(+) and CD44(+)/CD24(+) cells. The frequency of CD44(+)/CD24(-) or CD44(-)/CD24(+) was not associated with clinical characteristics or biological markers. There was also no significant association of these phenotypes with the event free (DFS) and overall survival (OS). Single CD44(+) was evident in 57.9% of the tumors and was marginally associated to grading and not to any other tumor characteristics as well as OS and DFS. CD24(+) was positive in 74.7% of the tumors, showing a significant association with estrogen receptor, progesterone receptor and Ki-67 and a marginal association with CKI8 and claudin-7. Expression of claudin-7 and Ki-67 did not associate with the cancer subgroups, while a positive association between CK18 and the luminal subgroups was found (P=0.03). CK5, CK18 and Ki-67 expression had no influence in OS or DFS. Single CD24(+) (P=0.07) and claudin-7 positivity (P=0.05) were associated with reduced time of recurrence, suggesting a contribution of these markers to aggressiveness of breast cancer.