926 resultados para Human genome, CpG islands, Markov models, DNA walk
Resumo:
The publication of a draft of the human genome and of large collections of transcribed sequences has made it possible to study the complex relationship between the transcriptome and the genome. In the work presented here, we have focused on mapping mRNA 3' ends onto the genome by use of the raw data generated by the expressed sequence tag (EST) sequencing projects. We find that at least half of the human genes encode multiple transcripts whose polyadenylation is driven by multiple signals. The corresponding transcript 3' ends are spread over distances in the kilobase range. This finding has profound implications for our understanding of gene expression regulation and of the diversity of human transcripts, for the design of cDNA microarray probes, and for the interpretation of gene expression profiling experiments.
Resumo:
The paper discusses maintenance challenges of organisations with a huge number of devices and proposes the use of probabilistic models to assist monitoring and maintenance planning. The proposal assumes connectivity of instruments to report relevant features for monitoring. Also, the existence of enough historical registers with diagnosed breakdowns is required to make probabilistic models reliable and useful for predictive maintenance strategies based on them. Regular Markov models based on estimated failure and repair rates are proposed to calculate the availability of the instruments and Dynamic Bayesian Networks are proposed to model cause-effect relationships to trigger predictive maintenance services based on the influence between observed features and previously documented diagnostics
Resumo:
Background: HSTL is a rare entity characterized by an infiltration of bone marrow, spleen and liver tissues by neoplastic gammadelta (gd) -more rarely alphabeta (ab)- T cells. Its pathogenesis is poorly understood. Our purpose was to identify the molecular signature of HSTL and explore molecular pathways implicated in its pathogenesis.Methods: Gene expression profiling and array CGH analysis of 10 HSTL samples (7gd, 3ab), 1 HSTL cell line (DERL2), 2 normal gd samples together with 16 peripheral T-cell lymphoma not otherwise specified (PTCL,NOS) and 7 nasal NK/T cell lymphomas were performed.Results: By unsupervised analysis, ab and gdHSTL clustered together remarkably separated from other lymphoma entities. Compared to PTCL, NOS, HSTL overexpresed genes encoding NK-associated molecules, oncogenes (VAV3) and the Sphingosine-1-phosphatase receptor 5 involved in cell trafficking. Compared to normal gd cells, HSTL overexpressed genes encoding NK-cell and multi drug resistance-associated molecules, transcription factors (RHOB), oncogenes (MAFB, FOS, JUN, VAV3) and the tyrosine kinase SYK whereas genes encoding cytotoxic molecules and the tumor suppressor gene AIM1 were among the most downregulated. By immunohistochemistry, SYK was demonstrated on HSTL cells with expression of its phosphorylated form in DERL2 cells by Western blot. Functional studies using a SYK inhibitor revealed a dose dependent increase of apoptotic DERL2 cells suggesting that SYK could be a candidate target for pharmacologic inhibition. Downexpression of AIM1 was validated by qRT-PCR. Methylation analysis of DERL2 genomic DNA treated by bisulfite demonstrated highly methylated CpG islands of AIM1. Genomic profiles confirmed recurrent isochromosome 7q (n=6/9) without alterations at 9q22 and 6q21 containing SYK and AIM1 genes, respectively.Conclusion: The current study identifies a distinct molecular signature for HSTL and highlights oncogenic pathways which offer rationale for exploring new therapeutic options such as SYK inhibitors. It supports the view of gd and ab HSTL as a single entity.
Resumo:
PURPOSE: We have investigated the expression and regulation of 15-hydroxyprostaglandin dehydrogenase (15-PGDH) in gastric cancer. EXPERIMENTAL DESIGN: Clinical gastric adenocarcinoma samples were analyzed by immunohistochemistry and quantitative real-time PCR for protein and mRNA expression of 15-PGDH and for methylation status of 15-PGDH promoter. The effects of interleukin-1beta (IL-1beta) and epigenetic mechanisms on 15-PGDH regulation were assessed in gastric cancer cell lines. RESULTS: In a gastric cancer cell line with a very low 15-PGDH expression (TMK-1), the 15-PGDH promoter was methylated and treatment with a demethylating agent 5-aza-2'-deoxycytidine restored 15-PGDH expression. In a cell line with a relatively high basal level of 15-PGDH (MKN-28), IL-1beta repressed expression of 15-PGDH mRNA and protein. This effect of IL-1beta was at least in part attributed to inhibition of 15-PGDH promoter activity. SiRNA-mediated knockdown of 15-PGDH resulted in strong increase of prostaglandin E(2) production in MKN-28 cells and increased cell growth of these cells by 31% in anchorage-independent conditions. In clinical gastric adenocarcinoma specimens, 15-PGDH mRNA levels were 5-fold lower in gastric cancer samples when compared with paired nonneoplastic tissues (n = 26) and 15-PGDH protein was lost in 65% of gastric adenocarcinomas (n = 210). CONCLUSIONS: 15-PGDH is down-regulated in gastric cancer, which could potentially lead to accelerated tumor progression. Importantly, our data indicate that a proinflammatory cytokine linked to gastric carcinogenesis, IL-1beta, suppresses 15-PGDH expression at least partially by inhibiting promoter activity of the 15-PGDH gene.
Resumo:
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.
Resumo:
The completion of the sequencing of the mouse genome promises to help predict human genes with greater accuracy. While current ab initio gene prediction programs are remarkably sensitive (i.e., they predict at least a fragment of most genes), their specificity is often low, predicting a large number of false-positive genes in the human genome. Sequence conservation at the protein level with the mouse genome can help eliminate some of those false positives. Here we describe SGP2, a gene prediction program that combines ab initio gene prediction with TBLASTX searches between two genome sequences to provide both sensitive and specific gene predictions. The accuracy of SGP2 when used to predict genes by comparing the human and mouse genomes is assessed on a number of data sets, including single-gene data sets, the highly curated human chromosome 22 predictions, and entire genome predictions from ENSEMBL. Results indicate that SGP2 outperforms purely ab initio gene prediction methods. Results also indicate that SGP2 works about as well with 3x shotgun data as it does with fully assembled genomes. SGP2 provides a high enough specificity that its predictions can be experimentally verified at a reasonable cost. SGP2 was used to generate a complete set of gene predictions on both the human and mouse by comparing the genomes of these two species. Our results suggest that another few thousand human and mouse genes currently not in ENSEMBL are worth verifying experimentally.
Resumo:
The vast majority of the biology of a newly sequenced genome is inferred from the set of encoded proteins. Predicting this set is therefore invariably the first step after the completion of the genome DNA sequence. Here we review the main computational pipelines used to generate the human reference protein-coding gene sets.
Resumo:
The distribution of transposable elements (TEs) in a genome reflects a balance between insertion rate and selection against new insertions. Understanding the distribution of TEs therefore provides insights into the forces shaping the organization of genomes. Past research has shown that TEs tend to accumulate in genomic regions with low gene density and low recombination rate. However, little is known about the factors modulating insertion rates across the genome and their evolutionary significance. One candidate factor is gene expression, which has been suggested to increase local insertion rate by rendering DNA more accessible. We test this hypothesis by comparing the TE density around germline- and soma-expressed genes in the euchromatin of Drosophila melanogaster. Because only insertions that occur in the germline are transmitted to the next generation, we predicted a higher density of TEs around germline-expressed genes than soma-expressed genes. We show that the rate of TE insertions is greater near germline- than soma-expressed genes. However, this effect is partly offset by stronger selection for genome compactness (against excess noncoding DNA) on germline-expressed genes. We also demonstrate that the local genome organization in clusters of coexpressed genes plays a fundamental role in the genomic distribution of TEs. Our analysis shows that-in addition to recombination rate-the distribution of TEs is shaped by the interaction of gene expression and genome organization. The important role of selection for compactness sheds a new light on the role of TEs in genome evolution. Instead of making genomes grow passively, TEs are controlled by the forces shaping genome compactness, most likely linked to the efficiency of gene expression or its complexity and possibly their interaction with mechanisms of TE silencing.
Resumo:
BACKGROUND: Conserved non-coding sequences in the human genome are approximately tenfold more abundant than known genes, and have been hypothesized to mark the locations of cis-regulatory elements. However, the global contribution of conserved non-coding sequences to the transcriptional regulation of human genes is currently unknown. Deeply conserved elements shared between humans and teleost fish predominantly flank genes active during morphogenesis and are enriched for positive transcriptional regulatory elements. However, such deeply conserved elements account for <1% of the conserved non-coding sequences in the human genome, which are predominantly mammalian. RESULTS: We explored the regulatory potential of a large sample of these 'common' conserved non-coding sequences using a variety of classic assays, including chromatin remodeling, and enhancer/repressor and promoter activity. When tested across diverse human model cell types, we find that the fraction of experimentally active conserved non-coding sequences within any given cell type is low (approximately 5%), and that this proportion increases only modestly when considered collectively across cell types. CONCLUSIONS: The results suggest that classic assays of cis-regulatory potential are unlikely to expose the functional potential of the substantial majority of mammalian conserved non-coding sequences in the human genome.
Resumo:
Microsatellites are important highly polymorphic genetic markers dispersed in the human genome. Using a panel of 22 (CA)n repeat microsatellite markers mapped to recurrent breakpoint cluster regions specifically involved in leukemia, we investigated 114 adult leukemias (25 acute lymphocytic leukemia [ALL], 32 acute myeloid leukemia [AML], 36 chronic lymphocytic leukemia [CLL], and 21 chronic myeloid leukemia [CML] in chronic phase) for somatic mutations at these loci. In each patient, DNA from fresh leukemia samples was analyzed alongside normal constitutive DNA from buccal epithelium. We detected loss of heterozygosity (LOH) in 81 of 114 patients (ALL 16/25, AML 25/32, CLL 30/36, CML 10/21). Deletions were most often seen in ALL at 11q23 and 19p13; in AML at 8q22 and 11q23; in CLL at 13q14.3, 11q13, and 11q23; and in CML at 3q26. Only six deletions were reported in 74 karyotypes analyzed, whereas in these same cases, 91 LOH events were detected by microsatellites. Of 26 leukemias with a normal karyotype, 16 nevertheless showed at least one LOH by microsatellite analysis. Replication errors were found in 10 of 114 patients (8.8%). Thus, microsatellite instability is rare in leukemia in contrast to many solid tumors. Our findings suggest that in adult leukemia, LOH may be an important genetic event in addition to typical chromosomal translocations. LOH may point to the existence of tumor suppressor genes involved in leukemogenesis to a degree that has hitherto been underestimated.
Resumo:
Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation and suggest the potential to use large samples to uncover variation among closely spaced populations. Here we characterize genetic variation in a sample of 3,000 European individuals genotyped at over half a million variable DNA sites in the human genome. Despite low average levels of genetic differentiation among Europeans, we find a close correspondence between genetic and geographic distances; indeed, a geographical map of Europe arises naturally as an efficient two-dimensional summary of genetic variation in Europeans. The results emphasize that when mapping the genetic basis of a disease phenotype, spurious associations can arise if genetic structure is not properly accounted for. In addition, the results are relevant to the prospects of genetic ancestry testing; an individual's DNA can be used to infer their geographic origin with surprising accuracy-often to within a few hundred kilometres.
Resumo:
Transposable elements, as major components of most eukaryotic organisms' genomes, define their structural organization and plasticity. They supply host genomes with functional elements, for example, binding sites of the pleiotropic master transcription factor p53 were identified in LINE1, Alu and LTR repeats in the human genome. Similarly, in this report we reveal the role of zebrafish (Danio rerio) EnSpmN6_DR non-autonomous DNA transposon in shaping the repertoire of the p53 target genes. The multiple copies of EnSpmN6_DR and their embedded p53 responsive elements drive in several instances p53-dependent transcriptional modulation of the adjacent gene, whose human orthologs were frequently previously annotated as p53 targets. These transposons define predominantly a set of target genes whose human orthologs contribute to neuronal morphogenesis, axonogenesis, synaptic transmission and the regulation of programmed cell death. Consistent with these biological functions the orthologs of the EnSpmN6_DR-colonized loci are enriched for genes expressed in the amygdala, the hippocampus and the brain cortex. Our data pinpoint a remarkable example of convergent evolution: the exaptation of lineage-specific transposons to shape p53-regulated neuronal morphogenesis-related pathways in both a hominid and a teleost fish.
Resumo:
In ecology, "disease tolerance" is defined as an evolutionary strategy of hosts against pathogens, characterized by reduced or absent pathogenesis despite high pathogen load. To our knowledge, tolerance has to date not been quantified and disentangled from host resistance to disease in any clinically relevant human infection. Using data from the Swiss HIV Cohort Study, we investigated if there is variation in tolerance to HIV in humans and if this variation is associated with polymorphisms in the human genome. In particular, we tested for associations between tolerance and alleles of the Human Leukocyte Antigen (HLA) genes, the CC chemokine receptor 5 (CCR5), the age at which individuals were infected, and their sex. We found that HLA-B alleles associated with better HIV control do not confer tolerance. The slower disease progression associated with these alleles can be fully attributed to the extent of viral load reduction in carriers. However, we observed that tolerance significantly varies across HLA-B genotypes with a relative standard deviation of 34%. Furthermore, we found that HLA-B homozygotes are less tolerant than heterozygotes. Lastly, tolerance was observed to decrease with age, resulting in a 1.7-fold difference in disease progression between 20 and 60-y-old individuals with the same viral load. Thus, disease tolerance is a feature of infection with HIV, and the identification of the mechanisms involved may pave the way to a better understanding of pathogenesis.
Resumo:
Although increasing evidence suggests that CTL are important to fight the development of some cancers, the frequency of detectable tumor-specific T cells is low in cancer patients, and these cells have generally poor functional capacities, compared with virus-specific CD8(+) T cells. The generation with a vaccine of potent CTL responses against tumor Ags therefore remains a major challenge. In the present study, ex vivo analyses of Melan-A-specific CD8(+) T cells following vaccination with Melan-A peptide and CpG oligodeoxynucleotides revealed the successful induction in the circulation of effective melanoma-specific T cells, i.e., with phenotypic and functional characteristics similar to those of CTL specific for immunodominant viral Ags. Nonetheless, the eventual impact on tumor development in vaccinated melanoma donors remained limited. The comprehensive study of vaccinated patient metastasis shows that vaccine-driven tumor-infiltrating lymphocytes, although activated, still differed in functional capacities compared with blood counterparts. This coincided with a significant increase of FoxP3(+) regulatory T cell activity within the tumor. The consistent induction of effective tumor-specific CD8(+) T cells in the circulation with a vaccine represents a major achievement; however, clinical benefit may not be achieved unless the tumor environment can be altered to enable CD8(+) T cell efficacy.