945 resultados para Expression Data


Relevância:

70.00% 70.00%

Publicador:

Resumo:

Mood stabilising drugs such as lithium (LiCl) and valproic acid (VPA) are the first line agents for treating conditions such as Bipolar disorder and Epilepsy. However, these drugs have potential developmental effects that are not fully understood. This study explores the use of a simple human neurosphere-based in vitro model to characterise the pharmacological and toxicological effects of LiCl and VPA using gene expression changes linked to phenotypic alterations in cells. Treatment with VPA and LiCl resulted in the differential expression of 331 and 164 genes respectively. In the subset of VPA targeted genes, 114 were downregulated whilst 217 genes were upregulated. In the subset of LiCl targeted genes, 73 were downregulated and 91 were upregulated. Gene ontology (GO) term enrichment analysis was used to highlight the most relevant GO terms associated with a given gene list following toxin exposure. In addition, in order to phenotypically anchor the gene expression data, changes in the heterogeneity of cell subtype populations and cell cycle phase were monitored using flow cytometry. Whilst LiCl exposure did not significantly alter the proportion of cells expressing markers for stem cells/undifferentiated cells (Oct4, SSEA4), neurons (Neurofilament M), astrocytes (GFAP) or cell cycle phase, the drug caused a 1.4-fold increase in total cell number. In contrast, exposure to VPA resulted in significant upregulation of Oct4, SSEA, Neurofilament M and GFAP with significant decreases in both G2/M phase cells and cell number. This neurosphere model might provide the basis of a human-based cellular approach for the regulatory exploration of developmental impact of potential toxic chemicals.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Constant technology advances have caused data explosion in recent years. Accord- ingly modern statistical and machine learning methods must be adapted to deal with complex and heterogeneous data types. This phenomenon is particularly true for an- alyzing biological data. For example DNA sequence data can be viewed as categorical variables with each nucleotide taking four different categories. The gene expression data, depending on the quantitative technology, could be continuous numbers or counts. With the advancement of high-throughput technology, the abundance of such data becomes unprecedentedly rich. Therefore efficient statistical approaches are crucial in this big data era.

Previous statistical methods for big data often aim to find low dimensional struc- tures in the observed data. For example in a factor analysis model a latent Gaussian distributed multivariate vector is assumed. With this assumption a factor model produces a low rank estimation of the covariance of the observed variables. Another example is the latent Dirichlet allocation model for documents. The mixture pro- portions of topics, represented by a Dirichlet distributed variable, is assumed. This dissertation proposes several novel extensions to the previous statistical methods that are developed to address challenges in big data. Those novel methods are applied in multiple real world applications including construction of condition specific gene co-expression networks, estimating shared topics among newsgroups, analysis of pro- moter sequences, analysis of political-economics risk data and estimating population structure from genotype data.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Multiple myeloma is characterized by genomic alterations frequently involving gains and losses of chromosomes. Single nucleotide polymorphism (SNP)-based mapping arrays allow the identification of copy number changes at the sub-megabase level and the identification of loss of heterozygosity (LOH) due to monosomy and uniparental disomy (UPD). We have found that SNP-based mapping array data and fluorescence in situ hybridization (FISH) copy number data correlated well, making the technique robust as a tool to investigate myeloma genomics. The most frequently identified alterations are located at 1p, 1q, 6q, 8p, 13, and 16q. LOH is found in these large regions and also in smaller regions throughout the genome with a median size of 1 Mb. We have identified that UPD is prevalent in myeloma and occurs through a number of mechanisms including mitotic nondisjunction and mitotic recombination. For the first time in myeloma, integration of mapping and expression data has allowed us to reduce the complexity of standard gene expression data and identify candidate genes important in both the transition from normal to monoclonal gammopathy of unknown significance (MGUS) to myeloma and in different subgroups within myeloma. We have documented these genes, providing a focus for further studies to identify and characterize those that are key in the pathogenesis of myeloma.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Background: Expressed Sequence Tags (ESTs) are in general used to gain a first insight into gene activities from a species of interest. Subsequently, and typically based on a combination of EST and genome sequences, microarray-based expression analyses are performed for a variety of conditions. In some cases, a multitude of EST and microarray experiments are conducted for one species, covering different tissues, cell states, and cell types. Under these circumstances, the challenge arises to combine results derived from the different expression profiling strategies, with the goal to uncover novel information on the basis of the integrated datasets. Findings: Using our new analysis tool, MediPlEx (MEDIcago truncatula multiPLe EXpression analysis), expression data from EST experiments, oligonucleotide microarrays and Affymetrix GeneChips® can be combined and analyzed, leading to a novel approach to integrated transcriptome analysis. We have validated our tool via the identification of a set of well-characterized AM-specific and AM-induced marker genes, identified by MediPlEx on the basis of in silico and experimental gene expression profiles from roots colonized with AM fungi. Conclusions: MediPlEx offers an integrated analysis pipeline for different sets of expression data generated for the model legume Medicago truncatula. As expected, in silico and experimental gene expression data that cover the same biological condition correlate well. The collection of differentially expressed genes identified via MediPlEx provides a starting point for functional studies in plant mutants.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Non-coding RNAs (ncRNAs) were recently given much higher attention due to technical advances in sequencing which expanded the characterization of transcriptomes in different organisms. ncRNAs have different lengths (22 nt to >1, 000 nt) and mechanisms of action that essentially comprise a sophisticated gene expression regulation network. Recent publication of schistosome genomes and transcriptomes has increased the description and characterization of a large number of parasite genes. Here we review the number of predicted genes and the coverage of genomic bases in face of the public ESTs dataset available, including a critical appraisal of the evidence and characterization of ncRNAs in schistosomes. We show expression data for ncRNAs in Schistosoma mansoni. We analyze three different microarray experiment datasets: (1) adult worms' large-scale expression measurements; (2) differentially expressed S. mansoni genes regulated by a human cytokine (TNF-α) in a parasite culture; and (3) a stage-specific expression of ncRNAs. All these data point to ncRNAs involved in different biological processes and physiological responses that suggest functionality of these new players in the parasite's biology. Exploring this world is a challenge for the scientists under a new molecular perspective of host-parasite interactions and parasite development.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Macro- and microarrays are well-established technologies to determine gene functions through repeated measurements of transcript abundance. We constructed a chicken skeletal muscle-associated array based on a muscle-specific EST database, which was used to generate a tissue expression dataset of similar to 4500 chicken genes across 5 adult tissues (skeletal muscle, heart, liver, brain, and skin). Only a small number of ESTs were sufficiently well characterized by BLAST searches to determine their probable cellular functions. Evidence of a particular tissue-characteristic expression can be considered an indication that the transcript is likely to be functionally significant. The skeletal muscle macroarray platform was first used to search for evidence of tissue-specific expression, focusing on the biological function of genes/transcripts, since gene expression profiles generated across tissues were found to be reliable and consistent. Hierarchical clustering analysis revealed consistent clustering among genes assigned to 'developmental growth', such as the ontology genes and germ layers. Accuracy of the expression data was supported by comparing information from known transcripts and tissue from which the transcript was derived with macroarray data. Hybridization assays resulted in consistent tissue expression profile, which will be useful to dissect tissue-regulatory networks and to predict functions of novel genes identified after extensive sequencing of the genomes of model organisms. Screening our skeletal-muscle platform using 5 chicken adult tissues allowed us identifying 43 'tissue-specific' transcripts, and 112 co-expressed uncharacterized transcripts with 62 putative motifs. This platform also represents an important tool for functional investigation of novel genes; to determine expression pattern according to developmental stages; to evaluate differences in muscular growth potential between chicken lines, and to identify tissue-specific genes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background -: Sucrose content is a highly desirable trait in sugarcane as the worldwide demand for cost-effective biofuels surges. Sugarcane cultivars differ in their capacity to accumulate sucrose and breeding programs routinely perform crosses to identify genotypes able to produce more sucrose. Sucrose content in the mature internodes reach around 20% of the culms dry weight. Genotypes in the populations reflect their genetic program and may display contrasting growth, development, and physiology, all of which affect carbohydrate metabolism. Few studies have profiled gene expression related to sugarcane's sugar content. The identification of signal transduction components and transcription factors that might regulate sugar accumulation is highly desirable if we are to improve this characteristic of sugarcane plants. Results -: We have evaluated thirty genotypes that have different Brix (sugar) levels and identified genes differentially expressed in internodes using cDNA microarrays. These genes were compared to existing gene expression data for sugarcane plants subjected to diverse stress and hormone treatments. The comparisons revealed a strong overlap between the drought and sucrose-content datasets and a limited overlap with ABA signaling. Genes associated with sucrose content were extensively validated by qRT-PCR, which highlighted several protein kinases and transcription factors that are likely to be regulators of sucrose accumulation. The data also indicate that aquaporins, as well as lignin biosynthesis and cell wall metabolism genes, are strongly related to sucrose accumulation. Moreover, sucrose-associated genes were shown to be directly responsive to short term sucrose stimuli, confirming their role in sugar-related pathways. Conclusion -: Gene expression analysis of sugarcane populations contrasting for sucrose content indicated a possible overlap with drought and cell wall metabolism processes and suggested signaling and transcriptional regulators to be used as molecular markers in breeding programs. Transgenic research is necessary to further clarify the role of the genes and define targets useful for sugarcane improvement programs based on transgenic plants.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: The post-genomic era has brought new challenges regarding the understanding of the organization and function of the human genome. Many of these challenges are centered on the meaning of differential gene regulation under distinct biological conditions and can be performed by analyzing the Multiple Differential Expression (MDE) of genes associated with normal and abnormal biological processes. Currently MDE analyses are limited to usual methods of differential expression initially designed for paired analysis. Results: We proposed a web platform named ProbFAST for MDE analysis which uses Bayesian inference to identify key genes that are intuitively prioritized by means of probabilities. A simulated study revealed that our method gives a better performance when compared to other approaches and when applied to public expression data, we demonstrated its flexibility to obtain relevant genes biologically associated with normal and abnormal biological processes. Conclusions: ProbFAST is a free accessible web-based application that enables MDE analysis on a global scale. It offers an efficient methodological approach for MDE analysis of a set of genes that are turned on and off related to functional information during the evolution of a tumor or tissue differentiation. ProbFAST server can be accessed at http://gdm.fmrp.usp.br/probfast.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: Claudin-4 (CLDN4) is one of several proteins that act as molecular mediators of embryo implantation. Recently, we examined immunolabeling of leukemia inhibitory factor (LIF) in the endometrial tissue of 52 IVF patients, and found that LIF staining intensity was strongly correlated with successful pregnancy initiation. In the same set of patients, we have now examined endometrial CLDN4 expression, to see how expression intensity may vary with LIF. We examined CLDN4 in the luteal phase of the menstrual cycle, immediately preceding IVF treatment. Our aim was to compare expression of LIF and CLDN4 in the luteal phase, and document these patterns as putative biomarkers for pregnancy. Methods: Endometrial tissue was collected from women undergoing IVF. Endometrial biopsies were obtained during the luteal phase preceding IVF, and were then used for tissue microarray (TMA) immunolabeling of CLDN4. Previously published LIF expression data were then combined with CLDN4 expression data, to determine CLDN4/LIF expression patterns. Associations between successful pregnancy after IVF and combined CLDN4/LIF expression patterns were evaluated. Results: Four patterns of immunolabeling were observed in the endometrial samples: 16% showed weak CLDN4 and strong LIF (CLDN4(-)/LIF(+)); 20% showed strong CLDN4 and strong LIF (LIF(+)/CLDN4(+)); 28% showed strong CLDN4 and weak LIF (CLDN4(+)/LIF(-)); and 36% showed weak CLDN4 and weak LIF (CLDN4(-)/LIF(-)). Successful implantation after IVF was associated with CLDN4(-)/LIF(+)(p = 0.003). Patients showing this endometrial CLDN4(-)/LIF(+) immunolabeling were also 6 times more likely to achieve pregnancy than patients with endometrial CLDN4(+)/LIF(-) immunolabeling (p = 0.007). Conclusion: The combined immunolabeling expression of CLDN4(-)/LIF(+) in endometrial tissue is a potential biomarker for predicting successful pregnancy in IVF candidates.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: Cancer shows a great diversity in its clinical behavior which cannot be easily predicted using the currently available clinical or pathological markers. The identification of pathways associated with lymph node metastasis (N+) and recurrent head and neck squamous cell carcinoma (HNSCC) may increase our understanding of the complex biology of this disease. Methods: Tumor samples were obtained from untreated HNSCC patients undergoing surgery. Patients were classified according to pathologic lymph node status (positive or negative) or tumor recurrence (recurrent or non-recurrent tumor) after treatment (surgery with neck dissection followed by radiotherapy). Using microarray gene expression, we screened tumor samples according to modules comprised by genes in the same pathway or functional category. Results: The most frequent alterations were the repression of modules in negative lymph node (N0) and in non-recurrent tumors rather than induction of modules in N+ or in recurrent tumors. N0 tumors showed repression of modules that contain cell survival genes and in non-recurrent tumors cell-cell signaling and extracellular region modules were repressed. Conclusions: The repression of modules that contain cell survival genes in N0 tumors reinforces the important role that apoptosis plays in the regulation of metastasis. In addition, because tumor samples used here were not microdissected, tumor gene expression data are represented together with the stroma, which may reveal signaling between the microenvironment and tumor cells. For instance, in non-recurrent tumors, extracellular region module was repressed, indicating that the stroma and tumor cells may have fewer interactions, which disable metastasis development. Finally, the genes highlighted in our analysis can be implicated in more than one pathway or characteristic, suggesting that therapeutic approaches to prevent tumor progression should target more than one gene or pathway, specially apoptosis and interactions between tumor cells and the stroma.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: There are several studies in the literature depicting measurement error in gene expression data and also, several others about regulatory network models. However, only a little fraction describes a combination of measurement error in mathematical regulatory networks and shows how to identify these networks under different rates of noise. Results: This article investigates the effects of measurement error on the estimation of the parameters in regulatory networks. Simulation studies indicate that, in both time series (dependent) and non-time series (independent) data, the measurement error strongly affects the estimated parameters of the regulatory network models, biasing them as predicted by the theory. Moreover, when testing the parameters of the regulatory network models, p-values computed by ignoring the measurement error are not reliable, since the rate of false positives are not controlled under the null hypothesis. In order to overcome these problems, we present an improved version of the Ordinary Least Square estimator in independent (regression models) and dependent (autoregressive models) data when the variables are subject to noises. Moreover, measurement error estimation procedures for microarrays are also described. Simulation results also show that both corrected methods perform better than the standard ones (i.e., ignoring measurement error). The proposed methodologies are illustrated using microarray data from lung cancer patients and mouse liver time series data. Conclusions: Measurement error dangerously affects the identification of regulatory network models, thus, they must be reduced or taken into account in order to avoid erroneous conclusions. This could be one of the reasons for high biological false positive rates identified in actual regulatory network models.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Phenylalanine hydroxylase (PAH) is the enzyme that converts phenylalanine to tyrosine as a rate-limiting step in phenylalanine catabolism and protein and neurotransmitter biosynthesis. Over 300 mutations have been identified in the gene encoding PAH that result in a deficient enzyme activity and lead to the disorders hyperphenylalaninaemia and phenylketonuria. The determination of the crystal structure of PAH now allows the determination of the structural basis of mutations resulting in PAH deficiency. We present an analysis of the structural basis of 120 mutations with a 'classified' biochemical phenotype and/or available in vitro expression data. We find that the mutations can be grouped into five structural categories, based on the distinct expected structural and functional effects of the mutations in each category. Missense mutations and small amino acid deletions are found in three categories:'active site mutations', 'dimer interface mutations', and 'domain structure mutations'. Nonsense mutations and splicing mutations form the category of 'proteins with truncations and large deletions'. The final category, 'fusion proteins', is caused by frameshift mutations. We show that the structural information helps formulate some rules that will help predict the likely effects of unclassified and newly discovered mutations: proteins with truncations and large deletions, fusion proteins and active site mutations generally cause severe phenotypes; domain structure mutations and dimer interface mutations spread over a range of phenotypes, but domain structure mutations in the catalytic domain are more likely to be severe than domain structure mutations in the regulatory domain or dimer interface mutations.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

To evaluate differential gene expression in penile tissue after treatment with the phosphodiesterase 5 (PDE5) inhibitor tadalafil, as of the three clinically available PDE5 inhibitors (sildenafil, tadalafil, and vardenafil) used for the treatment of erectile dysfunction (ED), tadalafil has a long half-life and low incidence of side-effects. In all, 32 adult rats were divided into two groups. The control group received 0.5 mL of drinking water alone, while the tadalafil group was treated with tadalafil at a dose of 0.27 mg/kg. At 4 h after treatment with water or tadalafil the rats were killed and the penile tissue was removed. The total RNA was isolated from the penile tissue from both groups and differentially expressed genes were identified by cDNA microarray analysis. To validate the expression data from the microarray analysis, quantitative real-time polymerase chain reaction (PCR) and immunohistochemistry were used. In all, 153 genes were differentially expressed between the control group and the tadalafil group. We validated the microarray results by quantitative PCR for the insulin-like growth factor binding protein 6 (IGFBP-6) gene and the neuronal calcium sensor 1 (NCS-1) gene, both of which were up-regulated in the tadalafil group, and for the natriuretic peptide receptor 1 (NPR-1) gene that was down-regulated in this group. Immunohistochemistry showed localization of the NCS-1 protein in sinusoid trabeculae of the corpus cavernosum in control and tadalafil-treated rats. There was differential expression in 153 genes after tadalafil treatment. Some of these genes such as IGFBP-6, NPR-1 and NCS-1, might result in new targets in the treatment of ED.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Diversas pesquisas apontam um contínuo desuso dos clíticos de 3ª pessoa (lhe, o, a, os, as e suas variantes) na modalidade oral do Português Brasileiro (PB), sendo substituídos pelo pronome reto, pelo objeto nulo ou pela repetição do sintagma a que eles fazem referência. Por outro lado, os clíticos de 3ª pessoa aparecem comumente em textos escritos veiculados pela mídia, bem como nos estilos mais formais da língua falada, o que faz com que esses elementos constem dos programas da disciplina Língua Portuguesa, nas escolas brasileiras. Dessa feita, esta pesquisa tem por objetivo descrever esse processo de aprendizagem do ponto de vista linguístico e social, procurando investigar como as crianças interpretam o uso dessas formas – e, por conseguinte, da variedade padrão da língua –, e se e/ou como a escola favorece a aprendizagem dessas formas. Para respondermos a essas perguntas, procedemos a uma pesquisa sociolinguística, que analisa os fatores linguísticos e sociais que poderiam influenciar essa aprendizagem. Para tanto, os clíticos de 3ª pessoa foram trabalhados gradualmente durante um ano letivo. Assim, o corpus desta pesquisa compõe-se de textos escritos livremente e de testes de compreensão – que visava verificar a compreensão dos clíticos presentes em textos infantis – e de desempenho – que consistia na substituição de expressões pelos clíticos adequados. Os dados foram colhidos em uma turma de 4º ano do Ensino Fundamental durante 10 meses letivos, numa escola pública de Belo Horizonte, Minas Gerais, cujos alunos pertencem a diferentes níveis socioeconômicos. Os resultados obtidos revelam que: i) o contexto em que o clítico é inserido contribui significativamente para a sua compreensão; ii) com relação aos fatores extralinguísticos, os alunos, independentemente de sua origem social e sexo/gênero, apresentam dificuldades em compreender os clíticos; entretanto, há uma leve tendência de um melhor aproveitamento quanto aos meninos da classe socioeconômica favorecida

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Many learning problems require handling high dimensional datasets with a relatively small number of instances. Learning algorithms are thus confronted with the curse of dimensionality, and need to address it in order to be effective. Examples of these types of data include the bag-of-words representation in text classification problems and gene expression data for tumor detection/classification. Usually, among the high number of features characterizing the instances, many may be irrelevant (or even detrimental) for the learning tasks. It is thus clear that there is a need for adequate techniques for feature representation, reduction, and selection, to improve both the classification accuracy and the memory requirements. In this paper, we propose combined unsupervised feature discretization and feature selection techniques, suitable for medium and high-dimensional datasets. The experimental results on several standard datasets, with both sparse and dense features, show the efficiency of the proposed techniques as well as improvements over previous related techniques.