5 resultados para EXPRESSION DATA

em Cochin University of Science


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Computational Biology is the research are that contributes to the analysis of biological data through the development of algorithms which will address significant research problems.The data from molecular biology includes DNA,RNA ,Protein and Gene expression data.Gene Expression Data provides the expression level of genes under different conditions.Gene expression is the process of transcribing the DNA sequence of a gene into mRNA sequences which in turn are later translated into proteins.The number of copies of mRNA produced is called the expression level of a gene.Gene expression data is organized in the form of a matrix. Rows in the matrix represent genes and columns in the matrix represent experimental conditions.Experimental conditions can be different tissue types or time points.Entries in the gene expression matrix are real values.Through the analysis of gene expression data it is possible to determine the behavioral patterns of genes such as similarity of their behavior,nature of their interaction,their respective contribution to the same pathways and so on. Similar expression patterns are exhibited by the genes participating in the same biological process.These patterns have immense relevance and application in bioinformatics and clinical research.Theses patterns are used in the medical domain for aid in more accurate diagnosis,prognosis,treatment planning.drug discovery and protein network analysis.To identify various patterns from gene expression data,data mining techniques are essential.Clustering is an important data mining technique for the analysis of gene expression data.To overcome the problems associated with clustering,biclustering is introduced.Biclustering refers to simultaneous clustering of both rows and columns of a data matrix. Clustering is a global whereas biclustering is a local model.Discovering local expression patterns is essential for identfying many genetic pathways that are not apparent otherwise.It is therefore necessary to move beyond the clustering paradigm towards developing approaches which are capable of discovering local patterns in gene expression data.A biclusters is a submatrix of the gene expression data matrix.The rows and columns in the submatrix need not be contiguous as in the gene expression data matrix.Biclusters are not disjoint.Computation of biclusters is costly because one will have to consider all the combinations of columans and rows in order to find out all the biclusters.The search space for the biclustering problem is 2 m+n where m and n are the number of genes and conditions respectively.Usually m+n is more than 3000.The biclustering problem is NP-hard.Biclustering is a powerful analytical tool for the biologist.The research reported in this thesis addresses the problem of biclustering.Ten algorithms are developed for the identification of coherent biclusters from gene expression data.All these algorithms are making use of a measure called mean squared residue to search for biclusters.The objective here is to identify the biclusters of maximum size with the mean squared residue lower than a given threshold. All these algorithms begin the search from tightly coregulated submatrices called the seeds.These seeds are generated by K-Means clustering algorithm.The algorithms developed can be classified as constraint based,greedy and metaheuristic.Constarint based algorithms uses one or more of the various constaints namely the MSR threshold and the MSR difference threshold.The greedy approach makes a locally optimal choice at each stage with the objective of finding the global optimum.In metaheuristic approaches particle Swarm Optimization(PSO) and variants of Greedy Randomized Adaptive Search Procedure(GRASP) are used for the identification of biclusters.These algorithms are implemented on the Yeast and Lymphoma datasets.Biologically relevant and statistically significant biclusters are identified by all these algorithms which are validated by Gene Ontology database.All these algorithms are compared with some other biclustering algorithms.Algorithms developed in this work overcome some of the problems associated with the already existing algorithms.With the help of some of the algorithms which are developed in this work biclusters with very high row variance,which is higher than the row variance of any other algorithm using mean squared residue, are identified from both Yeast and Lymphoma data sets.Such biclusters which make significant change in the expression level are highly relevant biologically.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Biclustering is simultaneous clustering of both rows and columns of a data matrix. A measure called Mean Squared Residue (MSR) is used to simultaneously evaluate the coherence of rows and columns within a submatrix. In this paper a novel algorithm is developed for biclustering gene expression data using the newly introduced concept of MSR difference threshold. In the first step high quality bicluster seeds are generated using K-Means clustering algorithm. Then more genes and conditions (node) are added to the bicluster. Before adding a node the MSR X of the bicluster is calculated. After adding the node again the MSR Y is calculated. The added node is deleted if Y minus X is greater than MSR difference threshold or if Y is greater than MSR threshold which depends on the dataset. The MSR difference threshold is different for gene list and condition list and it depends on the dataset also. Proper values should be identified through experimentation in order to obtain biclusters of high quality. The results obtained on bench mark dataset clearly indicate that this algorithm is better than many of the existing biclustering algorithms

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Microarray data analysis is one of data mining tool which is used to extract meaningful information hidden in biological data. One of the major focuses on microarray data analysis is the reconstruction of gene regulatory network that may be used to provide a broader understanding on the functioning of complex cellular systems. Since cancer is a genetic disease arising from the abnormal gene function, the identification of cancerous genes and the regulatory pathways they control will provide a better platform for understanding the tumor formation and development. The major focus of this thesis is to understand the regulation of genes responsible for the development of cancer, particularly colorectal cancer by analyzing the microarray expression data. In this thesis, four computational algorithms namely fuzzy logic algorithm, modified genetic algorithm, dynamic neural fuzzy network and Takagi Sugeno Kang-type recurrent neural fuzzy network are used to extract cancer specific gene regulatory network from plasma RNA dataset of colorectal cancer patients. Plasma RNA is highly attractive for cancer analysis since it requires a collection of small amount of blood and it can be obtained at any time in repetitive fashion allowing the analysis of disease progression and treatment response.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The present study demonstrate the functional alterations of the GABAA and GABAB receptors and the gene expression during the regeneration of pancreas following partial pancreatectomy. The role of these receptors in insulin secretion and pancreatic DNA synthesis using the specific agonists and antagonists also are studied in vitro. The alterations of GABAA and GABAR receptor function and gene expression in the brain stem, crebellum and hypothalamus play an important role in the sympathetic regulation of insulin secretion during pancreatic regeneration. Previous studies have given much information linking functional interaction between GABA and the peripheral nervous system. The involvement of specific receptor subtypes functional regulation during pancreatic regeneration has not given emphasis and research in this area seems to be scarce. We have observed a decreased GABA content, down regulation of GABAA receptors and an up regulation of GABAB receptors in the cerebral cortex, brain stem and hypothalamus. Real Time-PCR analysis confirmed the receptor data in the brain regions. These alterations in the GABAA and GABAB receptors of the brain are suggested to govern the regenerative response and growth regulation of the pancreas through sympathetic innervation. In addition, receptor binding studies and Real Time-PCR analysis revealed that during pancreatic regeneration GABAA receptors were down regulated and GABAB receptors were up regulated in pancreatic islets. This suggests an inhibitory role for GABAA receptors in islet cell proliferation i.e., the down regulation of this receptor facilitates proliferation. Insulin secretion study during 1 hour showed GABA has inhibited the insulin secretion in a dose dependent manner in normal and hyperglycaemic conditions. Bicuculline did not antagonize this effect. GABAA agonist, muscimol inhibited glucose stimulated insulin secretion from pancreatic islets except in the lowest concentration of 1O-9M in presence of 4mM glucose.Musclmol enhanced insulin secretion at 10-7 and 10-4M muscimol in presence of 20mM glucose- 4mM glucose represents normal and 20mM represent hyperglycaemic conditions. GABAB agonist, baclofen also inhibited glucose induced insulin secretion and enhanced at the concentration of 1O-5M at 4mM glucose and at 10-9M baclofen in presence of 20mM glucose. This shows a differential control of the GABAA and GABAB receptors over insulin release from the pancreatic islets. During 24 hours in vitro insulin secretion study it showed that low concentration of GABA has inhibited glucose stimulated insulin secretion from pancreatic islets. Muscimol, the GABAA agonist, inhibited the insulin secretion but, gave an enhanced secretion of insulin in presence of 4mM glucose at 10-7 , 10-5 and 1O-4M muscimol. But in presence of 20mM glucose muscimol significantly inhibited the insulin secretion. GABAB agonist, baclofen also inhibited glucose induced insulin secretion in presence of both 4mM and 20mM glucose. This shows the inhibitory role of GABA and its specific receptor subtypes over insulin synthesis from pancreatic bete-islets. In vitro DNA synthesis studies showed that activation of GABAA receptor by adding muscimol, a specific agonist, inhibited islet DNA synthesis. Also, the addition of baclofen, a specific agonist of GABAB receptor resulted in the stimulation of DNA synthesis.Thus the brain and pancreatic GABAA and GABAB receptor gene expression differentially regulates pancreatic insulin secretion and islet cell proliferation during pancreatic regeneration. This will have immense clinical significance in therapeutic applications in the management of Diabetes mellitus.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis entitled Reliability Modelling and Analysis in Discrete time Some Concepts and Models Useful in the Analysis of discrete life time data.The present study consists of five chapters. In Chapter II we take up the derivation of some general results useful in reliability modelling that involves two component mixtures. Expression for the failure rate, mean residual life and second moment of residual life of the mixture distributions in terms of the corresponding quantities in the component distributions are investigated. Some applications of these results are also pointed out. The role of the geometric,Waring and negative hypergeometric distributions as models of life lengths in the discrete time domain has been discussed already. While describing various reliability characteristics, it was found that they can be often considered as a class. The applicability of these models in single populations naturally extends to the case of populations composed of sub-populations making mixtures of these distributions worth investigating. Accordingly the general properties, various reliability characteristics and characterizations of these models are discussed in chapter III. Inference of parameters in mixture distribution is usually a difficult problem because the mass function of the mixture is a linear function of the component masses that makes manipulation of the likelihood equations, leastsquare function etc and the resulting computations.very difficult. We show that one of our characterizations help in inferring the parameters of the geometric mixture without involving computational hazards. As mentioned in the review of results in the previous sections, partial moments were not studied extensively in literature especially in the case of discrete distributions. Chapters IV and V deal with descending and ascending partial factorial moments. Apart from studying their properties, we prove characterizations of distributions by functional forms of partial moments and establish recurrence relations between successive moments for some well known families. It is further demonstrated that partial moments are equally efficient and convenient compared to many of the conventional tools to resolve practical problems in reliability modelling and analysis. The study concludes by indicating some new problems that surfaced during the course of the present investigation which could be the subject for a future work in this area.