5 resultados para l2 -mean-reversion
em Cochin University of Science
Resumo:
Computational Biology is the research are that contributes to the analysis of biological data through the development of algorithms which will address significant research problems.The data from molecular biology includes DNA,RNA ,Protein and Gene expression data.Gene Expression Data provides the expression level of genes under different conditions.Gene expression is the process of transcribing the DNA sequence of a gene into mRNA sequences which in turn are later translated into proteins.The number of copies of mRNA produced is called the expression level of a gene.Gene expression data is organized in the form of a matrix. Rows in the matrix represent genes and columns in the matrix represent experimental conditions.Experimental conditions can be different tissue types or time points.Entries in the gene expression matrix are real values.Through the analysis of gene expression data it is possible to determine the behavioral patterns of genes such as similarity of their behavior,nature of their interaction,their respective contribution to the same pathways and so on. Similar expression patterns are exhibited by the genes participating in the same biological process.These patterns have immense relevance and application in bioinformatics and clinical research.Theses patterns are used in the medical domain for aid in more accurate diagnosis,prognosis,treatment planning.drug discovery and protein network analysis.To identify various patterns from gene expression data,data mining techniques are essential.Clustering is an important data mining technique for the analysis of gene expression data.To overcome the problems associated with clustering,biclustering is introduced.Biclustering refers to simultaneous clustering of both rows and columns of a data matrix. Clustering is a global whereas biclustering is a local model.Discovering local expression patterns is essential for identfying many genetic pathways that are not apparent otherwise.It is therefore necessary to move beyond the clustering paradigm towards developing approaches which are capable of discovering local patterns in gene expression data.A biclusters is a submatrix of the gene expression data matrix.The rows and columns in the submatrix need not be contiguous as in the gene expression data matrix.Biclusters are not disjoint.Computation of biclusters is costly because one will have to consider all the combinations of columans and rows in order to find out all the biclusters.The search space for the biclustering problem is 2 m+n where m and n are the number of genes and conditions respectively.Usually m+n is more than 3000.The biclustering problem is NP-hard.Biclustering is a powerful analytical tool for the biologist.The research reported in this thesis addresses the problem of biclustering.Ten algorithms are developed for the identification of coherent biclusters from gene expression data.All these algorithms are making use of a measure called mean squared residue to search for biclusters.The objective here is to identify the biclusters of maximum size with the mean squared residue lower than a given threshold. All these algorithms begin the search from tightly coregulated submatrices called the seeds.These seeds are generated by K-Means clustering algorithm.The algorithms developed can be classified as constraint based,greedy and metaheuristic.Constarint based algorithms uses one or more of the various constaints namely the MSR threshold and the MSR difference threshold.The greedy approach makes a locally optimal choice at each stage with the objective of finding the global optimum.In metaheuristic approaches particle Swarm Optimization(PSO) and variants of Greedy Randomized Adaptive Search Procedure(GRASP) are used for the identification of biclusters.These algorithms are implemented on the Yeast and Lymphoma datasets.Biologically relevant and statistically significant biclusters are identified by all these algorithms which are validated by Gene Ontology database.All these algorithms are compared with some other biclustering algorithms.Algorithms developed in this work overcome some of the problems associated with the already existing algorithms.With the help of some of the algorithms which are developed in this work biclusters with very high row variance,which is higher than the row variance of any other algorithm using mean squared residue, are identified from both Yeast and Lymphoma data sets.Such biclusters which make significant change in the expression level are highly relevant biologically.
Resumo:
Vibrio are important during hatchery rearing. aquaculture phase and post-harvest quality of shrimps. Vibrio spp are of concern to shrimp farmers and hatchery operators because certain species can cause Vibriosis. Vibrio species are of concern to humans because certain species cause serious diseases.With the progress in aquaculture, intensive systems used for shrimp aquaculture create an artificial environment that increases bacterial growth. To maintain the productivity of such an intensive aquaculture, high inputs of fish protein have to be employed for feeding together with high levels of water exchange and the massive use of antibiotics/ probiotics / chemicals. It seems that the combination of these conditions favours the proliferation of vibrios and enhances their virulence and disease prevalence. The risk of a microbial infection is high, mainly at larval stages. The effect and severity are related to Vibrio species and dose, water, feed, shrimp quality and aquaculture management.Consumption of seafood can occasionally result in food-bome illnesses due to the proliferation of indigenous pathogens like Vibrio.Of the l2 pathogenic Vibrio species, 8 species are known to be directly food associated. Strict quality guidelines have been laid by the importing nations, for the food products that enter their markets. The microbiological quality requirement for export of frozen shrimp products is that V.cholerae, V.parahaemolyticus and V. vulnificus should be absent in 25g of the processed shrimp (Export Inspection Council of India, 1995). The mere presence of these pathogenic Vibrios is sufficient for the rejection of the exported product.The export rejections cause serious economic loss to the shrimp industry and might harm the brand image of the shrimp products from the countiy.There is a need for an independent study on the incidence of different pathogenic vibrios in shrimp aquaculture and investigate their biochemical characteristics to have a better understanding about the growth and survival of these organisms in the shrimp aquaculture niche. PCR based methods (conventional PCR, duplex PCR, multiplex-PCR and Real Time PCR) for the detection of the pathogenic Vibrios is important for rapid post-harvest quality assessment. Studies on the genetic heterogeneity among the specific pathogenic vibrio species isolated from shrimp aquaculture system provide; valuable information on the extent of genetic diversity of the pathogenic vibrios, the shrimp aquaculture system.So the present study was undertaken to study the incidence of pathogenic Vibrio spp. in Penaeus monodon shrimp hatcheries and aquaculture farms, to carry out biochemical investigations of the pathogenic Vibrio spp isolated from P. monodon hatchery and. aquaculture environments, to assess the effect of salt (NaCl) on the growth and enzymatic activities of pathogenic Vibrio spp., to study the effect of preservatives, and chemicals on the growth of pathogenic Vibrio spp. and to employ polymerase chain reaction (PCR) methods for the detection of pathogenic V ibrio spp.Samples of water (n=7) and post-larvae (n=7) were obtained from seven Penaeus monodon hatcheries and samples of water (n=5), sediment (n=5) and shrimp (n=5) were obtained from five P. monodon aquaculture farms located on the East Coast of lndia. The microbiological examination of water, sediment, post-larvae and shrimp samples was carried out employing standard methods and by using standard media.The higher bacterial loads were obtained in pond sediments which can be attributed to the accumulation of organic matter at the pond bottom which stimulated bacterial growth.Shrimp head. (4.78 x 105 +/- 3.0 x 104 cfu/g) had relatively higher bacterial load when compared to shrimp muscle 2.7 x 105 +/- 1.95 x 104 cfu/g). ln shrimp hatchery samples, the post-larvae (2.2 x 106 +/- 1.9 x 106 cfu/g) had higher bacterial load than water (5.6 x 103 +/- 3890 cfu/ml).The mean E.coli counts were higher in aquaculture pond sediment (204+/-13 cfu/g) and pond water (124+/-88 cfu/ml). Relatively lower Escherichia coli counts were obtained from shrimp samples (12+/-11 to 16+/-16.7 cfu/g). The presence of E.coli in aquaculture environment might have been from the source water. E.coli was not detected in hatchery waters and post-larvae.
Resumo:
In this paper, we study the relationship between the failure rate and the mean residual life of doubly truncated random variables. Accordingly, we develop characterizations for exponential, Pareto 11 and beta distributions. Further, we generalize the identities for fire Pearson and the exponential family of distributions given respectively in Nair and Sankaran (1991) and Consul (1995). Applications of these measures in file context of lengthbiased models are also explored
Resumo:
Globalization and liberalization, with the entry of many prominent foreign manufacturers, changed the automobile scenario in India, since early 1990’s. World Leaders in automobile manufacturing such as Ford, General Motors, Honda, Toyota, Suzuki, Hyundai, Renault, Mitsubishi, Benz, BMW, Volkswagen and Nissan set up their manufacturing units in India in joint venture with their Indian counterpart companies, by making use of the Foreign Direct Investment policy of the Government of India, These manufacturers started capturing the hearts of Indian car customers with their choice of technological and innovative product features, with quality and reliability. With the multiplicity of choices available to the Indian passenger car buyers, it drastically changed the way the car purchase scenario in India and particularly in the State of Kerala. This transformed the automobile scene from a sellers’ market to buyers’ market. Car customers started developing their own personal preferences and purchasing patterns, which were hitherto unknown in the Indian automobile segment. The main purpose of this paper is to come up with the identification of possible parameters and a framework development, that influence the consumer purchase behaviour patterns of passenger car owners in the State of Kerala, so that further research could be done, based on the framework and the identified parameters.
Resumo:
Adaptive filter is a primary method to filter Electrocardiogram (ECG), because it does not need the signal statistical characteristics. In this paper, an adaptive filtering technique for denoising the ECG based on Genetic Algorithm (GA) tuned Sign-Data Least Mean Square (SD-LMS) algorithm is proposed. This technique minimizes the mean-squared error between the primary input, which is a noisy ECG, and a reference input which can be either noise that is correlated in some way with the noise in the primary input or a signal that is correlated only with ECG in the primary input. Noise is used as the reference signal in this work. The algorithm was applied to the records from the MIT -BIH Arrhythmia database for removing the baseline wander and 60Hz power line interference. The proposed algorithm gave an average signal to noise ratio improvement of 10.75 dB for baseline wander and 24.26 dB for power line interference which is better than the previous reported works