Biblioteca Digital

9 resultados para Biology, Neuroscience|Biology, Genetics

em Collection Of Biostatistics Research Archive

Classification Using Generalized Partial Least Squares

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The advances in computational biology have made simultaneous monitoring of thousands of features possible. The high throughput technologies not only bring about a much richer information context in which to study various aspects of gene functions but they also present challenge of analyzing data with large number of covariates and few samples. As an integral part of machine learning, classification of samples into two or more categories is almost always of interest to scientists. In this paper, we address the question of classification in this setting by extending partial least squares (PLS), a popular dimension reduction tool in chemometrics, in the context of generalized linear regression based on a previous approach, Iteratively ReWeighted Partial Least Squares, i.e. IRWPLS (Marx, 1996). We compare our results with two-stage PLS (Nguyen and Rocke, 2002A; Nguyen and Rocke, 2002B) and other classifiers. We show that by phrasing the problem in a generalized linear model setting and by applying bias correction to the likelihood to avoid (quasi)separation, we often get lower classification error rates.

Veja mais

Differential Expression with the Bioconductor Project

Relevância:

90.00% 90.00%

Publicador:

Resumo:

A basic, yet challenging task in the analysis of microarray gene expression data is the identification of changes in gene expression that are associated with particular biological conditions. We discuss different approaches to this task and illustrate how they can be applied using software from the Bioconductor Project. A central problem is the high dimensionality of gene expression space, which prohibits a comprehensive statistical analysis without focusing on particular aspects of the joint distribution of the genes expression levels. Possible strategies are to do univariate gene-by-gene analysis, and to perform data-driven nonspecific filtering of genes before the actual statistical analysis. However, more focused strategies that make use of biologically relevant knowledge are more likely to increase our understanding of the data.

Veja mais

Estimation and Testing for the Effect of a Genetic Pathway on a Disease Outcome Using Logistic Kernel Machine Regression via Logistic Mixed Models

Relevância:

90.00% 90.00%

Publicador:

Veja mais

Powerful SNP Set Analysis for Case-Control Genome Wide Association Studies

Relevância:

90.00% 90.00%

Publicador:

Veja mais

Sparse Linear Discriminant Analysis for Simultaneous Testing for the Significance of a Gene Set/Pathway and Gene Selection

Relevância:

90.00% 90.00%

Publicador:

Veja mais

Survival Analysis with Large Dimensional Covariates: An Application in Microarray Studies

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Use of microarray technology often leads to high-dimensional and low- sample size data settings. Over the past several years, a variety of novel approaches have been proposed for variable selection in this context. However, only a small number of these have been adapted for time-to-event data where censoring is present. Among standard variable selection methods shown both to have good predictive accuracy and to be computationally efficient is the elastic net penalization approach. In this paper, adaptation of the elastic net approach is presented for variable selection both under the Cox proportional hazards model and under an accelerated failure time (AFT) model. Assessment of the two methods is conducted through simulation studies and through analysis of microarray data obtained from a set of patients with diffuse large B-cell lymphoma where time to survival is of interest. The approaches are shown to match or exceed the predictive performance of a Cox-based and an AFT-based variable selection method. The methods are moreover shown to be much more computationally efficient than their respective Cox- and AFT- based counterparts.

Veja mais

Assessment of a CGH-based Genetic Instability

Relevância:

90.00% 90.00%

Publicador:

Veja mais

Assessing Population Level Genetic Instability via Moving Average

Relevância:

90.00% 90.00%

Publicador:

Veja mais

A Powerful and Flexible Multilocus Association Test for Quantitative Traits

Relevância:

90.00% 90.00%

Publicador:

Veja mais

9 resultados para Biology, Neuroscience|Biology, Genetics

em Collection Of Biostatistics Research Archive

Filtro por publicador