2 resultados para Biology, Biostatistics|Statistics|Health Sciences, Public Health

em Digital Commons - Michigan Tech


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The developmental processes and functions of an organism are controlled by the genes and the proteins that are derived from these genes. The identification of key genes and the reconstruction of gene networks can provide a model to help us understand the regulatory mechanisms for the initiation and progression of biological processes or functional abnormalities (e.g. diseases) in living organisms. In this dissertation, I have developed statistical methods to identify the genes and transcription factors (TFs) involved in biological processes, constructed their regulatory networks, and also evaluated some existing association methods to find robust methods for coexpression analyses. Two kinds of data sets were used for this work: genotype data and gene expression microarray data. On the basis of these data sets, this dissertation has two major parts, together forming six chapters. The first part deals with developing association methods for rare variants using genotype data (chapter 4 and 5). The second part deals with developing and/or evaluating statistical methods to identify genes and TFs involved in biological processes, and construction of their regulatory networks using gene expression data (chapter 2, 3, and 6). For the first part, I have developed two methods to find the groupwise association of rare variants with given diseases or traits. The first method is based on kernel machine learning and can be applied to both quantitative as well as qualitative traits. Simulation results showed that the proposed method has improved power over the existing weighted sum method (WS) in most settings. The second method uses multiple phenotypes to select a few top significant genes. It then finds the association of each gene with each phenotype while controlling the population stratification by adjusting the data for ancestry using principal components. This method was applied to GAW 17 data and was able to find several disease risk genes. For the second part, I have worked on three problems. First problem involved evaluation of eight gene association methods. A very comprehensive comparison of these methods with further analysis clearly demonstrates the distinct and common performance of these eight gene association methods. For the second problem, an algorithm named the bottom-up graphical Gaussian model was developed to identify the TFs that regulate pathway genes and reconstruct their hierarchical regulatory networks. This algorithm has produced very significant results and it is the first report to produce such hierarchical networks for these pathways. The third problem dealt with developing another algorithm called the top-down graphical Gaussian model that identifies the network governed by a specific TF. The network produced by the algorithm is proven to be of very high accuracy.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Great Lakes watershed is home to over 40 million people, and the health of the Great Lakes ecosystem is vital to the overall economic, societal, and environmental health of the U.S. and Canada. However, environmental issues related to them are sometimes overlooked. Policymakers and the public face the challenges of balancing economic benefits with the need to conserve and/or replenish regional natural resources to ensure long term prosperity. From the literature review, nine critical stressors of ecological services were delineated, which include pollution and contamination, agricultural erosion, non-native species, degraded recreational resources, loss of wetlands habitat, climate change, risk of clean water shortage, vanishing sand dunes, and population overcrowding; this list was validated through a series of stakeholder discussions and focus groups in Grand Rapids. Focus groups were conducted in Grand Rapids to examine the awareness of, concern with, and willingness to expend resources on these stressors. Stressors that the respondents have direct contact with tend to be the most important. The focus group results show that concern related to pollution and contamination is much higher than for any of the other stressors. Low responses to climate change result in recommendations for outreach programs.