Gene Network Inference using Machine Learning and Graph Algorithms on Big Biomedical Data


Autoria(s): Wu, Migao
Contribuinte(s)

Yeung, Ka Yee

Data(s)

06/04/2016

06/04/2016

01/03/2016

Resumo

Thesis (Master's)--University of Washington, 2016-03

Gene networks capture the interactions between different biological entities. These gene networks have many applications in modern day biology. In particular, gene networks can help to shed light on the underlying mechanisms of diseases. Advances in biotechnology have led to the generation of different types of genome-wide data, profiling the activity levels across the entire genome. In this thesis, we generated informative and accurate gene networks by integrating multiple types of big biomedical data. Many algorithms have been proposed in the literature to infer gene networks from genome-wide data. However, it is non-trivial to distinguish direct edges between two nodes from indirect edges represented by a path connecting two nodes using these genome-wide data. In this thesis, I constructed compact and accurate gene networks by using an improved Bayesian Modeling Averaging based gene network inference algorithm which includes a post-processing step of removing indirect redundant edges. I applied this improved method to synthetic data in which the ground truth was already known and to real data in which external data sources were used to help assess and analyze the resulting gene networks. The assessment results were presented in two different forms, graphs and tables. In general, the results showed that the new gene network inference algorithm produced more accurate networks and the implementation is more efficient.

Formato

application/pdf

Identificador

Wu_washington_0250O_15486.pdf

http://hdl.handle.net/1773/35504

Idioma(s)

en_US

Palavras-Chave #Bayesian Modeling Averaging; Gene network inference algorithm; Gene networks; Genome-wide data; Synthetic data #Computer science #Bioinformatics #computing and software systems
Tipo

Thesis