985 resultados para gene networks
Resumo:
As advances in molecular biology continue to reveal additional layers of complexity in gene regulation, computational models need to incorporate additional features to explore the implications of new theories and hypotheses. It has recently been suggested that eukaryotic organisms owe their phenotypic complexity and diversity to the exploitation of small RNAs as signalling molecules. Previous models of genetic systems are, for several reasons, inadequate to investigate this theory. In this study, we present an artificial genome model of genetic regulatory networks based upon previous work by Torsten Reil, and demonstrate how this model generates networks with biologically plausible structural and dynamic properties. We also extend the model to explore the implications of incorporating regulation by small RNA molecules in a gene network. We demonstrate how, using these signals, highly connected networks can display dynamics that are more stable than expected given their level of connectivity.
Resumo:
Complex systems techniques provide a powerful tool to study the emergent properties of networks of interacting genes. In this study we extract models of genetic regulatory networks from an artificial genome, represented by a sequence of nucleotides, and analyse how variations in the connectivity and degree of inhibition of the extracted networks affects the resulting classes of behaviours. For low connectivity systems were found to be very stable. Only with higher connectivity was a significant occurrence of chaos found. Most interestingly, the peak in occurrence of chaos occurs perched on the edge of a phase transition in the occurrence of attractors.
Resumo:
Understanding a complex network's structure holds the key to understanding its function. The physics community has contributed a multitude of methods and analyses to this cross-disciplinary endeavor. Structural features exist on both the microscopic level, resulting from differences between single node properties, and the mesoscopic level resulting from properties shared by groups of nodes. Disentangling the determinants of network structure on these different scales has remained a major, and so far unsolved, challenge. Here we show how multiscale generative probabilistic exponential random graph models combined with efficient, distributive message-passing inference techniques can be used to achieve this separation of scales, leading to improved detection accuracy of latent classes as demonstrated on benchmark problems. It sheds new light on the statistical significance of motif-distributions in neural networks and improves the link-prediction accuracy as exemplified for gene-disease associations in the highly consequential Online Mendelian Inheritance in Man database. © 2011 Reichardt et al.
Resumo:
Background—The molecular mechanisms underlying similarities and differences between physiological and pathological left ventricular hypertrophy (LVH) are of intense interest. Most previous work involved targeted analysis of individual signaling pathways or screening of transcriptomic profiles. We developed a network biology approach using genomic and proteomic data to study the molecular patterns that distinguish pathological and physiological LVH. Methods and Results—A network-based analysis using graph theory methods was undertaken on 127 genome-wide expression arrays of in vivo murine LVH. This revealed phenotype-specific pathological and physiological gene coexpression networks. Despite >1650 common genes in the 2 networks, network structure is significantly different. This is largely because of rewiring of genes that are differentially coexpressed in the 2 networks; this novel concept of differential wiring was further validated experimentally. Functional analysis of the rewired network revealed several distinct cellular pathways and gene sets. Deeper exploration was undertaken by targeted proteomic analysis of mitochondrial, myofilament, and extracellular subproteomes in pathological LVH. A notable finding was that mRNA–protein correlation was greater at the cellular pathway level than for individual loci. Conclusions—This first combined gene network and proteomic analysis of LVH reveals novel insights into the integrated pathomechanisms that distinguish pathological versus physiological phenotypes. In particular, we identify differential gene wiring as a major distinguishing feature of these phenotypes. This approach provides a platform for the investigation of potentially novel pathways in LVH and offers a freely accessible protocol (http://sites.google.com/site/cardionetworks) for similar analyses in other cardiovascular diseases.
Resumo:
While molecular and cellular processes are often modeled as stochastic processes, such as Brownian motion, chemical reaction networks and gene regulatory networks, there are few attempts to program a molecular-scale process to physically implement stochastic processes. DNA has been used as a substrate for programming molecular interactions, but its applications are restricted to deterministic functions and unfavorable properties such as slow processing, thermal annealing, aqueous solvents and difficult readout limit them to proof-of-concept purposes. To date, whether there exists a molecular process that can be programmed to implement stochastic processes for practical applications remains unknown.
In this dissertation, a fully specified Resonance Energy Transfer (RET) network between chromophores is accurately fabricated via DNA self-assembly, and the exciton dynamics in the RET network physically implement a stochastic process, specifically a continuous-time Markov chain (CTMC), which has a direct mapping to the physical geometry of the chromophore network. Excited by a light source, a RET network generates random samples in the temporal domain in the form of fluorescence photons which can be detected by a photon detector. The intrinsic sampling distribution of a RET network is derived as a phase-type distribution configured by its CTMC model. The conclusion is that the exciton dynamics in a RET network implement a general and important class of stochastic processes that can be directly and accurately programmed and used for practical applications of photonics and optoelectronics. Different approaches to using RET networks exist with vast potential applications. As an entropy source that can directly generate samples from virtually arbitrary distributions, RET networks can benefit applications that rely on generating random samples such as 1) fluorescent taggants and 2) stochastic computing.
By using RET networks between chromophores to implement fluorescent taggants with temporally coded signatures, the taggant design is not constrained by resolvable dyes and has a significantly larger coding capacity than spectrally or lifetime coded fluorescent taggants. Meanwhile, the taggant detection process becomes highly efficient, and the Maximum Likelihood Estimation (MLE) based taggant identification guarantees high accuracy even with only a few hundred detected photons.
Meanwhile, RET-based sampling units (RSU) can be constructed to accelerate probabilistic algorithms for wide applications in machine learning and data analytics. Because probabilistic algorithms often rely on iteratively sampling from parameterized distributions, they can be inefficient in practice on the deterministic hardware traditional computers use, especially for high-dimensional and complex problems. As an efficient universal sampling unit, the proposed RSU can be integrated into a processor / GPU as specialized functional units or organized as a discrete accelerator to bring substantial speedups and power savings.
Resumo:
Valuable genetic variation for bean breeding programs is held within the common bean secondary gene pool which consists of Phaseolus albescens, P. coccineus, P. costaricensis, and P. dumosus. However, the use of close relatives for bean improvement is limited due to the lack of knowledge about genetic variation and genetic plasticity of many of these species. Characterisation and analysis of the genetic diversity is necessary among beans' wild relatives; in addition, conflicting phylogenies and relationships need to be understood and a hypothesis of a hybrid origin of P. dumosus needs to be tested. This thesis research was orientated to generate information about the patterns of relationships among the common bean secondary gene pool, with particular focus on the species Phaseolus dumosus. This species displays a set of characteristics of agronomic interest, not only for the direct improvement of common bean but also as a source of valuable genes for adaptation to climate change. Here I undertake the first comprehensive study of the genetic diversity of P. dumosus as ascertained from both nuclear and chloroplast genome markers. A germplasm collection of the ancestral forms of P. dumosus together with wild, landrace and cultivar representatives of all other species of the common bean secondary gene pool, were used to analyse genetic diversity, phylogenetic relationships and structure of P. dumosus. Data on molecular variation was generated from sequences of cpDNA loci accD-psaI spacer, trnT-trnL spacer, trnL intron and rps14-psaB spacer and from the nrDNA the ITS region. A whole genome DArT array was developed and used for the genotyping of P. dumosus and its closes relatives. 4208 polymorphic markers were generated in the DArT array and from those, 742 markers presented a call rate >95% and zero discordance. DArT markers revealed a moderate genetic polymorphism among P. dumosus samples (13% of polymorphic loci), while P. coccineus presented the highest level of polymorphism (88% of polymorphic loci). At the cpDNA one ancestral haplotype was detected among all samples of all species in the secondary genepool. The ITS region of P. dumosus revealed high homogeneity and polymorphism bias to P. coccineus genome. Phylogenetic reconstructions made with Maximum likelihood and Bayesian methods confirmed previously reported discrepancies among the nuclear and chloroplast genomes of P. dumosus. The outline of relationships by hybridization networks displayed a considerable number of interactions within and between species. This research provides compelling evidence that P. dumosus arose from hybridisation between P. vulgaris and P. coccineus and confirms that P. costaricensis has likely been involved in the genesis or backcrossing events (or both) in the history of P. dumosus. The classification of the specie P. persistentus was analysed based on cpDNA and ITS sequences, the results found this species to be highly related to P. vulgaris but not too similar to P. leptostachyus as previously proposed. This research demonstrates that wild types of the secondary genepool carry a significant genetic variation which makes this a valuable genetic resource for common bean improvement. The DArT array generated in this research is a valuable resource for breeding programs since it has the potential to be used in several approaches including genotyping, discovery of novel traits, mapping and marker-trait associations. Efforts should be made to search for potential populations of P. persistentus and to increase the collection of new populations of P. dumosus, P. albescens and P. costaricensis that may provide valuable traits for introgression into common bean and other Phaseolus crops.
Resumo:
Marine organisms have to cope with increasing CO2 partial pressures and decreasing pH in the oceans. We elucidated the impacts of an 8-week acclimation period to four seawater pCO2 treatments (39, 113, 243 and 405 Pa/385, 1,120, 2,400 and 4,000 µatm) on mantle gene expression patterns in the blue mussel Mytilus edulis from the Baltic Sea. Based on the M. edulis mantle tissue transcriptome, the expression of several genes involved in metabolism, calcification and stress responses was assessed in the outer (marginal and pallial zone) and the inner mantle tissues (central zone) using quantitative real-time PCR. The expression of genes involved in energy and protein metabolism (F-ATPase, hexokinase and elongation factor alpha) was strongly affected by acclimation to moderately elevated CO2 partial pressures. Expression of a chitinase, potentially important for the calcification process, was strongly depressed (maximum ninefold), correlating with a linear decrease in shell growth observed in the experimental animals. Interestingly, shell matrix protein candidate genes were less affected by CO2 in both tissues. A compensatory process toward enhanced shell protection is indicated by a massive increase in the expression of tyrosinase, a gene involved in periostracum formation (maximum 220-fold). Using correlation matrices and a force-directed layout network graph, we were able to uncover possible underlying regulatory networks and the connections between different pathways, thereby providing a molecular basis of observed changes in animal physiology in response to ocean acidification.
Resumo:
Background
It is generally acknowledged that a functional understanding of a biological system can only be obtained by an understanding of the collective of molecular interactions in form of biological networks. Protein networks are one particular network type of special importance, because proteins form the functional base units of every biological cell. On a mesoscopic level of protein networks, modules are of significant importance because these building blocks may be the next elementary functional level above individual proteins allowing to gain insight into fundamental organizational principles of biological cells.
Results
In this paper, we provide a comparative analysis of five popular and four novel module detection algorithms. We study these module prediction methods for simulated benchmark networks as well as 10 biological protein interaction networks (PINs). A particular focus of our analysis is placed on the biological meaning of the predicted modules by utilizing the Gene Ontology (GO) database as gold standard for the definition of biological processes. Furthermore, we investigate the robustness of the results by perturbing the PINs simulating in this way our incomplete knowledge of protein networks.
Conclusions
Overall, our study reveals that there is a large heterogeneity among the different module prediction algorithms if one zooms-in the biological level of biological processes in the form of GO terms and all methods are severely affected by a slight perturbation of the networks. However, we also find pathways that are enriched in multiple modules, which could provide important information about the hierarchical organization of the system
Resumo:
The overwhelming amount and unprecedented speed of publication in the biomedical domain make it difficult for life science researchers to acquire and maintain a broad view of the field and gather all information that would be relevant for their research. As a response to this problem, the BioNLP (Biomedical Natural Language Processing) community of researches has emerged and strives to assist life science researchers by developing modern natural language processing (NLP), information extraction (IE) and information retrieval (IR) methods that can be applied at large-scale, to scan the whole publicly available biomedical literature and extract and aggregate the information found within, while automatically normalizing the variability of natural language statements. Among different tasks, biomedical event extraction has received much attention within BioNLP community recently. Biomedical event extraction constitutes the identification of biological processes and interactions described in biomedical literature, and their representation as a set of recursive event structures. The 2009–2013 series of BioNLP Shared Tasks on Event Extraction have given raise to a number of event extraction systems, several of which have been applied at a large scale (the full set of PubMed abstracts and PubMed Central Open Access full text articles), leading to creation of massive biomedical event databases, each of which containing millions of events. Sinece top-ranking event extraction systems are based on machine-learning approach and are trained on the narrow-domain, carefully selected Shared Task training data, their performance drops when being faced with the topically highly varied PubMed and PubMed Central documents. Specifically, false-positive predictions by these systems lead to generation of incorrect biomolecular events which are spotted by the end-users. This thesis proposes a novel post-processing approach, utilizing a combination of supervised and unsupervised learning techniques, that can automatically identify and filter out a considerable proportion of incorrect events from large-scale event databases, thus increasing the general credibility of those databases. The second part of this thesis is dedicated to a system we developed for hypothesis generation from large-scale event databases, which is able to discover novel biomolecular interactions among genes/gene-products. We cast the hypothesis generation problem as a supervised network topology prediction, i.e predicting new edges in the network, as well as types and directions for these edges, utilizing a set of features that can be extracted from large biomedical event networks. Routine machine learning evaluation results, as well as manual evaluation results suggest that the problem is indeed learnable. This work won the Best Paper Award in The 5th International Symposium on Languages in Biology and Medicine (LBM 2013).
Resumo:
Plant reproduction depends on the concerted activation of many genes to ensure correct communication between pollen and pistil. Here, we queried the whole transcriptome of Arabidopsis (Arabidopsis thaliana) in order to identify genes with specific reproductive functions. We used the Affymetrix ATH1 whole genome array to profile wild-type unpollinated pistils and unfertilized ovules. By comparing the expression profile of pistils at 0.5, 3.5, and 8.0 h after pollination and applying a number of statistical and bioinformatics criteria, we found 1,373 genes differentially regulated during pollen-pistil interactions. Robust clustering analysis grouped these genes in 16 time-course clusters representing distinct patterns of regulation. Coregulation within each cluster suggests the presence of distinct genetic pathways, which might be under the control of specific transcriptional regulators. A total of 78% of the regulated genes were expressed initially in unpollinated pistil and/or ovules, 15% were initially detected in the pollen data sets as enriched or preferentially expressed, and 7% were induced upon pollination. Among those, we found a particular enrichment for unknown transcripts predicted to encode secreted proteins or representing signaling and cell wall-related proteins, which may function by remodeling the extracellular matrix or as extracellular signaling molecules. A strict regulatory control in various metabolic pathways suggests that fine-tuning of the biochemical and physiological cellular environment is crucial for reproductive success. Our study provides a unique and detailed temporal and spatial gene expression profile of in vivo pollen-pistil interactions, providing a framework to better understand the basis of the molecular mechanisms operating during the reproductive process in higher plants.
Resumo:
Dengue fever is one of the most important mosquito-borne diseases worldwide and is caused by infection with dengue virus (DENV). The disease is endemic in tropical and sub-tropical regions and has increased remarkably in the last few decades. At present, there is no antiviral or approved vaccine against the virus. Treatment of dengue patients is usually supportive, through oral or intravenous rehydration, or by blood transfusion for more severe dengue cases. Infection of DENV in humans and mosquitoes involves a complex interplay between the virus and host factors. This results in regulation of numerous intracellular processes, such as signal transduction and gene transcription which leads to progression of disease. To understand the mechanisms underlying the disease, the study of virus and host factors is therefore essential and could lead to the identification of human proteins modulating an essential step in the virus life cycle. Knowledge of these human proteins could lead to the discovery of potential new drug targets and disease control strategies in the future. Recent advances of high throughput screening technologies have provided researchers with molecular tools to carry out investigations on a large scale. Several studies have focused on determination of the host factors during DENV infection in human and mosquito cells. For instance, a genome-wide RNA interference (RNAi) screen has identified host factors that potentially play an important role in both DENV and West Nile virus replication (Krishnan et al. 2008). In the present study, a high-throughput yeast two-hybrid screen has been utilised in order to identify human factors interacting with DENV non-structural proteins. From the screen, 94 potential human interactors were identified. These include proteins involved in immune signalling regulation, potassium voltage-gated channels, transcriptional regulators, protein transporters and endoplasmic reticulum-associated proteins. Validation of fifteen of these human interactions revealed twelve of them strongly interacted with DENV proteins. Two proteins of particular interest were selected for further investigations of functional biological systems at the molecular level. These proteins, including a nuclear-associated protein BANP and a voltage-gated potassium channel Kv1.3, both have been identified through interaction with the DENV NS2A. BANP is known to be involved in NF-kB immune signalling pathway, whereas, Kv1.3 is known to play an important role in regulating passive flow of potassium ions upon changes in the cell transmembrane potential. This study also initiated a construction of an Aedes aegypti cDNA library for use with DENV proteins in Y2H screen. However, several issues were encountered during the study which made the library unsuitable for protein interaction analysis. In parallel, innate immune signalling was also optimised for downstream analysis. Overall, the work presented in this thesis, in particular the Y2H screen provides a number of human factors potentially targeted by DENV during infection. Nonetheless, more work is required to be done in order to validate these proteins and determine their functional properties, as well as testing them with infectious DENV to establish a biological significance. In the long term, data from this study will be useful for investigating potential human factors for development of antiviral strategies against dengue.
Resumo:
Hepatitis C virus is a positive-sense single-stranded RNA virus. The gene junction partitioning the viral glycoproteins E1 and E2 displays concurrent sequence evolution with the 3′-end of E1 highly conserved and the 5′-end of E2 highly heterogeneous. This gene junction is also believed to contain structured RNA elements, with a growing body of evidence suggesting that such structures can act as an additional level of viral replication and transcriptional control. We have previously used ultradeep pyrosequencing to analyze an amplicon library spanning the E1/E2 gene junction from a treatment naïve patient where samples were collected over 10 years of chronic HCV infection. During this timeframe maintenance of an in-frame insertion, recombination and humoral immune targeting of discrete virus sub-populations was reported. In the current study, we present evidence of epistatic evolution across the E1/E2 gene junction and observe the development of co-varying networks of codons set against a background of a complex virome with periodic shifts in population dominance. Overtime, the number of codons actively mutating decreases for all virus groupings. We identify strong synonymous co-variation between codon sites in a group of sequences harbouring a 3 bp in-frame insertion and propose that synonymous mutation acts to stabilize the RNA structural backbone.
Resumo:
Graphene and carbon nanotube nanocomposite (GCN) was synthesised and applied in gene transfection of pIRES plasmid conjugated with green fluorescent protein (GFP) in NIH-3T3 and NG97 cell lines. The tips of the multi-walled carbon nanotubes (MWCNTs) were exfoliated by oxygen plasma etching, which is also known to attach oxygen content groups on the MWCNT surfaces, changing their hydrophobicity. The nanocomposite was characterised by high resolution scanning electron microscopy; energy-dispersive X-ray, Fourier transform infrared and Raman spectroscopies, as well as zeta potential and particle size analyses using dynamic light scattering. BET adsorption isotherms showed the GCN to have an effective surface area of 38.5m(2)/g. The GCN and pIRES plasmid conjugated with the GFP gene, forming π-stacking when dispersed in water by magnetic stirring, resulting in a helical wrap. The measured zeta potential confirmed that the plasmid was connected to the nanocomposite. The NIH-3T3 and NG97 cell lines could phagocytize this wrap. The gene transfection was characterised by fluorescent protein produced in the cells and pictured by fluorescent microscopy. Before application, we studied GCN cell viability in NIH-3T3 and NG97 line cells using both MTT and Neutral Red uptake assays. Our results suggest that GCN has moderate stability behaviour as colloid solution and has great potential as a gene carrier agent in non-viral based therapy, with low cytotoxicity and good transfection efficiency.
Resumo:
For the first time, oxygen terminated cellulose carbon nanoparticles (CCN) was synthesised and applied in gene transfection of pIRES plasmid. The CCN was prepared from catalytic of polyaniline by chemical vapour deposition techniques. This plasmid contains one gene that encodes the green fluorescent protein (GFP) in eukaryotic cells, making them fluorescent. This new nanomaterial and pIRES plasmid formed π-stacking when dispersed in water by magnetic stirring. The frequencies shift in zeta potential confirmed the plasmid strongly connects to the nanomaterial. In vitro tests found that this conjugation was phagocytised by NG97, NIH-3T3 and A549 cell lines making them fluorescent, which was visualised by fluorescent microscopy. Before the transfection test, we studied CCN in cell viability. Both MTT and Neutral Red uptake tests were carried out using NG97, NIH-3T3 and A549 cell lines. Further, we use metabolomics to verify if small amounts of nanomaterial would be enough to cause some cellular damage in NG97 cells. We showed two mechanisms of action by CCN-DNA complex, producing an exogenous protein by the transfected cell and metabolomic changes that contributed by better understanding of glioblastoma, being the major finding of this work. Our results suggested that this nanomaterial has great potential as a gene carrier agent in non-viral based therapy, with low cytotoxicity, good transfection efficiency, and low cell damage in small amounts of nanomaterials in metabolomic tests.
Resumo:
Differential gene expression analysis by suppression subtractive hybridization with correlation to the metabolic pathways involved in chronic myeloid leukemia (CML) may provide a new insight into the pathogenesis of CML. Among the overexpressed genes found in CML at diagnosis are SEPT5, RUNX1, MIER1, KPNA6 and FLT3, while PAN3, TOB1 and ITCH were decreased when compared to healthy volunteers. Some genes were identified and involved in CML for the first time, including TOB1, which showed a low expression in patients with CML during tyrosine kinase inhibitor treatment with no complete cytogenetic response. In agreement, reduced expression of TOB1 was also observed in resistant patients with CML compared to responsive patients. This might be related to the deregulation of apoptosis and the signaling pathway leading to resistance. Most of the identified genes were related to the regulation of nuclear factor κB (NF-κB), AKT, interferon and interleukin-4 (IL-4) in healthy cells. The results of this study combined with literature data show specific gene pathways that might be explored as markers to assess the evolution and prognosis of CML as well as identify new therapeutic targets.