6 resultados para Bayesian Networks Elicitation GIS Integration

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo


Relevância:

100.00% 100.00%

Publicador:

Resumo:

A set of predictor variables is said to be intrinsically multivariate predictive (IMP) for a target variable if all properly contained subsets of the predictor set are poor predictors of the. target but the full set predicts the target with great accuracy. In a previous article, the main properties of IMP Boolean variables have been analytically described, including the introduction of the IMP score, a metric based on the coefficient of determination (CoD) as a measure of predictiveness with respect to the target variable. It was shown that the IMP score depends on four main properties: logic of connection, predictive power, covariance between predictors and marginal predictor probabilities (biases). This paper extends that work to a broader context, in an attempt to characterize properties of discrete Bayesian networks that contribute to the presence of variables (network nodes) with high IMP scores. We have found that there is a relationship between the IMP score of a node and its territory size, i.e., its position along a pathway with one source: nodes far from the source display larger IMP scores than those closer to the source, and longer pathways display larger maximum IMP scores. This appears to be a consequence of the fact that nodes with small territory have larger probability of having highly covariate predictors, which leads to smaller IMP scores. In addition, a larger number of XOR and NXOR predictive logic relationships has positive influence over the maximum IMP score found in the pathway. This work presents analytical results based on a simple structure network and an analysis involving random networks constructed by computational simulations. Finally, results from a real Bayesian network application are provided. (C) 2012 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The aim of this work is to use GIS integration data to characterize sedimentary processes in a SubTropical lagoon environment. The study area was the Canan,ia Inlet estuary in the southeastern section of the Canan,ia Lagoon Estuarine System (CLES), state of So Paulo, Brazil (25A degrees 03'S/47A degrees 53'W). The area is formed by the confluence of two estuarine channels forming a bay-shaped water body locally called "Trapand, Bay". The region is surrounded by one of the most preserved tracts of Atlantic Rain Forest in Southwestern Brazil and presents well-developed mangroves and marshes. In this study a methodology was developed using integrated a GIS database based on bottom sediment parameters, geomorphological data, remote sensing images, Hidrodynamical Modeling data and geophysical parameters. The sediment grain size parameters and the bottom morphology of the lagoon were also used to develop models of net sediment transport pathways. It was possible to observe that the sediment transport vectors based on the grain size model had a good correlation with the transport model based on the bottom topography features and Hydrodynamic model, especially in areas with stronger energetic conditions, with a minor contribution of finer sediments. This relation is somewhat less evident near shallower banks and depositional features. In these regions the organic matter contents in the sediments was a good complementary tool for inferring the hydrodynamic and depositional conditions (i.e. primary productivity, sedimentation rates, sources, oxi-reduction rates).

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fraud is a global problem that has required more attention due to an accentuated expansion of modern technology and communication. When statistical techniques are used to detect fraud, whether a fraud detection model is accurate enough in order to provide correct classification of the case as a fraudulent or legitimate is a critical factor. In this context, the concept of bootstrap aggregating (bagging) arises. The basic idea is to generate multiple classifiers by obtaining the predicted values from the adjusted models to several replicated datasets and then combining them into a single predictive classification in order to improve the classification accuracy. In this paper, for the first time, we aim to present a pioneer study of the performance of the discrete and continuous k-dependence probabilistic networks within the context of bagging predictors classification. Via a large simulation study and various real datasets, we discovered that the probabilistic networks are a strong modeling option with high predictive capacity and with a high increment using the bagging procedure when compared to traditional techniques. (C) 2012 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Semi-qualitative probabilistic networks (SQPNs) merge two important graphical model formalisms: Bayesian networks and qualitative probabilistic networks. They provade a very Complexity of inferences in polytree-shaped semi-qualitative probabilistic networks and qualitative probabilistic networks. They provide a very general modeling framework by allowing the combination of numeric and qualitative assessments over a discrete domain, and can be compactly encoded by exploiting the same factorization of joint probability distributions that are behind the bayesian networks. This paper explores the computational complexity of semi-qualitative probabilistic networks, and takes the polytree-shaped networks as its main target. We show that the inference problem is coNP-Complete for binary polytrees with multiple observed nodes. We also show that interferences can be performed in time linear in the number of nodes if there is a single observed node. Because our proof is construtive, we obtain an efficient linear time algorithm for SQPNs under such assumptions. To the best of our knowledge, this is the first exact polynominal-time algorithm for SQPn. Together these results provide a clear picture of the inferential complexity in polytree-shaped SQPNs.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: A current challenge in gene annotation is to define the gene function in the context of the network of relationships instead of using single genes. The inference of gene networks (GNs) has emerged as an approach to better understand the biology of the system and to study how several components of this network interact with each other and keep their functions stable. However, in general there is no sufficient data to accurately recover the GNs from their expression levels leading to the curse of dimensionality, in which the number of variables is higher than samples. One way to mitigate this problem is to integrate biological data instead of using only the expression profiles in the inference process. Nowadays, the use of several biological information in inference methods had a significant increase in order to better recover the connections between genes and reduce the false positives. What makes this strategy so interesting is the possibility of confirming the known connections through the included biological data, and the possibility of discovering new relationships between genes when observed the expression data. Although several works in data integration have increased the performance of the network inference methods, the real contribution of adding each type of biological information in the obtained improvement is not clear. Methods: We propose a methodology to include biological information into an inference algorithm in order to assess its prediction gain by using biological information and expression profile together. We also evaluated and compared the gain of adding four types of biological information: (a) protein-protein interaction, (b) Rosetta stone fusion proteins, (c) KEGG and (d) KEGG+GO. Results and conclusions: This work presents a first comparison of the gain in the use of prior biological information in the inference of GNs by considering the eukaryote (P. falciparum) organism. Our results indicates that information based on direct interaction can produce a higher improvement in the gain than data about a less specific relationship as GO or KEGG. Also, as expected, the results show that the use of biological information is a very important approach for the improvement of the inference. We also compared the gain in the inference of the global network and only the hubs. The results indicates that the use of biological information can improve the identification of the most connected proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: schizophrenia's endophenotipic profile is not only generally complex, but often varies from case to case. The perspective of trying to define specific anatomic correlates of the syndrome has led to disappointing results. In that context, neurophysiologic hypotheses (e. g. glutamatergic hypothesis) and connectivity hypotheses became prominent. Nevertheless, despite their commitment to the principle of denying 'localist' views and approaching the syndrome's endophenotype from a whole brain perspective, efforts to integrate both have not flourished at this moment in time. Objectives: This paper aims to introduce a new etiological model that integrates the glutamatergic and the WM (WM) hypotheses of schizophrenia's etiology. This model proposes to serve as a framework in order to relate to patterns of brain abnormalities from the onset of the syndrome to stages of advanced chronification. Highlights: Neurotransmitter abnormalities forego noticeable WM abnormalities. The former, chiefly represented by NMDAR hypo-function and associated molecular cascades, is related to the first signs of cell loss. This process is both directly and indirectly integrated to the underpinning of WM structural abnormalities; not only is the excess of glutamate toxic to the WM, but its disruption is associated to the expression of known genetic risk factors (e. g., NRG-1). A second level of the model develops the idea that abnormal neurotransmission within specific neural populations ('motifs') impair particular cognitive abilities, while subsequent WM structural abnormalities impair the integration of brain functions and multimodality. As a result of this two-stage dynamic, the affected individual progresses from experiencing specific cognitive and psychological deficits, to a condition of cognitive and existential fragmentation, linked to hardly reversible decreases in psychosocial functioning.