91 resultados para vase, orange tree


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present TANC, a TAN classifier (tree-augmented naive) based on imprecise probabilities. TANC models prior near-ignorance via the Extreme Imprecise Dirichlet Model (EDM). A first contribution of this paper is the experimental comparison between EDM and the global Imprecise Dirichlet Model using the naive credal classifier (NCC), with the aim of showing that EDM is a sensible approximation of the global IDM. TANC is able to deal with missing data in a conservative manner by considering all possible completions (without assuming them to be missing-at-random), but avoiding an exponential increase of the computational time. By experiments on real data sets, we show that TANC is more reliable than the Bayesian TAN and that it provides better performance compared to previous TANs based on imprecise probabilities. Yet, TANC is sometimes outperformed by NCC because the learned TAN structures are too complex; this calls for novel algorithms for learning the TAN structures, better suited for an imprecise probability classifier.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Retrospective clinical datasets are often characterized by a relatively small sample size and many missing data. In this case, a common way for handling the missingness consists in discarding from the analysis patients with missing covariates, further reducing the sample size. Alternatively, if the mechanism that generated the missing allows, incomplete data can be imputed on the basis of the observed data, avoiding the reduction of the sample size and allowing methods to deal with complete data later on. Moreover, methodologies for data imputation might depend on the particular purpose and might achieve better results by considering specific characteristics of the domain. The problem of missing data treatment is studied in the context of survival tree analysis for the estimation of a prognostic patient stratification. Survival tree methods usually address this problem by using surrogate splits, that is, splitting rules that use other variables yielding similar results to the original ones. Instead, our methodology consists in modeling the dependencies among the clinical variables with a Bayesian network, which is then used to perform data imputation, thus allowing the survival tree to be applied on the completed dataset. The Bayesian network is directly learned from the incomplete data using a structural expectation–maximization (EM) procedure in which the maximization step is performed with an exact anytime method, so that the only source of approximation is due to the EM formulation itself. On both simulated and real data, our proposed methodology usually outperformed several existing methods for data imputation and the imputation so obtained improved the stratification estimated by the survival tree (especially with respect to using surrogate splits).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper we present TANC, i.e., a tree-augmented naive credal classifier based on imprecise probabilities; it models prior near-ignorance via the Extreme Imprecise Dirichlet Model (EDM) (Cano et al., 2007) and deals conservatively with missing data in the training set, without assuming them to be missing-at-random. The EDM is an approximation of the global Imprecise Dirichlet Model (IDM), which considerably simplifies the computation of upper and lower probabilities; yet, having been only recently introduced, the quality of the provided approximation needs still to be verified. As first contribution, we extensively compare the output of the naive credal classifier (one of the few cases in which the global IDM can be exactly implemented) when learned with the EDM and the global IDM; the output of the classifier appears to be identical in the vast majority of cases, thus supporting the adoption of the EDM in real classification problems. Then, by experiments we show that TANC is more reliable than the precise TAN (learned with uniform prior), and also that it provides better performance compared to a previous (Zaffalon, 2003) TAN model based on imprecise probabilities. TANC treats missing data by considering all possible completions of the training set, but avoiding an exponential increase of the computational times; eventually, we present some preliminary results with missing data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper strengthens the NP-hardness result for the (partial) maximum a posteriori (MAP) problem in Bayesian networks with topology of trees (every variable has at most one parent) and variable cardinality at most three. MAP is the problem of querying the most probable state configuration of some (not necessarily all) of the network variables given evidence. It is demonstrated that the problem remains hard even in such simplistic networks.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The nature of photon interaction and reaction pH can have significant impacts on semiconductor photocatalysis. This paper describes the effect of pH on the photonic efficiency of photocatalytic reactions in the aqueous phase using TiO2 catalysts. The reactor was irradiated using periodic illumination with UV-LEDs through control of the illumination duty cycle (γ) through a series of light and dark times (Ton/Toff). Photonic efficiencies for methyl orange degradation were found to be comparable at high γ irrespective of pH. At lower γ, pH effects on photonic efficiency were very distinct across acidic, neutral and alkaline pH indicating an effect of complementary parameters. The results suggest photonic efficiency is greatest as illumination time, Ton approaches interfacial electron-transfer characteristic time which is within the range of this study or charge-carrier lifetimes upon extrapolation and also when electrostatic attraction between surface-trapped holes, {TiIVOH}ads+ and substrate molecules is strongest.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The use of controlled periodic illumination with UV LEDs for enhancing photonic efficiency of photocatalytic decomposition processes in water has been investigated using methyl orange as a model compound. The impact of the length of light and dark time periods (T ON/T OFF times) on photodegradation and photonic efficiency using a UV LED-illuminated photoreactor has been studied. The results have shown an inverse dependency of the photonic efficiency on duty cycle and a very little effect on T ON or T OFF time periods, indicating no effect of rate-limiting steps through mass diffusion or adsorption/desorption in the reaction. For this reactor, the photonic efficiency under controlled periodic illumination (CPI) matches to that of continuous illumination, for the same average UV light intensities. Furthermore, under CPI conditions, the photonic efficiency is inversely related to the average UV light intensity in the reactor, in the millisecond time regime. This is the first study that has investigated the effect of controlled periodic illumination using ultra band gap UV LED light sources in the photocatalytic destruction of dye compounds using titanium dioxide. The results not only enhance the understanding of the effect of periodic illumination on photocatalytic processes but also provide a greater insight to the potential of these light sources in photocatalytic reactions. 

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Quantum yields of the photocatalytic degradation of methyl orange under controlled periodic illumination (CPI) have been modelled using existing models. A modified Langmuir-Hinshelwood (L-H) rate equation was used to predict the degradation reaction rates of methyl orange at various duty cycles and a simple photocatalytic model was applied in modelling quantum yield enhancement of the photocatalytic process due to the CPI effect. A good agreement between the modelled and experimental data was observed for quantum yield modelling. The modified L-H model, however, did not accurately predict the photocatalytic decomposition of the dye under periodic illumination.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The efficiency of solar-energy-conversion devices depends on the absorption region and intensity of the photon collectors. Organic chromophores, which have been widely stabilized on inorganic semiconductors for light trapping, are limited by the interface between the chromophore and semiconductor. Herein we report a novel orange zinc germanate (Zn-Ge-O) with a chromophore-like structure, by which the absorption region can be dramatically expanded. Structural characterizations and theoretical calculations together reveal that the origin of visible-light response can be attributed to the unusual metallic Ge-Ge bonds which act in a similar way to organic chromophores. Benefiting from the enhanced light harvest, the orange Zn-Ge-O demonstrates superior capacity for solar-driven hydrogen production.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This work presents a new general purpose classifier named Averaged Extended Tree Augmented Naive Bayes (AETAN), which is based on combining the advantageous characteristics of Extended Tree Augmented Naive Bayes (ETAN) and Averaged One-Dependence Estimator (AODE) classifiers. We describe the main properties of the approach and algorithms for learning it, along with an analysis of its computational time complexity. Empirical results with numerous data sets indicate that the new approach is superior to ETAN and AODE in terms of both zero-one classification accuracy and log loss. It also compares favourably against weighted AODE and hidden Naive Bayes. The learning phase of the new approach is slower than that of its competitors, while the time complexity for the testing phase is similar. Such characteristics suggest that the new classifier is ideal in scenarios where online learning is not required.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This work proposes an extended version of the well-known tree-augmented naive Bayes (TAN) classifier where the structure learning step is performed without requiring features to be connected to the class. Based on a modification of Edmonds' algorithm, our structure learning procedure explores a superset of the structures that are considered by TAN, yet achieves global optimality of the learning score function in a very efficient way (quadratic in the number of features, the same complexity as learning TANs). We enhance our procedure with a new score function that only takes into account arcs that are relevant to predict the class, as well as an optimization over the equivalent sample size during learning. These ideas may be useful for structure learning of Bayesian networks in general. A range of experiments shows that we obtain models with better prediction accuracy than naive Bayes and TAN, and comparable to the accuracy of the state-of-the-art classifier averaged one-dependence estimator (AODE). We release our implementation of ETAN so that it can be easily installed and run within Weka.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Learning Bayesian networks with bounded tree-width has attracted much attention recently, because low tree-width allows exact inference to be performed efficiently. Some existing methods [12, 14] tackle the problem by using k-trees to learn the optimal Bayesian network with tree-width up to k. In this paper, we propose a sampling method to efficiently find representative k-trees by introducing an Informative score function to characterize the quality of a k-tree. The proposed algorithm can efficiently learn a Bayesian network with tree-width at most k. Experiment results indicate that our approach is comparable with exact methods, but is much more computationally efficient.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Bounding the tree-width of a Bayesian network can reduce the chance of overfitting, and allows exact inference to be performed efficiently. Several existing algorithms tackle the problem of learning bounded tree-width Bayesian networks by learning from k-trees as super-structures, but they do not scale to large domains and/or large tree-width. We propose a guided search algorithm to find k-trees with maximum Informative scores, which is a measure of quality for the k-tree in yielding good Bayesian networks. The algorithm achieves close to optimal performance compared to exact solutions in small domains, and can discover better networks than existing approximate methods can in large domains. It also provides an optimal elimination order of variables that guarantees small complexity for later runs of exact inference. Comparisons with well-known approaches in terms of learning and inference accuracy illustrate its capabilities.