7 resultados para tree-augmented-Naive Bayes structure
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
This work proposes and discusses an approach for inducing Bayesian classifiers aimed at balancing the tradeoff between the precise probability estimates produced by time consuming unrestricted Bayesian networks and the computational efficiency of Naive Bayes (NB) classifiers. The proposed approach is based on the fundamental principles of the Heuristic Search Bayesian network learning. The Markov Blanket concept, as well as a proposed ""approximate Markov Blanket"" are used to reduce the number of nodes that form the Bayesian network to be induced from data. Consequently, the usually high computational cost of the heuristic search learning algorithms can be lessened, while Bayesian network structures better than NB can be achieved. The resulting algorithms, called DMBC (Dynamic Markov Blanket Classifier) and A-DMBC (Approximate DMBC), are empirically assessed in twelve domains that illustrate scenarios of particular interest. The obtained results are compared with NB and Tree Augmented Network (TAN) classifiers, and confinn that both proposed algorithms can provide good classification accuracies and better probability estimates than NB and TAN, while being more computationally efficient than the widely used K2 Algorithm.
Resumo:
The substitution of missing values, also called imputation, is an important data preparation task for many domains. Ideally, the substitution of missing values should not insert biases into the dataset. This aspect has been usually assessed by some measures of the prediction capability of imputation methods. Such measures assume the simulation of missing entries for some attributes whose values are actually known. These artificially missing values are imputed and then compared with the original values. Although this evaluation is useful, it does not allow the influence of imputed values in the ultimate modelling task (e.g. in classification) to be inferred. We argue that imputation cannot be properly evaluated apart from the modelling task. Thus, alternative approaches are needed. This article elaborates on the influence of imputed values in classification. In particular, a practical procedure for estimating the inserted bias is described. As an additional contribution, we have used such a procedure to empirically illustrate the performance of three imputation methods (majority, naive Bayes and Bayesian networks) in three datasets. Three classifiers (decision tree, naive Bayes and nearest neighbours) have been used as modelling tools in our experiments. The achieved results illustrate a variety of situations that can take place in the data preparation practice.
Resumo:
Royal palm tree peroxidase (RPTP) is a very stable enzyme in regards to acidity, temperature, H(2)O(2), and organic solvents. Thus, RPTP is a promising candidate for developing H(2)O(2)-sensitive biosensors for diverse applications in industry and analytical chemistry. RPTP belongs to the family of class III secretory plant peroxidases, which include horseradish peroxidase isozyme C, soybean and peanut peroxidases. Here we report the X-ray structure of native RPTP isolated from royal palm tree (Roystonea regia) refined to a resolution of 1.85 angstrom. RPTP has the same overall folding pattern of the plant peroxidase superfamily, and it contains one heme group and two calcium-binding sites in similar locations. The three-dimensional structure of RPTP was solved for a hydroperoxide complex state, and it revealed a bound 2-(N-morpholino) ethanesulfonic acid molecule (MES) positioned at a putative substrate-binding secondary site. Nine N-glycosylation sites are clearly defined in the RPTP electron-density maps, revealing for the first time conformations of the glycan chains of this highly glycosylated enzyme. Furthermore, statistical coupling analysis (SCA) of the plant peroxidase superfamily was performed. This sequence-based method identified a set of evolutionarily conserved sites that mapped to regions surrounding the heme prosthetic group. The SCA matrix also predicted a set of energetically coupled residues that are involved in the maintenance of the structural folding of plant peroxidases. The combination of crystallographic data and SCA analysis provides information about the key structural elements that could contribute to explaining the unique stability of RPTP. (C) 2009 Elsevier Inc. All rights reserved.
Resumo:
Pilostyles species (Apodanthaceae) are endoparasites in stems of the plant family Fabaceae. The body comprises masses of parenchyma in the host bark and cortex, with sinkers, comprising groups of twisted tracheal elements surrounded by parenchyma that enter the secondary xylem of the host plant. Here we report for the first time the effects of Pilostyles parasitism on host secondary xylem. We obtained healthy and parasitized stems from Mimosa foliolosa, M. maguirei and M. setosa and compared vessel element length, fiber length, vessel diameter and vessel frequency, measured through digital imaging. Also, tree height and girth were compared between healthy and parasitized M. setosa. When parasitized, plant size, vessel diameter, vessel element length and fiber length are all less than in healthy plants. Also, vessel frequency is greater and vessels are narrower in parasitized stems. These responses to parasitism are similar to those observed in stressed plants. Thus, hosts respond to the parasite by changing its wood micromorphology in favour of increased hydraulic safety.
Resumo:
Rudgea jasminoides (Rubiaceae) is a tropical tree species native of the Atlantic Forest in the south of Brazil. Previous studies with leaf cell walls of R. jasminoides showed a different proportion of cross-linked glycans compared to what is usually reported for eudicots. However, due to the difficulties of working with whole plant organs, cell suspensions of R. jasminoides, consisting of predominantly undifferentiated cells with mainly primary cell walls, were used to examine cell walls and extracellular soluble polysaccharides (EP) released into the culture medium. Sugar composition and linkage analysis showed homogalacturonans, xylogalacturonans and arabinogalactans to be the predominant EP. In the cell wall, homogalacturonans and arabinogalactans are the major pectins, and xyloglucans and xylans are the major cross-linking glycans. The presence of xylogalacturonans in the R. jasminoides cell cultures seems to be related to the occurrence of a homogeneous cell suspension with loosely attached cells. Although all alkali extractions from the cell walls yielded amounts of xyloglucan that exceed those of the xylans, the latter was found in a proportion that is higher than what has been usually reported for primary cell walls of most eudicots. The xyloglucan from cell walls of cell suspension cultures of R. jasminoides has low fucosylation levels and high proportion of galactosyl residues, a branching pattern commonly found in storage cell-wall xyloglucans.
Resumo:
A variety of human-induced disturbances such as forest fragmentation and recovery after deforestation for pasture or agricultural activities have resulted in a complex landscape mosaic in the Una region of northeastern Brazil. Using a set of vegetation descriptors, we investigated the main structural changes observed in forest categories that comprise the major components of the regional landscape and searched for potential key descriptors that could be used to discriminate among different forest categories. We assessed the forest structure of five habitat categories defined as (I) interiors and (2) edges of large fragments of old-growth forest (>1000 ha), (3) interiors and (4) edges of small forest fragments (<100 ha), and (5) early secondary forests. Forest descriptors used here were: frequency of herbaceous lianas and woody climbers, number of standing dead trees, number of fallen trunks, litter depth, number of pioneer plants (early secondary and shade-intolerant species), vertical foliage stratification profile and distribution Of trees in different diameter classes. Edges and interiors of forest fragments were significantly different only in the number of standing dead trees. Secondary forests and edges of fragments showed differences in litter depth, fallen trunks and number of pioneer trees, and secondary forests were significantly different from fragment interiors in the number of standing dead trees and the number of pioneer trees. Horizontal and vertical structure evaluated via ordination analysis showed that fragment interiors, compared to secondary forests, were characterized by a greater number of medium (25-35 cm) and large (35-50 cm) trees and smaller numbers of thin trees (5-10 cm). There was great heterogeneity at the edges of small and large fragments, as these sites were distributed along almost the entire gradient. Most interiors of large and small fragments presented higher values of foliage densities at higher strata ( 15-20 m and at 20-25 m height), and lower densities at 1-5 m. All secondary forests and some fragment edge sites showed an opposite tendency. A discriminant function highlighted differences among forest categories, with transects of large fragment interiors and secondary forests representing two extremes along a disturbance gradient determined by foliage structure (densities at 15-20 m and 20-25 m), with the edges of both large and small fragments and the interiors of small fragments scattered across the gradient. The major underlying processes determining patterns of forest disturbance in the study region are discussed, highlighting the importance of forest fragments, independently of its size, as forests recovery after clear cut show a greatly distinct structure, with profound implications on fauna movements. (C) 2009 Elsevier BY. All rights reserved.
Resumo:
Augmented Lagrangian methods for large-scale optimization usually require efficient algorithms for minimization with box constraints. On the other hand, active-set box-constraint methods employ unconstrained optimization algorithms for minimization inside the faces of the box. Several approaches may be employed for computing internal search directions in the large-scale case. In this paper a minimal-memory quasi-Newton approach with secant preconditioners is proposed, taking into account the structure of Augmented Lagrangians that come from the popular Powell-Hestenes-Rockafellar scheme. A combined algorithm, that uses the quasi-Newton formula or a truncated-Newton procedure, depending on the presence of active constraints in the penalty-Lagrangian function, is also suggested. Numerical experiments using the Cute collection are presented.