974 resultados para Classification Tree Pruning
Resumo:
Considering the importance of water content for the conservation and storage of seeds, and the involvement of soluble carbohydrates and lipids for embryo development, a comparative study was carried out among the seeds of Inga vera (ingá), Eugenia uniflora (pitanga), both classified as recalcitrant, and Caesalpinia echinata (brazilwood) and Erythrina speciosa (mulungu), considered as orthodox seeds. Low concentrations of cyclitols (0.3-0.5%), raffinose family oligosaccharides (ca. 0.05%) and unsaturated fatty acids (0-19%) were found in the seeds of ingá and pitanga, while larger amounts of cyclitols (2-3%) and raffinose (4.6-13%) were found in brazilwood and mulungu, respectively. These results, in addition to higher proportions of unsaturated fatty acids (53-71%) in orthodox seeds, suggested that sugars and lipids played important role in water movement, protecting the embryo cell membranes against injuries during dehydration.
Resumo:
The genus Callistomys belongs to the rodent family Echimyidae, subfamily Echimyinae, and its only living representative is Callistomys pictus, a rare and vulnerable endemic species of the state of Bahia, Brazil. Callistomys has been previously classified as Nelomys, Loncheres, Isothrix and Echimys. In this paper we present the karyotype of Callistomys pictus, including CBG and GTG-banding patterns and silver staining of the nucleolus organizer regions (Ag-NORs). Comments on Callistomys pictus morphological traits and a compilation of Echimyinae chromosomal data are also included. Our analyses revealed that Callistomys can be recognized both by its distintinctive morphology and by its karyotype.
Resumo:
Saving our science from ourselves: the plight of biological classification. Biological classification ( nomenclature, taxonomy, and systematics) is being sold short. The desire for new technologies, faster and cheaper taxonomic descriptions, identifications, and revisions is symptomatic of a lack of appreciation and understanding of classification. The problem of gadget-driven science, a lack of best practice and the inability to accept classification as a descriptive and empirical science are discussed. The worst cases scenario is a future in which classifications are purely artificial and uninformative.
Resumo:
Due to the imprecise nature of biological experiments, biological data is often characterized by the presence of redundant and noisy data. This may be due to errors that occurred during data collection, such as contaminations in laboratorial samples. It is the case of gene expression data, where the equipments and tools currently used frequently produce noisy biological data. Machine Learning algorithms have been successfully used in gene expression data analysis. Although many Machine Learning algorithms can deal with noise, detecting and removing noisy instances from the training data set can help the induction of the target hypothesis. This paper evaluates the use of distance-based pre-processing techniques for noise detection in gene expression data classification problems. This evaluation analyzes the effectiveness of the techniques investigated in removing noisy data, measured by the accuracy obtained by different Machine Learning classifiers over the pre-processed data.
Resumo:
PURPOSE: The main goal of this study was to develop and compare two different techniques for classification of specific types of corneal shapes when Zernike coefficients are used as inputs. A feed-forward artificial Neural Network (NN) and discriminant analysis (DA) techniques were used. METHODS: The inputs both for the NN and DA were the first 15 standard Zernike coefficients for 80 previously classified corneal elevation data files from an Eyesys System 2000 Videokeratograph (VK), installed at the Departamento de Oftalmologia of the Escola Paulista de Medicina, São Paulo. The NN had 5 output neurons which were associated with 5 typical corneal shapes: keratoconus, with-the-rule astigmatism, against-the-rule astigmatism, "regular" or "normal" shape and post-PRK. RESULTS: The NN and DA responses were statistically analyzed in terms of precision ([true positive+true negative]/total number of cases). Mean overall results for all cases for the NN and DA techniques were, respectively, 94% and 84.8%. CONCLUSION: Although we used a relatively small database, results obtained in the present study indicate that Zernike polynomials as descriptors of corneal shape may be a reliable parameter as input data for diagnostic automation of VK maps, using either NN or DA.
Resumo:
This paper describes a new food classification which assigns foodstuffs according to the extent and purpose of the industrial processing applied to them. Three main groups are defined: unprocessed or minimally processed foods (group 1), processed culinary and food industry ingredients (group 2), and ultra-processed food products (group 3). The use of this classification is illustrated by applying it to data collected in the Brazilian Household Budget Survey which was conducted in 2002/2003 through a probabilistic sample of 48,470 Brazilian households. The average daily food availability was 1,792 kcal/person being 42.5% from group 1 (mostly rice and beans and meat and milk), 37.5% from group 2 (mostly vegetable oils, sugar, and flours), and 20% from group 3 (mostly breads, biscuits, sweets, soft drinks, and sausages). The share of group 3 foods increased with income, and represented almost one third of all calories in higher income households. The impact of the replacement of group 1 foods and group 2 ingredients by group 3 products on the overall quality of the diet, eating patterns and health is discussed.
Resumo:
In 2000, an outbreak of sylvatic yellow fever possibly occurred in gallery forests of the Grande river in the Paraná basin in the northwestern region of São Paulo state. The aim of this study was to obtain information on the bionomics of Haemagogus and other mosquitoes inside tree holes in that area. Eighteen open tree holes were sampled for immature specimens. Adults were collected twice a month in the forest in Santa Albertina county from July 2000 to June 2001. The seasonal frequency of fourth instars was obtained by the Williams geometric mean (Mw), while the adult frequency was estimated either by hourly arithmetic or the Williams' means. Cole's index was applied to evaluate larval inter-specific associations. Among the ten mosquito species identified, the most abundant was Aedes terrens Walker followed by Sabethes tridentatus Cerqueira and Haemagogus janthinomys Dyar. Larval and adult abundance of these species was higher in summer than in winter. Although larval abundance of Hg. janthinomys peaked in the rainy season, correlation with rainfall was not significant. Six groups of larval associations were distinguished, one of which the most positively stable. The Hg. janthinomys and Ae. terrens association was significant, and Limatus durhamii Theobald was the species with most negative associations.
Resumo:
The weevil subfamily Scolytinae includes beetles which may feed on the bark, trunk or roots of both live and dead trees and are sometimes considered forest and silvicultural pests. Less frequently, some species feed on seeds and may be cause economic losses when associated to plant cultivars. Spermophthorus apuleiae Costa-Lima is a Neotropical Scolytinae formerly recorded to be "associated" with seeds of Caesalpinia ferrea var. leiostachya Benth, a Brazilian tree popularly known in Portuguese as "pau-ferro". Hitherto, it was not clear whether these beetles actually feed on the seeds of that plant. In order to investigate the ability of S. apuleiae to feed on seeds of "pau-ferro", observations were done and colonies of these beetles were established. Both in the field and in captivity the beetles were not observed feeding on the seeds. Even when beetles were exposed to seeds as the only source of food they were incapable of boring or eating the seeds and died. Our data therefore suggest that S. apuleiae is a frugivorous species which peculiarly does not eat seeds of "pau-ferro".
Resumo:
This work proposes a new approach using a committee machine of artificial neural networks to classify masses found in mammograms as benign or malignant. Three shape factors, three edge-sharpness measures, and 14 texture measures are used for the classification of 20 regions of interest (ROIs) related to malignant tumors and 37 ROIs related to benign masses. A group of multilayer perceptrons (MLPs) is employed as a committee machine of neural network classifiers. The classification results are reached by combining the responses of the individual classifiers. Experiments involving changes in the learning algorithm of the committee machine are conducted. The classification accuracy is evaluated using the area A. under the receiver operating characteristics (ROC) curve. The A, result for the committee machine is compared with the A, results obtained using MLPs and single-layer perceptrons (SLPs), as well as a linear discriminant analysis (LDA) classifier Tests are carried out using the student's t-distribution. The committee machine classifier outperforms the MLP SLP, and LDA classifiers in the following cases: with the shape measure of spiculation index, the A, values of the four methods are, in order 0.93, 0.84, 0.75, and 0.76; and with the edge-sharpness measure of acutance, the values are 0.79, 0.70, 0.69, and 0.74. Although the features with which improvement is obtained with the committee machines are not the same as those that provided the maximal value of A(z) (A(z) = 0.99 with some shape features, with or without the committee machine), they correspond to features that are not critically dependent on the accuracy of the boundaries of the masses, which is an important result. (c) 2008 SPIE and IS&T.
Resumo:
Previous studies pointed out that species richness and high density values within the Leguminosae in Brazilian forest fragments affected by fire could be due, at least partially, to the high incidence of root sprouting in this family. However, there are few Studies of the factors that induce root sprouting in woody plants after disturbance. We investigated the bud formation on root cuttings, and considered a man-made disturbance that isolates the root from the shoot apical dominance of three Leguminosae (Bauhinia forficata Link., Centrolobium tomentosum Guill. ex Benth, and Inga laurina (Sw.) Willd) and one Rutaceae (Esenbeckia febrifuga (St. Hit.) Juss. ex Mart.). All these species resprout frequently after fire. We also attempted to induce bud formation on root systems by removing the main trunk, girdling or sectioning the shallow lateral roots from forest tree species Esenbeckia febrifuga and Hymenaea courbaril L. We identified the origin of shoot primordia and their early development by fixing the samples in Karnovsky solution, dehydrating in ethyl alcohol series and embedding in plastic resin. Serial sections were cut on a rotary microtome and stained with toluidine blue O. Permanent slides were mounted in synthetic resin. We observed different modes of bud origin on root cuttings: close to the vascular cambium (C. tomentosum), from the callus (B. forficata and E febrifuga) and from the phloematic parenchyma proliferation (L laurina). Fragments of B. forficala root bark were also capable of forming reparative buds from healing phellogen formed in callus in the bark's inner side. In the attempt of bud induction on root systems, Hymenaea courbaril did not respond to any of the induction tests, probably because of plant age. However, Esenbeckia febrifuga roots formed suckers when the main trunk was removed or their roots were sectioned and isolated from the original plant. We experimentally demonstrated the ability of four tree species to resprout from roots after disturbance. Our results suggest that the release of apical dominance enables root resprouting in the studied species. Rev. Biol. Trop. 57 (3): 789-800. Epub 2009 September 30.
Resumo:
Due to its relationship with other properties, wood density is the main wood quality parameter. Modern, accurate methods - such as X-ray densitometry - are applied to determine the spatial distribution of density in wood sections and to evaluate wood quality. The objectives of this study were to determinate the influence of growing conditions on wood density variation and tree ring demarcation of gmelina trees from fast growing plantations in Costa Rica. The wood density was determined by X-ray densitometry method. Wood samples were cut from gmelina trees and were exposed to low X-rays. The radiographic films were developed and scanned using a 256 gray scale with 1000 dpi resolution and the wood density was determined by CRAD and CERD software. The results showed tree-ring boundaries were distinctly delimited in trees growing in site with rainfall lower than 25 10 mm/year. It was demonstrated that tree age, climatic conditions and management of plantation affects wood density and its variability. The specific effect of variables on wood density was quantified by for multiple regression method. It was determined that tree year explained 25.8% of the total variation of density and 19.9% were caused by climatic condition where the tree growing. Wood density was less affected by the intensity of forest management with 5.9% of total variation.
Resumo:
Aims. In this work, we describe the pipeline for the fast supervised classification of light curves observed by the CoRoT exoplanet CCDs. We present the classification results obtained for the first four measured fields, which represent a one-year in-orbit operation. Methods. The basis of the adopted supervised classification methodology has been described in detail in a previous paper, as is its application to the OGLE database. Here, we present the modifications of the algorithms and of the training set to optimize the performance when applied to the CoRoT data. Results. Classification results are presented for the observed fields IRa01, SRc01, LRc01, and LRa01 of the CoRoT mission. Statistics on the number of variables and the number of objects per class are given and typical light curves of high-probability candidates are shown. We also report on new stellar variability types discovered in the CoRoT data. The full classification results are publicly available.
Resumo:
This study was conducted in the Private Reserve Mata do Jambreiro (912 ha), localized in the Iron Quadrangle, Minas Gerais, southeastern portion of the Espinhaco Range, which is predominantly covered by semideciduous seasonal montane forest. Three topographically and physiognomic similar areas located within a continuum forest fragment, distant by 1.3 to 1.5 km were sampled by the point-quadrat method. In each area, 30 points were marked. Individuals with a minimum perimeter at the breast height (PBH) of 15 cm were sampled, totaling 111 species belonging to 40 families. The most representative family was Fabaceae, with 14.29% of the total number of species. Low floristic similarity (5.3% to 34.4%) was observed between the areas, pointing out the importance of distribution of sample units in continuous fragments. Shannon diversity index (H') found was 4.22 and Pielou equability (J) 0.894. Soil analysis showed some differences in chemical composition between the three studied areas and was an important component for the interpretation of the floristic variation found. The low floristic similarity observed here for close areas justify the requirement of more detailed inventories by Brazilian Environmental Agencies for the legal authorization procedures prior to the establishment of new enterprising projects. Also, the professionals that conduct rapid inventories, mainly the Environmental Consultants, should give more attention to this kind of floristic variation and to the methods used to inventory complex forests.
Resumo:
An (n, d)-expander is a graph G = (V, E) such that for every X subset of V with vertical bar X vertical bar <= 2n - 2 we have vertical bar Gamma(G)(X) vertical bar >= (d + 1) vertical bar X vertical bar. A tree T is small if it has at most n vertices and has maximum degree at most d. Friedman and Pippenger (1987) proved that any ( n; d)- expander contains every small tree. However, their elegant proof does not seem to yield an efficient algorithm for obtaining the tree. In this paper, we give an alternative result that does admit a polynomial time algorithm for finding the immersion of any small tree in subgraphs G of (N, D, lambda)-graphs Lambda, as long as G contains a positive fraction of the edges of Lambda and lambda/D is small enough. In several applications of the Friedman-Pippenger theorem, including the ones in the original paper of those authors, the (n, d)-expander G is a subgraph of an (N, D, lambda)-graph as above. Therefore, our result suffices to provide efficient algorithms for such previously non-constructive applications. As an example, we discuss a recent result of Alon, Krivelevich, and Sudakov (2007) concerning embedding nearly spanning bounded degree trees, the proof of which makes use of the Friedman-Pippenger theorem. We shall also show a construction inspired on Wigderson-Zuckerman expander graphs for which any sufficiently dense subgraph contains all trees of sizes and maximum degrees achieving essentially optimal parameters. Our algorithmic approach is based on a reduction of the tree embedding problem to a certain on-line matching problem for bipartite graphs, solved by Aggarwal et al. (1996).
Resumo:
Context tree models have been introduced by Rissanen in [25] as a parsimonious generalization of Markov models. Since then, they have been widely used in applied probability and statistics. The present paper investigates non-asymptotic properties of two popular procedures of context tree estimation: Rissanen's algorithm Context and penalized maximum likelihood. First showing how they are related, we prove finite horizon bounds for the probability of over- and under-estimation. Concerning overestimation, no boundedness or loss-of-memory conditions are required: the proof relies on new deviation inequalities for empirical probabilities of independent interest. The under-estimation properties rely on classical hypotheses for processes of infinite memory. These results improve on and generalize the bounds obtained in Duarte et al. (2006) [12], Galves et al. (2008) [18], Galves and Leonardi (2008) [17], Leonardi (2010) [22], refining asymptotic results of Buhlmann and Wyner (1999) [4] and Csiszar and Talata (2006) [9]. (C) 2011 Elsevier B.V. All rights reserved.