64 resultados para Land cover classification
Resumo:
Saving our science from ourselves: the plight of biological classification. Biological classification ( nomenclature, taxonomy, and systematics) is being sold short. The desire for new technologies, faster and cheaper taxonomic descriptions, identifications, and revisions is symptomatic of a lack of appreciation and understanding of classification. The problem of gadget-driven science, a lack of best practice and the inability to accept classification as a descriptive and empirical science are discussed. The worst cases scenario is a future in which classifications are purely artificial and uninformative.
Resumo:
Due to the imprecise nature of biological experiments, biological data is often characterized by the presence of redundant and noisy data. This may be due to errors that occurred during data collection, such as contaminations in laboratorial samples. It is the case of gene expression data, where the equipments and tools currently used frequently produce noisy biological data. Machine Learning algorithms have been successfully used in gene expression data analysis. Although many Machine Learning algorithms can deal with noise, detecting and removing noisy instances from the training data set can help the induction of the target hypothesis. This paper evaluates the use of distance-based pre-processing techniques for noise detection in gene expression data classification problems. This evaluation analyzes the effectiveness of the techniques investigated in removing noisy data, measured by the accuracy obtained by different Machine Learning classifiers over the pre-processed data.
Resumo:
PURPOSE: The main goal of this study was to develop and compare two different techniques for classification of specific types of corneal shapes when Zernike coefficients are used as inputs. A feed-forward artificial Neural Network (NN) and discriminant analysis (DA) techniques were used. METHODS: The inputs both for the NN and DA were the first 15 standard Zernike coefficients for 80 previously classified corneal elevation data files from an Eyesys System 2000 Videokeratograph (VK), installed at the Departamento de Oftalmologia of the Escola Paulista de Medicina, São Paulo. The NN had 5 output neurons which were associated with 5 typical corneal shapes: keratoconus, with-the-rule astigmatism, against-the-rule astigmatism, "regular" or "normal" shape and post-PRK. RESULTS: The NN and DA responses were statistically analyzed in terms of precision ([true positive+true negative]/total number of cases). Mean overall results for all cases for the NN and DA techniques were, respectively, 94% and 84.8%. CONCLUSION: Although we used a relatively small database, results obtained in the present study indicate that Zernike polynomials as descriptors of corneal shape may be a reliable parameter as input data for diagnostic automation of VK maps, using either NN or DA.
Resumo:
We present a molecular phylogenetic analysis of caenophidian (advanced) snakes using sequences from two mitochondrial genes (12S and 16S rRNA) and one nuclear (c-mos) gene (1681 total base pairs), and with 131 terminal taxa sampled from throughout all major caenophidian lineages but focussing on Neotropical xenodontines. Direct optimization parsimony analysis resulted in a well-resolved phylogenetic tree, which corroborates some clades identified in previous analyses and suggests new hypotheses for the composition and relationships of others. The major salient points of our analysis are: (1) placement of Acrochordus, Xenodermatids, and Pareatids as successive outgroups to all remaining caenophidians (including viperids, elapids, atractaspidids, and all other "colubrid" groups); (2) within the latter group, viperids and homalopsids are sucessive sister clades to all remaining snakes; (3) the following monophyletic clades within crown group caenophidians: Afro-Asian psammophiids (including Mimophis from Madagascar), Elapidae (including hydrophiines but excluding Homoroselaps), Pseudoxyrhophiinae, Colubrinae, Natricinae, Dipsadinae, and Xenodontinae. Homoroselaps is associated with atractaspidids. Our analysis suggests some taxonomic changes within xenodontines, including new taxonomy for Alsophis elegans, Liophis amarali, and further taxonomic changes within Xenodontini and the West Indian radiation of xenodontines. Based on our molecular analysis, we present a revised classification for caenophidians and provide morphological diagnoses for many of the included clades; we also highlight groups where much more work is needed. We name as new two higher taxonomic clades within Caenophidia, one new subfamily within Dipsadidae, and, within Xenodontinae five new tribes, six new genera and two resurrected genera. We synonymize Xenoxybelis and Pseudablabes with Philodryas; Erythrolamprus with Liophis; and Lystrophis and Waglerophis with Xenodon.
Resumo:
OBJETIVO: Conhecer a qualidade dos dados de internação por causas externas em São José dos Campos, São Paulo. MÉTODO: Foram estudadas as internações pelo Sistema Único de Saúde por lesões decorrentes de causas externas no primeiro semestre de 2003, no Hospital Municipal, referência para o atendimento ao trauma no Município, por meio da comparação dos dados registrados no Sistema de Informações Hospitalares com os prontuários de 990 internações. A concordância das variáveis relativas à vítima, à internação e ao agravo foi avaliada pela taxa bruta de concordância e pelo coeficiente Kappa. As lesões e as causas externas foram codificadas segundo a 10ª revisão da Classificação Internacional de Doenças, respectivamente, capítulos XIX e XX. RESULTADOS: A taxa de concordância bruta foi de boa qualidade para as variáveis relativas à vítima e à internação, variando de 89,0% a 99,2%. As lesões tiveram concordância ótima, exceto os traumatismos do pescoço (k=0,73), traumatismos múltiplos (k=0,67) e fraturas do tórax (k=0,49). As causas externas tiveram concordância ótima para acidentes de transporte (k=0,90) e quedas (k=0,83). A confiabilidade foi menor para agressões (k=0,50), causas indeterminadas (k=0,37), e complicações da assistência médica (k=0,03). Houve concordância ótima nos acidentes de transporte em pedestres, ciclistas e motociclistas. CONCLUSÃO: A maioria das variáveis de estudo teve boa qualidade no nível de agregação analisado. Algumas variáveis relativas à vítima e alguns tipos de causas externas necessitam de aperfeiçoamento da qualidade dos dados. O perfil da morbidade hospitalar encontrado confirmou os acidentes de transporte como importante causa externa de internação hospitalar no Município.
Resumo:
This paper describes a new food classification which assigns foodstuffs according to the extent and purpose of the industrial processing applied to them. Three main groups are defined: unprocessed or minimally processed foods (group 1), processed culinary and food industry ingredients (group 2), and ultra-processed food products (group 3). The use of this classification is illustrated by applying it to data collected in the Brazilian Household Budget Survey which was conducted in 2002/2003 through a probabilistic sample of 48,470 Brazilian households. The average daily food availability was 1,792 kcal/person being 42.5% from group 1 (mostly rice and beans and meat and milk), 37.5% from group 2 (mostly vegetable oils, sugar, and flours), and 20% from group 3 (mostly breads, biscuits, sweets, soft drinks, and sausages). The share of group 3 foods increased with income, and represented almost one third of all calories in higher income households. The impact of the replacement of group 1 foods and group 2 ingredients by group 3 products on the overall quality of the diet, eating patterns and health is discussed.
Resumo:
Os acidentes de trânsito continuam a se constituir em um importante problema de saúde pública no Brasil. Objetivo : Analisar as características dos acidentes de transporte terrestre e suas vítimas no município de Cuiabá. Método: Para o estudo da mortalidade foram obtidos dados do Sistema de Informações sobre Mortalidade /Ministério da Saúde, disponíveis em CD-ROM referente aos anos de 1980-2005; para o de morbidade hospitalar foram utilizados os dados Sistemas de Informações Hospitalares no período de 1998-2006 e para o estudo da demanda das unidades de urgência e emergência foi utilizado um banco de dados construído especialmente para esse fim, referente aos meses de maio a junho de 2005. Adotaram-se os conceitos definições estabelecidos na Classificação Internacional de Doenças 10: acidentes de transporte (categ. V01-V99) e acidente de transporte terrestre (V01-V89). Resultados: Em todas as análises, as taxas de mortalidade/morbidade hospitalar se expressaram com valores maiores que a média brasileira. Apesar de apresentar aspectos distintos entre mortalidade, morbidade hospitalar e morbidade da demanda de unidades de urgência e emergência, destacam-se como principais vítimas os jovens do sexo masculino; a vítima qualificada como "ocupante" predomina nos acidentes fatais e os "motociclistas", nos não fatais.Conclusão: este estudo revela que Cuiabá é uma área onde os acidentes de transporte terrestre devem ser tratados como prioridade devido à sua magnitude, seja na mortalidade ou morbidade, trazendo subsídios para o seu enfrentamento
Resumo:
This work proposes a new approach using a committee machine of artificial neural networks to classify masses found in mammograms as benign or malignant. Three shape factors, three edge-sharpness measures, and 14 texture measures are used for the classification of 20 regions of interest (ROIs) related to malignant tumors and 37 ROIs related to benign masses. A group of multilayer perceptrons (MLPs) is employed as a committee machine of neural network classifiers. The classification results are reached by combining the responses of the individual classifiers. Experiments involving changes in the learning algorithm of the committee machine are conducted. The classification accuracy is evaluated using the area A. under the receiver operating characteristics (ROC) curve. The A, result for the committee machine is compared with the A, results obtained using MLPs and single-layer perceptrons (SLPs), as well as a linear discriminant analysis (LDA) classifier Tests are carried out using the student's t-distribution. The committee machine classifier outperforms the MLP SLP, and LDA classifiers in the following cases: with the shape measure of spiculation index, the A, values of the four methods are, in order 0.93, 0.84, 0.75, and 0.76; and with the edge-sharpness measure of acutance, the values are 0.79, 0.70, 0.69, and 0.74. Although the features with which improvement is obtained with the committee machines are not the same as those that provided the maximal value of A(z) (A(z) = 0.99 with some shape features, with or without the committee machine), they correspond to features that are not critically dependent on the accuracy of the boundaries of the masses, which is an important result. (c) 2008 SPIE and IS&T.
Resumo:
Background: In Brazil, 99% of malaria cases are concentrated in the Amazon, and malaria's spatial distribution is commonly associated with socio-environmental conditions on a fine landscape scale. In this study, the spatial patterns of malaria and its determinants in a rural settlement of the Brazilian agricultural reform programme called ""Vale do Amanhecer"" in the northern Mato Grosso state were analysed. Methods: In a fine-scaled, exploratory ecological study, geocoded notification forms corresponding to malaria cases from 2005 were compared with spectral indices, such as the Normalized Difference Vegetation Index (NDVI) and the third component of the Tasseled Cap Transformation (TC_3) and thematic layers, derived from the visual interpretation of multispectral TM-Landsat 5 imagery and the application of GIS distance operators. Results: Of a total of 336 malaria cases, 102 (30.36%) were caused by Plasmodium falciparum and 174 (51.79%) by Plasmodium vivax. Of all the cases, 37.6% (133 cases) were from residents of a unique road. In total, 276 cases were reported for the southern part of the settlement, where the population density is higher, with notification rates higher than 10 cases per household. The local landscape mostly consists of open areas (38.79 km(2)). Training forest occupied 27.34 km(2) and midsize vegetation 7.01 km(2). Most domiciles with more than five notified malaria cases were located near areas with high NDVI values. Most domiciles (41.78%) and malaria cases (44.94%) were concentrated in areas with intermediate values of the TC_3, a spectral index representing surface and vegetation humidity. Conclusions: Environmental factors and their alteration are associated with the occurrence and spatial distribution of malaria cases in rural settlements.
Resumo:
Aims. In this work, we describe the pipeline for the fast supervised classification of light curves observed by the CoRoT exoplanet CCDs. We present the classification results obtained for the first four measured fields, which represent a one-year in-orbit operation. Methods. The basis of the adopted supervised classification methodology has been described in detail in a previous paper, as is its application to the OGLE database. Here, we present the modifications of the algorithms and of the training set to optimize the performance when applied to the CoRoT data. Results. Classification results are presented for the observed fields IRa01, SRc01, LRc01, and LRa01 of the CoRoT mission. Statistics on the number of variables and the number of objects per class are given and typical light curves of high-probability candidates are shown. We also report on new stellar variability types discovered in the CoRoT data. The full classification results are publicly available.
Resumo:
We investigate a conjecture on the cover times of planar graphs by means of large Monte Carlo simulations. The conjecture states that the cover time tau (G(N)) of a planar graph G(N) of N vertices and maximal degree d is lower bounded by tau (G(N)) >= C(d)N(lnN)(2) with C(d) = (d/4 pi) tan(pi/d), with equality holding for some geometries. We tested this conjecture on the regular honeycomb (d = 3), regular square (d = 4), regular elongated triangular (d = 5), and regular triangular (d = 6) lattices, as well as on the nonregular Union Jack lattice (d(min) = 4, d(max) = 8). Indeed, the Monte Carlo data suggest that the rigorous lower bound may hold as an equality for most of these lattices, with an interesting issue in the case of the Union Jack lattice. The data for the honeycomb lattice, however, violate the bound with the conjectured constant. The empirical probability distribution function of the cover time for the square lattice is also briefly presented, since very little is known about cover time probability distribution functions in general.
Resumo:
Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain sequences coming from two families of the Pfam database are significantly different. We model protein sequences as realizations of Variable Length Markov Chains (VLMC) and we use the context trees as a signature of each protein family. Our approach is based on a Kolmogorov-Smirnov-type goodness-of-fit test proposed by Balding et at. [Limit theorems for sequences of random trees (2008), DOI: 10.1007/s11749-008-0092-z]. The test statistic is a supremum over the space of trees of a function of the two samples; its computation grows, in principle, exponentially fast with the maximal number of nodes of the potential trees. We show how to transform this problem into a max-flow over a related graph which can be solved using a Ford-Fulkerson algorithm in polynomial time on that number. We apply the test to 10 randomly chosen protein domain families from the seed of Pfam-A database (high quality, manually curated families). The test shows that the distributions of context trees coming from different families are significantly different. We emphasize that this is a novel mathematical approach to validate the automatic clustering of sequences in any context. We also study the performance of the test via simulations on Galton-Watson related processes.
Resumo:
The problem of semialgebraic Lipschitz classification of quasihomogeneous polynomials on a Holder triangle is studied. For this problem, the ""moduli"" are described completely in certain combinatorial terms.
Resumo:
No-tillage mulch-based (NTM) cropping systems have been widely adopted by farmers in the Brazilian savanna region (Cerrado biome). We hypothesized that this new type of management should have a profound impact on soil organic carbon (SOC) at regional scale and consequently on climate change mitigation. The objective of this study was thus to quantify the SOC storage potential of NTM in the oxisols of the Cerrado using a synchronic approach that is based on a chronosequence of fields of different years under NTM. The study consisted of three phases: (1) a farm/cropping system survey to identify the main types of NTM systems to be chosen for the chronosequence; (2) a field survey to identify a homogeneous set of situations for the chronosequence and (3) the characterization of the chronosequence to assess the SOC storage potential. The main NTM system practiced by farmers is an annual succession of soybean (Glycine max)or maize (Zea mays) with another cereal crop. This cropping system covers 54% of the total cultivated area in the region. At the regional level, soil organic C concentrations from NTM fields were closely correlated with clay + silt content of the soil (r(2) = 0.64). No significant correlation was observed (r(2) = 0.07), however, between these two variables when we only considered the fields with a clay + silt content in the 500-700 g kg(-1) range. The final chronosequence of NTM fields was therefore based on a subsample of eight fields, within this textural range. The SOC stocks in the 0-30 cm topsoil layer of these selected fields varied between 4.2 and 6.7 kg C m(-2) and increased on average (r(2) = 0.97) with 0.19 kg C m(-2) year(-1). After 12 years of NTM management, SOC stocks were no longer significantly different from the stocks under natural Cerrado vegetation (p < 0.05), whereas a 23-year-old conventionally tilled and cropped field showed SOC stocks that were about 30% below this level. Confirming our hypotheses, this study clearly illustrated the high potential of NTM systems in increasing SOC storage under tropical conditions, and how a synchronic approach may be used to assess efficiently such modification on farmers` fields, identifying and excluding non desirable sources of heterogeneity (management, soils and climate). (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
Biogeochemistry is hosting this special thematic issue devoted to studies of land-water interactions, as part of the Large-scale Biosphere-Atmosphere Experiment in Amaznia (LBA). This compilation of papers covers a broad range of topics with a common theme of coupling land and water processes, across pristine and impacted systems. Findings highlighted that hydrologic flowpaths are clearly important across basin size and structure in determining how water and solutes reach streams. Land-use changes have pronounced impacts on flowpaths, and subsequently, on stream chemistry, from small streams to large rivers. Carbon is produced and transformed across a broad array of fluvial environments and wetlands. Surface waters are not only driven by, but provide feedback to, the atmosphere.