70 resultados para spatial clustering algorithms
Resumo:
Onion (Allium cepa) is one of the most cultivated and consumed vegetables in Brazil and its importance is due to the large laborforce involved. One of the main pests that affect this crop is the Onion Thrips (Thrips tabaci), but the spatial distribution of this insect, although important, has not been considered in crop management recommendations, experimental planning or sampling procedures. Our purpose here is to consider statistical tools to detect and model spatial patterns of the occurrence of the onion thrips. In order to characterize the spatial distribution pattern of the Onion Thrips a survey was carried out to record the number of insects in each development phase on onion plant leaves, on different dates and sample locations, in four rural properties with neighboring farms under different infestation levels and planting methods. The Mantel randomization test proved to be a useful tool to test for spatial correlation which, when detected, was described by a mixed spatial Poisson model with a geostatistical random component and parameters allowing for a characterization of the spatial pattern, as well as the production of prediction maps of susceptibility to levels of infestation throughout the area.
Resumo:
Geographic Data Warehouses (GDW) are one of the main technologies used in decision-making processes and spatial analysis, and the literature proposes several conceptual and logical data models for GDW. However, little effort has been focused on studying how spatial data redundancy affects SOLAP (Spatial On-Line Analytical Processing) query performance over GDW. In this paper, we investigate this issue. Firstly, we compare redundant and non-redundant GDW schemas and conclude that redundancy is related to high performance losses. We also analyze the issue of indexing, aiming at improving SOLAP query performance on a redundant GDW. Comparisons of the SB-index approach, the star-join aided by R-tree and the star-join aided by GiST indicate that the SB-index significantly improves the elapsed time in query processing from 25% up to 99% with regard to SOLAP queries defined over the spatial predicates of intersection, enclosure and containment and applied to roll-up and drill-down operations. We also investigate the impact of the increase in data volume on the performance. The increase did not impair the performance of the SB-index, which highly improved the elapsed time in query processing. Performance tests also show that the SB-index is far more compact than the star-join, requiring only a small fraction of at most 0.20% of the volume. Moreover, we propose a specific enhancement of the SB-index to deal with spatial data redundancy. This enhancement improved performance from 80 to 91% for redundant GDW schemas.
Resumo:
A definição das parcelas familiares em projetos de reforma agrária envolve questões técnicas e sociais. Essas questões estão associadas principalmente às diferentes aptidões agrícolas do solo nestes projetos. O objetivo deste trabalho foi apresentar método para realizar o processo de ordenamento territorial em assentamentos de reforma agrária empregando Algoritmo Genético (AG). O AG foi testado no Projeto de Assentamento Veredas, em Minas Gerais, e implementado com base no sistema de aptidão agrícola das terras.
Resumo:
OBJECTIVE: To estimate the spatial intensity of urban violence events using wavelet-based methods and emergency room data. METHODS: Information on victims attended at the emergency room of a public hospital in the city of São Paulo, Southeastern Brazil, from January 1, 2002 to January 11, 2003 were obtained from hospital records. The spatial distribution of 3,540 events was recorded and a uniform random procedure was used to allocate records with incomplete addresses. Point processes and wavelet analysis technique were used to estimate the spatial intensity, defined as the expected number of events by unit area. RESULTS: Of all georeferenced points, 59% were accidents and 40% were assaults. There is a non-homogeneous spatial distribution of the events with high concentration in two districts and three large avenues in the southern area of the city of São Paulo. CONCLUSIONS: Hospital records combined with methodological tools to estimate intensity of events are useful to study urban violence. The wavelet analysis is useful in the computation of the expected number of events and their respective confidence bands for any sub-region and, consequently, in the specification of risk estimates that could be used in decision-making processes for public policies.
Resumo:
In children with Duchenne muscular dystrophy, color vision losses have been related to dystrophin deletions downstream of exon 30, which affect a dystrophin isoform, Dp260, present in the retina. To further evaluate visual function in DMD children, we measured spatial, temporal, and chromatic red-green and blue-yellow contrast sensitivity in two groups of DMD children with gene deletion downstream and upstream of exon 30. Psychophysical spatial contrast sensitivity was measured for low, middle, and high spatial frequencies with achromatic gratings and for low and middle frequencies with red-green and blue-yellow chromatic gratings. Temporal contrast sensitivity was also measured with achromatic stimuli. A reduction in sensitivity at all spatial luminance contrasts was found for the DMD patients with deletion downstream of exon 30. Similar results were found for temporal luminance contrast sensitivity. Red-green chromatic contrast sensitivity was reduced in DMD children with deletion downstream of exon 30, whereas blue-yellow chromatic contrast sensitivity showed no significant differences. We conclude that visual function is impaired in DMD children. Furthermore, we report a genotype-phenotype relationship because the visual impairment occurred in children with deletion downstream but not upstream of exon 30, affecting the retinal isoform of dystrophin Dp260.
Resumo:
Structure of intertidal and subtidal benthic macrofauna in the northeastern region of Todos os Santos Bay (TSB), northeast Brazil, was investigated during a period of two years. Relationships with environmental parameters were studied through uni-and multivariate statistical analyses, and the main distributional patterns shown to be especially related to sediment type and content of organic fractions (Carbon, Nitrogen, Phosphorus), on both temporal and spatial scales. Polychaete annelids accounted for more than 70% of the total fauna and showed low densities, species richness and diversity, except for the area situated on the reef banks. These banks constitute a peculiar environment in relation to the rest of the region by having coarse sediments poor in organic matter and rich in biodetritic carbonates besides an abundant and diverse fauna. The intertidal region and the shallower area nearer to the oil refinery RLAM, with sediments composed mainly of fine sand, seem to constitute an unstable system with few highly dominant species, such as Armandia polyophthalma and Laeonereis acuta. In the other regions of TSB, where muddy bottoms predominated, densities and diversity were low, especially in the stations near the refinery. Here the lowest values of the biological indicators occurred together with the highest organic compound content. In addition, the nearest sites (stations 4 and 7) were sometimes azoic. The adjacent Caboto, considered as a control area at first, presented low density but intermediate values of species diversity, which indicates a less disturbed environment in relation to the pelitic infralittoral in front of the refinery. The results of the ordination analyses evidenced five homogeneous groups of stations (intertidal; reef banks; pelitic infralittoral; mixed sediments; Caboto) with different specific patterns, a fact which seems to be mainly related to granulometry and chemical sediment characteristics.
Resumo:
The aim of this study was to analyze the distribution and abundance of the fish fauna of Palmas bay on Anchieta Island in southeastern Brazil. Specimens were caught in the summer and winter of 1992, using an otter trawl at three locations in the bay. The specimens were caught in both the nighttime and daytime. Data on the water temperature and salinity were recorded for the characterization of the predominant water mass in the region, and sediment samples were taken for granulometric analysis. A total of 7 656 specimens (79 species), with a total weight of approximately 300 kg, were recorded. The most abundant species were Eucinostomus argenteus, Ctenosciaena gracilicirrhus, Haemulon steindachneri, Eucinostomus gula and Diapterus rhombeus, which together accounted for more than 73% of the sample. In general, the ecological indices showed no differences in the composition of species for the abiotic variables analyzed. The multivariate analysis showed that the variations in the distribution of the fish fauna were mainly associated with intra-annual differences in temperature and salinity, resulting from the presence of South Atlantic Central Water (SACW) in the area during the summer. The analysis also showed an association with the type of bottom and a lesser association with respect to the night/day periods.
Resumo:
Gene clustering is a useful exploratory technique to group together genes with similar expression levels under distinct cell cycle phases or distinct conditions. It helps the biologist to identify potentially meaningful relationships between genes. In this study, we propose a clustering method based on multivariate normal mixture models, where the number of clusters is predicted via sequential hypothesis tests: at each step, the method considers a mixture model of m components (m = 2 in the first step) and tests if in fact it should be m - 1. If the hypothesis is rejected, m is increased and a new test is carried out. The method continues (increasing m) until the hypothesis is accepted. The theoretical core of the method is the full Bayesian significance test, an intuitive Bayesian approach, which needs no model complexity penalization nor positive probabilities for sharp hypotheses. Numerical experiments were based on a cDNA microarray dataset consisting of expression levels of 205 genes belonging to four functional categories, for 10 distinct strains of Saccharomyces cerevisiae. To analyze the method's sensitivity to data dimension, we performed principal components analysis on the original dataset and predicted the number of classes using 2 to 10 principal components. Compared to Mclust (model-based clustering), our method shows more consistent results.
Resumo:
A susceptible-infective-recovered (SIR) epidemiological model based on probabilistic cellular automaton (PCA) is employed for simulating the temporal evolution of the registered cases of chickenpox in Arizona, USA, between 1994 and 2004. At each time step, every individual is in one of the states S, I, or R. The parameters of this model are the probabilities of each individual (each cell forming the PCA lattice ) passing from a state to another state. Here, the values of these probabilities are identified by using a genetic algorithm. If nonrealistic values are allowed to the parameters, the predictions present better agreement with the historical series than if they are forced to present realistic values. A discussion about how the size of the PCA lattice affects the quality of the model predictions is presented. Copyright (C) 2009 L. H. A. Monteiro et al.
Resumo:
Background: In Brazil, 99% of malaria cases are concentrated in the Amazon, and malaria's spatial distribution is commonly associated with socio-environmental conditions on a fine landscape scale. In this study, the spatial patterns of malaria and its determinants in a rural settlement of the Brazilian agricultural reform programme called ""Vale do Amanhecer"" in the northern Mato Grosso state were analysed. Methods: In a fine-scaled, exploratory ecological study, geocoded notification forms corresponding to malaria cases from 2005 were compared with spectral indices, such as the Normalized Difference Vegetation Index (NDVI) and the third component of the Tasseled Cap Transformation (TC_3) and thematic layers, derived from the visual interpretation of multispectral TM-Landsat 5 imagery and the application of GIS distance operators. Results: Of a total of 336 malaria cases, 102 (30.36%) were caused by Plasmodium falciparum and 174 (51.79%) by Plasmodium vivax. Of all the cases, 37.6% (133 cases) were from residents of a unique road. In total, 276 cases were reported for the southern part of the settlement, where the population density is higher, with notification rates higher than 10 cases per household. The local landscape mostly consists of open areas (38.79 km(2)). Training forest occupied 27.34 km(2) and midsize vegetation 7.01 km(2). Most domiciles with more than five notified malaria cases were located near areas with high NDVI values. Most domiciles (41.78%) and malaria cases (44.94%) were concentrated in areas with intermediate values of the TC_3, a spectral index representing surface and vegetation humidity. Conclusions: Environmental factors and their alteration are associated with the occurrence and spatial distribution of malaria cases in rural settlements.
Resumo:
We present K-band spectra of the near infrared counterparts to IRS 2E and IRS 2W which is associated with the ultracompact H II region W51d, both of them embedded sources in the Galactic compact H II region W51 IRS 2. The high spatial resolution observations were obtained with the laser guide star facility and Near-infrared Integral Field Spectrograph (NIFS) mounted at the Gemini-North observatory. The spectrum of the ionizing source of W51d shows the photospheric features N III ( 21155 angstrom) in emission and He II ( 21897 angstrom) in absorption which lead us to classify it as a young O3 type star. We detected CO overtone in emission at 23000 angstrom in the spectrum of IRS 2E, suggesting that it is a massive young object still surrounded by an accretion disk, probably transitioning from the hot core phase to an ultracompact H II region.
Resumo:
Context. We study galaxy evolution and spatial patterns in the surroundings of a sample of 2dF groups. Aims. Our aim is to find evidence of galaxy evolution and clustering out to 10 times the virial radius of the groups and so redefine their properties according to the spatial patterns in the fields and relate them to galaxy evolution. Methods. Group members and interlopers were redefined after the identification of gaps in the redshift distribution. We then used exploratory spatial statistics based on the the second moment of the Ripley function to probe the anisotropy in the galaxy distribution around the groups. Results. We found an important anticorrelation between anisotropy around groups and the fraction of early-type galaxies in these fields. Our results illustrate how the dynamical state of galaxy groups can be ascertained by the systematic study of their neighborhoods. This is an important achievement, since the correct estimate of the extent to which galaxies are affected by the group environment and follow large-scale filamentary structure is relevant to understanding the process of galaxy clustering and evolution in the Universe.
Resumo:
The VISTA near infrared survey of the Magellanic System (VMC) will provide deep YJK(s) photometry reaching stars in the oldest turn-off point throughout the Magellanic Clouds (MCs). As part of the preparation for the survey, we aim to access the accuracy in the star formation history (SFH) that can be expected from VMC data, in particular for the Large Magellanic Cloud (LMC). To this aim, we first simulate VMC images containing not only the LMC stellar populations but also the foreground Milky Way (MW) stars and background galaxies. The simulations cover the whole range of density of LMC field stars. We then perform aperture photometry over these simulated images, access the expected levels of photometric errors and incompleteness, and apply the classical technique of SFH-recovery based on the reconstruction of colour-magnitude diagrams (CMD) via the minimisation of a chi-squared-like statistics. We verify that the foreground MW stars are accurately recovered by the minimisation algorithms, whereas the background galaxies can be largely eliminated from the CMD analysis due to their particular colours and morphologies. We then evaluate the expected errors in the recovered star formation rate as a function of stellar age, SFR(t), starting from models with a known age-metallicity relation (AMR). It turns out that, for a given sky area, the random errors for ages older than similar to 0.4 Gyr seem to be independent of the crowding. This can be explained by a counterbalancing effect between the loss of stars from a decrease in the completeness and the gain of stars from an increase in the stellar density. For a spatial resolution of similar to 0.1 deg(2), the random errors in SFR(t) will be below 20% for this wide range of ages. On the other hand, due to the lower stellar statistics for stars younger than similar to 0.4 Gyr, the outer LMC regions will require larger areas to achieve the same level of accuracy in the SFR( t). If we consider the AMR as unknown, the SFH-recovery algorithm is able to accurately recover the input AMR, at the price of an increase of random errors in the SFR(t) by a factor of about 2.5. Experiments of SFH-recovery performed for varying distance modulus and reddening indicate that these parameters can be determined with (relative) accuracies of Delta(m-M)(0) similar to 0.02 mag and Delta E(B-V) similar to 0.01 mag, for each individual field over the LMC. The propagation of these errors in the SFR(t) implies systematic errors below 30%. This level of accuracy in the SFR(t) can reveal significant imprints in the dynamical evolution of this unique and nearby stellar system, as well as possible signatures of the past interaction between the MCs and the MW.
Resumo:
Genetic models of sex and caste determination in eusocial stingless bees suggest specific patterns of male, worker and gyne cell distribution in the brood comb. Conflict between queen and laying workers over male parentage and center-periphery gradients of conditions, such as food and temperature, could also contribute to non-random spatial configuration. We converted the positions of the hexagonal cells in a brood comb to Cartesian coordinates, labeled by sex or caste of the individuals inside. To detect and locate clustered patterns, the mapped brood combs were evaluated by indexes of dispersion (MMC, mean distance of cells of a given category from their centroid) and eccentricity (DMB, distance between this centroid and the overall brood comb centroid) that we developed. After randomizing the labels and recalculating the indexes, we calculated probabilities that the original values had been generated by chance. We created sets of binary brood combs in which males were aggregated, regularly or randomly distributed among females. These stylized maps were used to describe the power of MMC and DMB, and they were applied to evaluate the male distribution in the sampled Nannotrigona testaceicornis brood combs. MMC was very sensitive to slight deviations from a perfectly rounded clump; DMB detected any asymmetry in the location of these compact to fuzzy clusters. Six of the 82 brood combs of N. testaceicornis that we analyzed had more than nine males, distributed according to variations in spatial patterns, as indicated by the two indexes.
Resumo:
We propose and analyze two different Bayesian online algorithms for learning in discrete Hidden Markov Models and compare their performance with the already known Baldi-Chauvin Algorithm. Using the Kullback-Leibler divergence as a measure of generalization we draw learning curves in simplified situations for these algorithms and compare their performances.