913 resultados para Nearest Neighbor
Resumo:
The supervised pattern recognition methods K-Nearest Neighbors (KNN), stepwise discriminant analysis (SDA), and soft independent modelling of class analogy (SIMCA) were employed in this work with the aim to investigate the relationship between the molecular structure of 27 cannabinoid compounds and their analgesic activity. Previous analyses using two unsupervised pattern recognition methods (PCA-principal component analysis and HCA-hierarchical cluster analysis) were performed and five descriptors were selected as the most relevants for the analgesic activity of the compounds studied: R (3) (charge density on substituent at position C(3)), Q (1) (charge on atom C(1)), A (surface area), log P (logarithm of the partition coefficient) and MR (molecular refractivity). The supervised pattern recognition methods (SDA, KNN, and SIMCA) were employed in order to construct a reliable model that can be able to predict the analgesic activity of new cannabinoid compounds and to validate our previous study. The results obtained using the SDA, KNN, and SIMCA methods agree perfectly with our previous model. Comparing the SDA, KNN, and SIMCA results with the PCA and HCA ones we could notice that all multivariate statistical methods classified the cannabinoid compounds studied in three groups exactly in the same way: active, moderately active, and inactive.
Resumo:
Data mining is the process to identify valid, implicit, previously unknown, potentially useful and understandable information from large databases. It is an important step in the process of knowledge discovery in databases, (Olaru & Wehenkel, 1999). In a data mining process, input data can be structured, seme-structured, or unstructured. Data can be in text, categorical or numerical values. One of the important characteristics of data mining is its ability to deal data with large volume, distributed, time variant, noisy, and high dimensionality. A large number of data mining algorithms have been developed for different applications. For example, association rules mining can be useful for market basket problems, clustering algorithms can be used to discover trends in unsupervised learning problems, classification algorithms can be applied in decision-making problems, and sequential and time series mining algorithms can be used in predicting events, fault detection, and other supervised learning problems (Vapnik, 1999). Classification is among the most important tasks in the data mining, particularly for data mining applications into engineering fields. Together with regression, classification is mainly for predictive modelling. So far, there have been a number of classification algorithms in practice. According to (Sebastiani, 2002), the main classification algorithms can be categorized as: decision tree and rule based approach such as C4.5 (Quinlan, 1996); probability methods such as Bayesian classifier (Lewis, 1998); on-line methods such as Winnow (Littlestone, 1988) and CVFDT (Hulten 2001), neural networks methods (Rumelhart, Hinton & Wiliams, 1986); example-based methods such as k-nearest neighbors (Duda & Hart, 1973), and SVM (Cortes & Vapnik, 1995). Other important techniques for classification tasks include Associative Classification (Liu et al, 1998) and Ensemble Classification (Tumer, 1996).
Resumo:
The deep-sea pearleye, Scopelarchus michaelsarsi (Scopelarchidae) is a mesopelagic teleost with asymmetric or tubular eyes. The main retina subtends a large dorsal binocular field, while the accessory retina subtends a restricted monocular field of lateral visual space. Ocular specializations to increase the lateral visual field include an oblique pupil and a corneal lens pad. A detailed morphological and topographic study of the photoreceptors and retinal ganglion cells reveals seven specializations: a centronasal region of the main retina with ungrouped rod-like photoreceptors overlying a retinal tapetum; a region of high ganglion cell density (area centralis of 56.1x10(3) cells per mm(2)) in the centrolateral region of the main retina; a centrotemporal region of the main retina with grouped rod-like photoreceptors; a region (area giganto cellularis) of large (32.2+/-5.6 mu m(2)), alpha-like ganglion cells arranged in a regular array (nearest neighbour distance 53.5+/-9.3 mu m with a conformity ratio of 5.8) in the temporal main retina; an accessory retina with grouped rod-like photoreceptors; a nasotemporal band of a mixture of rod-and cone-like photoreceptors restricted to the ventral accessory retina; and a retinal diverticulum comprised of a ventral region of differentiated accessory retina located medial to the optic nerve head. Retrograde labelling from the optic nerve with DiI shows that approximately 14% of the cells in the ganglion cell layer of the main retina are displaced amacrine cells at 1.5 mm eccentricity. Cryosectioning of the tubular eye confirms Matthiessen's ratio (2.59), and calculations of the spatial resolving power suggests that the function of the area centralis (7.4 cycles per degree/8.1 minutes of are) and the cohort of temporal alpha-like ganglion cells (0.85 cycles per degree/70.6 minutes of are) in the main retina may be different. Low summation ratios in these various retinal zones suggests that each zone may mediate distinct visual tasks in a certain region of the visual field by optimizing sensitivity and/or resolving power.
Resumo:
Plants require roots to supply water, nutrients and oxygen for growth. The spatial distribution of roots in relation to the macropore structure of the soil in which they are growing influences how effective they are at accessing these resources. A method for quantifying root-macropore associations from horizontal soil sections is illustrated using two black vertisols from the Darling Downs, Queensland, Australia. Two-dimensional digital images were obtained of the macropore structure and root distribution for an area 55 x 55 mm at a resolution of 64 mu m. The spatial distribution of roots was quantified over a range of distances using the K-function. In all specimens, roots were shown to be clustered at short distances (1-10 mm) becoming more random at longer distances. Root location in relation to macropores was estimated using the function describing the distance of each root to the nearest macropore. From this function, a summary variable, termed the macropore sheath, was defined. The macropore sheath is the distance from macropores within which 80% of roots are located. Measured root locations were compared to random simulations of root distribution to establish if there was a preferential association between roots and macropores. More roots were found in and around macropores than expected at random.
Resumo:
A new species of the genus Gluconacetobacter, for which the name Gluconacetobacter sacchari sp. nov. is proposed, was isolated from the leaf sheath of sugar cane and from the pink sugar-cane mealy bug, Saccharicoccus sacchari, found on sugar cane growing in Queensland and northern New South Wales, Australia, The nearest phylogenetic relatives in the alpha-subclass of the Proteobacteria are Gluconacetobacter liquefaciens and Gluconacetobacter diazotrophicus, which have 98.8-99.3% and 97.9-98.5% 16S rDNA sequence similarity, respectively, to members of Gluconacetobacter sacchari. On the basis of the phylogenetic positioning of the strains, DNA reassociation studies, phenotypic tests and the presence of the Q10 ubiquinone, this new species was assigned to the genus Gluconacetobacter. No single phenotypic characteristic is unique to the species, but the species can be differentiated phenotypically from closely related members of the acetic acid bacteria by growth in the presence of 0.01% malachite green, growth on 30% glucose, an inability to fix nitrogen and an inability to grow with the L-amino acids asparagine, glycine, glutamine, threonine and tryptophan when D-mannitol was supplied as the sole carbon and energy source. The type strain of this species is strain SRI 1794(T) (= DSM 12717(T)).
Resumo:
Objective: To demonstrate the potential of GIS (geographic information system) technology and ARIA (Accessibility/Remoteness Index for Australia) as tools for medical workforce and health service planning in Australia. Design: ARIA is an index of remoteness derived by measuring road distance between populated localities and service centres. A continuous variable of remoteness from 0 to 12 is generated for any location in Australia. We created a GIS, with data on location of general practitioner services in non-metropolitan South Australia derived from the database of HUMPS (Rural Undergraduate Medical Placement System), and estimated, for the 1170 populated localities in South Australia, the accessibility/inaccessibility of the 109 identified GP services. Main outcome measures: Distance from populated locality to GP services. Results: Distance from populated locality to GP service ranged from 0 to 677 km (mean, 58 km). In all, 513 localities (43%) had a GP service within 20 km (for the majority this meant located within the town). However, for 173 populated localities (15%), the nearest GP service was more than 80 km away. There was a strong correlation between distance to GP service and ARIA value for each locality (0.69; P<0.05). Conclusions: GP services are relatively inaccessible to many rural South Australian communities. There is potential for GIS and for ARIA to contribute to rational medical workforce and health service planning. Adding measures of health need and more detailed data on types and extent of GP services provided will allow more sophisticated planning.
Resumo:
OBJECTIVE To describe heterogeneity of HIV prevalence among pregnant women in Hlabisa health district, South Africa and to correlate this with proximity of homestead to roads. METHODS HIV prevalence measured through anonymous surveillance among pregnant women and stratified by local village clinic. Polygons were created around each clinic, assuming women attend the clinic nearest their home. A geographical information system (GIS) calculated the mean distance from homesteads in each clinic catchment to nearest primary (1 degrees) and to nearest primary or secondary (2 degrees) road. RESULTS We found marked HIV heterogeneity by clinic catchment (range 19-31% (P < 0.001). A polygon plot demonstrated lower HIV prevalence in catchments remote from 1 degrees roads. Mean distance from homesteads to nearest 1 degrees or 2 degrees road varied by clinic catchment from 1623 to 7569 m. The mean distance from homesteads to a 1 degrees or 2 degrees road for each clinic catchment was strongly correlated with HIV prevalence (r = 0.66; P = 0.002). CONCLUSIONS The substantial HIV heterogeneity in this district is closely correlated with proximity to a 1 degrees or 2 degrees road. GIS is a powerful tool to demonstrate and to start to analyse this observation. Further research is needed to better understand this relationship both at ecological and individual levels, and to develop interventions to reduce the spread of HIV infection.
Resumo:
The effect of increasing population density on the formation of pits, their size and spatial distribution, and on levels of mortality was examined in the antlion Myrmeleon acer Walker. Antlions were kept at densities ranging from 0.4 to 12.8 individuals per 100 cm(2). The distribution of pits was regular or uniform across all densities, but antlions constructed proportionally fewer and smaller pits as density increased. Mortality through cannibalism was very low and only occurred at densities greater than five individuals per 100 cm(2). Antlions in artificially crowded situations frequently relocated their pits and when more space became available, individuals became more dispersed with time. Redistribution of this species results from active avoidance of other antlions and sand throwing associated with pit construction and maintenance, rather than any attempt to optimise prey capture per se.
Resumo:
We investigate the internal dynamics of two cellular automaton models with heterogeneous strength fields and differing nearest neighbour laws. One model is a crack-like automaton, transferring ail stress from a rupture zone to the surroundings. The other automaton is a partial stress drop automaton, transferring only a fraction of the stress within a rupture zone to the surroundings. To study evolution of stress, the mean spectral density. f(k(r)) of a stress deficit held is: examined prior to, and immediately following ruptures in both models. Both models display a power-law relationship between f(k(r)) and spatial wavenumber (k(r)) of the form f(k(r)) similar tok(r)(-beta). In the crack model, the evolution of stress deficit is consistent with cyclic approach to, and retreat from a critical state in which large events occur. The approach to criticality is driven by tectonic loading. Short-range stress transfer in the model does not affect the approach to criticality of broad regions in the model. The evolution of stress deficit in the partial stress drop model is consistent with small fluctuations about a mean state of high stress, behaviour indicative of a self-organised critical system. Despite statistics similar to natural earthquakes these simplified models lack a physical basis. physically motivated models of earthquakes also display dynamical complexity similar to that of a critical point system. Studies of dynamical complexity in physical models of earthquakes may lead to advancement towards a physical theory for earthquakes.
Resumo:
The rocky intertidal zone has the potential to be one of the harshest environments for free-spawning organisms, but empirical data on fertilization success are scarce. Here, I report on an intertidal, solitary ascidian, Pyura stolonifera, which was observed to spawn at low tide. At a scale likely to be most important to gametes (metres, duration of tide), approximately 30% of individuals in the population were spawning synchronously. Spawned gametes remained in a viscous matrix and this appeared to minimise their dilution. Fertilization success varied greatly among individuals (0 to 92%) and was related to the distance to the nearest neighbouring spawner. Occasional wave wash facilitated the movement of sperm between spawners. Fertilization success in some individuals was limited by the scarcity of sperm whilst the experimental addition of sperm did not increase success in others.
Resumo:
Ancestry informative markers (AIMs) are genetic loci with large frequency differences between the major ethnic groups and are very useful in admixture estimation. However, their frequencies are poorly known within South American indigenous populations, making it difficult to use them in admixture studies with Latin American populations, such as the trihybrid Brazilian population. To minimize this problem, the frequencies of the AIMs FY-null RB2300, LPL, AT3-1/1), Sb19.3, APO, and PV92 were determined via PCR and PCR-RFLP in four tribes from Brazilian Amazon (Tikuna, Kashinawa, Baniwa, and Kanamari), to evaluate their potential for discriminating indigenous populations from Europeans and Africans, as well as discriminating each tribe from the others. Although capable of differentiating tribes, as evidenced by the exact test of population differentiation, a neighbor-joining tree suggests that the AIMs are useless in obtaining reliable reconstructions of the biological relationships and evolutionary history that characterize the villages and tribes studied. The mean allele frequencies from these AIMs were very similar to those observed for North American natives. They discriminated Amerindians from Africans, but not from Europeans. On the other hand, the neighbor-joining dendrogram separated Africans and Europeans from Amerindians with a high statistical support (bootstrap = 0.989). The relatively low diversity (GST = 0.042) among North American natives and Amerindians from Brazilian Amazon agrees with the lack of intra-ethnic variation previously reported for these markers. Despite genetic drift effects, the mean allelic frequencies herein presented could be used as Amerindian parental frequencies in admixture estimates in urban Brazilian populations.
Resumo:
1. Cluster analysis of reference sites with similar biota is the initial step in creating River Invertebrate Prediction and Classification System (RIVPACS) and similar river bioassessment models such as Australian River Assessment System (AUSRIVAS). This paper describes and tests an alternative prediction method, Assessment by Nearest Neighbour Analysis (ANNA), based on the same philosophy as RIVPACS and AUSRIVAS but without the grouping step that some people view as artificial. 2. The steps in creating ANNA models are: (i) weighting the predictor variables using a multivariate approach analogous to principal axis correlations, (ii) calculating the weighted Euclidian distance from a test site to the reference sites based on the environmental predictors, (iii) predicting the faunal composition based on the nearest reference sites and (iv) calculating an observed/expected (O/E) analogous to RIVPACS/AUSRIVAS. 3. The paper compares AUSRIVAS and ANNA models on 17 datasets representing a variety of habitats and seasons. First, it examines each model's regressions for Observed versus Expected number of taxa, including the r(2), intercept and slope. Second, the two models' assessments of 79 test sites in New Zealand are compared. Third, the models are compared on test and presumed reference sites along a known trace metal gradient. Fourth, ANNA models are evaluated for western Australia, a geographically distinct region of Australia. The comparisons demonstrate that ANNA and AUSRIVAS are generally equivalent in performance, although ANNA turns out to be potentially more robust for the O versus E regressions and is potentially more accurate on the trace metal gradient sites. 4. The ANNA method is recommended for use in bioassessment of rivers, at least for corroborating the results of the well established AUSRIVAS- and RIVPACS-type models, if not to replace them.
Resumo:
Understanding the interfacial interactions and structure is important to better design and application of organic-inorganic nanohybrids. This paper presents our recent molecular dynamic studies on organoclays and polymer nanocomposites, including the layering behavior of organoclays, structural and dynamic properties of dioctadecyldimethyl ammoniums in organoclays, and interfacial interactions and structure of polyurethane nanocomposites. The results demonstrate that the layering behaviors of organoclays are closely related to the chain length of quaternary alkyl ammoniums and cation exchangeable capacity of clays. In addition to typical layered structures such as monolayer, bilayer and pseudo-trilayer, a pseudo-quadrilayer structure was also observed in organoclays modified with dioctadecyldimethyl ammoniums (DODDMA). In such a structure, alkyl chains do not lie flat within a single layer but interlace, and also jump to the next layer or even the next nearest layer. Moreover, the diffusion constants of nitrogen and methylene atoms increase with the temperature and methelene towards the tail groups. For polyurethane nanocomposite, the van der Waals interaction between apolar alkyl chains and soft segments of polyurethane predominates the interactions between organoclay and polyurethane. Different from most bulk polyurethane systems, there is no distinct phase-separated structure for the polyurethane.
Resumo:
The digenean originally designated Lepidapedon (Lepidapedon) ostorhinchi is redescribed from its type-host, Oplegnathus woodwardi [= Ostorhinchus conwaii], from the waters off Western Australia. The discovery of a uroproct indicates that the generic designation is wrong and the worm should be Paralepidapedon ostorhinchi (Korotaeva, 1974) n. comb. It is distinct from its nearest relative, P. hoplognathi (Yamaguti, 1938), in having: a prominent post-oral ring; a distinct oesophagus; short anterior diverticula on the caeca; a long external seminal vesicle, ensheathed in a membrane bound gland-cell mass; and less anteriorly extensive vitellarium.
Resumo:
We compared four strategies for inviting 91,456 women aged 50-69 years to one of six clinics for mammography screening and 40,142 men aged 60-79 years to one of 10 clinics for abdominal aortic aneurysm (AAA) screening. The strategies were invitation to the clinic nearest to the client and invitation to the clinic nearest to the client's area of residence defined by census small area, postcode and local government area. For each strategy we calculated the expected demand at each clinic and the travel distances for clients. We found that when women were allocated to mammography clinics on the basis of the local government area instead of their individual address, expected demand at one clinic increased by 60%, and 19% of clients were invited to attend a more remote clinic, entailing 99,000 km of additional travel. Similar results were obtained for men allocated to AAA clinics by their postcode of residence instead of their individual address: 55% difference in expected demand, 13% to a more remote clinic and 60,000 km of extra travel. Allocation on the basis of small areas did not show such great differences, except for travel distance, which was about 5% higher for each clinic type. We recommend that allocation of clients to screening clinics be made according to residential address, that assessment of the location of clinics be based on distances between residences and nearest clinic, but that planning new locations for clinics be aided with spatial analysis tools using small area demographic and social data. (C) 1997 Elsevier Science Ltd.