988 resultados para Scatter plot


Relevância:

70.00% 70.00%

Publicador:

Resumo:

Thesis (Master's)--University of Washington, 2016-01

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Over the last few decades, there has been a significant land cover (LC) change across the globe due to the increasing demand of the burgeoning population and urban sprawl. In order to take account of the change, there is a need for accurate and up-to-date LC maps. Mapping and monitoring of LC in India is being carried out at national level using multi-temporal IRS AWiFS data. Multispectral data such as IKONOS, Landsat-TM/ETM+, IRS-ICID LISS-III/IV, AWiFS and SPOT-5, etc. have adequate spatial resolution (similar to 1m to 56m) for LC mapping to generate 1:50,000 maps. However, for developing countries and those with large geographical extent, seasonal LC mapping is prohibitive with data from commercial sensors of limited spatial coverage. Superspectral data from the MODIS sensor are freely available, have better temporal (8 day composites) and spectral information. MODIS pixels typically contain a mixture of various LC types (due to coarse spatial resolution of 250, 500 and 1000 in), especially in more fragmented landscapes. In this context, linear spectral unmixing would be useful for mapping patchy land covers, such as those that characterise much of the Indian subcontinent. This work evaluates the existing unmixing technique for LC mapping using MODIS data, using end-members that are extracted through Pixel Purity Index (PPI), Scatter plot and N-dimensional visualisation. The abundance maps were generated for agriculture, built up, forest, plantations, waste land/others and water bodies. The assessment of the results using ground truth and a LISS-III classified map shows 86% overall accuracy, suggesting the potential for broad-scale applicability of the technique with superspectral data for natural resource planning and inventory applications. Index Terms-Remote sensing, digital

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Over the last few decades, there has been a significant land cover (LC) change across the globe due to the increasing demand of the burgeoning population and urban sprawl. In order to take account of the change, there is a need for accurate and up- to-date LC maps. Mapping and monitoring of LC in India is being carried out at national level using multi-temporal IRS AWiFS data. Multispectral data such as IKONOS, Landsat- TM/ETM+, IRS-1C/D LISS-III/IV, AWiFS and SPOT-5, etc. have adequate spatial resolution (~ 1m to 56m) for LC mapping to generate 1:50,000 maps. However, for developing countries and those with large geographical extent, seasonal LC mapping is prohibitive with data from commercial sensors of limited spatial coverage. Superspectral data from the MODIS sensor are freely available, have better temporal (8 day composites) and spectral information. MODIS pixels typically contain a mixture of various LC types (due to coarse spatial resolution of 250, 500 and 1000 m), especially in more fragmented landscapes. In this context, linear spectral unmixing would be useful for mapping patchy land covers, such as those that characterise much of the Indian subcontinent. This work evaluates the existing unmixing technique for LC mapping using MODIS data, using end- members that are extracted through Pixel Purity Index (PPI), Scatter plot and N-dimensional visualisation. The abundance maps were generated for agriculture, built up, forest, plantations, waste land/others and water bodies. The assessment of the results using ground truth and a LISS-III classified map shows 86% overall accuracy, suggesting the potential for broad-scale applicability of the technique with superspectral data for natural resource planning and inventory applications.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A radial basis function neural network was employed to model the abundance of cyanobacteria. The trained network could predict the populations of two bloom forming algal taxa with high accuracy, Nostocales spp. and Anabaena spp., in the River Darling, Australia. To elucidate the population dynamics for both Nostocales spp. and Anabaena spp., sensitivity analysis was performed with the following results. Total Kjeldahl nitrogen had a very strong influence on the abundance of the two algal taxa, electrical conductivity had a very strong negative relationship with the population of the two algal species, and flow was identified as one dominant factor influencing algal blooms after a scatter plot revealed that high flow could significantly reduce the algal biomass for both Nostocales spp. and Anabaena spp. Other variables such as turbidity, color, and pH were less important in determining the abundance and succession of the algal blooms.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The relationships of eight moss species of Dicranum in 31 sites in main ecological systems in the Changbai Mountain with environmental factors were studied by canonical correspondence analysis (CCA). The results showed that altitude, soil sand percentage, water percentage, acidity and canopy density were important environmental factors influencing the distribution of the species of Dicranum . The relationships between Dicranum elongatum Schleich. ex Schwaegr ., D.groenlandicum Brid. and altitude,between D.japonicum Mitt., D.scoparium Hedw. and canopy density,between D.polysetum Sw., D. undulatum Schrad. ex Brid. and soil acidity and water percentage,were positively correlative. The niche overlaps among the eight species of Dicranum were calculated. The minimal spanning tree of the eight species on the two-dimensional scatter plot were also drawn based on their niche overlaps, which clearly revealed the ecological similarities of eight species.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Behavioral inhibition model suggests the generation of anxiety is related with over-inhibition. For knowing about anxiety better, we used event-related potential (ERP) technique to explore the underlying mechanism of executive inhibition under the emotional distracter in high and low trait-anxious groups. Firstly, we set up the Chinese affective picture system (CAPS) as the stimuli of subsequent experiments. Secondly, we screened the high and low trait-anxious participants using the State-Trait Anxiety Inventory. In the first ERP study, a modified oddball paradigm was used with the positive, neutral and negative pictures as novel stimuli and the potentials evoked by three types pictures were analyzed. In the second ERP study, the same paradigm with higher task load was employed to examine the interaction of anxious level and emotion. Main results as follows: 1. CAPS consisted of 852 pictures was assessed via three dimensionalities, valence, arousal and dominance. The standard deviation of scores on valence and dominance was more than the standard deviation of scores on dimension of arousal. Scatter plot showed that the score distributing on the dimension of valence and arousal was wide in CAPS. 2. In both high and low trait-anxiety groups, the amplitudes of N2 and P3 of negative pictures were greater and smaller respectively as compared with neutral and positive pictures, which suggested all participants no matter what anxious level required more inhibition processing to negative information than others. 3. With increasing of task load, the P3 amplitudes of negative pictures in high anxious group were reduced relative to neutral pictures. In addition, in high anxious group, the P3 amplitudes of positive pictures had the same changes as those of negative ones. Whereas, the reduced P3 of positive pictures were not observed in low anxious group. The results showed the high anxious participants employed the same inhibitory strategy to the positive distracter as the negative distracter, which possibly the over-inhibition processing was involved in this group. 4. Dipole source analysis found cingulate may be involved in executive inhibition processing. In sum, as for the inhibition, high and low anxious group both is sensitive to negative information. However, in the high load situation, due to the shortness of cognitive resources, the high anxious individual represents the general sensitivity to all emotional information. These results gave the electrophysiological evidence for over-inhibition in high trait-anxiety group.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Our media is saturated with claims of ``facts'' made from data. Database research has in the past focused on how to answer queries, but has not devoted much attention to discerning more subtle qualities of the resulting claims, e.g., is a claim ``cherry-picking''? This paper proposes a Query Response Surface (QRS) based framework that models claims based on structured data as parameterized queries. A key insight is that we can learn a lot about a claim by perturbing its parameters and seeing how its conclusion changes. This framework lets us formulate and tackle practical fact-checking tasks --- reverse-engineering vague claims, and countering questionable claims --- as computational problems. Within the QRS based framework, we take one step further, and propose a problem along with efficient algorithms for finding high-quality claims of a given form from data, i.e. raising good questions, in the first place. This is achieved to using a limited number of high-valued claims to represent high-valued regions of the QRS. Besides the general purpose high-quality claim finding problem, lead-finding can be tailored towards specific claim quality measures, also defined within the QRS framework. An example of uniqueness-based lead-finding is presented for ``one-of-the-few'' claims, landing in interpretable high-quality claims, and an adjustable mechanism for ranking objects, e.g. NBA players, based on what claims can be made for them. Finally, we study the use of visualization as a powerful way of conveying results of a large number of claims. An efficient two stage sampling algorithm is proposed for generating input of 2d scatter plot with heatmap, evalutaing a limited amount of data, while preserving the two essential visual features, namely outliers and clusters. For all the problems, we present real-world examples and experiments that demonstrate the power of our model, efficiency of our algorithms, and usefulness of their results.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: There is growing interest in the potential utility of molecular diagnostics in improving the detection of life-threatening infection (sepsis). LightCycler® SeptiFast is a multipathogen probebased real-time PCR system targeting DNA sequences of bacteria and fungi present in blood samples within a few hours. We report here the protocol of the first systematic review of published clinical diagnostic accuracy studies of this technology when compared with blood culture in the setting of suspected sepsis. Methods/design: Data sources: the Cochrane Database of Systematic Reviews, the Database of Abstracts of Reviews of Effects (DARE), the Health Technology Assessment Database (HTA), the NHS Economic Evaluation Database (NHSEED), The Cochrane Library, MEDLINE, EMBASE, ISI Web of Science, BIOSIS Previews, MEDION and the Aggressive Research Intelligence Facility Database (ARIF). Study selection: diagnostic accuracy studies that compare the real-time PCR technology with standard culture results performed on a patient's blood sample during the management of sepsis. Data extraction: three reviewers, working independently, will determine the level of evidence, methodological quality and a standard data set relating to demographics and diagnostic accuracy metrics for each study. Statistical analysis/data synthesis: heterogeneity of studies will be investigated using a coupled forest plot of sensitivity and specificity and a scatter plot in Receiver Operator Characteristic (ROC) space. Bivariate model method will be used to estimate summary sensitivity and specificity. The authors will investigate reporting biases using funnel plots based on effective sample size and regression tests of asymmetry. Subgroup analyses are planned for adults, children and infection setting (hospital vs community) if sufficient data are uncovered. Dissemination: Recommendations will be made to the Department of Health (as part of an open-access HTA report) as to whether the real-time PCR technology has sufficient clinical diagnostic accuracy potential to move forward to efficacy testing during the provision of routine clinical care.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Purpose: There is an urgent need to develop diagnostic tests to improve the detection of pathogens causing life-threatening infection (sepsis). SeptiFast is a CE-marked multi-pathogen real-time PCR system capable of detecting DNA sequences of bacteria and fungi present in blood samples within a few hours. We report here a systematic review and meta-analysis of diagnostic accuracy studies of SeptiFast in the setting of suspected sepsis.

Methods: A comprehensive search strategy was developed to identify studies that compared SeptiFast with blood culture in suspected sepsis. Methodological quality was assessed using QUADAS. Heterogeneity of studies was investigated using a coupled forest plot of sensitivity and specificity and a scatter plot in receiver operator characteristic space. Bivariate model method was used to estimate summary sensitivity and specificity.

Results: From 41 phase III diagnostic accuracy studies, summary sensitivity and specificity for SeptiFast compared with blood culture were 0.68 (95 % CI 0.63–0.73) and 0.86 (95 % CI 0.84–0.89) respectively. Study quality was judged to be variable with important deficiencies overall in design and reporting that could impact on derived diagnostic accuracy metrics.

Conclusions: SeptiFast appears to have higher specificity than sensitivity, but deficiencies in study quality are likely to render this body of work unreliable. Based on the evidence presented here, it remains difficult to make firm recommendations about the likely clinical utility of SeptiFast in the setting of suspected sepsis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Knowledge discovery in databases is the non-trivial process of identifying valid, novel potentially useful and ultimately understandable patterns from data. The term Data mining refers to the process which does the exploratory analysis on the data and builds some model on the data. To infer patterns from data, data mining involves different approaches like association rule mining, classification techniques or clustering techniques. Among the many data mining techniques, clustering plays a major role, since it helps to group the related data for assessing properties and drawing conclusions. Most of the clustering algorithms act on a dataset with uniform format, since the similarity or dissimilarity between the data points is a significant factor in finding out the clusters. If a dataset consists of mixed attributes, i.e. a combination of numerical and categorical variables, a preferred approach is to convert different formats into a uniform format. The research study explores the various techniques to convert the mixed data sets to a numerical equivalent, so as to make it equipped for applying the statistical and similar algorithms. The results of clustering mixed category data after conversion to numeric data type have been demonstrated using a crime data set. The thesis also proposes an extension to the well known algorithm for handling mixed data types, to deal with data sets having only categorical data. The proposed conversion has been validated on a data set corresponding to breast cancer. Moreover, another issue with the clustering process is the visualization of output. Different geometric techniques like scatter plot, or projection plots are available, but none of the techniques display the result projecting the whole database but rather demonstrate attribute-pair wise analysis

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Efficient optic disc segmentation is an important task in automated retinal screening. For the same reason optic disc detection is fundamental for medical references and is important for the retinal image analysis application. The most difficult problem of optic disc extraction is to locate the region of interest. Moreover it is a time consuming task. This paper tries to overcome this barrier by presenting an automated method for optic disc boundary extraction using Fuzzy C Means combined with thresholding. The discs determined by the new method agree relatively well with those determined by the experts. The present method has been validated on a data set of 110 colour fundus images from DRION database, and has obtained promising results. The performance of the system is evaluated using the difference in horizontal and vertical diameters of the obtained disc boundary and that of the ground truth obtained from two expert ophthalmologists. For the 25 test images selected from the 110 colour fundus images, the Pearson correlation of the ground truth diameters with the detected diameters by the new method are 0.946 and 0.958 and, 0.94 and 0.974 respectively. From the scatter plot, it is shown that the ground truth and detected diameters have a high positive correlation. This computerized analysis of optic disc is very useful for the diagnosis of retinal diseases

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Impatiens noli-tangere is scarce in the UK and probably only native to the Lake District and Wales. It is the sole food plant for the endangered moth Eustroma reticulattum. Significant annual fluctuations in the size of I. noli-tangere populations endanger the continued presence of E. reticulatum in the UK. In this study, variation in population size was monitored across native populations of L noli-tangere in the English Lake District and Wales. In 1998, there was a crash in the population size of all metapopulations in the Lake District but not of those found in Wales. A molecular survey of the genetic affinities of samples in 1999 from both regions and a reference population from Switzerland was performed using AFLP and ISSR analyses. The consensus UPGMA dendrogram and a PCO scatter plot revealed clear differentiation between the populations of L noli-tangere in Wales and those in the Lake District. Most of the genetic variation in the UK (H-T= 0.064) was partitioned between (G(ST) = 0.455) rather than within (H-S = 0.034) regions, inferring little gene flow occurs between regions. There was similar bias towards differentiation between metapopulations in Wales, again consistent with low levels of interpopulation gene flow. This contrasts with far lower levels of differentiation in the Lake District which suggests modest rates of gene flow may occur between populations. It is concluded that in the event of local extinction of sites or populations, reintroductions should be restricted to samples collected from the same region. We then surveyed climatic variables to identify those most likely to cause local extinctions. Climatic correlates of population size were sought from two Lake District metapopulations situated close to a meteorological station. A combination of three climatic variables common to both sites explained 81-84% of the variation in plant number between 1990 and 2001. Projected trends for these climatic variables were used in a Monte Carlo simulation which suggested an increased risk of I. noli-tangere population crashes by 2050 at Coniston Water. but not at Derwentwater. Implications of these findings for practical conservation strategies are explored. (C) 2003 Elsevier Ltd. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We use sunspot group observations from the Royal Greenwich Observatory (RGO) to investigate the effects of intercalibrating data from observers with different visual acuities. The tests are made by counting the number of groups RB above a variable cut-off threshold of observed total whole-spot area (uncorrected for foreshortening) to simulate what a lower acuity observer would have seen. The synthesised annual means of RB are then re-scaled to the full observed RGO group number RA using a variety of regression techniques. It is found that a very high correlation between RA and RB (rAB > 0.98) does not prevent large errors in the intercalibration (for example sunspot maximum values can be over 30 % too large even for such levels of rAB). In generating the backbone sunspot number (RBB), Svalgaard and Schatten (2015, this issue) force regression fits to pass through the scatter plot origin which generates unreliable fits (the residuals do not form a normal distribution) and causes sunspot cycle amplitudes to be exaggerated in the intercalibrated data. It is demonstrated that the use of Quantile-Quantile (“Q  Q”) plots to test for a normal distribution is a useful indicator of erroneous and misleading regression fits. Ordinary least squares linear fits, not forced to pass through the origin, are sometimes reliable (although the optimum method used is shown to be different when matching peak and average sunspot group numbers). However, other fits are only reliable if non-linear regression is used. From these results it is entirely possible that the inflation of solar cycle amplitudes in the backbone group sunspot number as one goes back in time, relative to related solar-terrestrial parameters, is entirely caused by the use of inappropriate and non-robust regression techniques to calibrate the sunspot data.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Statistical analysis of data is crucial in cephalometric investigations. There are certainly excellent examples of good statistical practice in the field, but some articles published worldwide have carried out inappropriate analyses. Objective: The purpose of this study was to show that when the double records of each patient are traced on the same occasion, a control chart for differences between readings needs to be drawn, and limits of agreement and coefficients of repeatability must be calculated. Material and methods: Data from a well-known paper in Orthodontics were used for showing common statistical practices in cephalometric investigations and for proposing a new technique of analysis. Results: A scatter plot of the two radiograph readings and the two model readings with the respective regression lines are shown. Also, a control chart for the mean of the differences between radiograph readings was obtained and a coefficient of repeatability was calculated. Conclusions: A standard error assuming that mean differences are zero, which is referred to in Orthodontics and Facial Orthopedics as the Dahlberg error, can be calculated only for estimating precision if accuracy is already proven. When double readings are collected, limits of agreement and coefficients of repeatability must be calculated. A graph with differences of readings should be presented and outliers discussed.