121 resultados para Predicted Distribution Data
Resumo:
Advances in hardware and software technologies allow to capture streaming data. The area of Data Stream Mining (DSM) is concerned with the analysis of these vast amounts of data as it is generated in real-time. Data stream classification is one of the most important DSM techniques allowing to classify previously unseen data instances. Different to traditional classifiers for static data, data stream classifiers need to adapt to concept changes (concept drift) in the stream in real-time in order to reflect the most recent concept in the data as accurately as possible. A recent addition to the data stream classifier toolbox is eRules which induces and updates a set of expressive rules that can easily be interpreted by humans. However, like most rule-based data stream classifiers, eRules exhibits a poor computational performance when confronted with continuous attributes. In this work, we propose an approach to deal with continuous data effectively and accurately in rule-based classifiers by using the Gaussian distribution as heuristic for building rule terms on continuous attributes. We show on the example of eRules that incorporating our method for continuous attributes indeed speeds up the real-time rule induction process while maintaining a similar level of accuracy compared with the original eRules classifier. We termed this new version of eRules with our approach G-eRules.
Resumo:
Traditionally, the cusp has been described in terms of a time-stationary feature of the magnetosphere which allows access of magnetosheath-like plasma to low altitudes. Statistical surveys of data from low-altitude spacecraft have shown the average characteristics and position of the cusp. Recently, however, it has been suggested that the ionospheric footprint of flux transfer events (FTEs) may be identified as variations of the “cusp” on timescales of a few minutes. In this model, the cusp can vary in form between a steady-state feature in one limit and a series of discrete ionospheric FTE signatures in the other limit. If this time-dependent cusp scenario is correct, then the signatures of the transient reconnection events must be able, on average, to reproduce the statistical cusp occurrence previously determined from the satellite observations. In this paper, we predict the precipitation signatures which are associated with transient magnetopause reconnection, following recent observations of the dependence of dayside ionospheric convection on the orientation of the IMF. We then employ a simple model of the longitudinal motion of FTE signatures to show how such events can easily reproduce the local time distribution of cusp occurrence probabilities, as observed by low-altitude satellites. This is true even in the limit where the cusp is a series of discrete events. Furthermore, we investigate the existence of double cusp patches predicted by the simple model and show how these events may be identified in the data.
Resumo:
Combined observations by meridian-scanning photometers, all-sky auroral TV camera and the EISCAT radar permitted a detailed analysis of the temporal and spatial development of the midday auroral breakup phenomenon and the related ionospheric ion flow pattern within the 71°–75° invariant latitude radar field of view. The radar data revealed dominating northward and westward ion drifts, of magnitudes close to the corresponding velocities of the discrete, transient auroral forms, during the two different events reported here, characterized by IMF |BY/BZ| < 1 and > 2, respectively (IMF BZ between −8 and −3 nT and BY > 0). The spatial scales of the discrete optical events were ∼50 km in latitude by ∼500 km in longitude, and their lifetimes were less than 10 min. Electric potential enhancements with peak values in the 30–50 kV range are inferred along the discrete arc in the IMF |BY/BZ| < 1 case from the optical data and across the latitudinal extent of the radar field of view in the |BY/BZ| > 2 case. Joule heat dissipation rates in the maximum phase of the discrete structures of ∼ 100 ergs cm−2 s−1 (0.1 W m−2) are estimated from the photometer intensities and the ion drift data. These observations combined with the additional characteristics of the events, documented here and in several recent studies (i.e., their quasi-periodic nature, their motion pattern relative to the persistent cusp or cleft auroral arc, the strong relationship with the interplanetary magnetic field and the associated ion drift/E field events and ground magnetic signatures), are considered to be strong evidence in favour of a transient, intermittent reconnection process at the dayside magnetopause and associated energy and momentum transfer to the ionosphere in the polar cusp and cleft regions. The filamentary spatial structure and the spectral characteristics of the optical signature indicate associated localized ˜1-kV potential drops between the magnetopause and the ionosphere during the most intense auroral events. The duration of the events compares well with the predicted characteristic times of momentum transfer to the ionosphere associated with the flux transfer event-related current tubes. It is suggested that, after this 2–10 min interval, the sheath particles can no longer reach the ionosphere down the open flux tube, due to the subsequent super-Alfvénic flow along the magnetopause, conductivities are lower and much less momentum is extracted from the solar wind by the ionosphere. The recurrence time (3–15 min) and the local time distribution (∼0900–1500 MLT) of the dayside auroral breakup events, combined with the above information, indicate the important roles of transient magnetopause reconnection and the polar cusp and cleft regions in the transfer of momentum and energy between the solar wind and the magnetosphere.
Resumo:
Incoherent scatter data from non-thermal F-region ionospheric plasma are analysed, using theoretical spectra predicted by Raman et al. It is found that values of the semi-empirical drift parameter D∗, associated with deviations of the ion velocity distribution from a Maxwellian, and the plasma temperatures can be rigorously deduced (the results being independent of the path of iteration) if the angle between the line-of-sight and the geomagnetic field is larger than about 15–20 degrees. For small aspect angles, the deduced value of the average (or 3-D) ion temperature remains ambiguous and the analysis is restricted to the determination of the line-of-sight temperature because the theoretical spectrum is insensitive to non-thermal effects when the plasma is viewed along directions almost parallel to the magnetic field. This limitation is expected to apply to any realistic model of the ion velocity distribution, and its consequences are discussed. Fit strategies which allow for mixed ion composition are also considered. Examples of fits to data from various EISCAT observing programmes are presented.
Resumo:
Assessment is made of the effect of the assumed form for the ion velocity distribution function on estimates of three-dimensional ion temperature from one-dimensional observations. Incoherent scatter observations by the EISCAT radar at a variety of aspect angles are used to demonstrate features of ion temperature determination and to study the ion velocity distribution function. One form of the distribution function which has recently been widely used In the interpretation of EISCAT measurements, is found to be consistent with the data presented here, in that no deviation from a Maxwellian can be detected for observations along the magnetic field line and that the ion temperature and its anisotropy are accurately predicted. It is shown that theoretical predictions of the anisotropy by Monte Carlo computations are very accurate, the observed value being greater by only a few percent. It is also demonstrated for the case studied that errors of up to 93% are introduced into the ion temperature estimate if the anisotropy is neglected. Observations at an aspect angle of 54.7°, which are not subject to this error, have a much smaller uncertainty (less than 1%) due to the adopted form of the distribution of line-of-sight velocity.
Resumo:
Highly heterogeneous mountain snow distributions strongly affect soil moisture patterns; local ecology; and, ultimately, the timing, magnitude, and chemistry of stream runoff. Capturing these vital heterogeneities in a physically based distributed snow model requires appropriately scaled model structures. This work looks at how model scale—particularly the resolutions at which the forcing processes are represented—affects simulated snow distributions and melt. The research area is in the Reynolds Creek Experimental Watershed in southwestern Idaho. In this region, where there is a negative correlation between snow accumulation and melt rates, overall scale degradation pushed simulated melt to earlier in the season. The processes mainly responsible for snow distribution heterogeneity in this region—wind speed, wind-affected snow accumulations, thermal radiation, and solar radiation—were also independently rescaled to test process-specific spatiotemporal sensitivities. It was found that in order to accurately simulate snowmelt in this catchment, the snow cover needed to be resolved to 100 m. Wind and wind-affected precipitation—the primary influence on snow distribution—required similar resolution. Thermal radiation scaled with the vegetation structure (~100 m), while solar radiation was adequately modeled with 100–250-m resolution. Spatiotemporal sensitivities to model scale were found that allowed for further reductions in computational costs through the winter months with limited losses in accuracy. It was also shown that these modeling-based scale breaks could be associated with physiographic and vegetation structures to aid a priori modeling decisions.
Resumo:
In order to calculate unbiased microphysical and radiative quantities in the presence of a cloud, it is necessary to know not only the mean water content but also the distribution of this water content. This article describes a study of the in-cloud horizontal inhomogeneity of ice water content, based on CloudSat data. In particular, by focusing on the relations with variables that are already available in general circulation models (GCMs), a parametrization of inhomogeneity that is suitable for inclusion in GCM simulations is developed. Inhomogeneity is defined in terms of the fractional standard deviation (FSD), which is given by the standard deviation divided by the mean. The FSD of ice water content is found to increase with the horizontal scale over which it is calculated and also with the thickness of the layer. The connection to cloud fraction is more complicated; for small cloud fractions FSD increases as cloud fraction increases while FSD decreases sharply for overcast scenes. The relations to horizontal scale, layer thickness and cloud fraction are parametrized in a relatively simple equation. The performance of this parametrization is tested on an independent set of CloudSat data. The parametrization is shown to be a significant improvement on the assumption of a single-valued global FSD
Resumo:
Pasture-based ruminant production systems are common in certain areas of the world, but energy evaluation in grazing cattle is performed with equations developed, in their majority, with sheep or cattle fed total mixed rations. The aim of the current study was to develop predictions of metabolisable energy (ME) concentrations in fresh-cut grass offered to non-pregnant non-lactating cows at maintenance energy level, which may be more suitable for grazing cattle. Data were collected from three digestibility trials performed over consecutive grazing seasons. In order to cover a range of commercial conditions and data availability in pasture-based systems, thirty-eight equations for the prediction of energy concentrations and ratios were developed. An internal validation was performed for all equations and also for existing predictions of grass ME. Prediction error for ME using nutrient digestibility was lowest when gross energy (GE) or organic matter digestibilities were used as sole predictors, while the addition of grass nutrient contents reduced the difference between predicted and actual values, and explained more variation. Addition of N, GE and diethyl ether extract (EE) contents improved accuracy when digestible organic matter in DM was the primary predictor. When digestible energy was the primary explanatory variable, prediction error was relatively low, but addition of water-soluble carbohydrates, EE and acid-detergent fibre contents of grass decreased prediction error. Equations developed in the current study showed lower prediction errors when compared with those of existing equations, and may thus allow for an improved prediction of ME in practice, which is critical for the sustainability of pasture-based systems.
Resumo:
Sorace (2000, 2005) has claimed that while L2 learners can easily acquire properties of L2 narrow syntax they have significant difficulty with regard to interpretation and the discourse distribution of related properties, resulting in so-called residual optionality. However, there is no consensus as to what this difficulty indicates. Is it related to an insurmountable grammatical representational deficit (in the sense of representation deficit approaches; e.g. Beck 1998, Franceschina 2001, Hawkins 2005), is it due to cross-linguistic interference, or is it just a delay due to a greater complexity involved in the acquisition of interface-conditioned properties? In this article, I explore the L2 distribution of null and overt subject pronouns of English speaking learners of L2 Spanish. While intermediate learners clearly have knowledge of the syntax of Spanish null subjects, they do not have target-like pragmatic knowledge of their distribution with overt subjects. The present data demonstrate, however, that this difficulty is overcome at highly advanced stages of L2 development, thus suggesting that properties at the syntax-pragmatics interface are not destined for inevitable fossilization.
Resumo:
Species distribution models (SDM) are increasingly used to understand the factors that regulate variation in biodiversity patterns and to help plan conservation strategies. However, these models are rarely validated with independently collected data and it is unclear whether SDM performance is maintained across distinct habitats and for species with different functional traits. Highly mobile species, such as bees, can be particularly challenging to model. Here, we use independent sets of occurrence data collected systematically in several agricultural habitats to test how the predictive performance of SDMs for wild bee species depends on species traits, habitat type, and sampling technique. We used a species distribution modeling approach parametrized for the Netherlands, with presence records from 1990 to 2010 for 193 Dutch wild bees. For each species, we built a Maxent model based on 13 climate and landscape variables. We tested the predictive performance of the SDMs with independent datasets collected from orchards and arable fields across the Netherlands from 2010 to 2013, using transect surveys or pan traps. Model predictive performance depended on species traits and habitat type. Occurrence of bee species specialized in habitat and diet was better predicted than generalist bees. Predictions of habitat suitability were also more precise for habitats that are temporally more stable (orchards) than for habitats that suffer regular alterations (arable), particularly for small, solitary bees. As a conservation tool, SDMs are best suited to modeling rarer, specialist species than more generalist and will work best in long-term stable habitats. The variability of complex, short-term habitats is difficult to capture in such models and historical land use generally has low thematic resolution. To improve SDMs’ usefulness, models require explanatory variables and collection data that include detailed landscape characteristics, for example, variability of crops and flower availability. Additionally, testing SDMs with field surveys should involve multiple collection techniques.
Resumo:
Clustering methods are increasingly being applied to residential smart meter data, providing a number of important opportunities for distribution network operators (DNOs) to manage and plan the low voltage networks. Clustering has a number of potential advantages for DNOs including, identifying suitable candidates for demand response and improving energy profile modelling. However, due to the high stochasticity and irregularity of household level demand, detailed analytics are required to define appropriate attributes to cluster. In this paper we present in-depth analysis of customer smart meter data to better understand peak demand and major sources of variability in their behaviour. We find four key time periods in which the data should be analysed and use this to form relevant attributes for our clustering. We present a finite mixture model based clustering where we discover 10 distinct behaviour groups describing customers based on their demand and their variability. Finally, using an existing bootstrapping technique we show that the clustering is reliable. To the authors knowledge this is the first time in the power systems literature that the sample robustness of the clustering has been tested.
Resumo:
The responses of animals and plants to recent climate change vary greatly from species to species, but attempts to understand this variation have met with limited success. This has led to concerns that predictions of responses are inherently uncertain because of the complexity of interacting drivers and biotic interactions. However, we show for an exemplar group of 155 Lepidoptera species that about 60% of the variation among species in their abundance trends over the past four decades can be explained by species-specific exposure and sensitivity to climate change. Distribution changes were less well predicted, but nonetheless, up to 53% of the variation was explained. We found that species vary in their overall sensitivity to climate and respond to different components of the climate despite ostensibly experiencing the same climate changes. Hence, species have undergone different levels of population “forcing” (exposure), driving variation among species in their national-scale abundance and distribution trends. We conclude that variation in species’ responses to recent climate change may be more predictable than previously recognized.
Resumo:
We use sunspot group observations from the Royal Greenwich Observatory (RGO) to investigate the effects of intercalibrating data from observers with different visual acuities. The tests are made by counting the number of groups RB above a variable cut-off threshold of observed total whole-spot area (uncorrected for foreshortening) to simulate what a lower acuity observer would have seen. The synthesised annual means of RB are then re-scaled to the full observed RGO group number RA using a variety of regression techniques. It is found that a very high correlation between RA and RB (rAB > 0.98) does not prevent large errors in the intercalibration (for example sunspot maximum values can be over 30 % too large even for such levels of rAB). In generating the backbone sunspot number (RBB), Svalgaard and Schatten (2015, this issue) force regression fits to pass through the scatter plot origin which generates unreliable fits (the residuals do not form a normal distribution) and causes sunspot cycle amplitudes to be exaggerated in the intercalibrated data. It is demonstrated that the use of Quantile-Quantile (“Q Q”) plots to test for a normal distribution is a useful indicator of erroneous and misleading regression fits. Ordinary least squares linear fits, not forced to pass through the origin, are sometimes reliable (although the optimum method used is shown to be different when matching peak and average sunspot group numbers). However, other fits are only reliable if non-linear regression is used. From these results it is entirely possible that the inflation of solar cycle amplitudes in the backbone group sunspot number as one goes back in time, relative to related solar-terrestrial parameters, is entirely caused by the use of inappropriate and non-robust regression techniques to calibrate the sunspot data.
Resumo:
Blanket bog occupies approximately 6 % of the area of the UK today. The Holocene expansion of this hyperoceanic biome has previously been explained as a consequence of Neolithic forest clearance. However, the present distribution of blanket bog in Great Britain can be predicted accurately with a simple model (PeatStash) based on summer temperature and moisture index thresholds, and the same model correctly predicts the highly disjunct distribution of blanket bog worldwide. This finding suggests that climate, rather than land-use history, controls blanket-bog distribution in the UK and everywhere else. We set out to test this hypothesis for blanket bogs in the UK using bioclimate envelope modelling compared with a database of peat initiation age estimates. We used both pollen-based reconstructions and climate model simulations of climate changes between the mid-Holocene (6000 yr BP, 6 ka) and modern climate to drive PeatStash and predict areas of blanket bog. We compiled data on the timing of blanket-bog initiation, based on 228 age determinations at sites where peat directly overlies mineral soil. The model predicts large areas of northern Britain would have had blanket bog by 6000 yr BP, and the area suitable for peat growth extended to the south after this time. A similar pattern is shown by the basal peat ages and new blanket bog appeared over a larger area during the late Holocene, the greatest expansion being in Ireland, Wales and southwest England, as the model predicts. The expansion was driven by a summer cooling of about 2 °C, shown by both pollen-based reconstructions and climate models. The data show early Holocene (pre-Neolithic) blanket-bog initiation at over half of the sites in the core areas of Scotland, and northern England. The temporal patterns and concurrence of the bioclimate model predictions and initiation data suggest that climate change provides a parsimonious explanation for the early Holocene distribution and later expansion of blanket bogs in the UK, and it is not necessary to invoke anthropogenic activity as a driver of this major landscape change.
Resumo:
Blanket bog occupies approximately 6% of the area of the UK today. The Holocene expansion of this hyperoceanic biome has previously been explained as a consequence of Neolithic forest clearance. However, the present distribution of blanket bog in Great Britain can be predicted accurately with a simple model (PeatStash) based on summer temperature and moisture index thresholds, and the same model correctly predicts the highly disjunct distribution of blanket bog worldwide. This finding suggests that climate, rather than land-use history, controls blanket-bog distribution in the UK and everywhere else. We set out to test this hypothesis for blanket bogs in the UK using bioclimate envelope modelling compared with a database of peat initiation age estimates. We used both pollen-based reconstructions and climate model simulations of climate changes between the mid-Holocene (6000 yr BP, 6 ka) and modern climate to drive PeatStash and predict areas of blanket bog. We compiled data on the timing of blanketbog initiation, based on 228 age determinations at sites where peat directly overlies mineral soil. The model predicts that large areas of northern Britain would have had blanket bog by 6000 yr BP, and the area suitable for peat growth extended to the south after this time. A similar pattern is shown by the basal peat ages and new blanket bog appeared over a larger area during the late Holocene, the greatest expansion being in Ireland,Wales, and southwest England, as the model predicts. The expansion was driven by a summer cooling of about 2 °C, shown by both pollen-based reconstructions and climate models. The data show early Holocene (pre- Neolithic) blanket-bog initiation at over half of the sites in the core areas of Scotland and northern England. The temporal patterns and concurrence of the bioclimate model predictions and initiation data suggest that climate change provides a parsimonious explanation for the early Holocene distribution and later expansion of blanket bogs in the UK, and it is not necessary to invoke anthropogenic activity as a driver of this major landscape change.