15 resultados para Quantile regressions
em University of Queensland eSpace - Australia
Resumo:
Quantile computation has many applications including data mining and financial data analysis. It has been shown that an is an element of-approximate summary can be maintained so that, given a quantile query d (phi, is an element of), the data item at rank [phi N] may be approximately obtained within the rank error precision is an element of N over all N data items in a data stream or in a sliding window. However, scalable online processing of massive continuous quantile queries with different phi and is an element of poses a new challenge because the summary is continuously updated with new arrivals of data items. In this paper, first we aim to dramatically reduce the number of distinct query results by grouping a set of different queries into a cluster so that they can be processed virtually as a single query while the precision requirements from users can be retained. Second, we aim to minimize the total query processing costs. Efficient algorithms are developed to minimize the total number of times for reprocessing clusters and to produce the minimum number of clusters, respectively. The techniques are extended to maintain near-optimal clustering when queries are registered and removed in an arbitrary fashion against whole data streams or sliding windows. In addition to theoretical analysis, our performance study indicates that the proposed techniques are indeed scalable with respect to the number of input queries as well as the number of items and the item arrival rate in a data stream.
Resumo:
In many online applications, we need to maintain quantile statistics for a sliding window on a data stream. The sliding windows in natural form are defined as the most recent N data items. In this paper, we study the problem of estimating quantiles over other types of sliding windows. We present a uniform framework to process quantile queries for time constrained and filter based sliding windows. Our algorithm makes one pass on the data stream and maintains an E-approximate summary. It uses O((1)/(epsilon2) log(2) epsilonN) space where N is the number of data items in the window. We extend this framework to further process generalized constrained sliding window queries and proved that our technique is applicable for flexible window settings. Our performance study indicates that the space required in practice is much less than the given theoretical bound and the algorithm supports high speed data streams.
Resumo:
Purpose, An in vitro study was carried out to determine the iontophoretic permeability of local anesthetics through human epidermis. The relationship between physicochemical structure and the permeability of these solutes was then examined using an ionic mobility-pore model developed to define quantitative relationships. Methods. The iontophoretic permeability of both ester-type anesthetics (procaine, butacaine, tetracaine) and amide-type anesthetics (prilocaine, mepivacaine, lidocaine, bupivacaine, etidocaine, cinchocaine) were determined through excised human epidermis over 2 hrs using a constant d.c. current and Ag/AgCl electrodes. Individual ion mobilities were determined from conductivity measurements in aqueous solutions. Multiple stepwise regression was applied to interrelate the iontophoretic permeability of the solutes with their physical properties to examine the appropriateness of the ionic mobility-pore model and to determine the best predictor of iontophoretic permeability of the local anesthetics. Results. The logarithm of the iontophoretic permeability coefficient (log PCj,iont) for local anesthetics was directly related to the log ionic mobility and MW for the free volume form of the model when other conditions are held constant. Multiple linear regressions confirmed that log PCj,iont was best defined by ionic mobility (and its determinants: conductivity, pK(a) and MW) and MW. Conclusions. Our results suggest that of the properties studied, the best predictors of iontophoretic transport of local anesthetics are ionic mobility (or pK(a)) and molecular size. These predictions are consistent with the ionic mobility pore model determined by the mobility of ions in the aqueous solution, the total current, epidermal permselectivity and other factors as defined by the model.
The N-15 natural abundance (delta N-15) of ecosystem samples reflects measures of water availability
Resumo:
We assembled a globally-derived data set for site-averaged foliar delta(15)N, the delta(15)N of whole surface mineral soil and corresponding site factors (mean annual rainfall and temperature, latitude, altitude and soil pH). The delta(15)N of whole soil was related to all of the site variables (including foliar delta(15)N) except altitude and, when regressed on latitude and rainfall, provided the best model of these data, accounting for 49% of the variation in whole soil delta(15)N. As single linear regressions, site-averaged foliar delta(15)N was more strongly related to rainfall than was whole soil delta(15)N. A smaller data set showed similar, negative correlations between whole soil delta(15)N, site-averaged foliar delta(15)N and soil moisture variations during a single growing season. The negative correlation between water availability (measured here by rainfall and temperature) and soil or plant delta(15)N fails at the landscape scale, where wet spots are delta(15)N-enriched relative to their drier surroundings. Here we present global and seasonal data, postulate a proximate mechanism for the overall relationship between water availability and ecosystem delta(15)N and, newly, a mechanism accounting for the highly delta(15)N-depleted values found in the foliage and soils of many wet/cold ecosystems. These hypotheses are complemented by documentation of the present gaps in knowledge, suggesting lines of research which will provide new insights into terrestrial N-cycling. Our conclusions are consistent with those of Austin and Vitousek (1998) that foliar (and soil) delta(15)N appear to be related to the residence time of whole ecosystem N.
Resumo:
Serious infestations of Helicoverpa punctigera are experienced yearly in the eastern cropping regions of Australia. Regression analysis was used to determine whether the size of the first generation in spring (G(1)), which is comprised mostly of immigrants from inland Australia, was related to monthly rainfall in inland winter breeding areas. Data from two long series of light-trap catches at Narrabri in New South Wales (NSW) and Turretfield in South Australia (SA) were used in the analyses. The size of G1 at Narrabri in each year was significantly regressed on the amount of rainfall in western Queensland and NSW in May and June. The size of G1 at Turretfield each year was significantly regressed on the amount of rain in May, June and July in western Queensland and NSW and also in the desert of central Western Australia. Low r(2) values of the regressions suggest that rainfall data for more sites, as well as biological and other physical factors, such as temperature, evaporation, and prevailing wind systems, may need to be included to improve forecasts of the potential magnitude of the infestations in coastal cropping regions.
Late Quaternary cycles of mangrove development and decline on the north Australian continental shelf
Resumo:
Mangrove communities in the Australian tropics presently occur as narrow belts of vegetation in estuaries and on sheltered, muddy coasts. Palynological data from continental shelf and deep-sea cores indicate a long-term cyclical component of mangrove development and decline at a regional scale, which can be linked to specific phases of late Quaternary sealevel change. Extensive mangrove development, relative to today, occurs during periods of marine transgression, whereas very diminished mangrove occurs during marine regressions and during rarer periods of relative sea-level stability. Episodes of flourishing mangrove cannot be linked to phases of humid climate, as has been suggested in studies elsewhere. Rather, the cycle of expansion and decline of mangrove communities on a grand scale is explained in terms of contrasting physiographic settings characteristic of continental-shelf coasts during transgressive and regressive phases, in particular by the existence, or lack, of well-developed tidal estuaries. Copyright (C) 1999 John Wiley & Sons, Ltd.
Resumo:
This research tested the proposition that the effect of attachment security on safer-sex practice may be mediated by communication patterns. One hundred eighty-five undergraduate students completed questionnaire measures of attachment, assertiveness, and attitudes to communication about AIDS. Eight weeks later, they reported on their practice of safer sex in the period since the first testing session. Hierarchical regressions showed that at Step 1, anxiety about relationships (a measure of insecure attachment) was associated with less safer-sex practice, for all outcome measures. Attitudes to communication about AIDS added to the prediction of general reports of safer-sex practice: in line with the mediational model, anxiety about relationships became unimportant as a predictor when communication variables were included. Communication variables failed to add to the prediction of safer sex on the most recent encounter, and both anxiety about relationships and attitudes to communication about AIDS predicted condom use. Some gender differences in patterns of prediction were noted. The results are discussed in terms of attachment style and its links with the negotiation of sexual practice and relationship issues.
Resumo:
The use of long-term forecasts of pest pressure is central to better pest management. We relate the Southern Oscillation Index (SOI) and the Sea Surface Temperature (SST) to long-term light-trap catches of the two key moth pests of Australian agriculture, Helicoverpa punctigera (Wallengren) and H. armigera (Hubner), at Narrabri, New South Wales over 11 years, and for H. punctigera only at Turretfield, South Australia over 22 years. At Narrabri, the size of the first spring generation of both species was significantly correlated with the SOI in certain months, sometimes up to 15 months before the date of trapping. Differences in the SOI and SST between significant months were used to build composite variables in multiple regressions which gave fitted values of the trap catches to less than 25% of the observed values. The regressions suggested that useful forecasts of both species could be made 6-15 months ahead. The influence of the two weather variables on trap catches of H. punctigera at Turretfield were not as strong as at Narrabri, probably because the SOI was not as strongly related to rainfall in southern Australia as it is in eastern Australia. The best fits were again given by multiple regressions with SOI plus SST variables, to within 40% of the observed values. The reliability of both variables as predictors of moth numbers may be limited by the lack of stability in the SOI-rainfall correlation over the historical record. As no other data set is available to test the regressions, they can only be tested by future use. The use of long-term forecasts in pest management is discussed, and preliminary analyses of other long sets of insect numbers suggest that the Southern Oscillation Index may be a useful predictor of insect numbers in other parts of the world.
Resumo:
1. Cluster analysis of reference sites with similar biota is the initial step in creating River Invertebrate Prediction and Classification System (RIVPACS) and similar river bioassessment models such as Australian River Assessment System (AUSRIVAS). This paper describes and tests an alternative prediction method, Assessment by Nearest Neighbour Analysis (ANNA), based on the same philosophy as RIVPACS and AUSRIVAS but without the grouping step that some people view as artificial. 2. The steps in creating ANNA models are: (i) weighting the predictor variables using a multivariate approach analogous to principal axis correlations, (ii) calculating the weighted Euclidian distance from a test site to the reference sites based on the environmental predictors, (iii) predicting the faunal composition based on the nearest reference sites and (iv) calculating an observed/expected (O/E) analogous to RIVPACS/AUSRIVAS. 3. The paper compares AUSRIVAS and ANNA models on 17 datasets representing a variety of habitats and seasons. First, it examines each model's regressions for Observed versus Expected number of taxa, including the r(2), intercept and slope. Second, the two models' assessments of 79 test sites in New Zealand are compared. Third, the models are compared on test and presumed reference sites along a known trace metal gradient. Fourth, ANNA models are evaluated for western Australia, a geographically distinct region of Australia. The comparisons demonstrate that ANNA and AUSRIVAS are generally equivalent in performance, although ANNA turns out to be potentially more robust for the O versus E regressions and is potentially more accurate on the trace metal gradient sites. 4. The ANNA method is recommended for use in bioassessment of rivers, at least for corroborating the results of the well established AUSRIVAS- and RIVPACS-type models, if not to replace them.
Resumo:
This study examined the utility of self-efficacy as a predictor of social activity and mood control in multiple sclerosis (MS). Seventy-one subjects with MS were recruited from people attending an MS centre or from a mailing list and were examined on two occasions that were two months apart. Clinic patients were more disabled than patients who completed assessments by post, but they were of higher socioeconomic status and were less dysphoric; We attempted to predict self-reported performance of mood control and social activity at two months, from self-efficacy or performance on these tasks at pretest. Demographic variables, disorder status, disability, self-esteem and depression were also allowed to compete for entry into multiple regressions. Substantial stability in mood, performance and disability was observed over the two months. In both mood control and social activity, past performance was the strongest predictor of later performance, but self-efficacy also contributed significantly to the prediction. The disability level entered a prediction of social activity; but no other variables predicted either type of performance. A secondary analysis predicting self-esteem at two months also included self-efficacy for social activity, illustrating the contribution of perceived capability to later assessments of self-worth. The study provided support for self-efficacy as a predictor of later behavioural outcomes and self-esteem in multiple sclerosis. (C) 1997 Elsevier Science Ltd.
Resumo:
Short versions of four Eysenck personality scales had been included in questionnaires given to several adult samples from the Australian Twin Registry, comprising altogether some 5400 pairs. Means and regressions with age are compared for three samples at average ages of 23, 37, and 61 years, and for two samples of retested individuals, one tested twice at average ages of 29 and 37 years, and one tested three times at average ages of 45, 56, and 62 years, For both males and females the trends for Psychoticism (P), Extraversion (E), and Neuroticism (N) were generally downward with age, and for Lie (L), upward. However, in the longitudinal sample between ages 56 and 62 the trends for P, E, and I stopped or reversed, although N continued downward. Heritabilities were reasonably stable across age for P, E, and N, and the effects of shared environments negligible, but L showed some influence of shared environment as well as genes in all but the oldest age group. (C) 2001 Elsevier Science Ltd. All rights reserved.
Resumo:
Applying programming techniques to detailed data for 406 rice farms in 21 villages, for 1997, produces inefficiency measures, which differ substantially from the results of simple yield and unit cost measures. For the Boro (dry) season, mean technical efficiency was efficiency was 56.2 per cent and 69.4 per cent, allocative efficiency was 81.3 per cent, cost efficiency was 56.2 per cent and scale efficiency 94.9 per cent. The Aman (wet) season results are similar, but a few points lower. Allocative inefficiency is due to overuse of labour, suggesting population pressure, and of fertiliser, where recommended rates may warrant revision. Second-stage regressions show that large families are more inefficient, whereas farmers with better access to input markets, and those who do less off-farm work, tend to be more efficient. The information on the sources of inter-farm performance differentials could be used by the extension agents to help inefficient farmers. There is little excuse for such sub-optimal use of survey data, which are often collected at substantial costs.
Resumo:
It is becoming increasingly clear that species of smaller body size tend to be less vulnerable to contemporary extinction threats than larger species, but few studies have examined the mechanisms underlying this pattern. In this paper, data for the Australian terrestrial mammal fauna are used to ask whether higher reproductive output or smaller home ranges can explain the reduced extinction risk of smaller species. Extinct and endangered species do indeed have smaller litters and larger home ranges for their body size than expected under a null model. In multiple regressions, however, only litter size is a significant predictor of extinction risk once body size and phylogeny are controlled for. Larger litters contribute to fast population growth, and are probably part of the reason that smaller species are less extinction-prone. The effect of litter size varies between the mesic coastal regions and the and interior of Australia, indicating that the environment a species inhabits mediates the effect of biology on extinction risk. These results suggest that predicting extinction risk from biological traits is likely to be a complex task which must consider explicitly interactions between biology and environment.
Resumo:
The Caridina indistincta complex is a group of closely related atyid shrimps that inhabit coastal freshwater streams throughout north-eastern Australia. Using mitochondrial DNA sequence data (cytochrome oxidase 1, CO1), we (1) inferred the timing of speciation in the C. indistincta group and (2) examined the intraspecific phylogeographic patterns within the group. Assuming a shrimp-specific rate of CO1 evolution, the level of sequence divergence among species suggests that speciation took place during the Miocene epoch. Within one widespread mainland species, phylogeographic patterns suggest strong geographic 'regionalisation' of mtDNA lineages that are most likely of Pleistocene origin. By contrast, another species comprises two highly divergent mtDNA lineages that occur in sympatry. We suggest that although Pleistocene sea-level regressions appear important in generating population-level phylogeographic patterns, these events were largely unimportant in the formation of species in this group.