799 resultados para Utility-based performance measures
Resumo:
BACKGROUND AND PURPOSE We report on workflow and process-based performance measures and their effect on clinical outcome in Solitaire FR Thrombectomy for Acute Revascularization (STAR), a multicenter, prospective, single-arm study of Solitaire FR thrombectomy in large vessel anterior circulation stroke patients. METHODS Two hundred two patients were enrolled across 14 centers in Europe, Canada, and Australia. The following time intervals were measured: stroke onset to hospital arrival, hospital arrival to baseline imaging, baseline imaging to groin puncture, groin puncture to first stent deployment, and first stent deployment to reperfusion. Effects of time of day, general anesthesia use, and multimodal imaging on workflow were evaluated. Patient characteristics and workflow processes associated with prolonged interval times and good clinical outcome (90-day modified Rankin score, 0-2) were analyzed. RESULTS Median times were onset of stroke to hospital arrival, 123 minutes (interquartile range, 163 minutes); hospital arrival to thrombolysis in cerebral infarction (TICI) 2b/3 or final digital subtraction angiography, 133 minutes (interquartile range, 99 minutes); and baseline imaging to groin puncture, 86 minutes (interquartile range, 24 minutes). Time from baseline imaging to puncture was prolonged in patients receiving intravenous tissue-type plasminogen activator (32-minute mean delay) and when magnetic resonance-based imaging at baseline was used (18-minute mean delay). Extracranial carotid disease delayed puncture to first stent deployment time on average by 25 minutes. For each 1-hour increase in stroke onset to final digital subtraction angiography (or TICI 2b/3) time, odds of good clinical outcome decreased by 38%. CONCLUSIONS Interval times in the STAR study reflect current intra-arterial therapy for patients with acute ischemic stroke. Improving workflow metrics can further improve clinical outcome. CLINICAL TRIAL REGISTRATION: URL http://www.clinicaltrials.gov. Unique identifier: NCT01327989.
Resumo:
Includes bibliographical references (p. [107]).
Resumo:
Asset allocation decisions and value at risk calculations rely strongly on volatility estimates. Volatility measures such as rolling window, EWMA, GARCH and stochastic volatility are used in practice. GARCH and EWMA type models that incorporate the dynamic structure of volatility and are capable of forecasting future behavior of risk should perform better than constant, rolling window volatility models. For the same asset the model that is the ‘best’ according to some criterion can change from period to period. We use the reality check test∗ to verify if one model out-performs others over a class of re-sampled time-series data. The test is based on re-sampling the data using stationary bootstrapping. For each re-sample we check the ‘best’ model according to two criteria and analyze the distribution of the performance statistics. We compare constant volatility, EWMA and GARCH models using a quadratic utility function and a risk management measurement as comparison criteria. No model consistently out-performs the benchmark.
Resumo:
Poor posture control has been associated with an increased risk of falls and mobility disability among older adults. This study was conducted to assess the test-retest reliability and sensitivity to group differences regarding the time-limit (TLimit) of one-leg standing and selected balance parameters obtained with a force platform in older and young adults. A secondary purpose was to assess the relationship between TLimit and these balance parameters. Twenty-eight healthy older adults (age: 69±5years) and thirty young adults (age: 21±4years) participated in this study. Two one-leg stance tasks were performed: (1) three trials of 30s maximum and (2) one TLimit trial. The following balance parameters were computed: center of pressure area, RMS sway amplitude, and mean velocity and mean frequency in both the anterio-posterior and medio-lateral directions. All balance parameters obtained with the force platform as well as the TLimit variable were sensitive to differences in balance performance between older and young adults. The test-retest reliability of these measures was found to be acceptable (ICC: 0.40-0.85), with better ICC scores observed for mean velocity and mean frequency in the older group. Pearson correlations coefficients (r) between balance parameters and TLimit ranged from -0.16 to -0.54. These results add to the current literature that can be used in the development of measurement tools for evaluating balance in older and young adults. © 2013 Elsevier Ltd.
Resumo:
Since the seminal works of Markowitz (1952), Sharpe (1964), and Lintner (1965), numerous studies on portfolio selection and performance measure have been based upon the mean-variance framework. However, several researchers (e.g., Arditti (1967, and 1971), Samuelson (1970), and Rubinstein (1973)) argue that the higher moments cannot be neglected unless there is reason to believe that: (i) the asset returns are normally distributed and the investor's utility function is quadratic, or (ii) the empirical evidence demonstrates that higher moments are irrelevant to the investor's decision. Based on the same argument, this dissertation investigates the impact of higher moments of return distributions on three issues concerning the 14 international stock markets.^ First, the portfolio selection with skewness is determined using: the Polynomial Goal Programming in which investor preferences for skewness can be incorporated. The empirical findings suggest that the return distributions of international stock markets are not normally distributed, and that the incorporation of skewness into an investor's portfolio decision causes a major change in the construction of his optimal portfolio. The evidence also indicates that an investor will trade expected return of the portfolio for skewness. Moreover, when short sales are allowed, investors are better off as they attain higher expected return and skewness simultaneously.^ Second, the performance of international stock markets are evaluated using two types of performance measures: (i) the two-moment performance measures of Sharpe (1966), and Treynor (1965), and (ii) the higher-moment performance measures of Prakash and Bear (1986), and Stephens and Proffitt (1991). The empirical evidence indicates that higher moments of return distributions are significant and relevant to the investor's decision. Thus, the higher moment performance measures should be more appropriate to evaluate the performances of international stock markets. The evidence also indicates that various measures provide a vastly different performance ranking of the markets, albeit in the same direction.^ Finally, the inter-temporal stability of the international stock markets is investigated using the Parhizgari and Prakash (1989) algorithm for the Sen and Puri (1968) test which accounts for non-normality of return distributions. The empirical finding indicates that there is strong evidence to support the stability in international stock market movements. However, when the Anderson test which assumes normality of return distributions is employed, the stability in the correlation structure is rejected. This suggests that the non-normality of the return distribution is an important factor that cannot be ignored in the investigation of inter-temporal stability of international stock markets. ^
Resumo:
This paper will investigate the suitability of existing performance measures under the assumption of a clearly defined benchmark. A range of measures are examined including the Sortino Ratio, the Sharpe Selection ratio (SSR), the Student’s t-test and a decay rate measure. A simulation study is used to assess the power and bias of these measures based on variations in sample size and mean performance of two simulated funds. The Sortino Ratio is found to be the superior performance measure exhibiting more power and less bias than the SSR when the distribution of excess returns are skewed.
Resumo:
Performance evaluation of object tracking systems is typically performed after the data has been processed, by comparing tracking results to ground truth. Whilst this approach is fine when performing offline testing, it does not allow for real-time analysis of the systems performance, which may be of use for live systems to either automatically tune the system or report reliability. In this paper, we propose three metrics that can be used to dynamically asses the performance of an object tracking system. Outputs and results from various stages in the tracking system are used to obtain measures that indicate the performance of motion segmentation, object detection and object matching. The proposed dynamic metrics are shown to accurately indicate tracking errors when visually comparing metric results to tracking output, and are shown to display similar trends to the ETISEO metrics when comparing different tracking configurations.
Resumo:
Internationally, marine biodiversity conservation objectives are having an increasing influence on the management of commercial fisheries. While this is largely being implemented through Marine Protected Areas (MPAs) other management measures, such as market based instruments (MBIs), have proved to be effective at managing target species catch in fisheries and reducing environmental impacts in industries such as mining and tourism. Market-based management measures aim to mitigate the impacts of activities by better aligning the incentives their participants face with the objectives of management, changing their behavior as a consequence. In this paper, we review the potential of MBIs as management tools to mitigate undesirable environmental impacts associated with commercial fishing. Where they exist, examples of previous applications are described and the factors that influence their applicability and effectiveness are discussed. Several fishing methods and impacts are considered and suggest that whilst no single approach is most appropriate in all circumstances either replacing or complementing existing management arrangements with MBIs has the potential to improve environmental performance. This has a number of implications. From the environmental perspective they should enable levels of undesirable impacts such as damage to sensitive habitat or the bycatch of protected species of turtles, marine mammals, and seabirds to be reduced. The increased flexibility MBIs allow industry when developing solutions also has the potential to reduce costs to both the industry and managers, improving the cost-effectiveness of regulation as a result. Further, in the increasingly relevant case of MPAs the need for publicly funded compensation, often paid to industry when vessels are excluded from grounds, may also be significantly reduced if improved environmental performance makes it possible for some industry members to continue operating.
Resumo:
Recently there has been an increasing interest in the development of new methods using Pareto optimality to deal with multi-objective criteria (for example, accuracy and architectural complexity). Once one has learned a model based on their devised method, the problem is then how to compare it with the state of art. In machine learning, algorithms are typically evaluated by comparing their performance on different data sets by means of statistical tests. Unfortunately, the standard tests used for this purpose are not able to jointly consider performance measures. The aim of this paper is to resolve this issue by developing statistical procedures that are able to account for multiple competing measures at the same time. In particular, we develop two tests: a frequentist procedure based on the generalized likelihood-ratio test and a Bayesian procedure based on a multinomial-Dirichlet conjugate model. We further extend them by discovering conditional independences among measures to reduce the number of parameter of such models, as usually the number of studied cases is very reduced in such comparisons. Real data from a comparison among general purpose classifiers is used to show a practical application of our tests.
Resumo:
Many different performance measures have been developed to evaluate field predictions in meteorology. However, a researcher or practitioner encountering a new or unfamiliar measure may have difficulty in interpreting its results, which may lead to them avoiding new measures and relying on those that are familiar. In the context of evaluating forecasts of extreme events for hydrological applications, this article aims to promote the use of a range of performance measures. Some of the types of performance measures that are introduced in order to demonstrate a six-step approach to tackle a new measure. Using the example of the European Centre for Medium-Range Weather Forecasts (ECMWF) ensemble precipitation predictions for the Danube floods of July and August 2002, to show how to use new performance measures with this approach and the way to choose between different performance measures based on their suitability for the task at hand is shown. Copyright © 2008 Royal Meteorological Society
Resumo:
There is increasing recognition that agricultural landscapes meet multiple societal needs and demands beyond provision of economic and environmental goods and services. Accordingly, there have been significant calls for the inclusion of societal, amenity and cultural values in agri-environmental landscape indicators to assist policy makers in monitoring the wider impacts of land-based policies. However, capturing the amenity and cultural values that rural agrarian areas provide, by use of such indicators, presents significant challenges. The EU social awareness of landscape indicator represents a new class of generalized social indicator using a top-down methodology to capture the social dimensions of landscape without reference to the specific structural and cultural characteristics of individual landscapes. This paper reviews this indicator in the context of existing agri-environmental indicators and their differing design concepts. Using a stakeholder consultation approach in five case study regions, the potential and limitations of the indicator are evaluated, with a particular focus on its perceived meaning, utility and performance in the context of different user groups and at different geographical scales. This analysis supplements previous EU-wide assessments, through regional scale assessment of the limitations and potentialities of the indicator and the need for further data collection. The evaluation finds that the perceived meaning of the indicator does not vary with scale, but in common with all mapped indicators, the usefulness of the indicator, to different user groups, does change with scale of presentation. This indicator is viewed as most useful when presented at the scale of governance at which end users operate. The relevance of the different sub-components of the indicator are also found to vary across regions.
Resumo:
Sensitivity and specificity are measures that allow us to evaluate the performance of a diagnostic test. In practice, it is common to have situations where a proportion of selected individuals cannot have the real state of the disease verified, since the verification could be an invasive procedure, as occurs with biopsy. This happens, as a special case, in the diagnosis of prostate cancer, or in any other situation related to risks, that is, not practicable, nor ethical, or in situations with high cost. For this case, it is common to use diagnostic tests based only on the information of verified individuals. This procedure can lead to biased results or workup bias. In this paper, we introduce a Bayesian approach to estimate the sensitivity and the specificity for two diagnostic tests considering verified and unverified individuals, a result that generalizes the usual situation based on only one diagnostic test.
Resumo:
Current water management practices in South Florida have negatively impacted many species inhabiting Florida Bay. Variable and high salinity has been identified as a key stressor in these estuaries. The Comprehensive Everglades Restoration Plan (CERP) includes water redistribution projects that will restore natural freshwater flows to northeastern Florida Bay. My studies focused on the following central theme and hypotheses: Biological performance measures (i.e., growth, reproduction, survival), behavior (i.e., habitat preference and locomotor behavior) and diversity of estuarine fish will be controlled by changes in salinity and water quality that will occur as a result of the restoration of freshwater flow to the bay. A series of acute and subchronic physiological toxicity studies were conducted to determine the effects of salinity changes on the life stages (embryo/larval, juvenile, adult) and fecundity of four native estuarine fish (Cyprinodon variegatus, Floridichthys carpio, Poecilia latipinna, and Gambusia holbrooki). Fish were exposed to a range of salinity concentrations (freshwater to hypersaline) based on salinity profiles in the study areas. Growth (length, weight) and survival were measured. Salinity trials included both rapid and gradual change events. Results show negative effects of acute, abrupt salinity changes on fish survival, development and reproductive success as a result of salinity stress. Other studies targeted reproduction and critical embryo-larval/neonate development as key areas for detecting long-term population effects of salinity change in Florida Bay. Adults of C. variegates and P. latipinna were also examined for behavioral responses to pulsed salinity changes. These responses include changes in swimming performance, locomotor behavior and zone preference. Finally, an ecological risk assessment was conducted for adverse salinity conditions in northeastern Florida Bay. Using the U.S. EPA's framework, the risk to estuarine fish species diversity was assessed against regional salinity profiles from a 17-year database. Based on the risk assessment, target salinity profiles for these areas are recommended for managers.^
Resumo:
Current water management practices in South Florida have negatively impacted many species inhabiting Florida Bay. Variable and high salinity has been identified as a key stressor in these estuaries. The comprehensive Everglades Restoration Plan (CERP) includes water redistribution projects that will restore natural freshwater flows to northeastern Florida Bay. My studies focused on the following central theme and hypotheses: Biological performance measures (i.e., growth, reproduction, survival), behavior (i.e., habitat preference and locomotor behavior) and diversity of estuarine fish will be controlled by changes in salinity and water quality that will occur as a result of the restoration of freshwater flow to the bay. A series of acute and subchronic physiological toxicity studies were conducted to determine the effects of salinity changes on the life stages (embryo/larval, juvenile, adult) and fecundity of four native estuarine fish (Cyprinodon variegatus, Floridichthys carpio, Poecilia latipinna, and Gambusia holbrooki). Fishe were exposed to a range of salinity concentrations (freshwater to hypersaline) based on salinity profiles in the study areas. Growth (length, weight) and survival were measured. Salinity trials included both rapid and gradual change events. Results show negative effects of acute, abrupt salinity changes on fish survival, development and reproductive success as a result of salinity stress. Other studies targeted reproduction and critical embryo-larval/neonate development as key areas for detecting long-term population effects of salinity change in Florida Bay. Adults of C. variegatus and P. latipinna were also examined for behavioral responses to pulsed salinity changes. These responses include changes in swimming performance, locomotor behavior and zone preference. Finally, an ecological risk assessment was conducted for adverse salinity conditions in northeastern Florida Bay. Using the U.S. EPA's framework, the risk to estuarine fish species diversity was assessed against regional salinity profiles from a 17-year database. Based on the risk assessment, target salinity profiles for these areas are recommended for managers.