62 resultados para kernel estimates


Relevância:

20.00% 20.00%

Publicador:

Resumo:

In discrete choice experiments respondents are generally assumed to consider all of the attributes across each of the alternatives, and to choose their most preferred. However, results in this paper indicate that many respondents employ simplified lexicographic decision-making rules, whereby they have a ranking of the attributes, but their choice of an alternative is based solely on the level of their most important attribute(s). Not accounting for these simple decision-making heuristics introduces systemic errors and leads to biased point estimates, as they are a violation of the continuity axiom and a departure from the use of compensatory decision-making. In this paper the implications of lexicographic preferences are examined. In particular, using a mixed logit specification this paper investigates the sensitivity of individual-specific willingness to pay (WTP) estimates conditional on whether lexicographic decision-making rules are accounted for in the modelling of discrete choice responses. Empirical results are obtained from a discrete choice experiment that was carried out to address the value of a number of rural landscape attributes in Ireland

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes the methodology, results and limitations of the 2013 International Diabetes Federation (IDF) Atlas (6th edition) estimates of the worldwide numbers of prevalent cases of type 1 diabetes in children (<15 years). The majority of relevant information in the published literature is in the form of incidence rates derived from registers of newly diagnosed cases. Studies were graded on quality criteria and, if no information was available in the published literature, extrapolation was used to assign a country the rate from an adjacent country with similar characteristics. Prevalence rates were then derived from these incidence rates and applied to United Nations 2012 Revision population estimates for 2013 for each country to obtain estimates of the number of prevalent cases. Data availability was highest for the countries in Europe (76%) and lowest for the countries in sub-Saharan Africa (8%). The prevalence estimates indicate that there are almost 500,000 children aged under 15 years with type 1 diabetes worldwide, the largest numbers being in Europe (129,000) and North America (108,700). Countries with the highest estimated numbers of new cases annually were the United States (13,000), India (10,900) and Brazil (5000). Compared with the prevalence estimates made in previous editions of the IDF Diabetes Atlas, the numbers have increased in most of the IDF Regions, often reflecting the incidence rate increases that have been well-documented in many countries. Monogenic diabetes is increasingly being recognised among those with clinical features of type 1 or type 2 diabetes as genetic studies become available, but population-based data on incidence and prevalence show wide variation due to lack of standardisation in the studies. Similarly, studies on type 2 diabetes in childhood suggest increased incidence and prevalence in many countries, especially in Indigenous peoples and ethnic minorities, but detailed population-based studies remain limited.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, we consider the variable selection problem for a nonlinear non-parametric system. Two approaches are proposed, one top-down approach and one bottom-up approach. The top-down algorithm selects a variable by detecting if the corresponding partial derivative is zero or not at the point of interest. The algorithm is shown to have not only the parameter but also the set convergence. This is critical because the variable selection problem is binary, a variable is either selected or not selected. The bottom-up approach is based on the forward/backward stepwise selection which is designed to work if the data length is limited. Both approaches determine the most important variables locally and allow the unknown non-parametric nonlinear system to have different local dimensions at different points of interest. Further, two potential applications along with numerical simulations are provided to illustrate the usefulness of the proposed algorithms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Selection bias in HIV prevalence estimates occurs if non-participation in testing is correlated with HIV status. Longitudinal data suggests that individuals who know or suspect they are HIV positive are less likely to participate in testing in HIV surveys, in which case methods to correct for missing data which are based on imputation and observed characteristics will produce biased results. Methods: The identity of the HIV survey interviewer is typically associated with HIV testing participation, but is unlikely to be correlated with HIV status. Interviewer identity can thus be used as a selection variable allowing estimation of Heckman-type selection models. These models produce asymptotically unbiased HIV prevalence estimates, even when non-participation is correlated with unobserved characteristics, such as knowledge of HIV status. We introduce a new random effects method to these selection models which overcomes non-convergence caused by collinearity, small sample bias, and incorrect inference in existing approaches. Our method is easy to implement in standard statistical software, and allows the construction of bootstrapped standard errors which adjust for the fact that the relationship between testing and HIV status is uncertain and needs to be estimated. Results: Using nationally representative data from the Demographic and Health Surveys, we illustrate our approach with new point estimates and confidence intervals (CI) for HIV prevalence among men in Ghana (2003) and Zambia (2007). In Ghana, we find little evidence of selection bias as our selection model gives an HIV prevalence estimate of 1.4% (95% CI 1.2% – 1.6%), compared to 1.6% among those with a valid HIV test. In Zambia, our selection model gives an HIV prevalence estimate of 16.3% (95% CI 11.0% - 18.4%), compared to 12.1% among those with a valid HIV test. Therefore, those who decline to test in Zambia are found to be more likely to be HIV positive. Conclusions: Our approach corrects for selection bias in HIV prevalence estimates, is possible to implement even when HIV prevalence or non-participation is very high or very low, and provides a practical solution to account for both sampling and parameter uncertainty in the estimation of confidence intervals. The wide confidence intervals estimated in an example with high HIV prevalence indicate that it is difficult to correct statistically for the bias that may occur when a large proportion of people refuse to test.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study provides estimates of the macroeconomic impact of non-communicable diseases (NCDs) inChina and India for the period 2012–2030. Our estimates are derived using the World Health Organization’sEPIC model of economic growth, which focuses on the negative effects of NCDs on labor supply andcapital accumulation. We present results for the five main NCDs (cardiovascular disease, cancer, chronicrespiratory disease, diabetes, and mental health). Our undiscounted estimates indicate that the cost ofthe five main NCDs will total USD 23.03 trillion for China and USD 4.58 trillion for India (in 2010 USD).For both countries, the most costly domain is cardiovascular disease. Our analyses also reveal that thecosts are much larger in China than in India mainly because of China’s higher and steeper income trajectory,and to a lesser extent its older population. Rough calculations also indicate that WHO’s best buys foraddressing the challenge of NCDs are highly cost-beneficial

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction: HIV testing is a cornerstone of efforts to combat the HIV epidemic, and testing conducted as part of surveillance provides invaluable data on the spread of infection and the effectiveness of campaigns to reduce the transmission of HIV. However, participation in HIV testing can be low, and if respondents systematically select not to be tested because they know or suspect they are HIV positive (and fear disclosure), standard approaches to deal with missing data will fail to remove selection bias. We implemented Heckman-type selection models, which can be used to adjust for missing data that are not missing at random, and established the extent of selection bias in a population-based HIV survey in an HIV hyperendemic community in rural South Africa.

Methods: We used data from a population-based HIV survey carried out in 2009 in rural KwaZulu-Natal, South Africa. In this survey, 5565 women (35%) and 2567 men (27%) provided blood for an HIV test. We accounted for missing data using interviewer identity as a selection variable which predicted consent to HIV testing but was unlikely to be independently associated with HIV status. Our approach involved using this selection variable to examine the HIV status of residents who would ordinarily refuse to test, except that they were allocated a persuasive interviewer. Our copula model allows for flexibility when modelling the dependence structure between HIV survey participation and HIV status.

Results: For women, our selection model generated an HIV prevalence estimate of 33% (95% CI 27–40) for all people eligible to consent to HIV testing in the survey. This estimate is higher than the estimate of 24% generated when only information from respondents who participated in testing is used in the analysis, and the estimate of 27% when imputation analysis is used to predict missing data on HIV status. For men, we found an HIV prevalence of 25% (95% CI 15–35) using the selection model, compared to 16% among those who participated in testing, and 18% estimated with imputation. We provide new confidence intervals that correct for the fact that the relationship between testing and HIV status is unknown and requires estimation.

Conclusions: We confirm the feasibility and value of adopting selection models to account for missing data in population-based HIV surveys and surveillance systems. Elements of survey design, such as interviewer identity, present the opportunity to adopt this approach in routine applications. Where non-participation is high, true confidence intervals are much wider than those generated by standard approaches to dealing with missing data suggest.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In many applications, and especially those where batch processes are involved, a target scalar output of interest is often dependent on one or more time series of data. With the exponential growth in data logging in modern industries such time series are increasingly available for statistical modeling in soft sensing applications. In order to exploit time series data for predictive modelling, it is necessary to summarise the information they contain as a set of features to use as model regressors. Typically this is done in an unsupervised fashion using simple techniques such as computing statistical moments, principal components or wavelet decompositions, often leading to significant information loss and hence suboptimal predictive models. In this paper, a functional learning paradigm is exploited in a supervised fashion to derive continuous, smooth estimates of time series data (yielding aggregated local information), while simultaneously estimating a continuous shape function yielding optimal predictions. The proposed Supervised Aggregative Feature Extraction (SAFE) methodology can be extended to support nonlinear predictive models by embedding the functional learning framework in a Reproducing Kernel Hilbert Spaces setting. SAFE has a number of attractive features including closed form solution and the ability to explicitly incorporate first and second order derivative information. Using simulation studies and a practical semiconductor manufacturing case study we highlight the strengths of the new methodology with respect to standard unsupervised feature extraction approaches.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, a novel and effective lip-based biometric identification approach with the Discrete Hidden Markov Model Kernel (DHMMK) is developed. Lips are described by shape features (both geometrical and sequential) on two different grid layouts: rectangular and polar. These features are then specifically modeled by a DHMMK, and learnt by a support vector machine classifier. Our experiments are carried out in a ten-fold cross validation fashion on three different datasets, GPDS-ULPGC Face Dataset, PIE Face Dataset and RaFD Face Dataset. Results show that our approach has achieved an average classification accuracy of 99.8%, 97.13%, and 98.10%, using only two training images per class, on these three datasets, respectively. Our comparative studies further show that the DHMMK achieved a 53% improvement against the baseline HMM approach. The comparative ROC curves also confirm the efficacy of the proposed lip contour based biometrics learned by DHMMK. We also show that the performance of linear and RBF SVM is comparable under the frame work of DHMMK.