163 resultados para missing values

em Queensland University of Technology - ePrints Archive


Relevância:

70.00% 70.00%

Publicador:

Resumo:

Obtaining attribute values of non-chosen alternatives in a revealed preference context is challenging because non-chosen alternative attributes are unobserved by choosers, chooser perceptions of attribute values may not reflect reality, existing methods for imputing these values suffer from shortcomings, and obtaining non-chosen attribute values is resource intensive. This paper presents a unique Bayesian (multiple) Imputation Multinomial Logit model that imputes unobserved travel times and distances of non-chosen travel modes based on random draws from the conditional posterior distribution of missing values. The calibrated Bayesian (multiple) Imputation Multinomial Logit model imputes non-chosen time and distance values that convincingly replicate observed choice behavior. Although network skims were used for calibration, more realistic data such as supplemental geographically referenced surveys or stated preference data may be preferred. The model is ideally suited for imputing variation in intrazonal non-chosen mode attributes and for assessing the marginal impacts of travel policies, programs, or prices within traffic analysis zones.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: The two-stage Total Laparoscopic Hysterectomy (TLH) versus Total Abdominal Hysterectomy (TAH) for stage I endometrial cancer (LACE) randomised controlled trial was initiated in 2005. The primary objective of stage 1 was to assess whether TLH results in equivalent or improved QoL up to 6 months after surgery compared to TAH. The primary objective of stage 2 was to test the hypothesis that disease-free survival at 4.5 years is equivalent for TLH and TAH. Results addressing the primary objective of stage 1 of the LACE trial are presented here. Methods: The first 361 LACE participants (TAH n= 142, TLH n=190) were enrolled in the QoL substudy at 19 centres across Australia, New Zealand and Hong Kong, and 332 completed the QoL analysis. Randomisation was performed centrally and independently from other study procedures via a computer generated, web-based system (providing concealment of the next assigned treatment) using stratified permuted blocks of 3 and 6, and assigned patients with histologically confirmed stage 1 endometrioid endometrial adenocarcinoma and ECOG performance status <2 to TLH or TAH stratified by histological grade and study centre. No blinding of patients or study personnel was attempted. QoL was measured at baseline, 1 and 4 weeks (early), and 3 and 6 months (late) after surgery using the Functional Assessment of Cancer Therapy-General (FACT-G) questionnaire. The primary endpoint was the difference between the groups in QoL change from baseline at early and late time points (a 5% difference was considered clinically significant). Analysis was performed according to the intention-to-treat principle using generalized estimating equations on differences from baseline for the early and late QoL recovery. The LACE trial is registered with clinicaltrials.gov (NCT00096408) and the Australian New Zealand Clinical Trials Registry (CTRN12606000261516). Patients for both stages of the trial have now been recruited and are being followed up for disease-specific outcomes. Findings: The proportion of missing values at the 5%, 10% 15% and 20% differences in the FACT-G scale was 6% (12/190) in the TLH and 14% (20/142) in the TAH group. There were 8/332 conversions (2.4%, 7 of which were from TLH to TAH). In the early phase of recovery, patients undergoing TLH reported significantly greater improvement of QoL from baseline compared to TAH in all subscales except the emotional and social well-being subscales. Improvements in QoL up to 6 months post-surgery continued to favour TLH except for the emotional and social well-being of the FACT and the visual analogue scale of the EuroQoL five dimensions (EuroQoL-VAS). Length of operating time was significantly longer in the TLH group (138±43 mins), than in the TAH group at (109±34 mins; p=0.001). While the proportion of intraoperative adverse events was similar between the treatment groups (TAH 8/142, 5.6%; TLH 14/190, 7.4%; p=0.55), postoperatively, twice as many patients in the TAH group experienced adverse events of CTC grade 3+ than in the TLH group (33/142, 23.2% and 22/190, 11.6%, respectively; p=0.004). Postoperative serious adverse events occurred more frequently in patients who had a TAH (27/142, 19.0%) than a TLH (15/190, 7.9%) (p=0.002). Interpretation: QoL improvements from baseline during early and later phases of recovery, and the adverse event profile significantly favour TLH compared to TAH for patients treated for Stage I endometrial cancer.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The purpose of this review is to integrate and summarize specific measurement topics (instrument and metric choice, validity, reliability, how many and what types of days, reactivity, and data treatment) appropriate to the study of youth physical activity. Research quality pedometers are necessary to aid interpretation of steps per day collected in a range of young populations under a variety of circumstances. Steps per day is the most appropriate metric choice, but steps per minute can be used to interpret time-in-intensity in specifically delimited time periods (e.g., physical education class). Reported intraclass correlations (ICC) have ranged from .65 over 2 days (although higher values also have been reported for 2 days) to .87 over 8 days (although higher values have been reported for fewer days). Reported ICCs are lower on weekend days (.59) versus weekdays (.75) and lower over vacation days (.69) versus school days (.74). There is no objective evidence of reactivity at this time. Data treatment includes (a) identifying and addressing missing values, (b) identifying outliers and reducing data appropriately if necessary, and (c) transforming the data as required in preparation for inferential analysis. As more pedometry studies in young populations are published, these preliminary methodological recommendations should be modified and refined.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this paper, we present WebPut, a prototype system that adopts a novel web-based approach to the data imputation problem. Towards this, Webput utilizes the available information in an incomplete database in conjunction with the data consistency principle. Moreover, WebPut extends effective Information Extraction (IE) methods for the purpose of formulating web search queries that are capable of effectively retrieving missing values with high accuracy. WebPut employs a confidence-based scheme that efficiently leverages our suite of data imputation queries to automatically select the most effective imputation query for each missing value. A greedy iterative algorithm is also proposed to schedule the imputation order of the different missing values in a database, and in turn the issuing of their corresponding imputation queries, for improving the accuracy and efficiency of WebPut. Experiments based on several real-world data collections demonstrate that WebPut outperforms existing approaches.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Modern mobile computing devices are versatile, but bring the burden of constant settings adjustment according to the current conditions of the environment. While until today, this task has to be accomplished by the human user, the variety of sensors usually deployed in such a handset provides enough data for autonomous self-configuration by a learning, adaptive system. However, this data is not fully available at certain points in time, or can contain false values. Handling potentially incomplete sensor data to detect context changes without a semantic layer represents a scientific challenge which we address with our approach. A novel machine learning technique is presented - the Missing-Values-SOM - which solves this problem by predicting setting adjustments based on context information. Our method is centered around a self-organizing map, extending it to provide a means of handling missing values. We demonstrate the performance of our approach on mobile context snapshots, as well as on classical machine learning datasets.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this paper, we present WebPut, a prototype system that adopts a novel web-based approach to the data imputation problem. Towards this, Webput utilizes the available information in an incomplete database in conjunction with the data consistency principle. Moreover, WebPut extends effective Information Extraction (IE) methods for the purpose of formulating web search queries that are capable of effectively retrieving missing values with high accuracy. WebPut employs a confidence-based scheme that efficiently leverages our suite of data imputation queries to automatically select the most effective imputation query for each missing value. A greedy iterative algorithm is proposed to schedule the imputation order of the different missing values in a database, and in turn the issuing of their corresponding imputation queries, for improving the accuracy and efficiency of WebPut. Moreover, several optimization techniques are also proposed to reduce the cost of estimating the confidence of imputation queries at both the tuple-level and the database-level. Experiments based on several real-world data collections demonstrate not only the effectiveness of WebPut compared to existing approaches, but also the efficiency of our proposed algorithms and optimization techniques.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This chapter investigates a variety of water quality assessment tools for reservoirs with balanced/unbalanced monitoring designs and focuses on providing informative water quality assessments to ensure decision-makers are able to make risk-informed management decisions about reservoir health. In particular, two water quality assessment methods are described: non-compliance (probability of the number of times the indicator exceeds the recommended guideline) and amplitude (degree of departure from the guideline). Strengths and weaknesses of current and alternative water quality methods will be discussed. The proposed methodology is particularly applicable to unbalanced designs with/without missing values and reflects the general conditions and is not swayed too heavily by the occasional extreme value (very high or very low quality). To investigate the issues in greater detail, we use as a case study, a reservoir within South-East Queensland (SEQ), Australia. The purpose here is to obtain an annual score that reflected the overall water quality, temporally, spatially and across water quality indicators for each reservoir.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In recent years, considerable research efforts have been directed to micro-array technologies and their role in providing simultaneous information on expression profiles for thousands of genes. These data, when subjected to clustering and classification procedures, can assist in identifying patterns and providing insight on biological processes. To understand the properties of complex gene expression datasets, graphical representations can be used. Intuitively, the data can be represented in terms of a bipartite graph, with weighted edges corresponding to gene-sample node couples in the dataset. Biologically meaningful subgraphs can be sought, but performance can be influenced both by the search algorithm, and, by the graph-weighting scheme and both merit rigorous investigation. In this paper, we focus on edge-weighting schemes for bipartite graphical representation of gene expression. Two novel methods are presented: the first is based on empirical evidence; the second on a geometric distribution. The schemes are compared for several real datasets, assessing efficiency of performance based on four essential properties: robustness to noise and missing values, discrimination, parameter influence on scheme efficiency and reusability. Recommendations and limitations are briefly discussed. Keywords: Edge-weighting; weighted graphs; gene expression; bi-clustering

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A new database called the World Resource Table is constructed in this study. Missing values are known to produce complications when constructing global databases. This study provides a solution for applying multiple imputation techniques and estimates the global environmental Kuznets curve (EKC) for CO2, SO2, PM10, and BOD. Policy implications for each type of emission are derived based on the results of the EKC using WRI. Finally, we predicted the future emissions trend and regional share of CO2 emissions. We found that East Asia and South Asia will be increasing their emissions share while other major CO2 emitters will still produce large shares of the total global emissions.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Objective Child maltreatment is a problem that has longer recognition in the northern hemisphere and in high-income countries. Recent work has highlighted the nearly universal nature of the problem in other countries but demonstrated the lack of comparability of studies because of the variations in definitions and measures used. The International Society for the Prevention of Child Abuse and Neglect has developed instrumentation that may be used with cross-cultural and cross-national benchmarking by local investigators. Design and sampling The instrument design began with a team of expert in Brisbane in 2004. A large bank of questions were subjected to two rounds of Delphi review to develop the fielded version of the instrument. Convenience samples included approximately 120 parent respondents with children under the age of 18 in each of six countries (697 total). Results This paper presents an instrument that measures parental behaviors directed at children and reports data from pilot work in 6 countries and 7 languages. Patterns of response revealed few missing values and distributions of responses that generally were similar in the six countries. Subscales performed well in terms of internal consistency with Cronbach's alpha in very good range (0.77–0.88) with the exception of the neglect and sex abuse subscales. Results varied by child age and gender in expected directions but with large variations among the samples. About 15% of children were shaken, 24% hit on the buttocks with an object, and 37% were spanked. Reports of choking and smothering were made by 2% of parents. Conclusion These pilot data demonstrate that the instrument is well tolerated and captures variations in, and potentially harmful forms of child discipline. Practice implications The ISPCAN Child Abuse Screening Tool – Parent Version (ICAST-P) has been developed as a survey instrument to be administered to parents for the assessment of child maltreatment in a multi-national and multi-cultural context. It was developed with broad input from international experts and subjected to Dephi review, translation, and pilot testing in six countries. The results of the Delphi study and pilot testing are presented. This study demonstrates that a single instrument can be used in a broad range of cultures and languages with low rates of missing data and moderate to high internal consistency.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Water quality data are often collected at different sites over time to improve water quality management. Water quality data usually exhibit the following characteristics: non-normal distribution, presence of outliers, missing values, values below detection limits (censored), and serial dependence. It is essential to apply appropriate statistical methodology when analyzing water quality data to draw valid conclusions and hence provide useful advice in water management. In this chapter, we will provide and demonstrate various statistical tools for analyzing such water quality data, and will also introduce how to use a statistical software R to analyze water quality data by various statistical methods. A dataset collected from the Susquehanna River Basin will be used to demonstrate various statistical methods provided in this chapter. The dataset can be downloaded from website http://www.srbc.net/programs/CBP/nutrientprogram.htm.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In an ever changing world the adults of the future will be faced with many challenges. To cope with these challenges it seems apparent that values education will need to become paramount within a child.s education. A considerable number of research studies have indicated that values education is a critical component within education (Lovat & Toomey, 2007b). Building on this research Lovat (2006) claimed that values education was the missing link in quality teaching The concept of quality teaching had risen to the fore within educational research literature in the late 20th century with the claim that it is the teacher who makes the difference in schooling (Hattie, 2004). Thus, if teachers make such a difference to student learning, achievement and well-being, then it must hold true that pre-service teacher education programmes are vital in ensuring the development of quality teachers for our schools. The gap that this current research programme addressed was to link the fields of values education, quality teaching and pre-service teacher education. This research programme aimed to determine the impact of a values-based pedagogy on the development of quality teaching dimensions within pre-service teacher education. The values-based pedagogy that was investigated in this research programme was Philosophy in the Classroom. The research programme adopted a nested case study design based on the constructivist-interpretative paradigm in examining a unit within a pre-service teacher education programme at a Queensland university. The methodology utilised was qualitative where the main source of data was via interviews. In total, 43 pre-service teachers participated in three studies in order to determine if their involvement in a unit where the focus was on introducing pre-service teachers to an explicit values-based pedagogy impacted on their knowledge, skills and confidence in terms of quality teaching dimensions. The research programme was divided into three separate studies in order to address the two research questions: 1. In what ways do pre-service teachers perceive they are being prepared to become quality teachers? 2. Is there a connection between an explicit values-based pedagogy in pre-service teacher education and the development of pre-service teachers. understanding of quality teaching? Study One provided insight into 21 pre-service teachers. understandings of quality teaching. These 21 participants had not engaged in an explicit values-based pedagogy. Study Two involved the interviewing of 22 pre-service teachers at two separate points in time . prior to exposure to a unit that employed a values-explicit pedagogy and post this subject.s lecture content delivery. Study Three reported on and analysed individual case studies of five pre-service teachers who had participated in Study Two Time 1 and Time 2, as well as a third time following their field experience where they had practice in teaching the values explicit pedagogy. The results of the research demonstrate that an explicit values-based pedagogy introduced into a teacher education programme has a positive impact on the development of pre-service teachers. understanding of quality teaching skills and knowledge. The teaching and practice of a values-based pedagogy positively impacted on pre-service teachers with increases of knowledge, skills and confidence demonstrated on the quality teaching dimensions of intellectual quality, a supportive classroom environment, recognition of difference, connectedness and values. These findings were reinforced through the comparison of pre-service teachers who had participated in the explicit values-based pedagogical approach, with a sample of pre-service teachers who had not engaged in this same values-based pedagogical approach. A solid values-based pedagogy and practice can and does enhance pre-service teachers. understanding of quality teaching. These findings surrounding the use of a values-based pedagogy in pre-service teacher education to enhance quality teaching knowledge and skills has contributed theoretically to the field of educational research, as well having practical implications for teacher education institutions and teacher educators.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objectives Demonstrate the application of decision trees – classification and regression trees (CARTs), and their cousins, boosted regression trees (BRTs) – to understand structure in missing data. Setting Data taken from employees at three different industry sites in Australia. Participants 7915 observations were included. Materials and Methods The approach was evaluated using an occupational health dataset comprising results of questionnaires, medical tests, and environmental monitoring. Statistical methods included standard statistical tests and the ‘rpart’ and ‘gbm’ packages for CART and BRT analyses, respectively, from the statistical software ‘R’. A simulation study was conducted to explore the capability of decision tree models in describing data with missingness artificially introduced. Results CART and BRT models were effective in highlighting a missingness structure in the data, related to the Type of data (medical or environmental), the site in which it was collected, the number of visits and the presence of extreme values. The simulation study revealed that CART models were able to identify variables and values responsible for inducing missingness. There was greater variation in variable importance for unstructured compared to structured missingness. Discussion Both CART and BRT models were effective in describing structural missingness in data. CART models may be preferred over BRT models for exploratory analysis of missing data, and selecting variables important for predicting missingness. BRT models can show how values of other variables influence missingness, which may prove useful for researchers. Conclusion Researchers are encouraged to use CART and BRT models to explore and understand missing data.

Relevância:

20.00% 20.00%

Publicador: