936 resultados para leave one out cross validation


Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents the first version of EmotiBlog, an annotation scheme for emotions in non-traditional textual genres such as blogs or forums. We collected a corpus composed by blog posts in three languages: English, Spanish and Italian and about three topics of interest. Subsequently, we annotated our collection and carried out the inter-annotator agreement and a ten-fold cross-validation evaluation, obtaining promising results. The main aim of this research is to provide a finer-grained annotation scheme and annotated data that are essential to perform evaluation focused on checking the quality of the created resources.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The assessment of attitudes toward school with the objective of identifying adolescents who may be at risk of underachievement has become an important area of research in educational psychology, although few specific tools for their evaluation have been designed to date. One of the instruments available is the School Attitude Assessment Survey-Revised (SAAS-R). Method: The objective of the current research is to test the construct validity and to analyze the psychometric properties of the Spanish version of the SAAS-R. Data were collected from 1,398 students attending different high schools. Students completed the SAAS-R along with measures of the g factor, and academic achievement was obtained from school records. Results: Confirmatory factor analysis, multivariate analysis of variance and analysis of variance tests supported the validity evidence. Conclusions: The results indicate that the Spanish version of the SAAS-R is a useful measure that contributes to identification of underachieving students. Lastly, the results obtained and their implications for education are discussed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: There is a recognized need to move from mortality to morbidity outcome predictions following traumatic injury. However, there are few morbidity outcome prediction scoring methods and these fail to incorporate important comorbidities or cofactors. This study aims to develop and evaluate a method that includes such variables. Methods: This was a consecutive case series registered in the Queensland Trauma Registry that consented to a prospective 12-month telephone conducted follow-up study. A multivariable statistical model was developed relating Trauma Registry data to trichotomized 12-month post-injury outcome (categories: no limitations, minor limitations and major limitations). Cross-validation techniques using successive single hold-out samples were then conducted to evaluate the model's predictive capabilities. Results: In total, 619 participated, with 337 (54%) experiencing no limitations, 101 (16%) experiencing minor limitations and 181 (29%) experiencing major limitations 12 months after injury. The final parsimonious multivariable statistical model included whether the injury was in the lower extremity body region, injury severity, age, length of hospital stay, pulse at admission and whether the participant was admitted to an intensive care unit. This model explained 21% of the variability in post-injury outcome. Predictively, 64% of those with no limitations, 18% of those with minor limitations and 37% of those with major limitations were correctly identified. Conclusion: Although carefully developed, this statistical model lacks the predictive power necessary for its use as a basis of a useful prognostic tool. Further research is required to identify variables other than those routinely used in the Trauma Registry to develop a model with the necessary predictive utility.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Published birthweight references in Australia do not fully take into account constitutional factors that influence birthweight and therefore may not provide an accurate reference to identify the infant with abnormal growth. Furthermore, studies in other regions that have derived adjusted (customised) birthweight references have applied untested assumptions in the statistical modelling. Aims: To validate the customised birthweight model and to produce a reference set of coefficients for estimating a customised birthweight that may be useful for maternity care in Australia and for future research. Methods: De-identified data were extracted from the clinical database for all births at the Mater Mother's Hospital, Brisbane, Australia, between January 1997 and June 2005. Births with missing data for the variables under study were excluded. In addition the following were excluded: multiple pregnancies, births less than 37 completed week's gestation, stillbirths, and major congenital abnormalities. Multivariate analysis was undertaken. A double cross-validation procedure was used to validate the model. Results: The study of 42 206 births demonstrated that, for statistical purposes, birthweight is normally distributed. Coefficients for the derivation of customised birthweight in an Australian population were developed and the statistical model is demonstrably robust. Conclusions: This study provides empirical data as to the robustness of the model to determine customised birthweight. Further research is required to define where normal physiology ends and pathology begins, and which segments of the population should be included in the construction of a customised birthweight standard.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Calculating the potentials on the heart’s epicardial surface from the body surface potentials constitutes one form of inverse problems in electrocardiography (ECG). Since these problems are ill-posed, one approach is to use zero-order Tikhonov regularization, where the squared norms of both the residual and the solution are minimized, with a relative weight determined by the regularization parameter. In this paper, we used three different methods to choose the regularization parameter in the inverse solutions of ECG. The three methods include the L-curve, the generalized cross validation (GCV) and the discrepancy principle (DP). Among them, the GCV method has received less attention in solutions to ECG inverse problems than the other methods. Since the DP approach needs knowledge of norm of noises, we used a model function to estimate the noise. The performance of various methods was compared using a concentric sphere model and a real geometry heart-torso model with a distribution of current dipoles placed inside the heart model as the source. Gaussian measurement noises were added to the body surface potentials. The results show that the three methods all produce good inverse solutions with little noise; but, as the noise increases, the DP approach produces better results than the L-curve and GCV methods, particularly in the real geometry model. Both the GCV and L-curve methods perform well in low to medium noise situations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This project explored how consumers in emerging economies evaluate brand extension by using China as a case. Two separate but related studies were conducted, and university students were used as respondents in both the studies. Study one or replication study tested Aaker and Keller's brand extension model in China. Assuming similar methods to Aaker and Keller's, six well-recognised brands were chosen as parent brand and each was extended to three product categories. Totally, 469 respondents completed the survey questionnaire. As each was to evaluate six extensions, this made the cases 2814. The data was analysed using Optimal Least Square regression approach and "residual centred" approach respectively. The result confirmed most of the findings observed in developed countries. Specifically, consumer's attitude towards the extension is primarily driven by the brand affect, the fit between the two product categories, the difficulty of making the extension and moderated via the interactions between the brand affect and the fit variables. Study two refined and extended Aaker and Keller's model by adding new variables and making methodological adjustments. The same stimuli and data analysis techniques as those in the replication were employed. 252 respondents participated in the survey and each evaluated six extensions, making cases 1512. In addition to re-verifying the findings of the replication and providing cross validation to these findings, the extended study found that the image consistency between the parent brand and the extension, the competition intensity of the extension product market were important in determining the success of the extension. Further, consumer differed in evaluating durable extensions and non-durable extensions. The thesis detailed the two studies above, and discussed the findings and their implications by relating to branding literature, to the general situation of the emerging economies as well as the reality of China. It also presented the limitations of the research and the future research directions.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This research evaluates pattern recognition techniques on a subclass of big data where the dimensionality of the input space (p) is much larger than the number of observations (n). Specifically, we evaluate massive gene expression microarray cancer data where the ratio κ is less than one. We explore the statistical and computational challenges inherent in these high dimensional low sample size (HDLSS) problems and present statistical machine learning methods used to tackle and circumvent these difficulties. Regularization and kernel algorithms were explored in this research using seven datasets where κ < 1. These techniques require special attention to tuning necessitating several extensions of cross-validation to be investigated to support better predictive performance. While no single algorithm was universally the best predictor, the regularization technique produced lower test errors in five of the seven datasets studied.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background. In pre-school and primary education pupils differ in many abilities and competences (‘giftedness’). Yet mainstream educational practice seems rather homogeneous in providing age-based or grade-class subject matter approaches. Aims. To clarify whether pupils scoring initially at high ability level do develop and attain differently at school with respect to language and arithmetic compared with pupils displaying other initial ability levels. To investigate whether specific individual, family or educational variables co-vary with the attainment of these different types of pupils in school. Samples. Data from the large-scale PRIMA cohort study including a total of 8258 grade 2 and 4 pupils from 438 primary schools in The Netherlands. Methods. Secondary analyses were carried out to construct gain scores for both language and arithmetic proficiency and a number of behavioural, attitudinal, family and educational characteristics. The pupils were grouped into different ability categories (highly able; able; above average; average and below). Further analyses used Pearson correlations and analyses of variance both between and within ability categories. Cross-validation was done by introducing a cohort of younger pupils in pre-school and grouping both cohorts into decile groups based on initial ability in language and arithmetic. Results. Highly able pupils generally decreased in attainment in both language and arithmetic, whereas pupils in average and below average groups improved their language and arithmetic scores. Only with highly able pupils were some educational characteristics correlated with the pupils’ development in achievement, behaviour and attitudes. Conclusions. Pre-school and primary education should better match pupils’ differences in abilities and competences from their start in pre-school to improve their functioning, learning processes and outcomes. Recommendations for educational improvement strategies are presented in closing.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Purpose: The purpose of this study is to examine the effectiveness of the comply-or-explain principle in Sweden to determine if the flexible approach is functioning as in-tended. Research design: This paper scrutinizes the quality of the explanations with respect to the Swedish Corporate Governance Code. A quantitative research with a cross-sectional design has been performed and the data collection covers 241 companies listed on Nasdaq OMX Stockholm for the fiscal year of 2014. The secondary data has been gathered from corporate governance reports of the researched companies and analysed by using a tax-onomy of explanations. Findings: The report demonstrates that the comply-or-explain principle in Sweden is effective. A clear majority of the explanations, 71,8%, were deemed as informative, mean-ing that a large proportion of the Swedish firms are utilizing the flexible approach in an effective manner. However, one out of four explanations were classified as insufficient and we have thus provided recommendations in order for the code to become even more effective. Contribution: Our findings provide insights on how the comply-or-explain principle works in a country that is supposed to be a leading example of how the comply-or-explain approach should be implemented. This study should be of significance for policy makers considering that we have outlined how the principle works and provided recommenda-tions on how the Swedish Corporate Governance Code can be improved. Value: Our findings demonstrate that companies listed on Nasdaq OMX Stockholm pro-vide high quality explanations that can serve as an inspiration for companies listed in other countries. Furthermore, the results indicate that managers are likely to act within ethically desired norm. Considering the social implications, as Swedish firms are informative in terms of explanations, it minimizes the risk of firms acting dishonestly.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This work analyses a study on natural ventilation and its relation to the urban legislation versus the building types in an urban fraction of coastal area of Praia do Meio in the city of Natal/RN, approaching the type or types of land use most appropriate to this limited urban fraction. The objective of this study is to analyse the effects of the present legislation as well as the types of buildings in this area on the natural ventilation. This urban fraction was selected because it is one of the sites from where the wind flows into the city of Natal. This research is based on the hypothesis stating that the reduction on the porosity of the urban soil (decrease in the set back/boundary clearance), and an increase in the form (height of the buildings) rise the level of the ventilation gradient, consequently causing a reduction on the wind speed at the lowest part of the buildings. Three-dimensional computational models were used to produce the modes of occupation allowed in the urban fraction within the area under study. A Computational Fluid Dynamics (CFD) software was also used to analyse the modes of land occupation. Following simulation, a statistical assessment was carried out for validation of the hypothesis. It was concluded that the reduction in the soil porosity as a consequence of the rates that defined the minimum boundary clearance between the building and the boundary of the plot (and consequently the set back), as well as the increase in the building form (height of the buildings) caused a reduction in the wind speed, thus creating heat islands

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Ce mémoire s’intéresse à l’étude du critère de validation croisée pour le choix des modèles relatifs aux petits domaines. L’étude est limitée aux modèles de petits domaines au niveau des unités. Le modèle de base des petits domaines est introduit par Battese, Harter et Fuller en 1988. C’est un modèle de régression linéaire mixte avec une ordonnée à l’origine aléatoire. Il se compose d’un certain nombre de paramètres : le paramètre β de la partie fixe, la composante aléatoire et les variances relatives à l’erreur résiduelle. Le modèle de Battese et al. est utilisé pour prédire, lors d’une enquête, la moyenne d’une variable d’intérêt y dans chaque petit domaine en utilisant une variable auxiliaire administrative x connue sur toute la population. La méthode d’estimation consiste à utiliser une distribution normale, pour modéliser la composante résiduelle du modèle. La considération d’une dépendance résiduelle générale, c’est-à-dire autre que la loi normale donne une méthodologie plus flexible. Cette généralisation conduit à une nouvelle classe de modèles échangeables. En effet, la généralisation se situe au niveau de la modélisation de la dépendance résiduelle qui peut être soit normale (c’est le cas du modèle de Battese et al.) ou non-normale. L’objectif est de déterminer les paramètres propres aux petits domaines avec le plus de précision possible. Cet enjeu est lié au choix de la bonne dépendance résiduelle à utiliser dans le modèle. Le critère de validation croisée sera étudié à cet effet.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Between 1961-1971 vitamin D deficiency was recognized as a public health issue in the UK, because of the lack of effective sunlight and the population mix [1, 2]. In recent years, health care professionals have cited evidence suggesting a re-emergence of the vitamin D deficiency linked to a number of health consequences as a concern [3-6]. Evidence from observational studies has linked low vitamin D status with impairment in glucose homeostasis and immune dysfunction [7-9]. However, interventional studies, particularly those focused on paediatric populations, have been limited and inconsistent. There is a need for detailed studies, to clarify the therapeutic benefits of vitamin D in these important clinical areas. Objective: The aims of this PhD thesis were two-fold. Firstly, to perform preliminary work assessing the association between vitamin D deficiency and bone status, glucose homeostasis and immune function, and to explore any changes in these parameters following short term vitamin D3 replacement therapy. Secondly, to assess the effectiveness of an electronic surveillance system (ScotPSU) as a tool to determine the current incidence of hospital-based presentation of childhood vitamin D deficiency in Scotland. Methods: Active surveillance was performed for a period of two years as a part of an electronic web-based surveillance programme performed by the Scottish Paediatric Surveillance Unit (ScotPSU). The validity of the system was assessed by identifying cases with profound vitamin D deficiency (in Glasgow and Edinburgh) from the regional laboratory. All clinical details were checked against those identified using the surveillance system. Thirty-seven children aged 3 months to 10 years, who had been diagnosed with vitamin D deficiency, were recruited for the bone, glucose and immunity studies over a period of 24 months. Twenty-five samples were analysed for the glucose and bone studies; of these, 18 samples were further analysed for immune study. Treatment consisted of six weeks taking 5000 IU units cholecalciferol orally once a day. At baseline and after completion of treatment, 25 hydroxyvitamin D (25(OH)D), parathyroid hormone (PTH), alkaline phosphatase (ALP), collagen type 1 cross-linked C-telopeptide (CTX), osteocalcin (OCN), calcium, phosphate, insulin, glucose, homeostasis model assessment index, estimated insulin resistance (HOMA IR), glycated hemoglobin (HbA1c), sex hormone binding globulin (SHBG), lipids profiles, T helper 1 (Th1) cytokines (interleukin-2 ( IL-2), tumor necrosis factors-alpha (TNF-α), interferon-gamma (INF-γ)), T helper 2 (Th2) cytokines (interleukin-4 (IL-4), interleukin-5 (IL-5), interleukin-6 (IL-6)), T helper 17 (Th17) cytokine (interleukin-17 (IL-17)), Regulatory T (Treg) cytokine (interleukin-10 (IL-10)) and chemokines/cytokines, linked with Th1/Th2 subset balance and/or differentiation (interleukin-8 (IL-8), interleukin-12 (IL-12), eosinophil chemotactic protein ( EOTAXIN), macrophage inflammatory proteins-1beta (MIP-1β), interferon-gamma-induced protein-10 (IP-10), regulated on activation, normal T cell expressed and secreted (RANTES), monocyte chemoattractant protein-1(MCP-1)) were measured. Leukoocyte subset analysis was performed for T cells, B cells and T regulatory cells and a luminex assay was used to measure the cytokiens. Results: Between September 2009 and August 2011, 163 cases of vitamin D deficiency were brought to the attention of the ScotPSU, and the majority of cases (n = 82) were reported in Glasgow. The cross-validation checking in Glasgow and Edinburgh over a one-year period revealed only 3 (11%) cases of clearly symptomatic vitamin D deficiency, which had been missed by the ScotPSU survey in Glasgow. While 16 (67%) symptomatic cases had failed to be reported through the ScotPSU survey in Edinburgh. For the 23 children who are included in bone and glucose studies, 22 (96%) children had basal serum 25(OH)D in the deficiency range (< 50 nmol/l) and one (4%) child had serum 25(OH)D in the insufficiency range (51-75 nmol/l). Following vitamin D3 treatment, 2 (9%) children had final serum 25(OH)D lower than 50 nmol/l, 6 (26%) children had final serum 25(OH)D between >50-75 nmol/l, 12 (52%) children reached a final serum 25(OH)D >75-150 nmol/l and finally 3 (13%) exceeded the normal reference range with a final 25(OH)D >150 nmol/l. Markers for remodelling ALP and PTH had significantly decreased (p = 0.001 and <0.0001 for ALP and PTH respectively). In 17 patients for whom insulin and HOMA IR data were available and enrolled in glucose study, significant improvements in insulin resistance (p = 0.04) with a trend toward a reduction in serum insulin (p = 0.05) was observed. Of those 14 children who had their cytokines profile data analysed and enrolled in the immunity study, insulin and HOMA IR data were missed in one child. A significant increase in the main Th2 secreted cytokine IL-4 (p = 0.001) and a tendency for significant increases in other Th2 secreted cytokines IL-5 (p = 0.05) and IL-6 (p = 0.05) was observed following vitamin D3 supplementation. Conclusion: An electronic surveillance system can provide data for studying the epidemiology of vitamin D deficiency. However, it may underestimate the number of positive cases. Improving vitamin D status in vitamin D deficient otherwise healthy children significantly improved their vitamin D deficient status, and was associated with an improvement in bone profile, improvements in insulin resistance and an alteration in main Th2 secreting cytokines.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

En este trabajo se propone un nuevo sistema híbrido para el análisis de sentimientos en clase múltiple basado en el uso del diccionario General Inquirer (GI) y un enfoque jerárquico del clasificador Logistic Model Tree (LMT). Este nuevo sistema se compone de tres capas, la capa bipolar (BL) que consta de un LMT (LMT-1) para la clasificación de la polaridad de sentimientos, mientras que la segunda capa es la capa de la Intensidad (IL) y comprende dos LMTs (LMT-2 y LMT3) para detectar por separado tres intensidades de sentimientos positivos y tres intensidades de sentimientos negativos. Sólo en la fase de construcción, la capa de Agrupación (GL) se utiliza para agrupar las instancias positivas y negativas mediante el empleo de 2 k-means, respectivamente. En la fase de Pre-procesamiento, los textos son segmentados por palabras que son etiquetadas, reducidas a sus raíces y sometidas finalmente al diccionario GI con el objetivo de contar y etiquetar sólo los verbos, los sustantivos, los adjetivos y los adverbios con 24 marcadores que se utilizan luego para calcular los vectores de características. En la fase de Clasificación de Sentimientos, los vectores de características se introducen primero al LMT-1, a continuación, se agrupan en GL según la etiqueta de clase, después se etiquetan estos grupos de forma manual, y finalmente las instancias positivas son introducidas a LMT-2 y las instancias negativas a LMT-3. Los tres árboles están entrenados y evaluados usando las bases de datos Movie Review y SenTube con validación cruzada estratificada de 10-pliegues. LMT-1 produce un árbol de 48 hojas y 95 de tamaño, con 90,88% de exactitud, mientras que tanto LMT-2 y LMT-3 proporcionan dos árboles de una hoja y uno de tamaño, con 99,28% y 99,37% de exactitud,respectivamente. Los experimentos muestran que la metodología de clasificación jerárquica propuesta da un mejor rendimiento en comparación con otros enfoques prevalecientes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Current research on achievement goals acknowledges that students can manifest different goal patterns. This study aimed to adapt and validate a self-report scale to assess the goal orientations of Portuguese students. A total of 2675 (age range 9–24 years) Portuguese students completed the Goal Orientations Scale (GOS). Through a cross-validation procedure, confirmatory factor analysis and descriptive statistics supports the existence of four different goal orientations: task, self-enhancing, self-defeating and avoidance orientations. The reliability and the internal validity estimates confirm that the GOS is an adequate instrument in assessing student goal orientations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, we investigate output accuracy for a Discrete Event Simulation (DES) model and Agent Based Simulation (ABS) model. The purpose of this investigation is to find out which of these simulation techniques is the best one for modelling human reactive behaviour in the retail sector. In order to study the output accuracy in both models, we have carried out a validation experiment in which we compared the results from our simulation models to the performance of a real system. Our experiment was carried out using a large UK department store as a case study. We had to determine an efficient implementation of management policy in the store’s fitting room using DES and ABS. Overall, we have found that both simulation models were a good representation of the real system when modelling human reactive behaviour.