56 resultados para Datasets

em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain


Relevância:

10.00% 10.00%

Publicador:

Resumo:

The aim of this note is to complement some of the results appearing in Dolado et al. (2003) article “Publishing Performance in Economics: Spanish Rankings” Particularly we want to focus on three issues: the robustness of the results regardless of the time span considered, the evaluation of a researcher to the advance of the knowledge, and to what extent the choice of a particular database to download the results can affect the results. Differences are significant when we expand the time period considered. There are also small but significant differences if we combine datasets to derive the rankings.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The severely poor are very poor since their consumption is far below the absolute poverty line, and the chronically poor are very poor since their consumption persists for long periods below the absolute poverty line. A combination of chronic poverty and severe poverty (CSP) must represent the very worst instance of poverty. Yet the exercise in this paper of asking simple questions about CSP shows large research gaps. Quantified statements on CSP at the country level can be made for just 14 countries, and at the household level in just six countries. This data suggests a positive correlation between severe poverty and chronic poverty, both at the country level and the household level. Understanding the CSP relationship – whether it is strong, where it arises, what causes it – may improve our explanation of observed cross-country variation in the elasticity between macroeconomic growth and poverty reduction, and why within countries, some households take better advantage of opportunities afforded by macroeconomic growth. Some limited data suggests similarity in socioeconomic characteristics of the severe poor and the chronic poor in terms of location, household size, gender, education and economic sector of work. Of concern is that microlongitudinal datasets drop large proportions of their base year samples, and how this affects our understanding of CSP is not well evaluated. On causal mechanisms, evidence suggests that CSP may be caused by parental CSP (i.e. an intergenerational CSP cycle) and in households not previously poor, CSP may be caused by a morbidity cycle.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This work complements some of the results appearing in the article ?Publishing Performance in Economics: Spanish Rankings? by Dolado et al. . Specifically we focus on the robustness of the results regardless of the time span considered, the effect of the choice of a particular database on the final results, and the effects on changes in the unit of institutional measure (departments versus institutions as a whole). Differences are significant when we expand the time period considered. There are also significant but small differences if we combine datasets to derive the rankings. Finally, department rankings offer a more precise picture of the situation of the Spanish academics, although results do not differ substantially from those obtained when overall institutions are considered.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

El canvi climàtic del segle XXI és una realitat, hi ha moltes evidències científiques que indiquen que l’escalfament del sistema climàtic és inequívoc. Malgrat això, també hi ha moltes incerteses respecte els impactes que pot comportar aquest canvi climàtic global. L’objectiu d’aquest projecte és estudiar la possible evolució futura de tres variables climàtiques, que són el rang de la temperatura diürna a prop de la superfície (DTR), la temperatura mitjana a prop de la superfície (MT) i la precipitació mensual (PL_mes) i valorar l’exposició que poden experimentar diferents cobertes del sòl i diferents regions biogeogràfiques del continent europeu davant d’aquests possibles patrons de canvi. Per això s’han utilitzat Models Climàtics Globals que fan projeccions de variables climàtiques que permeten preveure el possible clima futur. Mitjançant l’aplicatiu informàtic Tetyn s’han extret els paràmetres climàtics dels conjunts de dades del Tyndall Centre for Climate Change Research, del futur (TYN SC) i del passat (CRU TS). Les variables obtingudes s’han processat amb eines de sistemes d’informació geogràfica (SIG) per obtenir els patrons de canvi de les variables a cada coberta del sòl. Els resultats obtinguts mostren que hi ha una gran variabilitat, que augmenta amb el temps, entre els diferents models climàtics i escenaris considerats, que posa de manifest la incertesa associada a la modelització climàtica, a la generació d’escenaris d’emissions i a la naturalesa dinàmica i no determinista del sistema climàtic. Però en general, mostren que les glaceres seran una de les cobertes més exposades al canvi climàtic, i la mediterrània, una de les regions més vulnerables

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A parts based model is a parametrization of an object class using a collection of landmarks following the object structure. The matching of parts based models is one of the problems where pairwise Conditional Random Fields have been successfully applied. The main reason of their effectiveness is tractable inference and learning due to the simplicity of involved graphs, usually trees. However, these models do not consider possible patterns of statistics among sets of landmarks, and thus they sufffer from using too myopic information. To overcome this limitation, we propoese a novel structure based on a hierarchical Conditional Random Fields, which we explain in the first part of this memory. We build a hierarchy of combinations of landmarks, where matching is performed taking into account the whole hierarchy. To preserve tractable inference we effectively sample the label set. We test our method on facial feature selection and human pose estimation on two challenging datasets: Buffy and MultiPIE. In the second part of this memory, we present a novel approach to multiple kernel combination that relies on stacked classification. This method can be used to evaluate the landmarks of the parts-based model approach. Our method is based on combining responses of a set of independent classifiers for each individual kernel. Unlike earlier approaches that linearly combine kernel responses, our approach uses them as inputs to another set of classifiers. We will show that we outperform state-of-the-art methods on most of the standard benchmark datasets.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

While the Internet has given educators access to a steady supply of Open Educational Resources, the educational rubrics commonly shared on the Web are generally in the form of static, non-semantic presentational documents or in the proprietary data structures of commercial content and learning management systems.With the advent of Semantic Web Standards, producers of online resources have a new framework to support the open exchange of software-readable datasets. Despite these advances, the state of the art of digital representation of rubrics as sharable documents has not progressed.This paper proposes an ontological model for digital rubrics. This model is built upon the Semantic Web Standards of the World Wide Web Consortium (W3C), principally the Resource Description Framework (RDF) and Web Ontology Language (OWL).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The log-ratio methodology makes available powerful tools for analyzing compositionaldata. Nevertheless, the use of this methodology is only possible for those data setswithout null values. Consequently, in those data sets where the zeros are present, aprevious treatment becomes necessary. Last advances in the treatment of compositionalzeros have been centered especially in the zeros of structural nature and in the roundedzeros. These tools do not contemplate the particular case of count compositional datasets with null values. In this work we deal with \count zeros" and we introduce atreatment based on a mixed Bayesian-multiplicative estimation. We use the Dirichletprobability distribution as a prior and we estimate the posterior probabilities. Then weapply a multiplicative modi¯cation for the non-zero values. We present a case studywhere this new methodology is applied.Key words: count data, multiplicative replacement, composition, log-ratio analysis

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The application of Discriminant function analysis (DFA) is not a new idea in the studyof tephrochrology. In this paper, DFA is applied to compositional datasets of twodifferent types of tephras from Mountain Ruapehu in New Zealand and MountainRainier in USA. The canonical variables from the analysis are further investigated witha statistical methodology of change-point problems in order to gain a betterunderstanding of the change in compositional pattern over time. Finally, a special caseof segmented regression has been proposed to model both the time of change and thechange in pattern. This model can be used to estimate the age for the unknown tephrasusing Bayesian statistical calibration

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In CoDaWork’05, we presented an application of discriminant function analysis (DFA) to 4 differentcompositional datasets and modelled the first canonical variable using a segmented regression modelsolely based on an observation about the scatter plots. In this paper, multiple linear regressions areapplied to different datasets to confirm the validity of our proposed model. In addition to dating theunknown tephras by calibration as discussed previously, another method of mapping the unknown tephrasinto samples of the reference set or missing samples in between consecutive reference samples isproposed. The application of these methodologies is demonstrated with both simulated and real datasets.This new proposed methodology provides an alternative, more acceptable approach for geologists as theirfocus is on mapping the unknown tephra with relevant eruptive events rather than estimating the age ofunknown tephra.Kew words: Tephrochronology; Segmented regression

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Given a set of images of scenes containing different object categories (e.g. grass, roads) our objective is to discover these objects in each image, and to use this object occurrences to perform a scene classification (e.g. beach scene, mountain scene). We achieve this by using a supervised learning algorithm able to learn with few images to facilitate the user task. We use a probabilistic model to recognise the objects and further we classify the scene based on their object occurrences. Experimental results are shown and evaluated to prove the validity of our proposal. Object recognition performance is compared to the approaches of He et al. (2004) and Marti et al. (2001) using their own datasets. Furthermore an unsupervised method is implemented in order to evaluate the advantages and disadvantages of our supervised classification approach versus an unsupervised one

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A recent finding of the structural VAR literature is that the response of hours worked to a technology shock depends on the assumption on the order of integration of the hours. In this work we relax this assumption, allowing for fractional integration and long memory in the process for hours and productivity. We find that the sign and magnitude of the estimated impulse responses of hours to a positive technology shock depend crucially on the assumptions applied to identify them. Responses estimated with short-run identification are positive and statistically significant in all datasets analyzed. Long-run identification results in negative often not statistically significant responses. We check validity of these assumptions with the Sims (1989) procedure, concluding that both types of assumptions are appropriate to recover the impulse responses of hours in a fractionally integrated VAR. However, the application of longrun identification results in a substantial increase of the sampling uncertainty. JEL Classification numbers: C22, E32. Keywords: technology shock, fractional integration, hours worked, structural VAR, identification

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: Systematic approaches for identifying proteins involved in different types of cancer are needed. Experimental techniques such as microarrays are being used to characterize cancer, but validating their results can be a laborious task. Computational approaches are used to prioritize between genes putatively involved in cancer, usually based on further analyzing experimental data. Results: We implemented a systematic method using the PIANA software that predicts cancer involvement of genes by integrating heterogeneous datasets. Specifically, we produced lists of genes likely to be involved in cancer by relying on: (i) protein-protein interactions; (ii) differential expression data; and (iii) structural and functional properties of cancer genes. The integrative approach that combines multiple sources of data obtained positive predictive values ranging from 23% (on a list of 811 genes) to 73% (on a list of 22 genes), outperforming the use of any of the data sources alone. We analyze a list of 20 cancer gene predictions, finding that most of them have been recently linked to cancer in literature. Conclusion: Our approach to identifying and prioritizing candidate cancer genes can be used to produce lists of genes likely to be involved in cancer. Our results suggest that differential expression studies yielding high numbers of candidate cancer genes can be filtered using protein interaction networks.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

An important problem in descriptive and prescriptive research in decision making is to identify regions of rationality, i.e., the areas for which heuristics are and are not effective. To map the contours of such regions, we derive probabilities that heuristics identify the best of m alternatives (m > 2) characterized by k attributes or cues (k > 1). The heuristics include a single variable (lexicographic), variations of elimination-by-aspects, equal weighting, hybrids of the preceding, and models exploiting dominance. We use twenty simulated and four empirical datasets for illustration. We further provide an overview by regressing heuristic performance on factors characterizing environments. Overall, sensible heuristics generally yield similar choices in many environments. However, selection of the appropriate heuristic can be important in some regions (e.g., if there is low inter-correlation among attributes/cues). Since our work assumes a hit or miss decision criterion, we conclude by outlining extensions for exploring the effects of different loss functions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

An important policy issue in recent years concerns the number of people claimingdisability benefits for reasons of incapacity for work. We distinguish between workdisability , which may have its roots in economic and social circumstances, and healthdisability which arises from clear diagnosed medical conditions. Although there is a linkbetween work and health disability, economic conditions, and in particular the businesscycle and variations in the risk of unemployment over time and across localities, mayplay an important part in explaining both the stock of disability benefit claimants andinflows to and outflow from that stock. We employ a variety of cross?country andcountry?specific household panel data sets, as well as administrative data, to testwhether disability benefit claims rise when unemployment is higher, and also toinvestigate the impact of unemployment rates on flows on and off the benefit rolls. Wefind strong evidence that local variations in unemployment have an importantexplanatory role for disability benefit receipt, with higher total enrolments, loweroutflows from rolls and, often, higher inflows into disability rolls in regions and periodsof above?average unemployment. Although general subjective measures of selfreporteddisability and longstanding illness are also positively associated withunemployment rates, inclusion of self?reported health measures does not eliminate thestatistical relationship between unemployment rates and disability benefit receipt;indeed including general measures of health often strengthens that underlyingrelationship. Intriguingly, we also find some evidence from the United Kingdom and theUnited States that the prevalence of self?reported objective specific indicators ofdisability are often pro?cyclical that is, the incidence of specific forms of disability arepro?cyclical whereas claims for disability benefits given specific health conditions arecounter?cyclical. Overall, the analysis suggests that, for a range of countries and datasets, levels of claims for disability benefits are not simply related to changes in theincidence of health disability in the population and are strongly influenced by prevailingeconomic conditions. We discuss the policy implications of these various findings.