21 resultados para Data mining and knowledge discovery

em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Trabajo de investigación que realiza un estudio clasificatorio de las asignaturas matriculadas en la carrera de Administración y Dirección de Empresas de la UOC en relación a su resultado. Se proponen diferentes métodos y modelos de comprensión del entorno en el que se realiza el estudio.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Marketing scholars have suggested a need for more empirical research on consumer response to malls, in order to have a better understanding of the variables that explain the behavior of the consumers. The segmentation methodology CHAID (Chi-square automatic interaction detection) was used in order to identify the profiles of consumers with regard to their activities at malls, on the basis of socio-demographic variables and behavioral variables (how and with whom they go to the malls). A sample of 790 subjects answered an online questionnaire. The CHAID analysis of the results was used to identify the profiles of consumers with regard to their activities at malls. In the set of variables analyzed the transport used in order to go shopping and the frequency of visits to centers are the main predictors of behavior in malls. The results provide guidelines for the development of effective strategies to attract consumers to malls and retain them there.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this project a research both in finding predictors via clustering techniques and in reviewing the Data Mining free software is achieved. The research is based in a case of study, from where additionally to the KDD free software used by the scientific community; a new free tool for pre-processing the data is presented. The predictors are intended for the e-learning domain as the data from where these predictors have to be inferred are student qualifications from different e-learning environments. Through our case of study not only clustering algorithms are tested but also additional goals are proposed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This report is an extension and partial update of de la Fuente and Ciccone (2002). It constructs estimates of the private and social rates of return on schooling for fourteen EU countries using microeconometric estimates of Mincerian wage equations, the results of cross-country growth regressions and OECD data on educational expenditures, tax rates and social benefits. The results are used to draw some tentative conclusions regarding the optimality of observed investment patterns and educational subsidy levels.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Consider a model with parameter phi, and an auxiliary model with parameter theta. Let phi be a randomly sampled from a given density over the known parameter space. Monte Carlo methods can be used to draw simulated data and compute the corresponding estimate of theta, say theta_tilde. A large set of tuples (phi, theta_tilde) can be generated in this manner. Nonparametric methods may be use to fit the function E(phi|theta_tilde=a), using these tuples. It is proposed to estimate phi using the fitted E(phi|theta_tilde=theta_hat), where theta_hat is the auxiliary estimate, using the real sample data. This is a consistent and asymptotically normally distributed estimator, under certain assumptions. Monte Carlo results for dynamic panel data and vector autoregressions show that this estimator can have very attractive small sample properties. Confidence intervals can be constructed using the quantiles of the phi for which theta_tilde is close to theta_hat. Such confidence intervals are found to have very accurate coverage.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The increasing volume of data describing humandisease processes and the growing complexity of understanding, managing, and sharing such data presents a huge challenge for clinicians and medical researchers. This paper presents the@neurIST system, which provides an infrastructure for biomedical research while aiding clinical care, by bringing together heterogeneous data and complex processing and computing services. Although @neurIST targets the investigation and treatment of cerebral aneurysms, the system’s architecture is generic enough that it could be adapted to the treatment of other diseases.Innovations in @neurIST include confining the patient data pertaining to aneurysms inside a single environment that offers cliniciansthe tools to analyze and interpret patient data and make use of knowledge-based guidance in planning their treatment. Medicalresearchers gain access to a critical mass of aneurysm related data due to the system’s ability to federate distributed informationsources. A semantically mediated grid infrastructure ensures that both clinicians and researchers are able to seamlessly access andwork on data that is distributed across multiple sites in a secure way in addition to providing computing resources on demand forperforming computationally intensive simulations for treatment planning and research.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of the PANACEA ICT-2007.2.2 EU project is to build a platform that automates the stages involved in the acquisition,production, updating and maintenance of the large language resources required by, among others, MT systems. The development of a Corpus Acquisition Component (CAC) for extracting monolingual and bilingual data from the web is one of the most innovative building blocks of PANACEA. The CAC, which is the first stage in the PANACEA pipeline for building Language Resources, adopts an efficient and distributed methodology to crawl for web documents with rich textual content in specific languages and predefined domains. The CAC includes modules that can acquire parallel data from sites with in-domain content available in more than one language. In order to extrinsically evaluate the CAC methodology, we have conducted several experiments that used crawled parallel corpora for the identification and extraction of parallel sentences using sentence alignment. The corpora were then successfully used for domain adaptation of Machine Translation Systems.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This document is a report prepared for the DG for Employment and Social Affairs of the European Commission. It surveys the available evidence on the contribution of investment in human capital to aggregate productivity growth and on its impact on wages and other labour outcomes at the individual level. It also draws some tentative policy conclusions for an average European country.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The application of compositional data analysis through log ratio trans-formations corresponds to a multinomial logit model for the shares themselves.This model is characterized by the property of Independence of Irrelevant Alter-natives (IIA). IIA states that the odds ratio in this case the ratio of shares is invariant to the addition or deletion of outcomes to the problem. It is exactlythis invariance of the ratio that underlies the commonly used zero replacementprocedure in compositional data analysis. In this paper we investigate using thenested logit model that does not embody IIA and an associated zero replacementprocedure and compare its performance with that of the more usual approach ofusing the multinomial logit model. Our comparisons exploit a data set that com-bines voting data by electoral division with corresponding census data for eachdivision for the 2001 Federal election in Australia

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The paper analyzes the effects of strategic behavior by an insider in a price discovery process, akin to an information tatonnement, in the presence of a competitive informed sector. Such processes are used in the preopening period of continuous trading systems in several exchanges. It is found that the insider manipulates the market using a contrarian strategy in order to neutralize the effect of the trades of competitive informed agents. Furthermore, consistently with the empirical evidence available, we find that information revelation accelerates close to the opening, that the market price does not converge to the fundamental value no matter how many rounds the tatonnement has, and that the expected trading volume displays a U-shaped pattern. We also find that a market with a larger competitive sector (smaller insider) has an improved informational efficiency and an increased trading volume. The insider provides a public good (a lower informativeness of the price) for the competitive informed sector.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

La infancia extranjera se escolariza en Cataluña en un programa de cambio de lengua del hogar a la escuela. Las investigaciones afirman que este alumnado tarda un mínimo de seis años en equiparar sus habilidades lingüístico-cognitivas con sus pares autóctonos, no así las habilidades conversacionales, las cuales se adquieren antes de los dos años de residencia. Sin embargo, no existen estudios sobre los efectos de la escolarización en el parvulario del alumnado alófono, así como de su lengua familiar, en relación con la adquisición de la lengua escolar. El artículo es un estudio comparativo de la adquisición del catalán de 567 autóctonos y 434 alófonos, al final del parvulario, en 50 escuelas de Cataluña que escolarizan a alumnado de origen extranjero. Las lenguas del alumnado autóctono son el catalán, el castellano y el bilingüismo catalán-castellano y las lenguas del alumnado alófono son el árabe, el soninké y el castellano. Los factores utilizados más relevantes han sido el nivel socioprofesional y educativo de las familias, el tiempo de residencia y el momento de escolarización del alumnado, el porcentaje de alumnado catalanohablante y de alumnado alófono en el aula y el contexto sociolingüístico del centro escolar. Los resultados muestran que el alumnado autóctono sabe más catalán que el alumnado alófono, pero las diferencias desaparecen respecto a algunos factores, de los cuales los más relevantes son los relacionados con las características del alumnado de las aulas. La lengua familiar del alumnado alófono no incide en sus resultados

Relevância:

100.00% 100.00%

Publicador:

Resumo:

[cat] Analitzem una economia amb dues característiques principals: la mobilitat dels treballadors implica transferència de coneixement i la productivitat de l’empresa augmenta amb l’intercanvi de coneixement. Cada empresa desenvolupa un tipus de coneixement que serà trasmès a la resta de la indústria mitjançant la mobilitat de treballadors. Estudiem dues estructures de mercat laboral i utilitzant un anàlisi comparatiu derivem les implicacions del model. Els resultats revelen com la mobilitat de treballadors depèn en la varietat i nivell del coneixement, la presència de costos de mobilitat, les institucions, la capacitat d’absorvir coneixement per part de les empreses i la mida de la indústria. Els resultats no depenen de l’estructura del mercat laboral.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

[cat] Analitzem una economia amb dues característiques principals: la mobilitat dels treballadors implica transferència de coneixement i la productivitat de l’empresa augmenta amb l’intercanvi de coneixement. Cada empresa desenvolupa un tipus de coneixement que serà trasmès a la resta de la indústria mitjançant la mobilitat de treballadors. Estudiem dues estructures de mercat laboral i utilitzant un anàlisi comparatiu derivem les implicacions del model. Els resultats revelen com la mobilitat de treballadors depèn en la varietat i nivell del coneixement, la presència de costos de mobilitat, les institucions, la capacitat d’absorvir coneixement per part de les empreses i la mida de la indústria. Els resultats no depenen de l’estructura del mercat laboral.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Powell Basin is a small oceanic basin located at the NE end of the Antarctic Peninsula developed during the Early Miocene and mostly surrounded by the continental crusts of the South Orkney Microcontinent, South Scotia Ridge and Antarctic Peninsula margins. Gravity data from the SCAN 97 cruise obtained with the R/V Hespérides and data from the Global Gravity Grid and Sea Floor Topography (GGSFT) database (Sandwell and Smith, 1997) are used to determine the 3D geometry of the crustal-mantle interface (CMI) by numerical inversion methods. Water layer contribution and sedimentary effects were eliminated from the Free Air anomaly to obtain the total anomaly. Sedimentary effects were obtained from the analysis of existing and new SCAN 97 multichannel seismic profiles (MCS). The regional anomaly was obtained after spectral and filtering processes. The smooth 3D geometry of the crustal mantle interface obtained after inversion of the regional anomaly shows an increase in the thickness of the crust towards the continental margins and a NW-SE oriented axis of symmetry coinciding with the position of an older oceanic spreading axis. This interface shows a moderate uplift towards the western part and depicts two main uplifts to the northern and eastern sectors.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This article describes a method for determining the polydispersity index Ip2=Mz/Mw of the molecular weight distribution (MWD) of linear polymeric materials from linear viscoelastic data. The method uses the Mellin transform of the relaxation modulus of a simple molecular rheological model. One of the main features of this technique is that it enables interesting MWD information to be obtained directly from dynamic shear experiments. It is not necessary to achieve the relaxation spectrum, so the ill-posed problem is avoided. Furthermore, a determinate shape of the continuous MWD does not have to be assumed in order to obtain the polydispersity index. The technique has been developed to deal with entangled linear polymers, whatever the form of the MWD is. The rheological information required to obtain the polydispersity index is the storage G′(ω) and loss G″(ω) moduli, extending from the terminal zone to the plateau region. The method provides a good agreement between the proposed theoretical approach and the experimental polydispersity indices of several linear polymers for a wide range of average molecular weights and polydispersity indices. It is also applicable to binary blends.