Biblioteca Digital

41 resultados para outlier detection, data mining, gpgpu, gpu computing, supercomputing

em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain

Data mining and mall users profile

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Marketing scholars have suggested a need for more empirical research on consumer response to malls, in order to have a better understanding of the variables that explain the behavior of the consumers. The segmentation methodology CHAID (Chi-square automatic interaction detection) was used in order to identify the profiles of consumers with regard to their activities at malls, on the basis of socio-demographic variables and behavioral variables (how and with whom they go to the malls). A sample of 790 subjects answered an online questionnaire. The CHAID analysis of the results was used to identify the profiles of consumers with regard to their activities at malls. In the set of variables analyzed the transport used in order to go shopping and the frequency of visits to centers are the main predictors of behavior in malls. The results provide guidelines for the development of effective strategies to attract consumers to malls and retain them there.

A Data mining approach to indirect inference

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Consider a model with parameter phi, and an auxiliary model with parameter theta. Let phi be a randomly sampled from a given density over the known parameter space. Monte Carlo methods can be used to draw simulated data and compute the corresponding estimate of theta, say theta_tilde. A large set of tuples (phi, theta_tilde) can be generated in this manner. Nonparametric methods may be use to fit the function E(phi|theta_tilde=a), using these tuples. It is proposed to estimate phi using the fitted E(phi|theta_tilde=theta_hat), where theta_hat is the auxiliary estimate, using the real sample data. This is a consistent and asymptotically normally distributed estimator, under certain assumptions. Monte Carlo results for dynamic panel data and vector autoregressions show that this estimator can have very attractive small sample properties. Confidence intervals can be constructed using the quantiles of the phi for which theta_tilde is close to theta_hat. Such confidence intervals are found to have very accurate coverage.

Using free data mining software and clustering algorithms to find predictors from student qualifications

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this project a research both in finding predictors via clustering techniques and in reviewing the Data Mining free software is achieved. The research is based in a case of study, from where additionally to the KDD free software used by the scientific community; a new free tool for pre-processing the data is presented. The predictors are intended for the e-learning domain as the data from where these predictors have to be inferred are student qualifications from different e-learning environments. Through our case of study not only clustering algorithms are tested but also additional goals are proposed.

Educational data mining and learning analytics : Clasificación de las matriculaciones de A.D.E. en la UOC

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Trabajo de investigación que realiza un estudio clasificatorio de las asignaturas matriculadas en la carrera de Administración y Dirección de Empresas de la UOC en relación a su resultado. Se proponen diferentes métodos y modelos de comprensión del entorno en el que se realiza el estudio.

Educational Data Mining: cerrando el círculo del proceso de aprendizaje en entornos virtuales

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Development of methods to explore data from educational settings, to understand better the learning process.

The EU-ADR Web Platform: delivering advanced pharmacovigilance tools

Relevância:

100.00% 100.00%

Publicador:

Resumo:

PURPOSE: Pharmacovigilance methods have advanced greatly during the last decades, making post-market drug assessment an essential drug evaluation component. These methods mainly rely on the use of spontaneous reporting systems and health information databases to collect expertise from huge amounts of real-world reports. The EU-ADR Web Platform was built to further facilitate accessing, monitoring and exploring these data, enabling an in-depth analysis of adverse drug reactions risks.METHODS: The EU-ADR Web Platform exploits the wealth of data collected within a large-scale European initiative, the EU-ADR project. Millions of electronic health records, provided by national health agencies, are mined for specific drug events, which are correlated with literature, protein and pathway data, resulting in a rich drug-event dataset. Next, advanced distributed computing methods are tailored to coordinate the execution of data-mining and statistical analysis tasks. This permits obtaining a ranked drug-event list, removing spurious entries and highlighting relationships with high risk potential.RESULTS: The EU-ADR Web Platform is an open workspace for the integrated analysis of pharmacovigilance datasets. Using this software, researchers can access a variety of tools provided by distinct partners in a single centralized environment. Besides performing standalone drug-event assessments, they can also control the pipeline for an improved batch analysis of custom datasets. Drug-event pairs can be substantiated and statistically analysed within the platform's innovative working environment.CONCLUSIONS: A pioneering workspace that helps in explaining the biological path of adverse drug reactions was developed within the EU-ADR project consortium. This tool, targeted at the pharmacovigilance community, is available online at https://bioinformatics.ua.pt/euadr/. Copyright © 2012 John Wiley & Sons, Ltd.

Destination image of Girona: an online text-mining approach

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The main objective of this Master Thesis is to discover more about Girona’s image as a tourism destination from different agents’ perspective and to study its differences on promotion or opinions. In order to meet this objective, three components of Girona’s destination image will be studied: attribute-based component, the holistic component, and the affective component. It is true that a lot of research has been done about tourism destination image, but it is less when we are talking about the destination of Girona. Some studies have already focused on Girona as a tourist destination, but they used a different type of sample and different methodological steps. This study is new among destination studies in the sense that it is based only on textual online data and it follows a methodology based on text-miming. Text-mining is a kind of methodology that allows people extract relevant information from texts. Also, after this information is extracted by this methodology, some statistical multivariate analyses are done with the aim of discovering more about Girona’s tourism image

Bioinformática: consultas cruzadas a bases de datos biomédicas remotas

Relevância:

100.00% 100.00%

Publicador:

Resumo:

En la presente memoria se detallan con exactitud los pasos y procesos realizados para construir una aplicación que posibilite el cruce de datos genéticos a partir de información contenida en bases de datos remotas. Desarrolla un estudio en profundidad del contenido y estructura de las bases de datos remotas del NCBI y del KEGG, documentando una minería de datos con el objetivo de extraer de ellas la información necesaria para desarrollar la aplicación de cruce de datos genéticos. Finalmente se establecen los programas, scripts y entornos gráficos que han sido implementados para la construcción y posterior puesta en marcha de la aplicación que proporciona la funcionalidad de cruce de la que es objeto este proyecto fin de carrera.

Introducció d'una empresa a l'extracció del coneixement a partir d'unes dades

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Treball de final de carrera de l'àrea de mineria de dades que té com a objectiu la implantació d'un projecte de

Extracció de coneixement d'una base de dades d'explotació de màquines recreatives i d'atzar.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

L'objectiu de treball ha estat mostrar les possibilitats de les aplicacions informàtiques de gestió sobre les dades amb les que treballen. Simplement mostren les dades guardades a les bases de dades d'explotació, de forma detallada o bé resumida i amb sumatoris. Sobre aquestes dades també es pot extreure coneixement.

Implementación y entrenamiento de un modelo clasificatorio de red neural sobre la base de datos IGBADAT para la clasificación de las rocas basálticas de acuerdo a las clases del sistema de clasificación tradicional de Yoder and Tiller.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aquest treball vol implementar un projecte de mineria de dades en l'àrea de la petrologia ígnia, especialitat englobada dins la geologia clàssica.

Agregador WEB

Relevância:

100.00% 100.00%

Publicador:

Resumo:

L'objectiu de l'aplicació serà el facilitar i automatitzar en la mesura del possible el treball de les persones que mantenen bases de dades de productes en Internet, encara que es pot estendre en futures versions (i haurà d'estar preparat per a això) per a altres usos com l'actualització de la pàgina web amb informació de tercers en temps quasi-real.

Elaboració d'un protocol basat en XML per a una xarxa de sensors

Relevância:

100.00% 100.00%

Publicador:

Resumo:

La progressiva reducció de dimensió i cost en els dispositius electrònics, la dràstica retallada de consum elèctric i la independència de què això els dota han fet créixer en els últims temps l'interès de les comunitats científiques i tecnològiques per les xarxes sense fils de petits dispositius. Per altra banda, l'XML (eXtensible Markup Language) és un metallenguatge extensible que ha esdevingut un estàndard per a l'intercanvi d'informació estructurada entre diferents plataformes. L'objectiu d'aquest treball és explorar les possibilitats que pot oferir la introducció de l'XML en les xarxes de sensors amb l'elaboració d'un protocol de comunicació basat en aquest llenguatge i demostrar la transparència en el canvi de plataforma. Per fer-ho, es disposa de dos dispositius amb capacitat de comunicació sense fils equipats amb detectors de temperatura, lluminositat, efecte Hall i nivell de càrrega de la bateria. El projecte constarà de dues parts: una, més extensa, dedicada al desenvolupament del programari per a aquests dispositius, encarregat de obtenir les lectures dels diferents sensors i emetre-les per la xarxa utilitzant el llenguatge XML, i una altra, per recollir aquesta informació present a la xarxa, interpretar-la, salvar-la en una base de dades i exposar-la al món en una plana web. El programari dels dispositius sensors s'escriurà en llenguatge nesC dins el sistema tinyOS que és el sistema operatiu que equipen. La part d'explotació de les dades es desenvoluparà sota la plataforma .NET de Microsoft.

TFC Magatzems de dades: construcció i explotació d'un magatzem de dades de planificació hidrològica

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aquest TFC consisteix en la creació d'un magatzem de dades que automatitzi la recollida de dades de l'estat dels embassaments de la Confederació Hidrogràfica Nord-Est mitjançant processos ETL, per posteriorment tractar aquestes dades amb processos PL/SQL amb l'objectiu de poder explotar aquestes dades mitjançant eines de Business Intelligence.

Construcción y explotación de un almacén de datos de planificación hidrológica

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Construcción y explotación de un almacén de datos de planificación hidrológica para la Confederación Hidrográfica del Norte y Este.

«
1
2
3
»