Biblioteca Digital

945 resultados para cluster analysis

Os determinantes da capacidade regional de inovação nas Regiões Periféricas da União Europeia

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Dissertação de Mestrado em Gestão de Empresas/MBA.

Veja mais

On clustering interval data with different scales of measures : experimental results

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This article is is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. Attribution-NonCommercial (CC BY-NC) license lets others remix, tweak, and build upon work non-commercially, and although the new works must also acknowledge & be non-commercial.

Veja mais

Probabilistic approach for comparing partitions

Relevância:

60.00% 60.00%

Publicador:

Resumo:

3rd SMTDA Conference Proceedings, 11-14 June 2014, Lisbon, Portugal.

Veja mais

Clustering of variables with a three-way approach for health sciences

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Veja mais

Diferenciais intermunicipais de condições de vida e saúde: construção de um indicador composto

Relevância:

60.00% 60.00%

Publicador:

Resumo:

OBJETIVO: Descrever um índice para reconhecimento das desigualdades de condições de vida e saúde e sua relação com o planejamento em saúde. MÉTODOS: Foram selecionadas variáveis e indicadores que refletissem os processos demográficos, econômicos, ambientais e de educação, bem como oferta e produção de serviços de saúde. Esses indicadores foram utilizados no escalonamento adimensional dos indicadores e agrupamento dos 5.507 municípios brasileiros. As fontes de dados foram o censo de 2000 e os sistemas de informações do Ministério da Saúde. Para análise dos dados foram aplicados os testes z-score e cluster analysis. Com base nesses testes foram definidos quatro grupos de municípios segundo condições de vida. RESULTADOS: Existe uma polarização entre o grupo de melhores condições de vida e saúde (grupo 1) e o de piores condições (grupo 4). O grupo 1 é caracterizado pelos municípios de maior porte populacional e no grupo 4 estão principalmente os menores municípios. Quanto à macrorregião do País, os municípios do grupo 1 concentram-se no Sul e Sudeste e no grupo 4 estão os municípios do Nordeste. CONCLUSÕES: Por incorporar dimensões da realidade como habitação, meio ambiente e saúde, o índice de condições de vida e saúde permitiu identificar municípios mais vulneráveis, embasando a definição de prioridades, critérios para financiamento e repasse de recursos de forma mais eqüitativa.

Veja mais

Turismo de nichos : uma análise ao papel dos seniores como participantes ativos

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Dissertação de Mestrado, Gestão do Turismo Internacional, 3 de Dezembro de 2015, Universidade dos Açores.

Veja mais

Caracterização e valorização do recreio balnear

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Dissertação de Mestrado, Ciências Económicas e Empresariais, 9 Dezembro de 2015 , Universidade dos Açores.

Veja mais

Entrepreneurship Education : The role of the Higher Education Institutions in the Entrepreneurial Attitudes of the Students

Relevância:

60.00% 60.00%

Publicador:

Resumo:

8th International Conference of Education, Research and Innovation. 18-20 November, 2015, Seville, Spain.

Veja mais

Clustering an interval data set : are the main partitions similar to a priori partition?

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Veja mais

Feature selection for clustering categorical data with an embedded modelling approach

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Research on the problem of feature selection for clustering continues to develop. This is a challenging task, mainly due to the absence of class labels to guide the search for relevant features. Categorical feature selection for clustering has rarely been addressed in the literature, with most of the proposed approaches having focused on numerical data. In this work, we propose an approach to simultaneously cluster categorical data and select a subset of relevant features. Our approach is based on a modification of a finite mixture model (of multinomial distributions), where a set of latent variables indicate the relevance of each feature. To estimate the model parameters, we implement a variant of the expectation-maximization algorithm that simultaneously selects the subset of relevant features, using a minimum message length criterion. The proposed approach compares favourably with two baseline methods: a filter based on an entropy measure and a wrapper based on mutual information. The results obtained on synthetic data illustrate the ability of the proposed expectation-maximization method to recover ground truth. An application to real data, referred to official statistics, shows its usefulness.

Veja mais

Categorical data clustering using a minimum message length criterion

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Research on cluster analysis for categorical data continues to develop, new clustering algorithms being proposed. However, in this context, the determination of the number of clusters is rarely addressed. We propose a new approach in which clustering and the estimation of the number of clusters is done simultaneously for categorical data. We assume that the data originate from a finite mixture of multinomial distributions and use a minimum message length criterion (MML) to select the number of clusters (Wallace and Bolton, 1986). For this purpose, we implement an EM-type algorithm (Silvestre et al., 2008) based on the (Figueiredo and Jain, 2002) approach. The novelty of the approach rests on the integration of the model estimation and selection of the number of clusters in a single algorithm, rather than selecting this number based on a set of pre-estimated candidate models. The performance of our approach is compared with the use of Bayesian Information Criterion (BIC) (Schwarz, 1978) and Integrated Completed Likelihood (ICL) (Biernacki et al., 2000) using synthetic data. The obtained results illustrate the capacity of the proposed algorithm to attain the true number of cluster while outperforming BIC and ICL since it is faster, which is especially relevant when dealing with large data sets.

Veja mais

Determining the number of clusters in categorical data

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Cluster analysis for categorical data has been an active area of research. A well-known problem in this area is the determination of the number of clusters, which is unknown and must be inferred from the data. In order to estimate the number of clusters, one often resorts to information criteria, such as BIC (Bayesian information criterion), MML (minimum message length, proposed by Wallace and Boulton, 1968), and ICL (integrated classification likelihood). In this work, we adopt the approach developed by Figueiredo and Jain (2002) for clustering continuous data. They use an MML criterion to select the number of clusters and a variant of the EM algorithm to estimate the model parameters. This EM variant seamlessly integrates model estimation and selection in a single algorithm. For clustering categorical data, we assume a finite mixture of multinomial distributions and implement a new EM algorithm, following a previous version (Silvestre et al., 2008). Results obtained with synthetic datasets are encouraging. The main advantage of the proposed approach, when compared to the above referred criteria, is the speed of execution, which is especially relevant when dealing with large data sets.

Veja mais

Clustering and selecting categorical features

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In data clustering, the problem of selecting the subset of most relevant features from the data has been an active research topic. Feature selection for clustering is a challenging task due to the absence of class labels for guiding the search for relevant features. Most methods proposed for this goal are focused on numerical data. In this work, we propose an approach for clustering and selecting categorical features simultaneously. We assume that the data originate from a finite mixture of multinomial distributions and implement an integrated expectation-maximization (EM) algorithm that estimates all the parameters of the model and selects the subset of relevant features simultaneously. The results obtained on synthetic data illustrate the performance of the proposed approach. An application to real data, referred to official statistics, shows its usefulness.

Veja mais

PAH air pollution at a Portuguese urban area: carcinogenic risks and sources identification

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This study aimed to characterize air pollution and the associated carcinogenic risks of polycyclic aromatic hydrocarbon (PAHs) at an urban site, to identify possible emission sources of PAHs using several statistical methodologies, and to analyze the influence of other air pollutants and meteorological variables on PAH concentrations.The air quality and meteorological data were collected in Oporto, the second largest city of Portugal. Eighteen PAHs (the 16 PAHs considered by United States Environment Protection Agency (USEPA) as priority pollutants, dibenzo[a,l]pyrene, and benzo[j]fluoranthene) were collected daily for 24 h in air (gas phase and in particles) during 40 consecutive days in November and December 2008 by constant low-flow samplers and using polytetrafluoroethylene (PTFE) membrane filters for particulate (PM10 and PM2.5 bound) PAHs and pre-cleaned polyurethane foam plugs for gaseous compounds. The other monitored air pollutants were SO2, PM10, NO2, CO, and O3; the meteorological variables were temperature, relative humidity, wind speed, total precipitation, and solar radiation. Benzo[a]pyrene reached a mean concentration of 2.02 ngm−3, surpassing the EU annual limit value. The target carcinogenic risks were equal than the health-based guideline level set by USEPA (10−6) at the studied site, with the cancer risks of eight PAHs reaching senior levels of 9.98×10−7 in PM10 and 1.06×10−6 in air. The applied statistical methods, correlation matrix, cluster analysis, and principal component analysis, were in agreement in the grouping of the PAHs. The groups were formed according to their chemical structure (number of rings), phase distribution, and emission sources. PAH diagnostic ratios were also calculated to evaluate the main emission sources. Diesel vehicular emissions were the major source of PAHs at the studied site. Besides that source, emissions from residential heating and oil refinery were identified to contribute to PAH levels at the respective area. Additionally, principal component regression indicated that SO2, NO2, PM10, CO, and solar radiation had positive correlation with PAHs concentrations, while O3, temperature, relative humidity, and wind speed were negatively correlated.

Veja mais

Dentistry and HIV/AIDS related stigma

Relevância:

60.00% 60.00%

Publicador:

Resumo:

OBJECTIVE To analyze HIV/AIDS positive individual’s perception and attitudes regarding dental services.METHODS One hundred and thirty-four subjects (30.0% of women and 70.0% of men) from Nuevo León, Mexico, took part in the study (2014). They filled out structured, analytical, self-administered, anonymous questionnaires. Besides the sociodemographic variables, the perception regarding public and private dental services and related professionals was evaluated, as well as the perceived stigma associated with HIV/AIDS, through a Likert-type scale. The statistical evaluation included a factorial and a non-hierarchical cluster analysis.RESULTS Social inequalities were found regarding the search for public and private dental professionals and services. Most subjects reported omitting their HIV serodiagnosis and agreed that dentists must be trained and qualified to treat patients with HIV/AIDS. The factorial analysis revealed two elements: experiences of stigma and discrimination in dental appointments and feelings of concern regarding the attitudes of professionals or their teams concerning patients’ HIV serodiagnosis. The cluster analysis identified three groups: users who have not experienced stigma or discrimination (85.0%); the ones who have not had those experiences, but feel somewhat concerned (12.7%); and the ones who underwent stigma and discrimination and feel concerned (2.3%).CONCLUSIONS We observed a low percentage of stigma and discrimination in dental appointments; however, most HIV/AIDS patients do not reveal their serodiagnosis to dentists out of fear of being rejected. Such fact implies a workplace hazard to dental professionals, but especially to the very own health of HIV/AIDS patients, as dentists will not be able to provide them a proper clinical and pharmaceutical treatment.

Veja mais

945 resultados para cluster analysis

Filtro por publicador