65 resultados para Data clustering. Fuzzy C-Means. Cluster centers initialization. Validation indices
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
There is a family of well-known external clustering validity indexes to measure the degree of compatibility or similarity between two hard partitions of a given data set, including partitions with different numbers of categories. A unified, fully equivalent set-theoretic formulation for an important class of such indexes was derived and extended to the fuzzy domain in a previous work by the author [Campello, R.J.G.B., 2007. A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment. Pattern Recognition Lett., 28, 833-841]. However, the proposed fuzzy set-theoretic formulation is not valid as a general approach for comparing two fuzzy partitions of data. Instead, it is an approach for comparing a fuzzy partition against a hard referential partition of the data into mutually disjoint categories. In this paper, generalized external indexes for comparing two data partitions with overlapping categories are introduced. These indexes can be used as general measures for comparing two partitions of the same data set into overlapping categories. An important issue that is seldom touched in the literature is also addressed in the paper, namely, how to compare two partitions of different subsamples of data. A number of pedagogical examples and three simulation experiments are presented and analyzed in details. A review of recent related work compiled from the literature is also provided. (c) 2010 Elsevier B.V. All rights reserved.
Resumo:
This paper presents the design and implementation of an embedded soft sensor, i. e., a generic and autonomous hardware module, which can be applied to many complex plants, wherein a certain variable cannot be directly measured. It is implemented based on a fuzzy identification algorithm called ""Limited Rules"", employed to model continuous nonlinear processes. The fuzzy model has a Takagi-Sugeno-Kang structure and the premise parameters are defined based on the Fuzzy C-Means (FCM) clustering algorithm. The firmware contains the soft sensor and it runs online, estimating the target variable from other available variables. Tests have been performed using a simulated pH neutralization plant. The results of the embedded soft sensor have been considered satisfactory. A complete embedded inferential control system is also presented, including a soft sensor and a PID controller. (c) 2007, ISA. Published by Elsevier Ltd. All rights reserved.
Resumo:
This paper is concerned with the computational efficiency of fuzzy clustering algorithms when the data set to be clustered is described by a proximity matrix only (relational data) and the number of clusters must be automatically estimated from such data. A fuzzy variant of an evolutionary algorithm for relational clustering is derived and compared against two systematic (pseudo-exhaustive) approaches that can also be used to automatically estimate the number of fuzzy clusters in relational data. An extensive collection of experiments involving 18 artificial and two real data sets is reported and analyzed. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
This paper tackles the problem of showing that evolutionary algorithms for fuzzy clustering can be more efficient than systematic (i.e. repetitive) approaches when the number of clusters in a data set is unknown. To do so, a fuzzy version of an Evolutionary Algorithm for Clustering (EAC) is introduced. A fuzzy cluster validity criterion and a fuzzy local search algorithm are used instead of their hard counterparts employed by EAC. Theoretical complexity analyses for both the systematic and evolutionary algorithms under interest are provided. Examples with computational experiments and statistical analyses are also presented.
Resumo:
One of the top ten most influential data mining algorithms, k-means, is known for being simple and scalable. However, it is sensitive to initialization of prototypes and requires that the number of clusters be specified in advance. This paper shows that evolutionary techniques conceived to guide the application of k-means can be more computationally efficient than systematic (i.e., repetitive) approaches that try to get around the above-mentioned drawbacks by repeatedly running the algorithm from different configurations for the number of clusters and initial positions of prototypes. To do so, a modified version of a (k-means based) fast evolutionary algorithm for clustering is employed. Theoretical complexity analyses for the systematic and evolutionary algorithms under interest are provided. Computational experiments and statistical analyses of the results are presented for artificial and text mining data sets. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
[Ru(3)O(CH(3)COO)(6)(pz)(CO)](6) is a cyclic hexamer species encompassing six triangular ruthenium cluster centers bridged by pyrazine ligands. The electronic communication among the cluster units strongly depends on their oxidation states, and has been successfully probed by means of cyclic voltammetry and UV-vis spectroelectrochemistry. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
In the current work, we studied the effect of the nonionic detergent dodecyloctaethyleneglycol, C(12)E(8), on the structure and oligomeric form of the Na,K-ATPase membrane enzyme (sodium-potassium pump) in aqueous suspension, by means of small-angle X-ray scattering (SAXS). Samples composed of 2 mg/mL of Na,K-ATPase, extracted from rabbit kidney medulla, in the presence of a small amount of C(12)E(8) (0.005 mg/mL) and in larger concentrations ranging from 2.7 to 27 mg/mL did not present catalytic activity. Under this condition, an oligomerization of the alpha subunits is expected. SAXS data were analyzed by means of a global fitting procedure supposing that the scattering is due to two independent contributions: one coming from the enzyme and the other one from C(12)E(8) micelles. In the small detergent content (0.005 mg/mL), the SAXS results evidenced that Na,K-ATPase is associated into aggregates larger than (alpha beta)(2) form. When 2.7 mg/mL of C(12)E(8) is added, the data analysis revealed the presence of alpha(4) aggregates in the solution and some free micelles. Increasing the detergent amount up to 27 mg/mL does not disturb the alpha(4) aggregate: just more micelles of the same size and shape are proportionally formed in solution. We believe that our results shed light on a better understanding of how nonionic detergents induce subunit dissociation and reassembling to minimize the exposure of hydrophobic residues to the aqueous solvent.
Resumo:
Clustering quality or validation indices allow the evaluation of the quality of clustering in order to support the selection of a specific partition or clustering structure in its natural unsupervised environment, where the real solution is unknown or not available. In this paper, we investigate the use of quality indices mostly based on the concepts of clusters` compactness and separation, for the evaluation of clustering results (partitions in particular). This work intends to offer a general perspective regarding the appropriate use of quality indices for the purpose of clustering evaluation. After presenting some commonly used indices, as well as indices recently proposed in the literature, key issues regarding the practical use of quality indices are addressed. A general methodological approach is presented which considers the identification of appropriate indices thresholds. This general approach is compared with the simple use of quality indices for evaluating a clustering solution.
Resumo:
Method. Participants were 18 years of age or older, who had been discharged from hospitalisation between 6 months and 1 year before the interview, or who underwent reconstructive surgery during the previous year, or who were under outpatient follow-up awaiting reconstructive surgery. Data were collected by means of semi-structured interviews. Results. Thirty-eight of the 44 participants (86.4%) reported some type of changes associated with the burn injury, the treatment, or both, regarding the following aspects: work, leisure, relationships, religious ties, educational activities and habits (smoking, using alcohol and drugs and dressing style). The data showed a statistically significant association between burns on at least one of the upper limbs (with or without hands) and changes in work. Conclusions. Some of the aspects mentioned by the participants, such as work and leisure activities, need to be further researched in order to improve our understanding of the impact that these changes causes in the person`s life.
Resumo:
Candidemia is associated with high morbidity and mortality resulting in significant increases in the length of patients` hospitalization and in healthcare costs. Critically ill patients are at particular risk for candidemia because of their debilitated condition and frequent need for invasive procedures. The aim of this study was to characterize the incidence and epidemiology of candidemia over a seven-year period in intensive care units (ICUs) and the use of fluconazole and caspofungin in a large university-affiliated hospital. All cases of candidemia were identified by surveillance, using the Centers for Diseases Control and Prevention criteria. Demographic variables, use of antifungal (fluconazole and caspofungin) and patient outcomes were evaluated. The 2 test for linear trend was employed to evaluate the distribution of Candida spp. and the use of fluconazole and caspofungin by defined daily dose (DDD) per 1,000 patients-days during the study period. One hundred and eight episodes of candidemia were identified. The overall incidence of candidemia (P=0.20) and incidence of non-Candida albicans Candida infections (P=0.32) remained stable over the study period and ranged from 0.3-0.9 episodes per 1,000 catheter-days and 0.39-0.83 episodes per 1,000 patients-days. However, the use of fluconazole and caspofungin increased significantly (P0.001). While there were no reports of the use of fluconazole for prophylaxis in 1999, its use for this purpose increased from 3% in 2000 to 7.0% (P=0.07) in 2006. C. albicans was the most frequent specie isolated and burns and cancer were the most frequent underlying conditions. The overall mortality was 76%. There was no difference between C. albicans and non-C. albicans Candida infections when the crude and 14-day mortality rates were compared. Our data demonstrated that C. albicans is still the most frequent species causing candidemia in our intensive care units. Our rates of candidemia are lower than those reported from the region and similar to American and European hospitals. Although the incidence of blood stream infections (BSI) and candidemia remained stable, the use of fluconazole and caspofungin increased significantly over the years included in this study but had no impact on the incidence of infections caused by non-C. albicans Candida species.
Resumo:
Objective: The purpose of the study was to investigate whether dentine irradiation with a pulsed CO(2) laser (10.6 mu m) emitting pulses of 10 ms is capable of reducing dentine calcium and phosphorus losses in an artificial caries model. Design: The 90 dentine slabs obtained from bovine teeth were randomly divided into six groups (n = 15): negative control group (GC); positive control group, treated with fluoride 1.23% (GF); and laser groups irradiated with 8 J/cm(2) (L8); irradiated as in L8 + fluoride 1.23% (L8F); irradiated with 11j/cm(2) (L11); irradiated as in L11 + fluoride 1.23% (L11F). After laser irradiation the samples were submitted to a pH-cycling model for 9 days. The calcium and phosphorous contents in the de- and remineralization solutions were measured by means of inductively coupled plasma optical emission spectrometer - ICP-OES. Additionally intra-pulpal temperature measurements were performed. The obtained data were analysed by means of ANOVA and Tukey`s test (alpha = 0.05). Results: In the demineralization solutions the groups L11F and GF presented significantly lower means of calcium and phosphorous losses than the control group; and in L11F means were significantly lower than in the fluoride group. Both irradiation parameters tested caused intrapulpal temperature increase below 2 degrees C. Conclusion: It can be concluded that under the conditions of this study, CO(2) laser irradiation (10.6 mu m) with 11J/cm(2) (540 mJ and 10 Hz) of fluoride treated dentine surfaces decreases the loss of calcium and phosphorous in the demineralization process and does not cause excessive temperature increase inside the pulp chamber. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
Dentin adhesion procedure presents limitations, especially regarding to lifetime stability of formed hybrid layer. Alternative procedures have been studied in order to improve adhesion to dentin. OBJECTIVE: The aim of this study was to evaluate in vitro the influence of deproteinization or dentin tubular occlusion, as well as the combination of both techniques, on microtensile bond strength (µTBS) and marginal microleakage of composite resin restorations. MATERIAL AND METHODS: Extracted erupted human third molars were randomly divided into 4 groups. Dentin surfaces were treated with one of the following procedures: (A) 35% phosphoric acid gel (PA) + adhesive system (AS); (B) PA + 10% NaOCl + AS; (C) PA + oxalate + AS and (D) PA + oxalate + 10% NaOCl + AS. Bond strength data were analyzed statistically by two-way ANOVA and Tukey's test. The microleakage scores were analyzed using Kruskal-Wallis and Mann-Whitney non-parametric tests. Significance level was set at 0.05 for all analyses. RESULTS: µTBS data presented statistically lower values for groups D and B, ranking data as A>C>B>D. The use of oxalic acid resulted in microleakage reduction along the tooth/restoration interface, being significant when used alone. On the other hand, the use of 10% NaOCl alone or in combination with oxalic acid, resulted in increased microleakage. CONCLUSIONS: Dentin deproteinization with 10% NaOCl or in combination with oxalate significantly compromised both the adhesive bond strength and the microleakage at interface. Tubular occlusion prior to adhesive system application seems to be a useful technique to reduce marginal microleakage.
Resumo:
Diversos autores relatam que a consulta médica se associa a melhores resultados quando se adota como referencial o modelo centrado no paciente. OBJETIVO: Avaliar se os médicos ingressantes na residência de Pediatria realizam consultas ambulatoriais segundo pressupostos do modelo centrado no paciente. MÉTODO: Em 2007, no início de seu estágio de ambulatório, dez residentes foram selecionados aleatoriamente para serem filmados durante a realização de uma consulta. Adotando-se como referencial teórico pressupostos do modelo centrado no paciente, os dados foram analisados por meio de metodologia qualitativa, por meio da técnica exploratória, com três juízes independentes. RESULTADOS: A maioria dos residentes explora precocemente a primeira queixa referida pelos pais, assumindo-a como principal; não explora outras queixas; decide e faz orientações terapêuticas de modo não compartilhado; conversa pouco com as crianças; cria longos momentos de silêncio durante a consulta; não explica o exame físico e às vezes utiliza o prontuário como a principal fonte de informação. CONCLUSÃO: Os residentes realizam consultas sem a inclusão da perspectiva dos pais e, portanto, não atendem segundo pressupostos do modelo centrado no paciente.
Resumo:
A negligência e abandono constituem-se uma das formas mais frequentes de maus tratos. No entanto, seu conhecimento ainda está em processo de construção. Objetivo: analisar as características da negligência/abandono contra menores de 15 anos residentes em Londrina, PR, cujo evento foi notificado aos Conselhos Tutelares e serviços de atendimento, em 2006. Método: Estudo transversal e descritivo, cujos dados foram processados pelo programa EPI Info. Resultados: Foram obtidos 308 casos, cuja notificação se deu, principalmente, por profissionais de saúde (67,2 por cento). As vítimas do sexo feminino predominaram (72,7 por cento) e maiores coeficientes foram aos 4 anos (13,8 e 5,0 por 1.000 no sexo feminino e masculino, respectivamente). Os agressores foram mãe (69,5 por cento) e madrasta (22,2 por cento). As quesões da maternidade, ou seja, presença de filho não natural (32,8 por cento) e a pouca idade da mãe (20,8 por cento) foram as características mais associadas. As vítimas sofreram o abuso por 1 a 2 anos antes da notificação (62,7 por cento). Conclusões: O estudo contribui para ampliar o conhecimento acerca da negligência e abandono praticada contra menores. É preciso que os órgãos competentes trabalhem para a detecção precoce, a fim de possibilitar tratamento e acompanhamento adequados que possam reduzir as importantes sequelas decorrentes
Resumo:
Background: The inherent complexity of statistical methods and clinical phenomena compel researchers with diverse domains of expertise to work in interdisciplinary teams, where none of them have a complete knowledge in their counterpart's field. As a result, knowledge exchange may often be characterized by miscommunication leading to misinterpretation, ultimately resulting in errors in research and even clinical practice. Though communication has a central role in interdisciplinary collaboration and since miscommunication can have a negative impact on research processes, to the best of our knowledge, no study has yet explored how data analysis specialists and clinical researchers communicate over time. Methods/Principal Findings: We conducted qualitative analysis of encounters between clinical researchers and data analysis specialists (epidemiologist, clinical epidemiologist, and data mining specialist). These encounters were recorded and systematically analyzed using a grounded theory methodology for extraction of emerging themes, followed by data triangulation and analysis of negative cases for validation. A policy analysis was then performed using a system dynamics methodology looking for potential interventions to improve this process. Four major emerging themes were found. Definitions using lay language were frequently employed as a way to bridge the language gap between the specialties. Thought experiments presented a series of ""what if'' situations that helped clarify how the method or information from the other field would behave, if exposed to alternative situations, ultimately aiding in explaining their main objective. Metaphors and analogies were used to translate concepts across fields, from the unfamiliar to the familiar. Prolepsis was used to anticipate study outcomes, thus helping specialists understand the current context based on an understanding of their final goal. Conclusion/Significance: The communication between clinical researchers and data analysis specialists presents multiple challenges that can lead to errors.