15 resultados para agglomerative clustering
em Universidade do Minho
Resumo:
Usually, data warehousing populating processes are data-oriented workflows composed by dozens of granular tasks that are responsible for the integration of data coming from different data sources. Specific subset of these tasks can be grouped on a collection together with their relationships in order to form higher- level constructs. Increasing task granularity allows for the generalization of processes, simplifying their views and providing methods to carry out expertise to new applications. Well-proven practices can be used to describe general solutions that use basic skeletons configured and instantiated according to a set of specific integration requirements. Patterns can be applied to ETL processes aiming to simplify not only a possible conceptual representation but also to reduce the gap that often exists between two design perspectives. In this paper, we demonstrate the feasibility and effectiveness of an ETL pattern-based approach using task clustering, analyzing a real world ETL scenario through the definitions of two commonly used clusters of tasks: a data lookup cluster and a data conciliation and integration cluster.
Resumo:
When a pregnant woman is guided to a hospital for obstetrics purposes, many outcomes are possible, depending on her current conditions. An improved understanding of these conditions could provide a more direct medical approach by categorizing the different types of patients, enabling a faster response to risk situations, and therefore increasing the quality of services. In this case study, the characteristics of the patients admitted in the maternity care unit of Centro Hospitalar of Porto are acknowledged, allowing categorizing the patient women through clustering techniques. The main goal is to predict the patients’ route through the maternity care, adapting the services according to their conditions, providing the best clinical decisions and a cost-effective treatment to patients. The models developed presented very interesting results, being the best clustering evaluation index: 0.65. The evaluation of the clustering algorithms proved the viability of using clustering based data mining models to characterize pregnant patients, identifying which conditions can be used as an alert to prevent the occurrence of medical complications.
Resumo:
Lecture Notes in Computer Science, 9273
Resumo:
The present paper reports the precipitation process of Al3Sc structures in an aluminum scandium alloy, which has been simulated with a synchronous parallel kinetic Monte Carlo (spkMC) algorithm. The spkMC implementation is based on the vacancy diffusion mechanism. To filter the raw data generated by the spkMC simulations, the density-based clustering with noise (DBSCAN) method has been employed. spkMC and DBSCAN algorithms were implemented in the C language and using MPI library. The simulations were conducted in the SeARCH cluster located at the University of Minho. The Al3Sc precipitation was successfully simulated at the atomistic scale with the spkMC. DBSCAN proved to be a valuable aid to identify the precipitates by performing a cluster analysis of the simulation results. The achieved simulations results are in good agreement with those reported in the literature under sequential kinetic Monte Carlo simulations (kMC). The parallel implementation of kMC has provided a 4x speedup over the sequential version.
Resumo:
Propolis is a chemically complex biomass produced by honeybees (Apis mellifera) from plant resins added of salivary enzymes, beeswax, and pollen. The biological activities described for propolis were also identified for donor plants resin, but a big challenge for the standardization of the chemical composition and biological effects of propolis remains on a better understanding of the influence of seasonality on the chemical constituents of that raw material. Since propolis quality depends, among other variables, on the local flora which is strongly influenced by (a)biotic factors over the seasons, to unravel the harvest season effect on the propolis chemical profile is an issue of recognized importance. For that, fast, cheap, and robust analytical techniques seem to be the best choice for large scale quality control processes in the most demanding markets, e.g., human health applications. For that, UV-Visible (UV-Vis) scanning spectrophotometry of hydroalcoholic extracts (HE) of seventy-three propolis samples, collected over the seasons in 2014 (summer, spring, autumn, and winter) and 2015 (summer and autumn) in Southern Brazil was adopted. Further machine learning and chemometrics techniques were applied to the UV-Vis dataset aiming to gain insights as to the seasonality effect on the claimed chemical heterogeneity of propolis samples determined by changes in the flora of the geographic region under study. Descriptive and classification models were built following a chemometric approach, i.e. principal component analysis (PCA) and hierarchical clustering analysis (HCA) supported by scripts written in the R language. The UV-Vis profiles associated with chemometric analysis allowed identifying a typical pattern in propolis samples collected in the summer. Importantly, the discrimination based on PCA could be improved by using the dataset of the fingerprint region of phenolic compounds ( = 280-400m), suggesting that besides the biological activities of those secondary metabolites, they also play a relevant role for the discrimination and classification of that complex matrix through bioinformatics tools. Finally, a series of machine learning approaches, e.g., partial least square-discriminant analysis (PLS-DA), k-Nearest Neighbors (kNN), and Decision Trees showed to be complementary to PCA and HCA, allowing to obtain relevant information as to the sample discrimination.
Resumo:
Gold nanoparticles were dispersed in two different dielectric matrices, TiO2 and Al2O3, using magnetron sputtering and a post-deposition annealing treatment. The main goal of the present work was to study how the two different host dielectric matrices, and the resulting microstructure evolution (including both the nanoparticles and the host matrix itself) promoted by thermal annealing, influenced the physical properties of the films. In particular, the structure and morphology of the nanocomposites were correlated with the optical response of the thin films, namely their localized surface plasmon resonance (LSPR) characteristics. Furthermore, and in order to scan the future application of the two thin film system in different types of sensors (namely biological ones), their functional behaviour (hardness and Young's modulus change) was also evaluated. Despite the similar Au concentrations in both matrices (~ 11 at.%), very different microstructural features were observed, which were found to depend strongly on the annealing temperature. The main structural differences included: (i) the early crystallization of the TiO2 host matrix, while the Al2O3 one remained amorphous up to 800 °C; (ii) different grain size evolution behaviours with the annealing temperature, namely an almost linear increase for the Au:TiO2 system (from 3 to 11 nm), and the approximately constant values observed in the Au:Al2O3 system (4–5 nm). The results from the nanoparticle size distributions were also found to be quite sensitive to the surrounding matrix, suggesting different mechanisms for the nanoparticle growth (particle migration and coalescence dominating in TiO2 and Ostwald ripening in Al2O3). These different clustering behaviours induced different transmittance-LSPR responses and a good mechanical stability, which opens the possibility for future use of these nanocomposite thin film systems in some envisaged applications (e.g. LSPR-biosensors).
Resumo:
CdS nanoparticles (NPs) were synthesized using colloidal methods and incorporated within a diureasil hybrid matrix. The surface capping of the CdS NPs by 3-mercaptopropyltrimethoxysilane (MPTMS) and 3-aminopropyltrimethoxysilane (APTMS) organic ligands during the incorporation of the NPs within the hybrid matrix has been investigated. The matrix is based on poly(ethylene oxide)/poly(propylene oxide) chains grafted to a siliceous skeleton through urea bonds and was produced by sol–gel process. Both alkaline and acidic catalysis of the sol–gel reaction were used to evaluate the effect of each organic ligand on the optical properties of the CdS NPs. The hybrid materials were characterized by absorption, steady-state and time-resolved photoluminescence spectroscopy and High Resolution Transmission Electron Microscopy (HR-TEM). The preservation of the optical properties of the CdS NPs within the diureasil hybrids was dependent on the experimental conditions used. Both organic ligands (APTMS and MPTMS) demonstrated to be crucial in avoiding the increase of size distribution and clustering of the NPs within the hybrid matrix. The use of organic ligands was also shown to influence the level of interaction between the hybrid host and the CdS NPs. The CdS NPs showed large Stokes shifts and long average lifetimes, both in colloidal solution and in the xerogels, due to the origin of the PL emission in surface states. The CdS NPs capped with MPTMS have lower PL lifetimes compared to the other xerogel samples but still larger than the CdS NPs in the original colloidal solution. An increase in PL lifetimes of the NPs after their incorporation within the hybrid matrix is related to interaction between the NPs and the hybrid host matrix.
Resumo:
Noble metal powders containing gold and silver have been used for many centuries, providing different colours in the windows of the medieval cathedrals and in ancient Roman glasses. Nowadays, the interest in nanocomposite materials containing noble nanoparticles embedded in dielectric matrices is related with their potential use for a wide range of advanced technological applications. They have been proposed for environmental and biological sensing, tailoring colour of functional coatings, or for surface enhanced Raman spectroscopy. Most of these applications rely on the so-called localised surface plasmon resonance absorption, which is governed by the type of the noble metal nanoparticles, their distribution, size and shape and as well as of the dielectric characteristics of the host matrix. The aim of this work is to study the influence of the composition and thermal annealing on the morphological and structural changes of thin films composed of Ag metal clusters embedded in a dielectric TiO2 matrix. Since changes in size, shape and distribution of the clusters are fundamental parameters for tailoring the properties of plasmonic materials, a set of films with different Ag concentrations was prepared. The optical properties and the thermal behaviour of the films were correlated with the structural and morphological changes promoted by annealing. The films were deposited by DC magnetron sputtering and in order to promote the clustering of the Ag nanoparticles the as-deposited samples were subjected to an in-air annealing protocol. It was demonstrated that the clustering of metallic Ag affects the optical response spectrum and the thermal behaviour of the films.
Resumo:
Nanocomposite thin films consisting of a dielectric matrix, such as titanium oxide (TiO2), with embedded gold (Au) nanoparticles were prepared and will be analysed and discussed in detail in the present work. The evolution of morphological and structural features was studied for a wide range of Au concentrations and for annealing treatments in air, for temperatures ranging from 200 to 800 °C. Major findings revealed that for low Au atomic concentrations (at.%), there are only traces of clustering, and just for relatively high annealing temperatures, T ≥ 500 °C. Furthermore, the number of Au nanoparticles is extremely low, even for the highest annealing temperature, T = 800 °C. It is noteworthy that the TiO2 matrix also crystallizes in the anatase phase for annealing temperatures above 300 °C. For intermediate Au contents (5 at.% ≤ CAu ≤ 15 at.%), the formation of gold nanoclusters was much more evident, beginning at lower annealing temperatures (T ≥ 200 °C) with sizes ranging from 2 to 25 nm as the temperature increased. A change in the matrix crystallization from anatase to rutile was also observed in this intermediate range of compositions. For the highest Au concentrations (> 20 at.%), the films tended to form relatively larger clusters, with sizes above 20 nm (for T ≥ 400 °C). It is demonstrated that the structural and morphological characteristics of the films are strongly affected by the annealing temperature, as well as by the particular amounts, size and distribution of the Au nanoparticles dispersed in the TiO2 matrix.
Resumo:
Dissertação de mestrado integrado em Engenharia e Gestão de Sistemas de Informação
Resumo:
Programa Doutoral em Engenharia Eletrónica e de Computadores
Resumo:
Programa Doutoral em Líderes para as Indústrias Tecnológicas
Resumo:
Tese de Doutoramento em Ciências da Saúde.
Resumo:
The aim of this study was to determine if mycobacterial lineages affect infection risk, clustering, and disease progression among Mycobacterium tuberculosis cases in The Netherlands. Multivariate negative binomial regression models adjusted for patient-related factors and stratified by patient ethnicity were used to determine the association between phylogenetic lineages and infectivity (mean number of positive contacts around each patient) and clustering (as defined by number of secondary cases within 2 years after diagnosis of an index case sharing the same fingerprint) indices. An estimate of progression to disease by each risk factor was calculated as a bootstrapped risk ratio of the clustering index by the infectivity index. Compared to the Euro-American reference, Mycobacterium africanum showed significantly lower infectivity and clustering indices in the foreign-born population, while Mycobacterium bovis showed significantly lower infectivity and clustering indices in the native population. Significantly lower infectivity was also observed for the East African Indian lineage in the foreign-born population. Smear positivity was a significant risk factor for increased infectivity and increased clustering. Estimates of progression to disease were significantly associated with age, sputum-smear status, and behavioral risk factors, such as alcohol and intravenous drug abuse, but not with phylogenetic lineages. In conclusion, we found evidence of a bacteriological factor influencing indicators of a strain's transmissibility, namely, a decreased ability to infect and a lower clustering index in ancient phylogenetic lineages compared to their modern counterparts. Confirmation of these findings via follow-up studies using tuberculin skin test conversion data should have important implications on M. tuberculosis control efforts.
Resumo:
Tese de Doutoramento em Ciências da Saúde