Biblioteca Digital

945 resultados para Cluster-analysis

An R Library for Compositional Data Analysis in Archaeometry

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Compositional data naturally arises from the scientific analysis of the chemical composition of archaeological material such as ceramic and glass artefacts. Data of this type can be explored using a variety of techniques, from standard multivariate methods such as principal components analysis and cluster analysis, to methods based upon the use of log-ratios. The general aim is to identify groups of chemically similar artefacts that could potentially be used to answer questions of provenance. This paper will demonstrate work in progress on the development of a documented library of methods, implemented using the statistical package R, for the analysis of compositional data. R is an open source package that makes available very powerful statistical facilities at no cost. We aim to show how, with the aid of statistical software such as R, traditional exploratory multivariate analysis can easily be used alongside, or in combination with, specialist techniques of compositional data analysis. The library has been developed from a core of basic R functionality, together with purpose-written routines arising from our own research (for example that reported at CoDaWork'03). In addition, we have included other appropriate publicly available techniques and libraries that have been implemented in R by other authors. Available functions range from standard multivariate techniques through to various approaches to log-ratio analysis and zero replacement. We also discuss and demonstrate a small selection of relatively new techniques that have hitherto been little-used in archaeometric applications involving compositional data. The application of the library to the analysis of data arising in archaeometry will be demonstrated; results from different analyses will be compared; and the utility of the various methods discussed

Statistical analysis of eight surface ozone measurement series for various sites in Ireland

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Data from various stations having different measurement record periods between 1988 and 2007 are analyzed to investigate the surface ozone concentration, long-term trends, and seasonal changes in and around Ireland. Time series statistical analysis is performed on the monthly mean data using seasonal and trend decomposition procedures and the Box-Jenkins approach (autoregressive integrated moving average). In general, ozone concentrations in the Irish region are found to have a negative trend at all sites except at the coastal sites of Mace Head and Valentia. Data from the most polluted Dublin city site have shown a very strong negative trend of −0.33 ppb/yr with a 95% confidence limit of 0.17 ppb/yr (i.e., −0.33 ± 0.17) for the period 2002−2007, and for the site near the city of Cork, the trend is found to be −0.20 ± 0.11 ppb/yr over the same period. The negative trend for other sites is more pronounced when the data span is considered from around the year 2000 to 2007. Rural sites of Wexford and Monaghan have also shown a very strong negative trend of −0.99 ± 0.13 and −0.58 ± 0.12, respectively, for the period 2000−2007. Mace Head, a site that is representative of ozone changes in the air advected from the Atlantic to Europe in the marine planetary boundary layer, has shown a positive trend of about +0.16 ± 0.04 ppb per annum over the entire period 1988−2007, but this positive trend has reduced during recent years (e.g., in the period 2001−2007). Cluster analysis for back trajectories are performed for the stations having a long record of data, Mace Head and Lough Navar. For Mace Head, the northern and western clean air sectors have shown a similar positive trend (+0.17 ± 0.02 ppb/yr for the northern sector and +0.18 ± 0.02 ppb/yr for the western sector) for the whole period, but partial analysis for the clean western sector at Mace Head shows different trends during different time periods with a decrease in the positive trend since 1988 indicating a deceleration in the ozone trend for Atlantic air masses entering Europe.

Analysis of the bacterial communities present in lungs of patients with cystic fibrosis from American and British centers

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The aim of this study was to determine whether geographical differences impact the composition of bacterial communities present in the airways of cystic fibrosis (CF) patients attending CF centers in the United States or United Kingdom. Thirty-eight patients were matched on the basis of clinical parameters into 19 pairs comprised of one U.S. and one United Kingdom patient. Analysis was performed to determine what, if any, bacterial correlates could be identified. Two culture-independent strategies were used: terminal restriction fragment length polymorphism (T-RFLP) profiling and 16S rRNA clone sequencing. Overall, 73 different terminal restriction fragment lengths were detected, ranging from 2 to 10 for U.S. and 2 to 15 for United Kingdom patients. The statistical analysis of T-RFLP data indicated that patient pairing was successful and revealed substantial transatlantic similarities in the bacterial communities. A small number of bands was present in the vast majority of patients in both locations, indicating that these are species common to the CF lung. Clone sequence analysis also revealed that a number of species not traditionally associated with the CF lung were present in both sample groups. The species number per sample was similar, but differences in species presence were observed between sample groups. Cluster analysis revealed geographical differences in bacterial presence and relative species abundance. Overall, the U.S. samples showed tighter clustering with each other compared to that of United Kingdom samples, which may reflect the lower diversity detected in the U.S. sample group. The impact of cross-infection and biogeography is considered, and the implications for treating CF lung infections also are discussed.

Advanced multivariate analysis to assess remediation of hydrocarbons in soils

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Accurate monitoring of degradation levels in soils is essential in order to understand and achieve complete degradation of petroleum hydrocarbons in contaminated soils. We aimed to develop the use of multivariate methods for the monitoring of biodegradation of diesel in soils and to determine if diesel contaminated soils could be remediated to a chemical composition similar to that of an uncontaminated soil. An incubation experiment was set up with three contrasting soil types. Each soil was exposed to diesel at varying stages of degradation and then analysed for key hydrocarbons throughout 161 days of incubation. Hydrocarbon distributions were analysed by Principal Coordinate Analysis and similar samples grouped by cluster analysis. Variation and differences between samples were determined using permutational multivariate analysis of variance. It was found that all soils followed trajectories approaching the chemical composition of the unpolluted soil. Some contaminated soils were no longer significantly different to that of uncontaminated soil after 161 days of incubation. The use of cluster analysis allows the assignment of a percentage chemical similarity of a diesel contaminated soil to an uncontaminated soil sample. This will aid in the monitoring of hydrocarbon contaminated sites and the establishment of potential endpoints for successful remediation.

Under-reporting of energy intake is more prevalent in a healthy dietary pattern cluster

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The aim of the present study was to determine whether under-reporting rates vary between dietary pattern Clusters. Subjects were sixty-five Brazilian women. During 3 weeks, anthropometric data were collected. total energy expenditure (TEE) was determined by the doubly labelled water method and diet Was Measured. Energy intake (El) and the daily frequency of consumption per 1000 kJ of twenty-two food groups were obtained from a FFQ. These frequencies were entered into a Cluster analysis procedure in order to obtain dietary patterns. Under-reporters were defined Lis those who did not lose more than 1 kg of body weight during the study and presented EI:TEE less than 0.82. Three dietary pattern clusters were identified and named according to their most recurrent food groups: sweet foods (SW). starchy foods (ST) and health), (H). Subjects from the healthy cluster had the lowest mean EI:TEE (SW = 0.86, ST = 0.71 and H = 0.58: P = 0.003) and EI - TEE (SW = -0.49 MJ, ST = - 3.20 MJ and H = -5.09 MJ; P = 0.008). The proportion of Under-reporters was 45.2 (95 % CI 35.5, 55.0) % in the SW Cluster: 58.3 (95 % CI 48.6, 68.0) % in the ST Cluster and 70.0 (95 % CI 61.0, 79) % in the H cluster (P=0.34). Thus, in Brazilian women, Under-reporting of El is not uniformly distributed among, dietary pattern clusters and tends to be more severe among subjects from the healthy cluster. This cluster is more consistent with both dietary guidelines and with what lay individuals usually consider `healthy eating`.

Analysis of genotypic variation in genes associated with virulence in Aggregatibacter actinomycetemcomitans clinical isolates

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Background and Objective: Although certain serotypes of Aggregatibacter actinomycetemcomitans are associated more with aggressive periodontitis than are other serotypes, the correlation between distinct lineages and virulence traits in this species is poorly understood. This study aimed to evaluate the polymorphism of genes encoding putative virulence factors of clinical isolates, and to correlate these findings with A. actinomycetemcomitans serotypes, genotypes and periodontal status of the hosts. Material and Methods: Twenty-six clinical isolates from diverse geographic populations with different periodontal conditions were evaluated. Genotyping was performed using pulse-field gel electrophoresis. Polymorphisms in the genes encoding leukotoxin, Aae, ApaH and determinants for serotype-specific O polysaccharide were investigated. Results: The isolates were classified into serotypes a-f, and exhibited three apaH genotypes, five aae alleles and 25 macrorestriction profiles. Two serotype b isolates (7.7%), obtained from Brazilian patients with aggressive periodontitis, were associated with the highly leukotoxic genotype; these isolates showed identical fingerprint patterns and aae and apaH genotypes. Serotype c, obtained from various periodontal conditions, was the most prevalent among Brazilian isolates, and isolates were distributed in two aae alleles, but formed a genetically distinct group based on apaH analysis. Cluster analysis showed a close relationship between fingerprinting genotypes and serotypes/apaH genotypes, but not with aae genotypes. Conclusion: Apart from the deletion in the ltx promoter region, no disease-associated markers were identified. Non-JP2-like strains recovered from individuals with periodontal disease exhibited considerable genetic variation regarding aae/apaH genotypes, serotypes and XhoI DNA fingerprints.

Multi-objective clustering ensemble for gene expression data analysis

Relevância:

70.00% 70.00%

Publicador:

Resumo:

In this paper, we present an algorithm for cluster analysis that integrates aspects from cluster ensemble and multi-objective clustering. The algorithm is based on a Pareto-based multi-objective genetic algorithm, with a special crossover operator, which uses clustering validation measures as objective functions. The algorithm proposed can deal with data sets presenting different types of clusters, without the need of expertise in cluster analysis. its result is a concise set of partitions representing alternative trade-offs among the objective functions. We compare the results obtained with our algorithm, in the context of gene expression data sets, to those achieved with multi-objective Clustering with automatic K-determination (MOCK). the algorithm most closely related to ours. (C) 2009 Elsevier B.V. All rights reserved.

Multivariate analysis of fresh-cut carambola slices stored under different temperatures

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Ingredient classification according to the digestible amino acid profile: an exploratory analysis

Relevância:

70.00% 70.00%

Publicador:

Resumo:

This study aimed: 1) to classify ingredients according to the digestible amino acid (AA) profile; 2) to determine ingredients with AA profile closer to the ideal for broiler chickens; and 3) to compare digestible AA profiles from simulated diets with the ideal protein profile. The digestible AA levels of 30 ingredients were compiled from the literature and presented as percentages of lysine according to the ideal protein concept. Cluster and principal component analyses (exploratory analyses) were used to compose and describe groups of ingredients according to AA profiles. Four ingredient groups were identified by cluster analysis, and the classification of the ingredients within each of these groups was obtained from a principal component analysis, showing 11 classes of ingredients with similar digestible AA profiles. The ingredients with AA profiles closer to the ideal protein were meat and bone meal 45, fish meal 60 and wheat germ meal, all of them constituting Class 1; the ingredients from the other classes gradually diverged from the ideal protein. Soybean meal, which is the main protein source for poultry, showed good AA balance since it was included in Class 3. on the contrary, corn, which is the main energy source in poultry diets, was classified in Class 8. Dietary AA profiles were improved when corn and/or soybean meal were partially or totally replaced in the simulations by ingredients with better AA balance.

GC Fingerprints Coupled to Pattern-Recognition Multivariate SIMCA Chemometric Analysis for Brazilian Gasoline Quality Studies

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Screening Brazilian commercial gasoline quality by hydrogen nuclear magnetic resonance spectroscopic fingerprintings and pattern-recognition multivariate chemometric analysis

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The identification of gasoline adulteration by organic solvents is not an easy task, because compounds that constitute the solvents are already in gasoline composition. In this work, the combination of Hydrogen Nuclear Magnetic Resonance ((1)H NMR) spectroscopic fingerprintings with pattern-recognition multivariate Soft Independent Modeling of Class Analogy (SIMCA) chemometric analysis provides an original and alternative approach to screening Brazilian commercial gasoline quality in a Monitoring Program for Quality Control of Automotive Fuels. SIMCA was performed on spectroscopic fingerprints to classify the quality of representative commercial gasoline samples selected by Hierarchical Cluster Analysis (HCA) and collected over a 6-month period from different gas stations in the São Paulo state, Brazil. Following optimized the (1)H NMR-SIMCA algorithm, it was possible to correctly classify 92.0% of commercial gasoline samples, which is considered acceptable. The chemometric method is recommended for routine applications in Quality-Control Monitoring Programs, since its measurements are fast and can be easily automated. Also, police laboratories could employ this method for rapid screening analysis to discourage adulteration practices. (C) 2010 Elsevier B.V. All rights reserved.

Analysis of floristic composition and structure as an aid to monitoring protected areas of dense rain forest in southeastern Brazil

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Techniques of digital image analysis for histological quantification of melanin

Relevância:

70.00% 70.00%

Publicador:

Resumo:

A análise morfométrica da melanina tecidual pode subsidiar quantitativamente a pesquisa em discromias. Os autores demonstram três técnicas de análise de imagem digital que permitem a identificação dos pixels equivalentes à melanina na epiderme pela coloração de Fontana-Masson, possibilitando o cálculo da sua porcentagem nas diferentes camadas da epiderme, e discutem os principais elementos relacionados à análise e a necessidade de rigorosa padronização do processo.

Esterase patterns and phylogenetic relationships of species and strains included in the Drosophila buzzatii cluster

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Ten strains of two species in the Drosophila buzzatii cluster (D. serido and D. seriema) were examined as to esterase patterns using polyacrylamide gel electrophoresis. The migration rate of esterases, and their substrate specificity to alpha and beta naphthyl acetates, were analysed. Other esterase features such as inhibition behaviour, presence in males and females and location in the head, thorax or abdomen of flies, were also examined. The present data,together with results obtained by others for eight strains of D. koepferae, D. serido, D. seriema and D. buzzatii, show that 69 bands have been detected in the eighteen strains studied. This total number of bands was used for comparison of strains and species by similarity index, analysis of dependence and cluster analysis. The comparisons confirmed the existence of a high degree of similarity among D. seriema strains and among D. koepferae strains, but indicated differentiation among the D. serido strains. Two strains (D69R2 and D69R5) which differed from the others of the latter species, showed closer affinities with D. buzzatii, which indicates the need for further work on those strains classified as D. serido.

Estimates of genetic parameters, and cluster and principal components analyses of breeding values related to egg production traits in a White Leghorn population

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

«
1
2
...
5
6
7
8
9
10
11
...
62
63
»