13 resultados para exploratory data analysis

em Universidade do Minho


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dissertação de mestrado integrado em Engenharia e Gestão de Sistemas de Informação

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tese de Doutoramento em Engenharia Civil.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Programa Doutoral em Matemática e Aplicações.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Recently, there has been a growing interest in the field of metabolomics, materialized by a remarkable growth in experimental techniques, available data and related biological applications. Indeed, techniques as Nuclear Magnetic Resonance, Gas or Liquid Chromatography, Mass Spectrometry, Infrared and UV-visible spectroscopies have provided extensive datasets that can help in tasks as biological and biomedical discovery, biotechnology and drug development. However, as it happens with other omics data, the analysis of metabolomics datasets provides multiple challenges, both in terms of methodologies and in the development of appropriate computational tools. Indeed, from the available software tools, none addresses the multiplicity of existing techniques and data analysis tasks. In this work, we make available a novel R package, named specmine, which provides a set of methods for metabolomics data analysis, including data loading in different formats, pre-processing, metabolite identification, univariate and multivariate data analysis, machine learning, and feature selection. Importantly, the implemented methods provide adequate support for the analysis of data from diverse experimental techniques, integrating a large set of functions from several R packages in a powerful, yet simple to use environment. The package, already available in CRAN, is accompanied by a web site where users can deposit datasets, scripts and analysis reports to be shared with the community, promoting the efficient sharing of metabolomics data analysis pipelines.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

As huge amounts of data become available in organizations and society, specific data analytics skills and techniques are needed to explore this data and extract from it useful patterns, tendencies, models or other useful knowledge, which could be used to support the decision-making process, to define new strategies or to understand what is happening in a specific field. Only with a deep understanding of a phenomenon it is possible to fight it. In this paper, a data-driven analytics approach is used for the analysis of the increasing incidence of fatalities by pneumonia in the Portuguese population, characterizing the disease and its incidence in terms of fatalities, knowledge that can be used to define appropriate strategies that can aim to reduce this phenomenon, which has increased more than 65% in a decade.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND: Furniture companies can analyze their safety status using quantitative measures. However, the data needed are not always available and the number of accidents is under-reported. Safety climate scales may be an alternative. However, there are no validated Portuguese scales that account for the specific attributes of the furniture sector. OBJECTIVE: The current study aims to develop and validate an instrument that uses a multilevel structure to measure the safety climate of the Portuguese furniture industry. METHODS: The Safety Climate in Wood Industries (SCWI) model was developed and applied to the safety climate analysis using three different scales: organizational, group and individual. A multilevel exploratory factor analysis was performed to analyze the factorial structure. The studied companies’ safety conditions were also analyzed. RESULTS: Different factorial structures were found between and within levels. In general, the results show the presence of a group-level safety climate. The scores of safety climates are directly and positively related to companies’ safety conditions; the organizational scale is the one that best reflects the actual safety conditions. CONCLUSIONS: The SCWI instrument allows for the identification of different safety climates in groups that comprise the same furniture company and it seems to reflect those groups’ safety conditions. The study also demonstrates the need for a multilevel analysis of the studied instrument.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Tese de Doutoramento em Ciências da Educação (Área de Conhecimento: Educação ambiental e para a Sustentabilidade)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Extreme value models are widely used in different areas. The Birnbaum–Saunders distribution is receiving considerable attention due to its physical arguments and its good properties. We propose a methodology based on extreme value Birnbaum–Saunders regression models, which includes model formulation, estimation, inference and checking. We further conduct a simulation study for evaluating its performance. A statistical analysis with real-world extreme value environmental data using the methodology is provided as illustration.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

High levels of marine salt deposition present in coastal areas have a relevant effect on road runoff characteristics. This study assesses this effect with the purpose of identifying the relationships between monitored water quality parameters and intrinsic site variables. To achieve this objective, an extensive monitoring program was conducted on a Portuguese coastal highway. The study included 30 rainfall events, in different weather, traffic, and salt deposition conditions. The evaluations of various water quality parameters were carried out in over 200 samples. In addition, the meteorological, hydrological, and traffic parameters were continuously measured. The salt deposition rates were determined by means of a wet candle device, which is an innovative feature of the monitoring program. The relation between road runoff pollutants and independent variables associated with weather, traffic, and salt deposition conditions was assessed. Significant correlations among pollutants were observed. A high salinity concentration and its influence on the road runoff were confirmed. Furthermore, the concentrations of the most relevant pollutants seemed to be very dependent on some meteorological variables, particularly the duration of the antecedent dry period prior to each rainfall event and the average wind speed.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This data article is referred to the research article entitled The role of ascorbate peroxidase, guaiacol peroxidase, and polysaccharides in cassava (Manihot esculenta Crantz) roots under postharvest physiological deterioration by Uarrota et al. (2015). Food Chemistry 197, Part A, 737746. The stress duo to PPD of cassava roots leads to the formation of ROS which are extremely harmful and accelerates cassava spoiling. To prevent or alleviate injuries from ROS, plants have evolved antioxidant systems that include non-enzymatic and enzymatic defence systems such as ascorbate peroxidase, guaiacol peroxidase and polysaccharides. In this data article can be found a dataset called newdata, in RData format, with 60 observations and 06 variables. The first 02 variables (Samples and Cultivars) and the last 04, spectrophotometric data of ascorbate peroxidase, guaiacol peroxidase, tocopherol, total proteins and arcsined data of cassava PPD scoring. For further interpretation and analysis in R software, a report is also provided. Means of all variables and standard deviations are also provided in the Supplementary tables (data.long3.RData, data.long4.RData and meansEnzymes.RData), raw data of PPD scoring without transformation (PPDmeans.RData) and days of storage (days.RData) are also provided for data analysis reproducibility in R software.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Dissertação de mestrado integrado em Engenharia e Gestão de Sistemas de Informação

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Dissertação de mestrado integrado em Engenharia e Gestão de Sistemas de Informação

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Here we focus on factor analysis from a best practices point of view, by investigating the factor structure of neuropsychological tests and using the results obtained to illustrate on choosing a reasonable solution. The sample (n=1051 individuals) was randomly divided into two groups: one for exploratory factor analysis (EFA) and principal component analysis (PCA), to investigate the number of factors underlying the neurocognitive variables; the second to test the "best fit" model via confirmatory factor analysis (CFA). For the exploratory step, three extraction (maximum likelihood, principal axis factoring and principal components) and two rotation (orthogonal and oblique) methods were used. The analysis methodology allowed exploring how different cognitive/psychological tests correlated/discriminated between dimensions, indicating that to capture latent structures in similar sample sizes and measures, with approximately normal data distribution, reflective models with oblimin rotation might prove the most adequate.