5 resultados para discovery driven analysis
em Universidade do Minho
Resumo:
NIPE - WP 02/2016
Resumo:
As huge amounts of data become available in organizations and society, specific data analytics skills and techniques are needed to explore this data and extract from it useful patterns, tendencies, models or other useful knowledge, which could be used to support the decision-making process, to define new strategies or to understand what is happening in a specific field. Only with a deep understanding of a phenomenon it is possible to fight it. In this paper, a data-driven analytics approach is used for the analysis of the increasing incidence of fatalities by pneumonia in the Portuguese population, characterizing the disease and its incidence in terms of fatalities, knowledge that can be used to define appropriate strategies that can aim to reduce this phenomenon, which has increased more than 65% in a decade.
Resumo:
Special issue guest editorial, June, 2015.
Resumo:
Existing data supports Portugal as the Western Europe country with highest HIV-1 subtype diversity. However, detailed phylogenetic studies of Portuguese HIV-1 epidemics are still scarce. Thus, our main goal was to analyze the phylodynamics of a local HIV-1 infection in the Portuguese region of Minho. Molecular epidemiological analysis was applied to data from 289 HIV-1 infected individuals followed in the reference Hospital of the province of Minho, Portugal, in which isolated viruses had been sequenced between 2000 and 2012. Viruses of the G (29.1%) and B (27.0%) subtypes were the most frequent, followed by recombinant forms (17.6%), C (14.5%), F1 (7.3%) and A1 (4.2%) subtypes. Multinomial logistic regression revealed that the odds of being infected with A1 and F1 subtype increased over the years when compared with B, G, C or recombinant viruses. As expected, polyphyletic patterns suggesting multiple and old introductions of subtypes B and G were found. However, transmission clusters of non-B and -G viruses among native individuals were also found with the dates of the most recent common ancestor estimated to the early 2000s. Our study supports that the HIV-1 subtype diversity in the Portuguese region of Minho is high and has been increasing in a manner that is apparently driven by factors other than immigration and international travel. Infections with A1 and F1 viruses in the region of Minho are becoming established and were mainly found in sexually transmitted clusters, reinforcing the need for more efficacious control measures targeting this infection route.
Resumo:
Recently, there has been a growing interest in the field of metabolomics, materialized by a remarkable growth in experimental techniques, available data and related biological applications. Indeed, techniques as Nuclear Magnetic Resonance, Gas or Liquid Chromatography, Mass Spectrometry, Infrared and UV-visible spectroscopies have provided extensive datasets that can help in tasks as biological and biomedical discovery, biotechnology and drug development. However, as it happens with other omics data, the analysis of metabolomics datasets provides multiple challenges, both in terms of methodologies and in the development of appropriate computational tools. Indeed, from the available software tools, none addresses the multiplicity of existing techniques and data analysis tasks. In this work, we make available a novel R package, named specmine, which provides a set of methods for metabolomics data analysis, including data loading in different formats, pre-processing, metabolite identification, univariate and multivariate data analysis, machine learning, and feature selection. Importantly, the implemented methods provide adequate support for the analysis of data from diverse experimental techniques, integrating a large set of functions from several R packages in a powerful, yet simple to use environment. The package, already available in CRAN, is accompanied by a web site where users can deposit datasets, scripts and analysis reports to be shared with the community, promoting the efficient sharing of metabolomics data analysis pipelines.