918 resultados para Image pre-processing
Resumo:
Recently, there has been a growing interest in the field of metabolomics, materialized by a remarkable growth in experimental techniques, available data and related biological applications. Indeed, techniques as Nuclear Magnetic Resonance, Gas or Liquid Chromatography, Mass Spectrometry, Infrared and UV-visible spectroscopies have provided extensive datasets that can help in tasks as biological and biomedical discovery, biotechnology and drug development. However, as it happens with other omics data, the analysis of metabolomics datasets provides multiple challenges, both in terms of methodologies and in the development of appropriate computational tools. Indeed, from the available software tools, none addresses the multiplicity of existing techniques and data analysis tasks. In this work, we make available a novel R package, named specmine, which provides a set of methods for metabolomics data analysis, including data loading in different formats, pre-processing, metabolite identification, univariate and multivariate data analysis, machine learning, and feature selection. Importantly, the implemented methods provide adequate support for the analysis of data from diverse experimental techniques, integrating a large set of functions from several R packages in a powerful, yet simple to use environment. The package, already available in CRAN, is accompanied by a web site where users can deposit datasets, scripts and analysis reports to be shared with the community, promoting the efficient sharing of metabolomics data analysis pipelines.
Resumo:
In this project a research both in finding predictors via clustering techniques and in reviewing the Data Mining free software is achieved. The research is based in a case of study, from where additionally to the KDD free software used by the scientific community; a new free tool for pre-processing the data is presented. The predictors are intended for the e-learning domain as the data from where these predictors have to be inferred are student qualifications from different e-learning environments. Through our case of study not only clustering algorithms are tested but also additional goals are proposed.
Resumo:
Realistic rendering animation is known to be an expensive processing task when physically-based global illumination methods are used in order to improve illumination details. This paper presents an acceleration technique to compute animations in radiosity environments. The technique is based on an interpolated approach that exploits temporal coherence in radiosity. A fast global Monte Carlo pre-processing step is introduced to the whole computation of the animated sequence to select important frames. These are fully computed and used as a base for the interpolation of all the sequence. The approach is completely view-independent. Once the illumination is computed, it can be visualized by any animated camera. Results present significant high speed-ups showing that the technique could be an interesting alternative to deterministic methods for computing non-interactive radiosity animations for moderately complex scenarios
Resumo:
1. The ecological niche is a fundamental biological concept. Modelling species' niches is central to numerous ecological applications, including predicting species invasions, identifying reservoirs for disease, nature reserve design and forecasting the effects of anthropogenic and natural climate change on species' ranges. 2. A computational analogue of Hutchinson's ecological niche concept (the multidimensional hyperspace of species' environmental requirements) is the support of the distribution of environments in which the species persist. Recently developed machine-learning algorithms can estimate the support of such high-dimensional distributions. We show how support vector machines can be used to map ecological niches using only observations of species presence to train distribution models for 106 species of woody plants and trees in a montane environment using up to nine environmental covariates. 3. We compared the accuracy of three methods that differ in their approaches to reducing model complexity. We tested models with independent observations of both species presence and species absence. We found that the simplest procedure, which uses all available variables and no pre-processing to reduce correlation, was best overall. Ecological niche models based on support vector machines are theoretically superior to models that rely on simulating pseudo-absence data and are comparable in empirical tests. 4. Synthesis and applications. Accurate species distribution models are crucial for effective environmental planning, management and conservation, and for unravelling the role of the environment in human health and welfare. Models based on distribution estimation rather than classification overcome theoretical and practical obstacles that pervade species distribution modelling. In particular, ecological niche models based on machine-learning algorithms for estimating the support of a statistical distribution provide a promising new approach to identifying species' potential distributions and to project changes in these distributions as a result of climate change, land use and landscape alteration.
Resumo:
In this paper the problem of intensity inhomogeneity athigh magnetic field on magnetic resonance images isaddressed. Specifically, rat brain images at 9.4Tacquired with a surface coil are bias corrected. Wepropose a low- pass frequency model that takes intoaccount not only background-object contours but alsoother important contours inside the image. Twopre-processing filters are proposed: first, to create avolume of interest without contours, and second, toextrapolate the image values of such masked area to thewhole image. Results are assessed quantitatively andvisually in comparison to standard low pass filterapproach, and they show as expected better accuracy inenhancing image intensity.
Resumo:
Compared to synthetic aperture radars (SARs), the angular resolution of microwave radiometers is quite poor. Traditionally, it has been limited by the physical size of the antenna. However, the angular resolution can be improved by means of aperture synthesis interferometric techniques. A narrow beam is synthesized during the image formation processing of the cross-correlations measured at zero-lag between pairs of signals collected by an array of antennas. The angular resolution is then determined by the maximum antenna spacing normalized to the wavelength (baseline). The next step in improving the angular resolution is the Doppler-Radiometer, somehow related to the super-synthesis radiometers and the Radiometer-SAR. This paper presents the concept of a three-antenna Doppler-Radiometer for 2D imaging. The performance of this instrument is evaluated in terms of angular/spatial resolution and radiometric sensitivity, and an L-band illustrative example is presented.
Resumo:
The increasing interest aroused by more advanced forecasting techniques, together with the requirement for more accurate forecasts of tourismdemand at the destination level due to the constant growth of world tourism, has lead us to evaluate the forecasting performance of neural modelling relative to that of time seriesmethods at a regional level. Seasonality and volatility are important features of tourism data, which makes it a particularly favourable context in which to compare the forecasting performance of linear models to that of nonlinear alternative approaches. Pre-processed official statistical data of overnight stays and tourist arrivals fromall the different countries of origin to Catalonia from 2001 to 2009 is used in the study. When comparing the forecasting accuracy of the different techniques for different time horizons, autoregressive integrated moving average models outperform self-exciting threshold autoregressions and artificial neural network models, especially for shorter horizons. These results suggest that the there is a trade-off between the degree of pre-processing and the accuracy of the forecasts obtained with neural networks, which are more suitable in the presence of nonlinearity in the data. In spite of the significant differences between countries, which can be explained by different patterns of consumer behaviour,we also find that forecasts of tourist arrivals aremore accurate than forecasts of overnight stays.
Resumo:
Universal Converter (UNICON) –projektin osana suunniteltiin sähkömoottorikäyttöjen ohjaukseen ja mittaukseen soveltuva digitaaliseen signaaliprosessoriin (DSP) pohjautuva sulautettu järjestelmä. Riittävän laskentatehon varmistamiseksi päädyttiin käyttämään moniprosessorijärjestelmää. Prosessorijärjestelmässä käytettävää DSP-piiriä valittaessa valintaperusteina olivat piirien tarjoama prosessointiteho ja moniprosessorituki. Analog Devices:n SHARC-sarjan DSP-piirit täyttivät parhaiten asetetut vaatimukset: Ne tarjoavat tehokkaan käskykannan lisäksi suuren sisäisen muistin ja sisäänrakennetun moniprosessorituen. Järjestelmän mittalaiteluonteisuudesta johtuen keskeinen suunnitteluparametri oli luoda nopeat tiedonsiirtoyhteydet mittausantureilta DSP-järjestelmään. Tämä toteutettiin käyttäen ohjelmointavia FPGA-logiikkapiirejä digitaalimuotoisen mittausdatan vastaanotossa ja esikäsittelyssä. Tiedonsiirtoyhteys PC-tietokoneelle toteutettiin käyttäen erityistä liityntäkorttia DSP-järjestelmän ja PC-tietokoneen välillä. Liityntäkortin päätehtävänä on puskuroida siirrettävä data. Järjestelyllä estetään PC-tietokoneen vaikutus DSP-järjestelmän toimintaan, jotta kyetään takaamaan järjestelmän reaaliaikainen toiminta kaikissa olosuhteissa.
Resumo:
In this thesis we study the field of opinion mining by giving a comprehensive review of the available research that has been done in this topic. Also using this available knowledge we present a case study of a multilevel opinion mining system for a student organization's sales management system. We describe the field of opinion mining by discussing its historical roots, its motivations and applications as well as the different scientific approaches that have been used to solve this challenging problem of mining opinions. To deal with this huge subfield of natural language processing, we first give an abstraction of the problem of opinion mining and describe the theoretical frameworks that are available for dealing with appraisal language. Then we discuss the relation between opinion mining and computational linguistics which is a crucial pre-processing step for the accuracy of the subsequent steps of opinion mining. The second part of our thesis deals with the semantics of opinions where we describe the different ways used to collect lists of opinion words as well as the methods and techniques available for extracting knowledge from opinions present in unstructured textual data. In the part about collecting lists of opinion words we describe manual, semi manual and automatic ways to do so and give a review of the available lists that are used as gold standards in opinion mining research. For the methods and techniques of opinion mining we divide the task into three levels that are the document, sentence and feature level. The techniques that are presented in the document and sentence level are divided into supervised and unsupervised approaches that are used to determine the subjectivity and polarity of texts and sentences at these levels of analysis. At the feature level we give a description of the techniques available for finding the opinion targets, the polarity of the opinions about these opinion targets and the opinion holders. Also at the feature level we discuss the various ways to summarize and visualize the results of this level of analysis. In the third part of our thesis we present a case study of a sales management system that uses free form text and that can benefit from an opinion mining system. Using the knowledge gathered in the review of this field we provide a theoretical multi level opinion mining system (MLOM) that can perform most of the tasks needed from an opinion mining system. Based on the previous research we give some hints that many of the laborious market research tasks that are done by the sales force, which uses this sales management system, can improve their insight about their partners and by that increase the quality of their sales services and their overall results.
Resumo:
Genetic algorithm was used for variable selection in simultaneous determination of mixtures of glucose, maltose and fructose by mid infrared spectroscopy. Different models, using partial least squares (PLS) and multiple linear regression (MLR) with and without data pre-processing, were used. Based on the results obtained, it was verified that a simpler model (multiple linear regression with variable selection by genetic algorithm) produces results comparable to more complex methods (partial least squares). The relative errors obtained for the best model was around 3% for the sugar determination, which is acceptable for this kind of determination.
Resumo:
The influence of pre-processing of arabica coffee beans on the composition of volatile precursors including sugars, chlorogenic acids, phenolics, proteins, aminoacids, trigonelline and fatty acids was assessed and correlated with volatiles formed during roasting. Reducing sugars and free aminoacids were highest for natural coffees whereas total sugars, chlorogenic acids and trigonelline were highest for washed coffees. The highest correlation was observed for total phenolics and volatile phenolics (R= 0.999). Experimental data were evaluated by Principal Components Analysis and results showed that washed coffees formed a distinct group in relation to semi-washed and natural coffees.
Resumo:
Tämän työn tavoitteena oli selvittää ja toteuttaa esikäsittelypiirin prototyyppi akustisen emission anturin signaalille. Toteutettu esikäsittelypiiri toimii yksipuoleisella käyttöjännitteellä. Työssä käydään läpi esikäsittelypiirin suunnitteluun liittyvät vaiheet laskelmien ja simulaatioiden muodossa. Lisäksi työssä esitetään mittaustulokset esikäsittelypiirin toiminnasta.
Resumo:
Työssä määritettiin luokan 2 eläinperäisistä sivutuotteista liikennekäyttöön tuotettujen biodieselin ja biometaanin elinkaaren aikaiset kasvihuonekaasupäästöt ja tuotantoprosessien energiankulutukset perustuen kirjallisuuslähteistä saatuihin lähtötietoihin. Tätä kautta tutkittiin yhdistelmäprosessia, jossa tuotetaan molempia polttoaineita ja selvitettiin onko tällaisella tuotantotavalla mahdollista vähentää päästöjä ja parantaa polttoaineiden tuotannon energiatehokkuutta. Kasvihuone-kaasupäästöjen laskentamenetelmä pohjautuu direktiivissä 2009/28/EY annettuun ohjeistukseen ja eri kasvihuonekaasupäästöjen karakterisointi IPCC:n sadan vuoden tarkastelumalliin. Käytännön laskenta suoritettiin standardien SFS-EN ISO 14040 ja 14044 määrittelemän elinkaariarviointiselvityksen muodossa. Työssä käytetyn laskentamenetelmän ja tarkasteluun valittujen tuotanto-teknologioiden perusteella lasketut tulokset osoittavat, että yhdistelmäprosessilla ei saavuteta suurempia päästövähenemiä eikä parempaa energiatehokkuutta kuin nykyisin käytössä olevilla tuotantotavoilla. Tulokset ovat kuitenkin hyvin herkkiä laskennassa tehtyjen oletusten ja käytettyjen lähtötietojen vaihtelulle sekä valittujen laskentamenetelmien muutoksille. Suurin päästöjä ja energiankulutusta aiheuttava yksittäinen tekijä on kaikissa tuotejärjestelmissä luokan 2 sivutuotteiden esikäsittelyssä vaadittavaan steri-lointiin tarvittavan lämmön tuotanto. Tutkituissa tuotejärjestelmissä lämpö tuotetaan kokonaan tai osittain fossiilisilla polttoaineilla. Kasvihuone-kaasupäästöjä olisi mahdollista alentaa merkittävästi siirtymällä lämmön tuotannossa kokonaan uusiutuviin polttoaineisiin. Sterilointi on lain edellyttämä käsittelytapa ja siksi energiankulutusta on vallitsevissa olosuhteissa hyvin vaikea pienentää merkittävästi.
Resumo:
Many research works have being carried out on analyzing grain storage facility costs; however a few of them had taken into account the analysis of factors associated to all pre-processing and storage steps. The objective of this work was to develop a decision support system for determining the grain storage facility costs and utilization fees in grain storage facilities. The data of a CONAB storage facility located in Ponta Grossa - PR, Brazil, was used as input of the system developed to analyze its specific characteristics, such as amount of product received and stored throughout the year, hourly capacity of drying, cleaning, and receiving, and dispatch. By applying the decision support system, it was observed that the reception and expedition costs were exponentially reduced as the turnover rate of the storage increased. The cleaning and drying costs increased linearly with grain initial moisture. The storage cost increased exponentially as the occupancy rate of the storage facility decreased.
Resumo:
OBJECTIVE: to evaluate the role of fibrillar extracellular matrix components in the pathogenesis of inguinal hernias. METHODS: samples of the transverse fascia and of the anterior sheath of the rectus abdominis muscle were collected from 40 men aged between 20 and 60 years with type II and IIIA Nyhus inguinal hernia and from 10 fresh male cadavers (controls) without hernia in the same age range. The staining technique was immunohistochemistry for collagen I, collagen III and elastic fibers; quantification of fibrillar components was performed with an image analysis processing software. RESULTS: no statistically significant differences were found in the amount of elastic fibers, collagen I and collagen III, and the ratio of collagen I / III among patients with inguinal hernia when compared with subjects without hernia. CONCLUSION: the amount of fibrillar extracellular matrix components did not change in patients with and without inguinal hernia.