956 resultados para Data sets
Resumo:
The uncertainty of any analytical determination depends on analysis and sampling. Uncertainty arising from sampling is usually not controlled and methods for its evaluation are still little known. Pierre Gy’s sampling theory is currently the most complete theory about samplingwhich also takes the design of the sampling equipment into account. Guides dealing with the practical issues of sampling also exist, published by international organizations such as EURACHEM, IUPAC (International Union of Pure and Applied Chemistry) and ISO (International Organization for Standardization). In this work Gy’s sampling theory was applied to several cases, including the analysis of chromite concentration estimated on SEM (Scanning Electron Microscope) images and estimation of the total uncertainty of a drug dissolution procedure. The results clearly show that Gy’s sampling theory can be utilized in both of the above-mentioned cases and that the uncertainties achieved are reliable. Variographic experiments introduced in Gy’s sampling theory are beneficially applied in analyzing the uncertainty of auto-correlated data sets such as industrial process data and environmental discharges. The periodic behaviour of these kinds of processes can be observed by variographic analysis as well as with fast Fourier transformation and auto-correlation functions. With variographic analysis, the uncertainties are estimated as a function of the sampling interval. This is advantageous when environmental data or process data are analyzed as it can be easily estimated how the sampling interval is affecting the overall uncertainty. If the sampling frequency is too high, unnecessary resources will be used. On the other hand, if a frequency is too low, the uncertainty of the determination may be unacceptably high. Variographic methods can also be utilized to estimate the uncertainty of spectral data produced by modern instruments. Since spectral data are multivariate, methods such as Principal Component Analysis (PCA) are needed when the data are analyzed. Optimization of a sampling plan increases the reliability of the analytical process which might at the end have beneficial effects on the economics of chemical analysis,
Resumo:
Imide compounds have shown biological activity. These compounds can be easily synthesized with good yields. The objective of this paper was the rational planning of imides and sulfonamides with antinociceptive activity using the 3D-QSAR/CoMFA approach. The studies were performed using two data sets. The first set consisted of 39 cyclic imides while the second set consisted of 39 imides and 15 sulfonamides. The 3D- QSAR/CoMFA models have shown that the steric effect is important for the antinociceptive activity of imide and sulphonamide compounds. Ten new compounds with improved potential antinociceptive activity have been proposed by de novo design leapfrog simulations.
Resumo:
Most metazoans rely on aerobic energy production, which is dependent on adequate oxygen supply. In the case of reduced oxygen supply (hypoxia), the most profound changes in gene expression are mediated by transcription factors named hypoxia-inducible factors (HIF alpha). These proteins are post-translationally regulated by prolyl-4-hydroxylase (PHD) enzymes that are direct “sensors” of cellular oxygen levels. This thesis examines the molecular evolution of metazoan HIF systems. In early metazoans the HIF system emerged from pre-existing PHD oxygen sensors and early bHLH-PAS transcription factors. In invertebrates our analysis revealed an unexpected diversity of PHD genes and HIF alpha sequence characteristics. An early branching vertebrate, the epaulette shark (Hemiscyllium ocellatum) was chosen for sequencing and hypoxia preconditioning studies of HIF alpha and PHD genes. As no quantitative PCR reference genes were available, this thesis includes the first study of reference genes in cartilaginous fish species. Applying multiple statistical analysis we also discoveredthat commonly used reference gene software may perform poorly with some data sets. Novel reference genes allowed accurate measurements of the mRNAlevels of the studied target genes. Cartilaginous fishes have three genomic duplicates of both HIF alpha and PHD genes like mammals and teleost fishes. Combining functional divergence and selection analyses it was possible to describe how sequence changes in both HIF alpha and PHD duplicates may have contributed to the differential oxygen sensitivityof HIF alphas. Additionally, novel teleost HIF-1 alpha sequences were produced and used to reveal the molecular evolution of HIF-1 alpha in this lineage rich with hypoxia tolerant species.
Resumo:
Although social capital and health have been extensively studied during the last decade, there are still open issues in current empirical research. These concern for instance the measurement of the concept in different contexts, as well as the association between different types of social capital and different dimensions of health. The present thesis addressed these questions. The general aim was to promote the understanding of social capital and health by investigating the oldest old and the two major language groups in Finland, Swedish- and Finnish-speakers. Another aim was to contribute to the discussion on methodological issues in social capital and health research. The present thesis investigated two empirical data sets, Umeå 85+ and Health 2000. The Umeå 85+ study was a cross-sectional study of 163 individuals aged 85, 90, and 95 or older, living in the municipality of Umeå, Sweden, in the year of 2000. The Health 2000 survey was a national study of 8,028 persons aged 30 or above carried out in Finland in 2000-2001. Different indicators of structural (e.g. social contacts) and cognitive (e.g. trust) social capital, as well as health indicators were used as variables in the analyses. The Umeå 85+ data set was analyzed with factor analysis, as well as univariate and multivariate analysis of variance. The Health 2000 data was analyzed with logistic regression techniques. The results showed that the Swedish-speakers in the Finnish data set Health 2000 had consistently higher prevalence of social capital compared to the Finnish-speakers even after controlling for central sociodemographic variables. The results further showed that even if the language group differences in health were small, the Swedishspeakers experienced in general better self-reported health compared with the Finnish-speakers. Common sociodemographic variables could not explain these observed differences in health. The results imply that social capital is often, but not always, associated with health. This was clearly seen in the Umeå 85+ data set where only one health indicator (depressive symptoms) was associated with structural social capital among the oldest old. The results based on the analysis of the Health 2000 survey demonstrated that the cognitive component of social capital was associated with self-rated health and psychological health rather than with participation in social activities and social contacts. In addition, social capital statistically reduced the health advantage especially for Swedish-speaking men, indicating that high prevalence of social capital may promote health. Finally, the present thesis also discussed the issue of methodological challenges faced with when analyzing social capital and health. It was suggested that certain components of social capital such as bonding and bridging social capital may be more relevant than structural and cognitive components when investigating social capital among the two language groups in Finland. The results concerning the oldest old indicated that the structural aspects of social capital probably reflect current living conditions, whereas cognitive social capital reflects attitudes and traits often acquired decades earlier. This is interpreted as an indication of the fact that structural and cognitive social capital are closely related yet empirically two distinctive concepts. Taken together, some components of social capital may be more relevant to study than others depending on which population group and age group is under study. The results also implied that the choice of cut-off point of dichotomization of selfrated health has an impact on the estimated effects of the explanatory variables. When the whole age interval, 35-64 years, was analyzed with logistic regression techniques the choice of cut-off point did not matter for the estimated effects of marital status and educational level. The results changed, however, when the age interval was divided into three shorter intervals. If self-rated health is explored using wide age intervals that do not account for age-dependent covariates there is a risk of drawing misleading conclusions. In conclusion, the results presented in the thesis suggest that the uneven distribution of social capital observed between the two language groups in Finland are of importance when trying to further understand health inequalities that exist between Swedish- and Finnish-speakers in Finland. Although social capital seemed to be relevant to the understanding of health among the oldest old, the meaning of social capital is probably different compared to a less vulnerable age group. This should be noticed in future empirical research. In the present thesis, it was shown that the relationship between social capital and health is complex and multidimensional. Different aspects of social capital seem to be important for different aspects of health. This reduces the possibility to generalize the results and to recommend general policy implementations in this area. An increased methodological awareness regarding social capital as well as health are called for in order to further understand the cfomplex association between them. However, based on the present data and findings social capital is associated with health. To understand individual health one must also consider social aspects of the individuals’ environment such as social capital.
Resumo:
Taking into account that the sampling intensity of soil attributes is a determining factor for applying of concepts of precision agriculture, this study aims to determine the spatial distribution pattern of soil attributes and corn yield at four soil sampling intensities and verify how sampling intensity affects cause-effect relationship between soil attributes and corn yield. A 100-referenced point sample grid was imposed on the experimental site. Thus, each sampling cell encompassed an area of 45 m² and was composed of five 10-m long crop rows, where referenced points were considered the center of the cell. Samples were taken from at 0 to 0.1 m and 0.1 to 0.2 m depths. Soil chemical attributes and clay content were evaluated. Sampling intensities were established by initial 100-point sampling, resulting data sets of 100; 75; 50 and 25 points. The data were submitted to descriptive statistical and geostatistics analyses. The best sampling intensity to know the spatial distribution pattern was dependent on the soil attribute being studied. The attributes P and K+ content showed higher spatial variability; while the clay content, Ca2+, Mg2+ and base saturation values (V) showed lesser spatial variability. The spatial distribution pattern of clay content and V at the 100-point sampling were the ones which best explained the spatial distribution pattern of corn yield.
Resumo:
The aim of this study was to compare the hydrographically conditioned digital elevation models (HCDEMs) generated from data of VNIR (Visible Near Infrared) sensor of ASTER (Advanced Spaceborne Thermal Emission and Reflection Radiometer), of SRTM (Shuttle Radar Topography Mission) and topographical maps from IBGE in a scale of 1:50,000, processed in the Geographical Information System (GIS), aiming the morphometric characterization of watersheds. It was taken as basis the Sub-basin of São Bartolomeu River, obtaining morphometric characteristics from HCDEMs. Root Mean Square Error (RMSE) and cross validation were the statistics indexes used to evaluate the quality of HCDEMs. The percentage differences in the morphometric parameters obtained from these three different data sets were less than 10%, except for the mean slope (21%). In general, it was observed a good agreement between HCDEMs generated from remote sensing data and IBGE maps. The result of HCDEM ASTER was slightly higher than that from HCDEM SRTM. The HCDEM ASTER was more accurate than the HCDEM SRTM in basins with high altitudes and rugged terrain, by presenting frequency altimetry nearest to HCDEM IBGE, considered standard in this study.
Resumo:
The purpose of this study is to examine how well risk parity works in terms of risk, return and diversification relative to more traditional minimum variance, 1/N and 60/40 portfolios. Risk parity portfolios were constituted of five risk sources; three common asset classes and two alternative beta investment strategies. The three common asset classes were equities, bonds and commodities, and the alternative beta investment strategies were carry trade and trend following. Risk parity portfolios were constructed using five different risk measures of which four were tail risk measures. The risk measures were standard deviation, Value-at-Risk, Expected Shortfall, modified Value-at-Risk and modified Expected Shortfall. We studied also how sensitive risk parity is to the choice of risk measure. The hypothesis is that risk parity portfolios provide better return with the same amount of risk and are better diversified than the benchmark portfolios. We used two data sets, monthly and weekly data. The monthly data was from the years 1989-2011 and the weekly data was from the years 2000-2011. Empirical studies showed that risk parity portfolios provide better diversification since the diversification is made at the risk level. Risk based portfolios provided superior return compared to the asset based portfolios. Using tail risk measures in risk parity portfolios do not necessarily provide better hedge from tail events than standard deviation.
Resumo:
Tässä diplomityössä jatkettiin Loviisan voimalaitoksen höyryturbiinien suorituskyvyn parannuspotentiaalien tutkimusta. Tavoitteena oli kehittää laitoksen höyryturbiinien suorituskyvyn käytönaikaisia on-line-mittauksia. Selvityksessä perehdyttiin norjalaisen IFE:n kehittämään stationääritilan TEMPOohjelmaan( The Thermal Performance Monitoring And Optimisation system), sen käyttöohjeisiin ja toimintaperiaatteisiin. Työssä esiteltiin laajasti tiedon yhteensovittamisen laskentateoriaa, johon TEMPOn toiminta perustuu. Työssä tarkasteltiin turbiinin todellista paisuntaprosessia, koska sen ymmärtäminen on tärkeässä osassa turbiinin suorituskyvyn valvonnassa. Tutkimuksessa esiteltiin myös turbiineille mahdollisia vikoja sekä niiden syntymisprosesseja. Työssä tarkasteltiin TEMPOn sovittamien tulostiedostojen analysointiohjelman toimivuutta havaitsemalla itse aiheutettuja poikkeamia todellisiin mittaustiedostoihin. Analysointiohjelmalla muodostettuja kuvaajia vertailtiin todellisen prosessin ajotilanteen kuvaajiin ja tarkasteltiin, kuinka poikkeamia on mahdollista havaita kuvaajien avulla. TEMPO-ohjelmalle löydettiin tutkimuksen edetessä kehittämisehdotuksia. Näillä muutoksilla ohjelma saadaan mallintamaan Loviisan voimalaitoksen turbiiniprosessia tarkemmin ja tuloksista saadaan hyödyllisempiä.
Resumo:
Chaotic dynamical systems exhibit trajectories in their phase space that converges to a strange attractor. The strangeness of the chaotic attractor is associated with its dimension in which instance it is described by a noninteger dimension. This contribution presents an overview of the main definitions of dimension discussing their evaluation from time series employing the correlation and the generalized dimension. The investigation is applied to the nonlinear pendulum where signals are generated by numerical integration of the mathematical model, selecting a single variable of the system as a time series. In order to simulate experimental data sets, a random noise is introduced in the time series. State space reconstruction and the determination of attractor dimensions are carried out regarding periodic and chaotic signals. Results obtained from time series analyses are compared with a reference value obtained from the analysis of mathematical model, estimating noise sensitivity. This procedure allows one to identify the best techniques to be applied in the analysis of experimental data.
Resumo:
Non-linear functional representation of the aerodynamic response provides a convenient mathematical model for motion-induced unsteady transonic aerodynamic loads response, that accounts for both complex non-linearities and time-history effects. A recent development, based on functional approximation theory, has established a novel functional form; namely, the multi-layer functional. For a large class of non-linear dynamic systems, such multi-layer functional representations can be realised via finite impulse response (FIR) neural networks. Identification of an appropriate FIR neural network model is facilitated by means of a supervised training process in which a limited sample of system input-output data sets is presented to the temporal neural network. The present work describes a procedure for the systematic identification of parameterised neural network models of motion-induced unsteady transonic aerodynamic loads response. The training process is based on a conventional genetic algorithm to optimise the network architecture, combined with a simplified random search algorithm to update weight and bias values. Application of the scheme to representative transonic aerodynamic loads response data for a bidimensional airfoil executing finite-amplitude motion in transonic flow is used to demonstrate the feasibility of the approach. The approach is shown to furnish a satisfactory generalisation property to different motion histories over a range of Mach numbers in the transonic regime.
Resumo:
Wind power is a low-carbon energy production form that reduces the dependence of society on fossil fuels. Finland has adopted wind energy production into its climate change mitigation policy, and that has lead to changes in legislation, guidelines, regional wind power areas allocation and establishing a feed-in tariff. Wind power production has indeed boosted in Finland after two decades of relatively slow growth, for instance from 2010 to 2011 wind energy production increased with 64 %, but there is still a long way to the national goal of 6 TWh by 2020. This thesis introduces a GIS-based decision-support methodology for the preliminary identification of suitable areas for wind energy production including estimation of their level of risk. The goal of this study was to define the least risky places for wind energy development within Kemiönsaari municipality in Southwest Finland. Spatial multicriteria decision analysis (SMCDA) has been used for searching suitable wind power areas along with many other location-allocation problems. SMCDA scrutinizes complex ill-structured decision problems in GIS environment using constraints and evaluation criteria, which are aggregated using weighted linear combination (WLC). Weights for the evaluation criteria were acquired using analytic hierarchy process (AHP) with nine expert interviews. Subsequently, feasible alternatives were ranked in order to provide a recommendation and finally, a sensitivity analysis was conducted for the determination of recommendation robustness. The first study aim was to scrutinize the suitability and necessity of existing data for this SMCDA study. Most of the available data sets were of sufficient resolution and quality. Input data necessity was evaluated qualitatively for each data set based on e.g. constraint coverage and attribute weights. Attribute quality was estimated mainly qualitatively by attribute comprehensiveness, operationality, measurability, completeness, decomposability, minimality and redundancy. The most significant quality issue was redundancy as interdependencies are not tolerated by WLC and AHP does not include measures to detect them. The third aim was to define the least risky areas for wind power development within the study area. The two highest ranking areas were Nordanå-Lövböle and Påvalsby followed by Helgeboda, Degerdal, Pungböle, Björkboda, and Östanå-Labböle. The fourth aim was to assess the recommendation reliability, and the top-ranking two areas proved robust whereas the other ones were more sensitive.
Resumo:
Benzodiazepines (BZD) and benzodiazepine related drugs (RD) are the most commonly used psychotropics among the aged. The use of other psychotropics taken concomitantly with BZD/ RD or their cognitive effects with BZD/RD have not been studied frequently. The aim of this academic thesis was to describe and analyse relationships between the use of BZD/RD alone or concomitantly with antipsychotics, antidepressants, opioids, antiepileptics, opioids and anticholinergics in the aged and their health. Especially, the relationships between long-term use of BZD/RD and cognitive decline were studied. Additionally, the effect of melatonin on BZD/RD withdrawal and the cognitive effects of BZD/RD withdrawal were studied. This study used multiple data sets: the first study (I) was based on clinical data containing aged patients (≥65 years; N=164) admitted to Pori City Hospital due to acute disease. The second data set (Studies II and III) was based on population-based data from the Lieto Study, a clinico-epidemiological longitudinal study carried out among the aged (≥65 years) in the municipality of Lieto. Follow-up data was formed by combining the cohort data collected in 1990-1991 (N=1283) and in 1998-1999 (N=1596) from those who participated in both cohorts (N=617). The third data set (Studies IV and V) was based on the Satauni Study’s data. This study was performed in the City of Pori in 2009-2010. In the RCT part of the Satauni Study, ninety-two long-term users of BZD/RD were withdrawn from their drugs using melatonin against placebo. The change of their cognitive abilities was measured during and after BZD/ RD withdrawal. BZD/RD use was related to worse cognitive and functional abilities, and their use may predict worse cognitive outcomes compared with BZD/RD non-users. Hypnotic use of BZD/RD could be withdrawn with psychosocial support in motivated participants, but melatonin did not improve the withdrawal results compared to those with placebo. Cognitive abilities in psychomotor tests did not show, or showed only modest, improvements for up to six months after BZD/RD withdrawal. This suggests that the cognitive effects of BZD/RD may be longlasting or permanent.
Resumo:
Suomalaisten ja saksalaisten arkikeskustelujen välillä on sekä yhtäläisyyksiä että eroja. Tässä saksalaisen filologian alaan kuuluvassa tutkimuksessa tarkastellaan yhtä keskeistä arkikeskustelun toimintoa, puhelinkeskustelun lopetusta, suomen- ja saksanpuhujien tuottamana. Aineistona on käytetty suomen- ja saksankielisten äidinkielisten puhujien tätä tutkimusta varten nauhoittamia henkilökohtaisia luonnollisia puhelinkeskusteluja. Aineistoon valikoitui 12 suomalaista ja 12 saksalaista puhelua. Nauhoitteiden käyttöön on saatu asianmukainen lupa kaikilta osapuolilta. Puhelut on litteroitu saksalaisella kielialueella vakiintuneen GAT-litterointisysteemin mukaan. Teoreettis-metodisena kehyksenä on kaksi tutkimusalaa, vuorovaikutuslingvistiikka ja kielten vertailu. Vuorovaikutuslingvistinen tarkastelu keskittyy havaintoihin vuorojen ja puheen sekvenssien rakenteesta. Vuorojen merkitysten tulkinnassa hyödynnetään systemaattisesti prosodian antamia vihjeitä. Tuloksena on yksittäisten lopetusten keskustelunanalyyttinen lähikuvaus, jonka pohjalta määritellään kulloisenkin lopetuksen sekvenssirakenne. Kaikki lopetukset olivat siltä osin yhteneväisiä, että niissä kaikissa havaittiin ainakin aloittava, tulevaan tapaamiseen viittaava sekä lopputervehdyksiin johtava sekvenssi. Sekvenssirakenteen variaatioiden pohjalta aineiston lopetukset voidaan kuitenkin jaotella ryhmiin. Sekä suomen- että saksankielisessä aineistossa havaittiin kolmentyyppisiä lopetuksia: kompakteja, komplekseja ja keskeytettyjä lopetuksia. Ryhmittely kolmeen tyyppiin on avuksi seuraavassa kuvausvaiheessa, jossa verrataan suomen- ja saksankielisiä lopetuksia toisiinsa. Samanaikaisesti kun tutkimus valottaa kohtia, joissa kaksi aineistosettiä yhtenevät ja eroavat, se myös esittää, mitkä vuorovaikutuksen tasot soveltuvat kieltenvälisen vertailun kohteiksi. Pohdintaa siitä, mitä vuorovaikutuksen tasoja kieltenväliseen vertailuun voidaan sisällyttää, onkin toistaiseksi esitetty verrattain vähän. Työ siis rakentaa siltaa vuorovaikutuslingvistisen ja kontrastiivisen kielitieteen välille.
Resumo:
A floristic and structural survey of a natural grassland community was conducted on Morro do Osso, a granitic hill in Porto Alegre, RS, Brazil. Structural data were surveyed in 39 one square meter plots placed over two major grassland areas. An accidental fire has occurred in one of the areas approximately one year prior to our survey, leading to further analysis of parameters differences between sites. The floristic list contains 282 species, whereas the structural survey has found 161 species. Families with highest accumulated importance values were Poaceae, Asteraceae and Fabaceae. The diversity and evenness indexes were 4.51 nats ind-1 and 0.86, respectively. Cluster analysis denoted two groups coinciding with the areas distinguished by the fire disturbance. A similarity analysis between our data and two other data sets from nearby granitic hills resulted in 28% to 35% similarity, with equivalent species-family distribution and many common dominant species, corroborating the concept of a continuous flora along the South Brazilian granitic hills.
Resumo:
The amount of biological data has grown exponentially in recent decades. Modern biotechnologies, such as microarrays and next-generation sequencing, are capable to produce massive amounts of biomedical data in a single experiment. As the amount of the data is rapidly growing there is an urgent need for reliable computational methods for analyzing and visualizing it. This thesis addresses this need by studying how to efficiently and reliably analyze and visualize high-dimensional data, especially that obtained from gene expression microarray experiments. First, we will study the ways to improve the quality of microarray data by replacing (imputing) the missing data entries with the estimated values for these entries. Missing value imputation is a method which is commonly used to make the original incomplete data complete, thus making it easier to be analyzed with statistical and computational methods. Our novel approach was to use curated external biological information as a guide for the missing value imputation. Secondly, we studied the effect of missing value imputation on the downstream data analysis methods like clustering. We compared multiple recent imputation algorithms against 8 publicly available microarray data sets. It was observed that the missing value imputation indeed is a rational way to improve the quality of biological data. The research revealed differences between the clustering results obtained with different imputation methods. On most data sets, the simple and fast k-NN imputation was good enough, but there were also needs for more advanced imputation methods, such as Bayesian Principal Component Algorithm (BPCA). Finally, we studied the visualization of biological network data. Biological interaction networks are examples of the outcome of multiple biological experiments such as using the gene microarray techniques. Such networks are typically very large and highly connected, thus there is a need for fast algorithms for producing visually pleasant layouts. A computationally efficient way to produce layouts of large biological interaction networks was developed. The algorithm uses multilevel optimization within the regular force directed graph layout algorithm.