Biblioteca Digital

957 resultados para Statistical approach

Gene Set Analysis for improving genetic association studies

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Introduction. Genetic epidemiology is focused on the study of the genetic causes that determine health and diseases in populations. To achieve this goal a common strategy is to explore differences in genetic variability between diseased and nondiseased individuals. Usual markers of genetic variability are single nucleotide polymorphisms (SNPs) which are changes in just one base in the genome. The usual statistical approach in genetic epidemiology study is a marginal analysis, where each SNP is analyzed separately for association with the phenotype. Motivation. It has been observed, that for common diseases the single-SNP analysis is not very powerful for detecting genetic causing variants. In this work, we consider Gene Set Analysis (GSA) as an alternative to standard marginal association approaches. GSA aims to assess the overall association of a set of genetic variants with a phenotype and has the potential to detect subtle effects of variants in a gene or a pathway that might be missed when assessed individually. Objective. We present a new optimized implementation of a pair of gene set analysis methodologies for analyze the individual evidence of SNPs in biological pathways. We perform a simulation study for exploring the power of the proposed methodologies in a set of scenarios with different number of causal SNPs under different effect sizes. In addition, we compare the results with the usual single-SNP analysis method. Moreover, we show the advantage of using the proposed gene set approaches in the context of an Alzheimer disease case-control study where we explore the Reelin signal pathway.

Exploring Compositional Data with the CoDa-Dendrogram

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Within the special geometry of the simplex, the sample space of compositional data, compositional orthonormal coordinates allow the application of any multivariate statistical approach. The search for meaningful coordinates has suggested balances (between two groups of parts)—based on a sequential binary partition of a D-part composition—and a representation in form of a CoDa-dendrogram. Projected samples are represented in a dendrogram-like graph showing: (a) the way of grouping parts; (b) the explanatory role of subcompositions generated in the partition process; (c) the decomposition of the variance; (d) the center and quantiles of each balance. The representation is useful for the interpretation of balances and to describe the sample in a single diagram independently of the number of parts. Also, samples of two or more populations, as well as several samples from the same population, can be represented in the same graph, as long as they have the same parts registered. The approach is illustrated with an example of food consumption in Europe

Measuring Intermediary Determinants of Early Childhood Health: A Composite Index Comparing Colombian Departments

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In recent years there has been growing interest in composite indicators as an efficient tool of analysis and a method of prioritizing policies. This paper presents a composite index of intermediary determinants of child health using a multivariate statistical approach. The index shows how specific determinants of child health vary across Colombian departments (administrative subdivisions). We used data collected from the 2010 Colombian Demographic and Health Survey (DHS) for 32 departments and the capital city, Bogotá. Adapting the conceptual framework of Commission on Social Determinants of Health (CSDH), five dimensions related to child health are represented in the index: material circumstances, behavioural factors, psychosocial factors, biological factors and the health system. In order to generate the weight of the variables, and taking into account the discrete nature of the data, principal component analysis (PCA) using polychoric correlations was employed in constructing the index. From this method five principal components were selected. The index was estimated using a weighted average of the retained components. A hierarchical cluster analysis was also carried out. The results show that the biggest differences in intermediary determinants of child health are associated with health care before and during delivery.

Measuring early childhood health : a composite index comparing Colombian departments

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper presents a composite index of early childhood health using a multivariate statistical approach. The index shows how child health varies across Colombian departments, -administrative subdivisions-. In recent years there has been growing interest in composite indicators as an efficient analysis tool and a way of prioritizing policies. These indicators not only enable multi-dimensional phenomena to be simplified but also make it easier to measure, visualize, monitor and compare a country’s performance in particular issues. We used data collected from the Colombian Demographic and Health Survey, DHS, for 32 departments and the capital city, Bogotá, in 2005 and 2010. The variables included in the index provide a measure of three dimensions related to child health: health status, health determinants and the health system. In order to generate the weight of the variables and take into account the discrete nature of the data, we employed a principal component analysis, PCA, using polychoric correlation. From this method, five principal components were selected. The index was estimated using a weighted average of the components retained. A hierarchical cluster analysis was also carried out. We observed that the departments ranking in the lowest positions are located on the Colombian periphery. They are departments with low per capita incomes and they present critical social indicators. The results suggest that the regional disparities in child health may be associated with differences in parental characteristics, household conditions and economic development levels, which makes clear the importance of context in the study of child health in Colombia.

Estradiol and testosterone concentrations in follicular fluid as criteria to discriminate between mature and immature oocytes

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The objective of the present study was to examine the association between follicular fluid (FF) steroid concentration and oocyte maturity and fertilization rates. Seventeen infertile patients were submitted to ovulation induction with urinary human follicle-stimulating hormone, human menopausal gonadotropin and human chorionic gonadotropin (hCG). A total of 107 follicles were aspirated after hCG administration, the oocytes were analyzed for maturity and 81 of them were incubated and inseminated in vitro. Progesterone, estradiol (E2), estrone, androstenedione, and testosterone were measured in the FF. E2 and testosterone levels were significantly higher in FF containing immature oocytes (median = 618.2 and 16 ng/ml, respectively) than in FF containing mature oocytes (median = 368 and 5.7 ng/ml, respectively; P < 0.05). Progesterone, androstenedione and estrone levels were not significantly different between mature and immature oocytes. The application of the receiver-operating characteristic curve statistical approach to determine the best cut-off point for the discrimination between mature and immature oocytes indicated levels of 505.8 ng/ml for E2 (81.0% sensitivity and 81.8% specificity) and of 10.4 ng/ml for testosterone (90.9% sensitivity and 82.4% specificity). Follicular diameter was associated negatively with E2 and testosterone levels in FF. There was a significant increase in progesterone/testosterone, progesterone/E2 and E2/testosterone ratios in FF containing mature oocytes, suggesting a reduction in conversion of C21 to C19, but not in aromatase activity. The overall fertility rate was 61% but there was no correlation between the steroid levels or their ratios and the fertilization rates. E2 and testosterone levels in FF may be used as a predictive parameter of oocyte maturity, but not for the in vitro fertilization rate.

Development and validation of a genotype 3 recombinant protein-based immunoassay for hepatitis E virus serology in swine

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Hepatitis E virus (HEV) is classified within the family Hepeviridae, genus Hepevirus. HEV genotype 3 (Gt3) infections are endemic in pigs in Western Europe and in North and South America and cause zoonotic infections in humans. Several serological assays to detect HEV antibodies in pigs have been developed, at first mainly based on HEV genotype 1 (Gt1) antigens. To develop a sensitive HEV Gt3 ELISA, a recombinant baculovirus expression product of HEV Gt3 open reading frame-2 was produced and coated onto polystyrene ELISA plates. After incubation of porcine sera, bound HEV antibodies were detected with anti-porcine anti-IgG and anti-IgM conjugates. For primary estimation of sensitivity and specificity of the assay, sets of sera were used from pigs experimentally infected with HEV Gt3. For further validation of the assay and to set the cutoff value, a batch of 1100 pig sera was used. All pig sera were tested using the developed HEV Gt3 assay and two other serologic assays based on HEV Gt1 antigens. Since there is no gold standard available for HEV antibody testing, further validation and a definite setting of the cutoff of the developed HEV Gt3 assay were performed using a statistical approach based on Bayes' theorem. The developed and validated HEV antibody assay showed effective detection of HEV-specific antibodies. This assay can contribute to an improved detection of HEV antibodies and enable more reliable estimates of the prevalence of HEV Gt3 in swine in different regions.

Exact Multivariate Tests of Asset Pricing Models with Stable Asymmetric Distributions

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this paper, we propose exact inference procedures for asset pricing models that can be formulated in the framework of a multivariate linear regression (CAPM), allowing for stable error distributions. The normality assumption on the distribution of stock returns is usually rejected in empirical studies, due to excess kurtosis and asymmetry. To model such data, we propose a comprehensive statistical approach which allows for alternative - possibly asymmetric - heavy tailed distributions without the use of large-sample approximations. The methods suggested are based on Monte Carlo test techniques. Goodness-of-fit tests are formally incorporated to ensure that the error distributions considered are empirically sustainable, from which exact confidence sets for the unknown tail area and asymmetry parameters of the stable error distribution are derived. Tests for the efficiency of the market portfolio (zero intercepts) which explicitly allow for the presence of (unknown) nuisance parameter in the stable error distribution are derived. The methods proposed are applied to monthly returns on 12 portfolios of the New York Stock Exchange over the period 1926-1995 (5 year subperiods). We find that stable possibly skewed distributions provide statistically significant improvement in goodness-of-fit and lead to fewer rejections of the efficiency hypothesis.

Proposition de combinaisons optimales de contractions volontaires maximales isométriques pour la normalisation de 12 muscles de l'épaule

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Afin d’être représentatif d’un niveau d’effort musculaire, le signal électromyographique (EMG) est exprimé par rapport à une valeur d’activation maximale. Comme l’épaule est une structure articulaire et musculaire complexe, aucune contraction volontaire isométrique (CVMi) proposée dans la littérature ne permet d’activer maximalement un même muscle de l’épaule pour un groupe d’individus. L’objectif de ce mémoire est de développer une approche statistique permettant de déterminer les CVMi optimales afin de maximiser les niveaux d’activation d’un ensemble de muscles de l’épaule. L’amplitude du signal EMG de 12 muscles de l’épaule a été enregistrée chez 16 sujets alors qu’ils effectuaient 15 CVMi. Une première approche systématique a permis de déterminer les 4 CVMi parmi les 15 qui ensemble maximisent les niveaux d’activation pour les 12 muscles simultanément. Ces 4 contractions ont donné des niveaux d’activation supérieurs aux recommandations antérieures pour 4 muscles de l’épaule. Une seconde approche a permis de déterminer le nombre minimal de CVMi qui sont nécessaires afin de produire un niveau d’activation qui n’est pas significativement différent des valeurs d’activation maximales pour les 16 sujets. Pour 12 muscles de l’épaule, un total de 9 CVMi sont requises afin de produire des valeurs d’activation qui sont représentatives de l’effort maximal de tous les sujets. Ce mémoire a proposé deux approches originales, dont la première a maximisé les niveaux d’activation qui peuvent être produits à partir d’un nombre fixe de CVMi tandis que la deuxième a permis d’identifier le nombre minimal de CVMi nécessaire afin de produire des niveaux d’activation qui ne sont pas significativement différentes des valeurs d’activation maximales. Ces deux approches ont permis d’émettre des recommandations concernant les CVMi nécessaires à la normalisation de l’EMG afin de réduire les risques de sous-estimer l’effort maximal d’un ensemble d’individus.

Développement d’un modèle de classification probabiliste pour la cartographie du couvert nival dans les bassins versants d’Hydro-Québec à l’aide de données de micro-ondes passives

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Chaque jour, des décisions doivent être prises quant à la quantité d'hydroélectricité produite au Québec. Ces décisions reposent sur la prévision des apports en eau dans les bassins versants produite à l'aide de modèles hydrologiques. Ces modèles prennent en compte plusieurs facteurs, dont notamment la présence ou l'absence de neige au sol. Cette information est primordiale durant la fonte printanière pour anticiper les apports à venir, puisqu'entre 30 et 40% du volume de crue peut provenir de la fonte du couvert nival. Il est donc nécessaire pour les prévisionnistes de pouvoir suivre l'évolution du couvert de neige de façon quotidienne afin d'ajuster leurs prévisions selon le phénomène de fonte. Des méthodes pour cartographier la neige au sol sont actuellement utilisées à l'Institut de recherche d'Hydro-Québec (IREQ), mais elles présentent quelques lacunes. Ce mémoire a pour objectif d'utiliser des données de télédétection en micro-ondes passives (le gradient de températures de brillance en position verticale (GTV)) à l'aide d'une approche statistique afin de produire des cartes neige/non-neige et d'en quantifier l'incertitude de classification. Pour ce faire, le GTV a été utilisé afin de calculer une probabilité de neige quotidienne via les mélanges de lois normales selon la statistique bayésienne. Par la suite, ces probabilités ont été modélisées à l'aide de la régression linéaire sur les logits et des cartographies du couvert nival ont été produites. Les résultats des modèles ont été validés qualitativement et quantitativement, puis leur intégration à Hydro-Québec a été discutée.

Development Of A Pos Tagger For Malayalam-An Experience

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A Parts of Speech tagger for Malayalam which uses a stochastic approach has been proposed. The tagger makes use of word frequencies and bigram statistics from a corpus. The morphological analyzer is used to generate a tagged corpus due to the unavailability of an annotated corpus in Malayalam. Although the experiments have been performed on a very small corpus, the results have shown that the statistical approach works well with a highly agglutinative language like Malayalam

Core sediment biogeochemistry in specific zones of Cochin Estuarine System (CES)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Geochemical composition is a set of data for predicting the climatic condition existing in an ecosystem. Both the surficial and core sediment geochemistry are helpful in monitoring, assessing and evaluating the marine environment. The aim of the research work is to assess the relationship between the biogeochemical constituents in the Cochin Estuarine System (CES), their modifications after a long period of anoxia and also to identify the various processes which control the sediment composition in this region, through a multivariate statistical approach. Therefore the study of present core sediment geochemistry has a critical role in unraveling the benchmark of their characterization. Sediment cores from four prominent zones of CES were examined for various biogeochemical aspects. The results have served as rejuvenating records for the prediction of core sediment status prevailing in the CES

Geochemical metal fractionation profile of the core sediment in the Cochin estuarine system

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Geochemical composition is a set of data for predicting the climatic condition existing in an ecosystem. Both the surficial and core sediment geochemistry are helpful in monitoring, assessing and evaluating the marine environment. The aim of the research work is to assess the relationship between the biogeochemical constituents in the Cochin Estuarine System (CES), their modifications after a long period of anoxia and also to identify the various processes which control the sediment composition in this region, through a multivariate statistical approach. Therefore the study of present core sediment geochemistry has a critical role in unraveling the benchmark of their characterization. Sediment cores from four prominent zones of CES were examined for various biogeochemical aspects. The results have served as rejuvenating records for the prediction of core sediment status prevailing in the CES

Object Recognition with Pictorial Structures

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This thesis presents a statistical framework for object recognition. The framework is motivated by the pictorial structure models introduced by Fischler and Elschlager nearly 30 years ago. The basic idea is to model an object by a collection of parts arranged in a deformable configuration. The appearance of each part is modeled separately, and the deformable configuration is represented by spring-like connections between pairs of parts. These models allow for qualitative descriptions of visual appearance, and are suitable for generic recognition problems. The problem of detecting an object in an image and the problem of learning an object model using training examples are naturally formulated under a statistical approach. We present efficient algorithms to solve these problems in our framework. We demonstrate our techniques by training models to represent faces and human bodies. The models are then used to locate the corresponding objects in novel images.

A Factor analysis of hidrochemical composition of Llobregat river basin

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Hydrogeological research usually includes some statistical studies devised to elucidate mean background state, characterise relationships among different hydrochemical parameters, and show the influence of human activities. These goals are achieved either by means of a statistical approach or by mixing models between end-members. Compositional data analysis has proved to be effective with the first approach, but there is no commonly accepted solution to the end-member problem in a compositional framework. We present here a possible solution based on factor analysis of compositions illustrated with a case study. We find two factors on the compositional bi-plot fitting two non-centered orthogonal axes to the most representative variables. Each one of these axes defines a subcomposition, grouping those variables that lay nearest to it. With each subcomposition a log-contrast is computed and rewritten as an equilibrium equation. These two factors can be interpreted as the isometric log-ratio coordinates (ilr) of three hidden components, that can be plotted in a ternary diagram. These hidden components might be interpreted as end-members. We have analysed 14 molarities in 31 sampling stations all along the Llobregat River and its tributaries, with a monthly measure during two years. We have obtained a bi-plot with a 57% of explained total variance, from which we have extracted two factors: factor G, reflecting geological background enhanced by potash mining; and factor A, essentially controlled by urban and/or farming wastewater. Graphical representation of these two factors allows us to identify three extreme samples, corresponding to pristine waters, potash mining influence and urban sewage influence. To confirm this, we have available analysis of diffused and widespread point sources identified in the area: springs, potash mining lixiviates, sewage, and fertilisers. Each one of these sources shows a clear link with one of the extreme samples, except fertilisers due to the heterogeneity of their composition. This approach is a useful tool to distinguish end-members, and characterise them, an issue generally difficult to solve. It is worth note that the end-member composition cannot be fully estimated but only characterised through log-ratio relationships among components. Moreover, the influence of each endmember in a given sample must be evaluated in relative terms of the other samples. These limitations are intrinsic to the relative nature of compositional data

El proceso de universalizaci??n de la Ense??anza Secundaria en Espa??a en la segunda mitad del siglo XX : una aproximaci??n estad??stica

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Resumen tomado de la publicaci??n

«
1
2
3
4
5
6
7
8
...
63
64
»