885 resultados para Bayesian ridge regression
Resumo:
Este estudo tem como objetivo analisar o desempenho de vários modelos econométricos ao prever Inflação . Iniciamos o trabalho utilizando como base de comparação para todos os modelos a tradicional curva de Phillips que usa a taxa de desemprego como variável explicativa para diferenças de preço. Dentre os modelos analisados temos univariados e bivariados, sendo estes últimos uma curva de Phillips alternativa já que apenas sustitui a variável desemprego por outra variável macroeconômica. Além destes modelos também comparamos o desempenho de previsão de modelos que usam como covariadas uma combinação das previsões dos modelos anteriores (univariados e bivariados). O resultado deste estudo aponta a combinação de modelos por "ridge regression" como uma técnica - dentre as analisadas para combinação de previsões - de menor erro de previsão sempre; sendo alcançado pela combinação da média em apenas um dos casos analisados. No entanto, a combinação de previsões não apresentou melhor resultado que algumas das covariadas testadas em modelos bivariados
Resumo:
OBJECTIVE: To estimate the pretest probability of Cushing's syndrome (CS) diagnosis by a Bayesian approach using intuitive clinical judgment. MATERIALS AND METHODS: Physicians were requested, in seven endocrinology meetings, to answer three questions: "Based on your personal expertise, after obtaining clinical history and physical examination, without using laboratorial tests, what is your probability of diagnosing Cushing's Syndrome?"; "For how long have you been practicing Endocrinology?"; and "Where do you work?". A Bayesian beta regression, using the WinBugs software was employed. RESULTS: We obtained 294 questionnaires. The mean pretest probability of CS diagnosis was 51.6% (95%CI: 48.7-54.3). The probability was directly related to experience in endocrinology, but not with the place of work. CONCLUSION: Pretest probability of CS diagnosis was estimated using a Bayesian methodology. Although pretest likelihood can be context-dependent, experience based on years of practice may help the practitioner to diagnosis CS. Arq Bras Endocrinol Metab. 2012;56(9):633-7
Resumo:
Ecosystems are faced with high rates of species loss which has consequences for their functions and services. To assess the effects of plant species diversity on the nitrogen (N) cycle, we developed a model for monthly mean nitrate (NO3-N) concentrations in soil solution in 0-30 cm mineral soil depth using plant species and functional group richness and functional composition as drivers and assessing the effects of conversion of arable land to grassland, spatially heterogeneous soil properties, and climate. We used monthly mean NO3-N concentrations from 62 plots of a grassland plant diversity experiment from 2003 to 2006. Plant species richness (1-60) and functional group composition (1-4 functional groups: legumes, grasses, non-leguminous tall herbs, non-leguminous small herbs) were manipulated in a factorial design. Plant community composition, time since conversion from arable land to grassland, soil texture, and climate data (precipitation, soil moisture, air and soil temperature) were used to develop one general Bayesian multiple regression model for the 62 plots to allow an in-depth evaluation using the experimental design. The model simulated NO3-N concentrations with an overall Bayesian coefficient of determination of 0.48. The temporal course of NO3-N concentrations was simulated differently well for the individual plots with a maximum plot-specific Nash-Sutcliffe Efficiency of 0.57. The model shows that NO3-N concentrations decrease with species richness, but this relation reverses if more than approx. 25 % of legume species are included in the mixture. Presence of legumes increases and presence of grasses decreases NO3-N concentrations compared to mixtures containing only small and tall herbs. Altogether, our model shows that there is a strong influence of plant community composition on NO3-N concentrations.
Resumo:
The main aim of this paper is to provide a tutorial on regression with Gaussian processes. We start from Bayesian linear regression, and show how by a change of viewpoint one can see this method as a Gaussian process predictor based on priors over functions, rather than on priors over parameters. This leads in to a more general discussion of Gaussian processes in section 4. Section 5 deals with further issues, including hierarchical modelling and the setting of the parameters that control the Gaussian process, the covariance functions for neural network models and the use of Gaussian processes in classification problems.
Resumo:
Multiple linear regression model plays a key role in statistical inference and it has extensive applications in business, environmental, physical and social sciences. Multicollinearity has been a considerable problem in multiple regression analysis. When the regressor variables are multicollinear, it becomes difficult to make precise statistical inferences about the regression coefficients. There are some statistical methods that can be used, which are discussed in this thesis are ridge regression, Liu, two parameter biased and LASSO estimators. Firstly, an analytical comparison on the basis of risk was made among ridge, Liu and LASSO estimators under orthonormal regression model. I found that LASSO dominates least squares, ridge and Liu estimators over a significant portion of the parameter space for large dimension. Secondly, a simulation study was conducted to compare performance of ridge, Liu and two parameter biased estimator by their mean squared error criterion. I found that two parameter biased estimator performs better than its corresponding ridge regression estimator. Overall, Liu estimator performs better than both ridge and two parameter biased estimator.
Resumo:
An important statistical development of the last 30 years has been the advance in regression analysis provided by generalized linear models (GLMs) and generalized additive models (GAMs). Here we introduce a series of papers prepared within the framework of an international workshop entitled: Advances in GLMs/GAMs modeling: from species distribution to environmental management, held in Riederalp, Switzerland, 6-11 August 2001.We first discuss some general uses of statistical models in ecology, as well as provide a short review of several key examples of the use of GLMs and GAMs in ecological modeling efforts. We next present an overview of GLMs and GAMs, and discuss some of their related statistics used for predictor selection, model diagnostics, and evaluation. Included is a discussion of several new approaches applicable to GLMs and GAMs, such as ridge regression, an alternative to stepwise selection of predictors, and methods for the identification of interactions by a combined use of regression trees and several other approaches. We close with an overview of the papers and how we feel they advance our understanding of their application to ecological modeling.
Resumo:
Objective: To assess the associations between obesity markers (BMI, waist circumference and %body fat) and inflammatory markers (interleukin-1β (IL-1β); interleukin-6 (IL-6); tumor necrosis factor-α (TNF-α) and high-sensitivity C-reactive protein (hs-CRP)). Methods: Population sample of 2,884 men and 3,201 women aged 35-75 years. Associations were assessed using ridge regression adjusting for age, leisure-time physical activity, and smoking. Results: No differences were found in IL-1β levels between participants with increased obesity markers and healthy counterparts; multivariate regression showed %body fat to be negatively associated with IL-1β. Participants with high %body fat or abdominal obesity had higher IL-6 levels, but no independent association between IL-6 levels and obesity markers was found on multivariate regression. Participants with abdominal obesity had higher TNF-α levels, and positive associations were found between TNF-α levels and waist circumference in men and between TNF-α levels and BMI in women. Obese participants had higher hs-CRP levels, and these differences persisted after multivariate adjustment; similarly, positive associations were found between hs-CRP levels and all obesity markers studied. Conclusion: Obesity markers are differentially associated with cytokine levels. %Body fat is negatively associated with IL-1β; BMI (in women) and waist circumference (in men) are associated with TNF-α; all obesity markers are positively associated with hs-CRP. Copyright © 2012 S. Karger GmbH, Freiburg.
Resumo:
Työn taustalla oli tavoite parantaa erään teollisuusprosessin toimintaa ja sen tuottoa mallintamalla reaktiovaiheen alussa tapahtuvan välituotteen muodostumisen reaktiokinetiikka sekä perinteisellä tavalla että implisiittisellä kalibroinnilla. Toisena tavoitteena oli selvittää, kuinka implisiittistä kalibrointia voidaan yleisemmin hyödyntää kemiantekniikassa. Implisiittinen kalibrointi on menetelmä, jolla voidaan ratkaista jonkin teoreettisen mallin parametrit suoraan epäsuorasta mittausdatasta (esimerkiksi spektreistä) lähes kokonaan ilman off-line analyysejä. Tämän työn kirjallisuusosassa on esitetty implisiittisen kalibroinnin toimintaperiaate sekä lyhyesti FTIR-spektrometrian perusteita. Työn kokeellisessa osassa on estimoitu tutkitun välituotteen muodostumisen kineettiset parametrit sekä tavanomaisella parametriestimoinnilla että implisiittisellä kalibroinnilla. Lisäksi kokeellisessa osassa on selvitetty lyhyesti tutkitun prosessin FTIR-spektrien lämpötilariippuvuuksia ja esitetty neljä mahdollista uutta sovelluskohdetta implisiittiselle kalibroinnille. Tavanomaisella parametriestimoinnilla saatiin estimoitua varsin yksiselitteiset arvot kineettisille parametreille. Myös mallin sovitus koedataan on hyvä kolmessa kokeessa viidestä. Parametriestimointi implisiittisellä kalibroinnilla onnistui lupaavasti vaikka tulokset eivät ole aivan niin hyviä kuin tavanomaisessa parametriestimoinnissa. Parhaat tulokset implisiittisessä kalibroinnissa saavutettiin suoralla kalibrointitavalla GRR (Generalized Ridge Regression)-kalibrointimenetelmää käyttämällä.
Resumo:
Understanding the basis on which recruiters form hirability impressions for a job applicant is a key issue in organizational psychology and can be addressed as a social computing problem. We approach the problem from a face-to-face, nonverbal perspective where behavioral feature extraction and inference are automated. This paper presents a computational framework for the automatic prediction of hirability. To this end, we collected an audio-visual dataset of real job interviews where candidates were applying for a marketing job. We automatically extracted audio and visual behavioral cues related to both the applicant and the interviewer. We then evaluated several regression methods for the prediction of hirability scores and showed the feasibility of conducting such a task, with ridge regression explaining 36.2% of the variance. Feature groups were analyzed, and two main groups of behavioral cues were predictive of hirability: applicant audio features and interviewer visual cues, showing the predictive validity of cues related not only to the applicant, but also to the interviewer. As a last step, we analyzed the predictive validity of psychometric questionnaires often used in the personnel selection process, and found that these questionnaires were unable to predict hirability, suggesting that hirability impressions were formed based on the interaction during the interview rather than on questionnaire data.
Resumo:
Patients with chronic pancreatitis may have abnormal gastrointestinal transit, but the factors underlying these abnormalities are poorly understood. Gastrointestinal transit was assessed, in 40 male outpatients with alcohol-related chronic pancreatitis and 18 controls, by scintigraphy after a liquid meal labeled with (99m)technetium-phytate. Blood and urinary glucose, fecal fat excretion, nutritional status, and cardiovascular autonomic function were determined in all patients. The influence of diabetes mellitus, malabsorption, malnutrition, and autonomic neuropathy on abnormal gastrointestinal transit was assessed by univariate analysis and Bayesian multiple regression analysis. Accelerated gastrointestinal transit was found in 11 patients who showed abnormally rapid arrival of the meal marker to the cecum. Univariate and Bayesian analysis showed that diabetes mellitus and autonomic neuropathy had significant influences on rapid transit, which was not associated with either malabsorption or malnutrition. In conclusion, rapid gastrointestinal transit in patients with alcohol-related chronic pancreatitis is related to diabetes mellitus and autonomic neuropathy.
Resumo:
This thesis develops and evaluates statistical methods for different types of genetic analyses, including quantitative trait loci (QTL) analysis, genome-wide association study (GWAS), and genomic evaluation. The main contribution of the thesis is to provide novel insights in modeling genetic variance, especially via random effects models. In variance component QTL analysis, a full likelihood model accounting for uncertainty in the identity-by-descent (IBD) matrix was developed. It was found to be able to correctly adjust the bias in genetic variance component estimation and gain power in QTL mapping in terms of precision. Double hierarchical generalized linear models, and a non-iterative simplified version, were implemented and applied to fit data of an entire genome. These whole genome models were shown to have good performance in both QTL mapping and genomic prediction. A re-analysis of a publicly available GWAS data set identified significant loci in Arabidopsis that control phenotypic variance instead of mean, which validated the idea of variance-controlling genes. The works in the thesis are accompanied by R packages available online, including a general statistical tool for fitting random effects models (hglm), an efficient generalized ridge regression for high-dimensional data (bigRR), a double-layer mixed model for genomic data analysis (iQTL), a stochastic IBD matrix calculator (MCIBD), a computational interface for QTL mapping (qtl.outbred), and a GWAS analysis tool for mapping variance-controlling loci (vGWAS).
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The study of function approximation is motivated by the human limitation and inability to register and manipulate with exact precision the behavior variations of the physical nature of a phenomenon. These variations are referred to as signals or signal functions. Many real world problem can be formulated as function approximation problems and from the viewpoint of artificial neural networks these can be seen as the problem of searching for a mapping that establishes a relationship from an input space to an output space through a process of network learning. Several paradigms of artificial neural networks (ANN) exist. Here we will be investigated a comparative of the ANN study of RBF with radial Polynomial Power of Sigmoids (PPS) in function approximation problems. Radial PPS are functions generated by linear combination of powers of sigmoids functions. The main objective of this paper is to show the advantages of the use of the radial PPS functions in relationship traditional RBF, through adaptive training and ridge regression techniques.
Resumo:
Os objetivos neste trabalho foram comparar estimativas de parâmetros genéticos obtidas por meio de dois modelos - um contendo apenas efeitos aditivos e de dominância e outro que incluiu os efeitos aditivo-conjunto (complementaridade) e epistático - e testar alternativas de critérios objetivos para determinação do coeficiente lambda na aplicação da regressão de cumeeira. Os resultados obtidos revelaram que a escolha de um critério para determinação do coeficiente lambda em regressão de cumeeira depende não apenas do conjunto de dados e do modelo utilizado, mas, sobretudo, de um conhecimento prévio acerca do fenômeno estudado e do significado prático e da interpretação dos parâmetros encontrados. Pelo uso de modelos mais completos para avaliação de efeitos genéticos em bovinos de corte, pode-se identificar a contribuição dos efeitos aditivo-conjunto e epistático, que encontram-se embutidos no efeito de heterose estimado por modelos mais simples. A regressão de cumeeira é uma ferramenta que viabiliza a obtenção dessas estimativas mesmo na presença de forte multicolinearidade.
Resumo:
Objetivou-se obter estimativas de efeitos genéticos aditivos e não-aditivos para as características pré e pós-desmama de animais Hereford x Nelore por meio de análises de regressão linear múltipla, com e sem o uso da técnica de regressão de cumeeira. Avaliaram-se as características ganho médio diário do nascimento à desmama, conformação, precocidade e musculatura à desmama, ganho médio diário da desmama ao sobreano, conformação, precocidade e musculatura ao sobreano e perímetro escrotal ajustado para idade e para idade e peso. Os resultados obtidos sem o uso da técnica indicaram valores acentuados dos fatores de inflação da variância. Para melhor interpretar os efeitos estimados, foram preditos os desempenhos de cinco gerações na formação do Braford ½ em relação à raça Hereford, partindo de vacas da raça Nelore. Os animais da geração F1 apresentaram alto desempenho, em razão do benefício máximo da heterose direta e do efeito aditivo materno. A manifestação completa da epistasia direta reduziu significativamente os desempenhos dos animais da geração F2. Para as características de desmama, os animais da geração F3 mostraram desempenhos menores, em virtude do efeito epistático materno máximo, uma vez que suas mães eram da geração F2. Os valores destas características estabilizaram na geração F4, próximos aos valores apresentados pela raça Nelore. Os desempenhos das gerações F3 e F4 para as características pós-desmama e os valores das estabilizações foram próximos ou superiores aos obtidos na geração F2.