838 resultados para linear mixed-effects models
Resumo:
When an automobile passes over a bridge dynamic effects are produced in vehicle and structure. In addition, the bridge itself moves when exposed to the wind inducing dynamic effects on the vehicle that have to be considered. The main objective of this work is to understand the influence of the different parameters concerning the vehicle, the bridge, the road roughness or the wind in the comfort and safety of the vehicles when crossing bridges. Non linear finite element models are used for structures and multibody dynamic models are employed for vehicles. The interaction between the vehicle and the bridge is considered by contact methods. Road roughness is described by the power spectral density (PSD) proposed by the ISO 8608. To consider that the profiles under right and left wheels are different but not independent, the hypotheses of homogeneity and isotropy are assumed. To generate the wind velocity history along the road the Sandia method is employed. The global problem is solved by means of the finite element method. First the methodology for modelling the interaction is verified in a benchmark. Following, the case of a vehicle running along a rigid road and subjected to the action of the turbulent wind is analyzed and the road roughness is incorporated in a following step. Finally the flexibility of the bridge is added to the model by making the vehicle run over the structure. The application of this methodology will allow to understand the influence of the different parameters in the comfort and safety of road vehicles crossing wind exposed bridges. Those results will help to recommend measures to make the traffic over bridges more reliable without affecting the structural integrity of the viaduct
Resumo:
Los bosques húmedos de montaña se encuentran reconocidos como uno de los ecosistemas más amenazados en el mundo, llegando inclusive a ser considerado como un “hotspot” por su alta diversidad y endemismo. La acelerada pérdida de cobertura vegetal de estos bosques ha ocasionado que, en la actualidad, se encuentren restringidos a una pequeña fracción de su área de distribución histórica. Pese a esto, los estudios realizados sobre cual es efecto de la deforestación, fragmentación, cambios de uso de suelo y su efecto en las comunidades de plantas presentes en este tipo de vegetación aún son muy escuetos, en comparación a los realizados con sus similares amazónicos. En este trabajo, el cual se encuentra dividido en seis capítulos, abordaremos los siguientes objetivos: a) Comprender cuál es la dinámica que han seguido los diferentes tipos de bosques montanos andinos de la cuenca del Rio Zamora, Sur de Ecuador durante entre 1976 y 2002. b) Proveer de evidencia de las tasas de deforestación y fragmentación de todos los tipos diferentes de bosques montanos andinos presentes en la cuenca del Rio Zamora, Sur de Ecuador entre 1976 y 2002. c) Determinar qué factores inducen a la fragmentación de bosques de montaña en la cuenca alta del río Zamora entre 1976 y 2002. d) Determinar cuáles son y cómo afectan los factores ambientales y socioeconómicos a la dinámica de la deforestación y regeneración (pérdida y recuperación del hábitat) sufrida por los bosques de montaña dentro de la zona de estudio y e) Determinar si la deforestación y fragmentación actúan sobre la diversidad y estructura de las comunidades de tres tipos de organismos (comunidades de árboles, comunidades de líquenes epífitos y comunidades de hepáticas epífitas). Este estudio se centró en el cuenca alta del río Zamora, localizada al sur de Ecuador entre las coordenadas 3º 00´ 53” a 4º 20´ 24.65” de latitud sur y 79º 49´58” a 78º 35´ 38” de longitud oeste, que cubre alrededor de 4300 km2 de territorio situado entre las capitales de las provincias de Loja y Zamora-Chinchipe. Con objeto de predecir la dinámica futura de la deforestación en la región de Loja y cómo se verán afectados los diferentes tipos de hábitat, así como para detectar los factores que más influyen en dicha dinámica, se han construido modelos basados en la historia de la deforestación derivados de fotografías aéreas e imágenes satelitales de tres fechas (1976, 1989 y 2002). La cuantificación de la deforestación se realizó mediante la tasa de interés compuesto y para la caracterización de la configuración espacial de los fragmentos de bosque nativo se calcularon índices de paisaje los cuales fueron calculados utilizando el programa Fragstats 3.3. Se ha clasificado el recubrimiento del terreno en forestal y no forestal y se ha modelado su evolución temporal con Modelos Lineales Generalizados Mixtos (GLMM), empleando como variables explicativas tanto variables ambientales espacialmente explícitas (altitud, orientación, pendiente, etc) como antrópicas (distancia a zonas urbanizadas, deforestadas, caminos, entre otras). Para medir el efecto de la deforestación sobre las comunidades modelo (de árboles, líquenes y hepáticas) se monitorearon 11 fragmentos de vegetación de distinto tamaño: dos fragmentos de más de cien hectáreas, tres fragmentos de entre diez y noventa ha y seis fragmentos de menos de diez hectáreas. En ellos se instalaron un total de 38 transectos y 113 cuadrantes de 20 x 20 m a distancias que se alejaban progresivamente del borde en 10, 40 y 80 m. Nuestros resultados muestran una tasa media anual de deforestación del 1,16% para todo el período de estudio, que el tipo de vegetación que más alta tasa de destrucción ha sufrido, es el páramo herbáceo, con un 2,45% anual. El análisis de los patrones de fragmentación determinó un aumento en 2002 de más del doble de fragmentos presentes en 1976, lo cual se repite en el análisis del índice de densidad promedio. El índice de proximidad media entre fragmentos muestra una reducción progresiva de la continuidad de las áreas forestadas. Si bien las formas de los fragmentos se han mantenido bastante similares a lo largo del período de estudio, la conectividad entre estos ha disminuido en un 84%. Por otro lado, de nuestros análisis se desprende que las zonas con mayor probabilidad de deforestarse son aquellas que están cercanas a zonas previamente deforestadas; la cercanía a las vías también influye significativamente en la deforestación, causando un efecto directo en la composición y estructura de las comunidades estudiadas, que en el caso de los árboles viene mediado por el tamaño del fragmento y en el caso del componente epífito (hepáticas y líquenes), viene mediado tanto por el tamaño del fragmento como por la distancia al borde del mismo. Se concluye la posibilidad de que, de mantenerse esta tendencia, este tipo de bosques desaparecerá en corto tiempo y los servicios ecosistémicos que prestan, se verán seriamente comprometidos. ABSTRACT Mountain rainforests are recognized as one of the most threatened ecosystems in the world, and have even come to be considered as a “hotspot” due to their high degree of diversity and endemism. The accelerated loss of plant cover of these forests has caused them to be restricted today to a small fraction of their area of historic distribution. In spite of this, studies done on the effect of deforestation, fragmentation, changes in soil use and their effect on the plant communities present in this type of vegetation are very brief compared to those done on their analogues in the Amazon region. In this study, which is divided into six chapters, we will address the following objectives: a) To understand what the dynamic followed by the different types of Andean mountain forests in the Zamora River watershed of southern Ecuador has been between 1976 and 2002. b) To provide evidence of the rates of deforestation and fragmentation of all the different types of Andean mountain forests existing in the upper watershed of the Zamora River between 1976 and 2002. c) To determine the factors that induces fragmentation of all different types of Andean mountain forests existing in the upper watershed of the Zamora River between 1976 and 2002. d) To determine what the environmental and anthropogenic factors are driving the dynamic of deforestation and regeneration (loss and recuperation of the habitat) suffered by the mountain forests in the area of the study and e) To determine if the deforestation and fragmentation act upon the diversity and structure of three model communities: trees, epiphytic lichens and epiphytic liverworts. This study is centered on the upper Zamora River watershed, located in southern Ecuador between 3º 00´ 53” and 4º 20´ 24.65 south latitude and 79º 49´ 58” to 78º 35´ 38” west longitude, and covers around 4,300 km2 of territory located between Loja and Zamora-Chinchipe provinces. For the purpose of predicting the future dynamic of deforestation in the Loja region and how different types of habitats will be affected, as well as detecting the environmental and socioeconomic factors that influence landscape dynamics, models were constructed based on deforestation history, derived from aerial photographs and satellite images for three dates (1976, 1989 and 2002). Quantifying the deforestation was done using the compound interest rate; to characterize the spatial configuration of fragments of native forest, landscape indices were calculated with Fragstats 3.3 program. Land cover was classified as forested and not forested and its evolution over time was modeled with Generalized Linear Mixed Models (GLMM), using spatially explicit environmental variables (altitude, orientation, slope, etc.) as well as anthropic variables (distance to urbanized, deforested areas and roads, among others) as explanatory variables. To measure the effects of fragmentation on three types of model communities (forest trees and epiphytic lichen and liverworts), 11 vegetation fragments of different sizes were monitored: two fragments of more than one hundred hectares, three fragments of between ten and ninety ha and six fragments of fewer than ten hectares . In these fragments, a total of 38 transects and 113 20 x 20 m quadrats were installed at distances that progressively moved away from the edge of the fragment by 10, 40 and 80 m. Our results show an average annual rate of deforestation of 1.16% for the entire period of the study, and that the type of vegetation that suffered the highest rate of destruction was grassy paramo, with an annual rate of 2.45%. The analysis of fragmentation patterns determined the number of fragments in 2002 more than doubled the number of fragments present in 1976, and the same occurred for the average density index. The variation of the average proximity index among fragments showed a progressive reduction of the continuity of forested areas. Although fragment shapes have remained quite similar over the period of the study, connectivity among them has diminished by 84%. On the other hand, it emerged from our analysis that the areas of greatest probability of deforestation were those that are close to previously deforested areas; proximity to roads also significantly favored the deforestation causing a direct effect on the composition of our model communities, that in the case of forest trees is determined by the size of the fragment, and in the case of the epiphyte communities (liverworts and lichens), is determined, by the size of the fragment as well as the distance to edge. A subject under discussion is the possibility that if this tendency continues, this type of forest will disappear in a short time, and the ecological services it provides, will be seriously endangered.
Resumo:
There has been a recent burst of activity in the atmosphere/ocean sciences community in utilizing stable linear Langevin stochastic models for the unresolved degree of freedom in stochastic climate prediction. Here several idealized models for stochastic climate modeling are introduced and analyzed through unambiguous mathematical theory. This analysis demonstrates the potential need for more sophisticated models beyond stable linear Langevin equations. The new phenomena include the emergence of both unstable linear Langevin stochastic models for the climate mean and the need to incorporate both suitable nonlinear effects and multiplicative noise in stochastic models under appropriate circumstances. The strategy for stochastic climate modeling that emerges from this analysis is illustrated on an idealized example involving truncated barotropic flow on a beta-plane with topography and a mean flow. In this example, the effect of the original 57 degrees of freedom is well represented by a theoretically predicted stochastic model with only 3 degrees of freedom.
Resumo:
O objetivo dessa pesquisa foi avaliar aspectos genéticos que relacionados à produção in vitro de embriões na raça Guzerá. O primeiro estudo focou na estimação de (co) variâncias genéticas e fenotípicas em características relacionadas a produção de embriões e na detecção de possível associação com a idade ao primeiro parto (AFC). Foi detectada baixa e média herdabilidade para características relacionadas à produção de oócitos e embriões. Houve fraca associação genética entre características ligadas a reprodução artificial e a idade ao primeiro parto. O segundo estudo avaliou tendências genéticas e de endogamia em uma população Guzerá no Brasil. Doadoras e embriões produzidos in vitro foram considerados como duas subpopulações de forma a realizar comparações acerca das diferenças de variação anual genética e do coeficiente de endogamia. A tendência anual do coeficiente de endogamia (F) foi superior para a população geral, sendo detectado efeito quadrático. No entanto, a média de F para a sub- população de embriões foi maior do que na população geral e das doadoras. Foi observado ganho genético anual superior para a idade ao primeiro parto e para a produção de leite (305 dias) entre embriões produzidos in vitro do que entre doadoras ou entre a população geral. O terceiro estudo examinou os efeitos do coeficiente de endogamia da doadora, do reprodutor (usado na fertilização in vitro) e dos embriões sobre resultados de produção in vitro de embriões na raça Guzerá. Foi detectado efeito da endogamia da doadora e dos embriões sobre as características estudadas. O quarto (e último) estudo foi elaborado para comparar a adequação de modelos mistos lineares e generalizados sob método de Máxima Verossimilhança Restrita (REML) e sua adequação a variáveis discretas. Quatro modelos hierárquicos assumindo diferentes distribuições para dados de contagem encontrados no banco. Inferência foi realizada com base em diagnósticos de resíduo e comparação de razões entre componentes de variância para os modelos em cada variável. Modelos Poisson superaram tanto o modelo linear (com e sem transformação da variável) quanto binomial negativo à qualidade do ajuste e capacidade preditiva, apesar de claras diferenças observadas na distribuição das variáveis. Entre os modelos testados, a pior qualidade de ajuste foi obtida para o modelo linear mediante transformação logarítmica (Log10 X +1) da variável resposta.
Resumo:
Ce mémoire a été effectué dans le cadre d'une étude pour le Ministère des Transports.
Resumo:
Ce mémoire a été effectué dans le cadre d'une étude pour le Ministère des Transports.
Resumo:
The modelling of inpatient length of stay (LOS) has important implications in health care studies. Finite mixture distributions are usually used to model the heterogeneous LOS distribution, due to a certain proportion of patients sustaining-a longer stay. However, the morbidity data are collected from hospitals, observations clustered within the same hospital are often correlated. The generalized linear mixed model approach is adopted to accommodate the inherent correlation via unobservable random effects. An EM algorithm is developed to obtain residual maximum quasi-likelihood estimation. The proposed hierarchical mixture regression approach enables the identification and assessment of factors influencing the long-stay proportion and the LOS for the long-stay patient subgroup. A neonatal LOS data set is used for illustration, (C) 2003 Elsevier Science Ltd. All rights reserved.
Resumo:
The paper investigates a Bayesian hierarchical model for the analysis of categorical longitudinal data from a large social survey of immigrants to Australia. Data for each subject are observed on three separate occasions, or waves, of the survey. One of the features of the data set is that observations for some variables are missing for at least one wave. A model for the employment status of immigrants is developed by introducing, at the first stage of a hierarchical model, a multinomial model for the response and then subsequent terms are introduced to explain wave and subject effects. To estimate the model, we use the Gibbs sampler, which allows missing data for both the response and the explanatory variables to be imputed at each iteration of the algorithm, given some appropriate prior distributions. After accounting for significant covariate effects in the model, results show that the relative probability of remaining unemployed diminished with time following arrival in Australia.
Resumo:
Many variables that are of interest in social science research are nominal variables with two or more categories, such as employment status, occupation, political preference, or self-reported health status. With longitudinal survey data it is possible to analyse the transitions of individuals between different employment states or occupations (for example). In the statistical literature, models for analysing categorical dependent variables with repeated observations belong to the family of models known as generalized linear mixed models (GLMMs). The specific GLMM for a dependent variable with three or more categories is the multinomial logit random effects model. For these models, the marginal distribution of the response does not have a closed form solution and hence numerical integration must be used to obtain maximum likelihood estimates for the model parameters. Techniques for implementing the numerical integration are available but are computationally intensive requiring a large amount of computer processing time that increases with the number of clusters (or individuals) in the data and are not always readily accessible to the practitioner in standard software. For the purposes of analysing categorical response data from a longitudinal social survey, there is clearly a need to evaluate the existing procedures for estimating multinomial logit random effects model in terms of accuracy, efficiency and computing time. The computational time will have significant implications as to the preferred approach by researchers. In this paper we evaluate statistical software procedures that utilise adaptive Gaussian quadrature and MCMC methods, with specific application to modeling employment status of women using a GLMM, over three waves of the HILDA survey.
Resumo:
Background: Oral itraconazole (ITRA) is used for the treatment of allergic bronchopulmonary aspergillosis in patients with cystic fibrosis (CF) because of its antifungal activity against Aspergillus species. ITRA has an active hydroxy-metabolite (OH-ITRA) which has similar antifungal activity. ITRA is a highly lipophilic drug which is available in two different oral formulations, a capsule and an oral solution. It is reported that the oral solution has a 60% higher relative bioavailability. The influence of altered gastric physiology associated with CF on the pharmacokinetics (PK) of ITRA and its metabolite has not been previously evaluated. Objectives: 1) To estimate the population (pop) PK parameters for ITRA and its active metabolite OH-ITRA including relative bioavailability of the parent after administration of the parent by both capsule and solution and 2) to assess the performance of the optimal design. Methods: The study was a cross-over design in which 30 patients received the capsule on the first occasion and 3 days later the solution formulation. The design was constrained to have a maximum of 4 blood samples per occasion for estimation of the popPK of both ITRA and OH-ITRA. The sampling times for the population model were optimized previously using POPT v.2.0.[1] POPT is a series of applications that run under MATLAB and provide an evaluation of the information matrix for a nonlinear mixed effects model given a particular design. In addition it can be used to optimize the design based on evaluation of the determinant of the information matrix. The model details for the design were based on prior information obtained from the literature, which suggested that ITRA may have either linear or non-linear elimination. The optimal sampling times were evaluated to provide information for both competing models for the parent and metabolite and for both capsule and solution simultaneously. Blood samples were assayed by validated HPLC.[2] PopPK modelling was performed using FOCE with interaction under NONMEM, version 5 (level 1.1; GloboMax LLC, Hanover, MD, USA). The PK of ITRA and OH‑ITRA was modelled simultaneously using ADVAN 5. Subsequently three methods were assessed for modelling concentrations less than the LOD (limit of detection). These methods (corresponding to methods 5, 6 & 4 from Beal[3], respectively) were (a) where all values less than LOD were assigned to half of LOD, (b) where the closest missing value that is less than LOD was assigned to half the LOD and all previous (if during absorption) or subsequent (if during elimination) missing samples were deleted, and (c) where the contribution of the expectation of each missing concentration to the likelihood is estimated. The LOD was 0.04 mg/L. The final model evaluation was performed via bootstrap with re-sampling and a visual predictive check. The optimal design and the sampling windows of the study were evaluated for execution errors and for agreement between the observed and predicted standard errors. Dosing regimens were simulated for the capsules and the oral solution to assess their ability to achieve ITRA target trough concentration (Cmin,ss of 0.5-2 mg/L) or a combined Cmin,ss for ITRA and OH-ITRA above 1.5mg/L. Results and Discussion: A total of 241 blood samples were collected and analysed, 94% of them were taken within the defined optimal sampling windows, of which 31% where taken within 5 min of the exact optimal times. Forty six per cent of the ITRA values and 28% of the OH-ITRA values were below LOD. The entire profile after administration of the capsule for five patients was below LOD and therefore the data from this occasion was omitted from estimation. A 2-compartment model with 1st order absorption and elimination best described ITRA PK, with 1st order metabolism of the parent to OH-ITRA. For ITRA the clearance (ClItra/F) was 31.5 L/h; apparent volumes of central and peripheral compartments were 56.7 L and 2090 L, respectively. Absorption rate constants for capsule (kacap) and solution (kasol) were 0.0315 h-1 and 0.125 h-1, respectively. Comparative bioavailability of the capsule was 0.82. There was no evidence of nonlinearity in the popPK of ITRA. No screened covariate significantly improved the fit to the data. The results of the parameter estimates from the final model were comparable between the different methods for accounting for missing data, (M4,5,6)[3] and provided similar parameter estimates. The prospective application of an optimal design was found to be successful. Due to the sampling windows, most of the samples could be collected within the daily hospital routine, but still at times that were near optimal for estimating the popPK parameters. The final model was one of the potential competing models considered in the original design. The asymptotic standard errors provided by NONMEM for the final model and empirical values from bootstrap were similar in magnitude to those predicted from the Fisher Information matrix associated with the D-optimal design. Simulations from the final model showed that the current dosing regimen of 200 mg twice daily (bd) would provide a target Cmin,ss (0.5-2 mg/L) for only 35% of patients when administered as the solution and 31% when administered as capsules. The optimal dosing schedule was 500mg bd for both formulations. The target success for this dosing regimen was 87% for the solution with an NNT=4 compared to capsules. This means, for every 4 patients treated with the solution one additional patient will achieve a target success compared to capsule but at an additional cost of AUD $220 per day. The therapeutic target however is still doubtful and potential risks of these dosing schedules need to be assessed on an individual basis. Conclusion: A model was developed which described the popPK of ITRA and its main active metabolite OH-ITRA in adult CF after administration of both capsule and solution. The relative bioavailability of ITRA from the capsule was 82% that of the solution, but considerably more variable. To incorporate missing data, using the simple Beal method 5 (using half LOD for all samples below LOD) provided comparable results to the more complex but theoretically better Beal method 4 (integration method). The optimal sparse design performed well for estimation of model parameters and provided a good fit to the data.
Resumo:
Differences in lipid metabolism associate with age-related disease development and lifespan. Inflammation is a common link between metabolic dysregulation and aging. Saturated fatty acids (FAs) initiate pro-inflammatory signalling from many cells including monocytes; however, no existing studies have quantified age-associated changes in individual FAs in relation to inflammatory phenotype. Therefore, we have determined the plasma concentrations of distinct FAs by gas chromatography in 26 healthy younger individuals (age < 30 years) and 21 healthy FA individuals (age > 50 years). Linear mixed models were used to explore the association between circulating FAs, age and cytokines. We showed that plasma saturated, poly- and mono-unsaturated FAs increase with age. Circulating TNF-α and IL-6 concentrations increased with age, whereas IL-10 and TGF-β1 concentrations decreased. Oxidation of MitoSOX Red was higher in leucocytes from FA adults, and plasma oxidized glutathione concentrations were higher. There was significant colinearity between plasma saturated FAs, indicative of their metabolic relationships. Higher levels of the saturated FAs C18:0 and C24:0 were associated with lower TGF-β1 concentrations, and higher C16:0 were associated with higher TNF-α concentrations. We further examined effects of the aging FA profile on monocyte polarization and metabolism in THP1 monocytes. Monocytes preincubated with C16:0 increased secretion of pro-inflammatory cytokines in response to phorbol myristate acetate-induced differentiation through ceramide-dependent inhibition of PPARγ activity. Conversely, C18:1 primed a pro-resolving macrophage which was PPARγ dependent and ceramide dependent and which required oxidative phosphorylation. These data suggest that a midlife adult FA profile impairs the switch from proinflammatory to lower energy, requiring anti-inflammatory macrophages through metabolic reprogramming.
Resumo:
Differences in lipid metabolism associate with age-related disease development and lifespan. Inflammation is a common link between metabolic dysregulation and aging. Saturated fatty acids (FAs) initiate pro-inflammatory signalling from many cells including monocytes; however, no existing studies have quantified age-associated changes in individual FAs in relation to inflammatory phenotype. Therefore, we have determined the plasma concentrations of distinct FAs by gas chromatography in 26 healthy younger individuals (age < 30 years) and 21 healthy FA individuals (age > 50 years). Linear mixed models were used to explore the association between circulating FAs, age and cytokines. We showed that plasma saturated, poly- and mono-unsaturated FAs increase with age. Circulating TNF-α and IL-6 concentrations increased with age, whereas IL-10 and TGF-β1 concentrations decreased. Oxidation of MitoSOX Red was higher in leucocytes from FA adults, and plasma oxidized glutathione concentrations were higher. There was significant colinearity between plasma saturated FAs, indicative of their metabolic relationships. Higher levels of the saturated FAs C18:0 and C24:0 were associated with lower TGF-β1 concentrations, and higher C16:0 were associated with higher TNF-α concentrations. We further examined effects of the aging FA profile on monocyte polarization and metabolism in THP1 monocytes. Monocytes preincubated with C16:0 increased secretion of pro-inflammatory cytokines in response to phorbol myristate acetate-induced differentiation through ceramide-dependent inhibition of PPARγ activity. Conversely, C18:1 primed a pro-resolving macrophage which was PPARγ dependent and ceramide dependent and which required oxidative phosphorylation. These data suggest that a midlife adult FA profile impairs the switch from proinflammatory to lower energy, requiring anti-inflammatory macrophages through metabolic reprogramming.
Resumo:
This paper is based on the novel use of a very high fidelity decimation filter chain for Electrocardiogram (ECG) signal acquisition and data conversion. The multiplier-free and multi-stage structure of the proposed filters lower the power dissipation while minimizing the circuit area which are crucial design constraints to the wireless noninvasive wearable health monitoring products due to the scarce operational resources in their electronic implementation. The decimation ratio of the presented filter is 128, working in tandem with a 1-bit 3rd order Sigma Delta (ΣΔ) modulator which achieves 0.04 dB passband ripples and -74 dB stopband attenuation. The work reported here investigates the non-linear phase effects of the proposed decimation filters on the ECG signal by carrying out a comparative study after phase correction. It concludes that the enhanced phase linearity is not crucial for ECG acquisition and data conversion applications since the signal distortion of the acquired signal, due to phase non-linearity, is insignificant for both original and phase compensated filters. To the best of the authors’ knowledge, being free of signal distortion is essential as this might lead to misdiagnosis as stated in the state of the art. This article demonstrates that with their minimal power consumption and minimal signal distortion features, the proposed decimation filters can effectively be employed in biosignal data processing units.
Resumo:
Thesis (Master's)--University of Washington, 2016-08
Resumo:
We analyze a real data set pertaining to reindeer fecal pellet-group counts obtained from a survey conducted in a forest area in northern Sweden. In the data set, over 70% of counts are zeros, and there is high spatial correlation. We use conditionally autoregressive random effects for modeling of spatial correlation in a Poisson generalized linear mixed model (GLMM), quasi-Poisson hierarchical generalized linear model (HGLM), zero-inflated Poisson (ZIP), and hurdle models. The quasi-Poisson HGLM allows for both under- and overdispersion with excessive zeros, while the ZIP and hurdle models allow only for overdispersion. In analyzing the real data set, we see that the quasi-Poisson HGLMs can perform better than the other commonly used models, for example, ordinary Poisson HGLMs, spatial ZIP, and spatial hurdle models, and that the underdispersed Poisson HGLMs with spatial correlation fit the reindeer data best. We develop R codes for fitting these models using a unified algorithm for the HGLMs. Spatial count response with an extremely high proportion of zeros, and underdispersion can be successfully modeled using the quasi-Poisson HGLM with spatial random effects.