910 resultados para Negative Binomial Regression Model (NBRM)


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The identification of subjects at high risk for Alzheimer’s disease is important for prognosis and early intervention. We investigated the polygenic architecture of Alzheimer’s disease and the accuracy of Alzheimer’s disease prediction models, including and excluding the polygenic component in the model. This study used genotype data from the powerful dataset comprising 17 008 cases and 37 154 controls obtained from the International Genomics of Alzheimer’s Project (IGAP). Polygenic score analysis tested whether the alleles identified to associate with disease in one sample set were significantly enriched in the cases relative to the controls in an independent sample. The disease prediction accuracy was investigated in a subset of the IGAP data, a sample of 3049 cases and 1554 controls (for whom APOE genotype data were available) by means of sensitivity, specificity, area under the receiver operating characteristic curve (AUC) and positive and negative predictive values. We observed significant evidence for a polygenic component enriched in Alzheimer’s disease (P = 4.9 × 10−26). This enrichment remained significant after APOE and other genome-wide associated regions were excluded (P = 3.4 × 10−19). The best prediction accuracy AUC = 78.2% (95% confidence interval 77–80%) was achieved by a logistic regression model with APOE, the polygenic score, sex and age as predictors. In conclusion, Alzheimer’s disease has a significant polygenic component, which has predictive utility for Alzheimer’s disease risk and could be a valuable research tool complementing experimental designs, including preventative clinical trials, stem cell selection and high/low risk clinical studies. In modelling a range of sample disease prevalences, we found that polygenic scores almost doubles case prediction from chance with increased prediction at polygenic extremes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Temporal replicate counts are often aggregated to improve model fit by reducing zero-inflation and count variability, and in the case of migration counts collected hourly throughout a migration, allows one to ignore nonindependence. However, aggregation can represent a loss of potentially useful information on the hourly or seasonal distribution of counts, which might impact our ability to estimate reliable trends. We simulated 20-year hourly raptor migration count datasets with known rate of change to test the effect of aggregating hourly counts to daily or annual totals on our ability to recover known trend. We simulated data for three types of species, to test whether results varied with species abundance or migration strategy: a commonly detected species, e.g., Northern Harrier, Circus cyaneus; a rarely detected species, e.g., Peregrine Falcon, Falco peregrinus; and a species typically counted in large aggregations with overdispersed counts, e.g., Broad-winged Hawk, Buteo platypterus. We compared accuracy and precision of estimated trends across species and count types (hourly/daily/annual) using hierarchical models that assumed a Poisson, negative binomial (NB) or zero-inflated negative binomial (ZINB) count distribution. We found little benefit of modeling zero-inflation or of modeling the hourly distribution of migration counts. For the rare species, trends analyzed using daily totals and an NB or ZINB data distribution resulted in a higher probability of detecting an accurate and precise trend. In contrast, trends of the common and overdispersed species benefited from aggregation to annual totals, and for the overdispersed species in particular, trends estimating using annual totals were more precise, and resulted in lower probabilities of estimating a trend (1) in the wrong direction, or (2) with credible intervals that excluded the true trend, as compared with hourly and daily counts.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Statistical association between a single nucleotide polymorphism (SNP) genotype and a quantitative trait in genome-wide association studies is usually assessed using a linear regression model, or, in the case of non-normally distributed trait values, using the Kruskal-Wallis test. While linear regression models assume an additive mode of inheritance via equi-distant genotype scores, Kruskal-Wallis test merely tests global differences in trait values associated with the three genotype groups. Both approaches thus exhibit suboptimal power when the underlying inheritance mode is dominant or recessive. Furthermore, these tests do not perform well in the common situations when only a few trait values are available in a rare genotype category (disbalance), or when the values associated with the three genotype categories exhibit unequal variance (variance heterogeneity). We propose a maximum test based on Marcus-type multiple contrast test for relative effect sizes. This test allows model-specific testing of either dominant, additive or recessive mode of inheritance, and it is robust against variance heterogeneity. We show how to obtain mode-specific simultaneous confidence intervals for the relative effect sizes to aid in interpreting the biological relevance of the results. Further, we discuss the use of a related all-pairwise comparisons contrast test with range preserving confidence intervals as an alternative to Kruskal-Wallis heterogeneity test. We applied the proposed maximum test to the Bogalusa Heart Study dataset, and gained a remarkable increase in the power to detect association, particularly for rare genotypes. Our simulation study also demonstrated that the proposed non-parametric tests control family-wise error rate in the presence of non-normality and variance heterogeneity contrary to the standard parametric approaches. We provide a publicly available R library nparcomp that can be used to estimate simultaneous confidence intervals or compatible multiplicity-adjusted p-values associated with the proposed maximum test.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

O presente trabalho pretende analisar a divulgação do risco nos relatórios anuais das empresas portuguesas não financeiras, com valores cotados em bolsa. No momento em que vivemos, com toda esta instabilidade, os investidores e outros stakeholders estão cada vez menos confiantes e mais exigentes. Assim, relatar informação sobre risco, começa a ser um dos meios utilizados pelas empresas para transmitir confiança e viabilidade ao exterior. Contudo, será que uma empresa que divulga sobre risco é uma empresa que se encontra totalmente sã, e que não oculta nem ofusca qualquer tipo de informação? O objetivo deste trabalho passará por apurar, se de alguma forma, os gestores se fazem valer das estratégias de impression management para ocultar ou, ofuscar os stakeholders na divulgação de informações sobre risco. Para o desenvolvimento desta investigação tivemos por base as empresas cotadas na Euronext Lisbon, para as quais foi efetuada uma análise de conteúdo do Relatório de Gestão, do Anexo e do Relatório do Governo das Sociedades, nos anos de 2007, 2010 e 2013. Aos dados recolhidos aplicou-se o modelo de regressão OLS, confirmando a hipótese do índice de compreensibilidade estar associado positivamente com a dimensão da empresa. Dos resultados obtidos concluiu-se ainda a existência de uma associação negativa entre o índice de legibilidade e a dimensão e o setor de atividade.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We introduce a new class of integer-valued self-exciting threshold models, which is based on the binomial autoregressive model of order one as introduced by McKenzie (Water Resour Bull 21:645–650, 1985. doi:10.1111/j.1752-1688.1985. tb05379.x). Basic probabilistic and statistical properties of this class of models are discussed. Moreover, parameter estimation and forecasting are addressed. Finally, the performance of these models is illustrated through a simulation study and an empirical application to a set of measle cases in Germany.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Classical regression analysis can be used to model time series. However, the assumption that model parameters are constant over time is not necessarily adapted to the data. In phytoplankton ecology, the relevance of time-varying parameter values has been shown using a dynamic linear regression model (DLRM). DLRMs, belonging to the class of Bayesian dynamic models, assume the existence of a non-observable time series of model parameters, which are estimated on-line, i.e. after each observation. The aim of this paper was to show how DLRM results could be used to explain variation of a time series of phytoplankton abundance. We applied DLRM to daily concentrations of Dinophysis cf. acuminata, determined in Antifer harbour (French coast of the English Channel), along with physical and chemical covariates (e.g. wind velocity, nutrient concentrations). A single model was built using 1989 and 1990 data, and then applied separately to each year. Equivalent static regression models were investigated for the purpose of comparison. Results showed that most of the Dinophysis cf. acuminata concentration variability was explained by the configuration of the sampling site, the wind regime and tide residual flow. Moreover, the relationships of these factors with the concentration of the microalga varied with time, a fact that could not be detected with static regression. Application of dynamic models to phytoplankton time series, especially in a monitoring context, is discussed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Objectives: To study the relationship between severity of injury of the lower limb and severity of injury of the head, thoracic, and abdominal regions in frontal-impact road traffic collisions. Methods: Consecutive hospitalised trauma patients who were involved in a frontal road traffic collision were prospectively studied over 18 months. Patients with at least one Abbreviated Injury Scale (AIS) ≥3 or AIS 2 injuries within two AIS body regions were included. Patients were divided into two groups depending on the severity of injury to the head, chest or abdomen. Low severity group had an AIS < 2 and high severity group had an AIS ≥ 2. Backward likelihood logistic regression models were used to define significant factors affecting the severity of head, chest or abdominal injuries. Results: Eighty-five patients were studied. The backward likelihood logistic regression model defining independent factors affecting severity of head injuries was highly significant (p=0.01, nagelkerke r square = 0.1) severity of lower limb injuries was the only significant factor (p=0.013) having a negative correlation with head injury (Odds ratio of 0.64 (95% CI: 0.45-0.91). Conclusion: Occupants who sustain a greater severity of injury to the lower limb in a frontal-impact collision are likely to be spared from a greater severity of head injury.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, we aim at contributing to the new field of research that intends to bring up-to-date the tools and statistics currently used to look to the current reality given by Global Value Chains (GVC) in international trade and Foreign Direct Investment (FDI). Namely, we make use of the most recent data published by the World Input-Output Database to suggest indicators to measure the participation and net gains of countries by being a part of GVC; and use those indicators in a pooled-regression model to estimate determinants of FDI stocks in Organization for Economic Co-operation and Development (OECD)-member countries. We conclude that one of the measures proposed proves to be statistically significant in explaining the bilateral stock of FDI in OECD countries, meaning that the higher the transnational income generated between two given countries by GVC, taken as a proxy to the participation of those countries in GVC, the higher one could expect the FDI entering those countries to be. The regression also shows the negative impact of the global financial crisis that started in 2009 in the world’s bilateral FDI stocks and, additionally, the particular and significant role played by the People’s Republic of China in determining these stocks.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tourist accommodation expenditure is a widely investigated topic as it represents a major contribution to the total tourist expenditure. The identification of the determinant factors is commonly based on supply-driven applications while little research has been made on important travel characteristics. This paper proposes a demand-driven analysis of tourist accommodation price by focusing on data generated from room bookings. The investigation focuses on modeling the relationship between key travel characteristics and the price paid to book the accommodation. To accommodate the distributional characteristics of the expenditure variable, the analysis is based on the estimation of a quantile regression model. The findings support the econometric approach used and enable the elaboration of relevant managerial implications.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This dissertation focused on the longitudinal analysis of business start-ups using three waves of data from the Kauffman Firm Survey. The first essay used the data from years 2004-2008, and examined the simultaneous relationship between a firm’s capital structure, human resource policies, and its impact on the level of innovation. The firm leverage was calculated as, debt divided by total financial resources. Index of employee well-being was determined by a set of nine dichotomous questions asked in the survey. A negative binomial fixed effects model was used to analyze the effect of employee well-being and leverage on the count data of patents and copyrights, which were used as a proxy for innovation. The paper demonstrated that employee well-being positively affects the firm's innovation, while a higher leverage ratio had a negative impact on the innovation. No significant relation was found between leverage and employee well-being. The second essay used the data from years 2004-2009, and inquired whether a higher entrepreneurial speed of learning is desirable, and whether there is a linkage between the speed of learning and growth rate of the firm. The change in the speed of learning was measured using a pooled OLS estimator in repeated cross-sections. There was evidence of a declining speed of learning over time, and it was concluded that a higher speed of learning is not necessarily a good thing, because speed of learning is contingent on the entrepreneur's initial knowledge, and the precision of the signals he receives from the market. Also, there was no reason to expect speed of learning to be related to the growth of the firm in one direction over another. The third essay used the data from years 2004-2010, and determined the timing of diversification activities by the business start-ups. It captured when a start-up diversified for the first time, and explored the association between an early diversification strategy adopted by a firm, and its survival rate. A semi-parametric Cox proportional hazard model was used to examine the survival pattern. The results demonstrated that firms diversifying at an early stage in their lives show a higher survival rate; however, this effect fades over time.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The interactions between host individual, host population, and environmental factors modulate parasite abundance in a given host population. Since adult exophilic ticks are highly aggregated in red deer (Cervus elaphus) and this ungulate exhibits significant sexual size dimorphism, life history traits and segregation, we hypothesized that tick parasitism on males and hinds would be differentially influenced by each of these factors. To test the hypothesis, ticks from 306 red deer-182 males and 124 females-were collected during 7 years in a red deer population in south-central Spain. By using generalized linear models, with a negative binomial error distribution and a logarithmic link function, we modeled tick abundance on deer with 20 potential predictors. Three models were developed: one for red deer males, another for hinds, and one combining data for males and females and including "sex" as factor. Our rationale was that if tick burdens on males and hinds relate to the explanatory factors in a differential way, it is not possible to precisely and accurately predict the tick burden on one sex using the model fitted on the other sex, or with the model that combines data from both sexes. Our results showed that deer males were the primary target for ticks, the weight of each factor differed between sexes, and each sex specific model was not able to accurately predict burdens on the animals of the other sex. That is, results support for sex-biased differences. The higher weight of host individual and population factors in the model for males show that intrinsic deer factors more strongly explain tick burden than environmental host-seeking tick abundance. In contrast, environmental variables predominated in the models explaining tick burdens in hinds.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In twenty years almost one in four Canadians will be over the age of 65. How successfully these people age will influence their quality of life and contribute to their physical health. Illness and disease are frequent components of aging; however, ‘successful aging’ research normally excludes people with illness. Older people living with illness, even life threatening illness, often self-report a good quality of life and continue to experience psychological well-being and a significant engagement in social life. This dissertation uses a three manuscript approach to examine successful aging among people with illness. The first manuscript employed a scoping review to examine the models used in recent successful aging research, compiling the most frequently used constructs which included: engagement, optimism and/or positive attitude, resilience, spirituality and/or religiosity, self-efficacy and/or self-esteem, and gerotranscendence. The second manuscript utilized data gathered via interviews (online or in person) with people over the age of 65 years living with illness. The majority of these participants reported success in aging; only resilience was predictive in the binomial regression analysis. The third manuscript examined the role of social determinants of health on successful aging. The analysis revealed that disengagement from community-activities showed a significant association with higher self-reported successful aging. The best fitting model for predicting rate of successful aging with illness was a linear combination of participants’ ageism score and community activity score, while controlling for gender and age. When considered together, the results from these three manuscripts suggest that successful aging can be experienced by older adults aging with illness. And that, among these older adults, resilience, community interaction and ageism may all play a part in determining the extent to which aging is experienced as successful. Recommendations include the suggestion that we embrace the idea that people with illness can self-define as successful agers. Further, since some of the associated constructs (e.g. resilience) can be fostered, successful aging could be bolstered by education or programs to build skills along with the usual treatment modalities for the illnesses that co-exist.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study focuses on multiple linear regression models relating six climate indices (temperature humidity THI, environmental stress ESI, equivalent temperature index ETI, heat load HLI, modified HLI (HLI new), and respiratory rate predictor RRP) with three main components of cow’s milk (yield, fat, and protein) for cows in Iran. The least absolute shrinkage selection operator (LASSO) and the Akaike information criterion (AIC) techniques are applied to select the best model for milk predictands with the smallest number of climate predictors. Uncertainty estimation is employed by applying bootstrapping through resampling. Cross validation is used to avoid over-fitting. Climatic parameters are calculated from the NASA-MERRA global atmospheric reanalysis. Milk data for the months from April to September, 2002 to 2010 are used. The best linear regression models are found in spring between milk yield as the predictand and THI, ESI, ETI, HLI, and RRP as predictors with p-value < 0.001 and R2 (0.50, 0.49) respectively. In summer, milk yield with independent variables of THI, ETI, and ESI show the highest relation (p-value < 0.001) with R2 (0.69). For fat and protein the results are only marginal. This method is suggested for the impact studies of climate variability/change on agriculture and food science fields when short-time series or data with large uncertainty are available.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Logistic regression is a statistical tool widely used for predicting species’ potential distributions starting from presence/absence data and a set of independent variables. However, logistic regression equations compute probability values based not only on the values of the predictor variables but also on the relative proportion of presences and absences in the dataset, which does not adequately describe the environmental favourability for or against species presence. A few strategies have been used to circumvent this, but they usually imply an alteration of the original data or the discarding of potentially valuable information. We propose a way to obtain from logistic regression an environmental favourability function whose results are not affected by an uneven proportion of presences and absences. We tested the method on the distribution of virtual species in an imaginary territory. The favourability models yielded similar values regardless of the variation in the presence/absence ratio. We also illustrate with the example of the Pyrenean desman’s (Galemys pyrenaicus) distribution in Spain. The favourability model yielded more realistic potential distribution maps than the logistic regression model. Favourability values can be regarded as the degree of membership of the fuzzy set of sites whose environmental conditions are favourable to the species, which enables applying the rules of fuzzy logic to distribution modelling. They also allow for direct comparisons between models for species with different presence/absence ratios in the study area. This makes themmore useful to estimate the conservation value of areas, to design ecological corridors, or to select appropriate areas for species reintroductions.