856 resultados para Random regression models


Relevância:

90.00% 90.00%

Publicador:

Resumo:

We consider quantile regression models and investigate the induced smoothing method for obtaining the covariance matrix of the regression parameter estimates. We show that the difference between the smoothed and unsmoothed estimating functions in quantile regression is negligible. The detailed and simple computational algorithms for calculating the asymptotic covariance are provided. Intensive simulation studies indicate that the proposed method performs very well. We also illustrate the algorithm by analyzing the rainfall–runoff data from Murray Upland, Australia.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The need for a house rental model in Townsville, Australia is addressed. Models developed for predicting house rental levels are described. An analytical model is built upon a priori selected variables and parameters of rental levels. Regression models are generated to provide a comparison to the analytical model. Issues in model development and performance evaluation are discussed. A comparison of the models indicates that the analytical model performs better than the regression models.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This thesis developed semi-parametric regression models for estimating the spatio-temporal distribution of outdoor airborne ultrafine particle number concentration (PNC). The models developed incorporate multivariate penalised splines and random walks and autoregressive errors in order to estimate non-linear functions of space, time and other covariates. The models were applied to data from the "Ultrafine Particles from Traffic Emissions and Child" project in Brisbane, Australia, and to longitudinal measurements of air quality in Helsinki, Finland. The spline and random walk aspects of the models reveal how the daily trend in PNC changes over the year in Helsinki and the similarities and differences in the daily and weekly trends across multiple primary schools in Brisbane. Midday peaks in PNC in Brisbane locations are attributed to new particle formation events at the Port of Brisbane and Brisbane Airport.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper presents an event-based failure model to predict the number of failures that occur in water distribution assets. Often, such models have been based on analysis of historical failure data combined with pipe characteristics and environmental conditions. In this paper weather data have been added to the model to take into account the commonly observed seasonal variation of the failure rate. The theoretical basis of existing logistic regression models is briefly described in this paper, along with the refinements made to the model for inclusion of seasonal variation of weather. The performance of these refinements is tested using data from two Australian water authorities.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Pedestrian crashes are one of the major road safety problems in developing countries representing about 40% of total fatal crashes in low income countries. Despite the fact that many pedestrian crashes in these countries occur at unsignalized intersections such as roundabouts, studies focussing on this issue are limited—thus representing a critical research gap. The objective of this study is to develop safety performance functions for pedestrian crashes at modern roundabouts to identify significant roadway geometric, traffic and land use characteristics related to pedestrian safety. To establish the relationship between pedestrian crashes and various causal factors, detailed data including various forms of exposure, geometric and traffic characteristics, and spatial factors such as proximity to schools and proximity to drinking establishments were collected from a sample of 22 modern roundabouts in Addis Ababa, Ethiopia, representing about 56% of such roundabouts in Addis Ababa. To account for spatial correlation resulting from multiple observations at a roundabout, both the random effect Poisson (REP) and random effect Negative Binomial (RENB) regression models were estimated and compared. Model goodness of fit statistics reveal a marginally superior fit of the REP model compared to the RENB model of pedestrian crashes at roundabouts. Pedestrian crossing volume and the product of traffic volumes along major and minor road had significant and positive associations with pedestrian crashes at roundabouts. The presence of a public transport (bus/taxi) terminal beside a roundabout is associated with increased pedestrian crashes. While the maximum gradient of an approach road is negatively associated with pedestrian safety, the provision of a raised median along an approach appears to increase pedestrian safety at roundabouts. Remedial measures are identified for combating pedestrian safety problems at roundabouts in the context of a developing country.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Vertebral fracture risk is a heritable complex trait. The aim of this study was to identify genetic susceptibility factors for osteoporotic vertebral fractures applying a genome-wide association study (GWAS) approach. The GWAS discovery was based on the Rotterdam Study, a population-based study of elderly Dutch individuals aged >55years; and comprising 329 cases and 2666 controls with radiographic scoring (McCloskey-Kanis) and genetic data. Replication of one top-associated SNP was pursued by de-novo genotyping of 15 independent studies across Europe, the United States, and Australia and one Asian study. Radiographic vertebral fracture assessment was performed using McCloskey-Kanis or Genant semi-quantitative definitions. SNPs were analyzed in relation to vertebral fracture using logistic regression models corrected for age and sex. Fixed effects inverse variance and Han-Eskin alternative random effects meta-analyses were applied. Genome-wide significance was set at p<5×10-8. In the discovery, a SNP (rs11645938) on chromosome 16q24 was associated with the risk for vertebral fractures at p=4.6×10-8. However, the association was not significant across 5720 cases and 21,791 controls from 14 studies. Fixed-effects meta-analysis summary estimate was 1.06 (95% CI: 0.98-1.14; p=0.17), displaying high degree of heterogeneity (I2=57%; Qhet p=0.0006). Under Han-Eskin alternative random effects model the summary effect was significant (p=0.0005). The SNP maps to a region previously found associated with lumbar spine bone mineral density (LS-BMD) in two large meta-analyses from the GEFOS consortium. A false positive association in the GWAS discovery cannot be excluded, yet, the low-powered setting of the discovery and replication settings (appropriate to identify risk effect size >1.25) may still be consistent with an effect size <1.10, more of the type expected in complex traits. Larger effort in studies with standardized phenotype definitions is needed to confirm or reject the involvement of this locus on the risk for vertebral fractures.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Objective: Association between ankylosing spondylitis (AS) and two genes, ERAP1 and IL23R, has recently been reported in North American and British populations. The population attributable risk fraction for ERAP1 in this study was 25%, and for IL23R, 9%. Confirmation of these findings to ERAP1 in other ethnic groups has not yet been demonstrated. We sought to test the association between single nucleotide polymorphisms (SNPs) in these genes and susceptibility to AS among a Portuguese population. We also investigated the role of these genes in clinical manifestations of AS, including age of symptom onset, the Bath Ankylosing Spondylitis Disease Activity, Metrology and Functional Indices, and the modified Stoke Ankylosing Spondylitis Spinal Score. Methods: The study was conducted on 358 AS cases and 285 ethnically matched Portuguese healthy controls. AS was defined according to the modified New York Criteria. Genotyping of IL23R and ERAP1 allelic variants was carried out with TaqMan allelic discrimination assays. Association analysis was performed using the Cochrane-Armitage and linear regression tests of genotypes as implemented in PLINK for dichotomous and quantitative variables respectively. A meta-analysis for Portuguese and previously published Spanish IL23R data was performed using the StatsDirect® Statistical tools, by fixed and random effects models. Results: A total of 14 nsSNPs markers (8 for IL23R, 5 for ERAPl, 1 for LN-PEP) were analysed. Three markers (2 for IL23R and 1 for ERAP1) showed significant single-locus disease associations, confirming that the association of these genes with AS in the Portuguese population. The strongest associated SNP in IL23R was rs1004819 (OR=1.4, p=0.0049), and in ERAP1 was rs30187 (OR=1.26, p=0.035). The population attributable risk fractions in the Portuguese population for these SNPs are 11% and 9.7% respectively. No association was seen with any SNP in LN-PEP, which flanks ERAP1 and was associated with AS in the British population. No association was seen with clinical manifestations of AS. Conclusions: These results show that IL23R and ERAP1 genes are also associated with susceptibility to AS in the Portuguese population, and that they contribute a significant proportion of the population risk for this disease.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Summary High bone mineral density on routine dual energy X-ray absorptiometry (DXA) may indicate an underlying skeletal dysplasia. Two hundred fifty-eight individuals with unexplained high bone mass (HBM), 236 relatives (41% with HBM) and 58 spouses were studied. Cases could not float, had mandible enlargement, extra bone, broad frames, larger shoe sizes and increased body mass index (BMI). HBM cases may harbour an underlying genetic disorder. Introduction High bone mineral density is a sporadic incidental finding on routine DXA scanning of apparently asymptomatic individuals. Such individuals may have an underlying skeletal dysplasia, as seen in LRP5 mutations. We aimed to characterize unexplained HBM and determine the potential for an underlying skeletal dysplasia. Methods Two hundred fifty-eight individuals with unexplained HBM (defined as L1 Z-score ≥ +3.2 plus total hip Z-score ≥ +1.2, or total hip Z-score ≥ +3.2) were recruited from 15 UK centres, by screening 335,115 DXA scans. Unexplained HBM affected 0.181% of DXA scans. Next 236 relatives were recruited of whom 94 (41%) had HBM (defined as L1 Z-score + total hip Z-score ≥ +3.2). Fifty-eight spouses were also recruited together with the unaffected relatives as controls. Phenotypes of cases and controls, obtained from clinical assessment, were compared using random-effects linear and logistic regression models, clustered by family, adjusted for confounders, including age and sex. Results Individuals with unexplained HBM had an excess of sinking when swimming (7.11 [3.65, 13.84], p < 0.001; adjusted odds ratio with 95% confidence interval shown), mandible enlargement (4.16 [2.34, 7.39], p < 0.001), extra bone at tendon/ligament insertions (2.07 [1.13, 3.78], p = 0.018) and broad frame (3.55 [2.12, 5.95], p < 0.001). HBM cases also had a larger shoe size (mean difference 0.4 [0.1, 0.7] UK sizes, p = 0.009) and increased BMI (mean difference 2.2 [1.3, 3.1] kg/m 2, p < 0.001). Conclusion Individuals with unexplained HBM have an excess of clinical characteristics associated with skeletal dysplasia and their relatives are commonly affected, suggesting many may harbour an underlying genetic disorder affecting bone mass.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Ordinal qualitative data are often collected for phenotypical measurements in plant pathology and other biological sciences. Statistical methods, such as t tests or analysis of variance, are usually used to analyze ordinal data when comparing two groups or multiple groups. However, the underlying assumptions such as normality and homogeneous variances are often violated for qualitative data. To this end, we investigated an alternative methodology, rank regression, for analyzing the ordinal data. The rank-based methods are essentially based on pairwise comparisons and, therefore, can deal with qualitative data naturally. They require neither normality assumption nor data transformation. Apart from robustness against outliers and high efficiency, the rank regression can also incorporate covariate effects in the same way as the ordinary regression. By reanalyzing a data set from a wheat Fusarium crown rot study, we illustrated the use of the rank regression methodology and demonstrated that the rank regression models appear to be more appropriate and sensible for analyzing nonnormal data and data with outliers.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

With growing population and fast urbanization in Australia, it is a challenging task to maintain our water quality. It is essential to develop an appropriate statistical methodology in analyzing water quality data in order to draw valid conclusions and hence provide useful advices in water management. This paper is to develop robust rank-based procedures for analyzing nonnormally distributed data collected over time at different sites. To take account of temporal correlations of the observations within sites, we consider the optimally combined estimating functions proposed by Wang and Zhu (Biometrika, 93:459-464, 2006) which leads to more efficient parameter estimation. Furthermore, we apply the induced smoothing method to reduce the computational burden. Smoothing leads to easy calculation of the parameter estimates and their variance-covariance matrix. Analysis of water quality data from Total Iron and Total Cyanophytes shows the differences between the traditional generalized linear mixed models and rank regression models. Our analysis also demonstrates the advantages of the rank regression models for analyzing nonnormal data.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Environmental data usually include measurements, such as water quality data, which fall below detection limits, because of limitations of the instruments or of certain analytical methods used. The fact that some responses are not detected needs to be properly taken into account in statistical analysis of such data. However, it is well-known that it is challenging to analyze a data set with detection limits, and we often have to rely on the traditional parametric methods or simple imputation methods. Distributional assumptions can lead to biased inference and justification of distributions is often not possible when the data are correlated and there is a large proportion of data below detection limits. The extent of bias is usually unknown. To draw valid conclusions and hence provide useful advice for environmental management authorities, it is essential to develop and apply an appropriate statistical methodology. This paper proposes rank-based procedures for analyzing non-normally distributed data collected at different sites over a period of time in the presence of multiple detection limits. To take account of temporal correlations within each site, we propose an optimal linear combination of estimating functions and apply the induced smoothing method to reduce the computational burden. Finally, we apply the proposed method to the water quality data collected at Susquehanna River Basin in United States of America, which dearly demonstrates the advantages of the rank regression models.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We consider ranked-based regression models for clustered data analysis. A weighted Wilcoxon rank method is proposed to take account of within-cluster correlations and varying cluster sizes. The asymptotic normality of the resulting estimators is established. A method to estimate covariance of the estimators is also given, which can bypass estimation of the density function. Simulation studies are carried out to compare different estimators for a number of scenarios on the correlation structure, presence/absence of outliers and different correlation values. The proposed methods appear to perform well, in particular, the one incorporating the correlation in the weighting achieves the highest efficiency and robustness against misspecification of correlation structure and outliers. A real example is provided for illustration.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We consider rank-based regression models for repeated measures. To account for possible withinsubject correlations, we decompose the total ranks into between- and within-subject ranks and obtain two different estimators based on between- and within-subject ranks. A simple perturbation method is then introduced to generate bootstrap replicates of the estimating functions and the parameter estimates. This provides a convenient way for combining the corresponding two types of estimating function for more efficient estimation.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Statistical methods are often used to analyse commercial catch and effort data to provide standardised fishing effort and/or a relative index of fish abundance for input into stock assessment models. Achieving reliable results has proved difficult in Australia's Northern Prawn Fishery (NPF), due to a combination of such factors as the biological characteristics of the animals, some aspects of the fleet dynamics, and the changes in fishing technology. For this set of data, we compared four modelling approaches (linear models, mixed models, generalised estimating equations, and generalised linear models) with respect to the outcomes of the standardised fishing effort or the relative index of abundance. We also varied the number and form of vessel covariates in the models. Within a subset of data from this fishery, modelling correlation structures did not alter the conclusions from simpler statistical models. The random-effects models also yielded similar results. This is because the estimators are all consistent even if the correlation structure is mis-specified, and the data set is very large. However, the standard errors from different models differed, suggesting that different methods have different statistical efficiency. We suggest that there is value in modelling the variance function and the correlation structure, to make valid and efficient statistical inferences and gain insight into the data. We found that fishing power was separable from the indices of prawn abundance only when we offset the impact of vessel characteristics at assumed values from external sources. This may be due to the large degree of confounding within the data, and the extreme temporal changes in certain aspects of individual vessels, the fleet and the fleet dynamics.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Partial least squares regression models on NIR spectra are often optimised (for wavelength range, mathematical pretreatment and outlier elimination) in terms of calibration terms of validation performance with reference to totally independent populations.