16 resultados para Linear Mixed Integer Multicriteria Optimization
em Dalarna University College Electronic Archive
                                
Resumo:
This paper presents a two-step pseudo likelihood estimation technique for generalized linear mixed models with the random effects being correlated between groups. The core idea is to deal with the intractable integrals in the likelihood function by multivariate Taylor's approximation. The accuracy of the estimation technique is assessed in a Monte-Carlo study. An application of it with a binary response variable is presented using a real data set on credit defaults from two Swedish banks. Thanks to the use of two-step estimation technique, the proposed algorithm outperforms conventional pseudo likelihood algorithms in terms of computational time.
                                
Resumo:
This paper presents the techniques of likelihood prediction for the generalized linear mixed models. Methods of likelihood prediction is explained through a series of examples; from a classical one to more complicated ones. The examples show, in simple cases, that the likelihood prediction (LP) coincides with already known best frequentist practice such as the best linear unbiased predictor. The paper outlines a way to deal with the covariate uncertainty while producing predictive inference. Using a Poisson error-in-variable generalized linear model, it has been shown that in complicated cases LP produces better results than already know methods.
                                
Resumo:
Generalized linear mixed models are flexible tools for modeling non-normal data and are useful for accommodating overdispersion in Poisson regression models with random effects. Their main difficulty resides in the parameter estimation because there is no analytic solution for the maximization of the marginal likelihood. Many methods have been proposed for this purpose and many of them are implemented in software packages. The purpose of this study is to compare the performance of three different statistical principles - marginal likelihood, extended likelihood, Bayesian analysis-via simulation studies. Real data on contact wrestling are used for illustration.
                                
Resumo:
We present the hglm package for fitting hierarchical generalized linear models. It can be used for linear mixed models and generalized linear mixed models with random effects for a variety of links and a variety of distributions for both the outcomes and the random effects. Fixed effects can also be fitted in the dispersion part of the model.
                                
Resumo:
Background: The sensitivity to microenvironmental changes varies among animals and may be under genetic control. It is essential to take this element into account when aiming at breeding robust farm animals. Here, linear mixed models with genetic effects in the residual variance part of the model can be used. Such models have previously been fitted using EM and MCMC algorithms. Results: We propose the use of double hierarchical generalized linear models (DHGLM), where the squared residuals are assumed to be gamma distributed and the residual variance is fitted using a generalized linear model. The algorithm iterates between two sets of mixed model equations, one on the level of observations and one on the level of variances. The method was validated using simulations and also by re-analyzing a data set on pig litter size that was previously analyzed using a Bayesian approach. The pig litter size data contained 10,060 records from 4,149 sows. The DHGLM was implemented using the ASReml software and the algorithm converged within three minutes on a Linux server. The estimates were similar to those previously obtained using Bayesian methodology, especially the variance components in the residual variance part of the model. Conclusions: We have shown that variance components in the residual variance part of a linear mixed model can be estimated using a DHGLM approach. The method enables analyses of animal models with large numbers of observations. An important future development of the DHGLM methodology is to include the genetic correlation between the random effects in the mean and residual variance parts of the model as a parameter of the DHGLM.
                                
Resumo:
Objective To investigate if a home environment test battery can be used to measure effects of Parkinson’s disease (PD) treatment intervention and disease progression. Background Seventy-seven patients diagnosed with advanced PD were recruited in an open longitudinal 36-month study at 10 clinics in Sweden and Norway; 40 of them were treated with levodopa-carbidopa intestinal gel (LCIG) and 37 patients were candidates for switching from oral PD treatment to LCIG. They utilized a mobile device test battery, consisting of self-assessments of symptoms and objective measures of motor function through a set of fine motor tests (tapping and spiral drawings), in their homes. Both the LCIG-naïve and LCIG-non-naïve patients used the test battery four times per day during week-long test periods. Methods Assessments The LCIG-naïve patients used the test battery at baseline (before LCIG), month 0 (first visit; at least 3 months after intraduodenal LCIG), and thereafter quarterly for the first year and biannually for the second and third years. The LCIG-non-naïve patients used the test battery from the first visit, i.e. month 0. Out of the 77 patients, only 65 utilized the test battery; 35 were LCIG-non-naïve and 30 LCIG-naïve. In 20 of the LCIG-naïve patients, assessments with the test battery were available during oral treatment and at least one test period after having started infusion treatment. Three LCIG-naïve patients did not use the test battery at baseline but had at least one test period of assessments thereafter. Hence, n=23 in the LCIG-naïve group. In total, symptom assessments in the full sample (including both patient groups) were collected during 379 test periods and 10079 test occasions. For 369 of these test periods, clinical assessments including UPDRS and PDQ-39 were performed in afternoons at the start of the test periods. The repeated measurements of the test battery were processed and summarized into scores representing patients’ symptom severities over a test period, using statistical methods. Six conceptual dimensions were defined; four subjectively-reported: ‘walking’, ‘satisfied’, ‘dyskinesia’, and ‘off’ and two objectively-measured: ‘tapping’ and ‘spiral’. In addition, an ‘overall test score’ (OTS) was defined to represent the global health condition of the patient during a test period. Statistical methods Change in the test battery scores over time, that is at baseline and follow-up test periods, was assessed with linear mixed-effects models with patient ID as a random effect and test period as a fixed effect of interest. The within-patient variability of OTS was assessed using intra-class correlation coefficient (ICC), for the two patient groups. Correlations between clinical rating scores and test battery scores were assessed using Spearman’s rank correlations (rho). Results In LCIG-naïve patients, mean OTS compared to baseline was significantly improved from the first test period on LCIG treatment until month 24. However, there were no significant changes in mean OTS scores of LCIG-non-naïve patients, except for worse mean OTS at month 36 (p<0.01, n=16). The mean scores of all subjectively-reported dimensions improved significantly throughout the course of the study, except ‘walking’ at month 36 (p=0.41, n=4). However, there were no significant differences in mean scores of objectively-measured dimensions between baseline and other test periods, except improved ‘tapping’ at month 6 and month 36, and ‘spiral’ at month 3 (p<0.05). The LCIG-naïve patients had a higher within-subject variability in their OTS scores (ICC=0.67) compared to LCIG-non-naïve patients (ICC=0.71). The OTS correlated adequately with total UPDRS (rho=0.59) and total PDQ-39 (rho=0.59). Conclusions In this 3-year follow-up study of advanced PD patients treated with LCIG we found that it is possible to monitor PD progression over time using a home environment test battery. The significant improvements in the mean OTS scores indicate that the test battery is able to measure functional improvement with LCIG sustained over at least 24 months.
                                
Resumo:
This thesis develops and evaluates statistical methods for different types of genetic analyses, including quantitative trait loci (QTL) analysis, genome-wide association study (GWAS), and genomic evaluation. The main contribution of the thesis is to provide novel insights in modeling genetic variance, especially via random effects models. In variance component QTL analysis, a full likelihood model accounting for uncertainty in the identity-by-descent (IBD) matrix was developed. It was found to be able to correctly adjust the bias in genetic variance component estimation and gain power in QTL mapping in terms of precision. Double hierarchical generalized linear models, and a non-iterative simplified version, were implemented and applied to fit data of an entire genome. These whole genome models were shown to have good performance in both QTL mapping and genomic prediction. A re-analysis of a publicly available GWAS data set identified significant loci in Arabidopsis that control phenotypic variance instead of mean, which validated the idea of variance-controlling genes. The works in the thesis are accompanied by R packages available online, including a general statistical tool for fitting random effects models (hglm), an efficient generalized ridge regression for high-dimensional data (bigRR), a double-layer mixed model for genomic data analysis (iQTL), a stochastic IBD matrix calculator (MCIBD), a computational interface for QTL mapping (qtl.outbred), and a GWAS analysis tool for mapping variance-controlling loci (vGWAS).
                                
Resumo:
OBJECTIVES: To develop a method for objective assessment of fine motor timing variability in Parkinson’s disease (PD) patients, using digital spiral data gathered by a touch screen device. BACKGROUND: A retrospective analysis was conducted on data from 105 subjects including65 patients with advanced PD (group A), 15 intermediate patients experiencing motor fluctuations (group I), 15 early stage patients (group S), and 10 healthy elderly subjects (HE) were examined. The subjects were asked to perform repeated upper limb motor tasks by tracing a pre-drawn Archimedes spiral as shown on the screen of the device. The spiral tracing test was performed using an ergonomic pen stylus, using dominant hand. The test was repeated three times per test occasion and the subjects were instructed to complete it within 10 seconds. Digital spiral data including stylus position (x-ycoordinates) and timestamps (milliseconds) were collected and used in subsequent analysis. The total number of observations with the test battery were as follows: Swedish group (n=10079), Italian I group (n=822), Italian S group (n = 811), and HE (n=299). METHODS: The raw spiral data were processed with three data processing methods. To quantify motor timing variability during spiral drawing tasks Approximate Entropy (APEN) method was applied on digitized spiral data. APEN is designed to capture the amount of irregularity or complexity in time series. APEN requires determination of two parameters, namely, the window size and similarity measure. In our work and after experimentation, window size was set to 4 and similarity measure to 0.2 (20% of the standard deviation of the time series). The final score obtained by APEN was normalized by total drawing completion time and used in subsequent analysis. The score generated by this method is hence on denoted APEN. In addition, two more methods were applied on digital spiral data and their scores were used in subsequent analysis. The first method was based on Digital Wavelet Transform and Principal Component Analysis and generated a score representing spiral drawing impairment. The score generated by this method is hence on denoted WAV. The second method was based on standard deviation of frequency filtered drawing velocity. The score generated by this method is hence on denoted SDDV. Linear mixed-effects (LME) models were used to evaluate mean differences of the spiral scores of the three methods across the four subject groups. Test-retest reliability of the three scores was assessed after taking mean of the three possible correlations (Spearman’s rank coefficients) between the three test trials. Internal consistency of the methods was assessed by calculating correlations between their scores. RESULTS: When comparing mean spiral scores between the four subject groups, the APEN scores were different between HE subjects and three patient groups (P=0.626 for S group with 9.9% mean value difference, P=0.089 for I group with 30.2%, and P=0.0019 for A group with 44.1%). However, there were no significant differences in mean scores of the other two methods, except for the WAV between the HE and A groups (P<0.001). WAV and SDDV were highly and significantly correlated to each other with a coefficient of 0.69. However, APEN was not correlated to neither WAV nor SDDV with coefficients of 0.11 and 0.12, respectively. Test-retest reliability coefficients of the three scores were as follows: APEN (0.9), WAV(0.83) and SD-DV (0.55). CONCLUSIONS: The results show that the digital spiral analysis-based objective APEN measure is able to significantly differentiate the healthy subjects from patients at advanced level. In contrast to the other two methods (WAV and SDDV) that are designed to quantify dyskinesias (over-medications), this method can be useful for characterizing Off symptoms in PD. The APEN was not correlated to none of the other two methods indicating that it measures a different construct of upper limb motor function in PD patients than WAV and SDDV. The APEN also had a better test-retest reliability indicating that it is more stable and consistent over time than WAV and SDDV.
                                
Resumo:
BACKGROUND: Misoprostol is established for the treatment of incomplete abortion but has not been systematically assessed when provided by midwives at district level in a low-resource setting. We investigated the effectiveness and safety of midwives diagnosing and treating incomplete abortion with misoprostol, compared with physicians. METHODS: We did a multicentre randomised controlled equivalence trial at district level at six facilities in Uganda. Eligibility criteria were women with signs of incomplete abortion. We randomly allocated women with first-trimester incomplete abortion to clinical assessment and treatment with misoprostol either by a physician or a midwife. The randomisation (1:1) was done in blocks of 12 and was stratified for study site. Primary outcome was complete abortion not needing surgical intervention within 14-28 days after initial treatment. The study was not masked. Analysis of the primary outcome was done on the per-protocol population with a generalised linear-mixed effects model. The predefined equivalence range was -4% to 4%. The trial was registered at ClinicalTrials.gov, number NCT01844024. FINDINGS: From April 30, 2013, to July 21, 2014, 1108 women were assessed for eligibility. 1010 women were randomly assigned to each group (506 to midwife group and 504 to physician group). 955 women (472 in the midwife group and 483 in the physician group) were included in the per-protocol analysis. 452 (95·8%) of women in the midwife group had complete abortion and 467 (96·7%) in the physician group. The model-based risk difference for midwife versus physician group was -0·8% (95% CI -2·9 to 1·4), falling within the predefined equivalence range (-4% to 4%). The overall proportion of women with incomplete abortion was 3·8% (36/955), similarly distributed between the two groups (4·2% [20/472] in the midwife group, 3·3% [16/483] in the physician group). No serious adverse events were recorded. INTERPRETATION: Diagnosis and treatment of incomplete abortion with misoprostol by midwives is equally safe and effective as when provided by physicians, in a low-resource setting. Scaling up midwives' involvement in treatment of incomplete abortion with misoprostol at district level would increase access to safe post-abortion care. FUNDING: The Swedish Research Council, Karolinska Institutet, and Dalarna University.
                                
Resumo:
OBJECTIVE: This study aimed to assess women´s acceptability of diagnosis and treatment of incomplete abortion with misoprostol by midwives, compared with physicians. METHODS: This was an analysis of secondary outcomes from a multi-centre randomized controlled equivalence trial at district level in Uganda. Women with first trimester incomplete abortion were randomly allocated to clinical assessment and treatment with misoprostol by a physician or a midwife. The randomisation (1:1) was done in blocks of 12 and stratified for health care facility. Acceptability was measured in expectations and satisfaction at a follow up visit 14-28 days following treatment. Analysis of women's overall acceptability was done using a generalized linear mixed-effects model with an equivalence range of -4% to 4%. The study was not masked. The trial is registered at ClinicalTrials.org, NCT 01844024. RESULTS: From April 2013 to June 2014, 1108 women were assessed for eligibility of which 1010 were randomized (506 to midwife and 504 to physician). 953 women were successfully followed up and included in the acceptability analysis. 95% (904) of the participants found the treatment satisfactory and overall acceptability was found to be equivalent between the two study groups. Treatment failure, not feeling calm and safe following treatment, experiencing severe abdominal pain or heavy bleeding following treatment, were significantly associated with non-satisfaction. No serious adverse events were recorded. CONCLUSIONS: Treatment of incomplete abortion with misoprostol by midwives and physician was highly, and equally, acceptable to women. TRIAL REGISTRATION: ClinicalTrials.gov NCT01844024.
                                
Resumo:
BACKGROUND: Annually, 2.8 million neonatal deaths occur worldwide, despite the fact that three-quarters of them could be prevented if available evidence-based interventions were used. Facilitation of community groups has been recognized as a promising method to translate knowledge into practice. In northern Vietnam, the Neonatal Health - Knowledge Into Practice trial evaluated facilitation of community groups (2008-2011) and succeeded in reducing the neonatal mortality rate (adjusted odds ratio, 0.51; 95 % confidence interval 0.30-0.89). The aim of this paper is to report on the process (implementation and mechanism of impact) of this intervention. METHODS: Process data were excerpted from diary information from meetings with facilitators and intervention groups, and from supervisor records of monthly meetings with facilitators. Data were analyzed using descriptive statistics. An evaluation including attributes and skills of facilitators (e.g., group management, communication, and commitment) was performed at the end of the intervention using a six-item instrument. Odds ratios were analyzed, adjusted for cluster randomization using general linear mixed models. RESULTS: To ensure eight active facilitators over 3 years, 11 Women's Union representatives were recruited and trained. Of the 44 intervention groups, composed of health staff and commune stakeholders, 43 completed their activities until the end of the study. In total, 95 % (n = 1508) of the intended monthly meetings with an intervention group and a facilitator were conducted. The overall attendance of intervention group members was 86 %. The groups identified 32 unique problems and implemented 39 unique actions. The identified problems targeted health issues concerning both women and neonates. Actions implemented were mainly communication activities. Communes supported by a group with a facilitator who was rated high on attributes and skills (n = 27) had lower odds of neonatal mortality (odds ratio, 0.37; 95 % confidence interval, 0.19-0.73) than control communes (n = 46). CONCLUSIONS: This evaluation identified several factors that might have influenced the outcomes of the trial: continuity of intervention groups' work, adequate attributes and skills of facilitators, and targeting problems along a continuum of care. Such factors are important to consider in scaling-up efforts.
                                
Resumo:
1. Genomewide association studies (GWAS) enable detailed dissections of the genetic basis for organisms' ability to adapt to a changing environment. In long-term studies of natural populations, individuals are often marked at one point in their life and then repeatedly recaptured. It is therefore essential that a method for GWAS includes the process of repeated sampling. In a GWAS, the effects of thousands of single-nucleotide polymorphisms (SNPs) need to be fitted and any model development is constrained by the computational requirements. A method is therefore required that can fit a highly hierarchical model and at the same time is computationally fast enough to be useful. 2. Our method fits fixed SNP effects in a linear mixed model that can include both random polygenic effects and permanent environmental effects. In this way, the model can correct for population structure and model repeated measures. The covariance structure of the linear mixed model is first estimated and subsequently used in a generalized least squares setting to fit the SNP effects. The method was evaluated in a simulation study based on observed genotypes from a long-term study of collared flycatchers in Sweden. 3. The method we present here was successful in estimating permanent environmental effects from simulated repeated measures data. Additionally, we found that especially for variable phenotypes having large variation between years, the repeated measurements model has a substantial increase in power compared to a model using average phenotypes as a response. 4. The method is available in the R package RepeatABEL. It increases the power in GWAS having repeated measures, especially for long-term studies of natural populations, and the R implementation is expected to facilitate modelling of longitudinal data for studies of both animal and human populations.
                                
Resumo:
We analyze a real data set pertaining to reindeer fecal pellet-group counts obtained from a survey conducted in a forest area in northern Sweden. In the data set, over 70% of counts are zeros, and there is high spatial correlation. We use conditionally autoregressive random effects for modeling of spatial correlation in a Poisson generalized linear mixed model (GLMM), quasi-Poisson hierarchical generalized linear model (HGLM), zero-inflated Poisson (ZIP), and hurdle models. The quasi-Poisson HGLM allows for both under- and overdispersion with excessive zeros, while the ZIP and hurdle models allow only for overdispersion. In analyzing the real data set, we see that the quasi-Poisson HGLMs can perform better than the other commonly used models, for example, ordinary Poisson HGLMs, spatial ZIP, and spatial hurdle models, and that the underdispersed Poisson HGLMs with spatial correlation fit the reindeer data best. We develop R codes for fitting these models using a unified algorithm for the HGLMs. Spatial count response with an extremely high proportion of zeros, and underdispersion can be successfully modeled using the quasi-Poisson HGLM with spatial random effects.
                                
Resumo:
Maintenance of transport infrastructure assets is widely advocated as the key in minimizing current and future costs of the transportation network. While effective maintenance decisions are often a result of engineering skills and practical knowledge, efficient decisions must also account for the net result over an asset's life-cycle. One essential aspect in the long term perspective of transport infrastructure maintenance is to proactively estimate maintenance needs. In dealing with immediate maintenance actions, support tools that can prioritize potential maintenance candidates are important to obtain an efficient maintenance strategy. This dissertation consists of five individual research papers presenting a microdata analysis approach to transport infrastructure maintenance. Microdata analysis is a multidisciplinary field in which large quantities of data is collected, analyzed, and interpreted to improve decision-making. Increased access to transport infrastructure data enables a deeper understanding of causal effects and a possibility to make predictions of future outcomes. The microdata analysis approach covers the complete process from data collection to actual decisions and is therefore well suited for the task of improving efficiency in transport infrastructure maintenance. Statistical modeling was the selected analysis method in this dissertation and provided solutions to the different problems presented in each of the five papers. In Paper I, a time-to-event model was used to estimate remaining road pavement lifetimes in Sweden. In Paper II, an extension of the model in Paper I assessed the impact of latent variables on road lifetimes; displaying the sections in a road network that are weaker due to e.g. subsoil conditions or undetected heavy traffic. The study in Paper III incorporated a probabilistic parametric distribution as a representation of road lifetimes into an equation for the marginal cost of road wear. Differentiated road wear marginal costs for heavy and light vehicles are an important information basis for decisions regarding vehicle miles traveled (VMT) taxation policies. In Paper IV, a distribution based clustering method was used to distinguish between road segments that are deteriorating and road segments that have a stationary road condition. Within railway networks, temporary speed restrictions are often imposed because of maintenance and must be addressed in order to keep punctuality. The study in Paper V evaluated the empirical effect on running time of speed restrictions on a Norwegian railway line using a generalized linear mixed model.
                                
Resumo:
The subgradient optimization method is a simple and flexible linear programming iterative algorithm. It is much simpler than Newton's method and can be applied to a wider variety of problems. It also converges when the objective function is non-differentiable. Since an efficient algorithm will not only produce a good solution but also take less computing time, we always prefer a simpler algorithm with high quality. In this study a series of step size parameters in the subgradient equation is studied. The performance is compared for a general piecewise function and a specific p-median problem. We examine how the quality of solution changes by setting five forms of step size parameter.
 
                    