15 resultados para Imputation déterministe
em BORIS: Bern Open Repository and Information System - Berna - Suiça
Resumo:
Sequence analysis and optimal matching are useful heuristic tools for the descriptive analysis of heterogeneous individual pathways such as educational careers, job sequences or patterns of family formation. However, to date it remains unclear how to handle the inevitable problems caused by missing values with regard to such analysis. Multiple Imputation (MI) offers a possible solution for this problem but it has not been tested in the context of sequence analysis. Against this background, we contribute to the literature by assessing the potential of MI in the context of sequence analyses using an empirical example. Methodologically, we draw upon the work of Brendan Halpin and extend it to additional types of missing value patterns. Our empirical case is a sequence analysis of panel data with substantial attrition that examines the typical patterns and the persistence of sex segregation in school-to-work transitions in Switzerland. The preliminary results indicate that MI is a valuable methodology for handling missing values due to panel mortality in the context of sequence analysis. MI is especially useful in facilitating a sound interpretation of the resulting sequence types.
Resumo:
BACKGROUND A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide association studies (GWAS) and genomic predictions. METHODS We obtained genotypes for 54 602 SNPs (single nucleotide polymorphisms) in 1077 Franches-Montagnes (FM) horses and Illumina paired-end whole-genome sequencing data for 30 FM horses and 14 Warmblood horses. After variant calling, the sequence-derived SNP genotypes (~13 million SNPs) were used for genotype imputation with the software programs Beagle, Impute2 and FImpute. RESULTS The mean imputation accuracy of FM horses using Impute2 was 92.0%. Imputation accuracy using Beagle and FImpute was 74.3% and 77.2%, respectively. In addition, for Impute2 we determined the imputation accuracy of all individual horses in the validation population, which ranged from 85.7% to 99.8%. The subsequent inclusion of Warmblood sequence data further increased the correlation between true and imputed genotypes for most horses, especially for horses with a high level of admixture. The final imputation accuracy of the horses ranged from 91.2% to 99.5%. CONCLUSIONS Using Impute2, the imputation accuracy was higher than 91% for all horses in the validation population, which indicates that direct imputation of 50k SNP-chip data to sequence level genotypes is feasible in the FM population. The individual imputation accuracy depended mainly on the applied software and the level of admixture.
Resumo:
OBJECTIVE: To explore the feasibility and psychometric properties of a self-administered version of the 24-item Geriatric Pain Measure (GPM-24-SA). DESIGN: Secondary analysis of baseline data from the Prevention in Older People-Assessment in Generalists' practices trial, an international multi-center study of a health-risk appraisal system. PARTICIPANTS: One thousand seventy-two community dwelling nondisabled older adults self-reporting pain from London, UK; Hamburg, Germany; and Solothurn, Switzerland. OUTCOME MEASURES: GPM-24-SA as part of a multidimensional Health Risk Appraisal Questionnaire including self-reported demographic and health-related information. RESULTS: Among the 1,072 subjects, 655 had complete GPM-24-SA data, 404 had
Resumo:
BACKGROUND Low-grade gliomas (LGGs) are rare brain neoplasms, with survival spanning up to a few decades. Thus, accurate evaluations on how biomarkers impact survival among patients with LGG require long-term studies on samples prospectively collected over a long period. METHODS The 210 adult LGGs collected in our databank were screened for IDH1 and IDH2 mutations (IDHmut), MGMT gene promoter methylation (MGMTmet), 1p/19q loss of heterozygosity (1p19qloh), and nuclear TP53 immunopositivity (TP53pos). Multivariate survival analyses with multiple imputation of missing data were performed using either histopathology or molecular markers. Both models were compared using Akaike's information criterion (AIC). The molecular model was reduced by stepwise model selection to filter out the most critical predictors. A third model was generated to assess for various marker combinations. RESULTS Molecular parameters were better survival predictors than histology (ΔAIC = 12.5, P< .001). Forty-five percent of studied patients died. MGMTmet was positively associated with IDHmut (P< .001). In the molecular model with marker combinations, IDHmut/MGMTmet combined status had a favorable impact on overall survival, compared with IDHwt (hazard ratio [HR] = 0.33, P< .01), and even more so the triple combination, IDHmut/MGMTmet/1p19qloh (HR = 0.18, P< .001). Furthermore, IDHmut/MGMTmet/TP53pos triple combination was a significant risk factor for malignant transformation (HR = 2.75, P< .05). CONCLUSION By integrating networks of activated molecular glioma pathways, the model based on genotype better predicts prognosis than histology and, therefore, provides a more reliable tool for standardizing future treatment strategies.
Resumo:
INTRODUCTION The objective of this trial was to compare the survival rates of mandibular lingual retainers bonded with either chemically cured or light-cured adhesive after orthodontic treatment. METHODS Patients having undergone orthodontic treatment at a private orthodontic office were randomly allocated to fixed retainers placed with chemically cured composite or light-cured composite. Eligibility criteria included no active caries, restorations, or fractures on the mandibular anterior teeth, and adequate oral hygiene. The main outcome was any type of first-time lingual retainer breakage; pattern of failure (adapted adhesive remnant index scores) was a secondary outcome. Randomization was accomplished with random permuted blocks of 20 patients with allocation concealed in sequentially numbered, opaque, sealed envelopes. Blinding was applicable for outcome assessment only. Patients were reviewed at 1, 3, and 6 months and then every 6 months after placement of the retainer until completion of the study. Data were analyzed using survival analysis including Cox regression; sensitivity analysis was carried out after data imputation for subjects lost to follow-up. RESULTS Two hundred twenty patients (median age, 16 years; interquartile range, 2; range, 12-47 years) were randomized in a 1:1 ratio to either chemical or light curing. Baseline characteristics were similar between groups, the median follow-up period was 2.19 years (range, 0.003-3.64 years), and 16 patients were lost to follow-up. At a minimum follow-up of 2 years, 47 of 110 (42.7%) and 55 of 110 (50.0%) retainers had some type of failure with chemically cured and light-cured adhesive, respectively (log-rank test, P = 0.35). Data were analyzed on an intention-to-treat basis, and the hazard ratio (HR) was 1.15 (95% confidence interval [CI], 0.88-1.70; P = 0.47). There was weak evidence that age is a significant predictor for lingual retainer failures (HR, 0.96; 95% CI, 0.93-1.00; P = 0.08). Adhesive remnant index scoring was possible for only 66 of the 102 (64.7%) failures and did not differ between composites (Fisher exact test, P = 0.16). No serious harm was observed other than gingivitis associated with plaque accumulation. CONCLUSIONS The results of this study indicated no evidence that survival of mandibular lingual retainers differs between chemically and light-cured adhesives. The overall failure rate was 46.4%; however, this included any type of failure, which may have exaggerated the overall failure rate.
Resumo:
Loss to follow-up (LTFU) is a common problem in many epidemiological studies. In antiretroviral treatment (ART) programs for patients with human immunodeficiency virus (HIV), mortality estimates can be biased if the LTFU mechanism is non-ignorable, that is, mortality differs between lost and retained patients. In this setting, routine procedures for handling missing data may lead to biased estimates. To appropriately deal with non-ignorable LTFU, explicit modeling of the missing data mechanism is needed. This can be based on additional outcome ascertainment for a sample of patients LTFU, for example, through linkage to national registries or through survey-based methods. In this paper, we demonstrate how this additional information can be used to construct estimators based on inverse probability weights (IPW) or multiple imputation. We use simulations to contrast the performance of the proposed estimators with methods widely used in HIV cohort research for dealing with missing data. The practical implications of our approach are illustrated using South African ART data, which are partially linkable to South African national vital registration data. Our results demonstrate that while IPWs and proper imputation procedures can be easily constructed from additional outcome ascertainment to obtain valid overall estimates, neglecting non-ignorable LTFU can result in substantial bias. We believe the proposed estimators are readily applicable to a growing number of studies where LTFU is appreciable, but additional outcome data are available through linkage or surveys of patients LTFU. Copyright © 2013 John Wiley & Sons, Ltd.
Resumo:
BACKGROUND: Prognostic models for children starting antiretroviral therapy (ART) in Africa are lacking. We developed models to estimate the probability of death during the first year receiving ART in Southern Africa. METHODS: We analyzed data from children ≤10 years old who started ART in Malawi, South Africa, Zambia or Zimbabwe from 2004-2010. Children lost to follow-up or transferred were excluded. The primary outcome was all-cause mortality in the first year of ART. We used Weibull survival models to construct two prognostic models: one with CD4%, age, WHO clinical stage, weight-for-age z-score (WAZ) and anemia and one without CD4%, because it is not routinely measured in many programs. We used multiple imputation to account for missing data. RESULTS: Among 12655 children, 877 (6.9%) died in the first year of ART. 1780 children were lost to follow-up/transferred and excluded from main analyses; 10875 children were included. With the CD4% model probability of death at 1 year ranged from 1.8% (95% CI: 1.5-2.3) in children 5-10 years with CD4% ≥10%, WHO stage I/II, WAZ ≥-2 and without severe anemia to 46.3% (95% CI: 38.2-55.2) in children <1 year with CD4% <5%, stage III/IV, WAZ< -3 and severe anemia. The corresponding range for the model without CD4% was 2.2% (95% CI: 1.8-2.7) to 33.4% (95% CI: 28.2-39.3). Agreement between predicted and observed mortality was good (C-statistics=0.753 and 0.745 for models with and without CD4% respectively). CONCLUSION: These models may be useful to counsel children/caregivers, for program planning and to assess program outcomes after allowing for differences in patient disease severity characteristics.
Resumo:
Long-term measurements of CO2 flux can be obtained using the eddy covariance technique, but these datasets are affected by gaps which hinder the estimation of robust long-term means and annual ecosystem exchanges. We compare results obtained using three gap-fill techniques: multiple regression (MR), multiple imputation (MI), and artificial neural networks (ANNs), applied to a one-year dataset of hourly CO2 flux measurements collected in Lutjewad, over a flat agriculture area near the Wadden Sea dike in the north of the Netherlands. The dataset was separated in two subsets: a learning and a validation set. The performances of gap-filling techniques were analysed by calculating statistical criteria: coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), maximum absolute error (MaxAE), and mean square bias (MSB). The gap-fill accuracy is seasonally dependent, with better results in cold seasons. The highest accuracy is obtained using ANN technique which is also less sensitive to environmental/seasonal conditions. We argue that filling gaps directly on measured CO2 fluxes is more advantageous than the common method of filling gaps on calculated net ecosystem change, because ANN is an empirical method and smaller scatter is expected when gap filling is applied directly to measurements.
Resumo:
Human genetic variation contributes to differences in susceptibility to HIV-1 infection. To search for novel host resistance factors, we performed a genome-wide association study (GWAS) in hemophilia patients highly exposed to potentially contaminated factor VIII infusions. Individuals with hemophilia A and a documented history of factor VIII infusions before the introduction of viral inactivation procedures (1979-1984) were recruited from 36 hemophilia treatment centers (HTCs), and their genome-wide genetic variants were compared with those from matched HIV-infected individuals. Homozygous carriers of known CCR5 resistance mutations were excluded. Single nucleotide polymorphisms (SNPs) and inferred copy number variants (CNVs) were tested using logistic regression. In addition, we performed a pathway enrichment analysis, a heritability analysis, and a search for epistatic interactions with CCR5 Δ32 heterozygosity. A total of 560 HIV-uninfected cases were recruited: 36 (6.4%) were homozygous for CCR5 Δ32 or m303. After quality control and SNP imputation, we tested 1 081 435 SNPs and 3686 CNVs for association with HIV-1 serostatus in 431 cases and 765 HIV-infected controls. No SNP or CNV reached genome-wide significance. The additional analyses did not reveal any strong genetic effect. Highly exposed, yet uninfected hemophiliacs form an ideal study group to investigate host resistance factors. Using a genome-wide approach, we did not detect any significant associations between SNPs and HIV-1 susceptibility, indicating that common genetic variants of major effect are unlikely to explain the observed resistance phenotype in this population.
Resumo:
BACKGROUND The CD4 cell count or percent (CD4%) at the start of combination antiretroviral therapy (cART) is an important prognostic factor in children starting therapy and an important indicator of program performance. We describe trends and determinants of CD4 measures at cART initiation in children from low-, middle-, and high-income countries. METHODS We included children aged <16 years from clinics participating in a collaborative study spanning sub-Saharan Africa, Asia, Latin America, and the United States. Missing CD4 values at cART start were estimated through multiple imputation. Severe immunodeficiency was defined according to World Health Organization criteria. Analyses used generalized additive mixed models adjusted for age, country, and calendar year. RESULTS A total of 34,706 children from 9 low-income, 6 lower middle-income, 4 upper middle-income countries, and 1 high-income country (United States) were included; 20,624 children (59%) had severe immunodeficiency. In low-income countries, the estimated prevalence of children starting cART with severe immunodeficiency declined from 76% in 2004 to 63% in 2010. Corresponding figures for lower middle-income countries were from 77% to 66% and for upper middle-income countries from 75% to 58%. In the United States, the percentage decreased from 42% to 19% during the period 1996 to 2006. In low- and middle-income countries, infants and children aged 12-15 years had the highest prevalence of severe immunodeficiency at cART initiation. CONCLUSIONS Despite progress in most low- and middle-income countries, many children continue to start cART with severe immunodeficiency. Early diagnosis and treatment of HIV-infected children to prevent morbidity and mortality associated with immunodeficiency must remain a global public health priority.
Resumo:
Background Aerosolized vaccine can be used as a needle-free method of immunization against measles, a disease that remains a major cause of illness and death. Data on the immunogenicity of aerosolized vaccine against measles in children are inconsistent. Methods We conducted an open-label noninferiority trial involving children 9.0 to 11.9 months of age in India who were eligible to receive a first dose of measles vaccine. Children were randomly assigned to receive a single dose of vaccine by means of either aerosol inhalation or a subcutaneous injection. The primary end points were seropositivity for antibodies against measles and adverse events 91 days after vaccination. The noninferiority margin was 5 percentage points. Results A total of 1001 children were assigned to receive aerosolized vaccine, and 1003 children were assigned to receive subcutaneous vaccine; 1956 of all the children (97.6%) were followed to day 91, but outcome data were missing for 331 children because of thawed specimens. In the per-protocol population, data on 1560 of 2004 children (77.8%) could be evaluated. At day 91, a total of 662 of 775 children (85.4%; 95% confidence interval [CI], 82.5 to 88.0) in the aerosol group, as compared with 743 of 785 children (94.6%; 95% CI, 92.7 to 96.1) in the subcutaneous group, were seropositive, a difference of -9.2 percentage points (95% CI, -12.2 to -6.3). Findings were similar in the full-analysis set (673 of 788 children in the aerosol group [85.4%] and 754 of 796 children in the subcutaneous group [94.7%] were seropositive at day 91, a difference of -9.3 percentage points [95% CI, -12.3 to -6.4]) and after multiple imputation of missing results. No serious adverse events were attributable to measles vaccination. Adverse-event profiles were similar in the two groups. Conclusions Aerosolized vaccine against measles was immunogenic, but, at the prespecified margin, the aerosolized vaccine was inferior to the subcutaneous vaccine with respect to the rate of seropositivity. (Funded by the Bill and Melinda Gates Foundation; Measles Aerosol Vaccine Project Clinical Trials Registry-India number, CTRI/2009/091/000673 .).
Resumo:
PURPOSE The implementation of genomic-based medicine is hindered by unresolved questions regarding data privacy and delivery of interpreted results to health-care practitioners. We used DNA-based prediction of HIV-related outcomes as a model to explore critical issues in clinical genomics. METHODS We genotyped 4,149 markers in HIV-positive individuals. Variants allowed for prediction of 17 traits relevant to HIV medical care, inference of patient ancestry, and imputation of human leukocyte antigen (HLA) types. Genetic data were processed under a privacy-preserving framework using homomorphic encryption, and clinical reports describing potentially actionable results were delivered to health-care providers. RESULTS A total of 230 patients were included in the study. We demonstrated the feasibility of encrypting a large number of genetic markers, inferring patient ancestry, computing monogenic and polygenic trait risks, and reporting results under privacy-preserving conditions. The average execution time of a multimarker test on encrypted data was 865 ms on a standard computer. The proportion of tests returning potentially actionable genetic results ranged from 0 to 54%. CONCLUSIONS The model of implementation presented herein informs on strategies to deliver genomic test results for clinical care. Data encryption to ensure privacy helps to build patient trust, a key requirement on the road to genomic-based medicine.Genet Med advance online publication 14 January 2016Genetics in Medicine (2016); doi:10.1038/gim.2015.167.
Resumo:
Missing outcome data are common in clinical trials and despite a well-designed study protocol, some of the randomized participants may leave the trial early without providing any or all of the data, or may be excluded after randomization. Premature discontinuation causes loss of information, potentially resulting in attrition bias leading to problems during interpretation of trial findings. The causes of information loss in a trial, known as mechanisms of missingness, may influence the credibility of the trial results. Analysis of trials with missing outcome data should ideally be handled with intention to treat (ITT) rather than per protocol (PP) analysis. However, true ITT analysis requires appropriate assumptions and imputation of missing data. Using a worked example from a published dental study, we highlight the key issues associated with missing outcome data in clinical trials, describe the most recognized approaches to handling missing outcome data, and explain the principles of ITT and PP analysis.
Resumo:
IMPORTANCE Obesity is a risk factor for deep vein thrombosis of the leg and pulmonary embolism. To date, however, whether obesity is associated with adult cerebral venous thrombosis (CVT) has not been assessed. OBJECTIVE To assess whether obesity is a risk factor for CVT. DESIGN, SETTING, AND PARTICIPANTS A case-control study was performed in consecutive adult patients with CVT admitted from July 1, 2006 (Amsterdam), and October 1, 2009 (Berne), through December 31, 2014, to the Academic Medical Center in Amsterdam, the Netherlands, or Inselspital University Hospital in Berne, Switzerland. The control group was composed of individuals from the control population of the Multiple Environmental and Genetic Assessment of Risk Factors for Venous Thrombosis study, which was a large Dutch case-control study performed from March 1, 1999, to September 31, 2004, and in which risk factors for deep vein thrombosis and pulmonary embolism were assessed. Data analysis was performed from January 2 to July 12, 2015. MAIN OUTCOMES AND MEASURES Obesity was determined by body mass index (BMI). A BMI of 30 or greater was considered to indicate obesity, and a BMI of 25 to 29.99 was considered to indicate overweight. A multiple imputation procedure was used for missing data. We adjusted for sex, age, history of cancer, ethnicity, smoking status, and oral contraceptive use. Individuals with normal weight (BMI <25) were the reference category. RESULTS The study included 186 cases and 6134 controls. Cases were younger (median age, 40 vs 48 years), more often female (133 [71.5%] vs 3220 [52.5%]), more often used oral contraceptives (97 [72.9%] vs 758 [23.5%] of women), and more frequently had a history of cancer (17 [9.1%] vs 235 [3.8%]) compared with controls. Obesity (BMI ≥30) was associated with an increased risk of CVT (adjusted odds ratio [OR], 2.63; 95% CI, 1.53-4.54). Stratification by sex revealed a strong association between CVT and obesity in women (adjusted OR, 3.50; 95% CI, 2.00-6.14) but not in men (adjusted OR, 1.16; 95% CI, 0.25-5.30). Further stratification revealed that, in women who used oral contraceptives, overweight and obesity were associated with an increased risk of CVT in a dose-dependent manner (BMI 25.0-29.9: adjusted OR, 11.87; 95% CI, 5.94-23.74; BMI ≥30: adjusted OR, 29.26; 95% CI, 13.47-63.60). No association was found in women who did not use oral contraceptives. CONCLUSIONS AND RELEVANCE Obesity is a strong risk factor for CVT in women who use oral contraceptives.
Resumo:
Background. The impact of human genetic background on low-trauma fracture (LTF) risk has not been evaluated in the context of human immunodeficiency virus (HIV) and clinical LTF risk factors. Methods. In the general population, 6 common single-nucleotide polymorphisms (SNPs) associate with LTF through genome-wide association study. Using genome-wide SNP arrays and imputation, we genotyped these SNPs in HIV-positive, white Swiss HIV Cohort Study participants. We included 103 individuals with a first, physician-validated LTF and 206 controls matched on gender, whose duration of observation and whose antiretroviral therapy start dates were similar using incidence density sampling. Analyses of nongenetic LTF risk factors were based on 158 cases and 788 controls. Results. A genetic risk score built from the 6 LTF-associated SNPs did not associate with LTF risk, in both models including and not including parental hip fracture history. The contribution of clinical LTF risk factors was limited in our dataset. Conclusions. Genetic LTF markers with a modest effect size in the general population do not improve fracture prediction in persons with HIV, in whom clinical LTF risk factors are prevalent in both cases and controls.