41 resultados para Small sample asymptotics
Resumo:
There is not a large body of evidence on in-utero exposure to chemotherapy for pregnancy-associated cancers to help determine the long term effects on offspring. This study is a systematic review of long term follow-up to find evidence for adverse outcomes in exposed offspring. In order for studies to be eligible for this systematic review, they had to have a median follow up of at least 24 months with the resulting child. PubMed, Medline, and Scopus were the databases used, and we included all eligible articles, regardless of the date of publication. The search resulted in six articles meeting the eligibility requirements. The review of findings of these studies suggested that there is not enough evidence to make a determination of the risk of chemotherapy for the offspring. Exposed children in the sample of reviewed papers did have some medical conditions, but the rate and type did not differ from the non-exposed population. However, a limitation of this literature review is the very small sample size of publications on this important topic. This finding of few studies on this topic is an important result of this systematic review. More research and long term follow-up studies must be conducted to address this issue.^
Resumo:
This thesis project is motivated by the potential problem of using observational data to draw inferences about a causal relationship in observational epidemiology research when controlled randomization is not applicable. Instrumental variable (IV) method is one of the statistical tools to overcome this problem. Mendelian randomization study uses genetic variants as IVs in genetic association study. In this thesis, the IV method, as well as standard logistic and linear regression models, is used to investigate the causal association between risk of pancreatic cancer and the circulating levels of soluble receptor for advanced glycation end-products (sRAGE). Higher levels of serum sRAGE were found to be associated with a lower risk of pancreatic cancer in a previous observational study (255 cases and 485 controls). However, such a novel association may be biased by unknown confounding factors. In a case-control study, we aimed to use the IV approach to confirm or refute this observation in a subset of study subjects for whom the genotyping data were available (178 cases and 177 controls). Two-stage IV method using generalized method of moments-structural mean models (GMM-SMM) was conducted and the relative risk (RR) was calculated. In the first stage analysis, we found that the single nucleotide polymorphism (SNP) rs2070600 of the receptor for advanced glycation end-products (AGER) gene meets all three general assumptions for a genetic IV in examining the causal association between sRAGE and risk of pancreatic cancer. The variant allele of SNP rs2070600 of the AGER gene was associated with lower levels of sRAGE, and it was neither associated with risk of pancreatic cancer, nor with the confounding factors. It was a potential strong IV (F statistic = 29.2). However, in the second stage analysis, the GMM-SMM model failed to converge due to non- concaveness probably because of the small sample size. Therefore, the IV analysis could not support the causality of the association between serum sRAGE levels and risk of pancreatic cancer. Nevertheless, these analyses suggest that rs2070600 was a potentially good genetic IV for testing the causality between the risk of pancreatic cancer and sRAGE levels. A larger sample size is required to conduct a credible IV analysis.^
Resumo:
The purpose of this study is to investigate the effects of predictor variable correlations and patterns of missingness with dichotomous and/or continuous data in small samples when missing data is multiply imputed. Missing data of predictor variables is multiply imputed under three different multivariate models: the multivariate normal model for continuous data, the multinomial model for dichotomous data and the general location model for mixed dichotomous and continuous data. Subsequent to the multiple imputation process, Type I error rates of the regression coefficients obtained with logistic regression analysis are estimated under various conditions of correlation structure, sample size, type of data and patterns of missing data. The distributional properties of average mean, variance and correlations among the predictor variables are assessed after the multiple imputation process. ^ For continuous predictor data under the multivariate normal model, Type I error rates are generally within the nominal values with samples of size n = 100. Smaller samples of size n = 50 resulted in more conservative estimates (i.e., lower than the nominal value). Correlation and variance estimates of the original data are retained after multiple imputation with less than 50% missing continuous predictor data. For dichotomous predictor data under the multinomial model, Type I error rates are generally conservative, which in part is due to the sparseness of the data. The correlation structure for the predictor variables is not well retained on multiply-imputed data from small samples with more than 50% missing data with this model. For mixed continuous and dichotomous predictor data, the results are similar to those found under the multivariate normal model for continuous data and under the multinomial model for dichotomous data. With all data types, a fully-observed variable included with variables subject to missingness in the multiple imputation process and subsequent statistical analysis provided liberal (larger than nominal values) Type I error rates under a specific pattern of missing data. It is suggested that future studies focus on the effects of multiple imputation in multivariate settings with more realistic data characteristics and a variety of multivariate analyses, assessing both Type I error and power. ^
Resumo:
Few studies have investigated causal pathways linking psychosocial factors to each other and to screening mammography. Conflicting hypotheses exist in the theoretic literature regarding the role and importance of subjective norms, a person's perceived social pressure to perform the behavior and his/her motivation to comply. The Theory of Reasoned Action (TRA) hypothesizes that subjective norms directly affect intention; while the Transtheoretical Model (TTM) hypothesizes that attitudes mediate the influence of subjective norms on stage of change. No one has examined which hypothesis best predicts the effect of subjective norms on mammography intention and stage of change. Two statistical methods are available for testing mediation, sequential regression analysis (SRA) and latent variable structural equation modeling (LVSEM); however, software to apply LVSEM to dichotomous variables like intention has only recently become available. No one has compared the methods to determine whether or not they yield similar results for dichotomous variables. ^ Study objectives were to: (1) determine whether the effect of subjective norms on mammography intention and stage of change are mediated by pros and cons; and (2) compare mediation results from the SRA and LVSEM approaches when the outcome is dichotomous. We conducted a secondary analysis of data from a national sample of women veterans enrolled in Project H.O.M.E. (H&barbelow;ealthy O&barbelow;utlook on the M&barbelow;ammography E&barbelow;xperience), a behavioral intervention trial. ^ Results showed that the TTM model described the causal pathways better than the TRA one; however, we found support for only one of the TTM causal mechanisms. Cons was the sole mediator. The mediated effect of subjective norms on intention and stage of change by cons was very small. These findings suggest that interventionists focus their efforts on reducing negative attitudes toward mammography when resources are limited. ^ Both the SRA and LVSEM methods provided evidence for complete mediation, and the direction, magnitude, and standard errors of the parameter estimates were very similar. Because SRA parameter estimates were not biased toward the null, we can probably assume negligible measurement error in the independent and mediator variables. Simulation studies are needed to further our understanding of how these two methods perform under different data conditions. ^
Resumo:
This study explored the health, education, social assets, needs, attitudes, and behaviors of residents of Ferrocarril #4, a small urban community in Tamaulipas, Mexico. A collaborative Participatory Action Research approach was used to emphasize community involvement. Using Triangulation to ensure validity, qualitative methods included key informant in depth interviews, participant observation and participatory discussion groups with women and men. A personal interview with a probability sample of women was done. The median age of interviewees was 37 years. The majority was married or had a partner. Over half of respondents completed grades 6-9. Employed women (25%) earned a median weekly salary equivalent to ∼56 USD. Women with health insurance (67.7%) were covered mainly through Social Security and Seguro Popular. One in 5 reported bad health. Barriers to care were primarily money and transportation. To improve health care, women wanted a full service clinic in or close to the community and affordable health care. Socially, 28% of respondents had no close friends in the community and most did not participate in beneficial community activities. Many women did not socialize with others and help from neighbors was situational. Primary school teachers lacked parental support and it interfered with classroom efforts. Healthy community discussion groups focused on personal and environmental hygiene and safety. Valuable assets exist in the community. To date, collaborative efforts resulted in a school First Aid station, a school nurse visit weekly, posting of emergency contact phone numbers in the school and community center, and development of a student health information form. ^
Resumo:
Health departments, research institutions, policy-makers, and healthcare providers are often interested in knowing the health status of their clients/constituents. Without the resources, financially or administratively, to go out into the community and conduct health assessments directly, these entities frequently rely on data from population-based surveys to supply the information they need. Unfortunately, these surveys are ill-equipped for the job due to sample size and privacy concerns. Small area estimation (SAE) techniques have excellent potential in such circumstances, but have been underutilized in public health due to lack of awareness and confidence in applying its methods. The goal of this research is to make model-based SAE accessible to a broad readership using clear, example-based learning. Specifically, we applied the principles of multilevel, unit-level SAE to describe the geographic distribution of HPV vaccine coverage among females aged 11-26 in Texas.^ Multilevel (3 level: individual, county, public health region) random-intercept logit models of HPV vaccination (receipt of ≥ 1 dose Gardasil® ) were fit to data from the 2008 Behavioral Risk Factor Surveillance System (outcome and level 1 covariates) and a number of secondary sources (group-level covariates). Sampling weights were scaled (level 1) or constructed (levels 2 & 3), and incorporated at every level. Using the regression coefficients (and standard errors) from the final models, I simulated 10,000 datasets for each regression coefficient from the normal distribution and applied them to the logit model to estimate HPV vaccine coverage in each county and respective demographic subgroup. For simplicity, I only provide coverage estimates (and 95% confidence intervals) for counties.^ County-level coverage among females aged 11-17 varied from 6.8-29.0%. For females aged 18-26, coverage varied from 1.9%-23.8%. Aggregated to the state level, these values translate to indirect state estimates of 15.5% and 11.4%, respectively; both of which fall within the confidence intervals for the direct estimates of HPV vaccine coverage in Texas (Females 11-17: 17.7%, 95% CI: 13.6, 21.9; Females 18-26: 12.0%, 95% CI: 6.2, 17.7).^ Small area estimation has great potential for informing policy, program development and evaluation, and the provision of health services. Harnessing the flexibility of multilevel, unit-level SAE to estimate HPV vaccine coverage among females aged 11-26 in Texas counties, I have provided (1) practical guidance on how to conceptualize and conduct modelbased SAE, (2) a robust framework that can be applied to other health outcomes or geographic levels of aggregation, and (3) HPV vaccine coverage data that may inform the development of health education programs, the provision of health services, the planning of additional research studies, and the creation of local health policies.^
Resumo:
Sizes and power of selected two-sample tests of the equality of survival distributions are compared by simulation for small samples from unequally, randomly-censored exponential distributions. The tests investigated include parametric tests (F, Score, Likelihood, Asymptotic), logrank tests (Mantel, Peto-Peto), and Wilcoxon-Type tests (Gehan, Prentice). Equal sized samples, n = 18, 16, 32 with 1000 (size) and 500 (power) simulation trials, are compared for 16 combinations of the censoring proportions 0%, 20%, 40%, and 60%. For n = 8 and 16, the Asymptotic, Peto-Peto, and Wilcoxon tests perform at nominal 5% size expectations, but the F, Score and Mantel tests exceeded 5% size confidence limits for 1/3 of the censoring combinations. For n = 32, all tests showed proper size, with the Peto-Peto test most conservative in the presence of unequal censoring. Powers of all tests are compared for exponential hazard ratios of 1.4 and 2.0. There is little difference in power characteristics of the tests within the classes of tests considered. The Mantel test showed 90% to 95% power efficiency relative to parametric tests. Wilcoxon-type tests have the lowest relative power but are robust to differential censoring patterns. A modified Peto-Peto test shows power comparable to the Mantel test. For n = 32, a specific Weibull-exponential comparison of crossing survival curves suggests that the relative powers of logrank and Wilcoxon-type tests are dependent on the scale parameter of the Weibull distribution. Wilcoxon-type tests appear more powerful than logrank tests in the case of late-crossing and less powerful for early-crossing survival curves. Guidelines for the appropriate selection of two-sample tests are given. ^
Resumo:
"Technology assessment is a comprehensive form of policy research that examines the short- and long-term social consequences of the application or use of technology" (US Congress 1967).^ This study explored a research methodology appropriate for technology assessment (TA) within the health industry. The case studied was utilization of external Small-Volume Infusion Pumps (SVIP) at a cancer treatment and research center. Primary and secondary data were collected in three project phases. In Phase I, hospital prescription records (N = 14,979) represented SVIP adoption and utilization for the years 1982-1984. The Candidate Adoption-Use (CA-U) diffusion paradigm developed for this study was germane. Compared to classic and unorthodox curves, CA-U more accurately simulated empiric experience. The hospital SVIP 1983-1984 trends denoted assurance in prescribing chemotherapy and concomitant balloon SVIP efficacy and efficiency. Abandonment of battery pumps was predicted while exponential demand for balloon SVIP was forecast for 1985-1987. In Phase II, patients using SVIP (N = 117) were prospectively surveyed from July to October 1984; the data represented a single episode of therapy. The questionnaire and indices, specifically designed to measure the impact of SVIP, evinced face validity. Compeer group data were from pre-SVIP case reviews rather than from an inpatient sample. Statistically significant results indicated that outpatients using SVIP interacted socially more than inpatients using the alternative technology. Additionally, the hospital's education program effectively taught clients to discriminate between self care and professional SVIP services. In these contexts, there was sufficient evidence that the alternative technology restricted patients activity whereas SVIP permitted patients to function more independently and in a social lifestyle, thus adding quality to life. In Phase III, diffusion forecast and patient survey findings were combined with direct observation of clinic services to profile some economic dimensions of SVIP. These three project phases provide a foundation for executing: (1) cost effectiveness analysis of external versus internal infusors, (2) institutional resource allocation, and (3) technology deployment to epidemiology-significant communities. The models and methods tested in this research of clinical technology assessment are innovative and do assess biotechnology. ^
Resumo:
Lung cancer is the leading cause of cancer-related mortality in the US. Emerging evidence has shown that host genetic factors can interact with environmental exposures to influence patient susceptibility to the diseases as well as clinical outcomes, such as survival and recurrence. We aimed to identify genetic prognostic markers for non-small cell lung cancer (NSCLC), a major (85%) subtype of lung cancer, and also in other subgroups. With the fast evolution of genotyping technology, genetic association studies have went through candidate gene approach, to pathway-based approach, to the genome wide association study (GWAS). Even in the era of GWAS, pathway-based approach has its own advantages on studying cancer clinical outcomes: it is cost-effective, requiring a smaller sample size than GWAS easier to identify a validation population and explore gene-gene interactions. In the current study, we adopted pathway-based approach focusing on two critical pathways - miRNA and inflammation pathways. MicroRNAs (miRNA) post-transcriptionally regulate around 30% of human genes. Polymorphisms within miRNA processing pathways and binding sites may influence patients’ prognosis through altered gene regulation. Inflammation plays an important role in cancer initiation and progression, and also has shown to impact patients’ clinical outcomes. We first evaluated 240 single nucleotide polymorphisms (SNPs) in miRNA biogenesis genes and predicted binding sites in NSCLC patients to determine associations with clinical outcomes in early-stage (stage I and II) and late-stage (stage III and IV) lung cancer patients, respectively. First, in 535 early-stage patients, after correcting multiple comparisons, FZD4:rs713065 (hazard ratio [HR]:0.46, 95% confidence interval [CI]:0.32-0.65) showed a significant inverse association with survival in early stage surgery-only patients. SP1:rs17695156 (HR:2.22, 95% CI:1.44-3.41) and DROSHA:rs6886834 (HR:6.38, 95% CI:2.49-16.31) conferred increased risk of progression in the all patients and surgery-only populations, respectively. FAS:rs2234978 was significantly associated with improved survival in all patients (HR:0.59, 95% CI:0.44-0.77) and in the surgery plus chemotherapy populations (HR:0.19, 95% CI:0.07-0.46).. Functional genomics analysis demonstrated that this variant creates a miR-651 binding site resulting in altered miRNA regulation of FAS, providing biological plausibility for the observed association. We then analyzed these associations in 598 late-stage patients. After multiple comparison corrections, no SNPs remained significant in the late stage group, while the top SNP NAT1:rs15561 (HR=1.98, 96%CI=1.32-2.94) conferred a significantly increased risk of death in the chemotherapy subgroup. To test the hypothesis that genetic variants in the inflammation-related pathways may be associated with survival in NSCLC patients, we first conducted a three-stage study. In the discovery phase, we investigated a comprehensive panel of 11,930 inflammation-related SNPs in three independent lung cancer populations. A missense SNP (rs2071554) in HLA-DOB was significantly associated with poor survival in the discovery population (HR: 1.46, 95% CI: 1.02-2.09), internal validation population (HR: 1.51, 95% CI: 1.02-2.25), and external validation (HR: 1.52, 95% CI: 1.01-2.29) population. Rs2900420 in KLRK1 was significantly associated with a reduced risk for death in the discovery (HR: 0.76, 95% CI: 0.60-0.96) and internal validation (HR: 0.77, 95% CI: 0.61-0.99) populations, and the association reached borderline significance in the external validation population (HR: 0.80, 95% CI: 0.63-1.02). We also evaluated these inflammation-related SNPs in NSCLC patients in never smokers. Lung cancer in never smokers has been increasingly recognized as distinct disease from that in ever-smokers. A two-stage study was performed using a discovery population from MD Anderson (411 patients) and a validation population from Mayo Clinic (311 patients). Three SNPs (IL17RA:rs879576, BMP8A:rs698141, and STK:rs290229) that were significantly associated with survival were validated (pCD74:rs1056400 and CD38:rs10805347) were borderline significant (p=0.08) in the Mayo Clinic population. In the combined analysis, IL17RA:rs879576 resulted in a 40% reduction in the risk for death (p=4.1 × 10-5 [p=0.61, heterogeneity test]). We also validated a survival tree created in MD Anderson population in the Mayo Clinic population. In conclusion, our results provided strong evidence that genetic variations in specific pathways that examined (miRNA and inflammation pathways) influenced clinical outcomes in NSCLC patients, and with further functional studies, the novel loci have potential to be translated into clinical use.
Resumo:
Multi-center clinical trials are very common in the development of new drugs and devices. One concern in such trials, is the effect of individual investigational sites enrolling small numbers of patients on the overall result. Can the presence of small centers cause an ineffective treatment to appear effective when treatment-by-center interaction is not statistically significant?^ In this research, simulations are used to study the effect that centers enrolling few patients may have on the analysis of clinical trial data. A multi-center clinical trial with 20 sites is simulated to investigate the effect of a new treatment in comparison to a placebo treatment. Twelve of these 20 investigational sites are considered small, each enrolling less than four patients per treatment group. Three clinical trials are simulated with sample sizes of 100, 170 and 300. The simulated data is generated with various characteristics, one in which treatment should be considered effective and another where treatment is not effective. Qualitative interactions are also produced within the small sites to further investigate the effect of small centers under various conditions.^ Standard analysis of variance methods and the "sometimes-pool" testing procedure are applied to the simulated data. One model investigates treatment and center effect and treatment-by-center interaction. Another model investigates treatment effect alone. These analyses are used to determine the power to detect treatment-by-center interactions, and the probability of type I error.^ We find it is difficult to detect treatment-by-center interactions when only a few investigational sites enrolling a limited number of patients participate in the interaction. However, we find no increased risk of type I error in these situations. In a pooled analysis, when the treatment is not effective, the probability of finding a significant treatment effect in the absence of significant treatment-by-center interaction is well within standard limits of type I error. ^
Resumo:
Microsatellite instability (MSI) is a hallmark of the mutator phenotype associated with Hereditary Non-Polyposis Colon Cancer (HNPCC). The MSI-High (MSI-H) HNPCC population has been well characterized, but the microsatellite low and stable (MSI-L/MSS) HNPCC population is much less understood. We hypothesize there are significant levels of MSI in HNPCC DNA classified as MSI-L/MSS, but no single variant allele makes up a sufficient population in the tumor DNA to be detected by standard analysis. Finding variants would suggest there is a mutator phenotype for the MSI-L/MSS HNPCC population that is distinct from the MSI-H HNPCC populations. This study quantified and compared MSI in HNPCC patients previously shown to be MSI-H, MSI-L/MSS and an MSI-H older, sporadic colorectal cancer patient. Small-pool Polymerase Chain Reactions (SP-PCRs) were conducted where the DNAs from each sample and controls are diluted into multiple pools, each containing approximately single genome equivalents. At least 100 alleles/sample were studied at six microsatellite loci. Mutant fragments were identified, quantified, and compared using Poisson statistics. Most of the variants were small deletions or insertions, with more mutants being deletions, as has been previously described in yeast and transgenic mice. SP-PCR, where most of the pools contained only 3 or less fragments, enabled identification of variants too infrequent to be detected by large pool PCR. Mutant fragments in positive control MSI-H tumor samples ranged from 0.26 to 0.68 in at least 4 of the 6 loci tested and were consistent with their MSI-H status. In the so called MSS tumors and constitutive tissues (normal colon tissue, and PBLs) of all the HNPCC patients, low, but significant levels of MSI were seen in at least two of the loci studied. This phenomenon was not seen in the sporadic MSI constitutive tissues nor the normal controls and suggests haploinsufficiency, gain-of-function, or a dominant/negative basis of the instability in HNPCC patients carrying germline mutations for tumor suppressor genes. A different frequency and spectrum of mutant fragments suggests a different genetic basis (other than a major mutation in MLH1 or MSH2) for disease in MSI-L and MSS HNPCC patients. ^