12 resultados para Log-normal distribution

em DigitalCommons@The Texas Medical Center


Relevância:

90.00% 90.00%

Publicador:

Resumo:

A variety of occupational hazards are indigenous to academic and research institutions, ranging from traditional life safety concerns, such as fire safety and fall protection, to specialized occupational hygiene issues such as exposure to carcinogenic chemicals, radiation sources, and infectious microorganisms. Institutional health and safety programs are constantly challenged to establish and maintain adequate protective measures for this wide array of hazards. A unique subset of academic and research institutions are classified as historically Black universities which provide educational opportunities primarily to minority populations. State funded minority schools receive less resources than their non-minority counterparts, resulting in a reduced ability to provide certain programs and services. Comprehensive health and safety services for these institutions may be one of the services compromised, resulting in uncontrolled exposures to various workplace hazards. Such a result would also be contrary to the national health status objectives to improve preventive health care measures for minority populations.^ To determine if differences exist, a cross-sectional survey was performed to evaluate the relative status of health and safety programs present within minority and non-minority state-funded academic and research institutions. Data were obtained from direct mail questionnaires, supplemented by data from publicly available sources. Parameters for comparison included reported numbers of full and part-time health and safety staff, reported OSHA 200 log (or equivalent) values, and reported workers compensation experience modifiers. The relative impact of institutional minority status, institution size, and OSHA regulatory environment, was also assessed. Additional health and safety program descriptors were solicited in an attempt to develop a preliminary profile of the hazards present in this unique work setting.^ Survey forms were distributed to 24 minority and 51 non-minority institutions. A total of 72% of the questionnaires were returned, with 58% of the minority and 78% of the non-minority institutions participating. The mean number of reported full-time health and safety staff for the responding minority institutions was determined to be 1.14, compared to 3.12 for the responding non-minority institutions. Data distribution variances were stabilized using log-normal transformations, and although subsequent analysis indicated statistically significant differences, the differences were found to be predicted by institution size only, and not by minority status or OSHA regulatory environment. Similar results were noted for estimated full-time equivalent health and safety staffing levels. Significant differences were not noted between reported OSHA 200 log (or equivalent) data, and a lack of information provided on workers compensation experience modifiers prevented comparisons on insurance premium expenditures. Other health and safety program descriptive information obtained served to validate the study's presupposition that the inclusion criteria would encompass those organizations with occupational risks from all four major hazard categories. Worker medical surveillance programs appeared to exist at most institutions, but the specific tests completed were not readily identifiable.^ The results of this study serve as a preliminary description of the health and safety programs for a unique set of workplaces have not been previously investigated. Numerous opportunities for further research are noted, including efforts to quantify the relative amount of each hazard present, the further definition of the programs reported to be in place, determination of other means to measure health outcomes on campuses, and comparisons among other culturally diverse workplaces. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Environmental data sets of pollutant concentrations in air, water, and soil frequently include unquantified sample values reported only as being below the analytical method detection limit. These values, referred to as censored values, should be considered in the estimation of distribution parameters as each represents some value of pollutant concentration between zero and the detection limit. Most of the currently accepted methods for estimating the population parameters of environmental data sets containing censored values rely upon the assumption of an underlying normal (or transformed normal) distribution. This assumption can result in unacceptable levels of error in parameter estimation due to the unbounded left tail of the normal distribution. With the beta distribution, which is bounded by the same range of a distribution of concentrations, $\rm\lbrack0\le x\le1\rbrack,$ parameter estimation errors resulting from improper distribution bounds are avoided. This work developed a method that uses the beta distribution to estimate population parameters from censored environmental data sets and evaluated its performance in comparison to currently accepted methods that rely upon an underlying normal (or transformed normal) distribution. Data sets were generated assuming typical values encountered in environmental pollutant evaluation for mean, standard deviation, and number of variates. For each set of model values, data sets were generated assuming that the data was distributed either normally, lognormally, or according to a beta distribution. For varying levels of censoring, two established methods of parameter estimation, regression on normal ordered statistics, and regression on lognormal ordered statistics, were used to estimate the known mean and standard deviation of each data set. The method developed for this study, employing a beta distribution assumption, was also used to estimate parameters and the relative accuracy of all three methods were compared. For data sets of all three distribution types, and for censoring levels up to 50%, the performance of the new method equaled, if not exceeded, the performance of the two established methods. Because of its robustness in parameter estimation regardless of distribution type or censoring level, the method employing the beta distribution should be considered for full development in estimating parameters for censored environmental data sets. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Interaction effect is an important scientific interest for many areas of research. Common approach for investigating the interaction effect of two continuous covariates on a response variable is through a cross-product term in multiple linear regression. In epidemiological studies, the two-way analysis of variance (ANOVA) type of method has also been utilized to examine the interaction effect by replacing the continuous covariates with their discretized levels. However, the implications of model assumptions of either approach have not been examined and the statistical validation has only focused on the general method, not specifically for the interaction effect.^ In this dissertation, we investigated the validity of both approaches based on the mathematical assumptions for non-skewed data. We showed that linear regression may not be an appropriate model when the interaction effect exists because it implies a highly skewed distribution for the response variable. We also showed that the normality and constant variance assumptions required by ANOVA are not satisfied in the model where the continuous covariates are replaced with their discretized levels. Therefore, naïve application of ANOVA method may lead to an incorrect conclusion. ^ Given the problems identified above, we proposed a novel method modifying from the traditional ANOVA approach to rigorously evaluate the interaction effect. The analytical expression of the interaction effect was derived based on the conditional distribution of the response variable given the discretized continuous covariates. A testing procedure that combines the p-values from each level of the discretized covariates was developed to test the overall significance of the interaction effect. According to the simulation study, the proposed method is more powerful then the least squares regression and the ANOVA method in detecting the interaction effect when data comes from a trivariate normal distribution. The proposed method was applied to a dataset from the National Institute of Neurological Disorders and Stroke (NINDS) tissue plasminogen activator (t-PA) stroke trial, and baseline age-by-weight interaction effect was found significant in predicting the change from baseline in NIHSS at Month-3 among patients received t-PA therapy.^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Health departments, research institutions, policy-makers, and healthcare providers are often interested in knowing the health status of their clients/constituents. Without the resources, financially or administratively, to go out into the community and conduct health assessments directly, these entities frequently rely on data from population-based surveys to supply the information they need. Unfortunately, these surveys are ill-equipped for the job due to sample size and privacy concerns. Small area estimation (SAE) techniques have excellent potential in such circumstances, but have been underutilized in public health due to lack of awareness and confidence in applying its methods. The goal of this research is to make model-based SAE accessible to a broad readership using clear, example-based learning. Specifically, we applied the principles of multilevel, unit-level SAE to describe the geographic distribution of HPV vaccine coverage among females aged 11-26 in Texas.^ Multilevel (3 level: individual, county, public health region) random-intercept logit models of HPV vaccination (receipt of ≥ 1 dose Gardasil® ) were fit to data from the 2008 Behavioral Risk Factor Surveillance System (outcome and level 1 covariates) and a number of secondary sources (group-level covariates). Sampling weights were scaled (level 1) or constructed (levels 2 & 3), and incorporated at every level. Using the regression coefficients (and standard errors) from the final models, I simulated 10,000 datasets for each regression coefficient from the normal distribution and applied them to the logit model to estimate HPV vaccine coverage in each county and respective demographic subgroup. For simplicity, I only provide coverage estimates (and 95% confidence intervals) for counties.^ County-level coverage among females aged 11-17 varied from 6.8-29.0%. For females aged 18-26, coverage varied from 1.9%-23.8%. Aggregated to the state level, these values translate to indirect state estimates of 15.5% and 11.4%, respectively; both of which fall within the confidence intervals for the direct estimates of HPV vaccine coverage in Texas (Females 11-17: 17.7%, 95% CI: 13.6, 21.9; Females 18-26: 12.0%, 95% CI: 6.2, 17.7).^ Small area estimation has great potential for informing policy, program development and evaluation, and the provision of health services. Harnessing the flexibility of multilevel, unit-level SAE to estimate HPV vaccine coverage among females aged 11-26 in Texas counties, I have provided (1) practical guidance on how to conceptualize and conduct modelbased SAE, (2) a robust framework that can be applied to other health outcomes or geographic levels of aggregation, and (3) HPV vaccine coverage data that may inform the development of health education programs, the provision of health services, the planning of additional research studies, and the creation of local health policies.^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The efficacy of waste stabilization lagoons for the treatment of five priority pollutants and two widely used commercial compounds was evaluated in laboratory model ponds. Three ponds were designed to simulate a primary anaerobic lagoon, a secondary facultative lagoon, and a tertiary aerobic lagoon. Biodegradation, volatilization, and sorption losses were quantified for bis(2-chloroethyl) ether, benzene, toluene, naphthalene, phenanthrene, ethylene glycol, and ethylene glycol monoethyl ether. A statistical model using a log normal transformation indicated biodegradation of bis(2-chloroethyl) ether followed first-order kinetics. Additionally, multiple regression analysis indicated biochemical oxygen demand was the water quality variable most highly correlated with bis(2-chloroethyl) ether effluent concentration. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Maximizing data quality may be especially difficult in trauma-related clinical research. Strategies are needed to improve data quality and assess the impact of data quality on clinical predictive models. This study had two objectives. The first was to compare missing data between two multi-center trauma transfusion studies: a retrospective study (RS) using medical chart data with minimal data quality review and the PRospective Observational Multi-center Major Trauma Transfusion (PROMMTT) study with standardized quality assurance. The second objective was to assess the impact of missing data on clinical prediction algorithms by evaluating blood transfusion prediction models using PROMMTT data. RS (2005-06) and PROMMTT (2009-10) investigated trauma patients receiving ≥ 1 unit of red blood cells (RBC) from ten Level I trauma centers. Missing data were compared for 33 variables collected in both studies using mixed effects logistic regression (including random intercepts for study site). Massive transfusion (MT) patients received ≥ 10 RBC units within 24h of admission. Correct classification percentages for three MT prediction models were evaluated using complete case analysis and multiple imputation based on the multivariate normal distribution. A sensitivity analysis for missing data was conducted to estimate the upper and lower bounds of correct classification using assumptions about missing data under best and worst case scenarios. Most variables (17/33=52%) had <1% missing data in RS and PROMMTT. Of the remaining variables, 50% demonstrated less missingness in PROMMTT, 25% had less missingness in RS, and 25% were similar between studies. Missing percentages for MT prediction variables in PROMMTT ranged from 2.2% (heart rate) to 45% (respiratory rate). For variables missing >1%, study site was associated with missingness (all p≤0.021). Survival time predicted missingness for 50% of RS and 60% of PROMMTT variables. MT models complete case proportions ranged from 41% to 88%. Complete case analysis and multiple imputation demonstrated similar correct classification results. Sensitivity analysis upper-lower bound ranges for the three MT models were 59-63%, 36-46%, and 46-58%. Prospective collection of ten-fold more variables with data quality assurance reduced overall missing data. Study site and patient survival were associated with missingness, suggesting that data were not missing completely at random, and complete case analysis may lead to biased results. Evaluating clinical prediction model accuracy may be misleading in the presence of missing data, especially with many predictor variables. The proposed sensitivity analysis estimating correct classification under upper (best case scenario)/lower (worst case scenario) bounds may be more informative than multiple imputation, which provided results similar to complete case analysis.^

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Nuclear morphometry (NM) uses image analysis to measure features of the cell nucleus which are classified as: bulk properties, shape or form, and DNA distribution. Studies have used these measurements as diagnostic and prognostic indicators of disease with inconclusive results. The distributional properties of these variables have not been systematically investigated although much of the medical data exhibit nonnormal distributions. Measurements are done on several hundred cells per patient so summary measurements reflecting the underlying distribution are needed.^ Distributional characteristics of 34 NM variables from prostate cancer cells were investigated using graphical and analytical techniques. Cells per sample ranged from 52 to 458. A small sample of patients with benign prostatic hyperplasia (BPH), representing non-cancer cells, was used for general comparison with the cancer cells.^ Data transformations such as log, square root and 1/x did not yield normality as measured by the Shapiro-Wilks test for normality. A modulus transformation, used for distributions having abnormal kurtosis values, also did not produce normality.^ Kernel density histograms of the 34 variables exhibited non-normality and 18 variables also exhibited bimodality. A bimodality coefficient was calculated and 3 variables: DNA concentration, shape and elongation, showed the strongest evidence of bimodality and were studied further.^ Two analytical approaches were used to obtain a summary measure for each variable for each patient: cluster analysis to determine significant clusters and a mixture model analysis using a two component model having a Gaussian distribution with equal variances. The mixture component parameters were used to bootstrap the log likelihood ratio to determine the significant number of components, 1 or 2. These summary measures were used as predictors of disease severity in several proportional odds logistic regression models. The disease severity scale had 5 levels and was constructed of 3 components: extracapsulary penetration (ECP), lymph node involvement (LN+) and seminal vesicle involvement (SV+) which represent surrogate measures of prognosis. The summary measures were not strong predictors of disease severity. There was some indication from the mixture model results that there were changes in mean levels and proportions of the components in the lower severity levels. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE: To characterize PubMed usage over a typical day and compare it to previous studies of user behavior on Web search engines. DESIGN: We performed a lexical and semantic analysis of 2,689,166 queries issued on PubMed over 24 consecutive hours on a typical day. MEASUREMENTS: We measured the number of queries, number of distinct users, queries per user, terms per query, common terms, Boolean operator use, common phrases, result set size, MeSH categories, used semantic measurements to group queries into sessions, and studied the addition and removal of terms from consecutive queries to gauge search strategies. RESULTS: The size of the result sets from a sample of queries showed a bimodal distribution, with peaks at approximately 3 and 100 results, suggesting that a large group of queries was tightly focused and another was broad. Like Web search engine sessions, most PubMed sessions consisted of a single query. However, PubMed queries contained more terms. CONCLUSION: PubMed's usage profile should be considered when educating users, building user interfaces, and developing future biomedical information retrieval systems.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OPN is a secreted phosphate containing protein which is expressed by osteoblasts and a variety of other cells in vivo. Data from in vitro studies has accumulated which relates OPN to cellular transformation. We hypothesize that OPN expression is associated with neoplastic disease in humans as suggested by cell culture models. The overall objective of the current study was to determine the tissue distribution of OPN in human malignancy and to determine whether or not a correlation exists between OPN serum levels and malignancy. At the inception of this project, no study had been made demonstrating the relevance of OPN expression with naturally occurring neoplastic disease in humans. To date, few studies have reported OPN distribution in human neoplasia and are limited by either the number of specimens analyzed or the technique used in analysis. In this dissertation study, OPN was purified from human milk and $\alpha$-OPN antiserum developed and characterized. Following antibody development, the distribution and prevalence of OPN in human oral squamous cell carcinoma and human prostate carcinoma was evaluated using immunohistochemical localization. OPN immunolocalization was found in a high percentage of oral epithelial dysplasia and oral squamous cell carcinoma in humans. One oral squamous cell carcinoma cells line, UMSCC-1, was found to express OPN mRNA using Northern blotting. OPN localized to a high percentage of primary prostate adenocarcinomas. OPN localized to 52% of androgen dependent cases and 100% of androgen independent cases. Androgen dependent cell lines such as LNCap and NbE showed minimal OPN mRNA expression while the androgen independent lines C4-2 and PC3 produced ample OPN mRNA. An OPN sandwich assay was developed and used to determine the serum level of OPN in normal males, patients with BPH (benign prostate hypertrophy), and patients with prostate carcinoma. No statistically significant difference was found in OPN serum levels among the three groups. However, a trend of increasing OPN in the serum was noted in patients with BPH and prostate cancer. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Previous studies of normal children have linked body fat but not body fat distribution (BFD), to higher blood pressures, lipids, and insulin resistance (Berenson et al., 1988) BFD is a well-established risk factor for cardiovascular disease in adults (Björntorp, 1988). This study investigates the relation of BFD and serum lipids at baseline in children from Project HeartBeat!, a study of the growth and development of cardiovascular risk factors in 678 children in three cohorts measured initially at ages 8, 11, and 14 years. Initially, two of four indices of BFD were significantly related to the lipids: ratio of upper to lower body skinfolds (ln US:LS) and conicity (C Index). A factor analysis reduced the information in the serum lipids to two vectors: (1) total cholesterol + LDL-cholesterol and (2) HDL-cholesterol − triglycerides, which together accounted for 85% of the lipid variation. Using each serum lipid and vector as separate dependent variables, linear and quadratic regression models were constructed to examine the predictive ability of the two BFD variables, controlling for total body fat, gender, ethnicity (Black, non-Black) and maturation. Linear models provided an acceptable fit. Percent body fat (%BF) was a significant predictor in each and every lipid model, independent of age, maturation, or ethnicity (p ≤ 0.05). No BFD variable entered the equation for total or LDL-cholesterol, although there was a significant maturity by BFD interaction for LDL (ln US:LS was a significant predictor in more mature individuals). Both %BF and BFD (by way of Conicity) were significant predictors of HDL-cholesterol and triglycerides (p ≤ 0.01). All models were statistically significant at a high level (p ≤ 0.01), but adjusted R 2's for all models were low (0.05–0.15). Body fat distribution is a significant predictor of lipids in normal children, but secondarily to %BF, and for LDL-cholesterol in particular, the relation is dependent on maturity status. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The determination of size as well as power of a test is a vital part of a Clinical Trial Design. This research focuses on the simulation of clinical trial data with time-to-event as the primary outcome. It investigates the impact of different recruitment patterns, and time dependent hazard structures on size and power of the log-rank test. A non-homogeneous Poisson process is used to simulate entry times according to the different accrual patterns. A Weibull distribution is employed to simulate survival times according to the different hazard structures. The current study utilizes simulation methods to evaluate the effect of different recruitment patterns on size and power estimates of the log-rank test. The size of the log-rank test is estimated by simulating survival times with identical hazard rates between the treatment and the control arm of the study resulting in a hazard ratio of one. Powers of the log-rank test at specific values of hazard ratio (≠1) are estimated by simulating survival times with different, but proportional hazard rates for the two arms of the study. Different shapes (constant, decreasing, or increasing) of the hazard function of the Weibull distribution are also considered to assess the effect of hazard structure on the size and power of the log-rank test. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The physical characteristic of protons is that they deliver most of their radiation dose to the target volume and deliver no dose to the normal tissue distal to the tumor. Previously, numerous studies have shown unique advantages of proton therapy over intensity-modulated radiation therapy (IMRT) in conforming dose to the tumor and sparing dose to the surrounding normal tissues and the critical structures in many clinical sites. However, proton therapy is known to be more sensitive to treatment uncertainties such as inter- and intra-fractional variations in patient anatomy. To date, no study has clearly demonstrated the effectiveness of proton therapy compared with the conventional IMRT under the consideration of both respiratory motion and tumor shrinkage in non-small cell lung cancer (NSCLC) patients. Purpose: This thesis investigated two questions for establishing a clinically relevant comparison of the two different modalities (IMRT and proton therapy). The first question was whether or not there are any differences in tumor shrinkage between patients randomized to IMRT versus passively scattered proton therapy (PSPT). Tumor shrinkage is considered a standard measure of radiation therapy response that has been widely used to gauge a short-term progression of radiation therapy. The second question was whether or not there are any differences between the planned dose and 5D dose under the influence of inter- and intra-fractional variations in the patient anatomy for both modalities. Methods: A total of 45 patients (25 IMRT patients and 20 PSPT patients) were used to quantify the tumor shrinkage in terms of the change of the primary gross tumor volume (GTVp). All patients were randomized to receive either IMRT or PSPT for NSCLC. Treatment planning goals were identical for both groups. All patients received 5 to 8 weekly repeated 4-dimensional computed tomography (4DCT) scans during the course of radiation treatments. The original GTVp contours were propagated to T50 of weekly 4DCT images using deformable image registration and their absolute volumes were measured. Statistical analysis was performed to compare the distribution of tumor shrinkage between the two population groups. In order to investigate the difference between the planned dose and the 5D dose with consideration of both breathing motion and anatomical change, we re-calculated new dose distributions at every phase of the breathing cycle for all available weekly 4DCT data sets which resulted 50 to 80 individual dose calculations for each of the 7 patients presented in this thesis. The newly calculated dose distributions were then deformed and accumulated to T50 of the planning 4DCT for comparison with the planned dose distribution. Results: At the end of the treatment, both IMRT and PSPT groups showed mean tumor volume reductions of 23.6% ( 19.2%) and 20.9% ( 17.0 %) respectively. Moreover, the mean difference in tumor shrinkage between two groups is 3% along with the corresponding 95% confidence interval, [-8%, 14%]. The rate of tumor shrinkage was highly correlated with the initial tumor volume size. For the planning dose and 5D dose comparison study, all 7 patients showed a mean difference of 1 % in terms of target coverage for both IMRT and PSPT treatment plans. Conclusions: The results of the tumor shrinkage investigation showed no statistically significant difference in tumor shrinkage between the IMRT and PSPT patients, and the tumor shrinkage between the two modalities is similar based on the 95% confidence interval. From the pilot study of comparing the planned dose with the 5D dose, we found the difference to be only 1%. Overall impression of the two modalities in terms of treatment response as measured by the tumor shrinkage and 5D dose under the influence of anatomical change that were designed under the same protocol (i.e. randomized trial) showed similar result.