2 resultados para Linear mixed models
em DRUM (Digital Repository at the University of Maryland)
Resumo:
This dissertation proposes statistical methods to formulate, estimate and apply complex transportation models. Two main problems are part of the analyses conducted and presented in this dissertation. The first method solves an econometric problem and is concerned with the joint estimation of models that contain both discrete and continuous decision variables. The use of ordered models along with a regression is proposed and their effectiveness is evaluated with respect to unordered models. Procedure to calculate and optimize the log-likelihood functions of both discrete-continuous approaches are derived, and difficulties associated with the estimation of unordered models explained. Numerical approximation methods based on the Genz algortithm are implemented in order to solve the multidimensional integral associated with the unordered modeling structure. The problems deriving from the lack of smoothness of the probit model around the maximum of the log-likelihood function, which makes the optimization and the calculation of standard deviations very difficult, are carefully analyzed. A methodology to perform out-of-sample validation in the context of a joint model is proposed. Comprehensive numerical experiments have been conducted on both simulated and real data. In particular, the discrete-continuous models are estimated and applied to vehicle ownership and use models on data extracted from the 2009 National Household Travel Survey. The second part of this work offers a comprehensive statistical analysis of free-flow speed distribution; the method is applied to data collected on a sample of roads in Italy. A linear mixed model that includes speed quantiles in its predictors is estimated. Results show that there is no road effect in the analysis of free-flow speeds, which is particularly important for model transferability. A very general framework to predict random effects with few observations and incomplete access to model covariates is formulated and applied to predict the distribution of free-flow speed quantiles. The speed distribution of most road sections is successfully predicted; jack-knife estimates are calculated and used to explain why some sections are poorly predicted. Eventually, this work contributes to the literature in transportation modeling by proposing econometric model formulations for discrete-continuous variables, more efficient methods for the calculation of multivariate normal probabilities, and random effects models for free-flow speed estimation that takes into account the survey design. All methods are rigorously validated on both real and simulated data.
Resumo:
The burden of chronic diseases such as cancer is increasing in low and middle income countries around the globe. Nepal, one of the world’s poorest countries, is no exception to this trend, with lung cancer as the leading causes of cancer deaths. Despite this, limited data is available on the environmental and behavioral risk factors that contribute to the lung cancer etiology in Nepal. The objectives of this dissertation are to: 1) investigate the ethnic differences in consumption of local tobacco products and their role in lung cancer risk in Nepal; 2) evaluate urinary metabolite of 1,3-butadiene as a biomarker of exposure to combustion related household air pollution (CRHAP); 3) investigate the association between CRHAP exposure and lung cancer risk using urinary metabolite of 1,3-butadiene as a biomarker of exposure; 4) investigate the association between CRHAP exposure and lung cancer risk using questionnaire based measure of exposure. Lung cancer cases (n=606) and frequency matched controls (N=606) were recruited from B.P. Koirala Memorial Cancer Hospital. We obtained biological samples and information on lifestyles including cooking habits and type of fuels used. We used liquid chromatograph tandem mass spectrometer (LC-MS/MS) to quantify urinary metabolites of 1,3-butadiene in urine samples. We employed a combination of logistic and linear regression models to detect any exposure-disease associations while controlling for known confounding variables. Overall, we found that ethnic groups in Nepal use different tobacco products that have different differing cancer potency -we observed the highest odds ratios for the traditional tobacco products. The biomarker analysis showed strong evidence that monohydroxybutyl mercapturic acid is associated with biomass fuel use among participants. However, we did not find significant association between urinary MHMBA and lung cancer risk. When we used questionnaire based measure of exposure to household air pollution, we observed significant, dose-response associations between CRHAP exposure and lung cancer risk, particularly among never-smokers. Our results show that important role of local tobacco products in lung cancer risk in Nepal. Furthermore, we demonstrate that CRHAP exposure is a risk factor for lung cancer risk, independent of tobacco smoking.