24 resultados para DNA Sequence, Hidden Markov Model, Bayesian Model, Sensitive Analysis, Markov Chain Monte Carlo
Resumo:
The discrete-time Markov chain is commonly used in describing changes of health states for chronic diseases in a longitudinal study. Statistical inferences on comparing treatment effects or on finding determinants of disease progression usually require estimation of transition probabilities. In many situations when the outcome data have some missing observations or the variable of interest (called a latent variable) can not be measured directly, the estimation of transition probabilities becomes more complicated. In the latter case, a surrogate variable that is easier to access and can gauge the characteristics of the latent one is usually used for data analysis. ^ This dissertation research proposes methods to analyze longitudinal data (1) that have categorical outcome with missing observations or (2) that use complete or incomplete surrogate observations to analyze the categorical latent outcome. For (1), different missing mechanisms were considered for empirical studies using methods that include EM algorithm, Monte Carlo EM and a procedure that is not a data augmentation method. For (2), the hidden Markov model with the forward-backward procedure was applied for parameter estimation. This method was also extended to cover the computation of standard errors. The proposed methods were demonstrated by the Schizophrenia example. The relevance of public health, the strength and limitations, and possible future research were also discussed. ^
Resumo:
As the requirements for health care hospitalization have become more demanding, so has the discharge planning process become a more important part of the health services system. A thorough understanding of hospital discharge planning can, then, contribute to our understanding of the health services system. This study involved the development of a process model of discharge planning from hospitals. Model building involved the identification of factors used by discharge planners to develop aftercare plans, and the specification of the roles of these factors in the development of the discharge plan. The factors in the model were concatenated in 16 discrete decision sequences, each of which produced an aftercare plan.^ The sample for this study comprised 407 inpatients admitted to the M. D. Anderson Hospital and Tumor Institution at Houston, Texas, who were discharged to any site within Texas during a 15 day period. Allogeneic bone marrow donors were excluded from the sample. The factors considered in the development of discharge plans were recorded by discharge planners and were used to develop the model. Data analysis consisted of sorting the discharge plans using the plan development factors until for some combination and sequence of factors all patients were discharged to a single site. The arrangement of factors that led to that aftercare plan became a decision sequence in the model.^ The model constructs the same discharge plans as those developed by hospital staff for every patient in the study. Tests of the validity of the model should be extended to other patients at the MDAH, to other cancer hospitals, and to other inpatient services. Revisions of the model based on these tests should be of value in the management of discharge planning services and in the design and development of comprehensive community health services.^
Resumo:
In regression analysis, covariate measurement error occurs in many applications. The error-prone covariates are often referred to as latent variables. In this proposed study, we extended the study of Chan et al. (2008) on recovering latent slope in a simple regression model to that in a multiple regression model. We presented an approach that applied the Monte Carlo method in the Bayesian framework to the parametric regression model with the measurement error in an explanatory variable. The proposed estimator applied the conditional expectation of latent slope given the observed outcome and surrogate variables in the multiple regression models. A simulation study was presented showing that the method produces estimator that is efficient in the multiple regression model, especially when the measurement error variance of surrogate variable is large.^
Resumo:
A three-dimensional model has been proposed that uses Monte Carlo and fast Fourier transform convolution techniques to calculate the dose distribution from a fast neutron beam. This method transports scattered neutrons and photons in the forward, lateral, and backward directions and protons, electrons, and positrons in the forward and lateral directions by convolving energy spread kernels with initial interaction available energy distributions. The primary neutron and photon spectrums have been derived from narrow beam attenuation measurements. The positions and strengths of the effective primary neutron, scattered neutron, and photon sources have been derived from dual ion chamber measurements. The size of the effective primary neutron source has been measured using a copper activation technique. Heterogeneous tissue calculations require a weighted sum of two convolutions for each component since the kernels must be invariant for FFT convolution. Comparisons between calculations and measurements were performed for several water and heterogeneous phantom geometries. ^
Resumo:
The myogenin gene encodes an evolutionarily conserved basic helix-loop-helix transcription factor that regulates the expression of skeletal muscle-specific genes and its homozygous deletion results in mice who die of respiratory failure at birth. The histology of skeletal muscle in the myogenin null mice is reminiscent of that found in some severe congenital myopathy patients, many of whom also die of respiratory complications and provides the rationale that an aberrant human myogenin (myf4) coding region could be associated with some congenital myopathy conditions.^ With PCR, we found similarly sized amplimers for the three exons of the myogenin gene in 37 patient and 40 control samples. In contrast to the GeneBank sequence for human myogenin, we report several differences in flanking and coding regions plus an additional 659 and 498 bps in the first and second introns, respectively, in all patients and controls. We also find a novel (CA)-dinucleotide repeat in the second intron. No causative mutations were detected in the myogenin coding regions of genomic DNA from patients with severe congenital myopathy.^ Severe congenital myopathies in humans are often associated with respiratory complications and pulmonary hypoplasia. We have employed the myogenin null mouse, which lacks normal development of skeletal muscle fibers as a genetically defined severe congenital myopathy mouse model to evaluate the effect of absent fetal breathing movement on pulmonary development.^ Significant differences are observed at embryonic days E14, E17 and E20 of lung:body weight, total DNA and histologically, suggesting that the myogenin null lungs are hypoplastic. RT-PCR, in-situ immunofluorescence and EM reveal pneumocyte type II differentiation in both null and wild lungs as early as E14. However, at E14, myogenin null lungs have decreased BrdU incorporation while E17 through term, augmented cell death is detected in the myogenin null lungs, not seen in wild littermates. Absent mechanical forces appear to impair normal growth, but not maturation, of the developing lungs in myogenin null mouse.^ These investigations provide the basis for delineating the DNA sequence of the myogenin gene and and highlight the importance of skeletal muscle development in utero for normal lung organogenesis. My observation of no mutations within the coding regions of the human myogenin gene in DNA from patients with severe congenital myopathy do not support any association with this condition. ^
Resumo:
This paper reports a comparison of three modeling strategies for the analysis of hospital mortality in a sample of general medicine inpatients in a Department of Veterans Affairs medical center. Logistic regression, a Markov chain model, and longitudinal logistic regression were evaluated on predictive performance as measured by the c-index and on accuracy of expected numbers of deaths compared to observed. The logistic regression used patient information collected at admission; the Markov model was comprised of two absorbing states for discharge and death and three transient states reflecting increasing severity of illness as measured by laboratory data collected during the hospital stay; longitudinal regression employed Generalized Estimating Equations (GEE) to model covariance structure for the repeated binary outcome. Results showed that the logistic regression predicted hospital mortality as well as the alternative methods but was limited in scope of application. The Markov chain provides insights into how day to day changes of illness severity lead to discharge or death. The longitudinal logistic regression showed that increasing illness trajectory is associated with hospital mortality. The conclusion is reached that for standard applications in modeling hospital mortality, logistic regression is adequate, but for new challenges facing health services research today, alternative methods are equally predictive, practical, and can provide new insights. ^
VERIFICATION OF DNA PREDICTED PROTEIN SEQUENCES BY ENZYME HYDROLYSIS AND MASS SPECTROMETRIC ANALYSIS
Resumo:
The focus of this thesis lies in the development of a sensitive method for the analysis of protein primary structure which can be easily used to confirm the DNA sequence of a protein's gene and determine the modifications which are made after translation. This technique involves the use of dipeptidyl aminopeptidase (DAP) and dipeptidyl carboxypeptidase (DCP) to hydrolyze the protein and the mass spectrometric analysis of the dipeptide products.^ Dipeptidyl carboxypeptidase was purified from human lung tissue and characterized with respect to its proteolytic activity. The results showed that the enzyme has a relatively unrestricted specificity, making it useful for the analysis of the C-terminal of proteins. Most of the dipeptide products were identified using gas chromatography/mass spectrometry (GC/MS). In order to analyze the peptides not hydrolyzed by DCP and DAP, as well as the dipeptides not identified by GC/MS, a FAB ion source was installed on a quadrupole mass spectrometer and its performance evaluated with a variety of compounds.^ Using these techniques, the sequences of the N-terminal and C-terminal regions and seven fragments of bacteriophage P22 tail protein have been verified. All of the dipeptides identified in these analysis were in the same DNA reading frame, thus ruling out the possibility of a single base being inserted or deleted from the DNA sequence. The verification of small sequences throughout the protein sequence also indicates that no large portions of the protein have been removed after translation. ^
Resumo:
The human cytochrome P450 3A (CYP3A) subfamily is responsible for most of the metabolism of therapeutic drugs; however, an adequate in vivo model has yet to be discovered. This study begins with an investigation of a controversial topic surrounding the human CYP3As--estrogen regulation. A novel approach to this topic was used by defining expression in the estrogen-responsive endometrium. This study shows that estrogen down-regulates CYP3A4 expression in the endometrium. On the other hand, analogous studies showed an increase in CYP3A expression as age increases in liver tissue. Following the discussion of estrogen regulation, is an investigation of the cross-species relationships among all of the CYP3As was completed. The study compares isoforms from piscines, avians, rodents, canines, ovines, bovines, and primates. Using the traditional phylogenetic analyses and employing a novel approach using exon and intron lengths, the results show that only another primate could be the best animal model for analysis of the regulation of the expression of the human CYP3As. This analysis also demonstrated that the chimpanzee seems to be the best available human model. Moreover, the study showed the presence and similarities of one additional isoform in the chimpanzee genome that is absent in humans. Based on these results, initial characterization of the chimpanzee CYP3A subfamily was begun. While the human genome contains four isoforms--CYP3A4, CYP3A5, CYP3A7, and CYP3A43--the chimpanzee genome has five, the four previously mentioned and CYP3A67. Both species express CYP3A4, CYP3A5, and CYP3A43, but humans express CYP3A7 while chimpanzees express CYP3A67. In humans, CYP3A4 is expressed at higher levels than the other isoforms, but some chimpanzee individuals express CYP3A67 at higher levels than CYP3A4. Such a difference is expected to alter significantly the total CYP3A metabolism. On the other hand, any study considering individual isoforms would still constitute a valid method of study for the human CYP3A4, CYP3A5, and CYP3A43 isoforms. ^
Resumo:
With continuous new improvements in brachytherapy source designs and techniques, method of 3D dosimetry for treatment dose verifications would better ensure accurate patient radiotherapy treatment. This study was aimed to first evaluate the 3D dose distributions of the low-dose rate (LDR) Amersham 6711 OncoseedTM using PRESAGE® dosimeters to establish PRESAGE® as a suitable brachytherapy dosimeter. The new AgX100 125I seed model (Theragenics Corporation) was then characterized using PRESAGE® following the TG-43 protocol. PRESAGE® dosimeters are solid, polyurethane-based, 3D dosimeters doped with radiochromic leuco dyes that produce a linear optical density response to radiation dose. For this project, the radiochromic response in PRESAGE® was captured using optical-CT scanning (632 nm) and the final 3D dose matrix was reconstructed using the MATLAB software. An Amersham 6711 seed with an air-kerma strength of approximately 9 U was used to irradiate two dosimeters to 2 Gy and 11 Gy at 1 cm to evaluate dose rates in the r=1 cm to r=5 cm region. The dosimetry parameters were compared to the values published in the updated AAPM Report No. 51 (TG-43U1). An AgX100 seed with an air-kerma strength of about 6 U was used to irradiate two dosimeters to 3.6 Gy and 12.5 Gy at 1 cm. The dosimetry parameters for the AgX100 were compared to the values measured from previous Monte-Carlo and experimental studies. In general, the measured dose rate constant, anisotropy function, and radial dose function for the Amersham 6711 showed agreements better than 5% compared to consensus values in the r=1 to r=3 cm region. The dose rates and radial dose functions measured for the AgX100 agreed with the MCNPX and TLD-measured values within 3% in the r=1 to r=3 cm region. The measured anisotropy function in PRESAGE® showed relative differences of up to 9% with the MCNPX calculated values. It was determined that post-irradiation optical density change over several days was non-linear in different dose regions, and therefore the dose values in the r=4 to r=5 cm regions had higher uncertainty due to this effect. This study demonstrated that within the radial distance of 3 cm, brachytherapy dosimetry in PRESAGE® can be accurate within 5% as long as irradiation times are within 48 hours.