13 resultados para Analisi statistica, Multiple Testing Correction, sviluppo Libreria
em DigitalCommons@The Texas Medical Center
Resumo:
Hypertension (HT) is mediated by the interaction of many genetic and environmental factors. Previous genome-wide linkage analysis studies have found many loci that show linkage to HT or blood pressure (BP) regulation, but the results were generally inconsistent. Gene by environment interaction is among the reasons that potentially explain these inconsistencies between studies. Here we investigate influences of gene by smoking (GxS) interaction on HT and BP in European American (EA), African American (AA) and Mexican American (MA) families from the GENOA study. A variance component-based method was utilized to perform genome-wide linkage analysis of systolic blood pressure (SBP), diastolic blood pressure (DBP), and HT status, as well as bivariate analysis for SBP and DBP for smokers, non-smokers, and combined groups. The most significant results were found for SBP in MA. The strongest signal was for chromosome 17q24 (LOD = 4.2), increased to (LOD = 4.7) in bivariate analysis but there was no evidence of GxS interaction at this locus (p = 0.48). Two signals were identified only in one group: on chromosome 15q26.2 (LOD = 3.37) in non-smokers and chromosome 7q21.11 (LOD = 1.4) in smokers, both of which had strong evidence for GxS interaction (p = 0.00039 and 0.009 respectively). There were also two other signals, one on chromosome 20q12 (LOD = 2.45) in smokers, which became much higher in the combined sample (LOD = 3.53), and one on chromosome 6p22.2 (LOD = 2.06) in non-smokers. Neither peak had very strong evidence for GxS interaction (p = 0.08 and 0.06 respectively). A fine mapping association study was performed using 200 SNPs in 30 genes located under the linkage signals on chromosomes 15 and 17. Under the chromosome 15 peak, the association analysis identified 6 SNPs accounting for a 7 mmHg increase in SBP in MA non-smokers. For the chromosome 17 linkage peak, the association analysis identified 3 SNPs accounting for a 6 mmHg increase in SBP in MA. However, none of these SNPs was significant after correcting for multiple testing, and accounting for them in the linkage analysis produced very small reductions in the linkage signal. ^ The linkage analysis of BP traits considering the smoking status produced very interesting signals for SBP in the MA population. The fine mapping association analysis gave some insight into the contribution of some SNPs to two of the identified signals, but since these SNPs did not remain significant after multiple testing correction and did not explain the linkage peaks, more work is needed to confirm these exploratory results and identify the culprit variations under these linkage peaks. ^
Resumo:
In population studies, most current methods focus on identifying one outcome-related SNP at a time by testing for differences of genotype frequencies between disease and healthy groups or among different population groups. However, testing a great number of SNPs simultaneously has a problem of multiple testing and will give false-positive results. Although, this problem can be effectively dealt with through several approaches such as Bonferroni correction, permutation testing and false discovery rates, patterns of the joint effects by several genes, each with weak effect, might not be able to be determined. With the availability of high-throughput genotyping technology, searching for multiple scattered SNPs over the whole genome and modeling their joint effect on the target variable has become possible. Exhaustive search of all SNP subsets is computationally infeasible for millions of SNPs in a genome-wide study. Several effective feature selection methods combined with classification functions have been proposed to search for an optimal SNP subset among big data sets where the number of feature SNPs far exceeds the number of observations. ^ In this study, we take two steps to achieve the goal. First we selected 1000 SNPs through an effective filter method and then we performed a feature selection wrapped around a classifier to identify an optimal SNP subset for predicting disease. And also we developed a novel classification method-sequential information bottleneck method wrapped inside different search algorithms to identify an optimal subset of SNPs for classifying the outcome variable. This new method was compared with the classical linear discriminant analysis in terms of classification performance. Finally, we performed chi-square test to look at the relationship between each SNP and disease from another point of view. ^ In general, our results show that filtering features using harmononic mean of sensitivity and specificity(HMSS) through linear discriminant analysis (LDA) is better than using LDA training accuracy or mutual information in our study. Our results also demonstrate that exhaustive search of a small subset with one SNP, two SNPs or 3 SNP subset based on best 100 composite 2-SNPs can find an optimal subset and further inclusion of more SNPs through heuristic algorithm doesn't always increase the performance of SNP subsets. Although sequential forward floating selection can be applied to prevent from the nesting effect of forward selection, it does not always out-perform the latter due to overfitting from observing more complex subset states. ^ Our results also indicate that HMSS as a criterion to evaluate the classification ability of a function can be used in imbalanced data without modifying the original dataset as against classification accuracy. Our four studies suggest that Sequential Information Bottleneck(sIB), a new unsupervised technique, can be adopted to predict the outcome and its ability to detect the target status is superior to the traditional LDA in the study. ^ From our results we can see that the best test probability-HMSS for predicting CVD, stroke,CAD and psoriasis through sIB is 0.59406, 0.641815, 0.645315 and 0.678658, respectively. In terms of group prediction accuracy, the highest test accuracy of sIB for diagnosing a normal status among controls can reach 0.708999, 0.863216, 0.639918 and 0.850275 respectively in the four studies if the test accuracy among cases is required to be not less than 0.4. On the other hand, the highest test accuracy of sIB for diagnosing a disease among cases can reach 0.748644, 0.789916, 0.705701 and 0.749436 respectively in the four studies if the test accuracy among controls is required to be at least 0.4. ^ A further genome-wide association study through Chi square test shows that there are no significant SNPs detected at the cut-off level 9.09451E-08 in the Framingham heart study of CVD. Study results in WTCCC can only detect two significant SNPs that are associated with CAD. In the genome-wide study of psoriasis most of top 20 SNP markers with impressive classification accuracy are also significantly associated with the disease through chi-square test at the cut-off value 1.11E-07. ^ Although our classification methods can achieve high accuracy in the study, complete descriptions of those classification results(95% confidence interval or statistical test of differences) require more cost-effective methods or efficient computing system, both of which can't be accomplished currently in our genome-wide study. We should also note that the purpose of this study is to identify subsets of SNPs with high prediction ability and those SNPs with good discriminant power are not necessary to be causal markers for the disease.^
Resumo:
Most studies of differential gene-expressions have been conducted between two given conditions. The two-condition experimental (TCE) approach is simple in that all genes detected display a common differential expression pattern responsive to a common two-condition difference. Therefore, the genes that are differentially expressed under the other conditions other than the given two conditions are undetectable with the TCE approach. In order to address the problem, we propose a new approach called multiple-condition experiment (MCE) without replication and develop corresponding statistical methods including inference of pairs of conditions for genes, new t-statistics, and a generalized multiple-testing method for any multiple-testing procedure via a control parameter C. We applied these statistical methods to analyze our real MCE data from breast cancer cell lines and found that 85 percent of gene-expression variations were caused by genotypic effects and genotype-ANAX1 overexpression interactions, which agrees well with our expected results. We also applied our methods to the adenoma dataset of Notterman et al. and identified 93 differentially expressed genes that could not be found in TCE. The MCE approach is a conceptual breakthrough in many aspects: (a) many conditions of interests can be conducted simultaneously; (b) study of association between differential expressions of genes and conditions becomes easy; (c) it can provide more precise information for molecular classification and diagnosis of tumors; (d) it can save lot of experimental resources and time for investigators.^
Resumo:
This study investigated the characteristics of a clinic that affect how satisfied survivors of childhood cancer are with their medical care. Questionnaire and interview data from the Passport for Care: Texas Implementation project collected between January 2011 to April 2012 were analyzed. Eleven clinics in Texas participated. Questionnaire respondents were childhood cancer survivor patients who had been off therapy for at least 2 years, or their parents. Interview respondents were clinical providers or research staff at the participating clinics. The outcomes evaluated were answers to a single question on satisfaction with care and a composite Percent Satisfaction Score created from seven other questionnaire items that were correlated (Spearman Rho >0.3) with the question on satisfaction. The following characteristics were also evaluated: sex, age, race, education, and type of cancer. The following clinic indicators were evaluated: type of clinic (general vs. dedicated cancer survivor clinics), number of providers, number of survivors, ratio of survivors/providers, distribution of handouts, distribution of treatment summaries, and use of Children's Oncology Group (COG) guidelines. ^ The only demographic characteristic that affected satisfaction was race. A Kruskal-Wallis test showed a statistically significant difference (Chi-square 6.129, 2 d.f., p = 0.0467). To analyze this further, Wilcoxon Rank Sum test of pairings of the three groups were performed. A Bonferroni correction for multiple testing was applied, with p = 0.017 indicating significance at alpha = 0.05. There was no significant difference between the White and Hispanic groups or between the Hispanic and "Other" groups. For the White and "Other" groups there was a significant difference for the satisfaction item (p = 0.0123) but not for the Percent Satisfaction Score (p = 0.0289). These results suggest that race may influence satisfaction and should be evaluated further in future studies. ^ None of the clinic indicators affected the Percent Satisfaction Score. Going to a clinic that distributed patient information handouts (Wilcoxon Rank Sum p = 0.048) and going to a clinic with >=100 survivors (Wilcoxon Rank Sum p = 0.021) were associated with increased satisfaction. The population of childhood cancer survivors is a growing group of individuals with special health needs. In the future survivors will likely seek medical care in a variety of clinical settings, so it is important to investigate features to improve patient satisfaction with clinical care.^
Resumo:
Children who experience early pubertal development have an increased risk of developing cancer (breast, ovarian, and testicular), osteoporosis, insulin resistance, and obesity as adults. Early pubertal development has been associated with depression, aggressiveness, and increased sexual prowess. Possible explanations for the decline in age of pubertal onset include genetics, exposure to environmental toxins, better nutrition, and a reduction in childhood infections. In this study we (1) evaluated the association between 415 single nucleotide polymorphisms (SNPs) from hormonal pathways and early puberty, defined as menarche prior to age 12 in females and Tanner Stage 2 development prior to age 11 in males, and (2) measured endocrine hormone trajectories (estradiol, testosterone, and DHEAS) in relation to age, race, and Tanner Stage in a cohort of children from Project HeartBeat! At the end of the 4-year study, 193 females had onset of menarche and 121 males had pubertal staging at age 11. African American females had a younger mean age at menarche than Non-Hispanic White females. African American females and males had a lower mean age at each pubertal stage (1-5) than Non-Hispanic White females and males. African American females had higher mean BMI measures at each pubertal stage than Non-Hispanic White females. Of the 415 SNPs evaluated in females, 22 SNPs were associated with early menarche, when adjusted for race ( p<0.05), but none remained significant after adjusting for multiple testing by False Discovery Rate (p<0.00017). In males, 17 SNPs were associated with early pubertal development when adjusted for race (p<0.05), but none remained significant when adjusted for multiple testing (p<0.00017). ^ There were 4955 hormone measurements taken during the 4-year study period from 632 African American and Non-Hispanic White males and females. On average, African American females started and ended the pubertal process at a younger age than Non-Hispanic White females. The mean age of Tanner Stage 2 breast development in African American and Non-Hispanic White females was 9.7 (S.D.=0.8) and 10.2 (S.D.=1.1) years, respectively. There was a significant difference by race in mean age for each pubertal stage, except Tanner Stage 1 for pubic hair development. Both Estradiol and DHEAS levels in females varied significantly with age, but not by race. Estradiol and DHEAS levels increased from Tanner Stage 1 to Tanner Stage 5.^ African American males had a lower mean age at each Tanner Stage of development than Non-Hispanic White males. The mean age of Tanner Stage 2 genital development in African American and Non-Hispanic White males was 10.5 (S.D.=1.1) and 10.8 (S.D.=1.1) years, respectively, but this difference was not significant (p=0.11). Testosterone levels varied significantly with age and race. Non-Hispanic White males had higher levels of testosterone than African American males from Tanner Stage 1-4. Testosterone levels increased for both races from Tanner Stage 1 to Tanner Stage 5. Testosterone levels had the steepest increase from ages 11-15 for both races. DHEAS levels in males varied significantly with age, but not by race. DHEAS levels had the steepest increase from ages 14-17. ^ In conclusion, African American males and females experience pubertal onset at a younger age than Non-Hispanic White males and females, but in this study, we could not find a specific gene that explained the observed variation in age of pubertal onset. Future studies with larger study populations may provide a better understanding of the contribution of genes in early pubertal onset.^
Resumo:
PURPOSE: To review our clinical experience and determine if there are appropriate signs and symptoms to consider POLG sequencing prior to valproic acid (VPA) dosing in patients with seizures. METHODS: Four patients who developed VPA-induced hepatotoxicity were examined for POLG sequence variations. A subsequent chart review was used to describe clinical course prior to and after VPA dosing. RESULTS: Four patients of multiple different ethnicities, age 3-18 years, developed VPA-induced hepatotoxicity. All were given VPA due to intractable partial seizures. Three of the patients had developed epilepsia partialis continua. The time from VPA exposure to liver failure was between 2 and 3 months. Liver failure was reversible in one patient. Molecular studies revealed homozygous p.R597W or p.A467T mutations in two patients. The other two patients showed compound heterozygous mutations, p.A467T/p.Q68X and p.L83P/p.G888S. Clinical findings and POLG mutations were diagnostic of Alpers-Huttenlocher syndrome. CONCLUSION: Our cases underscore several important findings: POLG mutations have been observed in every ethnic group studied to date; early predominance of epileptiform discharges over the occipital region is common in POLG-induced epilepsy; the EEG and MRI findings varying between patients and stages of the disease; and VPA dosing at any stage of Alpers-Huttenlocher syndrome can precipitate liver failure. Our data support an emerging proposal that POLG gene testing should be considered in any child or adolescent who presents or develops intractable seizures with or without status epilepticus or epilepsia partialis continua, particularly when there is a history of psychomotor regression.
Resumo:
A graphing method was developed and tested to estimate gestational ages pre-and postnatally in a consistent manner for epidemiological research and clinical purposes on feti/infants of women with few consistent prenatal estimators of gestational age. Each patient's available data was plotted on a single page graph to give a comprehensive overview of that patient. A hierarchical classification of gestational age determination was then applied in a systematic manner, and reasonable gestational age estimates were produced. The method was tested for validity and reliability on 50 women who had known dates for their last menstrual period or dates of conception, and multiple ultrasound examinations and other gestational age estimating measures. The feasibility of the procedure was then tested on 1223 low income women with few gestational age estimators. The graphing method proved to have high inter- and intrarater reliability. It was quick, easy to use, inexpensive, and did not require special equipment. The graphing method estimate of gestational age for each infant was tested against the last menstrual period gestational age estimate using paired t-Tests, F tests and the Kolmogorov-Smirnov test of similar populations, producing a 98 percent probability or better that the means and data populations were the same. Less than 5 percent of the infants' gestational ages were misclassified using the graphing method, much lower than the amount of misclassification produced by ultrasound or neonatal examination estimates. ^
Resumo:
Retinal detachment is a common ophthalmologic procedure, and outcome is typically measured by a single factor-improvement in visual acuity. Health related functional outcome testing, which quantifies patient's self-reported perception of impairment, can be integrated with objective clinical findings. Based on the patient's self-assessed lifestyle impairment, the physician and patient together can make an informed decision on the treatment that is most likely to benefit the patient. ^ A functional outcome test (the Houston Vision Assessment Test-Retina; HVAT-Retina) was developed and validated in patients with multiple retinal detachments in the same eye. The HVAT-Retina divides an estimated total impairment into subcomponents: contribution of visual disability (potentially correctable by retinal detachment surgery) and nonvisual physical disabilities (co-morbidities not affected by retinal detachment surgery. ^ Seventy-six patients participated in this prospective multicenter study. Seven patients were excluded from the analysis because they were not certain of their answers. Cronbach's alpha coefficient was 0.91 for presurgery HVAT-Retina and 0.94 post-surgery. The item-to-total correlation ranged from 0.50 to 0.88. Visual impairment score improved by 9 points from pre-surgery (p = 0.0003). Physical impairment score also improved from pre-surgery (p = 0.0002). ^ In conclusion, the results of this study demonstrate that the instrument is reliable and valid in patients presenting with recurrent retinal detachments. The HVAT-Retina is a simple instrument and does not burden the patient or the health professional in terms of time or cost. It may be self-administrated, not requiring an interviewer. Because the HVAT-Retina was designed to demonstrate outcomes perceivable by the patient, it has the potential to guide the decision making process between patient and physician. ^
Resumo:
The PROPELLER (Periodically Rotated Overlapping Parallel Lines with Enhanced Reconstruction) magnetic resonance imaging (MRI) technique has inherent advantages over other fast imaging methods, including robust motion correction, reduced image distortion, and resistance to off-resonance effects. These features make PROPELLER highly desirable for T2*-sensitive imaging, high-resolution diffusion imaging, and many other applications. However, PROPELLER has been predominantly implemented as a fast spin-echo (FSE) technique, which is insensitive to T2* contrast, and requires time-inefficient signal averaging to achieve adequate signal-to-noise ratio (SNR) for many applications. These issues presently constrain the potential clinical utility of FSE-based PROPELLER. ^ In this research, our aim was to extend and enhance the potential applications of PROPELLER MRI by developing a novel multiple gradient echo PROPELLER (MGREP) technique that can overcome the aforementioned limitations. The MGREP pulse sequence was designed to acquire multiple gradient-echo images simultaneously, without any increase in total scan time or RF energy deposition relative to FSE-based PROPELLER. A new parameter was also introduced for direct user-control over gradient echo spacing, to allow variable sensitivity to T2* contrast. In parallel to pulse sequence development, an improved algorithm for motion correction was also developed and evaluated against the established method through extensive simulations. The potential advantages of MGREP over FSE-based PROPELLER were illustrated via three specific applications: (1) quantitative T2* measurement, (2) time-efficient signal averaging, and (3) high-resolution diffusion imaging. Relative to the FSE-PROPELLER method, the MGREP sequence was found to yield quantitative T2* values, increase SNR by ∼40% without any increase in acquisition time or RF energy deposition, and noticeably improve image quality in high-resolution diffusion maps. In addition, the new motion algorithm was found to improve the performance considerably in motion-artifact reduction. ^ Overall, this work demonstrated a number of enhancements and extensions to existing PROPELLER techniques. The new technical capabilities of PROPELLER imaging, developed in this thesis research, are expected to serve as the foundation for further expanding the scope of PROPELLER applications. ^
Resumo:
Objective. To explore (1) the association between "club drug" use and unprotected anal intercourse (UAI) and (2) the association between binge drug use and UAI among HIV seronegative men who have sex with men (MSM) seeking HIV/STD testing at a local clinic in Houston. ^ Study design. A sub-sample of 297 HIV seronegative MSM from a cross-sectional study of drug and sexual behavior in Houston was conducted in 2006. Patients who were seeking HIV/STD testing at a local MSM-identified STD clinic were recruited for an anonymous computer-assisted interview. Analysis of identified secondary data consisted of self-reported information about demographic characteristics, use of drugs, and sexual behaviors. ^ Results. With new and casual sex partners, there was a strong and statistically significant association between use of "club drugs" and UAI. No association between binge drug use and UAI was evident. Men aware of HIV seropositivity or unaware of the HIV serostatus of their primary partner were less likely to report UAI. ^ Conclusion. These data suggest that in the Houston area, HIV-negative MSM club drug users, particularly multiple drug users, are at higher risk of UAI than comparable MSMs who do not use club drugs. Episode-level data regarding binge use of these and other drugs, and UAI should be collected in future studies to explore their relationship. The 'new partner' category should be added to sex partner types to measure sex and drug use behaviors in future studies.^ Keywords. HIV-negative MSM; club drugs; unprotected anal intercourse; binge drug use. ^
Resumo:
Herbicides are used to control the growth of weeds along highways, power lines, and many other urban locations. Exposure to herbicides has been linked to adverse health outcomes. This study was initiated to pretest for the presence of herbicides in multiple water sources near intersections in a corridor in the Northwest Harris County (specifically in the Highway 6/FM 1960, North Freeway 45, US 290 and S 99 corridor). Roadside water and tap water samples were collected and analyzed for herbicides using the established Environmental Protection Agency (EPA) Method 515.4: "Determination of Chlorinated Acids in Drinking Water by Liquid-Liquid Micro-extraction, Derivatization, and Fast Gas Chromatography with Electron Capture Detection." A standard operating procedure (adapted from the US EPA Method 515.4) was developed for subsequent, larger studies of environmental fate of herbicides and non-occupational exposure risks. Preliminary testing of 16 water samples was performed to pretest the existence of trace herbicides; all concentrations that were greater than the minimum reporting limits of each analyte are reported with a 99 percent confidence. This study failed to find concentrations above the limits of detection of the method in any of the samples collected on June 15, 2008. However, this does not indicate that the waters around the NW Harris County are free of herbicides and metabolites. A larger and repeated sampling in the region would be necessary to make that claim. ^
Resumo:
The difficulty of detecting differential gene expression in microarray data has existed for many years. Several correction procedures try to avoid the family-wise error rate in multiple comparison process, including the Bonferroni and Sidak single-step p-value adjustments, Holm's step-down correction method, and Benjamini and Hochberg's false discovery rate (FDR) correction procedure. Each multiple comparison technique has its advantages and weaknesses. We studied each multiple comparison method through numerical studies (simulations) and applied the methods to the real exploratory DNA microarray data, which detect of molecular signatures in papillary thyroid cancer (PTC) patients. According to our results of simulation studies, Benjamini and Hochberg step-up FDR controlling procedure is the best process among these multiple comparison methods and we discovered 1277 potential biomarkers among 54675 probe sets after applying the Benjamini and Hochberg's method to PTC microarray data.^
Resumo:
Multiple Endocrine Neoplasia type 1 (MEN1) is a hereditary cancer syndrome characterized by tumors of the endocrine system. Tumors most commonly develop in the parathyroid glands, pituitary gland, and the gastro-entero pancreatic tract. MEN1 is a highly penetrant condition and age of onset is variable. Most patients are diagnosed in early adulthood; however, rare cases of MEN1 present in early childhood. Expert consensus opinion is that predictive genetic testing should be offered at age 5 years, however there are no evidence-based studies that clearly establish that predictive genetic testing at this age would be beneficial since most symptoms do not present until later in life. This study was designed to explore attitudes about the most appropriate age for predictive genetic testing from individuals at risk of having a child with MEN1. Participants who had an MEN1 mutation were invited to complete a survey and were asked to invite their spouses to participate as well. The survey included several validated measures designed to assess participants’ attitudes about predictive testing in minors. Fifty-eight affected participants and twenty-two spouses/partners completed the survey. Most participants felt that MEN1 genetic testing was appropriate in healthy minors. Younger age and increased knowledge of MEN1 genetics and inheritance predicted genetic testing at a younger age. Additionally, participants who saw more positive than negative general outcomes from genetic testing were more likely to favor genetic testing at younger ages. Overall, participants felt genetic testing should be offered at a younger age than most adult onset conditions and most felt the appropriate time for testing was when a child could understand and participate in the testing process. Psychological concerns seemed to be the primary focus of participants who favored later ages for genetic testing, while medical benefits were more commonly cited for younger age. This exploratory study has implications for counseling patients whose children are at risk of developing MEN1 and illustrates issues that are important to patients and their spouses when considering testing in children.