24 resultados para LOD (Linked Open Data)
em DigitalCommons@The Texas Medical Center
Resumo:
Brain tumor is one of the most aggressive types of cancer in humans, with an estimated median survival time of 12 months and only 4% of the patients surviving more than 5 years after disease diagnosis. Until recently, brain tumor prognosis has been based only on clinical information such as tumor grade and patient age, but there are reports indicating that molecular profiling of gliomas can reveal subgroups of patients with distinct survival rates. We hypothesize that coupling molecular profiling of brain tumors with clinical information might improve predictions of patient survival time and, consequently, better guide future treatment decisions. In order to evaluate this hypothesis, the general goal of this research is to build models for survival prediction of glioma patients using DNA molecular profiles (U133 Affymetrix gene expression microarrays) along with clinical information. First, a predictive Random Forest model is built for binary outcomes (i.e. short vs. long-term survival) and a small subset of genes whose expression values can be used to predict survival time is selected. Following, a new statistical methodology is developed for predicting time-to-death outcomes using Bayesian ensemble trees. Due to a large heterogeneity observed within prognostic classes obtained by the Random Forest model, prediction can be improved by relating time-to-death with gene expression profile directly. We propose a Bayesian ensemble model for survival prediction which is appropriate for high-dimensional data such as gene expression data. Our approach is based on the ensemble "sum-of-trees" model which is flexible to incorporate additive and interaction effects between genes. We specify a fully Bayesian hierarchical approach and illustrate our methodology for the CPH, Weibull, and AFT survival models. We overcome the lack of conjugacy using a latent variable formulation to model the covariate effects which decreases computation time for model fitting. Also, our proposed models provides a model-free way to select important predictive prognostic markers based on controlling false discovery rates. We compare the performance of our methods with baseline reference survival methods and apply our methodology to an unpublished data set of brain tumor survival times and gene expression data, selecting genes potentially related to the development of the disease under study. A closing discussion compares results obtained by Random Forest and Bayesian ensemble methods under the biological/clinical perspectives and highlights the statistical advantages and disadvantages of the new methodology in the context of DNA microarray data analysis.
Resumo:
The current state of health and biomedicine includes an enormity of heterogeneous data ‘silos’, collected for different purposes and represented differently, that are presently impossible to share or analyze in toto. The greatest challenge for large-scale and meaningful analyses of health-related data is to achieve a uniform data representation for data extracted from heterogeneous source representations. Based upon an analysis and categorization of heterogeneities, a process for achieving comparable data content by using a uniform terminological representation is developed. This process addresses the types of representational heterogeneities that commonly arise in healthcare data integration problems. Specifically, this process uses a reference terminology, and associated "maps" to transform heterogeneous data to a standard representation for comparability and secondary use. The capture of quality and precision of the “maps” between local terms and reference terminology concepts enhances the meaning of the aggregated data, empowering end users with better-informed queries for subsequent analyses. A data integration case study in the domain of pediatric asthma illustrates the development and use of a reference terminology for creating comparable data from heterogeneous source representations. The contribution of this research is a generalized process for the integration of data from heterogeneous source representations, and this process can be applied and extended to other problems where heterogeneous data needs to be merged.
Resumo:
A three-point linkage group comprised of loci coding for adenosine deaminase (ADA), glucose-6-phosphate dehydrogenase (G6PDH), and 6-phospho-gluconate dehydrogenase (6PGD) is described in fish of the genus Xiphophorus (Poeciliidae). The alleles at loci in this group were shown to assort independently from the alleles at three other loci--isocitrate dehydrogenase 1 and 2, and glyceraldehyde-3-phosphate dehydrogenase 1. Alleles at the latter three loci also assort independently from each other. Data were obtained by observing the segregation of electrophoretically variant alleles in reciprocal backcross hybrids derived from crosses between either X. helleri guentheri or X. h. strigatus and X. maculatus. The linkage component of chi2 was significant (less than 0.01) in all crosses, indicating that the linkage group is conserved in all populations of both species of Xiphophorus examined. While data from X. h. guentheri backcrosses indicate the linkage relationship ADA--6%--G6PDH--24%--6PGD, and ADA--29%--6PGD (30% when corrected for double crossovers), data from backcrosses involving strigatus, while supporting the same gene order, yielded significantly different recombination frequencies. The likelihood of the difference being due to an inversion could not be separated from the possibility of a sex effect on recombination in the present data. The linkage of 6PGD and G6PDH has been shown to exist in species of at least three classes of vertebrates, indicating the possibility of evolutionary conservation of this linkage.
Resumo:
People often use tools to search for information. In order to improve the quality of an information search, it is important to understand how internal information, which is stored in user’s mind, and external information, represented by the interface of tools interact with each other. How information is distributed between internal and external representations significantly affects information search performance. However, few studies have examined the relationship between types of interface and types of search task in the context of information search. For a distributed information search task, how data are distributed, represented, and formatted significantly affects the user search performance in terms of response time and accuracy. Guided by UFuRT (User, Function, Representation, Task), a human-centered process, I propose a search model, task taxonomy. The model defines its relationship with other existing information models. The taxonomy clarifies the legitimate operations for each type of search task of relation data. Based on the model and taxonomy, I have also developed prototypes of interface for the search tasks of relational data. These prototypes were used for experiments. The experiments described in this study are of a within-subject design with a sample of 24 participants recruited from the graduate schools located in the Texas Medical Center. Participants performed one-dimensional nominal search tasks over nominal, ordinal, and ratio displays, and searched one-dimensional nominal, ordinal, interval, and ratio tasks over table and graph displays. Participants also performed the same task and display combination for twodimensional searches. Distributed cognition theory has been adopted as a theoretical framework for analyzing and predicting the search performance of relational data. It has been shown that the representation dimensions and data scales, as well as the search task types, are main factors in determining search efficiency and effectiveness. In particular, the more external representations used, the better search task performance, and the results suggest the ideal search performance occurs when the question type and corresponding data scale representation match. The implications of the study lie in contributing to the effective design of search interface for relational data, especially laboratory results, which are often used in healthcare activities.
Resumo:
High-throughput assays, such as yeast two-hybrid system, have generated a huge amount of protein-protein interaction (PPI) data in the past decade. This tremendously increases the need for developing reliable methods to systematically and automatically suggest protein functions and relationships between them. With the available PPI data, it is now possible to study the functions and relationships in the context of a large-scale network. To data, several network-based schemes have been provided to effectively annotate protein functions on a large scale. However, due to those inherent noises in high-throughput data generation, new methods and algorithms should be developed to increase the reliability of functional annotations. Previous work in a yeast PPI network (Samanta and Liang, 2003) has shown that the local connection topology, particularly for two proteins sharing an unusually large number of neighbors, can predict functional associations between proteins, and hence suggest their functions. One advantage of the work is that their algorithm is not sensitive to noises (false positives) in high-throughput PPI data. In this study, we improved their prediction scheme by developing a new algorithm and new methods which we applied on a human PPI network to make a genome-wide functional inference. We used the new algorithm to measure and reduce the influence of hub proteins on detecting functionally associated proteins. We used the annotations of the Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) as independent and unbiased benchmarks to evaluate our algorithms and methods within the human PPI network. We showed that, compared with the previous work from Samanta and Liang, our algorithm and methods developed in this study improved the overall quality of functional inferences for human proteins. By applying the algorithms to the human PPI network, we obtained 4,233 significant functional associations among 1,754 proteins. Further comparisons of their KEGG and GO annotations allowed us to assign 466 KEGG pathway annotations to 274 proteins and 123 GO annotations to 114 proteins with estimated false discovery rates of <21% for KEGG and <30% for GO. We clustered 1,729 proteins by their functional associations and made pathway analysis to identify several subclusters that are highly enriched in certain signaling pathways. Particularly, we performed a detailed analysis on a subcluster enriched in the transforming growth factor β signaling pathway (P<10-50) which is important in cell proliferation and tumorigenesis. Analysis of another four subclusters also suggested potential new players in six signaling pathways worthy of further experimental investigations. Our study gives clear insight into the common neighbor-based prediction scheme and provides a reliable method for large-scale functional annotations in this post-genomic era.
Resumo:
Band 4.1B is a cytoskeletal adaptor protein that regulates various cellular behavior; however, the mechanisms by which Band 4.1B contributes to intracellular signaling are unclear. This project addresses in vivo and in vitro functions for Band 4.1B in integrin-mediated cell adhesion and signaling. Band 4.1B has been shown to bind to β8 integrin, although cooperative functions of these two proteins have not been determined. Here, functional links between β8 integrin and Band 4.1B were investigated using gene knockout strategies. Ablation of β8 integrin and Band 4.1B genes resulted in impaired cardiac morphogenesis, leading to embryonic lethality by E11.5. These embryos displayed malformation of the outflow tract that was likely linked to abnormal regulation of cardiac neural crest migration. These data indicate the importance of cooperative signaling between β8 integrin and Band 4.1B in cardiac development. The involvement of Band 4.1B in integrin-mediated cell adhesion and signaling was further demonstrated by studying its functional roles in vitro. Band 4.1B is highly expressed in the brain, but its signaling in astrocytes is not understood. Here, Band 4.1B was shown to promote cell spreading likely by interacting with β1 integrin via its band 4.1, ezrin, radixin, and moesin (FERM) domain in cell adhesions. In astrocytes, both Band 4.1B and β1 integrin were expressed in cell-ECM contact sites during early cell spreading. Exogenous expression of Band 4.1B, especially its FERM domain, enhanced cell spreading on fibronectin, an ECM ligand for β1 integrin. However, the increased cell spreading was prohibited by blocking β1 integrin. These findings suggest that Band 4.1B is crucial for early adhesion assembly and/or signaling that are mediated by β1 integrin. Collectively, this study was the first to establish Band 4.1B as a modulator of integrin-mediated adhesion and signaling.
Resumo:
Defects in apical-basal cell polarity and abnormal expression of cell polarity determinants are linked to human cancer. Loss of polarity is highly correlated with malignancy. In Drosophila, perturbation of apical-basal polarity, including overexpressing the apical determinant Crumbs, can lead to uncontrolled tissue growth. Cells mutant for the basolateral determinant scribble overproliferate and can form neoplastic tumors. Interestingly, scribble mutant clones that arise in wild-type tissues are eliminated and therefore do not manifest their tumorigenic potential. However, the mechanisms by which cell polarity coordinates with growth control pathways in developing organs to achieve appropriate organ size remain obscure. To investigate the function of apical determinants in growth regulation, I investigated the mechanism by which the apical determinant Crumbs affects growth in Drosophila imaginal discs. I found that crumbs gain and loss of function cause overgrowth and induction of Hippo target genes. In addition, Crumbs is required for the proper localization of Expanded, an upstream component of the Hippo pathway. Furthermore, we uncoupled the cell polarity and growth control function of Crb through structure-functional analysis. Taken together, our data identify a role of Crb in growth regulation specifically through modulation of the Hippo pathway. To further explore the role of polarity in growth control, I investigated how cells mutant for basolateral determinants are eliminated by using patches of cells mutant for scribble (scribble mutant clones) as a model system. We found that competitive cell-cell interactions eliminate tumorigenic scribble cells by modulation of the Hippo pathway. The regulation of Hippo signaling is required and sufficient to restrain the tumorous growth of scribble mutant cells. Artificially increasing the relative fitness of scribble mutant cells unleashes their tumorigenic potential. Therefore, we have identified a novel tumor-suppression mechanism that depends on signaling between normal and tumorigenic cells. These data identify evasion of cell competition as a critical step toward malignancy and illustrate a role for wild-type tissue in eliminating abnormal cells and preventing the formation of tumors.
Resumo:
Intensity modulated radiation therapy (IMRT) is a technique that delivers a highly conformal dose distribution to a target volume while attempting to maximally spare the surrounding normal tissues. IMRT is a common treatment modality used for treating head and neck (H&N) cancers, and the presence of many critical structures in this region requires accurate treatment delivery. The Radiological Physics Center (RPC) acts as both a remote and on-site quality assurance agency that credentials institutions participating in clinical trials. To date, about 30% of all IMRT participants have failed the RPC’s remote audit using the IMRT H&N phantom. The purpose of this project is to evaluate possible causes of H&N IMRT delivery errors observed by the RPC, specifically IMRT treatment plan complexity and the use of improper dosimetry data from machines that were thought to be matched but in reality were not. Eight H&N IMRT plans with a range of complexity defined by total MU (1460-3466), number of segments (54-225), and modulation complexity scores (MCS) (0.181-0.609) were created in Pinnacle v.8m. These plans were delivered to the RPC’s H&N phantom on a single Varian Clinac. One of the IMRT plans (1851 MU, 88 segments, and MCS=0.469) was equivalent to the median H&N plan from 130 previous RPC H&N phantom irradiations. This average IMRT plan was also delivered on four matched Varian Clinac machines and the dose distribution calculated using a different 6MV beam model. Radiochromic film and TLD within the phantom were used to analyze the dose profiles and absolute doses, respectively. The measured and calculated were compared to evaluate the dosimetric accuracy. All deliveries met the RPC acceptance criteria of ±7% absolute dose difference and 4 mm distance-to-agreement (DTA). Additionally, gamma index analysis was performed for all deliveries using a ±7%/4mm and ±5%/3mm criteria. Increasing the treatment plan complexity by varying the MU, number of segments, or varying the MCS resulted in no clear trend toward an increase in dosimetric error determined by the absolute dose difference, DTA, or gamma index. Varying the delivery machines as well as the beam model (use of a Clinac 6EX 6MV beam model vs. Clinac 21EX 6MV model), also did not show any clear trend towards an increased dosimetric error using the same criteria indicated above.
Resumo:
The human endogenous retrovirus K (HERV-K) env gene encodes envelope protein comprising surface (SU) and transmembrane (TM) domains. Having shown the exclusive expression of SU in human breast cancer and the stimulation of SU-specific immune responses in patients with breast cancer, our research here confirmed and extended the data by investigating the expression of HERV-K TM envelope domain and the induction of specific immune responses against TM in breast cancer patients. We found HERV-K TM mRNA and protein expression only in human breast cancer cells but not in normal controls. The specific immune responses against TM domain were induced in mice determined by enzyme-linked immunosorbent assay (ELISA) and IFN-γ enzyme-linked immunosorbent spot (ELISPOT) assay. Furthermore, ELISA detected higher titers of anti-HERV-K TM Env IgG antibodies in sera of breast cancer patients. In addition, the magnitude of the anti-HERV TM B cell response was correlated with the disease stage. Peripheral blood mononuclear cells (PBMCs) before and after in vitro stimulation (IVS) with HERV-K TM from patients with breast cancer as well as healthy controls were tested for T cell responses against HERV-K TM domain by ELISPOT assay. Breast cancer patients (n=21) had stronger HERV-K TM-specific cellular responses than healthy controls (n=12) (P < 0.05). These findings suggest, for the first time, that HERV-K TM expression was enhanced in human breast cancer cells and was able to induce specific B cell and T cell immune responses in breast cancer patients. This study provides support for HERV-K TM as a promising source of antigen for anti-tumor immunotherapy, prevention, diagnosis, and prognosis.
Resumo:
Background. Increased incidence of cancer is documented in immunosuppressed transplant patients. Likewise, as survival increases for persons infected with the Human Immunodeficiency Virus (HIV), we expect their incidence of cancer to increase. The objective of this study was to examine the current gender specific spectrum of cancer in an HIV infected cohort (especially malignancies not currently associated with Acquired Immunodeficiency Syndrome (AIDS)) in relation to the general population.^ Methods. Cancer incidence data was collected for residents of Harris County, Texas who were diagnosed with a malignancy between 1975 and 1994. This data was linked to HIV/AIDS registry data to identify malignancies in an HIV infected cohort of 14,986 persons. A standardized incidence ratio (SIR) analysis was used to compare incidence of cancer in this cohort to that in the general population. Risk factors such as mode of HIV infection, age, race and gender, were evaluated for contribution to the development of cancer within the HIV cohort, using Cox regression techniques.^ Findings. Of those in the HIV infected cohort, 2289 persons (15%) were identified as having one or more malignancies. The linkage identified 29.5% of these malignancies (males 28.7% females 60.9%). HIV infected men and women had incidences of cancer that were 16.7 (16.1, 17.3) and 2.9 (2.3, 3.7) times that expected for the general population of Harris County, Texas, adjusting for age. Significant SIR's were observed for the AIDS-defining malignancies of Kaposi's sarcoma, non-Hodgkin's lymphoma, primary lymphoma of the brain and cancer of the cervix. Additionally, significant SIR's for non-melanotic skin cancer in males, 6.9 (4.8, 9.5) and colon cancer in females, 4.0 (1.1, 10.2) were detected. Among the HIV infected cohort, race/ethnicity of White (relative risk 2.4 with 95% confidence intervals 2.0, 2.8) or Spanish Surname, 2.2 (1.9, 2.7) and an infection route of male to male sex, with, 3.0 (1.9, 4.9) or without, 3.4 (2.1, 5.5) intravenous drug use, increased the risk of having a diagnosis of an incident cancer.^ Interpretation. There appears to be an increased risk of developing cancer if infected with the HIV. In addition to the malignancies routinely associated with HIV infection, there appears to be an increased risk of being diagnosed with non-melanotic skin cancer in males and colon cancer in females. ^
Resumo:
Lodestar, a Drosophila maternal-effect gene, is essential for proper chromosome segregation during embryonic mitosis. Mutations in lodestar cause chromatin bridging in anaphase, preventing the sister chromatids from fully separating and leaving chromatin tangled at the metaphase plate. Drosophila lodestar protein was originally identified, in purified fractions of Drosophila Kc cell nuclear extracts, by its ability to suppress the generation of long RNA polymerase II transcripts. The human homolog of this protein (hLodestar) was cloned and studied in comparison to the Drosophila lodestar activities. The results of these studies show, similar to the Drosophila protein, hLodestar has dsDNA-dependent ATPase and transcription termination activity in vitro. hLodestar has also been shown to release RNA polymerase I and II stalled at a cyclobutane thymine dimer. Lodestar belongs to the SNF2 family of proteins, which are members of the DExH/D helicase super-family. The SNF2 family of proteins are believed to play a critical role in altering protein-DNA interactions in a variety of cellular contexts. We have recently isolated a human cDNA (hLodestar) that shares significant homology to the Drosophila lodestar gene. The 4.6 kb clone contains an open reading frame of 1162 amino acids, and shares 55% similarity and 46% identity to the Drosophila Lodestar protein sequence. Our studies looking for hLodestar interacting proteins revealed an association with CDC5L in the yeast two-hybrid system and co-immunoprecipitation experiments. CDC5L has been well documented to be a component of the spliceosome. Our data suggests hLodestar is involved in splicing through in vitro assembly and splicing reactions, in addition to its association with spliceosomes purified from HeLa nuclear extract. Although many other members of the DExH/D helicase super-family have been linked to splicing, this is the first SNF2 family member to be implicated in the splicing reaction. ^
Resumo:
Material Safety Data Sheets (MSDSs) are an integral component of occupational hazard communication systems. These documents are used to disseminate hazard information to workers on chemical substances. The primary purpose of this study was to investigate the comprehensibility of MSDSs by workers at an international level. ^ A total of 117 employees of a multi-national petrochemical company participated; thirty-nine (39) each in the United States, Canada and the United Kingdom. Overall participation rate of those approached to participate was 82%. These countries were selected as they each utilize one of the three major existing hazard communication systems for fixed workplaces. The systems are comprised of the Occupational Safety and Health Administration's Hazard Communication Standard in the United States, the Workplace Hazardous Materials Information System (WHMIS) in Canada, and the compilation of several European Union directives addressing classification, labeling of substances and preparations, and MSDSs in Europe. ^ A pretest posttest randomized study design was used, with the posttest being comparable to an open book test. The results of this research indicated that only about two-thirds of the information on the MSDSs was comprehended by the workers with a significant difference identified among study participants based on country comparisons. This data was fairly consistent with the results of previous MSDS comprehensibility studies conducted in the United States. There was no significant difference in the comprehension level among study participants when taking into account the international hazard communication standard that the MSDS complied with. Marginally, age, education level and experience level did not have a significant impact on the comprehension level. ^ Participants did find MSDSs to be satisfactory in providing the information needed to protect them regardless of their views on the readability and formatting of MSDSs. The health-related information was the least comprehended as less than half of it was comprehended on the basis of the responses. The findings from this research suggest that there is much work needed yet to make MSDSs more comprehensible on a global basis, particularly regarding health-related information. ^
Resumo:
Coronary artery disease (CAD) is a multifactorial disease process involving behavioral, inflammatory, clinical, thrombotic, and genetic components. Previous epidemiologic studies focused on identifying behavioral and demographic risk factors of CAD, but none focused on platelets. Current platelet literature lacks the known effects of platelet function and platelet receptor polymorphisms on CAD. This case-control analysis addressed these issues by analyzing data collected for a previous study. Cases were individuals who had undergone CABG and thus had been diagnosed with CAD, while the controls were volunteers presumed to be CAD free. The platelet function variables analyzed included fibrinogen Von Willebrand Factor activity (VWF), shear-induced platelet aggregation (SIPA), sCD40L, and mean platelet volume; and the platelet polymorphisms studied included PIA, α2 807, Ko, Kozak, and VNTR. Univariate analysis found fibrinogen, VWF, SIPA, and PIA to be independent risk factors of CAD. Logistic regression was used to build a predictive model for CAD using the platelet function and platelet polymorphism data adjusted for age, sex, race, and current smoking status. A model containing only platelet polymorphisms and their respective receptor densities, found polymorphisms within GPIbα to be associated with CAD, yielding an 86% (95% C.I. 0.97–3.55) increased risk with the presence of at least 1 polymorphism in Ko, Kozak, or VNTR. Another model included both platelet function and platelet polymorphism data. Fibrinogen, the receptor density of GPIbα, and the polymorphism in GPIa-IIa (α2 807) were all associated with CAD with odds ratios of 1.10, 1.04, and 2.30 for fibrinogen (10mg/dl increase), GPIbα receptors (1 MFI increase), and GPIa-IIa, respectively. In addition, risk estimates and 99% confidence intervals adjusted for race were calculated to determine if the presence of a platelet receptor polymorphism was associated with CAD. The results were as follows: PIA (1.64, 0.74–3.65); α2 807 (1.35, 0.77–2.37); Ko (1.71, 0.70–4.16); Kozak (1.17, 0.54–2.52); and VNTR (1.24, 0.52–2.91). Although not statistically significant, all platelet polymorphisms were associated with an increased risk for CAD. These exploratory findings indicate that platelets do appear to have a role in atherosclerosis and that anti-platelet drugs targeting GPI-IIa and GPIbα may be better treatment candidates for individuals with CAD. ^
Resumo:
The objectives of this study were to identify and measure the average outcomes of the Open Door Mission's nine-month community-based substance abuse treatment program, identify predictors of successful outcomes, and make recommendations to the Open Door Mission for improving its treatment program.^ The Mission's program is exclusive to adult men who have limited financial resources: most of which were homeless or dependent on parents or other family members for basic living needs. Many, but not all, of these men are either chemically dependent or have a history of substance abuse.^ This study tracked a cohort of the Mission's graduates throughout this one-year study and identified various indicators of success at short-term intervals, which may be predictive of longer-term outcomes. We tracked various levels of 12-step program involvement, as well as other social and spiritual activities, such as church affiliation and recovery support.^ Twenty-four of the 66 subjects, or 36% met the Mission's requirements for success. Specific to this success criteria; Fifty-four, or 82% reported affiliation with a home church; Twenty-six, or 39% reported full-time employment; Sixty-one, or 92% did not report or were not identified as having any post-treatment arrests or incarceration, and; Forty, or 61% reported continuous abstinence from both drugs and alcohol.^ Five research-based hypotheses were developed and tested. The primary analysis tool was the web-based non-parametric dependency modeling tool, B-Course, which revealed some strong associations with certain variables, and helped the researchers generate and test several data-driven hypotheses. Full-time employment is the greatest predictor of abstinence: 95% of those who reported full time employment also reported continuous post-treatment abstinence, while 50% of those working part-time were abstinent and 29% of those with no employment were abstinent. Working with a 12-step sponsor, attending aftercare, and service with others were identified as predictors of abstinence.^ This study demonstrates that associations with abstinence and the ODM success criteria are not simply based on one social or behavioral factor. Rather, these relationships are interdependent, and show that abstinence is achieved and maintained through a combination of several 12-step recovery activities. This study used a simple assessment methodology, which demonstrated strong associations across variables and outcomes, which have practical applicability to the Open Door Mission for improving its treatment program. By leveraging the predictive capability of the various success determination methodologies discussed and developed throughout this study, we can identify accurate outcomes with both validity and reliability. This assessment instrument can also be used as an intervention that, if operationalized to the Mission’s clients during the primary treatment program, may measurably improve the effectiveness and outcomes of the Open Door Mission.^
Resumo:
Among Mexican Americans, the second largest minority group in the United States, the prevalence of gallbladder disease is markedly elevated. Previous data from both genetic admixture and family studies indicate that there is a genetic component to the occurrence of gallbladder disease in Mexican Americans. However, prior to this thesis no formal genetic analysis of gallbladder disease had been carried out nor had any contributing genes been identified.^ The results of complex segregation analysis in a sample of 232 Mexican American pedigrees documented the existence of a major gene having two alleles with age- and gender-specific effects influencing the occurrence of gallbladder disease. The estimated frequency of the allele increasing susceptibility was 0.39. The lifetime probabilities that an individual will be affected by gallbladder disease were 1.0, 0.54, and 0.00 for females of genotypes "AA", "Aa", and "aa", respectively, and 0.68, 0.30, and 0.00 for males, respectively. This analysis provided the first conclusive evidence for the existence of a common single gene having a large effect on the occurrence of gallbladder disease.^ Human cholesterol 7$\alpha$-hydroxylase is the rate-limiting enzyme in bile acid synthesis. The results of an association study in both a random sample and a matched case/control sample showed that there is a significant association between cholesterol 7$\alpha$-hydroxylase gene variation and the occurrence of gallbladder disease in Mexican Americans males but not in females. These data have implicated a specific gene, 7$\alpha$-hydroxylase, in the etiology of gallbladder disease in this population.^ Finally, I asked whether the inferred major gene from complex segregation analysis is genetically linked to the cholesterol 7$\alpha$-hydroxylase gene. Three pedigrees predicted to be informative for linkage analysis by virtue of supporting the major gene hypothesis and having parents with informative genotypes and multiple offspring were selected for this linkage analysis. In each of these pedigrees, the recombination fractions maximized at 0 with a positive, albeit low, LOD score. The results of this linkage analysis provide preliminary and suggestive evidence that the cholesterol 7$\alpha$-hydroxylase gene and the inferred gallbladder disease susceptibility gene are genetically linked. ^