3 resultados para JOINT COMPOSITE INTERVAL MAPPING

em DigitalCommons@The Texas Medical Center


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Thoracic aortic aneurysms leading to aortic dissections (TAAD) are a major cause of morbidity and mortality in the United States. TAAD is a complication of some known genetic disorders, such as Marfan syndrome and Turner syndrome, but the majority of familial cases are not due to a known genetic syndrome. Previous studies by our group have established that nonsyndromic, familial TAAD is inherited in an autosomal dominant manner with decreased penetrance and variable expression. Using one large family with multiple members with TAAD for the genome wide scan, a major locus for familial TAAD was mapped to 5q13–14 (TAAD1). Nine out of 15 families studied were linked to this locus, establishing that TAAD1 was a major locus, and that there was genetic heterogeneity for the condition. Mapping of TAAD2 locus was accomplished using a single large family with multiple members with TAAD not linked to known loci of aneurysm formation. This established a second novel locus for familial TAAD on 3p24–25 (LOD score of 4.3), termed the TAAD2 locus. Two putative loci with suggestive LOD scores were mapped on 4q and 12q through a genome scan carried out using three families. TAAD phenotype in 12 families did not segregate with known loci, indicating further genetic heterogeneity. An STS-tagged BAC based contig was constructed for 7.8Mb and 25Mb critical interval of TAAD1 and TAAD2 respectively and characterized to identify the defective gene. The hypothesis that the defective genes responsible for the TAAD1 and TAAD2 encoded extracellular matrix (ECM) proteins, the major components of the elastic fiber system in the aortic media was tested. Four genes encoding ECM proteins, versican, thrombospondin-3, CRTL1, on TAAD1 and FBLN2 at TAAD2 were sequenced, but no disease-causing mutations were identified. Studies to identify the defective gene are initiated through the positional candidate gene approach using combination of bioinformatics and expression studies. The identification of the TAAD susceptibility genes will allow for presymptomatic diagnosis of individuals at risk for this life threatening disease. The identification of the molecular defects that contribute to TAAD will also further our understanding of the proteins that provide structural integrity to the aortic wall. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Renal cell carcinoma (RCC) is the most common malignant tumor of the kidney. Characterization of RCC tumors indicates that the most frequent genetic event associated with the initiation of tumor formation involves a loss of heterozygosity or cytogenetic aberration on the short arm of human chromosome 3. A tumor suppressor locus Nonpapillary Renal Carcinoma-1 (NRC-1, OMIM ID 604442) has been previously mapped to a 5–7 cM region on chromosome 3p12 and shown to induce rapid tumor cell death in vivo, as demonstrated by functional complementation experiments. ^ To identify the gene that accounts for the tumor suppressor activities of NRC-1, fine-scale physical mapping was conducted with a novel real-time quantitative PCR based method developed in this study. As a result, NRC-1 was mapped within a 4.6-Mb region defined by two unique sequences within UniGene clusters Hs.41407 and Hs.371835 (78,545Kb–83,172Kb in the NCBI build 31 physical map). The involvement of a putative tumor suppressor gene Robo1/Dutt1 was excluded as a candidate for NRC-1. Furthermore, a transcript map containing eleven candidate genes was established for the 4.6-Mb region. Analyses of gene expression patterns with real-time quantitative RT-PCR assays showed that one of the eleven candidate genes in the interval (TSGc28) is down-regulated in 15 out of 20 tumor samples compared with matched normal samples. Three exons of this gene have been identified by RACE experiments, although additional exon(s) seem to exist. Further gene characterization and functional studies are required to confirm the gene as a true tumor suppressor gene. ^ To study the cellular functions of NRC-1, gene expression profiles of three tumor suppressive microcell hybrids, each containing a functional copy of NRC-1, were compared with those of the corresponding parental tumor cell lines using 16K oligonucleotide microarrays. Differentially expressed genes were identified. Analyses based on the Gene Ontology showed that introduction of NRC-1 into tumor cell lines activates genes in multiple cellular pathways, including cell cycle, signal transduction, cytokines and stress response. NRC-1 is likely to induce cell growth arrest indirectly through WEE1. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In population studies, most current methods focus on identifying one outcome-related SNP at a time by testing for differences of genotype frequencies between disease and healthy groups or among different population groups. However, testing a great number of SNPs simultaneously has a problem of multiple testing and will give false-positive results. Although, this problem can be effectively dealt with through several approaches such as Bonferroni correction, permutation testing and false discovery rates, patterns of the joint effects by several genes, each with weak effect, might not be able to be determined. With the availability of high-throughput genotyping technology, searching for multiple scattered SNPs over the whole genome and modeling their joint effect on the target variable has become possible. Exhaustive search of all SNP subsets is computationally infeasible for millions of SNPs in a genome-wide study. Several effective feature selection methods combined with classification functions have been proposed to search for an optimal SNP subset among big data sets where the number of feature SNPs far exceeds the number of observations. ^ In this study, we take two steps to achieve the goal. First we selected 1000 SNPs through an effective filter method and then we performed a feature selection wrapped around a classifier to identify an optimal SNP subset for predicting disease. And also we developed a novel classification method-sequential information bottleneck method wrapped inside different search algorithms to identify an optimal subset of SNPs for classifying the outcome variable. This new method was compared with the classical linear discriminant analysis in terms of classification performance. Finally, we performed chi-square test to look at the relationship between each SNP and disease from another point of view. ^ In general, our results show that filtering features using harmononic mean of sensitivity and specificity(HMSS) through linear discriminant analysis (LDA) is better than using LDA training accuracy or mutual information in our study. Our results also demonstrate that exhaustive search of a small subset with one SNP, two SNPs or 3 SNP subset based on best 100 composite 2-SNPs can find an optimal subset and further inclusion of more SNPs through heuristic algorithm doesn't always increase the performance of SNP subsets. Although sequential forward floating selection can be applied to prevent from the nesting effect of forward selection, it does not always out-perform the latter due to overfitting from observing more complex subset states. ^ Our results also indicate that HMSS as a criterion to evaluate the classification ability of a function can be used in imbalanced data without modifying the original dataset as against classification accuracy. Our four studies suggest that Sequential Information Bottleneck(sIB), a new unsupervised technique, can be adopted to predict the outcome and its ability to detect the target status is superior to the traditional LDA in the study. ^ From our results we can see that the best test probability-HMSS for predicting CVD, stroke,CAD and psoriasis through sIB is 0.59406, 0.641815, 0.645315 and 0.678658, respectively. In terms of group prediction accuracy, the highest test accuracy of sIB for diagnosing a normal status among controls can reach 0.708999, 0.863216, 0.639918 and 0.850275 respectively in the four studies if the test accuracy among cases is required to be not less than 0.4. On the other hand, the highest test accuracy of sIB for diagnosing a disease among cases can reach 0.748644, 0.789916, 0.705701 and 0.749436 respectively in the four studies if the test accuracy among controls is required to be at least 0.4. ^ A further genome-wide association study through Chi square test shows that there are no significant SNPs detected at the cut-off level 9.09451E-08 in the Framingham heart study of CVD. Study results in WTCCC can only detect two significant SNPs that are associated with CAD. In the genome-wide study of psoriasis most of top 20 SNP markers with impressive classification accuracy are also significantly associated with the disease through chi-square test at the cut-off value 1.11E-07. ^ Although our classification methods can achieve high accuracy in the study, complete descriptions of those classification results(95% confidence interval or statistical test of differences) require more cost-effective methods or efficient computing system, both of which can't be accomplished currently in our genome-wide study. We should also note that the purpose of this study is to identify subsets of SNPs with high prediction ability and those SNPs with good discriminant power are not necessary to be causal markers for the disease.^