939 resultados para Copy number variations
Resumo:
Microarrays have a wide range of applications in the biomedical field. From the beginning, arrays have mostly been utilized in cancer research, including classification of tumors into different subgroups and identification of clinical associations. In the microarray format, a collection of small features, such as different oligonucleotides, is attached to a solid support. The advantage of microarray technology is the ability to simultaneously measure changes in the levels of multiple biomolecules. Because many diseases, including cancer, are complex, involving an interplay between various genes and environmental factors, the detection of only a single marker molecule is usually insufficient for determining disease status. Thus, a technique that simultaneously collects information on multiple molecules allows better insights into a complex disease. Since microarrays can be custom-manufactured or obtained from a number of commercial providers, understanding data quality and comparability between different platforms is important to enable the use of the technology to areas beyond basic research. When standardized, integrated array data could ultimately help to offer a complete profile of the disease, illuminating mechanisms and genes behind disorders as well as facilitating disease diagnostics. In the first part of this work, we aimed to elucidate the comparability of gene expression measurements from different oligonucleotide and cDNA microarray platforms. We compared three different gene expression microarrays; one was a commercial oligonucleotide microarray and the others commercial and custom-made cDNA microarrays. The filtered gene expression data from the commercial platforms correlated better across experiments (r=0.78-0.86) than the expression data between the custom-made and either of the two commercial platforms (r=0.62-0.76). Although the results from different platforms correlated reasonably well, combining and comparing the measurements were not straightforward. The clone errors on the custom-made array and annotation and technical differences between the platforms introduced variability in the data. In conclusion, the different gene expression microarray platforms provided results sufficiently concordant for the research setting, but the variability represents a challenge for developing diagnostic applications for the microarrays. In the second part of the work, we performed an integrated high-resolution microarray analysis of gene copy number and expression in 38 laryngeal and oral tongue squamous cell carcinoma cell lines and primary tumors. Our aim was to pinpoint genes for which expression was impacted by changes in copy number. The data revealed that especially amplifications had a clear impact on gene expression. Across the genome, 14-32% of genes in the highly amplified regions (copy number ratio >2.5) had associated overexpression. The impact of decreased copy number on gene underexpression was less clear. Using statistical analysis across the samples, we systematically identified hundreds of genes for which an increased copy number was associated with increased expression. For example, our data implied that FADD and PPFIA1 were frequently overexpressed at the 11q13 amplicon in HNSCC. The 11q13 amplicon, including known oncogenes such as CCND1 and CTTN, is well-characterized in different type of cancers, but the roles of FADD and PPFIA1 remain obscure. Taken together, the integrated microarray analysis revealed a number of known as well as novel target genes in altered regions in HNSCC. The identified genes provide a basis for functional validation and may eventually lead to the identification of novel candidates for targeted therapy in HNSCC.
Resumo:
Certain recent models of sex determination in mammals, Drosophila melanogaster, Caenorhabditis elegans, and snakes are examined in the light of the hypothesis that the relevant genetic regulatory mechanisms are similar and interrelated. The proposed key element in each of these instances is a noncoding DNA sequence, which serves as a high-affinity binding site for a repressor-like molecule regulating the activity of a major "sex-determining" gene. On this basis it is argued that, in several eukaryotes, (i) certain DNA sequences that are sex-determining are noncoding, in the sense that they are not the structural genes of a sex-determining protein; (ii) in some species these noncoding sequences are present in one sex and absent in the other, while in others their copy number or accessibility to regulatory molecules is significantly unequal between the two sexes; and (iii) this inequality determines whether the embryo develops into a male or a female.
Resumo:
Over the past years, much research on sarcomas based on low-resolution cytogenetic and molecular cytogenetic methods has been published, leading to the identification of genetic abnormalities partially underlying the tumourigenesis. Continued progress in the identification of genetic events such as copy number aberrations relies upon adapting the rapidly evolving high-resolution microarray technology, which will eventually provide novel insights into sarcoma biology, and targets for both diagnostics and drug development. The aim of this Thesis was to characterize DNA copy number changes that are involved in the pathogenesis of soft tissue leiomyosarcoma (LMS), dermatofibrosarcoma protuberans (DFSP), osteosarcoma (OS), malignant fibrous histiocytoma (MFH), and uterine leiomyosarcoma (ULMS) by applying fine resolution array comparative genomic hybridization (aCGH) technology. Both low- and high-grade LMS tumours showed distinct copy number patterns, in addition to sharing two minimal common regions of gains and losses. Small aberrations were detected by aCGH, which were beyond the resolution of chromosomal comparative genomic hybridization (cCGH). DFSP tumours analysed by aCGH showed gains in 17q, 22q, and 21 additional gained regions, but only one region (22q) with copy number loss. Recurrent amplicons identified in OS by aCGH were 12q11-q15, 8q, 6p12-p21, and 17p. Amplicons 12q and 17p were further characterized in detail. The amplicon at 17p was characterized by aCGH in low- and high-grade LMS, OS, and MFH. In all but one case this amplicon, with minimal common regions of gains at 17p11-p12, started with the distal loss of 17p13-pter. OS and high-grade LMS were grouped together as they showed a complex pattern of copy number gains and amplifications at 17p, whereas MFH and low-grade LMS showed a continuous pattern of copy number gains and amplification at 17p. In addition to the commonly gained and lost regions identified in ULMS by aCGH, various biological processes affected by these copy number changes were also indicated by pathway analysis. The three novel findings obtained in this work were: characterization of amplicon 17p in low- and high-grade LMS and MFH, profiles of DNA copy number changes in LMS, and detection of various pathways affected by copy number changes in ULMS. These studies have not been undertaken previously by aCGH technology, thus this Thesis adds new information regarding DNA copy number changes in sarcomas. In conclusion, the aCGH technique used in this Thesis has provided new insights into the genetics of sarcomas by detecting the precise regions affected by copy number changes and some potential candidate target genes within those regions, which had not been uncovered by previously applied low resolution techniques.
Resumo:
Hereditary Leiomyomatosis and Renal Cell Cancer (HLRCC) is a hereditary tumour predisposition syndrome. Its phenotype includes benign cutaneous and uterine leiomyomas (CLM, ULM) with high penetrance and rarer renal cell cancer (RCC), most commonly of papillary type 2 subtype. Over 130 HLRCC families have been identified world-wide but the RCC phenotype seems to concentrate in families from Finland and North America for unknown reasons. HLRCC is caused by heterozygous germline mutations in the fumarate hydratase (FH) gene. FH encodes the enzyme fumarase from mitochondrial citric acid cycle. Fumarase enzyme activity or type or site of the FH mutation are unassociated with disease phenotype. The strongest evidence for tumourigenesis mechanism in HLRCC supports a hypoxia inducible factor driven process called pseudohypoxia resulting from accumulation of the fumarase substrate fumarate. In this study, to assess the importance of gene- or exon-level deletions or amplifications of FH in patients with HLRCC-associated phenotypes, multiplex ligation-dependent probe amplification (MLPA) method was used. One novel FH mutation, deletion of exon 1, was found in a Swedish male patient with an evident HLRCC phenotype with CLM, RCC, and a family history of ULM and RCC. Six other patients with CLM and 12 patients with only RCC or uterine leiomyosarcoma (ULMS) remained FH mutation-negative. These results suggest that copy number aberrations of FH or its exons are an infrequent cause of HLRCC and that only co-occurrence of benign tumour types justifies FH-mutation screening in RCC or ULMS patients. Determination of the genomic profile of 11 HLRCC-associated RCCs from Finnish patients was performed by array comparative genomic hybridization. The most common copy number aberrations were gains of 2, 7, and 17 and losses of 13q12.3-q21.1, 14, 18, and X. When compared to aberrations of sporadic papillary RCCs, HLRCC-associated RCCs harboured a distinct DNA copy number profile and lacked many of the changes characterizing the sporadic RCCs. The findings suggest a divergent molecular pathway for tumourigenesis of papillary RCCs in HLRCC. In order to find a genetic modifier of RCC risk in HLRCC, genome-wide linkage and identical by descent (IBD) analysis studies were performed in Finnish HLRCC families with microsatellite marker mapping and SNP-array platforms. The linkage analysis identified only one locus of interest, the FH gene locus in 1q43, but no mutations were found in the genes of the region. IBD analysis yielded no convincing haplotypes shared by RCC patients. Although these results do not exclude the existence of a genetic modifier for RCC risk in HLRCC, they emphasize the role of FH mutations in the malignant tumourigenesis of HLRCC. To study the benign tumours in HLRCC, genome-wide DNA copy number and gene expression profiles of sporadic and HLRCC ULMs were defined with modern SNP- and gene-expression array platforms. The gene expression array suggests novel genes involved in FH-deficient ULM tumourigenesis and novel genes with putative roles in propagation of sporadic ULM. Both the gene expression and copy number profiles of HLRCC ULMs differed from those of sporadic ULMs indicating distinct molecular basis of the FH-deficient HLRCC tumours.
Resumo:
Chromosomal alterations in leukemia have been shown to have prognostic and predictive significance and are also important minimal residual disease (MRD) markers in the follow-up of leukemia patients. Although specific oncogenes and tumor suppressors have been discovered in some of the chromosomal alterations, the role and target genes of many alterations in leukemia remain unknown. In addition, a number of leukemia patients have a normal karyotype by standard cytogenetics, but have variability in clinical course and are often molecularly heterogeneous. Cytogenetic methods traditionally used in leukemia analysis and diagnostics; G-banding, various fluorescence in situ hybridization (FISH) techniques, and chromosomal comparative genomic hybridization (cCGH), have enormously increased knowledge about the leukemia genome, but have limitations in resolution or in genomic coverage. In the last decade, the development of microarray comparative genomic hybridization (array-CGH, aCGH) for DNA copy number analysis and the SNP microarray (SNP-array) method for simultaneous copy number and loss of heterozygosity (LOH) analysis has enabled investigation of chromosomal and gene alterations genome-wide with high resolution and high throughput. In these studies, genetic alterations were analyzed in acute myeloid leukemia (AML) and chronic lymphocytic leukemia (CLL). The aim was to screen and characterize genomic alterations that could play role in leukemia pathogenesis by using aCGH and SNP-arrays. One of the most important goals was to screen cryptic alterations in karyotypically normal leukemia patients. In addition, chromosomal changes were evaluated to narrow the target regions, to find new markers, and to obtain tumor suppressor and oncogene candidates. The work presented here shows the capability of aCGH to detect submicroscopic copy number alterations in leukemia, with information about breakpoints and genes involved in the alterations, and that genome-wide microarray analyses with aCGH and SNP-array are advantageous methods in the research and diagnosis of leukemia. The most important findings were the cryptic changes detected with aCGH in karyotypically normal AML and CLL, characterization of amplified genes in 11q marker chromosomes, detection of deletion-based mechanisms of MLL-ARHGEF12 fusion gene formation, and detection of LOH without copy number alteration in karyotypically normal AML. These alterations harbor candidate oncogenes and tumor suppressors for further studies.
Resumo:
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between body mass index and approximately 2.8 million SNPs in up to 123,865 individuals with targeted follow up of 42 SNPs in up to 125,931 additional individuals. We confirmed 14 known obesity susceptibility loci and identified 18 new loci associated with body mass index (P < 5 x 10(-)(8)), one of which includes a copy number variant near GPRC5B. Some loci (at MC4R, POMC, SH2B1 and BDNF) map near key hypothalamic regulators of energy balance, and one of these loci is near GIPR, an incretin receptor. Furthermore, genes in other newly associated loci may provide new insights into human body weight regulation.
Resumo:
A multiplex real-time PCR was developed for the detection and differentiation of two closely related bovine herpesviruses 1 (BoHV-1) and 5 (BoHV-5). The multiplex real-time PCR combines a duplex real-time PCR that targets the DNA polymerase gene of BoHV-1 and BoHV-5 and a real-time PCR targeting mitochondrial DNA, as a house-keeping gene, described previously by Cawthraw et al. (2009). The assay correctly identified 22 BoHV-1 and six BoHV-5 isolates from the Biosecurity Sciences Laboratory virus collection. BoHV-1 and BoHV-5 were also correctly identified when incorporated in spiked semen and brain tissue samples. The detection limits of the duplex assay were 10 copies of BoHV-1 and 45 copies of BoHV-5. The multiplex real-time PCR had reaction efficiencies of 1.04 for BoHV-1 and 1.08 for BoHV-5. Standard curves relating Ct value to template copy number had correlation coefficients of 0.989 for BoHV-1 and 0.978 for BoHV-5. The assay specificity was demonstrated by testing bacterial and viral DNA from pathogens commonly isolated from bovine respiratory and reproductive tracts. The validated multiplex real-time PCR was used to detect and differentiate BoHV-1 and BoHV-5 in bovine clinical samples with known histories.
Resumo:
Hereditary nonpolyposis colorectal cancer (HNPCC) is the most common known clearly hereditary cause of colorectal and endometrial cancer (CRC and EC). Dominantly inherited mutations in one of the known mismatch repair (MMR) genes predispose to HNPCC. Defective MMR leads to an accumulation of mutations especially in repeat tracts, presenting microsatellite instability. HNPCC is clinically a very heterogeneous disease. The age at onset varies and the target tissue may vary. In addition, families that fulfill the diagnostic criteria for HNPCC but fail to show any predisposing mutation in MMR genes exist. Our aim was to evaluate the genetic background of familial CRC and EC. We performed comprehensive molecular and DNA copy number analyses of CRCs fulfilling the diagnostic criteria for HNPCC. We studied the role of five pathways (MMR, Wnt, p53, CIN, PI3K/AKT) and divided the tumors into two groups, one with MMR gene germline mutations and the other without. We observed that MMR proficient familial CRC consist of two molecularly distinct groups that differ from MMR deficient tumors. Group A shows paucity of common molecular and chromosomal alterations characteristic of colorectal carcinogenesis. Group B shows molecular features similar to classical microsatellite stable tumors with gross chromosomal alterations. Our finding of a unique tumor profile in group A suggests the involvement of novel predisposing genes and pathways in colorectal cancer cohorts not linked to MMR gene defects. We investigated the genetic background of familial ECs. Among 22 families with clustering of EC, two (9%) were due to MMR gene germline mutations. The remaining familial site-specific ECs are largely comparable with HNPCC associated ECs, the main difference between these groups being MMR proficiency vs. deficiency. We studied the role of PI3K/AKT pathway in familial ECs as well and observed that PIK3CA amplifications are characteristic of familial site-specific EC without MMR gene germline mutations. Most of the high-level amplifications occurred in tumors with stable microsatellites, suggesting that these tumors are more likely associated with chromosomal rather than microsatellite instability and MMR defect. The existence of site-specific endometrial carcinoma as a separate entity remains equivocal until predisposing genes are identified. It is possible that no single highly penetrant gene for this proposed syndrome exists, it may, for example be due to a combination of multiple low penetrance genes. Despite advances in deciphering the molecular genetic background of HNPCC, it is poorly understood why certain organs are more susceptible than others to cancer development. We found that important determinants of the HNPCC tumor spectrum are, in addition to different predisposing germline mutations, organ specific target genes and different instability profiles, loss of heterozygosity at MLH1 locus, and MLH1 promoter methylation. This study provided more precise molecular classification of families with CRC and EC. Our observations on familial CRC and EC are likely to have broader significance that extends to sporadic CRC and EC as well.
Resumo:
Background: The Ewing sarcoma family of tumors (ESFT) are rare but highly malignant neoplasms that occur mainly in bone or but also in soft tissue. ESFT affects patients typically in their second decade of life, whereby children and adolescents bear the heaviest incidence burden. Despite recent advances in the clinical management of ESFT patients, their prognosis and survival are still disappointingly poor, especially in cases with metastasis. No targeted therapy for ESFT patients is currently available. Moreover, based merely on current clinical and biological characteristics, accurate classification of ESFT patients often fails at the time of diagnosis. Therefore, there is a constant need for novel molecular biomarkers to be applied in tandem with conventional parameters to further intensify ESFT risk-stratification and treatment selection, and ultimately to develop novel targeted therapies. In this context, a greater understanding of the genetics and immune characteristics of ESFT is needed. Aims: This study sought to open novel insights into gene copy number changes and gene expression in ESFT and, further, to enlighten the role of inflammation in ESFT. For this purpose, microarrays were used to provide gene-level information on a genomewide scale. In addition, this study focused on screening of 9p21.3 deletion sizes and frequencies in ESFT and, in another pediatric cancer, acute lymphocytic leukemia (ALL), in order to define more exact criteria for highrisk patient selection and to provide data for developing a more reliable diagnostic method to detect CDKN2A deletions. Results: In study I, 20 novel ESFT-associated suppressor genes and oncogenes were pinpointed using combined array CGH and expression analysis. In addition, interesting chromosomal rearrangements were identified: (1) Duplication of derivative chromosome der(22)(11;22) was detected in three ESFT patients. This duplication included the EWSR1-FLI1 fusion gene leading to increase in its copy number; (2) Cryptic amplifications on chromosomes 20 and 22 were detected, suggesting a novel translocation between chromosomes 20 and 22, which most probably produces a fusion between EWSR1 and NFATC2. In study II, bioinformatic analysis of ESFT expression profiles showed that inflammatory gene activation is detectable in ESFT patient samples and that the activation is characterized by macrophage gene expression. Most interestingly, ESFT patient samples were shown to express certain inflammatory genes that were prognostically significant. High local expression of C5 and JAK1 at the tumor site was shown to associate with favorable clinical outcome, whereas high local expression of IL8 was shown to be detrimental. Studies III and IV showed that the smallest overlapping region of deletion in 9p21.3 includes CDKN2A in all cases and that the length of this region is 12.2 kb in both Ewing sarcoma and ALL. Furthermore, our results showed that the most widely used commercial CDKN2A FISH probe creates false negative results in the narrowest microdeletion cases (<190 kb). Therefore, more accurate methods should be developed for the detection of deletions in the CDKN2A locus. Conclusions: This study provides novel insights into the genetic changes involved in the biology of ESFT, in the interaction between ESFT cells and immune system, and in the inactivation of CDKN2A. Novel ESFT biomarker genes identified in this study serve as a useful resource for future studies and in developing novel therapeutic strategies to improve the survival of patients with ESFT.
Resumo:
The von Hippel-lindau (VHL) disease is a dominantly inherited neoplastic disorder which predisposes patients to multiple tumours including capillary haemangioblastomas (CHBs), pheochromocytomas (PCCs), renal cell carcinomas (RCCs). CHBs are the most common manifestations of VHL disease, occurring sporadically or as a manifestation of VHL disease. Inactivation of the VHL gene at 3p25-26 is believed to cause both familial and sporadic VHL-associated tumours and germ-line mutation of the VHL gene have been detected in 100% of the CHBs studied. However, a limited number of sporadic CHBs, PCCs display VHL inactivation. Other molecular alterations involved in tumourigenesis of sporadic CHBs, PCCs remain largely unknown. The purpose of the present work was to search for genetic alterations, or other mechanisms of inactivation, in addition to the VHL gene, that may be important in the development of VHL-associated tumours. Though less satisfactory than cure, prevention and early detection are the most promising and feasible means reducing cancer morbidity and mortality. This work is based on the view that increasing knowledge about the molecular events underlying tumour development will eventually aid in early detection and lead to improved treatment. We evaluated a large set of VHL-associated patients, searched for a clinical and radiologic signs of the disease. We succesfully performed a germ-line mutation analysis and characterised three patient groups, VHL, suspect VHL and sporadic, a germ-line mutation analysis revealed a 50% mutation rate only in the VHL groups, no sporadic or suspect cases displayed any mutation. We also utilized comparative genomic hybridization (CGH) to screen for DNA copy number changes in both sporadic and VHL-associated CHB. Our analysis revealed (27%) DNA copy number losses. The most common finding was loss of chromosomal arm 6q, seen in (23%) cases, No differences were noted between VHL-associated and sporadic tumours. Furthermore a loss of heterozygosity (LOH) study on chromosome 3p and 6q was done with the purpose to determine allele losses not observable by CGH, and to uncover the location of putative tumour suppressor genes important in CHB and PCC tumourigenesis. We identified loss of chromosome 6q and a minimal deleted area at 6q23-24 in CHBs. We also showed LOH at 6q23-24 in PCCs and identified the ZAC1 (6q24-25) as a candidate gene, ZAC1 is a maternally imprinted tumour suppressor gene with anti proliferative properties. To study further the role of ZAC inactivation in CHBs, we investigated LOH, promoter hypermethylation and expression status of the ZAC1 gene in mainly sporadic CHBs. Our LOH analysis revealed that the majority of the tumours with allele loss. The gene promoter methylation analysis similarly detected predominance of the methylated ZAC sequence in almost all tumours. Immunohistochemistry exhibited a strongly reduced expression of ZAC in stromal cells of all CHBs studied. Our current results indicate that the absence of the unmethylated, ZAC1 promoter sequence was highly concurrent with LOH for the ZAC1 region or 6q loss. This observation together with lack of ZAC expression, points to preferential loss of the non imprinted, expressed ZAC allele in CHB, in summary, our series of studies reveal a new chromosomal region 6q, emphasizes the importance of ZAC1 gene in the development of CHB and PCC, particularly in non-VHL associated cases.
Resumo:
Malignant mesothelioma (MM) is a rare, usually incurable, disease mainly caused by former exposure to asbestos. Even though MM has a strong etiological link, genetic factors may play a role, since not all cases can be linked to former asbestos exposure. This thesis focuses on lung diseases, mainly malignant mesothelioma (MM), and idiopathic pulmonary fibrosis (IPF), which resembles asbestosis. The specific asbestos-related pathways associated with malignant as well as non-malignant lung diseases, still need to be clarified. Since most patients diagnosed with MM or asbestosis/fibrosis have a dismal prognosis and few therapeutic options are available, early diagnosis and better understanding of the disease pathogenesis are of the utmost importance. The first objective of this thesis was to identify asbestos specific differentially expressed genes. This was approached by using high-resolution gene expression arrays, and three different human lung cell lines, as well as with three different bioinformatics approaches. Since the first study aimed to elucidate potential early changes, the second study was used to screen DNA copy number changes in MM tumour samples. This was performed using genome wide microarrays for identification of DNA copy number changes characterstic for MM. Study III focused on the role of gremlin in the regulation of bone morphogenetic protein (BMPs) in IPF. Further studies were conducted in asbestos-exposed cell cultures as well as in an asbestos-induced mouse model. Furthermore, GATA-6 was studied in MM and metastatic pleural adenocarcinoma. The GATA transcription factors are important during embryonic development, but their role in cancer is still unclear. GATA-6 is a co-factor/target of thyroid transcription factor 1 (TTF-1), which is used in differential diagnostics of pleural MM and adenocarcinoma. Bioinformatics probed the genes and biological processes ordered in terms of significance, clusters, and highly enriched chromosomal regions. The study revealed several already identified targets, produced new ideas about genes which are central for asbestos exposure, as well as provided supplementary data for researchers to check their own novel findings or ideas. The analysis revealed DNA copy number changes characteristic for MM tumors. The most common regions of loss were detected in 1p, 3p, 6q, 9p, 13, 14, and 22, and gains at 17q. The histological features in asbestosis and IPF are very similar, wherefore IPF can be studied in asbestos models. The BMP antagonist gremlin was up-regulated by asbestos exposure in human epithelial cell lines, which was also observed in Study I. The transforming growth factor (TGF) -β and BMP expression and signaling activities were measured from murine and human fibrotic lungs. BMP-7 signaling was down-regulated in response to up-regulation of gremlin, and restoration of BMP-7 signaling prevented progression of fibrosis in mice. Therefore, the study suggests that the restoration of BMP-7 signaling in fibrotic lung could potentially aid in the treatment of IPF patients. Study IV revealed that GATA-6 was strongly expressed in the majority of the MM cases, and correlated statistically significant with longer survival in subgroups of MM.
Resumo:
Helicobacter pylori infection is a risk factor for gastric cancer, which is a major health issue worldwide. Gastric cancer has a poor prognosis due to the unnoticeable progression of the disease and surgery is the only available treatment in gastric cancer. Therefore, gastric cancer patients would greatly benefit from identifying biomarker genes that would improve diagnostic and prognostic prediction and provide targets for molecular therapies. DNA copy number amplifications are the hallmarks of cancers in various anatomical locations. Mechanisms of amplification predict that DNA double-strand breaks occur at the margins of the amplified region. The first objective of this thesis was to identify the genes that were differentially expressed in H. pylori infection as well as the transcription factors and signal transduction pathways that were associated with the gene expression changes. The second objective was to identify putative biomarker genes in gastric cancer with correlated expression and copy number, and the last objective was to characterize cancers based on DNA copy number amplifications. DNA microarrays, an in vitro model and real-time polymerase chain reaction were used to measure gene expression changes in H. pylori infected AGS cells. In order to identify the transcription factors and signal transduction pathways that were activated after H. pylori infection, gene expression profiling data from the H. pylori experiments and a bioinformatics approach accompanied by experimental validation were used. Genome-wide expression and copy number microarray analysis of clinical gastric cancer samples and immunohistochemistry on tissue microarray were used to identify putative gastric cancer genes. Data mining and machine learning techniques were applied to study amplifications in a cross-section of cancers. FOS and various stress response genes were regulated by H. pylori infection. H. pylori regulated genes were enriched in the chromosomal regions that are frequently changed in gastric cancer, suggesting that molecular pathways of gastric cancer and premalignant H. pylori infection that induces gastritis are interconnected. 16 transcription factors were identified as being associated with H. pylori infection induced changes in gene expression. NF-κB transcription factor and p50 and p65 subunits were verified using elecrophoretic mobility shift assays. ERBB2 and other genes located in 17q12- q21 were found to be up-regulated in association with copy number amplification in gastric cancer. Cancers with similar cell type and origin clustered together based on the genomic localization of the amplifications. Cancer genes and large genes were co-localized with amplified regions and fragile sites, telomeres, centromeres and light chromosome bands were enriched at the amplification boundaries. H. pylori activated transcription factors and signal transduction pathways function in cellular mechanisms that might be capable of promoting carcinogenesis of the stomach. Intestinal and diffuse type gastric cancers showed distinct molecular genetic profiles. Integration of gene expression and copy number microarray data allowed the identification of genes that might be involved in gastric carcinogenesis and have clinical relevance. Gene amplifications were demonstrated to be non-random genomic instabilities. Cell lineage, properties of precursor stem cells, tissue microenvironment and genomic map localization of specific oncogenes define the site specificity of DNA amplifications, whereas labile genomic features define the structures of amplicons. These conclusions suggest that the definition of genomic changes in cancer is based on the interplay between the cancer cell and the tumor microenvironment.
Resumo:
The evolutionary function of X chromosome inactivation is thought to be dosage compensation. However, there is, at present, little evidence to suggest that most X chromosome-linked genes require such compensation. Another view--that X chromosome inactivation may be related to sex determination--is examined here. Consider a hypothetical DNA sequence regulating a major structural gene concerned with the determination of maleness. If this regulatory sequence occurs in both X and Y chromosomes and if its copy number in the Y chromosome is significantly greater than in the X chromosome, then the male-determining properties of the Y chromosome could be attributed to this higher copy number. On the other hand, if the Y chromosome has the same copy number of this sequence as the X chromosome, it is difficult to see how determination of two sexes would occur under such circumstances because XX and XY genomes would then be indistinguishable in this regard. Such a situation seems to occur in the human species with respect to the banded krait minor satellite, a repetitious DNA sequence associated with sex determination. This apparent difficulty may be resolved if X chromosome inactivation renders regulatory as well as structural genes nonfunctional and thereby brings about a significant reduction in the effective copy number of X chromosome-linked DNA sequences concerned with sex determination. It is suggested that X chromosome inactivation brings about, in this manner, a critical inequality between XX and XY embryos and that sex determination in humans is a consequence of this inequality. An analogous situation appears to exist in certain insects in which inactivation of a haploid set of chromosomes (and presumably, therefore, a 50% reduction in the effective copy number of most genes) is associated with maleness. If this line of reasoning is correct, it would suggest that sex determination may be the primary function of X chromosome inactivation.
Resumo:
Large-scale chromosome rearrangements such as copy number variants (CNVs) and inversions encompass a considerable proportion of the genetic variation between human individuals. In a number of cases, they have been closely linked with various inheritable diseases. Single-nucleotide polymorphisms (SNPs) are another large part of the genetic variance between individuals. They are also typically abundant and their measuring is straightforward and cheap. This thesis presents computational means of using SNPs to detect the presence of inversions and deletions, a particular variety of CNVs. Technically, the inversion-detection algorithm detects the suppressed recombination rate between inverted and non-inverted haplotype populations whereas the deletion-detection algorithm uses the EM-algorithm to estimate the haplotype frequencies of a window with and without a deletion haplotype. As a contribution to population biology, a coalescent simulator for simulating inversion polymorphisms has been developed. Coalescent simulation is a backward-in-time method of modelling population ancestry. Technically, the simulator also models multiple crossovers by using the Counting model as the chiasma interference model. Finally, this thesis includes an experimental section. The aforementioned methods were tested on synthetic data to evaluate their power and specificity. They were also applied to the HapMap Phase II and Phase III data sets, yielding a number of candidates for previously unknown inversions, deletions and also correctly detecting known such rearrangements.
Resumo:
This thesis presents a highly sensitive genome wide search method for recessive mutations. The method is suitable for distantly related samples that are divided into phenotype positives and negatives. High throughput genotype arrays are used to identify and compare homozygous regions between the cohorts. The method is demonstrated by comparing colorectal cancer patients against unaffected references. The objective is to find homozygous regions and alleles that are more common in cancer patients. We have designed and implemented software tools to automate the data analysis from genotypes to lists of candidate genes and to their properties. The programs have been designed in respect to a pipeline architecture that allows their integration to other programs such as biological databases and copy number analysis tools. The integration of the tools is crucial as the genome wide analysis of the cohort differences produces many candidate regions not related to the studied phenotype. CohortComparator is a genotype comparison tool that detects homozygous regions and compares their loci and allele constitutions between two sets of samples. The data is visualised in chromosome specific graphs illustrating the homozygous regions and alleles of each sample. The genomic regions that may harbour recessive mutations are emphasised with different colours and a scoring scheme is given for these regions. The detection of homozygous regions, cohort comparisons and result annotations are all subjected to presumptions many of which have been parameterized in our programs. The effect of these parameters and the suitable scope of the methods have been evaluated. Samples with different resolutions can be balanced with the genotype estimates of their haplotypes and they can be used within the same study.