37 resultados para association rule mining

em Helda - Digital Repository of University of Helsinki


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gene mapping is a systematic search for genes that affect observable characteristics of an organism. In this thesis we offer computational tools to improve the efficiency of (disease) gene-mapping efforts. In the first part of the thesis we propose an efficient simulation procedure for generating realistic genetical data from isolated populations. Simulated data is useful for evaluating hypothesised gene-mapping study designs and computational analysis tools. As an example of such evaluation, we demonstrate how a population-based study design can be a powerful alternative to traditional family-based designs in association-based gene-mapping projects. In the second part of the thesis we consider a prioritisation of a (typically large) set of putative disease-associated genes acquired from an initial gene-mapping analysis. Prioritisation is necessary to be able to focus on the most promising candidates. We show how to harness the current biomedical knowledge for the prioritisation task by integrating various publicly available biological databases into a weighted biological graph. We then demonstrate how to find and evaluate connections between entities, such as genes and diseases, from this unified schema by graph mining techniques. Finally, in the last part of the thesis, we define the concept of reliable subgraph and the corresponding subgraph extraction problem. Reliable subgraphs concisely describe strong and independent connections between two given vertices in a random graph, and hence they are especially useful for visualising such connections. We propose novel algorithms for extracting reliable subgraphs from large random graphs. The efficiency and scalability of the proposed graph mining methods are backed by extensive experiments on real data. While our application focus is in genetics, the concepts and algorithms can be applied to other domains as well. We demonstrate this generality by considering coauthor graphs in addition to biological graphs in the experiments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Tämä pro gradu -tutkielma käsittelee kielisidonnaisen huumorin kääntämistä. Ennen itse kääntämisen käsittelyä täytyy kuitenkin määritellä mitä tarkoitetaan käsitteellä kielisidonnainen huumori . Aluksi käsitellään huumoria ja sen ominaisuuksia sekä huumorin ja kulttuuristen, fysiologisten ja sosiaalisten tekijöiden suhdetta. Huumori syntyy kun tietyt odotukset rikotaan, eli toimitaan joidenkin normien vastaisesti. Huumorin eräitä perusperiaatteita ovat ristiriitaisuus ja yhteensopimattomuus. Kielisidonnaisessa huumorissa ristiriitaisuus ja yhteensopimattomuus on havaittavissa kielen tasolla: kieltä käytetään tuottamaan moniselitteinen tai -mielinen koominen ilmaus. Sanaleikki, jossa käytetään yhtä sanaa jolla on yksi tai useampi merkitys, on tyypillinen esimerkki kielisidonnaisesta huumorista. Mutta kielisidonnainen huumori ei rajoitu pelkästään tämänkaltaisiin sanaleikkeihin (engl. pun), vaan kattaa käsitteenä laajemman valikoiman erilaisia kielisidonnaisen huumorin muotoja, esimerkiksi monimerkityksiset nimet, idiomaattisilla ilmauksilla tuotettu huumori, akrostikonit, kirjoituksen konventioita rikkomalla tuotettu huumori jne. Kielisidonnainen huumori on tutkielmassa luokiteltu ja määritelty omaksi huumorin alalajikseen. Kielisidonnaisen huumorin kielellinen monimerkityksisyys tekee sen kääntämisestä vaikeampaa kuin sellaisen tekstin, jossa kielen tasolla ei ilmene monimerkityksisyyttä. Tästä syystä kielisidonnainen huumori tarvitsee erilaisen käännösstrategian kuin esimerkiksi tieteellinen teksti. Seuraavaksi käydään aluksi läpi joitakin käännösteorian keskeisiä käsitteitä ja niiden suhdetta ja vaikutuksia kielisidonnaisen huumorin kääntämiseen. Sitten kuvataan kielisidonnaisen huumorin käännösprosessi, joka jakautuu kolmeen osaan: tunnistaminen, analyysi ja kääntäminen. Näiden kolmen pohjalta laaditaan kuuden eri käännösstrategian ryhmä. Kuusi eri päästrategiaa ovat käännössidonnaisen huumorikategorian säilyttäminen, kirjaimellinen käännös, muun tyylikeinon käyttäminen, kompensaatio, poisjättö ja toimitukselliset keinot. Strategiat käydään läpi deskriptiivisesti ja niiden käyttöä valaistaan esimerkkien avulla. Osa päästrategioista jakautuu alastrategioihin, jotka kuvaavat tarkemmin, minkälaisin keinoin lähtökielen kielisidonnainen käännösongelma voidaan siirtää kohdekieleen. Strategiat pyritään kuvaamaan siten, että ne voisivat olla avuksi käännettäessä minkä tahansa kieliparin välillä. Vaikka kuvatut käännösstrategiat käydään läpi deskriptiivisesti, on pyrkii tutkielma myös olemaan avuksi käytännön tilanteissa kielisidonnaista huumoria käännettäessä. Tätä varten on tutkielman lopussa annettu kuvaus yhden kielisidonnaisen huumoriongelman kääntämisprosessista. Yhdistämällä teoria käytäntöön kuvataan käännösprosessiesimerkissä yhden kielisidonnaisen huumoriongelman analyysi-ja kääntämisvaiheet. Tuloksena on viisi erilaista versiota samasta lähtötekstin käännösongelmasta. Tutkielma siis ensinnäkin määrittelee, mitä ja minkälaista on kielisidonnainen huumori sekä luokittelee sen. Toisekseen tutkielma kuvaa sen käännösprosessin ja määrittelee eri käännösstrategiat. Lisäksi esimerkin avulla esitellään eri käännösvaihtoehtoja. Avainsanat: kääntäminen, huumori, sanaleikki, kielisidonnainen

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis focused on the roles of parenting styles (including parental norm-breaking attitudes and parents perceptions of players coach-athlete relationship), players achievement strategies, perceptions of coaching behaviours, coaches own perceptions, and perceptions of team leaders, in explaining player satisfaction and team cohesion. Five studies based on the same data provided by the Finnish Ice Hockey Association (FIHA) were carried out. The sample sizes were as follows: players, 1,018; parents, 979; coaches, 35; and team leaders, 57. Questionnaires and self-report questionnaires were used to collect data. The results revealed that: (1) family-parenting styles provided a basis for adolescents achievement strategies and influenced whether the sons played fairly or engaged in rule breaking. Democratic parenting was associated with the adolescents high level of mastery-oriented behaviour, low level of task-irrelevant behaviour, and low level of norm-breaking behaviour. The adaptive achievement strategies enhanced player satisfaction. (2) Democratic parenting styles influenced parents perception of the coach-athlete relationship, which was further associated with a coaching style that also influenced how the child experienced his own cohesion within the team. (3) The adolescents tended to reflect the similarity in leadership behaviours between home and sport from both democratic and autocratic backgrounds. In particular, the compensating combination of non-demanding styles at home and a high level of support by a positive coach was associated with high team cohesion. (4) The stress factor changed the dynamics of the parenting behaviour. (5) Players ratings, coaches ratings, and the ratings of team leaders all differed upon coaching behaviours and team cohesion. Only the players ratings were associated with cohesion high with positive coaching and low with autocratic coaching. The developmental age and the long-lasting membership on an ice hockey team made positive coaching acceptable for all players. Sixteen year-olds from all families rated high team cohesion with positive coaching. Parenting styles were associated with adolescent ice hockey players achievement strategies, norm breaking, and satisfaction. The combination of parenting and coaching was associated with cohesion rated by players. The staff s experiences of coaching and its effects on cohesion differed from the players experiences. These results contribute to understanding links between parenting styles, achievement strategies, norm breaking, and satisfaction, as well as parenting styles, coaching behaviour and cohesion. This work has importance for parents, coaches, sport organizations, and teachers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The success of entering work life, young people s psychological resources and self-reported well-being were studied in a longitudinal setting from a life-span developmental-contextual perspective in early adulthood. The aim was to analyse how psychosocial characteristics in early childhood and adolescence predict successful entrance into work life, how this is associated with well-being, and to assess the level of psychological resources such as dispositional optimism, personal meaning of work and coping in early adulthood. The role of these and social support, in the relationship between regional factors (such as place of residence and migration), self-reported health and life satisfaction was studied. The association between a specific coping strategy, i.e. eating and drinking in a stressful situation and eating habits, was studied to demonstrate how coping is associated with health behaviour. Multivariate methods, including binary logistic regression analyses and ANOVA, were used for statistical analyses. The subjects were members of the Northern Finland 1966 Birth Cohort, which consists of all women and men born in 1966 in the two northernmost provinces of Finland (n= 12,058). The most recent follow-up, at the age of 31 years when 11,637 subjects were alive, took place in 1997-1998. The results show, first, that social resources in the childhood family and adolescence school achievement predict entrance into the labour market. Secondly, psychosocial resources were found to mediate the relationship between migration from rural to urban areas, and subjective well-being. Thirdly, psychological resources at entrance into the labour market were found to develop from early infancy on. They are, however, influenced later by work history. Fourthly, stress-related eating and drinking, as a way of coping, was found to be directly associated with unhealthy eating habits and alcohol use. Gender differences were found in psychosocial resources predicting, and being associated with success in entering the labour market. For men, the role of attitudinal and psychological factors seems to be especially important in entrance into work life and in the development of psychological resources. For women, academic attainment was more important for successfully entering work life, and lack of emotional social support was a risk factor for stress-related eating only among women. Stress-related eating and drinking habits were predicted by a long history of unemployment as well as a low level of education among both genders, but not excluding an academic degree among men. The results emphasize the role of childhood psychosocial factors in preventing long-term unemployment and in enhancing psychological well-being in early adulthood. Success in entering work life, in terms of continuous work history, plays a crucial role for well-being and the amount of psychological resources in early adulthood. The results emphasize the crucial role of enhancing psychological resources for promoting positive health behaviour and diminishing regional differences in subjective well-being.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Clinical trials have shown that weight reduction with lifestyles can delay or prevent diabetes and reduce blood pressure. An appropriate definition of obesity using anthropometric measures is useful in predicting diabetes and hypertension at the population level. However, there is debate on which of the measures of obesity is best or most strongly associated with diabetes and hypertension and on what are the optimal cut-off values for body mass index (BMI) and waist circumference (WC) in this regard. The aims of the study were 1) to compare the strength of the association for undiagnosed or newly diagnosed diabetes (or hypertension) with anthropometric measures of obesity in people of Asian origin, 2) to detect ethnic differences in the association of undiagnosed diabetes with obesity, 3) to identify ethnic- and sex-specific change point values of BMI and WC for changes in the prevalence of diabetes and 4) to evaluate the ethnic-specific WC cutoff values proposed by the International Diabetes Federation (IDF) in 2005 for central obesity. The study population comprised 28 435 men and 35 198 women, ≥ 25 years of age, from 39 cohorts participating in the DECODA and DECODE studies, including 5 Asian Indian (n = 13 537), 3 Mauritian Indian (n = 4505) and Mauritian Creole (n = 1075), 8 Chinese (n =10 801), 1 Filipino (n = 3841), 7 Japanese (n = 7934), 1 Mongolian (n = 1991), and 14 European (n = 20 979) studies. The prevalence of diabetes, hypertension and central obesity was estimated, using descriptive statistics, and the differences were determined with the χ2 test. The odds ratios (ORs) or  coefficients (from the logistic model) and hazard ratios (HRs, from the Cox model to interval censored data) for BMI, WC, waist-to-hip ratio (WHR), and waist-to-stature ratio (WSR) were estimated for diabetes and hypertension. The differences between BMI and WC, WHR or WSR were compared, applying paired homogeneity tests (Wald statistics with 1 df). Hierarchical three-level Bayesian change point analysis, adjusting for age, was applied to identify the most likely cut-off/change point values for BMI and WC in association with previously undiagnosed diabetes. The ORs for diabetes in men (women) with BMI, WC, WHR and WSR were 1.52 (1.59), 1.54 (1.70), 1.53 (1.50) and 1.62 (1.70), respectively and the corresponding ORs for hypertension were 1.68 (1.55), 1.66 (1.51), 1.45 (1.28) and 1.63 (1.50). For diabetes the OR for BMI did not differ from that for WC or WHR, but was lower than that for WSR (p = 0.001) in men while in women the ORs were higher for WC and WSR than for BMI (both p < 0.05). Hypertension was more strongly associated with BMI than with WHR in men (p < 0.001) and most strongly with BMI than with WHR (p < 0.001), WSR (p < 0.01) and WC (p < 0.05) in women. The HRs for incidence of diabetes and hypertension did not differ between BMI and the other three central obesity measures in Mauritian Indians and Mauritian Creoles during follow-ups of 5, 6 and 11 years. The prevalence of diabetes was highest in Asian Indians, lowest in Europeans and intermediate in others, given the same BMI or WC category. The  coefficients for diabetes in BMI (kg/m2) were (men/women): 0.34/0.28, 0.41/0.43, 0.42/0.61, 0.36/0.59 and 0.33/0.49 for Asian Indian, Chinese, Japanese, Mauritian Indian and European (overall homogeneity test: p > 0.05 in men and p < 0.001 in women). Similar results were obtained in WC (cm). Asian Indian women had lower  coefficients than women of other ethnicities. The change points for BMI were 29.5, 25.6, 24.0, 24.0 and 21.5 in men and 29.4, 25.2, 24.9, 25.3 and 22.5 (kg/m2) in women of European, Chinese, Mauritian Indian, Japanese, and Asian Indian descent. The change points for WC were 100, 85, 79 and 82 cm in men and 91, 82, 82 and 76 cm in women of European, Chinese, Mauritian Indian, and Asian Indian. The prevalence of central obesity using the 2005 IDF definition was higher in Japanese men but lower in Japanese women than in their Asian counterparts. The prevalence of central obesity was 52 times higher in Japanese men but 0.8 times lower in Japanese women compared to the National Cholesterol Education Programme definition. The findings suggest that both BMI and WC predicted diabetes and hypertension equally well in all ethnic groups. At the same BMI or WC level, the prevalence of diabetes was highest in Asian Indians, lowest in Europeans and intermediate in others. Ethnic- and sex-specific change points of BMI and WC should be considered in setting diagnostic criteria for obesity to detect undiagnosed or newly diagnosed diabetes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Type 1 diabetes is a disease where the insulin-producing beta cells of the pancreas are destroyed by an autoimmune mechanism. The incidence of type 1 diabetes, as well as the incidence of the diabetic kidney complication, diabetic nephropathy, are increasing worldwide. Nephrin is a crucial molecule for the filtration function of the kidney. It localises in the podocyte foot processes partially forming the interpodocyte final sieve of the filtration barrier, the slit diaphragm. The expression of nephrin is altered in diabetic nephropathy. Recently, nephrin was found from the beta cells of the pancreas as well, which makes this molecule interesting in the context of type 1 diabetes and especially in diabetic nephropathy. In this thesis work, the expression of other podocyte molecules in the beta cells of the pancreas, in addition to nephrin, were deciphered. It was also hypothesised that patients with type 1 diabetes may develop autoantibodies against novel beta cell molecules comparably to the formation of autoantibodies to GAD, IA-2 and insulin. The possible association of such novel autoantibodies with the pathogenesis of diabetic nephropathy was also assessed. Furthermore, expression of nephrin in lymphoid tissues has been suggested, and this issue was more thoroughly deciphered here. The expression of nephrin in the human lymphoid tissues, and a set of podocyte molecules in the human, mouse and rat pancreas at the gene and protein level were studied by polymerase chain reaction (PCR) -based methods and immunochemical methods. To detect autoantibodies to novel beta cell molecules, specific radioimmunoprecipitation assays were developed. These assays were used to screen a follow-up material of 66 patients with type 1 diabetes and a patient material of 150 diabetic patients with signs of diabetic nephropathy. Nephrin expression was detected in the lymphoid follicle germinal centres, specifically in the follicular dendritic cells. In addition to the previously reported expression of nephrin in the pancreas, expression of the podocyte molecules, densin, filtrin, FAT and alpha-actinin-4 were detected in the beta cells. Circulating antibodies to nephrin, densin and filtrin were discovered in a subset of patients with type 1 diabetes. However, no association of these autoantibodies with the pathogenesis of diabetic nephropathy was detected. In conclusion, the expression of five podocyte molecules in the beta cells of the pancreas suggests some molecular similarities between the two cell types. The novel autoantibodies against shared molecules of the kidney podocytes and the pancreatic beta cells appear to be part of the common autoimmune mechanism in patients with type 1 diabetes. No data suggested that the autoantibodies would be active participants of the kidney injury detected in diabetic nephropathy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this thesis, two separate single nucleotide polymorphism (SNP) genotyping techniques were set up at the Finnish Genome Center, pooled genotyping was evaluated as a screening method for large-scale association studies, and finally, the former approaches were used to identify genetic factors predisposing to two distinct complex diseases by utilizing large epidemiological cohorts and also taking environmental factors into account. The first genotyping platform was based on traditional but improved restriction-fragment-length-polymorphism (RFLP) utilizing 384-microtiter well plates, multiplexing, small reaction volumes (5 µl), and automated genotype calling. We participated in the development of the second genotyping method, based on single nucleotide primer extension (SNuPeTM by Amersham Biosciences), by carrying out the alpha- and beta tests for the chemistry and the allele-calling software. Both techniques proved to be accurate, reliable, and suitable for projects with thousands of samples and tens of markers. Pooled genotyping (genotyping of pooled instead of individual DNA samples) was evaluated with Sequenom s MassArray MALDI-TOF, in addition to SNuPeTM and PCR-RFLP techniques. We used MassArray mainly as a point of comparison, because it is known to be well suited for pooled genotyping. All three methods were shown to be accurate, the standard deviations between measurements being 0.017 for the MassArray, 0.022 for the PCR-RFLP, and 0.026 for the SNuPeTM. The largest source of error in the process of pooled genotyping was shown to be the volumetric error, i.e., the preparation of pools. We also demonstrated that it would have been possible to narrow down the genetic locus underlying congenital chloride diarrhea (CLD), an autosomal recessive disorder, by using the pooling technique instead of genotyping individual samples. Although the approach seems to be well suited for traditional case-control studies, it is difficult to apply if any kind of stratification based on environmental factors is needed. Therefore we chose to continue with individual genotyping in the following association studies. Samples in the two separate large epidemiological cohorts were genotyped with the PCR-RFLP and SNuPeTM techniques. The first of these association studies concerned various pregnancy complications among 100,000 consecutive pregnancies in Finland, of which we genotyped 2292 patients and controls, in addition to a population sample of 644 blood donors, with 7 polymorphisms in the potentially thrombotic genes. In this thesis, the analysis of a sub-study of pregnancy-related venous thromboses was included. We showed that the impact of factor V Leiden polymorphism on pregnancy-related venous thrombosis, but not the other tested polymorphisms, was fairly large (odds ratio 11.6; 95% CI 3.6-33.6), and increased multiplicatively when combined with other risk factors such as obesity or advanced age. Owing to our study design, we were also able to estimate the risks at the population level. The second epidemiological cohort was the Helsinki Birth Cohort of men and women who were born during 1924-1933 in Helsinki. The aim was to identify genetic factors that might modify the well known link between small birth size and adult metabolic diseases, such as type 2 diabetes and impaired glucose tolerance. Among ~500 individuals with detailed birth measurements and current metabolic profile, we found that an insertion/deletion polymorphism of the angiotensin converting enzyme (ACE) gene was associated with the duration of gestation, and weight and length at birth. Interestingly, the ACE insertion allele was also associated with higher indices of insulin secretion (p=0.0004) in adult life, but only among individuals who were born small (those among the lowest third of birth weight). Likewise, low birth weight was associated with higher indices of insulin secretion (p=0.003), but only among carriers of the ACE insertion allele. The association with birth measurements was also found with a common haplotype of the glucocorticoid receptor (GR) gene. Furthermore, the association between short length at birth and adult impaired glucose tolerance was confined to carriers of this haplotype (p=0.007). These associations exemplify the interaction between environmental factors and genotype, which, possibly due to altered gene expression, predisposes to complex metabolic diseases. Indeed, we showed that the common GR gene haplotype associated with reduced mRNA expression in thymus of three individuals (p=0.0002).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Latent transforming growth factor-beta (TGF-beta) binding proteins (LTBPs) -1, -3 and -4 are ECM components whose major function is to augment the secretion and matrix targeting of TGF-beta, a multipotent cytokine. LTBP-2 does not bind small latent TGF-beta but has suggested functions as a structural protein in ECM microfibrils. In the current work we focused on analyzing possible adhesive functions of LTBP-2 as well as on characterizing the kinetics and regulation of LTBP-2 secretion and ECM deposition. We also explored the role of TGF-beta binding LTBPs in endothelial cells activated to mimic angiogenesis as well as in malignant mesothelioma. We found that, unlike most adherent cells, several melanoma cell lines efficiently adhered to purified recombinant LTBP-2. Further characterization revealed that the adhesion was mediated by alpha3beta1 and alpha6beta1 integrins. Heparin also inhibited the melanoma cell adhesion suggesting a role for heparan sulphate proteoglycans. LTBP-2 was also identified as a haptotactic substrate for melanoma cell migration. We used cultured human embryonic lung fibroblasts to analyze the temporal and spatial association of LTBP-2 into ECM. By We found that LTBP-2 was efficiently assembled to the ECM only in confluent cultures following the deposition of fibronectin (FN) and fibrillin-1. In early, subconfluent cultures it remained primarily in soluble form after secretion. LTBP-2 colocalized transiently with FN and fibrillin-1. Silencing of fibrillin-1 expression by lentiviral shRNAs profoundly disrupted the deposition of LTBP-2 indicating that the ECM association of LTBP-2 depends on a pre-formed fibrillin-1 network. Considering the established role of TGF-beta as a regulator of angiogenesis we induced morphological activation of endothelial cells by phorbol 12-myristate 13-acetate (PMA) and followed the fate of LTBP-1 in the endothelial ECM. This resulted in profound proteolytic processing of LTBP-1 and release of latent TGF-beta complexes from the ECM. The processing was coupled with increased activation of MT-MMPs and specific upregulation of MT1-MMP. The major role of MT1-MMP in the proteolysis of LTBP-1 was confirmed by suppressing the expression with lentivirally induced short-hairpin RNAs as well as by various metalloproteinases inhibitors. TGF-beta can promote tumorigenesis of malignant mesothelioma (MM), which is an aggressive tumor of the pleura with poor prognosis. TGF-beta activity was analyzed in a panel of MM tumors by immunohistochemical staining of phosphorylated Smad-2 (P-Smad2). The tumor cells were strongly positive for P-Smad2 whereas LTBP-1 immunoreactivity was abundant in the stroma, and there was a negative correlation between LTBP-1 and P-Smad2 staining. In addition, the high P-Smad2 immunoreactivity correlated with shorter survival of patients. mRNA analysis revealed that TGF-beta1 was the most highly expressed isoform in both normal human pleura and MM tissue. LTBP-1 and LTBP-3 were both abundantly expressed. LTBP-1 was the predominant isoform in established MM cell lines whereas the expression of LTBP-3 was high in control cells. Suppression of LTBP-3 expression by siRNAs resulted in increased TGF-beta activity in MM cell lines accompanied by decreased proliferation. Our results suggest that decreased expression of LTBP-3 in MM could alter the targeting of TGF-beta to the ECM and lead to its increased activation. The current work emphasizes the coordinated process of the assembly and appropriate targeting of LTBPs with distinct adhesive or cytokine harboring properties into the ECM. The hierarchical assembly may have implications in the modulation of signaling events during morphogenesis and tissue remodeling.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Helicobacter pylori infection is a risk factor for gastric cancer, which is a major health issue worldwide. Gastric cancer has a poor prognosis due to the unnoticeable progression of the disease and surgery is the only available treatment in gastric cancer. Therefore, gastric cancer patients would greatly benefit from identifying biomarker genes that would improve diagnostic and prognostic prediction and provide targets for molecular therapies. DNA copy number amplifications are the hallmarks of cancers in various anatomical locations. Mechanisms of amplification predict that DNA double-strand breaks occur at the margins of the amplified region. The first objective of this thesis was to identify the genes that were differentially expressed in H. pylori infection as well as the transcription factors and signal transduction pathways that were associated with the gene expression changes. The second objective was to identify putative biomarker genes in gastric cancer with correlated expression and copy number, and the last objective was to characterize cancers based on DNA copy number amplifications. DNA microarrays, an in vitro model and real-time polymerase chain reaction were used to measure gene expression changes in H. pylori infected AGS cells. In order to identify the transcription factors and signal transduction pathways that were activated after H. pylori infection, gene expression profiling data from the H. pylori experiments and a bioinformatics approach accompanied by experimental validation were used. Genome-wide expression and copy number microarray analysis of clinical gastric cancer samples and immunohistochemistry on tissue microarray were used to identify putative gastric cancer genes. Data mining and machine learning techniques were applied to study amplifications in a cross-section of cancers. FOS and various stress response genes were regulated by H. pylori infection. H. pylori regulated genes were enriched in the chromosomal regions that are frequently changed in gastric cancer, suggesting that molecular pathways of gastric cancer and premalignant H. pylori infection that induces gastritis are interconnected. 16 transcription factors were identified as being associated with H. pylori infection induced changes in gene expression. NF-κB transcription factor and p50 and p65 subunits were verified using elecrophoretic mobility shift assays. ERBB2 and other genes located in 17q12- q21 were found to be up-regulated in association with copy number amplification in gastric cancer. Cancers with similar cell type and origin clustered together based on the genomic localization of the amplifications. Cancer genes and large genes were co-localized with amplified regions and fragile sites, telomeres, centromeres and light chromosome bands were enriched at the amplification boundaries. H. pylori activated transcription factors and signal transduction pathways function in cellular mechanisms that might be capable of promoting carcinogenesis of the stomach. Intestinal and diffuse type gastric cancers showed distinct molecular genetic profiles. Integration of gene expression and copy number microarray data allowed the identification of genes that might be involved in gastric carcinogenesis and have clinical relevance. Gene amplifications were demonstrated to be non-random genomic instabilities. Cell lineage, properties of precursor stem cells, tissue microenvironment and genomic map localization of specific oncogenes define the site specificity of DNA amplifications, whereas labile genomic features define the structures of amplicons. These conclusions suggest that the definition of genomic changes in cancer is based on the interplay between the cancer cell and the tumor microenvironment.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Precipitation-induced runoff and leaching from milled peat mining mires by peat types: a comparative method for estimating the loading of water bodies during peat production. This research project in environmental geology has arisen out of an observed need to be able to predict more accurately the loading of watercourses with detrimental organic substances and nutrients from already existing and planned peat production areas, since the authorities capacity for insisting on such predictions covering the whole duration of peat production in connection with evaluations of environmental impact is at present highly limited. National and international decisions regarding monitoring of the condition of watercourses and their improvement and restoration require more sophisticated evaluation methods in order to be able to forecast watercourse loading and its environmental impacts at the stage of land-use planning and preparations for peat production.The present project thus set out from the premise that it would be possible on the basis of existing mire and peat data properties to construct estimates for the typical loading from production mires over the whole duration of their exploitation. Finland has some 10 million hectares of peatland, accounting for almost a third of its total area. Macroclimatic conditions have varied in the course of the Holocene growth and development of this peatland, and with them the habitats of the peat-forming plants. Temperatures and moisture conditions have played a significant role in determining the dominant species of mire plants growing there at any particular time, the resulting mire types and the accumulation and deposition of plant remains to form the peat. The above climatic, environmental and mire development factors, together with ditching, have contributed, and continue to contribute, to the existence of peat horizons that differ in their physical and chemical properties, leading to differences in material transport between peatlands in a natural state and mires that have been ditched or prepared for forestry and peat production. Watercourse loading from the ditching of mires or their use for peat production can have detrimental effects on river and lake environments and their recreational use, especially where oxygen-consuming organic solids and soluble organic substances and nutrients are concerned. It has not previously been possible, however, to estimate in advance the watercourse loading likely to arise from ditching and peat production on the basis of the characteristics of the peat in a mire, although earlier observations have indicated that watercourse loading from peat production can vary greatly and it has been suggested that differences in peat properties may be of significance in this. Sprinkling is used here in combination with simulations of conditions in a milled peat production area to determine the influence of the physical and chemical properties of milled peats in production mires on surface runoff into the drainage ditches and the concentrations of material in the runoff water. Sprinkling and extraction experiments were carried out on 25 samples of milled Carex (C) and Sphagnum (S) peat of humification grades H 2.5 8.5 with moisture content in the range 23.4 89% on commencement of the first sprinkling, which was followed by a second sprinkling 24 hours later. The water retention capacity of the peat was best, and surface runoff lowest, with Sphagnum and Carex peat samples of humification grades H 2.5 6 in the moisture content class 56 75%. On account of the hydrophobicity of dry peat, runoff increased in a fairly regular manner with drying of the sample from 55% to 24 30%. Runoff from the samples with an original moisture content over 55% increased by 63% in the second round of sprinkling relative to the first, as they had practically reached saturation point on the first occasion, while those with an original moisture content below 55% retained their high runoff in the second round, due to continued hydrophobicity. The well-humified samples (H 6.5 8.5) with a moisture content over 80% showed a low water retention capacity and high runoff in both rounds of sprinkling. Loading of the runoff water with suspended solids, total phosphorus and total nitrogen, and also the chemical oxygen demand (CODMn O2), varied greatly in the sprinkling experiment, depending on the peat type and degree of humification, but concentrations of the same substances in the two sprinklings were closely or moderately closely correlated and these correlations were significant. The concentrations of suspended solids in the runoff water observed in the simulations of a peat production area and the direct surface runoff from it into the drainage ditch system in response to rain (sprinkling intensity 1.27 mm/min) varied c. 60-fold between the degrees of humification in the case of the Carex peats and c. 150-fold for the Sphagnum peats, while chemical oxygen demand varied c. 30-fold and c. 50-fold, respectively, total phosphorus c. 60-fold and c. 66-fold, total nitrogen c. 65-fold and c. 195-fold and ammonium nitrogen c. 90-fold and c. 30-fold. The increases in concentrations in the runoff water were very closely correlated with increases in humification of the peat. The correlations of the concentrations measured in extraction experiments (48 h) with peat type and degree of humification corresponded to those observed in the sprinkler experiments. The resulting figures for the surface runoff from a peat production area into the drainage ditches simulated by means of sprinkling and material concentrations in the runoff water were combined with statistics on the mean extent of daily rainfall (0 67 mm) during the frost-free period of the year (May October) over an observation period of 30 years to yield typical annual loading figures (kg/ha) for suspended solids (SS), chemical oxygen demand of organic matter (CODmn O2), total phosphorus (tot. P) and total nitrogen (tot. N) entering the ditches with respect to milled Carex (C) and Sphagnum (S) peats of humification grades H 2.5 8.5. In order to calculate the loading of drainage ditches from a milled peat production mire with the aid of these annual comparative values (in kg/ha), information is required on the properties of the intended production mire and its peat. Once data are available on the area of the mire, its peat depth, peat types and their degrees of humification, dry matter content, calorific value and corresponding energy content, it is possible to produce mutually comparable estimates for individual mires with respect to the annual loading of the drainage ditch system and the surrounding watercourse for the whole service life of the production area, the duration of this service life, determinations of energy content and the amount of loading per unit of energy generated (kg/MWh). In the 8 mires in the Köyhäjoki basin, Central Ostrobothnia, taken as an example, the loading of suspended solids (SS) in the drainage ditch networks calculated on the basis of the typical values obtained here and existing mire and peat data and expressed per unit of energy generated varied between the mires and horizons in the range 0.9 16.5 kg/MWh. One of the aims of this work was to develop means of making better use of existing mire and peat data and the results of corings and other field investigations. In this respect combination of the typical loading values (kg/ha) obtained here for S, SC, CS and C peats and the various degrees of humification (H 2.5 8.5) with the above mire and peat data by means of a computer program for the acquisition and handling of such data would enable all the information currently available and that deposited in the system in the future to be used for defining watercourse loading estimates for mires and comparing them with the corresponding estimates of energy content. The intention behind this work has been to respond to the challenge facing the energy generation industry to find larger peat production areas that exert less loading on the environment and to that facing the environmental authorities to improve the means available for estimating watercourse loading from peat production and its environmental impacts in advance. The results conform well to the initial hypothesis and to the goals laid down for the research and should enable watercourse loading from existing and planned peat production to be evaluated better in the future and the resulting impacts to be taken into account when planning land use and energy generation. The advance loading information available in this way would be of value in the selection of individual peat production areas, the planning of their exploitation, the introduction of water protection measures and the planning of loading inspections, in order to achieve controlled peat production that pays due attention to environmental considerations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The focus of this study is on statistical analysis of categorical responses, where the response values are dependent of each other. The most typical example of this kind of dependence is when repeated responses have been obtained from the same study unit. For example, in Paper I, the response of interest is the pneumococcal nasopharengyal carriage (yes/no) on 329 children. For each child, the carriage is measured nine times during the first 18 months of life, and thus repeated respones on each child cannot be assumed independent of each other. In the case of the above example, the interest typically lies in the carriage prevalence, and whether different risk factors affect the prevalence. Regression analysis is the established method for studying the effects of risk factors. In order to make correct inferences from the regression model, the associations between repeated responses need to be taken into account. The analysis of repeated categorical responses typically focus on regression modelling. However, further insights can also be gained by investigating the structure of the association. The central theme in this study is on the development of joint regression and association models. The analysis of repeated, or otherwise clustered, categorical responses is computationally difficult. Likelihood-based inference is often feasible only when the number of repeated responses for each study unit is small. In Paper IV, an algorithm is presented, which substantially facilitates maximum likelihood fitting, especially when the number of repeated responses increase. In addition, a notable result arising from this work is the freely available software for likelihood-based estimation of clustered categorical responses.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Telecommunications network management is based on huge amounts of data that are continuously collected from elements and devices from all around the network. The data is monitored and analysed to provide information for decision making in all operation functions. Knowledge discovery and data mining methods can support fast-pace decision making in network operations. In this thesis, I analyse decision making on different levels of network operations. I identify the requirements decision-making sets for knowledge discovery and data mining tools and methods, and I study resources that are available to them. I then propose two methods for augmenting and applying frequent sets to support everyday decision making. The proposed methods are Comprehensive Log Compression for log data summarisation and Queryable Log Compression for semantic compression of log data. Finally I suggest a model for a continuous knowledge discovery process and outline how it can be implemented and integrated to the existing network operations infrastructure.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Segmentation is a data mining technique yielding simplified representations of sequences of ordered points. A sequence is divided into some number of homogeneous blocks, and all points within a segment are described by a single value. The focus in this thesis is on piecewise-constant segments, where the most likely description for each segment and the most likely segmentation into some number of blocks can be computed efficiently. Representing sequences as segmentations is useful in, e.g., storage and indexing tasks in sequence databases, and segmentation can be used as a tool in learning about the structure of a given sequence. The discussion in this thesis begins with basic questions related to segmentation analysis, such as choosing the number of segments, and evaluating the obtained segmentations. Standard model selection techniques are shown to perform well for the sequence segmentation task. Segmentation evaluation is proposed with respect to a known segmentation structure. Applying segmentation on certain features of a sequence is shown to yield segmentations that are significantly close to the known underlying structure. Two extensions to the basic segmentation framework are introduced: unimodal segmentation and basis segmentation. The former is concerned with segmentations where the segment descriptions first increase and then decrease, and the latter with the interplay between different dimensions and segments in the sequence. These problems are formally defined and algorithms for solving them are provided and analyzed. Practical applications for segmentation techniques include time series and data stream analysis, text analysis, and biological sequence analysis. In this thesis segmentation applications are demonstrated in analyzing genomic sequences.