6 resultados para complex data
em Helda - Digital Repository of University of Helsinki
Resumo:
Bacteria play an important role in many ecological systems. The molecular characterization of bacteria using either cultivation-dependent or cultivation-independent methods reveals the large scale of bacterial diversity in natural communities, and the vastness of subpopulations within a species or genus. Understanding how bacterial diversity varies across different environments and also within populations should provide insights into many important questions of bacterial evolution and population dynamics. This thesis presents novel statistical methods for analyzing bacterial diversity using widely employed molecular fingerprinting techniques. The first objective of this thesis was to develop Bayesian clustering models to identify bacterial population structures. Bacterial isolates were identified using multilous sequence typing (MLST), and Bayesian clustering models were used to explore the evolutionary relationships among isolates. Our method involves the inference of genetic population structures via an unsupervised clustering framework where the dependence between loci is represented using graphical models. The population dynamics that generate such a population stratification were investigated using a stochastic model, in which homologous recombination between subpopulations can be quantified within a gene flow network. The second part of the thesis focuses on cluster analysis of community compositional data produced by two different cultivation-independent analyses: terminal restriction fragment length polymorphism (T-RFLP) analysis, and fatty acid methyl ester (FAME) analysis. The cluster analysis aims to group bacterial communities that are similar in composition, which is an important step for understanding the overall influences of environmental and ecological perturbations on bacterial diversity. A common feature of T-RFLP and FAME data is zero-inflation, which indicates that the observation of a zero value is much more frequent than would be expected, for example, from a Poisson distribution in the discrete case, or a Gaussian distribution in the continuous case. We provided two strategies for modeling zero-inflation in the clustering framework, which were validated by both synthetic and empirical complex data sets. We show in the thesis that our model that takes into account dependencies between loci in MLST data can produce better clustering results than those methods which assume independent loci. Furthermore, computer algorithms that are efficient in analyzing large scale data were adopted for meeting the increasing computational need. Our method that detects homologous recombination in subpopulations may provide a theoretical criterion for defining bacterial species. The clustering of bacterial community data include T-RFLP and FAME provides an initial effort for discovering the evolutionary dynamics that structure and maintain bacterial diversity in the natural environment.
Resumo:
In genetic epidemiology, population-based disease registries are commonly used to collect genotype or other risk factor information concerning affected subjects and their relatives. This work presents two new approaches for the statistical inference of ascertained data: a conditional and full likelihood approaches for the disease with variable age at onset phenotype using familial data obtained from population-based registry of incident cases. The aim is to obtain statistically reliable estimates of the general population parameters. The statistical analysis of familial data with variable age at onset becomes more complicated when some of the study subjects are non-susceptible, that is to say these subjects never get the disease. A statistical model for a variable age at onset with long-term survivors is proposed for studies of familial aggregation, using latent variable approach, as well as for prospective studies of genetic association studies with candidate genes. In addition, we explore the possibility of a genetic explanation of the observed increase in the incidence of Type 1 diabetes (T1D) in Finland in recent decades and the hypothesis of non-Mendelian transmission of T1D associated genes. Both classical and Bayesian statistical inference were used in the modelling and estimation. Despite the fact that this work contains five studies with different statistical models, they all concern data obtained from nationwide registries of T1D and genetics of T1D. In the analyses of T1D data, non-Mendelian transmission of T1D susceptibility alleles was not observed. In addition, non-Mendelian transmission of T1D susceptibility genes did not make a plausible explanation for the increase in T1D incidence in Finland. Instead, the Human Leucocyte Antigen associations with T1D were confirmed in the population-based analysis, which combines T1D registry information, reference sample of healthy subjects and birth cohort information of the Finnish population. Finally, a substantial familial variation in the susceptibility of T1D nephropathy was observed. The presented studies show the benefits of sophisticated statistical modelling to explore risk factors for complex diseases.
Resumo:
The kidney filtration barrier consists of fenestrated endothelial cell layer, glomerular basement membrane and slit diaphragm (SD), the specialized junction between glomerular viscelar epithelial cells (podocytes). Podocyte injury is associated with the development of proteinuria, and if not reversed the injury will lead to permanent deterioration of the glomerular filter. The early events are characterized by disruption of the integrity of the SD, but the molecular pathways involved are not fully understood. Congenital nephrotic syndrome of the Finnish type (CNF) is caused by mutations in NPHS1, the gene encoding the SD protein nephrin. Lack of nephrin results in loss of the SD and massive proteinuria beginning before birth. Furthermore, nephrin expression is decreased in acquired human kidney diseases including diabetic nephropathy. This highlights the importance of nephrin and consequently SD in regulating the kidney filtration function. However, the precise molecular mechanism of how nephrin is involved in the formation of the SD is unknown. This thesis work aimed at clarifying the role of nephrin and its interaction partners in the formation of the SD. The purpose was to identify novel proteins that associate with nephrin in order to define the essential molecular complex required for the establishment of the SD. The aim was also to decipher the role of novel nephrin interacting proteins in podocytes. Nephrin binds to nephrin-like proteins Neph1 and Neph2, and to adherens junction protein P-cadherin. These interactions have been suggested to play a role in the formation of the SD. In this thesis work, we identified densin as a novel interaction partner for nephrin. Densin was localized to the SD and it was shown to bind to adherens junction protein beta-catenin. Furthermore, densin was shown to behave in a similar fashion as adherens junction proteins in cell-cell contacts. These results indicate that densin may play a role in cell adhesion and, therefore, may contribute to the formation of the SD together with nephrin and adherens junction proteins. Nephrin was also shown to bind to Neph3, which has been previously localized to the SD. Neph3 and Neph1 were shown to induce cell adhesion alone, whereas nephrin needed to trans-interact with Neph1 or Neph3 from the opposite cell surface in order to make cell-cell contacts. This was associated with the decreased tyrosine phosphorylation of nephrin. These data extend the current knowledge of the molecular composition of the nephrin protein complex at the SD and also provide novel insights of how the SD may be formed. This thesis work also showed that densin was up-regulated in the podocytes of CNF patients. Neph3 was up-regulated in nephrin deficient mouse kidneys, which share similar podocyte alterations and lack of the SD as observed in CNF patients podocytes. These data suggest that densin and Neph3 may have a role in the formation of morphological alterations in podocytes detected in CNF patients. Furthermore, this thesis work showed that deletion of beta-catenin specifically from adult mouse podocytes protected the mice from the development of adriamycin-induced podocyte injury and proteinuria compared to wild-type mice. These results show that beta-catenin play a role in the adriamycin induced podocyte injury. Podocyte injury is a hallmark in many kidney diseases and the changes observed in the podocytes of CNF patient share characteristics with injured podocytes observed in chronic kidney diseases. Therefore, the results obtained in this thesis work suggest that densin, Neph3 and beta-catenin participate in the molecular pathways which result in morphological alterations commonly detected in injured podocytes in kidney diseases.
Resumo:
Cells of every living organism on our planet − bacterium, plant or animal − are organized in such a way that despite differences in structure and function they utilize the same metabolic energy represented by electrochemical proton gradient across a membrane. This gradient of protons is generated by the series of membrane bound multisubunit proteins, Complex I, II, III and IV, organized in so-called respiratory or electron transport chain. In the eukaryotic cell it locates in the inner mitochondrial membrane while in the bacterial cell it locates in the cytoplasmic membrane. The function of the respiratory chain is to accept electrons from NADH and ubiquinol and transfer them to oxygen resulting in the formation of water. The free energy released upon these redox reactions is converted by respiratory enzymes into an electrochemical proton gradient, which is used for synthesis of ATP as well as for many other energy dependent processes. This thesis is focused on studies of the first member of the respiratory chain − NADH:ubiquinone oxidoreductase or Complex I. This enzyme has a boot-shape structure with hydrophilic and hydrophobic domains, the former of which has all redox groups of the protein, the flavin and eight to nine iron-sulfur clusters. Complex I serves as a proton pump coupling transfer of two electrons from NADH to ubiquinone to the translocation of four protons across the membrane. So far the mechanism of energy transduction by Complex I is unknown. In the present study we applied a set of different methods to study the electron and proton transfer reactions in Complex I from Escherichia coli. The main achievement was the experiment that showed that the electron transfer through the hydrophilic domain of Complex I is unlikely to be coupled to proton transfer directly or to conformational changes in the protein. In this work for the first time properties of all redox centers of Complex I were characterized in the intact purified bacterial enzyme. We also probed the role of several conserved amino acid residues in the electron transfer of Complex I. Finally, we found that highly conserved amino acid residues in several membrane subunits form a common pattern with a very prominent feature – the presence of a few lysines within the membrane. Based on the experimental data, we suggested a tentative principle which may govern the redox-coupled proton pumping in Complex I.
Resumo:
The study of soil microbiota and their activities is central to the understanding of many ecosystem processes such as decomposition and nutrient cycling. The collection of microbiological data from soils generally involves several sequential steps of sampling, pretreatment and laboratory measurements. The reliability of results is dependent on reliable methods in every step. The aim of this thesis was to critically evaluate some central methods and procedures used in soil microbiological studies in order to increase our understanding of the factors that affect the measurement results and to provide guidance and new approaches for the design of experiments. The thesis focuses on four major themes: 1) soil microbiological heterogeneity and sampling, 2) storage of soil samples, 3) DNA extraction from soil, and 4) quantification of specific microbial groups by the most-probable-number (MPN) procedure. Soil heterogeneity and sampling are discussed as a single theme because knowledge on spatial (horizontal and vertical) and temporal variation is crucial when designing sampling procedures. Comparison of adjacent forest, meadow and cropped field plots showed that land use has a strong impact on the degree of horizontal variation of soil enzyme activities and bacterial community structure. However, regardless of the land use, the variation of microbiological characteristics appeared not to have predictable spatial structure at 0.5-10 m. Temporal and soil depth-related patterns were studied in relation to plant growth in cropped soil. The results showed that most enzyme activities and microbial biomass have a clear decreasing trend in the top 40 cm soil profile and a temporal pattern during the growing season. A new procedure for sampling of soil microbiological characteristics based on stratified sampling and pre-characterisation of samples was developed. A practical example demonstrated the potential of the new procedure to reduce the analysis efforts involved in laborious microbiological measurements without loss of precision. The investigation of storage of soil samples revealed that freezing (-20 °C) of small sample aliquots retains the activity of hydrolytic enzymes and the structure of the bacterial community in different soil matrices relatively well whereas air-drying cannot be recommended as a storage method for soil microbiological properties due to large reductions in activity. Freezing below -70 °C was the preferred method of storage for samples with high organic matter content. Comparison of different direct DNA extraction methods showed that the cell lysis treatment has a strong impact on the molecular size of DNA obtained and on the bacterial community structure detected. An improved MPN method for the enumeration of soil naphthalene degraders was introduced as an alternative to more complex MPN protocols or the DNA-based quantification approach. The main advantage of the new method is the simple protocol and the possibility to analyse a large number of samples and replicates simultaneously.
Resumo:
The purpose of this master´s thesis is to analyze how NATO Secretary General Anders Fogh Rasmussen is trying to justify the existence of the military alliance through the use of security arguments. I am puzzled by the question: why does NATO still exist – what is NATO’s raison d'être. The New Strategic Concept (2010) forms the base for his argumentation. This thesis focuses on the security argumentation of NATO which is examined by analyzing the speeches the Secretary General. The theoretical framework of this study is based on constructivist approach to international security examining the linguistic process of securitization. Issues become securitized after Anders Fogh Rasmussen names them as threats. This thesis focuses on the securitization process relating to NATO and analyses what issues Rasmussen raises to the security agenda. Research data consists of the speeches by Anders Fogh Rasmussen. They are analyzed through J.L. Austin’s speech act taxonomy and Chaïm Perelman’s argumentation theories. The thesis will concentrate on the formulation and articulation of these threats which are considered and coined as “new threats” in contemporary international relations. I am conducting this research through the use of securitization theory. This study illustrates that the threats are constructed by NATO’s member-states in unison, but the resolutions are sounded through Rasmussen’s official speeches and transcripts. . Based on the analysis it can be concluded that Rasmussen is giving reasons for the existence of NATO. This takes place by making use of speech acts and different rhetorical techniques. The results of the analysis indicate that NATO remains an essential organization for the West and the rest of the world according to the Secretary General.