9 resultados para complete linkage clustering

em Helda - Digital Repository of University of Helsinki


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Acquiring sufficient information on the genetic variation, genetic differentiation, and the ecological and genetic relationships among individuals and populations are essential for establishing guidelines on conservation and utilization of the genetic resources of a species, and more particularly when biotic and abiotic stresses are considered. The aim of this study was to assess the extent and pattern of genetic variation in date palm (Phoenix dacttylifera L) cultivars; the genetic diversity and structure in its populations occurring over geographical ranges; the variation in economically and botanically important traits of it and the variation in its drought adaptive traits, in conservation and utilization context. In this study, the genetic diversity and relationships among selected cultivars from Sudan and Morocco were assessed using microsatellite markers. Microsatellite markers were also used to investigate the genetic diversity within and among populations collected from different geographic locations in Sudan. In a separate investigation, fruits of cultivars selected from Sudan, involved morphological and chemical characterization, and morphological and DNA polymorphism of the mother trees were also investigated. Morphological and photosynthetic adjustments to water stress were studied in the five most important date palm cultivars in Sudan, namely, Gondaila, Barakawi, Bitamoda, Khateeb and Laggai; and the mechanism enhancing photosynthetic gas exchange in date palm under water stress was also investigated. Results showed a significant (p < 0.001, t-test) differentiation between Sudan and Morocco groups of cultivars. However, the major feature of all tested cultivars was the complete lack of clustering and the absence of cultivars representing specific clones. The results indicated high genetic as well as compositional and morphological diversity among cultivars; while, compositional and morphological traits were found to be characteristic features that strongly differentiate cultivars as well as phenotypes. High genetic diversity was observed also in different populations. Slight but significant (p < 0.01, AMOVA) divergence was observed for soft and dry types; however, the genetic divergence among populations was relatively weak. The results showed a complex genetic relationships between some of the tested populations especially when isolation by distance was considered. The results of the study also revealed that date palm cultivars and phenotypes possess specific direct or interaction effects due to water availability on a range of morphological and physiological traits. Soft and dry phenotypes responded differently to different levels of water stress, while the dry phenotype was more sensitive and conservative. The results indicated that date palm has high fixation capacity to photosynthetic CO2 supply with interaction effect to water availability, which can be considered as advantageous when coping with stresses that may arise with climate change. In conclusion, although a large amount of diversity exists among date palm germplasm, the findings in this study show that the role of biological nature of the tree, isolation by distance and environmental effects on structuring date palm genome was highly influenced by human impacts. Identity of date palm cultivars as developed and manipulated by date palm growers, in the absence of scientific breeding programmes, may continue to mainly depend on tree morphology and fruit characters. The pattern of genetic differentiation may cover specific morphological and physiological traits that contribute to adaptive mechanisms in each phenotype. These traits can be considered for further studies related to drought adaptation in date palm.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The multiplier ideals of an ideal in a regular local ring form a family of ideals parametrized by non-negative rational numbers. As the rational number increases the corresponding multiplier ideal remains unchanged until at some point it gets strictly smaller. A rational number where this kind of diminishing occurs is called a jumping number of the ideal. In this manuscript we shall give an explicit formula for the jumping numbers of a simple complete ideal in a two dimensional regular local ring. In particular, we obtain a formula for the jumping numbers of an analytically irreducible plane curve. We then show that the jumping numbers determine the equisingularity class of the curve.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Minimum Description Length (MDL) principle is a general, well-founded theoretical formalization of statistical modeling. The most important notion of MDL is the stochastic complexity, which can be interpreted as the shortest description length of a given sample of data relative to a model class. The exact definition of the stochastic complexity has gone through several evolutionary steps. The latest instantation is based on the so-called Normalized Maximum Likelihood (NML) distribution which has been shown to possess several important theoretical properties. However, the applications of this modern version of the MDL have been quite rare because of computational complexity problems, i.e., for discrete data, the definition of NML involves an exponential sum, and in the case of continuous data, a multi-dimensional integral usually infeasible to evaluate or even approximate accurately. In this doctoral dissertation, we present mathematical techniques for computing NML efficiently for some model families involving discrete data. We also show how these techniques can be used to apply MDL in two practical applications: histogram density estimation and clustering of multi-dimensional data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Online content services can greatly benefit from personalisation features that enable delivery of content that is suited to each user's specific interests. This thesis presents a system that applies text analysis and user modeling techniques in an online news service for the purpose of personalisation and user interest analysis. The system creates a detailed thematic profile for each content item and observes user's actions towards content items to learn user's preferences. A handcrafted taxonomy of concepts, or ontology, is used in profile formation to extract relevant concepts from the text. User preference learning is automatic and there is no need for explicit preference settings or ratings from the user. Learned user profiles are segmented into interest groups using clustering techniques with the objective of providing a source of information for the service provider. Some theoretical background for chosen techniques is presented while the main focus is in finding practical solutions to some of the current information needs, which are not optimally served with traditional techniques.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The first part of this work investigates the molecular epidemiology of a human enterovirus (HEV), echovirus 30 (E-30). This project is part of a series of studies performed in our research team analyzing the molecular epidemiology of HEV-B viruses. A total of 129 virus strains had been isolated in different parts of Europe. The sequence analysis was performed in three different genomic regions: 420 nucleotides (nt) in the VP4/VP2 capsid protein coding region, the entire VP1 capsid protein coding gene of 876 nt, and 150 nt in the VP1/2A junction region. The analysis revealed a succession of dominant sublineages within a major genotype. The temporally earlier genotypes had been replaced by a genetically homogenous lineage that has been circulating in Europe since the late 1970s. The same genotype was found by other research groups in North America and Australia. Globally, other cocirculating genetic lineages also exist. The prevalence of a dominant genotype makes E-30 different from other previously studied HEVs, such as polioviruses and coxsackieviruses B4 and B5, for which several coexisting genetic lineages have been reported. The second part of this work deals with molecular epidemiology of human rhinoviruses (HRVs). A total of 61 field isolates were studied in the 420-nt stretch in the capsid coding region of VP4/VP2. The isolates were collected from children under two years of age in Tampere, Finland. Sequences from the clinical isolates clustered in the two previously known phylogenetic clades. Seasonal clustering was found. Also, several distinct serotype-like clusters were found to co-circulate during the same epidemic season. Reappearance of a cluster after disappearing for a season was observed. The molecular epidemiology of the analyzed strains turned out to be complex, and we decided to continue our studies of HRV. Only five previously published complete genome sequences of HRV prototype strains were available for analysis. Therefore, all designated HRV prototype strains (n=102) were sequenced in the VP4/VP2 region, and the possibility of genetic typing of HRV was evaluated. Seventy-six of the 102 prototype strains clustered in HRV genetic group A (HRV-A) and 25 in group B (HRV-B). Serotype 87 clustered separately from other HRVs with HEV species D. The field strains of HRV represented as many as 19 different genotypes, as judged with an approximate demarcation of a 20% nt difference in the VP4/VP2 region. The interserotypic differences of HRV were generally similar to those reported between different HEV serotypes (i.e. about 20%), but smaller differences, less than 10%, were also observed. Because some HRV serotypes are genetically so closely related, we suggest that the genetic typing be performed using the criterion "the closest prototype strain". This study is the first systematic genetic characterization of all known HRV prototype strains, providing a further taxonomic proposal for classification of HRV. We proposed to divide the genus Human rhinoviruses into HRV-A and HRV-B. The final part of the work comprises a phylogenetic analysis of a subset (48) of HRV prototype strains and field isolates (12) in the nonstructural part of the genome coding for the RNA-dependent RNA polymerase (3D). The proposed division of the HRV strains in the species HRV-A and HRV-B was also supported by 3D region. HRV-B clustered closer to HEV species B, C, and also to polioviruses than to HRV-A. Intraspecies variation within both HRV-A and HRV-B was greater in the 3D coding region than in the VP4/VP2 coding region, in contrast to HEV. Moreover, the diversity of HRV in 3D exceeded that of HEV. One group of HRV-A, designated HRV-A', formed a separate cluster outside other HRV-A in the 3D region. It formed a cluster also in the capsid region, but located within HRV-A. This may reflect a different evolutionary history of distinct genomic regions among HRV-A. Furthermore, the tree topology within HRV-A in the 3D region differed from that in the VP4/VP2, suggesting possible recombination events in the evolution of the strains. No conflicting phylogenies were observed in any of the 12 field isolates. Possible recombination was further studied using the Similarity and Bootscanning analyses of the complete genome sequences of HRV available in public databases. Evidence for recombination among HRV-A was found, as HRV2 and HRV39 showed higher similarity in the nonstructural part of the genome. Whether HRV2 and HRV39 strains - and perhaps also some other HRV-A strains not yet completely sequenced - are recombinants remains to be determined.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Bipolar disorder (BP) is a complex psychiatric disorder characterized by episodes of mania and depression. BP affects approximately 1% of the world’s population and shows no difference in lifetime prevalence between males and females. BP arises from complex interactions among genetic, developmental and environmental factors, and it is likely that several predisposing genes are involved in BP. The genetic background of BP is still poorly understood, although intensive and long-lasting research has identified several chromosomal regions and genes involved in susceptibility to BP. This thesis work aims to identify the genetic variants that influence bipolar disorder in the Finnish population by candidate gene and genome-wide linkage analyses in families with many BP cases. In addition to diagnosis-based phenotypes, neuropsychological traits that can be seen as potential endophenotypes or intermediate traits for BP were analyzed. In the first part of the thesis, we examined the role of the allelic variants of the TSNAX/DISC1 gene cluster to psychotic and bipolar spectrum disorders and found association of distinct allelic haplotypes with these two groups of disorders. The haplotype at the 5’ end of the Disrupted-in-Schizophrenia-1 gene (DISC1) was over-transmitted to males with psychotic disorder (p = 0.008; for an extended haplotype p = 0.0007 with both genders), whereas haplotypes at the 3’ end of DISC1 associated with bipolar spectrum disorder (p = 0.0002; for an extended haplotype p = 0.0001). The variants of these haplotypes also showed association with different cognitive traits. The haplotypes at the 5’ end associated with perseverations and auditory attention, while the variants at the 3’ end associated with several cognitive traits including verbal fluency and psychomotor processing speed. Second, in our complete set of BP families with 723 individuals we studied six functional candidate genes from three distinct signalling systems: serotonin-related genes (SLC6A4 and TPH2), BDNF -related genes (BDNF, CREB1 and NTRK2) and one gene related to the inflammation and cytokine system (P2RX7). We replicated association of the functional variant Val66Met of BDNF with BP and better performance in retention. The variants at the 5’ end of SLC6A4 also showed some evidence of association among males (p = 0.004), but the widely studied functional variants did not yield any significant results. A protective four-variant haplotype on P2RX7 showed evidence of association with BP and executive functions: semantic and phonemic fluency (p = 0.006 and p = 0.0003, respectively). Third, we analyzed 23 bipolar families originating from the North-Eastern region of Finland. A genome-wide scan was performed using the 6K single nucleotide polymorphism (SNP) array. We identified susceptibility loci at chromosomes 7q31 with a LOD score of 3.20 and at 9p13.1 with a LOD score of 4.02. We followed up both linkage findings in the complete set of 179 Finnish bipolar families. The finding on chromosome 9p13 was supported (maximum LOD score of 3.02), but the susceptibility gene itself remains unclarified. In the fourth part of the thesis, we wanted to test the role of the allelic variants that have associated with bipolar disorder in recent genome-wide association studies (GWAS). We could confirm findings for the DFNB31, SORCS2, SCL39A3, and DGKH genes. The best signal in this study comes from DFNB31, which remained significant after multiple testing corrections. Two variants of SORCS2 were allelic replications and presented the same signal as the haplotype analysis. However, no association was detected with the PALB2 gene, which was the most significantly associated region in the previous GWAS. Our results indicate that BP is heterogeneous and its genetic background may accordingly vary in different populations. In order to fully understand the allelic heterogeneity that underlies common diseases such as BP, complete genome sequencing for many individuals with and without the disease is required. Identification of the specific risk variants will help us better understand the pathophysiology underlying BP and will lead to the development of treatments with specific biochemical targets. In addition, it will further facilitate the identification of environmental factors that alter risk, which will potentially provide improved occupational, social and psychological advice for individuals with high risk of BP.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper investigates the clustering pattern in the Finnish stock market. Using trading volume and time as factors capturing the clustering pattern in the market, the Keim and Madhavan (1996) and the Engle and Russell (1998) model provide the framework for the analysis. The descriptive and the parametric analysis provide evidences that an important determinant of the famous U-shape pattern in the market is the rate of information arrivals as measured by large trading volumes and durations at the market open and close. Precisely, 1) the larger the trading volume, the greater the impact on prices both in the short and the long run, thus prices will differ across quantities. 2) Large trading volume is a non-linear function of price changes in the long run. 3) Arrival times are positively autocorrelated, indicating a clustering pattern and 4) Information arrivals as approximated by durations are negatively related to trading flow.