52 resultados para K-Means clustering

em Université de Lausanne, Switzerland


Relevância:

100.00% 100.00%

Publicador:

Resumo:

*This study reconstructs the phylogeography of Aegilops geniculata, an allotetraploid relative of wheat, to discuss the impact of past climate changes and recent human activities (e.g. the early expansion of agriculture) on the genetic diversity of ruderal plant species. *We combined chloroplast DNA (cpDNA) sequencing, analysed using statistical parsimony network, with nonhierarchical K-means clustering of amplified fragment length polymorphism (AFLP) genotyping, to unravel patterns of genetic structure across the native range of Ae. geniculata. The AFLP dataset was further explored by measurement of the regional genetic diversity and the detection of isolation by distance patterns. *Both cpDNA and AFLP suggest an eastern Mediterranean origin of Ae. geniculata. Two lineages have spread independently over northern and southern Mediterranean areas. Northern populations show low genetic diversity but strong phylogeographical structure among the main peninsulas, indicating a major influence of glacial cycles. By contrast, low genetic structuring and a high genetic diversity are detected in southern Mediterranean populations. Finally, we highlight human-mediated dispersal resulting in substantial introgression between resident and migrant populations. *We have shown that the evolutionary trajectories of ruderal plants can be similar to those of wild species, but are interfered by human activities, promoting range expansions through increased long-distance dispersal and the creation of suitable habitats.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: In contrast to conventional (CONV) neuromuscular electrical stimulation (NMES), the use of "wide-pulse, high-frequencies" (WPHF) can generate higher forces than expected by the direct activation of motor axons alone. We aimed at investigating the occurrence, magnitude, variability and underlying neuromuscular mechanisms of these "Extra Forces" (EF). METHODS: Electrically-evoked isometric plantar flexion force was recorded in 42 healthy subjects. Additionally, twitch potentiation, H-reflex and M-wave responses were assessed in 13 participants. CONV (25Hz, 0.05ms) and WPHF (100Hz, 1ms) NMES consisted of five stimulation trains (20s on-90s off). RESULTS: K-means clustering analysis disclosed a responder rate of almost 60%. Within this group of responders, force significantly increased from 4% to 16% of the maximal voluntary contraction force and H-reflexes were depressed after WPHF NMES. In contrast, non-responders showed neither EF nor H-reflex depression. Twitch potentiation and resting EMG data were similar between groups. Interestingly, a large inter- and intrasubject variability of EF was observed. CONCLUSION: The responder percentage was overestimated in previous studies. SIGNIFICANCE: This study proposes a novel methodological framework for unraveling the neurophysiological mechanisms involved in EF and provides further evidence for a central contribution to EF in responders.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Conventional (CONV) neuromuscular electrical stimulation (NMES) (i.e., short pulse duration, low frequencies) induces a higher energetic response as compared to voluntary contractions (VOL). In contrast, wide-pulse, high-frequency (WPHF) NMES might elicit-at least in some subjects (i.e., responders)-a different motor unit recruitment compared to CONV that resembles the physiological muscle activation pattern of VOL. We therefore hypothesized that for these responder subjects, the metabolic demand of WPHF would be lower than CONV and comparable to VOL. 18 healthy subjects performed isometric plantar flexions at 10% of their maximal voluntary contraction force for CONV (25 Hz, 0.05 ms), WPHF (100 Hz, 1 ms) and VOL protocols. For each protocol, force time integral (FTI) was quantified and subjects were classified as responders and non-responders to WPHF based on k-means clustering analysis. Furthermore, a fatigue index based on FTI loss at the end of each protocol compared with the beginning of the protocol was calculated. Phosphocreatine depletion (ΔPCr) was assessed using 31P magnetic resonance spectroscopy. Responders developed four times higher FTI's during WPHF (99 ± 37 ×103 N.s) than non-responders (26 ± 12 ×103 N.s). For both responders and non-responders, CONV was metabolically more demanding than VOL when ΔPCr was expressed relative to the FTI. Only for the responder group, the ∆PCr/FTI ratio of WPHF (0.74 ± 0.19 M/N.s) was significantly lower compared to CONV (1.48 ± 0.46 M/N.s) but similar to VOL (0.65 ± 0.21 M/N.s). Moreover, the fatigue index was not different between WPHF (-16%) and CONV (-25%) for the responders. WPHF could therefore be considered as the less demanding NMES modality-at least in this subgroup of subjects-by possibly exhibiting a muscle activation pattern similar to VOL contractions.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract: To cluster textual sequence types (discourse types/modes) in French texts, K-means algorithm with high-dimensional embeddings and fuzzy clustering algorithm were applied on clauses whose POS (part-ofspeech) n-gram profiles were previously extracted. Uni-, bi- and trigrams were used on four 19th century French short stories by Maupassant. For high-dimensional embeddings, power transformations on the chi-squared distances between clauses were explored. Preliminary results show that highdimensional embeddings improve the quality of clustering, contrasting the use of bi and trigrams whose performance is disappointing, possibly because of feature space sparsity.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The in situ hybridization Allen Mouse Brain Atlas was mined for proteases expressed in the somatosensory cerebral cortex. Among the 480 genes coding for protease/peptidases, only four were found enriched in cortical interneurons: Reln coding for reelin; Adamts8 and Adamts15 belonging to the class of metzincin proteases involved in reshaping the perineuronal net (PNN) and Mme encoding for Neprilysin, the enzyme degrading amyloid β-peptides. The pattern of expression of metalloproteases (MPs) was analyzed by single-cell reverse transcriptase multiplex PCR after patch clamp and was compared with the expression of 10 canonical interneurons markers and 12 additional genes from the Allen Atlas. Clustering of these genes by K-means algorithm displays five distinct clusters. Among these five clusters, two fast-spiking interneuron clusters expressing the calcium-binding protein Pvalb were identified, one co-expressing Pvalb with Sst (PV-Sst) and another co-expressing Pvalb with three metallopeptidases Adamts8, Adamts15 and Mme (PV-MP). By using Wisteria floribunda agglutinin, a specific marker for PNN, PV-MP interneurons were found surrounded by PNN, whereas the ones expressing Sst, PV-Sst, were not.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

General clustering deals with weighted objects and fuzzy memberships. We investigate the group- or object-aggregation-invariance properties possessed by the relevant functionals (effective number of groups or objects, centroids, dispersion, mutual object-group information, etc.). The classical squared Euclidean case can be generalized to non-Euclidean distances, as well as to non-linear transformations of the memberships, yielding the c-means clustering algorithm as well as two presumably new procedures, the convex and pairwise convex clustering. Cluster stability and aggregation-invariance of the optimal memberships associated to the various clustering schemes are examined as well.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

PURPOSE: According to estimations around 230 people die as a result of radon exposure in Switzerland. This public health concern makes reliable indoor radon prediction and mapping methods necessary in order to improve risk communication to the public. The aim of this study was to develop an automated method to classify lithological units according to their radon characteristics and to develop mapping and predictive tools in order to improve local radon prediction. METHOD: About 240 000 indoor radon concentration (IRC) measurements in about 150 000 buildings were available for our analysis. The automated classification of lithological units was based on k-medoids clustering via pair-wise Kolmogorov distances between IRC distributions of lithological units. For IRC mapping and prediction we used random forests and Bayesian additive regression trees (BART). RESULTS: The automated classification groups lithological units well in terms of their IRC characteristics. Especially the IRC differences in metamorphic rocks like gneiss are well revealed by this method. The maps produced by random forests soundly represent the regional difference of IRCs in Switzerland and improve the spatial detail compared to existing approaches. We could explain 33% of the variations in IRC data with random forests. Additionally, the influence of a variable evaluated by random forests shows that building characteristics are less important predictors for IRCs than spatial/geological influences. BART could explain 29% of IRC variability and produced maps that indicate the prediction uncertainty. CONCLUSION: Ensemble regression trees are a powerful tool to model and understand the multidimensional influences on IRCs. Automatic clustering of lithological units complements this method by facilitating the interpretation of radon properties of rock types. This study provides an important element for radon risk communication. Future approaches should consider taking into account further variables like soil gas radon measurements as well as more detailed geological information.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Axée dans un premier temps sur le formalisme et les méthodes, cette thèse est construite sur trois concepts formalisés: une table de contingence, une matrice de dissimilarités euclidiennes et une matrice d'échange. À partir de ces derniers, plusieurs méthodes d'Analyse des données ou d'apprentissage automatique sont exprimées et développées: l'analyse factorielle des correspondances (AFC), vue comme un cas particulier du multidimensional scaling; la classification supervisée, ou non, combinée aux transformations de Schoenberg; et les indices d'autocorrélation et d'autocorrélation croisée, adaptés à des analyses multivariées et permettant de considérer diverses familles de voisinages. Ces méthodes débouchent dans un second temps sur une pratique de l'analyse exploratoire de différentes données textuelles et musicales. Pour les données textuelles, on s'intéresse à la classification automatique en types de discours de propositions énoncées, en se basant sur les catégories morphosyntaxiques (CMS) qu'elles contiennent. Bien que le lien statistique entre les CMS et les types de discours soit confirmé, les résultats de la classification obtenus avec la méthode K- means, combinée à une transformation de Schoenberg, ainsi qu'avec une variante floue de l'algorithme K-means, sont plus difficiles à interpréter. On traite aussi de la classification supervisée multi-étiquette en actes de dialogue de tours de parole, en se basant à nouveau sur les CMS qu'ils contiennent, mais aussi sur les lemmes et le sens des verbes. Les résultats obtenus par l'intermédiaire de l'analyse discriminante combinée à une transformation de Schoenberg sont prometteurs. Finalement, on examine l'autocorrélation textuelle, sous l'angle des similarités entre diverses positions d'un texte, pensé comme une séquence d'unités. En particulier, le phénomène d'alternance de la longueur des mots dans un texte est observé pour des voisinages d'empan variable. On étudie aussi les similarités en fonction de l'apparition, ou non, de certaines parties du discours, ainsi que les similarités sémantiques des diverses positions d'un texte. Concernant les données musicales, on propose une représentation d'une partition musicale sous forme d'une table de contingence. On commence par utiliser l'AFC et l'indice d'autocorrélation pour découvrir les structures existant dans chaque partition. Ensuite, on opère le même type d'approche sur les différentes voix d'une partition, grâce à l'analyse des correspondances multiples, dans une variante floue, et à l'indice d'autocorrélation croisée. Qu'il s'agisse de la partition complète ou des différentes voix qu'elle contient, des structures répétées sont effectivement détectées, à condition qu'elles ne soient pas transposées. Finalement, on propose de classer automatiquement vingt partitions de quatre compositeurs différents, chacune représentée par une table de contingence, par l'intermédiaire d'un indice mesurant la similarité de deux configurations. Les résultats ainsi obtenus permettent de regrouper avec succès la plupart des oeuvres selon leur compositeur.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

It is estimated that around 230 people die each year due to radon (222Rn) exposure in Switzerland. 222Rn occurs mainly in closed environments like buildings and originates primarily from the subjacent ground. Therefore it depends strongly on geology and shows substantial regional variations. Correct identification of these regional variations would lead to substantial reduction of 222Rn exposure of the population based on appropriate construction of new and mitigation of already existing buildings. Prediction of indoor 222Rn concentrations (IRC) and identification of 222Rn prone areas is however difficult since IRC depend on a variety of different variables like building characteristics, meteorology, geology and anthropogenic factors. The present work aims at the development of predictive models and the understanding of IRC in Switzerland, taking into account a maximum of information in order to minimize the prediction uncertainty. The predictive maps will be used as a decision-support tool for 222Rn risk management. The construction of these models is based on different data-driven statistical methods, in combination with geographical information systems (GIS). In a first phase we performed univariate analysis of IRC for different variables, namely the detector type, building category, foundation, year of construction, the average outdoor temperature during measurement, altitude and lithology. All variables showed significant associations to IRC. Buildings constructed after 1900 showed significantly lower IRC compared to earlier constructions. We observed a further drop of IRC after 1970. In addition to that, we found an association of IRC with altitude. With regard to lithology, we observed the lowest IRC in sedimentary rocks (excluding carbonates) and sediments and the highest IRC in the Jura carbonates and igneous rock. The IRC data was systematically analyzed for potential bias due to spatially unbalanced sampling of measurements. In order to facilitate the modeling and the interpretation of the influence of geology on IRC, we developed an algorithm based on k-medoids clustering which permits to define coherent geological classes in terms of IRC. We performed a soil gas 222Rn concentration (SRC) measurement campaign in order to determine the predictive power of SRC with respect to IRC. We found that the use of SRC is limited for IRC prediction. The second part of the project was dedicated to predictive mapping of IRC using models which take into account the multidimensionality of the process of 222Rn entry into buildings. We used kernel regression and ensemble regression tree for this purpose. We could explain up to 33% of the variance of the log transformed IRC all over Switzerland. This is a good performance compared to former attempts of IRC modeling in Switzerland. As predictor variables we considered geographical coordinates, altitude, outdoor temperature, building type, foundation, year of construction and detector type. Ensemble regression trees like random forests allow to determine the role of each IRC predictor in a multidimensional setting. We found spatial information like geology, altitude and coordinates to have stronger influences on IRC than building related variables like foundation type, building type and year of construction. Based on kernel estimation we developed an approach to determine the local probability of IRC to exceed 300 Bq/m3. In addition to that we developed a confidence index in order to provide an estimate of uncertainty of the map. All methods allow an easy creation of tailor-made maps for different building characteristics. Our work is an essential step towards a 222Rn risk assessment which accounts at the same time for different architectural situations as well as geological and geographical conditions. For the communication of 222Rn hazard to the population we recommend to make use of the probability map based on kernel estimation. The communication of 222Rn hazard could for example be implemented via a web interface where the users specify the characteristics and coordinates of their home in order to obtain the probability to be above a given IRC with a corresponding index of confidence. Taking into account the health effects of 222Rn, our results have the potential to substantially improve the estimation of the effective dose from 222Rn delivered to the Swiss population.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The long term goal of this research is to develop a program able to produce an automatic segmentation and categorization of textual sequences into discourse types. In this preliminary contribution, we present the construction of an algorithm which takes a segmented text as input and attempts to produce a categorization of sequences, such as narrative, argumentative, descriptive and so on. Also, this work aims at investigating a possible convergence between the typological approach developed in particular in the field of text and discourse analysis in French by Adam (2008) and Bronckart (1997) and unsupervised statistical learning.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Positron emission tomography with [18F] fluorodeoxyglucose (FDG-PET) plays a well-established role in assisting early detection of frontotemporal lobar degeneration (FTLD). Here, we examined the impact of intensity normalization to different reference areas on accuracy of FDG-PET to discriminate between patients with mild FTLD and healthy elderly subjects. FDG-PET was conducted at two centers using different acquisition protocols: 41 FTLD patients and 42 controls were studied at center 1, 11 FTLD patients and 13 controls were studied at center 2. All PET images were intensity normalized to the cerebellum, primary sensorimotor cortex (SMC), cerebral global mean (CGM), and a reference cluster with most preserved FDG uptake in the aforementioned patients group of center 1. Metabolic deficits in the patient group at center 1 appeared 1.5, 3.6, and 4.6 times greater in spatial extent, when tracer uptake was normalized to the reference cluster rather than to the cerebellum, SMC, and CGM, respectively. Logistic regression analyses based on normalized values from FTLD-typical regions showed that at center 1, cerebellar, SMC, CGM, and cluster normalizations differentiated patients from controls with accuracies of 86%, 76%, 75% and 90%, respectively. A similar order of effects was found at center 2. Cluster normalization leads to a significant increase of statistical power in detecting early FTLD-associated metabolic deficits. The established FTLD-specific cluster can be used to improve detection of FTLD on a single case basis at independent centers - a decisive step towards early diagnosis and prediction of FTLD syndromes enabling specific therapies in the future.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE: We investigated whether the INTERMED, a generic instrument for assessing biopsychosocial case complexity and direct care, identifies organ transplant patients at risk of unfavourable post-transplant development by comparing it to the Transplant Evaluation Rating Scale (TERS), the established measure for pretransplant psychosocial evaluation. METHOD: One hundred nineteen kidney, liver, and heart transplant candidates were evaluated using the INTERMED, TERS, SF-36, EuroQol, Montgomery-Åsberg Depression Rating Scale (MADRS), and Hospital Anxiety & Depression Scale (HADS). RESULTS: We found significant relationships between the INTERMED and the TERS scores. The INTERMED highly correlated with the HADS,MADRS, and mental and physical health scores of the SF-36 Health Survey. CONCLUSIONS: The results demonstrate the validity and usefulness of the INTERMED instrument for pretransplant evaluation. Furthermore, our findings demonstrate the different qualities of INTERMED and TERS in clinical practice. The advantages of the psychiatric focus of the TERS and the biopsychosocial perspective of the INTERMED are discussed in the context of current literature on integrated care.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Polochic and Motagua faults define the active plate boundary between the North American and Caribbean plates in central Guatemala. A splay of the Polochic Fault traverses the rapidly growing city of San Miguel Uspantan that is periodically affected by destructive earthquakes. This fault splay was located using a 2D electrical resistivity tomography (ERT) survey that also characterized the fault damage zone and evaluated the thickness and nature of recent deposits upon which most of the city is built. ERT images show the fault as a similar to 50 m wide, near-vertical low-resistivity anomaly, bounded within a few meters by high resistivity anomalies. Forward modeling reproduces the key aspects of the observed electrical resistivity data with remarkable fidelity thus defining the overall location, geometry, and internal structure of the fault zone as well as the affected lithologies. Our results indicate that the city is constructed on a similar to 20 m thick surficial layer consisting of poorly consolidated, highly porous, water-logged pumice. This soft layer is likely to amplify seismic waves and to liquefy upon moderate to strong ground shaking. The electrical conductivity as well as the major element chemistry of the groundwater provides evidence to suggest that the local aquifer might, at least in part, be fed by water rising along the fault. Therefore, the potential threat posed by this fault splay may not be limited to its seismic activity per se, but could be compounded its potential propensity to enhance seismic site effects by injecting water into the soft surficial sediments. The results of this study provide the basis for a rigorous analysis of seismic hazard and sustainable development of San Miguel Uspantan and illustrate the potential of ERT surveying for paleoseismic studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PURPOSE: To objectively characterize different heart tissues from functional and viability images provided by composite-strain-encoding (C-SENC) MRI. MATERIALS AND METHODS: C-SENC is a new MRI technique for simultaneously acquiring cardiac functional and viability images. In this work, an unsupervised multi-stage fuzzy clustering method is proposed to identify different heart tissues in the C-SENC images. The method is based on sequential application of the fuzzy c-means (FCM) and iterative self-organizing data (ISODATA) clustering algorithms. The proposed method is tested on simulated heart images and on images from nine patients with and without myocardial infarction (MI). The resulting clustered images are compared with MRI delayed-enhancement (DE) viability images for determining MI. Also, Bland-Altman analysis is conducted between the two methods. RESULTS: Normal myocardium, infarcted myocardium, and blood are correctly identified using the proposed method. The clustered images correctly identified 90 +/- 4% of the pixels defined as infarct in the DE images. In addition, 89 +/- 5% of the pixels defined as infarct in the clustered images were also defined as infarct in DE images. The Bland-Altman results show no bias between the two methods in identifying MI. CONCLUSION: The proposed technique allows for objectively identifying divergent heart tissues, which would be potentially important for clinical decision-making in patients with MI.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present a new framework for large-scale data clustering. The main idea is to modify functional dimensionality reduction techniques to directly optimize over discrete labels using stochastic gradient descent. Compared to methods like spectral clustering our approach solves a single optimization problem, rather than an ad-hoc two-stage optimization approach, does not require a matrix inversion, can easily encode prior knowledge in the set of implementable functions, and does not have an ?out-of-sample? problem. Experimental results on both artificial and real-world datasets show the usefulness of our approach.