20 resultados para K-Means Cluster
em Université de Lausanne, Switzerland
Resumo:
Abstract: To cluster textual sequence types (discourse types/modes) in French texts, K-means algorithm with high-dimensional embeddings and fuzzy clustering algorithm were applied on clauses whose POS (part-ofspeech) n-gram profiles were previously extracted. Uni-, bi- and trigrams were used on four 19th century French short stories by Maupassant. For high-dimensional embeddings, power transformations on the chi-squared distances between clauses were explored. Preliminary results show that highdimensional embeddings improve the quality of clustering, contrasting the use of bi and trigrams whose performance is disappointing, possibly because of feature space sparsity.
Resumo:
The in situ hybridization Allen Mouse Brain Atlas was mined for proteases expressed in the somatosensory cerebral cortex. Among the 480 genes coding for protease/peptidases, only four were found enriched in cortical interneurons: Reln coding for reelin; Adamts8 and Adamts15 belonging to the class of metzincin proteases involved in reshaping the perineuronal net (PNN) and Mme encoding for Neprilysin, the enzyme degrading amyloid β-peptides. The pattern of expression of metalloproteases (MPs) was analyzed by single-cell reverse transcriptase multiplex PCR after patch clamp and was compared with the expression of 10 canonical interneurons markers and 12 additional genes from the Allen Atlas. Clustering of these genes by K-means algorithm displays five distinct clusters. Among these five clusters, two fast-spiking interneuron clusters expressing the calcium-binding protein Pvalb were identified, one co-expressing Pvalb with Sst (PV-Sst) and another co-expressing Pvalb with three metallopeptidases Adamts8, Adamts15 and Mme (PV-MP). By using Wisteria floribunda agglutinin, a specific marker for PNN, PV-MP interneurons were found surrounded by PNN, whereas the ones expressing Sst, PV-Sst, were not.
Resumo:
*This study reconstructs the phylogeography of Aegilops geniculata, an allotetraploid relative of wheat, to discuss the impact of past climate changes and recent human activities (e.g. the early expansion of agriculture) on the genetic diversity of ruderal plant species. *We combined chloroplast DNA (cpDNA) sequencing, analysed using statistical parsimony network, with nonhierarchical K-means clustering of amplified fragment length polymorphism (AFLP) genotyping, to unravel patterns of genetic structure across the native range of Ae. geniculata. The AFLP dataset was further explored by measurement of the regional genetic diversity and the detection of isolation by distance patterns. *Both cpDNA and AFLP suggest an eastern Mediterranean origin of Ae. geniculata. Two lineages have spread independently over northern and southern Mediterranean areas. Northern populations show low genetic diversity but strong phylogeographical structure among the main peninsulas, indicating a major influence of glacial cycles. By contrast, low genetic structuring and a high genetic diversity are detected in southern Mediterranean populations. Finally, we highlight human-mediated dispersal resulting in substantial introgression between resident and migrant populations. *We have shown that the evolutionary trajectories of ruderal plants can be similar to those of wild species, but are interfered by human activities, promoting range expansions through increased long-distance dispersal and the creation of suitable habitats.
Resumo:
Axée dans un premier temps sur le formalisme et les méthodes, cette thèse est construite sur trois concepts formalisés: une table de contingence, une matrice de dissimilarités euclidiennes et une matrice d'échange. À partir de ces derniers, plusieurs méthodes d'Analyse des données ou d'apprentissage automatique sont exprimées et développées: l'analyse factorielle des correspondances (AFC), vue comme un cas particulier du multidimensional scaling; la classification supervisée, ou non, combinée aux transformations de Schoenberg; et les indices d'autocorrélation et d'autocorrélation croisée, adaptés à des analyses multivariées et permettant de considérer diverses familles de voisinages. Ces méthodes débouchent dans un second temps sur une pratique de l'analyse exploratoire de différentes données textuelles et musicales. Pour les données textuelles, on s'intéresse à la classification automatique en types de discours de propositions énoncées, en se basant sur les catégories morphosyntaxiques (CMS) qu'elles contiennent. Bien que le lien statistique entre les CMS et les types de discours soit confirmé, les résultats de la classification obtenus avec la méthode K- means, combinée à une transformation de Schoenberg, ainsi qu'avec une variante floue de l'algorithme K-means, sont plus difficiles à interpréter. On traite aussi de la classification supervisée multi-étiquette en actes de dialogue de tours de parole, en se basant à nouveau sur les CMS qu'ils contiennent, mais aussi sur les lemmes et le sens des verbes. Les résultats obtenus par l'intermédiaire de l'analyse discriminante combinée à une transformation de Schoenberg sont prometteurs. Finalement, on examine l'autocorrélation textuelle, sous l'angle des similarités entre diverses positions d'un texte, pensé comme une séquence d'unités. En particulier, le phénomène d'alternance de la longueur des mots dans un texte est observé pour des voisinages d'empan variable. On étudie aussi les similarités en fonction de l'apparition, ou non, de certaines parties du discours, ainsi que les similarités sémantiques des diverses positions d'un texte. Concernant les données musicales, on propose une représentation d'une partition musicale sous forme d'une table de contingence. On commence par utiliser l'AFC et l'indice d'autocorrélation pour découvrir les structures existant dans chaque partition. Ensuite, on opère le même type d'approche sur les différentes voix d'une partition, grâce à l'analyse des correspondances multiples, dans une variante floue, et à l'indice d'autocorrélation croisée. Qu'il s'agisse de la partition complète ou des différentes voix qu'elle contient, des structures répétées sont effectivement détectées, à condition qu'elles ne soient pas transposées. Finalement, on propose de classer automatiquement vingt partitions de quatre compositeurs différents, chacune représentée par une table de contingence, par l'intermédiaire d'un indice mesurant la similarité de deux configurations. Les résultats ainsi obtenus permettent de regrouper avec succès la plupart des oeuvres selon leur compositeur.
Resumo:
OBJECTIVE: In contrast to conventional (CONV) neuromuscular electrical stimulation (NMES), the use of "wide-pulse, high-frequencies" (WPHF) can generate higher forces than expected by the direct activation of motor axons alone. We aimed at investigating the occurrence, magnitude, variability and underlying neuromuscular mechanisms of these "Extra Forces" (EF). METHODS: Electrically-evoked isometric plantar flexion force was recorded in 42 healthy subjects. Additionally, twitch potentiation, H-reflex and M-wave responses were assessed in 13 participants. CONV (25Hz, 0.05ms) and WPHF (100Hz, 1ms) NMES consisted of five stimulation trains (20s on-90s off). RESULTS: K-means clustering analysis disclosed a responder rate of almost 60%. Within this group of responders, force significantly increased from 4% to 16% of the maximal voluntary contraction force and H-reflexes were depressed after WPHF NMES. In contrast, non-responders showed neither EF nor H-reflex depression. Twitch potentiation and resting EMG data were similar between groups. Interestingly, a large inter- and intrasubject variability of EF was observed. CONCLUSION: The responder percentage was overestimated in previous studies. SIGNIFICANCE: This study proposes a novel methodological framework for unraveling the neurophysiological mechanisms involved in EF and provides further evidence for a central contribution to EF in responders.
Resumo:
Conventional (CONV) neuromuscular electrical stimulation (NMES) (i.e., short pulse duration, low frequencies) induces a higher energetic response as compared to voluntary contractions (VOL). In contrast, wide-pulse, high-frequency (WPHF) NMES might elicit-at least in some subjects (i.e., responders)-a different motor unit recruitment compared to CONV that resembles the physiological muscle activation pattern of VOL. We therefore hypothesized that for these responder subjects, the metabolic demand of WPHF would be lower than CONV and comparable to VOL. 18 healthy subjects performed isometric plantar flexions at 10% of their maximal voluntary contraction force for CONV (25 Hz, 0.05 ms), WPHF (100 Hz, 1 ms) and VOL protocols. For each protocol, force time integral (FTI) was quantified and subjects were classified as responders and non-responders to WPHF based on k-means clustering analysis. Furthermore, a fatigue index based on FTI loss at the end of each protocol compared with the beginning of the protocol was calculated. Phosphocreatine depletion (ΔPCr) was assessed using 31P magnetic resonance spectroscopy. Responders developed four times higher FTI's during WPHF (99 ± 37 ×103 N.s) than non-responders (26 ± 12 ×103 N.s). For both responders and non-responders, CONV was metabolically more demanding than VOL when ΔPCr was expressed relative to the FTI. Only for the responder group, the ∆PCr/FTI ratio of WPHF (0.74 ± 0.19 M/N.s) was significantly lower compared to CONV (1.48 ± 0.46 M/N.s) but similar to VOL (0.65 ± 0.21 M/N.s). Moreover, the fatigue index was not different between WPHF (-16%) and CONV (-25%) for the responders. WPHF could therefore be considered as the less demanding NMES modality-at least in this subgroup of subjects-by possibly exhibiting a muscle activation pattern similar to VOL contractions.
Resumo:
Positron emission tomography with [18F] fluorodeoxyglucose (FDG-PET) plays a well-established role in assisting early detection of frontotemporal lobar degeneration (FTLD). Here, we examined the impact of intensity normalization to different reference areas on accuracy of FDG-PET to discriminate between patients with mild FTLD and healthy elderly subjects. FDG-PET was conducted at two centers using different acquisition protocols: 41 FTLD patients and 42 controls were studied at center 1, 11 FTLD patients and 13 controls were studied at center 2. All PET images were intensity normalized to the cerebellum, primary sensorimotor cortex (SMC), cerebral global mean (CGM), and a reference cluster with most preserved FDG uptake in the aforementioned patients group of center 1. Metabolic deficits in the patient group at center 1 appeared 1.5, 3.6, and 4.6 times greater in spatial extent, when tracer uptake was normalized to the reference cluster rather than to the cerebellum, SMC, and CGM, respectively. Logistic regression analyses based on normalized values from FTLD-typical regions showed that at center 1, cerebellar, SMC, CGM, and cluster normalizations differentiated patients from controls with accuracies of 86%, 76%, 75% and 90%, respectively. A similar order of effects was found at center 2. Cluster normalization leads to a significant increase of statistical power in detecting early FTLD-associated metabolic deficits. The established FTLD-specific cluster can be used to improve detection of FTLD on a single case basis at independent centers - a decisive step towards early diagnosis and prediction of FTLD syndromes enabling specific therapies in the future.
Resumo:
This is the third edition of the compendium. It documents the status of important projects on nanomaterial toxicity and exposure monitoring, integrated risk management, research infrstructure and coordination and support activities. The compendium is not intended to be a guidance document for human health and environmental safety management of nanotechnologies, as such guidance documents already exist and are widely available. Neither is the compendium intended to be a medium for the publication of scientific papers and research results, as this task is covered by scientific conferences and the reviewed press. The compendium aims to bring researchers closer together and show them the potential for synergy in their work. It is a means to establish links and communication between them during the actual research phase and well before the publication of their results. It thus focuses on the communication of projects' strategic aims, extensively covers specific work objectives and the methods used in research, and documents human capacities and available laboratory infrastructure. As such, the compendium supports collaboration on common goals and the joint elaboration of future plans, whilst compromising neither the potential for scientific publication, nor intellectual property rights. [Auteurs]
Resumo:
OBJECTIVES: To document biopsychosocial profiles of patients with rheumatoid arthritis (RA) by means of the INTERMED and to correlate the results with conventional methods of disease assessment and health care utilization. METHODS: Patients with RA (n = 75) were evaluated with the INTERMED, an instrument for assessing case complexity and care needs. Based on their INTERMED scores, patients were compared with regard to severity of illness, functional status, and health care utilization. RESULTS: In cluster analysis, a 2-cluster solution emerged, with about half of the patients characterized as complex. Complex patients scoring especially high in the psychosocial domain of the INTERMED were disabled significantly more often and took more psychotropic drugs. Although the 2 patient groups did not differ in severity of illness and functional status, complex patients rated their illness as more severe on subjective measures and on most items of the Medical Outcomes Study Short Form 36. Complex patients showed increased health care utilization despite a similar biologic profile. CONCLUSIONS: The INTERMED identified complex patients with increased health care utilization, provided meaningful and comprehensive patient information, and proved to be easy to implement and advantageous compared with conventional methods of disease assessment. Intervention studies will have to demonstrate whether management strategies based on INTERMED profiles can improve treatment response and outcome of complex patients.
Resumo:
OBJECTIVES: To assess the effectiveness of implementing guidelines, coupled with individual feedback, on antibiotic prescribing behaviour of primary care physicians in Switzerland. METHODS: One hundred and forty general practices from a representative Swiss sentinel network of primary care physicians participated in this cluster-randomized prospective intervention study. The intervention consisted of providing guidelines on treatment of respiratory tract infections (RTIs) and uncomplicated lower urinary tract infections (UTIs), coupled with sustained, regular feedback on individual antibiotic prescription behaviour during 2 years. The main aims were: (i) to increase the percentage of prescriptions of penicillins for all RTIs treated with antibiotics; (ii) to increase the percentage of trimethoprim/sulfamethoxazole prescriptions for all uncomplicated lower UTIs treated with antibiotics; (iii) to decrease the percentage of quinolone prescriptions for all cases of exacerbated COPD (eCOPD) treated with antibiotics; and (iv) to decrease the proportion of sinusitis and other upper RTIs treated with antibiotics. The study was registered at ClinicalTrials.gov (NCT01358916). RESULTS: While the percentage of antibiotics prescribed for sinusitis or other upper RTIs and the percentage of quinolones prescribed for eCOPD did not differ between the intervention group and the control group, there was a significant increase in the percentage of prescriptions of penicillins for all RTIs treated with antibiotics [57% versus 49%, OR=1.42 (95% CI 1.08-1.89), P=0.01] and in the percentage of trimethoprim/sulfamethoxazole prescriptions for all uncomplicated lower UTIs treated with antibiotics [35% versus 19%, OR=2.16 (95% CI 1.19-3.91), P=0.01] in the intervention group. CONCLUSIONS: In our setting, implementing guidelines, coupled with sustained individual feedback, was not able to reduce the proportion of sinusitis and other upper RTIs treated with antibiotics, but increased the use of recommended antibiotics for RTIs and UTIs, as defined by the guidelines.
Resumo:
Forest fire sequences can be modelled as a stochastic point process where events are characterized by their spatial locations and occurrence in time. Cluster analysis permits the detection of the space/time pattern distribution of forest fires. These analyses are useful to assist fire-managers in identifying risk areas, implementing preventive measures and conducting strategies for an efficient distribution of the firefighting resources. This paper aims to identify hot spots in forest fire sequences by means of the space-time scan statistics permutation model (STSSP) and a geographical information system (GIS) for data and results visualization. The scan statistical methodology uses a scanning window, which moves across space and time, detecting local excesses of events in specific areas over a certain period of time. Finally, the statistical significance of each cluster is evaluated through Monte Carlo hypothesis testing. The case study is the forest fires registered by the Forest Service in Canton Ticino (Switzerland) from 1969 to 2008. This dataset consists of geo-referenced single events including the location of the ignition points and additional information. The data were aggregated into three sub-periods (considering important preventive legal dispositions) and two main ignition-causes (lightning and anthropogenic causes). Results revealed that forest fire events in Ticino are mainly clustered in the southern region where most of the population is settled. Our analysis uncovered local hot spots arising from extemporaneous arson activities. Results regarding the naturally-caused fires (lightning fires) disclosed two clusters detected in the northern mountainous area.
Resumo:
OBJECTIVE: We investigated whether the INTERMED, a generic instrument for assessing biopsychosocial case complexity and direct care, identifies organ transplant patients at risk of unfavourable post-transplant development by comparing it to the Transplant Evaluation Rating Scale (TERS), the established measure for pretransplant psychosocial evaluation. METHOD: One hundred nineteen kidney, liver, and heart transplant candidates were evaluated using the INTERMED, TERS, SF-36, EuroQol, Montgomery-Åsberg Depression Rating Scale (MADRS), and Hospital Anxiety & Depression Scale (HADS). RESULTS: We found significant relationships between the INTERMED and the TERS scores. The INTERMED highly correlated with the HADS,MADRS, and mental and physical health scores of the SF-36 Health Survey. CONCLUSIONS: The results demonstrate the validity and usefulness of the INTERMED instrument for pretransplant evaluation. Furthermore, our findings demonstrate the different qualities of INTERMED and TERS in clinical practice. The advantages of the psychiatric focus of the TERS and the biopsychosocial perspective of the INTERMED are discussed in the context of current literature on integrated care.
Resumo:
The Polochic and Motagua faults define the active plate boundary between the North American and Caribbean plates in central Guatemala. A splay of the Polochic Fault traverses the rapidly growing city of San Miguel Uspantan that is periodically affected by destructive earthquakes. This fault splay was located using a 2D electrical resistivity tomography (ERT) survey that also characterized the fault damage zone and evaluated the thickness and nature of recent deposits upon which most of the city is built. ERT images show the fault as a similar to 50 m wide, near-vertical low-resistivity anomaly, bounded within a few meters by high resistivity anomalies. Forward modeling reproduces the key aspects of the observed electrical resistivity data with remarkable fidelity thus defining the overall location, geometry, and internal structure of the fault zone as well as the affected lithologies. Our results indicate that the city is constructed on a similar to 20 m thick surficial layer consisting of poorly consolidated, highly porous, water-logged pumice. This soft layer is likely to amplify seismic waves and to liquefy upon moderate to strong ground shaking. The electrical conductivity as well as the major element chemistry of the groundwater provides evidence to suggest that the local aquifer might, at least in part, be fed by water rising along the fault. Therefore, the potential threat posed by this fault splay may not be limited to its seismic activity per se, but could be compounded its potential propensity to enhance seismic site effects by injecting water into the soft surficial sediments. The results of this study provide the basis for a rigorous analysis of seismic hazard and sustainable development of San Miguel Uspantan and illustrate the potential of ERT surveying for paleoseismic studies.
Resumo:
A semisupervised support vector machine is presented for the classification of remote sensing images. The method exploits the wealth of unlabeled samples for regularizing the training kernel representation locally by means of cluster kernels. The method learns a suitable kernel directly from the image and thus avoids assuming a priori signal relations by using a predefined kernel structure. Good results are obtained in image classification examples when few labeled samples are available. The method scales almost linearly with the number of unlabeled samples and provides out-of-sample predictions.
Resumo:
This is the fourth edition of the Nanosafety Cluster compendium. It documents the status of important projects on nanomaterial toxicity and exposure monitoring, integrated risk management, research infrastructure and coordination and support activities. The compendium is not intended to be a guidance document for human health and environmental safety management of nanotechnologies, as such guidance documents already exist and are widely available. Neither is the compendium intended to be a medium for the publication of scientific papers and research results, as this task is covered by scientific conferences and the peer reviewed press. The compendium aims to bring researchers closer together and show them the potential for synergy in their work. It is a means to establish links and communication between them during the actual research phase and well before the publication of their results. It thus focuses on the communication of projects' strategic aims, extensively covers specific work objectives and the methods used in research, and documents human capacities and available laboratory infrastructure. As such, the compendium supports collaboration on common goals and the joint elaboration of future plans, whilst compromising neither the potential for scientific publication, nor intellectual property rights.