156 resultados para Structure mining
em Université de Lausanne, Switzerland
Resumo:
Data mining can be defined as the extraction of previously unknown and potentially useful information from large datasets. The main principle is to devise computer programs that run through databases and automatically seek deterministic patterns. It is applied in different fields of application, e.g., remote sensing, biometry, speech recognition, but has seldom been applied to forensic case data. The intrinsic difficulty related to the use of such data lies in its heterogeneity, which comes from the many different sources of information. The aim of this study is to highlight potential uses of pattern recognition that would provide relevant results from a criminal intelligence point of view. The role of data mining within a global crime analysis methodology is to detect all types of structures in a dataset. Once filtered and interpreted, those structures can point to previously unseen criminal activities. The interpretation of patterns for intelligence purposes is the final stage of the process. It allows the researcher to validate the whole methodology and to refine each step if necessary. An application to cutting agents found in illicit drug seizures was performed. A combinatorial approach was done, using the presence and the absence of products. Methods coming from the graph theory field were used to extract patterns in data constituted by links between products and place and date of seizure. A data mining process completed using graphing techniques is called ``graph mining''. Patterns were detected that had to be interpreted and compared with preliminary knowledge to establish their relevancy. The illicit drug profiling process is actually an intelligence process that uses preliminary illicit drug classes to classify new samples. Methods proposed in this study could be used \textit{a priori} to compare structures from preliminary and post-detection patterns. This new knowledge of a repeated structure may provide valuable complementary information to profiling and become a source of intelligence.
Resumo:
La présente étude est à la fois une évaluation du processus de la mise en oeuvre et des impacts de la police de proximité dans les cinq plus grandes zones urbaines de Suisse - Bâle, Berne, Genève, Lausanne et Zurich. La police de proximité (community policing) est à la fois une philosophie et une stratégie organisationnelle qui favorise un partenariat renouvelé entre la police et les communautés locales dans le but de résoudre les problèmes relatifs à la sécurité et à l'ordre public. L'évaluation de processus a analysé des données relatives aux réformes internes de la police qui ont été obtenues par l'intermédiaire d'entretiens semi-structurés avec des administrateurs clés des cinq départements de police, ainsi que dans des documents écrits de la police et d'autres sources publiques. L'évaluation des impacts, quant à elle, s'est basée sur des variables contextuelles telles que des statistiques policières et des données de recensement, ainsi que sur des indicateurs d'impacts construit à partir des données du Swiss Crime Survey (SCS) relatives au sentiment d'insécurité, à la perception du désordre public et à la satisfaction de la population à l'égard de la police. Le SCS est un sondage régulier qui a permis d'interroger des habitants des cinq grandes zones urbaines à plusieurs reprises depuis le milieu des années 1980. L'évaluation de processus a abouti à un « Calendrier des activités » visant à créer des données de panel permettant de mesurer les progrès réalisés dans la mise en oeuvre de la police de proximité à l'aide d'une grille d'évaluation à six dimensions à des intervalles de cinq ans entre 1990 et 2010. L'évaluation des impacts, effectuée ex post facto, a utilisé un concept de recherche non-expérimental (observational design) dans le but d'analyser les impacts de différents modèles de police de proximité dans des zones comparables à travers les cinq villes étudiées. Les quartiers urbains, délimités par zone de code postal, ont ainsi été regroupés par l'intermédiaire d'une typologie réalisée à l'aide d'algorithmes d'apprentissage automatique (machine learning). Des algorithmes supervisés et non supervisés ont été utilisés sur les données à haute dimensionnalité relatives à la criminalité, à la structure socio-économique et démographique et au cadre bâti dans le but de regrouper les quartiers urbains les plus similaires dans des clusters. D'abord, les cartes auto-organisatrices (self-organizing maps) ont été utilisées dans le but de réduire la variance intra-cluster des variables contextuelles et de maximiser simultanément la variance inter-cluster des réponses au sondage. Ensuite, l'algorithme des forêts d'arbres décisionnels (random forests) a permis à la fois d'évaluer la pertinence de la typologie de quartier élaborée et de sélectionner les variables contextuelles clés afin de construire un modèle parcimonieux faisant un minimum d'erreurs de classification. Enfin, pour l'analyse des impacts, la méthode des appariements des coefficients de propension (propensity score matching) a été utilisée pour équilibrer les échantillons prétest-posttest en termes d'âge, de sexe et de niveau d'éducation des répondants au sein de chaque type de quartier ainsi identifié dans chacune des villes, avant d'effectuer un test statistique de la différence observée dans les indicateurs d'impacts. De plus, tous les résultats statistiquement significatifs ont été soumis à une analyse de sensibilité (sensitivity analysis) afin d'évaluer leur robustesse face à un biais potentiel dû à des covariables non observées. L'étude relève qu'au cours des quinze dernières années, les cinq services de police ont entamé des réformes majeures de leur organisation ainsi que de leurs stratégies opérationnelles et qu'ils ont noué des partenariats stratégiques afin de mettre en oeuvre la police de proximité. La typologie de quartier développée a abouti à une réduction de la variance intra-cluster des variables contextuelles et permet d'expliquer une partie significative de la variance inter-cluster des indicateurs d'impacts avant la mise en oeuvre du traitement. Ceci semble suggérer que les méthodes de géocomputation aident à équilibrer les covariables observées et donc à réduire les menaces relatives à la validité interne d'un concept de recherche non-expérimental. Enfin, l'analyse des impacts a révélé que le sentiment d'insécurité a diminué de manière significative pendant la période 2000-2005 dans les quartiers se trouvant à l'intérieur et autour des centres-villes de Berne et de Zurich. Ces améliorations sont assez robustes face à des biais dus à des covariables inobservées et covarient dans le temps et l'espace avec la mise en oeuvre de la police de proximité. L'hypothèse alternative envisageant que les diminutions observées dans le sentiment d'insécurité soient, partiellement, un résultat des interventions policières de proximité semble donc être aussi plausible que l'hypothèse nulle considérant l'absence absolue d'effet. Ceci, même si le concept de recherche non-expérimental mis en oeuvre ne peut pas complètement exclure la sélection et la régression à la moyenne comme explications alternatives. The current research project is both a process and impact evaluation of community policing in Switzerland's five major urban areas - Basel, Bern, Geneva, Lausanne, and Zurich. Community policing is both a philosophy and an organizational strategy that promotes a renewed partnership between the police and the community to solve problems of crime and disorder. The process evaluation data on police internal reforms were obtained through semi-structured interviews with key administrators from the five police departments as well as from police internal documents and additional public sources. The impact evaluation uses official crime records and census statistics as contextual variables as well as Swiss Crime Survey (SCS) data on fear of crime, perceptions of disorder, and public attitudes towards the police as outcome measures. The SCS is a standing survey instrument that has polled residents of the five urban areas repeatedly since the mid-1980s. The process evaluation produced a "Calendar of Action" to create panel data to measure community policing implementation progress over six evaluative dimensions in intervals of five years between 1990 and 2010. The impact evaluation, carried out ex post facto, uses an observational design that analyzes the impact of the different community policing models between matched comparison areas across the five cities. Using ZIP code districts as proxies for urban neighborhoods, geospatial data mining algorithms serve to develop a neighborhood typology in order to match the comparison areas. To this end, both unsupervised and supervised algorithms are used to analyze high-dimensional data on crime, the socio-economic and demographic structure, and the built environment in order to classify urban neighborhoods into clusters of similar type. In a first step, self-organizing maps serve as tools to develop a clustering algorithm that reduces the within-cluster variance in the contextual variables and simultaneously maximizes the between-cluster variance in survey responses. The random forests algorithm then serves to assess the appropriateness of the resulting neighborhood typology and to select the key contextual variables in order to build a parsimonious model that makes a minimum of classification errors. Finally, for the impact analysis, propensity score matching methods are used to match the survey respondents of the pretest and posttest samples on age, gender, and their level of education for each neighborhood type identified within each city, before conducting a statistical test of the observed difference in the outcome measures. Moreover, all significant results were subjected to a sensitivity analysis to assess the robustness of these findings in the face of potential bias due to some unobserved covariates. The study finds that over the last fifteen years, all five police departments have undertaken major reforms of their internal organization and operating strategies and forged strategic partnerships in order to implement community policing. The resulting neighborhood typology reduced the within-cluster variance of the contextual variables and accounted for a significant share of the between-cluster variance in the outcome measures prior to treatment, suggesting that geocomputational methods help to balance the observed covariates and hence to reduce threats to the internal validity of an observational design. Finally, the impact analysis revealed that fear of crime dropped significantly over the 2000-2005 period in the neighborhoods in and around the urban centers of Bern and Zurich. These improvements are fairly robust in the face of bias due to some unobserved covariate and covary temporally and spatially with the implementation of community policing. The alternative hypothesis that the observed reductions in fear of crime were at least in part a result of community policing interventions thus appears at least as plausible as the null hypothesis of absolutely no effect, even if the observational design cannot completely rule out selection and regression to the mean as alternative explanations.
Resumo:
Target identification for tractography studies requires solid anatomical knowledge validated by an extensive literature review across species for each seed structure to be studied. Manual literature review to identify targets for a given seed region is tedious and potentially subjective. Therefore, complementary approaches would be useful. We propose to use text-mining models to automatically suggest potential targets from the neuroscientific literature, full-text articles and abstracts, so that they can be used for anatomical connection studies and more specifically for tractography. We applied text-mining models to three structures: two well-studied structures, since validated deep brain stimulation targets, the internal globus pallidus and the subthalamic nucleus and, the nucleus accumbens, an exploratory target for treating psychiatric disorders. We performed a systematic review of the literature to document the projections of the three selected structures and compared it with the targets proposed by text-mining models, both in rat and primate (including human). We ran probabilistic tractography on the nucleus accumbens and compared the output with the results of the text-mining models and literature review. Overall, text-mining the literature could find three times as many targets as two man-weeks of curation could. The overall efficiency of the text-mining against literature review in our study was 98% recall (at 36% precision), meaning that over all the targets for the three selected seeds, only one target has been missed by text-mining. We demonstrate that connectivity for a structure of interest can be extracted from a very large amount of publications and abstracts. We believe this tool will be useful in helping the neuroscience community to facilitate connectivity studies of particular brain regions. The text mining tools used for the study are part of the HBP Neuroinformatics Platform, publicly available at http://connectivity-brainer.rhcloud.com/.
Resumo:
The alpha1b-adrenergic receptor (AR) is a member of the large superfamily of seven transmembrane domain (TMD) G protein-coupled receptors (GPCR). Combining site-directed mutagenesis of the alpha1b-AR with computational simulations of receptor dynamics, we have explored the conformational changes underlying the process of receptor activation, i.e. the transition between the inactive and active states. Our findings suggest that the structural constraint stabilizing the alpha1b-AR in the inactive form is a network of H-bonding interactions amongst conserved residues forming a polar pocket and R143 of the DRY sequence at the end of TMDIII. We have recently reported that point mutations of D142, of the DRY sequence and of A293 in the distal portion of the third intracellular loop resulted in ligand-independent (constitutive) activation of the alpha1b-AR. These constitutively activating mutations could induce perturbations resulting in the shift of R143 out of the polar pocket. The main role of R143 may be to mediate receptor activation by triggering the exposure of several basic amino acids of the intracellular loops towards the G protein. Our investigation has been extended also to the biochemical events involved in the desensitization process of alpha1b-AR. Our results indicate that immediately following agonist-induced activation, the alpha1b-AR can undergo rapid agonist-induced phosphorylation and desensitization. Different members of the G protein coupled receptor kinase family can play a role in agonist-induced regulation of the alpha1b-AR. In addition, constitutively active alpha1b-AR mutants display different phosphorylation and internalization features. The future goal is to further elucidate the molecular mechanism underlying the complex equilibrium between activation and inactivation of the alpha1b-AR and its regulation by pharmacological substances. These findings can help to elucidate the mechanism of action of various agents displaying properties of agonists or inverse agonists at the adrenergic system.
Resumo:
Every spring, workers of the Argentine Ant Linepithema humile kill a large proportion of queens within their nests, Although this behaviour inflicts a high energetic cost oil the colonies, its biological significance has remained elusive so far. An earlier study showed that the probability of a queen being executed is not related to her weight, fecundity, or age. Here we test the hypothesis that workers collectively eliminate queens to which they are less related, thereby increasing their inclusive fitness. We found no evidence for this hypothesis. Workers of a nest were on average not significantly less related to executed queens than to surviving ones. Moreover, a population genetic analysis revealed that workers were not genetically differentiated between nests. This means that workers of a given nest are equally related to any queen in the population and that there can be no increase in average worker-queen relatedness by selective elimination of queens. Finally, our genetic analyses also showed that, in contrast to workers, queens were significantly genetically differentiated between nests and that there was significant isolation by distance for queens.
Covariation between colony social structure and immune defences of workers in the ant Formica selysi
Resumo:
Several ant species vary in the number of queens per colony, yet the causes and consequences of this variation remain poorly understood. In previous experiments, we found that Formica selysi workers originating from multiple-queen (=polygyne) colonies had a lower resistance to a fungal pathogen than workers originating from single-queen (=monogyne) colonies. In contrast, group diversity improved disease resistance in experimental colonies. This discrepancy between field and experimental colonies suggested that variation in social structure in the field had antagonistic effects on worker resistance, possibly through a down-regulation of the immune system balancing the positive effect of genetic diversity. Here, we examined if workers originating from field colonies with alternative social structure differed in three major components of their immune system. We found that workers from polygyne colonies had a lower bacterial growth inhibitory activity than workers from monogyne colonies. In contrast, workers from the two types of colonies did not differ significantly in bacterial cell wall lytic activity and prophenoloxidase activity. Overall, the presence of multiple queens in a colony correlated with a slight reduction in one inducible component of the immune system of individual workers. This reduced level of immune defence might explain the lower resistance of workers originating from polygyne colonies despite the positive effect of genetic diversity. More generally, these results indicate that social changes at the group level can modulate individual immune defences.
Resumo:
Pizgrischite, (Cu,Fe)Cu14PbBi17S35, is a new mineral species named after the type locality, Piz Grisch Mountain, Val Ferrera, Graubunden, Switzerland. This sulfosalt occurs as thin, striated, metallic lead-grey blades measuring up to I cm in length, embedded in quartz and associated with tetrahedrite, chalcopyrite, pyrite, sphalerite, emplectite and derivatives of the aikinite-bismuthinite series. In plane-polarized light, the new species is brownish grey with no perceptible pleochroism; under crossed nicols in oil immersion, it presents a weak anisotropy with dark brown tints. Minimum and maximum reflectance values (in %) in air are: 40.7-42.15 (470 nm), 41.2-43.1 (546 nm), 41.2-43.35 (589 nm) and 40.7-43.3 (650 nm). Cleavage is perfect along 001 I and well developed on {010}. Abundant polysynthetic twinning is observed on (010). The mean micro-indentation hardness is 190 kg/mm(2) (Mohs hardness 3.3), and the calculated density is 6.58 g/cm(3). Electron-microprobe analyses yield (wt%; mean result of seven analyses): Cu 16.48, Pb 2.10, Fe 0.77, Bi 60.70, Sb 0.35, S 19.16, Se 0.04, total 99.60. The resulting empirical chemical formula is (Cu15.24Fe0.80Pb0.60)(Sigma 16.64)(Bi17.07Sb0.17)(Sigma 17.24)(S35.09Se0.03)(Sigma 35.12), in accordance with the formula derived from the single-crystal refinement of the structure, (Cu,Fe)Cu14PbBi17S35. Pizgrischite is monoclinic, space group C2/m, with the following unit-cell parameters: a 35.054(2), b3.91123(I), c43.192(2) angstrom, beta 96.713(4)degrees, V5881.24 angstrom(3), Z=4. The strongest seven X-ray powder-diffraction lines [d in angstrom (I)(hkl)] are: 5.364(40)((6) over bar 04), 4.080(50)((8) over bar 05), 3.120(40)(118), 3.104(68)((3) over bar 18), 2.759(53) ((9) over bar 11),2.752(44)(910) and 1.956(100)(020). The crystal structure is an expanded monoclinic derivative of kupcikite. Pizgrischite belongs to the cuprobismutite series of bismuth sulfosalts but, sensu stricto, it is not a homologue of cuprobismutite. At the type locality. pizarischite is the result of the Alpine metamorphism under greenschist-facies conditions of pre-Tertiary hydrothermal Cu-Bi mineralization.
Resumo:
In the framework of health services research sponsored by the Swiss National Science Foundation, a research was undertaken of the activity of the large majority of the public health nurses working in the Swiss cantons of Vaud and Fribourg (total population 700,000). During one week, 130 nurses gathered, with a specially devised instrument, data on 4165 patient visits. Studying the duration of the contacts, one has distinguished contact duration per se (DC), duration of the travel time preceding the contact (DD), and total duration in relation with the contact (DTC-addition of the first two). It was noted that the three durations increased significantly with patient age (as regard travel time, this is explained by the higher proportion of home visits in higher age groups, as compared with visits at a health center). Examined according to location of the visit, contact duration per se (without travel) is higher for visits at home and in nursing homes than for those taking place at a health center. Looked at in respect to the care given (technical care, or basic nursing care, or both simultaneously), our data show that the provision of basic nursing care (alone or with technical care) doubles contact duration (from 20 to 42-45'). The analyses according to patient age shows that, at an advanced age (beyond 80 years particularly), there is an important increase of the visits where both types of care are given. However, contact duration per se shows a significant raise with age only for the group "technical care only"; it can be demonstrated that this is due to the fact that older patients require more complex technical acts (e.g., bladder care, as compared with simpler acts such as injection). A model of the relationships between patient age and contact duration is proposed: it is because of the increase in the proportions of home visits, of visits including basic nursing care, and of more complex technical acts that older persons require more of the working time of public health nurses.
Resumo:
The introduction of culture-independent molecular screening techniques, especially based on 16S rRNA gene sequences, has allowed microbiologists to examine a facet of microbial diversity not necessarily reflected by the results of culturing studies. The bacterial community structure was studied for a pesticide-contaminated site that was subsequently remediated using an efficient degradative strain Arthrobacter protophormiae RKJ100. The efficiency of the bioremediation process was assessed by monitoring the depletion of the pollutant, and the effect of addition of an exogenous strain on the existing soil community structure was determined using molecular techniques. The 16S rRNA gene pool amplified from the soil metagenome was cloned and restriction fragment length polymorphism studies revealed 46 different phylotypes on the basis of similar banding patterns. Sequencing of representative clones of each phylotype showed that the community structure of the pesticide-contaminated soil was mainly constituted by Proteobacteria and Actinomycetes. Terminal restriction fragment length polymorphism analysis showed only nonsignificant changes in community structure during the process of bioremediation. Immobilized cells of strain RKJ100 enhanced pollutant degradation but seemed to have no detectable effects on the existing bacterial community structure.
Resumo:
A cryo-electron microscopy study of supercoiled DNA molecules freely suspended in cryo-vitrified buffer was combined with Monte Carlo simulations and gel electrophoretic analysis to investigate the role of intersegmental electrostatic repulsion in determining the shape of supercoiled DNA molecules. It is demonstrated here that a decrease of DNA-DNA repulsion by increasing concentrations of counterions causes a higher fraction of the linking number deficit to be partitioned into writhe. When counterions reach concentrations likely to be present under in vivo conditions, naturally supercoiled plasmids adopt a tightly interwound conformation. In these tightly supercoiled DNA molecules the opposing segments of interwound superhelix seem to directly contact each other. This form of supercoiling, where two DNA helices interact laterally, may represent an important functional state of DNA. In the particular case of supercoiled minicircles (178 bp) the delta Lk = -2 topoisomers undergo a sharp structural transition from almost planar circles in low salt buffers to strongly writhed "figure-eight" conformations in buffers containing neutralizing concentrations of counterions. Possible implications of this observed structural transition in DNA are discussed.
Resumo:
The RuvA and RuvB proteins of Escherichia coli, which are induced in response to DNA damage, are important in the formation of heteroduplex DNA during genetic recombination and related recombinational repair processes. In vitro studies show that RuvA binds Holiday junctions and acts as a specificity factor that targets the RuvB ATPase, a hexameric ring protein, to the junction. Together, RuvA and RuvB promote branch migration, an ATP-dependent reaction that increases the length of the heteroduplex DNA. Electron microscopic visualization of RuvAB now provides a new insight into the mechanism of this process. We observe the formation of a tripartite protein complex in which RuvA binds the crossover and is sandwiched between two hexameric rings of RuvB. The Holliday junction within this complex adopts a square-planar structure. We propose a molecular model for branch migration, a unique feature of which is the role played by the two oppositely oriented RuvB ring motors.