948 resultados para Cryptography Statistical methods
Resumo:
Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.
Resumo:
Spatial analysis and social network analysis typically take into consideration social processes in specific contexts of geographical or network space. The research in political science increasingly strives to model heterogeneity and spatial dependence. To better understand and geographically model the relationship between “non-political” events, streaming data from social networks, and political climate was the primary objective of the current study. Geographic information systems (GIS) are useful tools in the organization and analysis of streaming data from social networks. In this study, geographical and statistical analysis were combined in order to define the temporal and spatial nature of the data eminating from the popular social network Twitter during the 2014 FIFA World Cup. The study spans the entire globe because Twitter’s geotagging function, the fundamental data that makes this study possible, is not limited to a geographic area. By examining the public reactions to an inherenlty non-political event, this study serves to illuminate broader questions about social behavior and spatial dependence. From a practical perspective, the analyses demonstrate how the discussion of political topics fluсtuate according to football matches. Tableau and Rapidminer, in addition to a set basic statistical methods, were applied to find patterns in the social behavior in space and time in different geographic regions. It was found some insight into the relationship between an ostensibly non-political event – the World Cup - and public opinion transmitted by social media. The methodology could serve as a prototype for future studies and guide policy makers in governmental and non-governmental organizations in gauging the public opinion in certain geographic locations.
Resumo:
RESUMO: Introdução: As benzodiazepinas são os fármacos ansiolíticos e hipnóticos mais utilizados. O elevado consumo destes fármacos tem representado uma preocupação devido aos efeitos secundários do seu uso prolongado e dependência. Portugal tem a maior utilização de benzodiazepinas na Europa. Este estudo pretende analisar a alteração do padrão de prescrição de benzodiazepinas após uma intervenção com clínicos gerais. Métodos: A intervenção consistiu numa sessão educacional a um grupo de clínicos gerais. Foi comparado o padrão de prescrição de benzodiazepinas dos médicos intervencionados com o de um grupo de médicos não intervencionado da mesma região e com o de um grupo de médicos não intervencionados de outra região. Analisaram-‐se as prescrições de 12 meses antes e depois da intervenção. A análise do padrão de prescrição utilizou como metodologia a Dose Diária Definida (DDD) e a Dose Diária Definida por 1000 pacientes por dia (DHD). A análise estatística recorreu a métodos de regressão segmentada. Resultados: Houve uma diminuição no padrão de prescrição de benzodiazepinas no grupo intervencionado após a intervenção (p=0.005). Houve também uma redução no padrão de prescrição no grupo não intervencionada da mesma região (p=0.037) e no grupo não-intervencionado da região diferente (p=0.010). Analisando por género, prescritores do género feminino prescrevem uma quantidade maior de benzodiazepinas. Os clínicos gerais do género feminino intervencionados tiveram a maior redução na prescrição após a intervenção (p=0.008). Discussão: Os dados demonstraram que a intervenção reduziu a prescrição de benzodiazepinas após a intervenção. A diminuição geral do padrão de prescrição poderá ser explicada pelo efeito de Hawthorne ou pela contaminação entre os três grupos de clínicos gerais. Os dados disponíveis não explicam as diferenças nos padrões de prescrição por género. Conclusão: Este estudo demonstra como uma única intervenção tem um impacto positivo na melhoria dos padrões de prescrição. A replicação desta intervenção poderá representar uma oportunidade para alterar a prescrição de benzodiazepinas em Portugal. -----------------------------ABSTRACT: Introduction: Benzodiazepines are the most utilized anxiolytic and hypnotic drugs. The high consumption of benzodiazepines has been a concern due to the reported side effects of long-‐term use and dependence. Portugal has the highest benzodiazepine utilisation in Europe. This study aims to analyse the change in General Practitioners’ (GPs) benzodiazepine prescription pattern after na intervention period. Methods: An educational session was delivered to a group of intervened GPs. The benzodiazepine prescription pattern of the intervened group was compared to the pattern of a non-‐intervened matched group from the same region, and to the pattern of another non-‐intervened matched group from a diferente region. The research time frame was 12 month before and after intervention. The analysis of the prescription trends used the Defined Daily Dose (DDD) and Defined Daily Dose per 1000 patients per day (DHD) methodology. The statistical methods consisted of segmented regression analysis. Results: There was a decrease in benzodiazepine prescription pattern of intervened GPs after intervention (p=0.005). There was also a decrease in benzodiazepine prescription pattern for the non-‐intervened group from the same region (p=0.037) and for the non-‐ intervened group from a diferente region (p=0.010). Concerningthe analysis by gender, female gender prescribed a higher amount of benzodiazepines. The intervened female gender prescribers presented the highest decrease in prescription trend after intervention (p=0.008). Discussion: The data demonstrated that the intervention was effective in reducing benzodiazepine prescription after intervention. The general decrease in prescription trend might be explained by a Hawthorne effect or a contamination effect between the three groups of GPs. The available data couldn´t explain the diferences in prescription patterns by gender. Conclusion: This study demonstrates how a single intervention has a positive impact on improving prescription trends. The replication of this intervention might be an opportunity to changing the worrying benzodiazepine utilisation in Portugal.
Resumo:
Dissertação de mestrado em Engenharia Industrial
Resumo:
This work is focused on the development of a methodology for the use of chemical characteristic of tire traces to help answer the following question: "Is the offending tire at the origin of the trace found on the crime scene?". This methodology goes from the trace sampling on the road to statistical analysis of its chemical characteristics. Knowledge about the composition and manufacture of tread tires as well as a review of instrumental techniques used for the analysis of polymeric materials were studied to select, as an ansi vi cal technique for this research, pyrolysis coupled to a gas Chromatograph with a mass spectrometry detector (Py-GC/MS). An analytical method was developed and optimized to obtain the lowest variability between replicates of the same sample. Within-variability of the tread was evaluated regarding width and circumference with several samples taken from twelve tires of different brands and/or models. The variability within each of the treads (within-variability) and between the treads (between-variability) could be quantified. Different statistical methods have shown that within-variability is lower than between-variability, which helped differentiate these tires. Ten tire traces were produced with tires of different brands and/or models by braking tests. These traces have been adequately sampled using sheets of gelatine. Particles of each trace were analysed using the same methodology as for the tires at their origin. The general chemical profile of a trace or of a tire has been characterized by eighty-six compounds. Based on a statistical comparison of the chemical profiles obtained, it has been shown that a tire trace is not differentiable from the tire at its origin but is generally differentiable from tires that are not at its origin. Thereafter, a sample containing sixty tires was analysed to assess the discrimination potential of the developed methodology. The statistical results showed that most of the tires of different brands and models are differentiable. However, tires of the same brand and model with identical characteristics, such as country of manufacture, size and DOT number, are not differentiable. A model, based on a likelihood ratio approach, was chosen to evaluate the results of the comparisons between the chemical profiles of the traces and tires. The methodology developed was finally blindly tested using three simulated scenarios. Each scenario involved a trace of an unknown tire as well as two tires possibly at its origin. The correct results for the three scenarios were used to validate the developed methodology. The different steps of this work were useful to collect the required information to test and validate the underlying assumption that it is possible to help determine if an offending tire » or is not at the origin of a trace, by means of a statistical comparison of their chemical profile. This aid was formalized by a measure of the probative value of the evidence, which is represented by the chemical profile of the trace of the tire. - Ce travail s'est proposé de développer une méthodologie pour l'exploitation des caractéristiques chimiques des traces de pneumatiques dans le but d'aider à répondre à la question suivante : «Est-ce que le pneumatique incriminé est ou n'est pas à l'origine de la trace relevée sur les lieux ? ». Cette méthodologie s'est intéressée du prélèvement de la trace de pneumatique sur la chaussée à l'exploitation statistique de ses caractéristiques chimiques. L'acquisition de connaissances sur la composition et la fabrication de la bande de roulement des pneumatiques ainsi que la revue de techniques instrumentales utilisées pour l'analyse de matériaux polymériques ont permis de choisir, comme technique analytique pour la présente recherche, la pyrolyse couplée à un chromatographe en phase gazeuse avec un détecteur de spectrométrie de masse (Py-GC/MS). Une méthode analytique a été développée et optimisée afin d'obtenir la plus faible variabilité entre les réplicas d'un même échantillon. L'évaluation de l'intravariabilité de la bande de roulement a été entreprise dans sa largeur et sa circonférence à l'aide de plusieurs prélèvements effectués sur douze pneumatiques de marques et/ou modèles différents. La variabilité au sein de chacune des bandes de roulement (intravariabilité) ainsi qu'entre les bandes de roulement considérées (intervariabilité) a pu être quantifiée. Les différentes méthodes statistiques appliquées ont montré que l'intravariabilité est plus faible que l'intervariabilité, ce qui a permis de différencier ces pneumatiques. Dix traces de pneumatiques ont été produites à l'aide de pneumatiques de marques et/ou modèles différents en effectuant des tests de freinage. Ces traces ont pu être adéquatement prélevées à l'aide de feuilles de gélatine. Des particules de chaque trace ont été analysées selon la même méthodologie que pour les pneumatiques à leur origine. Le profil chimique général d'une trace de pneumatique ou d'un pneumatique a été caractérisé à l'aide de huitante-six composés. Sur la base de la comparaison statistique des profils chimiques obtenus, il a pu être montré qu'une trace de pneumatique n'est pas différenciable du pneumatique à son origine mais est, généralement, différenciable des pneumatiques qui ne sont pas à son origine. Par la suite, un échantillonnage comprenant soixante pneumatiques a été analysé afin d'évaluer le potentiel de discrimination de la méthodologie développée. Les méthodes statistiques appliquées ont mis en évidence que des pneumatiques de marques et modèles différents sont, majoritairement, différenciables entre eux. La méthodologie développée présente ainsi un bon potentiel de discrimination. Toutefois, des pneumatiques de la même marque et du même modèle qui présentent des caractéristiques PTD (i.e. pays de fabrication, taille et numéro DOT) identiques ne sont pas différenciables. Un modèle d'évaluation, basé sur une approche dite du likelihood ratio, a été adopté pour apporter une signification au résultat des comparaisons entre les profils chimiques des traces et des pneumatiques. La méthodologie mise en place a finalement été testée à l'aveugle à l'aide de la simulation de trois scénarios. Chaque scénario impliquait une trace de pneumatique inconnue et deux pneumatiques suspectés d'être à l'origine de cette trace. Les résultats corrects obtenus pour les trois scénarios ont permis de valider la méthodologie développée. Les différentes étapes de ce travail ont permis d'acquérir les informations nécessaires au test et à la validation de l'hypothèse fondamentale selon laquelle il est possible d'aider à déterminer si un pneumatique incriminé est ou n'est pas à l'origine d'une trace, par le biais d'une comparaison statistique de leur profil chimique. Cette aide a été formalisée par une mesure de la force probante de l'indice, qui est représenté par le profil chimique de la trace de pneumatique.
Resumo:
INTRODUCTION/OBJECTIVES: Detection rates for adenoma and early colorectal cancer (CRC) are insufficient due to low compliance towards invasive screening procedures, like colonoscopy.Available non-invasive screening tests have unfortunately low sensitivity and specificity performances.Therefore, there is a large unmet need calling for a cost-effective, reliable and non-invasive test to screen for early neoplastic and pre-neoplastic lesions AIMS & Methods: The objective is to develop a screening test able to detect early CRCs and adenomas.This test is based on a nucleic acids multi-gene assay performed on peripheral blood mononuclear cells (PBMCs).A colonoscopy-controlled feasibility study was conducted on 179 subjects.The first 92 subjects was used as training set to generate a statistical significant signature.Colonoscopy revealed 21 subjects with CRC,30 with adenoma bigger than 1 cm and 41 with no neoplastic or inflammatory lesions.The second group of 48 subjects (controls, CRC and polyps) was used as a test set and will be kept blinded for the entire data analysis.To determine the organ and disease specificity 38 subjects were used:24 with inflammatory bowel disease (IBD),14 with other cancers than CRC (OC).Blood samples were taken from each patient the day of the colonoscopy and PBMCs were purified. Total RNA was extracted following standard procedures.Multiplex RT-qPCR was applied on 92 different candidate biomarkers.Different univariate and multivariate statistical methods were applied on these candidates and among them 60 biomarkers with significant p-values (<0.01) were selected.These biomarkers are involved in several different biological functions as cellular movement,cell signaling and interaction,tissue and cellular development,cancer and cell growth and proliferation.Two distinct biomarker signatures are used to separate patients without lesion from those with cancer or with adenoma, named COLOX CRC and COLOX POL respectively.COLOX performances were validated using random resampling method, bootstrap. RESULTS: COLOX CRC and POL tests successfully separate patients without lesions from those with CRC (Se 67%,Sp 93%,AUC 0.87) and from those with adenoma bigger than 1cm (Se 63%,Sp 83%,AUC 0.77),respectively. 6/24 patients in the IBD group and 1/14 patients in the OC group have a positive COLOX CRC CONCLUSION: The two COLOX tests demonstrated a high sensitivity and specificity to detect the presence of CRCs and adenomas bigger than 1 cm.A prospective, multicenter, pivotal study is underway in order to confirm these promising results in a larger cohort.
Resumo:
The aim of this work is to evaluate the capabilities and limitations of chemometric methods and other mathematical treatments applied on spectroscopic data and more specifically on paint samples. The uniqueness of the spectroscopic data comes from the fact that they are multivariate - a few thousands variables - and highly correlated. Statistical methods are used to study and discriminate samples. A collection of 34 red paint samples was measured by Infrared and Raman spectroscopy. Data pretreatment and variable selection demonstrated that the use of Standard Normal Variate (SNV), together with removal of the noisy variables by a selection of the wavelengths from 650 to 1830 cm−1 and 2730-3600 cm−1, provided the optimal results for infrared analysis. Principal component analysis (PCA) and hierarchical clusters analysis (HCA) were then used as exploratory techniques to provide evidence of structure in the data, cluster, or detect outliers. With the FTIR spectra, the Principal Components (PCs) correspond to binder types and the presence/absence of calcium carbonate. 83% of the total variance is explained by the four first PCs. As for the Raman spectra, we observe six different clusters corresponding to the different pigment compositions when plotting the first two PCs, which account for 37% and 20% respectively of the total variance. In conclusion, the use of chemometrics for the forensic analysis of paints provides a valuable tool for objective decision-making, a reduction of the possible classification errors, and a better efficiency, having robust results with time saving data treatments.
Resumo:
The purpose of the study was to determine reference percentiles for the urinary (U) oxalate (Ox) and urate (Ura) to creatinine (Cr) concentration ratios in the second morning urine of healthy infants, children, and adolescents. The urinary oxalate and urate to creatinine ratios were determined in the spontaneously voided second morning urine sample. To test reproducibility, two urine samples were analyzed on 2 consecutive weeks in 63% of the subjects. Three hundred eighty-four healthy children (181 girls, 203 boys), aged 1 month to 17 years, from nurseries, kindergartens, and schools of Lausanne, Switzerland, were studied. The 5th and 95th percentiles were determined from the total number of urine samples (627) after confirmation that there was no order effect between repeated measurements and there were no significant sex differences. A nonlinear regression analysis in terms of age was used to smooth the calculated percentiles. In this manner, curves were obtained from which the reference values can be read at any given age. The 95th percentiles decreased with age: for UOx/Cr from 0.175 mg/mg (0.22 mol/mol) at 1 to 6 months to 0.048 mg/mg (0.06 mol/mol) from 7 years and beyond; and UUra/Cr from 2.378 mg/mg (1.6 mol/mol) at 1 to 6 months to 0.594 mg/mg (0.4 mol/mol) in adolescence. We provide 5th and 95th percentile curves for the UOx/Cr and UUra/Cr ratios determined from the second morning urine samples in a large cohort of healthy infants, children, and adolescents. Values were determined by standard analytical chemical techniques and were analyzed by powerful statistical methods. The calculated 95th percentile for the UOx/Cr values fell rather rapidly and reached normal adult values by the age of 7 years, whereas for UUra/Cr, the 95th percentile decreased slowly and stabilized in adolescence.
Resumo:
1. Species distribution modelling is used increasingly in both applied and theoretical research to predict how species are distributed and to understand attributes of species' environmental requirements. In species distribution modelling, various statistical methods are used that combine species occurrence data with environmental spatial data layers to predict the suitability of any site for that species. While the number of data sharing initiatives involving species' occurrences in the scientific community has increased dramatically over the past few years, various data quality and methodological concerns related to using these data for species distribution modelling have not been addressed adequately. 2. We evaluated how uncertainty in georeferences and associated locational error in occurrences influence species distribution modelling using two treatments: (1) a control treatment where models were calibrated with original, accurate data and (2) an error treatment where data were first degraded spatially to simulate locational error. To incorporate error into the coordinates, we moved each coordinate with a random number drawn from the normal distribution with a mean of zero and a standard deviation of 5 km. We evaluated the influence of error on the performance of 10 commonly used distributional modelling techniques applied to 40 species in four distinct geographical regions. 3. Locational error in occurrences reduced model performance in three of these regions; relatively accurate predictions of species distributions were possible for most species, even with degraded occurrences. Two species distribution modelling techniques, boosted regression trees and maximum entropy, were the best performing models in the face of locational errors. The results obtained with boosted regression trees were only slightly degraded by errors in location, and the results obtained with the maximum entropy approach were not affected by such errors. 4. Synthesis and applications. To use the vast array of occurrence data that exists currently for research and management relating to the geographical ranges of species, modellers need to know the influence of locational error on model quality and whether some modelling techniques are particularly robust to error. We show that certain modelling techniques are particularly robust to a moderate level of locational error and that useful predictions of species distributions can be made even when occurrence data include some error.
Resumo:
OBJECTIVES: The aim of the study was to assess whether prospective follow-up data within the Swiss HIV Cohort Study can be used to predict patients who stop smoking; or among smokers who stop, those who start smoking again. METHODS: We built prediction models first using clinical reasoning ('clinical models') and then by selecting from numerous candidate predictors using advanced statistical methods ('statistical models'). Our clinical models were based on literature that suggests that motivation drives smoking cessation, while dependence drives relapse in those attempting to stop. Our statistical models were based on automatic variable selection using additive logistic regression with component-wise gradient boosting. RESULTS: Of 4833 smokers, 26% stopped smoking, at least temporarily; because among those who stopped, 48% started smoking again. The predictive performance of our clinical and statistical models was modest. A basic clinical model for cessation, with patients classified into three motivational groups, was nearly as discriminatory as a constrained statistical model with just the most important predictors (the ratio of nonsmoking visits to total visits, alcohol or drug dependence, psychiatric comorbidities, recent hospitalization and age). A basic clinical model for relapse, based on the maximum number of cigarettes per day prior to stopping, was not as discriminatory as a constrained statistical model with just the ratio of nonsmoking visits to total visits. CONCLUSIONS: Predicting smoking cessation and relapse is difficult, so that simple models are nearly as discriminatory as complex ones. Patients with a history of attempting to stop and those known to have stopped recently are the best candidates for an intervention.
Resumo:
This paper examines the results of spatial (microgeographical) water contact/schistosomiasis studies in two African (Egyptian and Kenyan) and one Brazilian communities. All three studies used traditional cartographic and statistical methods but one of them emploeyd also GIS (geographical information systems) tools. The advantage of GIS and their potential role in schistosomiasis control are briefly described. The three cases revealed considerable variation in the spatial distribution of water contact, transmission parameters and infection levels at the household and individual levels. All studies showed considerable variation in the prevalence and intensity of infection between households. They also show a variable influence of distance on water contact behavior associated with type of activity, age, sex, socioeconomic level, perception of water quality, season and availability of water in the home. Water contact behavior and schistosomiasis were evaluated in the Brazilian village of Nova União within the context of water sharing between household and age/sex groups. Recommendations are made for further spatial studies on the transmission and control of schistosomiasis.
Resumo:
Report for the scientific sojourn at the University of Reading, United Kingdom, from January until May 2008. The main objectives have been firstly to infer population structure and parameters in demographic models using a total of 13 microsatellite loci for genotyping approximately 30 individuals per population in 10 Palinurus elephas populations both from Mediterranean and Atlantic waters. Secondly, developing statistical methods to identify discrepant loci, possibly under selection and implement those methods using the R software environment. It is important to consider that the calculation of the probability distribution of the demographic and mutational parameters for a full genetic data set is numerically difficult for complex demographic history (Stephens 2003). The Approximate Bayesian Computation (ABC), based on summary statistics to infer posterior distributions of variable parameters without explicit likelihood calculations, can surmount this difficulty. This would allow to gather information on different demographic prior values (i.e. effective population sizes, migration rate, microsatellite mutation rate, mutational processes) and assay the sensitivity of inferences to demographic priors by assuming different priors.
Resumo:
In Alzheimer's disease (AD), synaptic alterations play a major role and are often correlated with cognitive changes. In order to better understand synaptic modifications, we compared alterations in NMDA receptors and postsynaptic protein PSD-95 expression in the entorhinal cortex (EC) and frontal cortex (FC; area 9) of AD and control brains. We combined immunohistochemical and image analysis methods to quantify on consecutive sections the distribution of PSD-95 and NMDA receptors GluN1, GluN2A and GluN2B in EC and FC from 25 AD and control cases. The density of stained receptors was analyzed using multivariate statistical methods to assess the effect of neurodegeneration. In both regions, the number of neuronal profiles immunostained for GluN1 receptors subunit and PSD-95 protein was significantly increased in AD compared to controls (3-6 fold), while the number of neuronal profiles stained for GluN2A and GluN2B receptors subunits was on the contrary decreased (3-4 fold). The increase in marked neuronal profiles was more prominent in a cortical band corresponding to layers 3 to 5 with large pyramidal cells. Neurons positive for GluN1 or PSD-95 staining were often found in the same localization on consecutive sections and they were also reactive for the anti-tau antibody AD2, indicating a neurodegenerative process. Differences in the density of immunoreactive puncta representing neuropile were not statistically significant. Altogether these data indicate that GluN1 and PSD-95 accumulate in the neuronal perikarya, but this is not the case for GluN2A and GluN2B, while the neuropile compartment is less subject to modifications. Thus, important variations in the pattern of distribution of the NMDA receptors subunits and PSD-95 represent a marker in AD and by impairing the neuronal network, contribute to functional deterioration.
Resumo:
La aplicación Log2XML tiene como objeto principal la transformación de archivos log en formato texto con separador de campos a un formato XML estandarizado. Para permitir que la aplicación pueda trabajar con logs de diferentes sistemas o aplicaciones, dispone de un sistema de plantillas (indicación de orden de campos y carácter separador) que permite definir la estructura mínima para poder extraer la información de cualquier tipo de log que se base en separadores de campo. Por último, la aplicación permite el procesamiento de la información extraída para la generación de informes y estadísticas.Por otro lado, en el proyecto se profundiza en la tecnología Grails.
Resumo:
Analyzing the relationship between the baseline value and subsequent change of a continuous variable is a frequent matter of inquiry in cohort studies. These analyses are surprisingly complex, particularly if only two waves of data are available. It is unclear for non-biostatisticians where the complexity of this analysis lies and which statistical method is adequate.With the help of simulated longitudinal data of body mass index in children,we review statistical methods for the analysis of the association between the baseline value and subsequent change, assuming linear growth with time. Key issues in such analyses are mathematical coupling, measurement error, variability of change between individuals, and regression to the mean. Ideally, it is better to rely on multiple repeated measurements at different times and a linear random effects model is a standard approach if more than two waves of data are available. If only two waves of data are available, our simulations show that Blomqvist's method - which consists in adjusting for measurement error variance the estimated regression coefficient of observed change on baseline value - provides accurate estimates. The adequacy of the methods to assess the relationship between the baseline value and subsequent change depends on the number of data waves, the availability of information on measurement error, and the variability of change between individuals.