928 resultados para Binary hypothesis testing


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Sequences of timestamped events are currently being generated across nearly every domain of data analytics, from e-commerce web logging to electronic health records used by doctors and medical researchers. Every day, this data type is reviewed by humans who apply statistical tests, hoping to learn everything they can about how these processes work, why they break, and how they can be improved upon. To further uncover how these processes work the way they do, researchers often compare two groups, or cohorts, of event sequences to find the differences and similarities between outcomes and processes. With temporal event sequence data, this task is complex because of the variety of ways single events and sequences of events can differ between the two cohorts of records: the structure of the event sequences (e.g., event order, co-occurring events, or frequencies of events), the attributes about the events and records (e.g., gender of a patient), or metrics about the timestamps themselves (e.g., duration of an event). Running statistical tests to cover all these cases and determining which results are significant becomes cumbersome. Current visual analytics tools for comparing groups of event sequences emphasize a purely statistical or purely visual approach for comparison. Visual analytics tools leverage humans' ability to easily see patterns and anomalies that they were not expecting, but is limited by uncertainty in findings. Statistical tools emphasize finding significant differences in the data, but often requires researchers have a concrete question and doesn't facilitate more general exploration of the data. Combining visual analytics tools with statistical methods leverages the benefits of both approaches for quicker and easier insight discovery. Integrating statistics into a visualization tool presents many challenges on the frontend (e.g., displaying the results of many different metrics concisely) and in the backend (e.g., scalability challenges with running various metrics on multi-dimensional data at once). I begin by exploring the problem of comparing cohorts of event sequences and understanding the questions that analysts commonly ask in this task. From there, I demonstrate that combining automated statistics with an interactive user interface amplifies the benefits of both types of tools, thereby enabling analysts to conduct quicker and easier data exploration, hypothesis generation, and insight discovery. The direct contributions of this dissertation are: (1) a taxonomy of metrics for comparing cohorts of temporal event sequences, (2) a statistical framework for exploratory data analysis with a method I refer to as high-volume hypothesis testing (HVHT), (3) a family of visualizations and guidelines for interaction techniques that are useful for understanding and parsing the results, and (4) a user study, five long-term case studies, and five short-term case studies which demonstrate the utility and impact of these methods in various domains: four in the medical domain, one in web log analysis, two in education, and one each in social networks, sports analytics, and security. My dissertation contributes an understanding of how cohorts of temporal event sequences are commonly compared and the difficulties associated with applying and parsing the results of these metrics. It also contributes a set of visualizations, algorithms, and design guidelines for balancing automated statistics with user-driven analysis to guide users to significant, distinguishing features between cohorts. This work opens avenues for future research in comparing two or more groups of temporal event sequences, opening traditional machine learning and data mining techniques to user interaction, and extending the principles found in this dissertation to data types beyond temporal event sequences.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Scharff-technique is used for eliciting information from human sources. At the very core of the technique is the “illusion of knowing it all” tactic, which aims to inflate a source's perception of how much knowledge an interviewer holds about the event to be discussed. For the current study, we mapped the effects following two different ways of introducing this particular tactic; a traditional way of implementation where the interviewer explicitly states that s/he already knows most of the important information (the traditional condition), and a new way of implementation where the interviewer just starts to present the information that s/he holds (the just start condition). The two versions were compared in two separate experiments. In Experiment 1 (N = 60), we measured the participants’ perceptions of the interviewer's knowledge, and in Experiment 2 (N = 60), the participants’ perceptions of the interviewer's knowledge gaps. We found that participants in the just start condition (a) believed the interviewer had more knowledge (Experiment 1), and (b) searched less actively for gaps in the interviewer's knowledge (Experiment 2), compared to the traditional condition. We will discuss the current findings and how sources test and perceive the knowledge his or her interviewer possesses within a framework of social hypothesis testing.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Objective: To compare the eficacy and safety of 4 mg of ondansetron vs. 4 mg of nalbuphine for the treatment of neuraxial morphine-induced pruritus, in patients at the “Dr. José Eleuterio González” University Hospital from September 2012 to August 2013. Material and methods: A controlled, prospective, randomized study of 28 patients (14 per group) receiving neuraxial morphine analgesia was conducted, which was registered and approved by the ethics Committee of the Institution and patients agreed to participate in the study under informed consent. The results were segmented and contrasted (according to drug) by hypothesis testing; the association was determined by X2 with a 95% conidence interval (CI). Results: Pruritus was effectively resolved in both groups and no signiicant difference was found in the rest of the variables. An increase in the visual analogue scale (eVA) was observed at 6 and 12 hours for the ondansetron group, which was statistically signiicant (p≤0.05), however both groups had an eVA of less than 3. Conclusions: When comparing the eficacy and safety of ondansetron 4 mg vs. nalbuphine 4 mg for the treatment of neuraxial morphine induced pruritus, the only signiicant difference found was the mean eVA at 6 and 12 hours, favoring the ondansetron group. However, both groups scored less than 3 on the eVA. Therefore, we consider that both treatments are effective and safe in the treatment of pruritus caused by neuraxial morphine.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Face ao paradigma atual onde são constantemente impostas às entidades públicas medidas para a racionalização de recursos, os Estabelecimentos de Ensino Superior Público Universitário Militar não são exceção tornando-se cada vez mais premente a aposta numa gestão eficiente e eficaz. Neste âmbito, a Contabilidade Analítica assume de forma crescente um papel dominante na análise e controlo dos custos por atividade. O presente Trabalho de Investigação Aplicada encontra-se subordinado ao tema “A Formação de Oficiais de Administração: Oportunidades, Especificidades e Contingências na senda de uma Carreira Profissional”. Assim, o objetivo geral do presente trabalho passa pelo cálculo do custos de formação dos alunos de Administração dos três ramos das Forças Armadas e desta forma, optar pelo modelo mais rentável economicamente. Para o cálculo do custo, de entre as inúmeras opções existentes relativamente a sistemas de custeio, baseámo-nos no método das Secções Homogéneas ou Centros de Custos. A estrutura do trabalho pode ser dividida em duas partes, a primeira de cariz teórico e a segunda uma vertente prática. A metodologia adotada teve como referência o método de investigação em Ciência Sociais, isto é, partindo de uma pergunta central de investigação, que origina perguntas derivadas, procuram-se respostas através da formulação, exploração e teste de hipóteses. De acordo com os resultados do presente estudo podemos verificar que é o modelo de formação utilizada na Academia Militar o mais rentável economicamente. Desta forma, dadas as evidentes afinidades científicas existentes entre os cursos seria pertinente uma reconfiguração da estrutura científica, durações e do perfil formativo dos diferentes cursos. Assim, uma reorganização que elimine redundâncias e promova a partilha de recursos possibilitará ganhos de eficiência na gestão e consequentemente redução de custos.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Students may need explicit training in informal statistical reasoning in order to design experiments or use formal statistical tests effectively. By using scientific scandals and media misinterpretation, we can explore the need for good experimental design in an informal way. This article describes the use of a paper that reviews the measles mumps rubella vaccine and autism controversy in the UK to illustrate a number of threshold concepts underlying good study design and interpretation of scientific evidence. These include the necessity of sufficient sample size, representative and random sampling, appropriate controls and inferring causation.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Reorganizing a dataset so that its hidden structure can be observed is useful in any data analysis task. For example, detecting a regularity in a dataset helps us to interpret the data, compress the data, and explain the processes behind the data. We study datasets that come in the form of binary matrices (tables with 0s and 1s). Our goal is to develop automatic methods that bring out certain patterns by permuting the rows and columns. We concentrate on the following patterns in binary matrices: consecutive-ones (C1P), simultaneous consecutive-ones (SC1P), nestedness, k-nestedness, and bandedness. These patterns reflect specific types of interplay and variation between the rows and columns, such as continuity and hierarchies. Furthermore, their combinatorial properties are interlinked, which helps us to develop the theory of binary matrices and efficient algorithms. Indeed, we can detect all these patterns in a binary matrix efficiently, that is, in polynomial time in the size of the matrix. Since real-world datasets often contain noise and errors, we rarely witness perfect patterns. Therefore we also need to assess how far an input matrix is from a pattern: we count the number of flips (from 0s to 1s or vice versa) needed to bring out the perfect pattern in the matrix. Unfortunately, for most patterns it is an NP-complete problem to find the minimum distance to a matrix that has the perfect pattern, which means that the existence of a polynomial-time algorithm is unlikely. To find patterns in datasets with noise, we need methods that are noise-tolerant and work in practical time with large datasets. The theory of binary matrices gives rise to robust heuristics that have good performance with synthetic data and discover easily interpretable structures in real-world datasets: dialectical variation in the spoken Finnish language, division of European locations by the hierarchies found in mammal occurrences, and co-occuring groups in network data. In addition to determining the distance from a dataset to a pattern, we need to determine whether the pattern is significant or a mere occurrence of a random chance. To this end, we use significance testing: we deem a dataset significant if it appears exceptional when compared to datasets generated from a certain null hypothesis. After detecting a significant pattern in a dataset, it is up to domain experts to interpret the results in the terms of the application.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper, we test the Prebish-Singer (PS) hypothesis, which states that real commodity prices decline in the long run, using two recent powerful panel data stationarity tests accounting for cross-sectional dependence and a structural break. We find that the hypothesis cannot be rejected for most commodities other than oil.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper, we re-examine two important aspects of the dynamics of relative primary commodity prices, namely the secular trend and the short run volatility. To do so, we employ 25 series, some of them starting as far back as 1650 and powerful panel data stationarity tests that allow for endogenous multiple structural breaks. Results show that all the series are stationary after allowing for endogenous multiple breaks. Test results on the Prebisch–Singer hypothesis, which states that relative commodity prices follow a downward secular trend, are mixed but with a majority of series showing negative trends. We also make a first attempt at identifying the potential drivers of the structural breaks. We end by investigating the dynamics of the volatility of the 25 relative primary commodity prices also allowing for endogenous multiple breaks. We describe the often time-varying volatility in commodity prices and show that it has increased in recent years.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper considers methods for testing for superiority or non-inferiority in active-control trials with binary data, when the relative treatment effect is expressed as an odds ratio. Three asymptotic tests for the log-odds ratio based on the unconditional binary likelihood are presented, namely the likelihood ratio, Wald and score tests. All three tests can be implemented straightforwardly in standard statistical software packages, as can the corresponding confidence intervals. Simulations indicate that the three alternatives are similar in terms of the Type I error, with values close to the nominal level. However, when the non-inferiority margin becomes large, the score test slightly exceeds the nominal level. In general, the highest power is obtained from the score test, although all three tests are similar and the observed differences in power are not of practical importance. Copyright (C) 2007 John Wiley & Sons, Ltd.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The purpose of this study was to apply and compare two time-domain analysis procedures in the determination of oxygen uptake (VO2) kinetics in response to a pseudorandom binary sequence (PRBS) exercise test. PRBS exercise tests have typically been analysed in the frequency domain. However, the complex interpretation of frequency responses may have limited the application of this procedure in both sporting and clinical contexts, where a single time measurement would facilitate subject comparison. The relative potential of both a mean response time (MRT) and a peak cross-correlation time (PCCT) was investigated. This study was divided into two parts: a test-retest reliability study (part A), in which 10 healthy male subjects completed two identical PRBS exercise tests, and a comparison of the VO2 kinetics of 12 elite endurance runners (ER) and 12 elite sprinters (SR; part B). In part A, 95% limits of agreement were calculated for comparison between MRT and PCCT. The results of part A showed no significant difference between test and retest as assessed by MRT [mean (SD) 42.2 (4.2) s and 43.8 (6.9) s] or by PCCT [21.8 (3.7) s and 22.7 (4.5) s]. Measurement error (%) was lower for MRT in comparison with PCCT (16% and 25%, respectively). In part B of the study, the VO2 kinetics of ER were significantly faster than those of SR, as assessed by MRT [33.4 (3.4) s and 39.9 (7.1) s, respectively; P<0.01] and PCCT [20.9 (3.8) s and 24.8 (4.5) s; P < 0.05]. It is possible that either analysis procedure could provide a single test measurement Of VO2 kinetics; however, the greater reliability of the MRT data suggests that this method has more potential for development in the assessment Of VO2 kinetics by PRBS exercise testing.