953 resultados para Statistical Inference
Resumo:
Membrane proteins, which constitute approximately 20% of most genomes, form two main classes: alpha helical and beta barrel transmembrane proteins. Using methods based on Bayesian Networks, a powerful approach for statistical inference, we have sought to address beta-barrel topology prediction. The beta-barrel topology predictor reports individual strand accuracies of 88.6%. The method outlined here represents a potentially important advance in the computational determination of membrane protein topology.
Resumo:
2000 Mathematics Subject Classification: 60J80.
Resumo:
2000 Mathematics Subject Classification: 62G32, 62G20.
Resumo:
Multiple linear regression model plays a key role in statistical inference and it has extensive applications in business, environmental, physical and social sciences. Multicollinearity has been a considerable problem in multiple regression analysis. When the regressor variables are multicollinear, it becomes difficult to make precise statistical inferences about the regression coefficients. There are some statistical methods that can be used, which are discussed in this thesis are ridge regression, Liu, two parameter biased and LASSO estimators. Firstly, an analytical comparison on the basis of risk was made among ridge, Liu and LASSO estimators under orthonormal regression model. I found that LASSO dominates least squares, ridge and Liu estimators over a significant portion of the parameter space for large dimension. Secondly, a simulation study was conducted to compare performance of ridge, Liu and two parameter biased estimator by their mean squared error criterion. I found that two parameter biased estimator performs better than its corresponding ridge regression estimator. Overall, Liu estimator performs better than both ridge and two parameter biased estimator.
Resumo:
This work's objective is the development of a methodology to represent an unknown soil through a stratified horizontal multilayer soil model, from which the engineer may carry out eletrical grounding projects with high precision. The methodology uses the experimental electrical apparent resistivity curve, obtained through measurements on the ground, using a 4-wire earth ground resistance tester kit, along with calculations involving the measured resistance. This curve is then compared with the theoretical electrical apparent resistivity curve, obtained through calculations over a horizontally strati ed soil, whose parameters are conjectured. This soil model parameters, such as the number of layers, in addition to the resistivity and the thickness of each layer, are optimized by Differential Evolution method, with enhanced performance through parallel computing, in order to both apparent resistivity curves get close enough, and it is possible to represent the unknown soil through the multilayer horizontal soil model fitted with optimized parameters. In order to assist the Differential Evolution method, in case of a stagnation during an arbitrary amount of generations, an optimization process unstuck methodology is proposed, to expand the search space and test new combinations, allowing the algorithm to nd a better solution and/or leave the local minima. It is further proposed an error improvement methodology, in order to smooth the error peaks between the apparent resistivity curves, by giving opportunities for other more uniform solutions to excel, in order to improve the whole algorithm precision, minimizing the maximum error. Methodologies to verify the polynomial approximation of the soil characteristic function and the theoretical apparent resistivity calculations are also proposed by including middle points among the approximated ones in the verification. Finally, a statistical evaluation prodecure is presented, in order to enable the classication of soil samples. The soil stratification methodology is used in a control group, formed by horizontally stratified soils. By using statistical inference, one may calculate the amount of soils that, within an error margin, does not follow the horizontal multilayer model.
Resumo:
Dengue is an important vector-borne virus that infects on the order of 400 million individuals per year. Infection with one of the virus's four serotypes (denoted DENV-1 to 4) may be silent, result in symptomatic dengue 'breakbone' fever, or develop into the more severe dengue hemorrhagic fever/dengue shock syndrome (DHF/DSS). Extensive research has therefore focused on identifying factors that influence dengue infection outcomes. It has been well-documented through epidemiological studies that DHF is most likely to result from a secondary heterologous infection, and that individuals experiencing a DENV-2 or DENV-3 infection typically are more likely to present with more severe dengue disease than those individuals experiencing a DENV-1 or DENV-4 infection. However, a mechanistic understanding of how these risk factors affect disease outcomes, and further, how the virus's ability to evolve these mechanisms will affect disease severity patterns over time, is lacking. In the second chapter of my dissertation, I formulate mechanistic mathematical models of primary and secondary dengue infections that describe how the dengue virus interacts with the immune response and the results of this interaction on the risk of developing severe dengue disease. I show that only the innate immune response is needed to reproduce characteristic features of a primary infection whereas the adaptive immune response is needed to reproduce characteristic features of a secondary dengue infection. I then add to these models a quantitative measure of disease severity that assumes immunopathology, and analyze the effectiveness of virological indicators of disease severity. In the third chapter of my dissertation, I then statistically fit these mathematical models to viral load data of dengue patients to understand the mechanisms that drive variation in viral load. I specifically consider the roles that immune status, clinical disease manifestation, and serotype may play in explaining viral load variation observed across the patients. With this analysis, I show that there is statistical support for the theory of antibody dependent enhancement in the development of severe disease in secondary dengue infections and that there is statistical support for serotype-specific differences in viral infectivity rates, with infectivity rates of DENV-2 and DENV-3 exceeding those of DENV-1. In the fourth chapter of my dissertation, I integrate these within-host models with a vector-borne epidemiological model to understand the potential for virulence evolution in dengue. Critically, I show that dengue is expected to evolve towards intermediate virulence, and that the optimal virulence of the virus depends strongly on the number of serotypes that co-circulate. Together, these dissertation chapters show that dengue viral load dynamics provide insight into the within-host mechanisms driving differences in dengue disease patterns and that these mechanisms have important implications for dengue virulence evolution.
Resumo:
Extensive investigation has been conducted on network data, especially weighted network in the form of symmetric matrices with discrete count entries. Motivated by statistical inference on multi-view weighted network structure, this paper proposes a Poisson-Gamma latent factor model, not only separating view-shared and view-specific spaces but also achieving reduced dimensionality. A multiplicative gamma process shrinkage prior is implemented to avoid over parameterization and efficient full conditional conjugate posterior for Gibbs sampling is accomplished. By the accommodating of view-shared and view-specific parameters, flexible adaptability is provided according to the extents of similarity across view-specific space. Accuracy and efficiency are tested by simulated experiment. An application on real soccer network data is also proposed to illustrate the model.
Resumo:
Uncertainty quantification (UQ) is both an old and new concept. The current novelty lies in the interactions and synthesis of mathematical models, computer experiments, statistics, field/real experiments, and probability theory, with a particular emphasize on the large-scale simulations by computer models. The challenges not only come from the complication of scientific questions, but also from the size of the information. It is the focus in this thesis to provide statistical models that are scalable to massive data produced in computer experiments and real experiments, through fast and robust statistical inference.
Chapter 2 provides a practical approach for simultaneously emulating/approximating massive number of functions, with the application on hazard quantification of Soufri\`{e}re Hills volcano in Montserrate island. Chapter 3 discusses another problem with massive data, in which the number of observations of a function is large. An exact algorithm that is linear in time is developed for the problem of interpolation of Methylation levels. Chapter 4 and Chapter 5 are both about the robust inference of the models. Chapter 4 provides a new criteria robustness parameter estimation criteria and several ways of inference have been shown to satisfy such criteria. Chapter 5 develops a new prior that satisfies some more criteria and is thus proposed to use in practice.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-08
Resumo:
O presente projeto de Mestrado assenta no estudo dos principais fatores motivacionais na compra de livros de ficção, aprofundando a dicotomia uso próprio versus oferta, englobando duas perspetivas complementares: a sustentação bibliográfica e uma pesquisa quantitativa que foi concretizada através da distribuição de um inquérito por questionário, em formato digital, que permitiu obter 487 respostas válidas e completas. Os dados obtidos foram tratados no SPSS (Statistical Package for the Social Sciences), utilizando análises descritivas, que foram complementadas com técnicas de inferência estatística. Os resultados permitiram concluir que, apesar de homens e mulheres oferecerem um número idêntico de livros, as mulheres compram e leem mais. Além disso, cerca de um terço dos livros comprados destina-se a oferta. Ficou ainda provado que a interação com os autores nas redes sociais apresenta uma associação positiva com a importância atribuida a esse fator na hora de comprar o(s) livro(s). As mulheres são mais sensíveis às “Políticas de Marketing” (Atendimento Personalizado; Facilidade de Pagamento; Local com Cartão de Fidelização; Preços e Descontos; Elevada quantidade de livros), enquanto critérios de seleção do local de compra. Importa referir que a compra é concretizada de forma menos impulsiva quando o livro é para oferta. Os atributos da capa mais relevantes no processo de decisão de compra são o “Título” e a “Sinopse”. Contudo, os diferentes fatores da capa estudados não apresentam diferenças significativas na compra para uso próprio comparativamente à oferta, no que respeita à sua importância. “Autores com prémios literários” e “Nobel” e “Ser uma novidade” (livro) são outros fatores (não da capa) que apresentam diferenças significativas favoráveis à compra para oferta. Por outro lado, “Recomendação de Amigos e Familiares” e “Presença/Ligação/Interação com o Autor nas redes sociais” são mais favoráveis à compra para uso próprio. As conclusões são relevantes para enquadrar as preferências dos consumidores, possibilitando aos profissionais do marketing algumas referências para definir estratégias que respondam às suas necessidades e ambições. Além disso, ao explorar a dimensão dos submercados (Uso próprio versus Oferta), será possível ir ao encontro das exigências dos compradores, tornando a subjetividade da aquisição/utilização numa ferramenta eficaz da própria idealização e personalização dos livros, permitindo conceber e implementar ideias empreendedoras que visem estimular a compra e, consequentemente, melhorar os hábitos de leitura de todos os portugueses.
Resumo:
The aim of this thesis is to review and augment the theory and methods of optimal experimental design. In Chapter I the scene is set by considering the possible aims of an experimenter prior to an experiment, the statistical methods one might use to achieve those aims and how experimental design might aid this procedure. It is indicated that, given a criterion for design, a priori optimal design will only be possible in certain instances and, otherwise, some form of sequential procedure would seem to be indicated. In Chapter 2 an exact experimental design problem is formulated mathematically and is compared with its continuous analogue. Motivation is provided for the solution of this continuous problem, and the remainder of the chapter concerns this problem. A necessary and sufficient condition for optimality of a design measure is given. Problems which might arise in testing this condition are discussed, in particular with respect to possible non-differentiability of the criterion function at the design being tested. Several examples are given of optimal designs which may be found analytically and which illustrate the points discussed earlier in the chapter. In Chapter 3 numerical methods of solution of the continuous optimal design problem are reviewed. A new algorithm is presented with illustrations of how it should be used in practice. It is shown that, for reasonably large sample size, continuously optimal designs may be approximated to well by an exact design. In situations where this is not satisfactory algorithms for improvement of this design are reviewed. Chapter 4 consists of a discussion of sequentially designed experiments, with regard to both the philosophies underlying, and the application of the methods of, statistical inference. In Chapter 5 we criticise constructively previous suggestions for fully sequential design procedures. Alternative suggestions are made along with conjectures as to how these might improve performance. Chapter 6 presents a simulation study, the aim of which is to investigate the conjectures of Chapter 5. The results of this study provide empirical support for these conjectures. In Chapter 7 examples are analysed. These suggest aids to sequential experimentation by means of reduction of the dimension of the design space and the possibility of experimenting semi-sequentially. Further examples are considered which stress the importance of the use of prior information in situations of this type. Finally we consider the design of experiments when semi-sequential experimentation is mandatory because of the necessity of taking batches of observations at the same time. In Chapter 8 we look at some of the assumptions which have been made and indicate what may go wrong where these assumptions no longer hold.
Resumo:
International audience
Resumo:
Let (X, Y) be bivariate normal random vectors which represent the responses as a result of Treatment 1 and Treatment 2. The statistical inference about the bivariate normal distribution parameters involving missing data with both treatment samples is considered. Assuming the correlation coefficient ρ of the bivariate population is known, the MLE of population means and variance (ξ, η, and σ2) are obtained. Inferences about these parameters are presented. Procedures of constructing confidence interval for the difference of population means ξ – η and testing hypothesis about ξ – η are established. The performances of the new estimators and testing procedure are compared numerically with the method proposed in Looney and Jones (2003) on the basis of extensive Monte Carlo simulation. Simulation studies indicate that the testing power of the method proposed in this thesis study is higher.
Resumo:
Loftus (Memory & Cognition 6:312-319, 1978) distinguished between interpretable and uninterpretable interactions. Uninterpretable interactions are ambiguous, because they may be due to two additive main effects (no interaction) and a nonlinear relationship between the (latent) outcome variable and its indicator. Interpretable interactions can only be due to the presence of a true interactive effect in the outcome variable, regardless of the relationship that it establishes with its indicator. In the present article, we first show that same problem can arise when an unmeasured mediator has a nonlinear effect on the measured outcome variable. Then we integrate Loftus's arguments with a seemingly contradictory approach to interactions suggested by Rosnow and Rosenthal (Psychological Bulletin 105:143-146, 1989). We show that entire data patterns, not just interaction effects alone, produce interpretable or noninterpretable interactions. Next, we show that the same problem of interpretability can apply to main effects. Lastly, we give concrete advice on what researchers can do to generate data patterns that provide unambiguous evidence for hypothesized interactions.
Resumo:
Introduction: There has been a continuous development of new technologies in healthcare that are derived from national quality registries. However, this innovation needs to be translated into the workflow of healthcare delivery, to enable children with long-term conditions to get the best support possible to manage their health during everyday life. Since children living with long-term conditions experience different interference levels in their lives, healthcare professionals need to assess the impact of care on children’s day-to-day lives, as a complement to biomedical assessments. Aim: The overall aim of this thesis was to explore and describe the use of instruments about health-related quality of life (HRQOL) in outpatient care for children with long-term conditions on the basis of a national quality registry system. Methods: The research was conducted by using comparative, cross-sectional and explorative designs and data collection was performed by using different methods. The questionnaire DISABKIDS Chronic Generic Measure -37 was used as well as semi-structured interviews and video-recordings from consultations. Altogether, 156 children (8–18 years) and nine healthcare professionals participated in the studies. Children with Type 1 Diabetes (T1D) (n 131) answered the questionnaire DISABKIDS and children with rheumatic diseases, kidney diseases and T1D (n 25) were interviewed after their consultation at the outpatient clinic after the web-DISABKIDS had been used. In total, nine healthcare professionals used the HRQOL instrument as an assessment tool during the encounters which was video-recorded (n 21). Quantitative deductive content analysis was used to describe content in different HRQOL instruments. Statistical inference was used to analyse results from DISABKIDS and qualitative content analysis was used to analyse the interviews and video-recordings. Results: The findings showed that based on a biopsychosocial perspective, both generic and disease-specific instruments should be used to gain a comprehensive evaluation of the child’s HRQOL. The DISABKIDS instrument is applicable when describing different aspects of health concerning children with T1D. When DISABKIDS was used in the encounters, children expressed positive experiences about sharing their results with the healthcare professional. It was discovered that different approaches led to different outcomes for the child when the healthcare professionals were using DISABKIDS during the encounter. When an instructing approach is used, the child’s ability to learn more about their health and how to improve their health is limited. When an inviting or engaging approach is used by the professional, the child may become more involved during the conversations. Conclusions: It could be argued that instruments of HRQOL could be used as a complement to biomedical variables, to promote a biopsychosocial perspective on the child’s health. According to the children in this thesis, feedback on their results after answering to web-DISABKIDS is important, which implies that healthcare professionals need to prioritize time for discussions about results from HRQOL instruments in the encounters. If healthcare professionals involve the child in the discussion of the results of the HRQOL, misinterpreted answers could be corrected during the conversation. Concurrently, this claims that healthcare professionals invite and engage the child.