25 resultados para exceedance probabilities


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: Genome-wide association studies (GWAS) require large sample sizes to obtain adequate statistical power, but it may be possible to increase the power by incorporating complementary data. In this study we investigated the feasibility of automatically retrieving information from the medical literature and leveraging this information in GWAS. Methods: We developed a method that searches through PubMed abstracts for pre-assigned keywords and key concepts, and uses this information to assign prior probabilities of association for each single nucleotide polymorphism (SNP) with the phenotype of interest - the Adjusting Association Priors with Text (AdAPT) method. Association results from a GWAS can subsequently be ranked in the context of these priors using the Bayes False Discovery Probability (BFDP) framework. We initially tested AdAPT by comparing rankings of known susceptibility alleles in a previous lung cancer GWAS, and subsequently applied it in a two-phase GWAS of oral cancer. Results: Known lung cancer susceptibility SNPs were consistently ranked higher by AdAPT BFDPs than by p-values. In the oral cancer GWAS, we sought to replicate the top five SNPs as ranked by AdAPT BFDPs, of which rs991316, located in the ADH gene region of 4q23, displayed a statistically significant association with oral cancer risk in the replication phase (per-rare-allele log additive p-value [p(trend)] = 2.5 x 10(-3)). The combined OR for having one additional rare allele was 0.83 (95% CI: 0.76-0.90), and this association was independent of previously identified susceptibility SNPs that are associated with overall UADT cancer in this gene region. We also investigated if rs991316 was associated with other cancers of the upper aerodigestive tract (UADT), but no additional association signal was found. Conclusion: This study highlights the potential utility of systematically incorporating prior knowledge from the medical literature in genome-wide analyses using the AdAPT methodology. AdAPT is available online (url: http://services.gate.ac.uk/lld/gwas/service/config).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A set of predictor variables is said to be intrinsically multivariate predictive (IMP) for a target variable if all properly contained subsets of the predictor set are poor predictors of the. target but the full set predicts the target with great accuracy. In a previous article, the main properties of IMP Boolean variables have been analytically described, including the introduction of the IMP score, a metric based on the coefficient of determination (CoD) as a measure of predictiveness with respect to the target variable. It was shown that the IMP score depends on four main properties: logic of connection, predictive power, covariance between predictors and marginal predictor probabilities (biases). This paper extends that work to a broader context, in an attempt to characterize properties of discrete Bayesian networks that contribute to the presence of variables (network nodes) with high IMP scores. We have found that there is a relationship between the IMP score of a node and its territory size, i.e., its position along a pathway with one source: nodes far from the source display larger IMP scores than those closer to the source, and longer pathways display larger maximum IMP scores. This appears to be a consequence of the fact that nodes with small territory have larger probability of having highly covariate predictors, which leads to smaller IMP scores. In addition, a larger number of XOR and NXOR predictive logic relationships has positive influence over the maximum IMP score found in the pathway. This work presents analytical results based on a simple structure network and an analysis involving random networks constructed by computational simulations. Finally, results from a real Bayesian network application are provided. (C) 2012 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, the effects of uncertainty and expected costs of failure on optimum structural design are investigated, by comparing three distinct formulations of structural optimization problems. Deterministic Design Optimization (DDO) allows one the find the shape or configuration of a structure that is optimum in terms of mechanics, but the formulation grossly neglects parameter uncertainty and its effects on structural safety. Reliability-based Design Optimization (RBDO) has emerged as an alternative to properly model the safety-under-uncertainty part of the problem. With RBDO, one can ensure that a minimum (and measurable) level of safety is achieved by the optimum structure. However, results are dependent on the failure probabilities used as constraints in the analysis. Risk optimization (RO) increases the scope of the problem by addressing the compromising goals of economy and safety. This is accomplished by quantifying the monetary consequences of failure, as well as the costs associated with construction, operation and maintenance. RO yields the optimum topology and the optimum point of balance between economy and safety. Results are compared for some example problems. The broader RO solution is found first, and optimum results are used as constraints in DDO and RBDO. Results show that even when optimum safety coefficients are used as constraints in DDO, the formulation leads to configurations which respect these design constraints, reduce manufacturing costs but increase total expected costs (including expected costs of failure). When (optimum) system failure probability is used as a constraint in RBDO, this solution also reduces manufacturing costs but by increasing total expected costs. This happens when the costs associated with different failure modes are distinct. Hence, a general equivalence between the formulations cannot be established. Optimum structural design considering expected costs of failure cannot be controlled solely by safety factors nor by failure probability constraints, but will depend on actual structural configuration. (c) 2011 Elsevier Ltd. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The number of citations received by authors in scientific journals has become a major parameter to assess individual researchers and the journals themselves through the impact factor. A fair assessment therefore requires that the criteria for selecting references in a given manuscript should be unbiased with regard to the authors or journals cited. In this paper, we assess approaches for citations considering two recommendations for authors to follow while preparing a manuscript: (i) consider similarity of contents with the topics investigated, lest related work should be reproduced or ignored; (ii) perform a systematic search over the network of citations including seminal or very related papers. We use formalisms of complex networks for two datasets of papers from the arXiv and the Web of Science repositories to show that neither of these two criteria is fulfilled in practice. By representing the texts as complex networks we estimated a similarity index between pieces of texts and found that the list of references did not contain the most similar papers in the dataset. This was quantified by calculating a consistency index, whose maximum value is one if the references in a given paper are the most similar in the dataset. For the areas of "complex networks" and "graphenes", the consistency index was only 0.11-0.23 and 0.10-0.25, respectively. To simulate a systematic search in the citation network, we employed a traditional random walk search (i.e. diffusion) and a random walk whose probabilities of transition are proportional to the number of the ingoing edges of the neighbours. The frequency of visits to the nodes (papers) in the network had a very small correlation with either the actual list of references in the papers or with the number of downloads from the arXiv repository. Therefore, apparently the authors and users of the repository did not follow the criterion related to a systematic search over the network of citations. Based on these results, we propose an approach that we believe is fairer for evaluating and complementing citations of a given author, effectively leading to a virtual scientometry.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Questions Does the spatial association between isolated adult trees and understorey plants change along a gradient of sand dunes? Does this association depend on the life form of the understorey plant? Location Coastal sand dunes, southeast Brazil. Methods We recorded the occurrence of understorey plant species in 100 paired 0.25 m2 plots under adult trees and in adjacent treeless sites along an environmental gradient from beach to inland. Occurrence probabilities were modelled as a function of the fixed variables of the presence of a neighbour, distance from the seashore and life form, and a random variable, the block (i.e. the pair of plots). Generalized linear mixed models (GLMM) were fitted in a backward step-wise procedure using Akaike's information criterion (AIC) for model selection. Results The occurrence of understorey plants was affected by the presence of an adult tree neighbour, but the effect varied with the life form of the understorey species. Positive spatial association was found between isolated adult neighbour and young trees, whereas a negative association was found for shrubs. Moreover, a neutral association was found for lianas, whereas for herbs the effect of the presence of an adult neighbour ranged from neutral to negative, depended on the subgroup considered. The strength of the negative association with forbs increased with distance from the seashore. However, for the other life forms, the associational pattern with adult trees did not change along the gradient. Conclusions For most of the understorey life forms there is no evidence that the spatial association between isolated adult trees and understorey plants changes with the distance from the seashore, as predicted by the stress gradient hypothesis, a common hypothesis in the literature about facilitation in plant communities. Furthermore, the positive spatial association between isolated adult trees and young trees identified along the entire gradient studied indicates a positive feedback that explains the transition from open vegetation to forest in subtropical coastal dune environments.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Deficiencies in calcium (Ca) and magnesium (Mg) are associated with various complications during pregnancy. To test the hypothesis that the status of these minerals is inadequate in pregnancy, a cross-sectional study was conducted of the dietary intake and status of Ca and Mg in pregnant women (n = 50) attending a general public university hospital in Brazil. Dietary intake was assessed from 4-day food records; levels of plasma Mg, erythrocyte Mg, and urinary Ca and Mg excretion were determined by flame atomic absorption spectroscopy; and type I collagen C-telopeptides were evaluated by enzyme-linked immunosorbent assay. Probabilities of inadequate Ca and Mg intake were exhibited by 58 and 98% of the study population, respectively. The mean levels of urinary Ca and Mg excretion were 8.55 and 3.77 mmol/L, respectively. Plasma C-telopeptides, plasma Mg, and erythrocyte Mg were within normal levels. Multiple linear regression analysis revealed positive relationships among urinary Ca excretion, Ca intake (P = .002) and urinary Mg excretion (P < .001) and between erythrocyte Mg and Mg intake (P = .023). It is concluded that the Ca and Mg status of participants was adequate even though the intake of Ca and Mg was lower than the recommended level. (C) 2012 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objectives To evaluate the accuracy and probabilities of different fetal ultrasound parameters to predict neonatal outcome in isolated congenital diaphragmatic hernia (CDH). Methods Between January 2004 and December 2010, we evaluated prospectively 108 fetuses with isolated CDH (82 left-sided and 26 right-sided). The following parameters were evaluated: gestational age at diagnosis, side of the diaphragmatic defect, presence of polyhydramnios, presence of liver herniated into the fetal thorax (liver-up), lung-to-head ratio (LHR) and observed/expected LHR (o/e-LHR), observed/expected contralateral and total fetal lung volume (o/e-ContFLV and o/e-TotFLV) ratios, ultrasonographic fetal lung volume/fetal weight ratio (US-FLW), observed/expected contralateral and main pulmonary artery diameter (o/e-ContPA and o/eMPA) ratios and the contralateral vascularization index (Cont-VI). The outcomes were neonatal death and severe postnatal pulmonary arterial hypertension (PAH). Results Neonatal mortality was 64.8% (70/108). Severe PAH was diagnosed in 68 (63.0%) cases, of which 63 died neonatally (92.6%) (P < 0.001). Gestational age at diagnosis, side of the defect and polyhydramnios were not associated with poor outcome (P > 0.05). LHR, o/eLHR, liver-up, o/e-ContFLV, o/e-TotFLV, US-FLW, o/eContPA, o/e-MPA and Cont-VI were associated with both neonatal death and severe postnatal PAH (P < 0.001). Receiver-operating characteristics curves indicated that measuring total lung volumes (o/e-TotFLV and US-FLW) was more accurate than was considering only the contralateral lung sizes (LHR, o/e-LHR and o/e-ContFLV; P < 0.05), and Cont-VI was the most accurate ultrasound parameter to predict neonatal death and severe PAH (P < 0.001). Conclusions Evaluating total lung volumes is more accurate than is measuring only the contralateral lung size. Evaluating pulmonary vascularization (Cont-VI) is the most accurate predictor of neonatal outcome. Estimating the probability of survival and severe PAH allows classification of cases according to prognosis. Copyright (C) 2011 ISUOG. Published by John Wiley & Sons, Ltd.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Statistical methods have been widely employed to assess the capabilities of credit scoring classification models in order to reduce the risk of wrong decisions when granting credit facilities to clients. The predictive quality of a classification model can be evaluated based on measures such as sensitivity, specificity, predictive values, accuracy, correlation coefficients and information theoretical measures, such as relative entropy and mutual information. In this paper we analyze the performance of a naive logistic regression model (Hosmer & Lemeshow, 1989) and a logistic regression with state-dependent sample selection model (Cramer, 2004) applied to simulated data. Also, as a case study, the methodology is illustrated on a data set extracted from a Brazilian bank portfolio. Our simulation results so far revealed that there is no statistically significant difference in terms of predictive capacity between the naive logistic regression models and the logistic regression with state-dependent sample selection models. However, there is strong difference between the distributions of the estimated default probabilities from these two statistical modeling techniques, with the naive logistic regression models always underestimating such probabilities, particularly in the presence of balanced samples. (C) 2012 Elsevier Ltd. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The aim of this study was to evaluate the immunoexpression of MMP-2, MMP-9 and CD31/microvascular density in squamous cell carcinomas of the floor of the mouth and to correlate the results with demographic, survival, clinical (TNM staging) and histopathological variables (tumor grade, perineural invasion, embolization and bone invasion). Data from medical records and diagnoses of 41 patients were reviewed. Histological sections were subjected to immunostaining using primary antibodies for human MMP-2, MMP-9 and CD31 and streptavidin-biotin-immunoperoxidase system. Histomorphometric analyses quantified positivity for MMPs (20 fields per slide, 100?points grade, ×200) and for CD31 (microvessels <50?µm in the area of the highest vascularization, 5 fields per slide, 100?points grade, ×400). Statistical design was composed by non-parametric Mann-Whitney U test (investigating the association between numerical variables and immunostainings), chi-square frequency test (in contingency tables), Fisher's exact test (when at least one expected frequency was less than 5 in 2×2 tables), Kaplan-Meier method (estimated probabilities of overall survival) and Iogrank test (comparison of survival curves), all with a significance level of 5%. There was a statistically significant correlation between immunostaining for MMP-2 and lymph node metastasis. Factors associated negatively with survival were N stage, histopathological grade, perineural invasion and immunostaining for MMP-9. There was no significant association between immunoexpression of CD31 and the other variables. The intensity of immunostaining for MMP-2 can be indicative of metastasis in lymph nodes and for MMP-9 of a lower probability of survival

Relevância:

10.00% 10.00%

Publicador:

Resumo:

CONTEXT AND OBJECTIVE: Epidemiology may help educators to face the challenge of establishing content guidelines for the curricula in medical schools. The aim was to develop learning objectives for a medical curriculum from an epidemiology database. DESIGN AND SETTING: Descriptive study assessing morbidity and mortality data, conducted in a private university in São Paulo. METHODS: An epidemiology database was used, with mortality and morbidity recorded as summaries of deaths and the World Health Organization's Disability-Adjusted Life Year (DALY). The scoring took into consideration probabilities for mortality and morbidity. RESULTS: The scoring presented a classification of health conditions to be used by a curriculum design committee, taking into consideration its highest and lowest quartiles, which corresponded respectively to the highest and lowest impact on morbidity and mortality. Data from three countries were used for international comparison and showed distinct results. The resulting scores indicated topics to be developed through educational taxonomy. CONCLUSION: The frequencies of the health conditions and their statistical treatment made it possible to identify topics that should be fully developed within medical education. The classification also suggested limits between topics that should be developed in depth, including knowledge and development of skills and attitudes, regarding topics that can be concisely presented at the level of knowledge.