889 resultados para Regression Trees
Resumo:
The objective of this paper is to compare the performance of twopredictive radiological models, logistic regression (LR) and neural network (NN), with five different resampling methods. One hundred and sixty-seven patients with proven calvarial lesions as the only known disease were enrolled. Clinical and CT data were used for LR and NN models. Both models were developed with cross validation, leave-one-out and three different bootstrap algorithms. The final results of each model were compared with error rate and the area under receiver operating characteristic curves (Az). The neural network obtained statistically higher Az than LR with cross validation. The remaining resampling validation methods did not reveal statistically significant differences between LR and NN rules. The neural network classifier performs better than the one based on logistic regression. This advantage is well detected by three-fold cross-validation, but remains unnoticed when leave-one-out or bootstrap algorithms are used.
Resumo:
This paper proposes a common and tractable framework for analyzingdifferent definitions of fixed and random effects in a contant-slopevariable-intercept model. It is shown that, regardless of whethereffects (i) are treated as parameters or as an error term, (ii) areestimated in different stages of a hierarchical model, or whether (iii)correlation between effects and regressors is allowed, when the sameinformation on effects is introduced into all estimation methods, theresulting slope estimator is also the same across methods. If differentmethods produce different results, it is ultimately because differentinformation is being used for each methods.
Resumo:
This paper shows how recently developed regression-based methods for thedecomposition of health inequality can be extended to incorporateindividual heterogeneity in the responses of health to the explanatoryvariables. We illustrate our method with an application to the CanadianNPHS of 1994. Our strategy for the estimation of heterogeneous responsesis based on the quantile regression model. The results suggest that thereis an important degree of heterogeneity in the association of health toexplanatory variables which, in turn, accounts for a substantial percentageof inequality in observed health. A particularly interesting finding isthat the marginal response of health to income is zero for healthyindividuals but positive and significant for unhealthy individuals. Theheterogeneity in the income response reduces both overall health inequalityand income related health inequality.
Resumo:
Temporal variation in the composition of ant assemblages (Hymenoptera, Formicidae) on trees in the Pantanal floodplain, Mato Grosso do Sul, Brazil. In this paper we investigate how seasonal flooding influences the composition of assemblages of ants foraging on trees in the Pantanal of Mato Grosso do Sul. During the flood in the Pantanal, a large area is covered by floods that are the main forces that regulate the pattern of diversity in these areas. However, the effects of such natural disturbances in the ant communities are poorly known. In this sense, the objective of this study was to evaluate the effect of temporal variation in assemblages of ants foraging on trees in the Pantanal of Miranda. Samples were collected during a year in two adjacent areas, one who suffered flooding during the wet period and another that did not suffer flooding throughout the year. In 10 sites for each evaluated habitat, five pitfall traps were installed at random in trees 25 m apart from each other. In the habitat with flooding, the highest richness was observed during the flooding period, while there was no significant change in richness in the area that does not suffer flooding. The diversity of species between the two evaluated habitats varied significantly during the two seasons. Most ants sampled belong to species that forage and nest in soil. This suggests that during the flood in flooded habitats, ants that did not migrate to higher areas without flooding adopt the strategy to search for resources in the tree canopy.
Resumo:
Summary points: - The bias introduced by random measurement error will be different depending on whether the error is in an exposure variable (risk factor) or outcome variable (disease) - Random measurement error in an exposure variable will bias the estimates of regression slope coefficients towards the null - Random measurement error in an outcome variable will instead increase the standard error of the estimates and widen the corresponding confidence intervals, making results less likely to be statistically significant - Increasing sample size will help minimise the impact of measurement error in an outcome variable but will only make estimates more precisely wrong when the error is in an exposure variable
Resumo:
Senescent stem-galls in trees of Eremanthus erythropappus as a resource for arboreal ants. Members of the dipteran families Tephritidae and Cecidomyiidae are inducers of stem-galls in Eremanthus erythropappus (DC.) MacLeish (Asteraceae), a tree common in the state of Minas Gerais, Brazil. When senescent, these galls become available to other organisms, such as ants. The present study describes a community of ants having benefitted from this process of ecosystem-engineering. The colonies in question inhabit the senescent stem-galls of trees of E. erythropappus and were examined in view of answering the following questions: i) whether the presence of stem-galls had any bearing on the richness, composition, or size of the ant colonies therein; and ii) whether the ants displayed any preferences regarding the shape and/or size of the galls. The study was conducted in populations of E. erythropappus trees near the city of Ouro Preto, MG. A total of 227 galls were collected, 14% of which were occupied by ants, belonging to eight different species. Half of the species occupied galls of both morphotypes (fusiform and globular), although we observed a marked preference for larger, globular shapes. Overall, our results showed the galls to be an effective and abundant resource, helping to maintain the diversity of the ants in the canopy. We also observed the occurrence of outstations and polydomic nests, although an in-depth examination of the influence of galls on this type of structuring has not been investigated.
Resumo:
The predictive potential of six selected factors was assessed in 72 patients with primary myelodysplastic syndrome using univariate and multivariate logistic regression analysis of survival at 18 months. Factors were age (above median of 69 years), dysplastic features in the three myeloid bone marrow cell lineages, presence of chromosome defects, all metaphases abnormal, double or complex chromosome defects (C23), and a Bournemouth score of 2, 3, or 4 (B234). In the multivariate approach, B234 and C23 proved to be significantly associated with a reduction in the survival probability. The similarity of the regression coefficients associated with these two factors means that they have about the same weight. Consequently, the model was simplified by counting the number of factors (0, 1, or 2) present in each patient, thus generating a scoring system called the Lausanne-Bournemouth score (LB score). The LB score combines the well-recognized and easy-to-use Bournemouth score (B score) with the chromosome defect complexity, C23 constituting an additional indicator of patient outcome. The predicted risk of death within 18 months calculated from the model is as follows: 7.1% (confidence interval: 1.7-24.8) for patients with an LB score of 0, 60.1% (44.7-73.8) for an LB score of 1, and 96.8% (84.5-99.4) for an LB score of 2. The scoring system presented here has several interesting features. The LB score may improve the predictive value of the B score, as it is able to recognize two prognostic groups in the intermediate risk category of patients with B scores of 2 or 3. It has also the ability to identify two distinct prognostic subclasses among RAEB and possibly CMML patients. In addition to its above-described usefulness in the prognostic evaluation, the LB score may bring new insights into the understanding of evolution patterns in MDS. We used the combination of the B score and chromosome complexity to define four classes which may be considered four possible states of myelodysplasia and which describe two distinct evolutional pathways.
Resumo:
Summary
Resumo:
Uncertainty quantification of petroleum reservoir models is one of the present challenges, which is usually approached with a wide range of geostatistical tools linked with statistical optimisation or/and inference algorithms. The paper considers a data driven approach in modelling uncertainty in spatial predictions. Proposed semi-supervised Support Vector Regression (SVR) model has demonstrated its capability to represent realistic features and describe stochastic variability and non-uniqueness of spatial properties. It is able to capture and preserve key spatial dependencies such as connectivity, which is often difficult to achieve with two-point geostatistical models. Semi-supervised SVR is designed to integrate various kinds of conditioning data and learn dependences from them. A stochastic semi-supervised SVR model is integrated into a Bayesian framework to quantify uncertainty with multiple models fitted to dynamic observations. The developed approach is illustrated with a reservoir case study. The resulting probabilistic production forecasts are described by uncertainty envelopes.
Resumo:
The new techniques proposed for agriculture in the Amazon region include rotational fallow systems enriched with leguminous trees and the replacement of biomass burning by mulching. Decomposition and nutrient release from mulch were studied using fine-mesh litterbags with five different leguminous species and the natural fallow vegetation as control. Samples from each treatment were analyzed for total C, N, P, K, Ca, Mg, lignin, cellulose content and soluble polyphenol at different sampling times over the course of one year. The decomposition rate constant varied with species and time. Weight loss from the decomposed litter bag material after 96 days was 30.1 % for Acacia angustissima, 32.7 % for Sclerolobium paniculatum, 33.9 % for Iinga edulis and the Fallow vegetation, 45.2 % for Acacia mangium and 63.6 % for Clitoria racemosa. Immobilization of N and P was observed in all studied treatments. Nitrogen mineralization was negatively correlated with phenol, C-to-N ratio, lignin + phenol/N ratio, and phenol/phosphorus ratios and with N content in the litterbag material. After 362 days of field incubation, an average (of all treatments), 3.3 % K, 32.2 % Ca and 22.4 % Mg remained in the mulch. Results confirm that low quality and high amount of organic C as mulch application are limiting for the quantity of energy available for microorganisms and increase the nutrient immobilization for biomass decomposition, which results in competition for nutrients with the crop plants.
Resumo:
Experimental and clinical evidence indicates that non-steroidal anti-inflammatory drugs and cyclooxygenase-2 inhibitors may have anti-cancer activities. Here we report on a patient with a metastatic melanoma of the leg who experienced a complete and sustained regression of skin metastases upon continuous single treatment with the cyclooxygenase-2 inhibitor rofecoxib. Our observations indicate that the inhibition of cyclooxygenase-2 can lead to the regression of disseminated skin melanoma metastases, even after failure of chemotherapy.
Resumo:
Understanding adaptive genetic responses to climate change is a main challenge for preserving biological diversity. Successful predictive models for climate-driven range shifts of species depend on the integration of information on adaptation, including that derived from genomic studies. Long-lived forest trees can experience substantial environmental change across generations, which results in a much more prominent adaptation lag than in annual species. Here, we show that candidate-gene SNPs (single nucleotide polymorphisms) can be used as predictors of maladaptation to climate in maritime pine (Pinus pinaster Aiton), an outcrossing long-lived keystone tree. A set of 18 SNPs potentially associated with climate, 5 of them involving amino acid-changing variants, were retained after performing logistic regression, latent factor mixed models, and Bayesian analyses of SNP-climate correlations. These relationships identified temperature as an important adaptive driver in maritime pine and highlighted that selective forces are operating differentially in geographically discrete gene pools. The frequency of the locally advantageous alleles at these selected loci was strongly correlated with survival in a common garden under extreme (hot and dry) climate conditions, which suggests that candidate-gene SNPs can be used to forecast the likely destiny of natural forest ecosystems under climate change scenarios. Differential levels of forest decline are anticipated for distinct maritime pine gene pools. Geographically defined molecular proxies for climate adaptation will thus critically enhance the predictive power of range-shift models and help establish mitigation measures for long-lived keystone forest trees in the face of impending climate change.
Resumo:
Tumor-regressions following tumor-associated-antigen vaccination in animal models contrast with the limited clinical outcomes in cancer patients. Most animal studies however used subcutaneous-tumor-models and questions arise as whether these are relevant for tumors growing in mucosae; whether specific mucosal-homing instructions are required; and how this may be influenced by the tumor.
Resumo:
La regressió basada en distàncies és un mètode de predicció que consisteix en dos passos: a partir de les distàncies entre observacions obtenim les variables latents, les quals passen a ser els regressors en un model lineal de mínims quadrats ordinaris. Les distàncies les calculem a partir dels predictors originals fent us d'una funció de dissimilaritats adequada. Donat que, en general, els regressors estan relacionats de manera no lineal amb la resposta, la seva selecció amb el test F usual no és possible. En aquest treball proposem una solució a aquest problema de selecció de predictors definint tests estadístics generalitzats i adaptant un mètode de bootstrap no paramètric per a l'estimació dels p-valors. Incluim un exemple numèric amb dades de l'assegurança d'automòbils.