974 resultados para Classification Tree Pruning


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Pós-graduação em Agronomia (Ciência do Solo) - FCAV

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Objective Various nonvalidated criteria for disease flare have been used in studies of gout. Our objective was to develop empirical definitions for a gout flare from patient-reported features. Methods Possible elements for flare criteria were previously reported. Data were collected from 210 gout patients at 8 international sites to evaluate potential gout flare criteria against the gold standard of an expert rheumatologist definition. Flare definitions based on the presence of the number of criteria independently associated with the flare and classification and regression tree approaches were developed. Results The mean +/- SD age of the study participants was 56.2 +/- 15 years, 207 of them (98%) were men, and 54 of them (26%) had flares of gout. The presence of any patient-reported warm joint, any patient-reported swollen joint, patient-reported pain at rest score of >3 (010 scale), and patient-reported flare were independently associated with the study gold standard. The greatest discriminating power was noted for the presence of 3 or more of the above 4 criteria (sensitivity 91% and specificity 82%). Requiring all 4 criteria provided the highest specificity (96%) and positive predictive value (85%). A classification tree identified pain at rest with a score of >3, followed by patient self-reported flare, as the rule associated with the gold standard (sensitivity 83% and specificity 90%). Conclusion We propose definitions for a disease flare based on self-reported items in patients previously diagnosed as having gout. Patient-reported flare, joint pain at rest, warm joints, and swollen joints were most strongly associated with presence of a gout flare. These provisional definitions will next be validated in clinical trials.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Cesarean Delivery (CD) rates are rising in many parts of the world. In order to define strategies to reduce them, it is important to explore the role of clinical and organizational factors. This thesis has the objective to describe the contemporary CD practice and study clinical and organizational variables as determinants of CD in all women who gave birth between 2005 and June 2010 in the Emilia Romagna region (Italy). All hospital discharge abstracts of women who delivered between 2005 and mid 2010 in the region were selected and linked with birth certificates. In addition to descriptive statistics, in order to study the role of clinical and organizational variables (teaching or non-teaching hospital, birth volumes, time and day of delivery) multilevel Poisson regression models and a classification tree were used. A substantial inter-hospital variability in CD rate was found, and this was only partially explained by the considered variables. The most important risk factors of CD were: previous CD (RR 4,95; 95%CI: 4,85-5,05), cord prolapse (RR 3,51; 95% CI:2,96-4,16), and malposition/malpresentation (RR 2,72; 95%CI: 2,66-2,77). Delivery between 7 pm and 7 am and during non working days protect against CD in all subgroups including those with a small number of elective CDs while delivery at a teaching hospital and birth volumes were not statistically significant risk factors. The classification tree shows that previous CD and malposition/malpresentation are the most important variables discriminating between high and low risk of CD. These results indicate that other not considered factors might explain CD variability and do not provide clear evidence that small hospitals have a poor performance in terms of CD rate. Some strategies to reduce CD could be found by focusing on the differences in delivery practice between day and night and between working and no-working day deliveries.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND & AIMS: Age is frequently discussed as negative host factor to achieve a sustained virological response (SVR) to antiviral therapy of chronic hepatitis C. However, elderly patients often show advanced fibrosis/cirrhosis as known negative predictive factor. The aim of this study was to assess age as an independent predictive factor during antiviral therapy. METHODS: Overall, 516 hepatitis C patients were treated with pegylated interferon-α and ribavirin, thereof 66 patients ≥60 years. We analysed the impact of host factors (age, gender, fibrosis, haemoglobin, previous hepatitis C treatment) and viral factors (genotype, viral load) on SVR per therapy course by performing a generalized estimating equations (GEE) regression modelling, a matched pair analysis and a classification tree analysis. RESULTS: Overall, SVR per therapy course was 42.9 and 26.1%, respectively, in young and elderly patients with hepatitis C virus (HCV) genotypes 1/4/6. The corresponding figures for HCV genotypes 2/3 were 74.4 and 84%. In the GEE model, age had no significant influence on achieving SVR. In matched pair analysis, SVR was not different in young and elderly patients (54.2 and 55.9% respectively; P = 0.795 in binominal test). In classification tree analysis, age was not a relevant splitting variable. CONCLUSIONS: Age is not a significant predictive factor for achieving SVR, when relevant confounders are taken into account. As life expectancy in Western Europe at age 60 is more than 20 years, it is reasonable to treat chronic hepatitis C in selected elderly patients with relevant fibrosis or cirrhosis but without major concomitant diseases, as SVR improves survival and reduces carcinogenesis.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

INTRODUCTION Every joint registry aims to improve patient care by identifying implants that have an inferior performance. For this reason, each registry records the implant name that has been used in the individual patient. In most registries, a paper-based approach has been utilized for this purpose. However, in addition to being time-consuming, this approach does not account for the fact that failure patterns are not necessarily implant specific but can be associated with design features that are used in a number of implants. Therefore, we aimed to develop and evaluate an implant product library that allows both time saving barcode scanning on site in the hospital for the registration of the implant components and a detailed description of implant specifications. MATERIALS AND METHODS A task force consisting of representatives of the German Arthroplasty Registry, industry, and computer specialists agreed on a solution that allows barcode scanning of implant components and that also uses a detailed standardized classification describing arthroplasty components. The manufacturers classified all their components that are sold in Germany according to this classification. The implant database was analyzed regarding the completeness of components by algorithms and real-time data. RESULTS The implant library could be set up successfully. At this point, the implant database includes more than 38,000 items, of which all were classified by the manufacturers according to the predefined scheme. Using patient data from the German Arthroplasty Registry, several errors in the database were detected, all of which were corrected by the respective implant manufacturers. CONCLUSIONS The implant library that was developed for the German Arthroplasty Registry allows not only on-site barcode scanning for the registration of the implant components but also its classification tree allows a sophisticated analysis regarding implant characteristics, regardless of brand or manufacturer. The database is maintained by the implant manufacturers, thereby allowing registries to focus their resources on other areas of research. The database might represent a possible global model, which might encourage harmonization between joint replacement registries enabling comparisons between joint replacement registries.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Species distribution models (SDM) predict species occurrence based on statistical relationships with environmental conditions. The R-package biomod2 which includes 10 different SDM techniques and 10 different evaluation methods was used in this study. Macroalgae are the main biomass producers in Potter Cove, King George Island (Isla 25 de Mayo), Antarctica, and they are sensitive to climate change factors such as suspended particulate matter (SPM). Macroalgae presence and absence data were used to test SDMs suitability and, simultaneously, to assess the environmental response of macroalgae as well as to model four scenarios of distribution shifts by varying SPM conditions due to climate change. According to the averaged evaluation scores of Relative Operating Characteristics (ROC) and True scale statistics (TSS) by models, those methods based on a multitude of decision trees such as Random Forest and Classification Tree Analysis, reached the highest predictive power followed by generalized boosted models (GBM) and maximum-entropy approaches (Maxent). The final ensemble model used 135 of 200 calculated models (TSS > 0.7) and identified hard substrate and SPM as the most influencing parameters followed by distance to glacier, total organic carbon (TOC), bathymetry and slope. The climate change scenarios show an invasive reaction of the macroalgae in case of less SPM and a retreat of the macroalgae in case of higher assumed SPM values.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Areas of the landscape that are priorities for conservation should be those that are both vulnerable to threatening processes and that if lost or degraded, will result in conservation targets being compromised. While much attention is directed towards understanding the patterns of biodiversity, much less is given to determining the areas of the landscape most vulnerable to threats. We assessed the relative vulnerability of remaining areas of native forest to conversion to plantations in the ecologically significant temperate rainforest region of south central Chile. The area of the study region is 4.2 million ha and the extent of plantations is approximately 200000 ha. First, the spatial distribution of native forest conversion to plantations was determined. The variables related to the spatial distribution of this threatening process were identified through the development of a classification tree and the generation of a multivariate. spatially explicit, statistical model. The model of native forest conversion explained 43% of the deviance and the discrimination ability of the model was high. Predictions were made of where native forest conversion is likely to occur in the future. Due to patterns of climate, topography, soils and proximity to infrastructure and towns, remaining forest areas differ in their relative risk of being converted to plantations. Another factor that may increase the vulnerability of remaining native forest in a subset of the study region is the proposed construction of a highway. We found that 90% of the area of existing plantations within this region is within 2.5 km of roads. When the predictions of native forest conversion were recalculated accounting for the construction of this highway, it was found that: approximately 27000 ha of native forest had an increased probability of conversion. The areas of native forest identified to be vulnerable to conversion are outside of the existing reserve network. (C) 2004 Elsevier Ltd. All tights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

An expanding human population and associated demands for goods and services continues to exert an increasing pressure on ecological systems. Although the rate of expansion of agricultural lands has slowed since 1960, rapid deforestation still occurs in many tropical countries, including Colombia. However, the location and extent of deforestation and associated ecological impacts within tropical countries is often not well known. The primary aim of this study was to obtain an understanding of the spatial patterns of forest conversion for agricultural land uses in Colombia. We modeled native forest conversion in Colombia at regional and national-levels using logistic regression and classification trees. We investigated the impact of ignoring the regional variability of model parameters, and identified biophysical and socioeconomic factors that best explain the current spatial pattern and inter-regional variation in forest cover. We validated our predictions for the Amazon region using MODIS satellite imagery. The regional-level classification tree that accounted for regional heterogeneity had the greatest discrimination ability. Factors related to accessibility (distance to roads and towns) were related to the presence of forest cover, although this relationship varied regionally. In order to identify areas with a high risk of deforestation, we used predictions from the best model, refined by areas with rural population growth rates of > 2%. We ranked forest ecosystem types in terms of levels of threat of conversion. Our results provide useful inputs to planning for biodiversity conservation in Colombia, by identifying areas and ecosystem types that are vulnerable to deforestation. Several of the predicted deforestation hotspots coincide with areas that are outstanding in terms of biodiversity value.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

La chirurgia conservativa o l’esofagectomia, possono essere indicate per il trattamento della disfagia nell’acalasia scompensata. L’esofagectomia è inoltre finalizzata alla prevenzione dello sviluppo del carcinoma esofageo. Gli obiettivi erano: a) definire prevalenza e fattori di rischio per il carcinoma epidermoidale; b) confrontare i risultati clinici e funzionali di Heller-Dor con pull-down della giunzione esofagogastrica (PD-HD) ed esofagectomia. I dati in analisi, ricavati da un database istituito nel 1973 e finalizzato alla ricerca prospettica, sono stati: a) le caratteristiche cliniche, radiologiche ed endoscopiche di 573 pazienti acalasici; b) il risultato oggettivo e la qualità della vita, definita mediante questionario SF-36, dopo intervento di PD-HD (29 pazienti) e dopo esofagectomia per acalasia scompensata o carcinoma (20 pazienti). Risultati: a) sono stati diagnosticati 17 carcinomi epidermoidali ed un carcinosarcoma (3.14%). Fattori di rischio sono risultati essere: il diametro esofageo (p<0.001), il ristagno esofageo (p<0.01) e la durata dei sintomi dell’acalasia (p<0.01). Secondo l’albero di classificazione, soltanto i pazienti con esito insufficiente del trattamento ai controlli clinico-strumentali ed acalasia sigmoidea presentavano un rischio di sviluppare il carcinoma squamocellulare del 52.9%. b) Non sono state riscontrate differenze statisticamente significative tra i pazienti sottoposti ad intervento conservativo e quelli trattati con esofagectomia per quanto concerne l’esito dell’intervento valutato mediante parametri oggettivi (p=0.515). L’analisi della qualità della vita non ha evidenziato differenze statisticamente significative per quanto concerne i domini GH, RP, PF e BP. Punteggi significativamente più elevati nei domini RE (p=0.012), VT (p<0.001), MH (p=0.001) e SF (p=0.014) sono stati calcolati per PD-HD rispetto alle esofagectomie. In conclusione, PD-HD determina una miglior qualità della vita, ed è pertanto la procedura di scelta per i pazienti con basso rischio di cancro. A coloro che abbiano già raggiunto i parametri di rischio, si offrirà l’esofagectomia o l'opzione conservativa seguita da protocolli di follow-up.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The impellent global environmental issues related to plastic materials can be addressed by following two different approaches: i) the development of synthetic strategies towards novel bio-based polymers, deriving from biomasses and thus identifiable as CO2-neutral materials, and ii) the development of new plastic materials, such as biocomposites, which are bio-based and biodegradable and therefore able to counteract the accumulation of plastic waste. In this framework, this dissertation presents extensive research efforts have been devoted to the synthesis and characterization of polyesters based on various bio-based monomers, including ω-pentadecalactone, vanillic acid, 2,5-furan dicarboxylic acid, and 5-hydroxymethylfurfural. With the aim of achieving high molecular weight polyesters, different synthetic strategies have been used as melt polycondensation, enzymatic polymerization, ring-opening polymerization and chain extension reaction. In particular, poly(ethylene vanillate) (PEV), poly(ω-pentadecalactone) (PPDL), poly(ethylene vanillate-co-pentadecalactone) (P(EV-co-PDL)), poly(2-hydroxymethyl 5-furancarboxylate) (PHMF), poly(ethylene 2,5-furandicarboxylate) (PEF) with different amount of diethylene glycol (DEG) unit amount, poly(propylene 2,5-furandicarboxylate) (PPF), poly(hexamethylene 2,5-furandicarboxylate), (PHF) have been prepared and extensively characterized. To improve the lacks of poly(hydroxybutyrate-co-valerate) (PHBV), its minimal formulations with natural additives and its blending with medium chain length PHAs (mcl-PHAs) have been tested. Additionally, this dissertation presents new biocomposites based on polylactic acid (PLA), poly(butylene succinate) (PBS), and PHBV, which are polymers both bio-based and biodegradable. To maintain their biodegradability only bio-fillers have been taken into account as reinforcing agents. Moreover, the commitment to sustainability has further limited the selection and led to the exclusive use of agricultural waste as fillers. Detailly, biocomposites have been obtained and discussed by using the following materials: PLA and agro-wastes like tree pruning, potato peels, and hay leftovers; PBS and exhausted non-compliant coffee green beans; PHBV and industrial starch extraction residues.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

The Prism family of algorithms induces modular classification rules which, in contrast to decision tree induction algorithms, do not necessarily fit together into a decision tree structure. Classifiers induced by Prism algorithms achieve a comparable accuracy compared with decision trees and in some cases even outperform decision trees. Both kinds of algorithms tend to overfit on large and noisy datasets and this has led to the development of pruning methods. Pruning methods use various metrics to truncate decision trees or to eliminate whole rules or single rule terms from a Prism rule set. For decision trees many pre-pruning and postpruning methods exist, however for Prism algorithms only one pre-pruning method has been developed, J-pruning. Recent work with Prism algorithms examined J-pruning in the context of very large datasets and found that the current method does not use its full potential. This paper revisits the J-pruning method for the Prism family of algorithms and develops a new pruning method Jmax-pruning, discusses it in theoretical terms and evaluates it empirically.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

The Prism family of algorithms induces modular classification rules in contrast to the Top Down Induction of Decision Trees (TDIDT) approach which induces classification rules in the intermediate form of a tree structure. Both approaches achieve a comparable classification accuracy. However in some cases Prism outperforms TDIDT. For both approaches pre-pruning facilities have been developed in order to prevent the induced classifiers from overfitting on noisy datasets, by cutting rule terms or whole rules or by truncating decision trees according to certain metrics. There have been many pre-pruning mechanisms developed for the TDIDT approach, but for the Prism family the only existing pre-pruning facility is J-pruning. J-pruning not only works on Prism algorithms but also on TDIDT. Although it has been shown that J-pruning produces good results, this work points out that J-pruning does not use its full potential. The original J-pruning facility is examined and the use of a new pre-pruning facility, called Jmax-pruning, is proposed and evaluated empirically. A possible pre-pruning facility for TDIDT based on Jmax-pruning is also discussed.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper, we consider active sampling to label pixels grouped with hierarchical clustering. The objective of the method is to match the data relationships discovered by the clustering algorithm with the user's desired class semantics. The first is represented as a complete tree to be pruned and the second is iteratively provided by the user. The active learning algorithm proposed searches the pruning of the tree that best matches the labels of the sampled points. By choosing the part of the tree to sample from according to current pruning's uncertainty, sampling is focused on most uncertain clusters. This way, large clusters for which the class membership is already fixed are no longer queried and sampling is focused on division of clusters showing mixed labels. The model is tested on a VHR image in a multiclass classification setting. The method clearly outperforms random sampling in a transductive setting, but cannot generalize to unseen data, since it aims at optimizing the classification of a given cluster structure.