934 resultados para Recursive Partitioning and Regression Trees (RPART)
Resumo:
En écologie, dans le cadre par exemple d’études des services fournis par les écosystèmes, les modélisations descriptive, explicative et prédictive ont toutes trois leur place distincte. Certaines situations bien précises requièrent soit l’un soit l’autre de ces types de modélisation ; le bon choix s’impose afin de pouvoir faire du modèle un usage conforme aux objectifs de l’étude. Dans le cadre de ce travail, nous explorons dans un premier temps le pouvoir explicatif de l’arbre de régression multivariable (ARM). Cette méthode de modélisation est basée sur un algorithme récursif de bipartition et une méthode de rééchantillonage permettant l’élagage du modèle final, qui est un arbre, afin d’obtenir le modèle produisant les meilleures prédictions. Cette analyse asymétrique à deux tableaux permet l’obtention de groupes homogènes d’objets du tableau réponse, les divisions entre les groupes correspondant à des points de coupure des variables du tableau explicatif marquant les changements les plus abrupts de la réponse. Nous démontrons qu’afin de calculer le pouvoir explicatif de l’ARM, on doit définir un coefficient de détermination ajusté dans lequel les degrés de liberté du modèle sont estimés à l’aide d’un algorithme. Cette estimation du coefficient de détermination de la population est pratiquement non biaisée. Puisque l’ARM sous-tend des prémisses de discontinuité alors que l’analyse canonique de redondance (ACR) modélise des gradients linéaires continus, la comparaison de leur pouvoir explicatif respectif permet entre autres de distinguer quel type de patron la réponse suit en fonction des variables explicatives. La comparaison du pouvoir explicatif entre l’ACR et l’ARM a été motivée par l’utilisation extensive de l’ACR afin d’étudier la diversité bêta. Toujours dans une optique explicative, nous définissons une nouvelle procédure appelée l’arbre de régression multivariable en cascade (ARMC) qui permet de construire un modèle tout en imposant un ordre hiérarchique aux hypothèses à l’étude. Cette nouvelle procédure permet d’entreprendre l’étude de l’effet hiérarchisé de deux jeux de variables explicatives, principal et subordonné, puis de calculer leur pouvoir explicatif. L’interprétation du modèle final se fait comme dans une MANOVA hiérarchique. On peut trouver dans les résultats de cette analyse des informations supplémentaires quant aux liens qui existent entre la réponse et les variables explicatives, par exemple des interactions entres les deux jeux explicatifs qui n’étaient pas mises en évidence par l’analyse ARM usuelle. D’autre part, on étudie le pouvoir prédictif des modèles linéaires généralisés en modélisant la biomasse de différentes espèces d’arbre tropicaux en fonction de certaines de leurs mesures allométriques. Plus particulièrement, nous examinons la capacité des structures d’erreur gaussienne et gamma à fournir les prédictions les plus précises. Nous montrons que pour une espèce en particulier, le pouvoir prédictif d’un modèle faisant usage de la structure d’erreur gamma est supérieur. Cette étude s’insère dans un cadre pratique et se veut un exemple pour les gestionnaires voulant estimer précisément la capture du carbone par des plantations d’arbres tropicaux. Nos conclusions pourraient faire partie intégrante d’un programme de réduction des émissions de carbone par les changements d’utilisation des terres.
Resumo:
Contexte: L'obésité chez les jeunes représente aujourd’hui un problème de santé publique à l’échelle mondiale. Afin d’identifier des cibles potentielles pour des stratégies populationnelles de prévention, les liens entre les caractéristiques du voisinage, l’obésité chez les jeunes et les habitudes de vie font de plus en plus l’objet d’études. Cependant, la recherche à ce jour comporte plusieurs incohérences. But: L’objectif général de cette thèse est d’étudier la contribution de différentes caractéristiques du voisinage relativement à l’obésité chez les jeunes et les habitudes de vie qui y sont associées. Les objectifs spécifiques consistent à: 1) Examiner les associations entre la présence de différents commerces d’alimentation dans les voisinages résidentiels et scolaires des enfants et leurs habitudes alimentaires; 2) Examiner comment l’exposition à certaines caractéristiques du voisinage résidentiel détermine l’obésité au niveau familial (chez le jeune, la mère et le père), ainsi que l’obésité individuelle pour chaque membre de la famille; 3) Identifier des combinaisons de facteurs de risque individuels, familiaux et du voisinage résidentiel qui prédisent le mieux l’obésité chez les jeunes, et déterminer si ces profils de facteurs de risque prédisent aussi un changement dans l’obésité après un suivi de deux ans. Méthodes: Les données proviennent de l’étude QUALITY, une cohorte québécoise de 630 jeunes, âgés de 8-10 ans au temps 1, avec une histoire d’obésité parentale. Les voisinages de 512 participants habitant la Région métropolitaine de Montréal ont été caractérisés à l’aide de : 1) données spatiales provenant du recensement et de bases de données administratives, calculées pour des zones tampons à partir du réseau routier et centrées sur le lieu de la résidence et de l’école; et 2) des observations menées par des évaluateurs dans le voisinage résidentiel. Les mesures du voisinage étudiées se rapportent aux caractéristiques de l’environnement bâti, social et alimentaire. L’obésité a été estimée aux temps 1 et 2 à l’aide de l’indice de masse corporelle (IMC) calculé à partir du poids et de la taille mesurés. Les habitudes alimentaires ont été mesurées au temps 1 à l'aide de trois rappels alimentaires. Les analyses effectuées comprennent, entres autres, des équations d'estimation généralisées, des régressions multiniveaux et des analyses prédictives basées sur des arbres de décision. Résultats: Les résultats démontrent la présence d’associations avec l’obésité chez les jeunes et les habitudes alimentaires pour certaines caractéristiques du voisinage. En particulier, la présence de dépanneurs et de restaurants-minutes dans le voisinage résidentiel et scolaire est associée avec de moins bonnes habitudes alimentaires. La présence accrue de trafic routier, ainsi qu’un faible niveau de prestige et d’urbanisation dans le voisinage résidentiel sont associés à l’obésité familiale. Enfin, les résultats montrent qu’habiter un voisinage obésogène, caractérisé par une défavorisation socioéconomique, la présence de moins de parcs et de plus de dépanneurs, prédit l'obésité chez les jeunes lorsque combiné à la présence de facteurs de risque individuels et familiaux. Conclusion: Cette thèse contribue aux écrits sur les voisinages et l’obésité chez les jeunes en considérant à la fois l'influence potentielle du voisinage résidentiel et scolaire ainsi que l’influence de l’environnement familial, en utilisant des méthodes objectives pour caractériser le voisinage et en utilisant des méthodes statistiques novatrices. Les résultats appuient en outre la notion que les efforts de prévention de l'obésité doivent cibler les multiples facteurs de risque de l'obésité chez les jeunes dans les environnements bâtis, sociaux et familiaux de ces jeunes.
Resumo:
Type and rate of fertilizers influence the level of soil organic carbon (Corg) and total nitrogen (Nt) markedly, but the effect on C and N partitioning into different pools is open to question. The objectives of the present work were to: (i) quantify the impact of fertilizer type and rate on labile, intermediate and passive C and N pools by using a combination of biological, chemical and mathematical methods; (ii) explain previously reported differences in the soil organic matter (SOM) levels between soils receiving farmyard manure with or without biodynamic preparations by using Corg time series and information on SOM partitioning; and (iii) quantify the long-term and short-term dynamics of SOM in density fractions and microbial biomass as affected by fertilizer type and rate and determine the incorporation of crop residues into labile SOM fractions. Samples were taken from a sandy Cambisol from the long-term fertilization trial in Darmstadt, Germany, founded in 1980. The nine treatments (four field replicates) were: straw incorporation plus application of mineral fertilizer (MSI) and application of rotted farmyard manure with (DYN) or without (FYM) addition of biodynamic preparations, each at high (140 – 150 kg N ha-1 year-1; MSIH, DYNH, FYMH), medium (100 kg N ha-1 year-1; MSIM, DYNM, FYMM) and low (50 – 60 kg N ha-1 year-1; MSIL, DYNL, FYML) rates. The main findings were: (i) The stocks of Corg (t ha-1) were affected by fertilizer type and rate and increased in the order MSIL (23.6), MSIM (23.7), MSIH (24.2) < FYML (25.3) < FYMM (28.1), FYMH (28.1). Stocks of Nt were affected in the same way (C/N ratio: 11). Storage of C and N in the modelled labile pools (turnover times: 462 and 153 days for C and N, respectively) were not influenced by the type of fertilizer (FYM and MSI) but depended significantly (p ≤ 0.05) on the application rate and ranged from 1.8 to 3.2 t C ha 1 (7 – 13% of Corg) and from 90 to 140 kg N ha-1 (4-5% of Nt). In the calculated intermediate pool (C/N ratio 7), stocks of C were markedly higher in FYM treatments (15-18 t ha-1) compared to MSI treatments (12-14 t ha-1). This showed that differences in SOM stocks in the sandy Cambisol induced by fertilizer rate may be short-lived in case of changing management, but differences induced by fertilizer type may persist for decades. (ii) Crop yields, estimated C inputs (1.5 t ha-1 year-1) with crop residue, microbial bio¬mass C (Cmic, 118 – 150 mg kg-1), microbial biomass N (17 – 20 mg kg-1) and labile C and N pools did not differ significantly between FYM and DYN treatments. However, labile C increased linearly with application rate (R2 = 0.53) from 7 to 11% of Corg. This also applied for labile N (3.5 to 4.9% of Nt). The higher contents of Corg in DYN treatments existed since 1982, when the first sampling was conducted for all individual treatments. Contents of Corg between DYN and FYM treatments con-verged slightly since then. Furthermore, at least 30% of the difference in Corg was located in the passive pool where a treatment effect could be excluded. Therefore, the reported differences in Corg contents existed most likely since the beginning of the experiment and, as a single factor of biodynamic agriculture, application of bio-dynamic preparations had no effect on SOM stocks. (iii) Stocks of SOM, light fraction organic C (LFOC, ρ ≤ 2.0 g cm-3), light fraction organic N and Cmic decreased in the order FYMH > FYML > MSIH, MSIL for all sampling dates in 2008 (March, May, September, December). However, statistical significance of treatment effects differed between the dates, probably due to dif-ferences in the spatial variation throughout the year. The high proportion of LFOC on total Corg stocks (45 – 55%) highlighted the importance of selective preservation of OM as a stabilization mechanism in this sandy Cambisol. The apparent turnover time of LFOC was between 21 and 32 years, which agreed very well with studies with substantially longer vegetation change compared to our study. Overall, both approaches; (I) the combination of incubation, chemical fractionation and simple modelling and (II) the density fractionation; provided complementary information on the partitioning of SOM into pools of different stability. The density fractionation showed that differences in Corg stocks between FYM and MSI treatments were mainly located in the light fraction, i.e. induced by higher recalcitrance of the organic input in the FYM treatments. Moreover, the use of the combination of biological, chemical and mathematical methods indicated that effects of fertilizer rate on total Corg and Nt stocks may be short-lived, but that the effect of fertilizer type may persist for longer time spans in the sandy Cambisol.
Resumo:
The present essay’s central argument or hypothesis is, consequently, that the mechanisms accelerating a wealth concentrating and exclusionary economy centred on the benefit and overprotection of big business—with a corresponding plundering of resources that are vital for life—generated forms of loss and regression in the right to healthcare and the dismantling of institutional protections. These are all expressed in indicators from 1990-2005, which point not only to the deterioration of healthcare programs and services but also to the undermining of the general conditions of life (social reproduction) and, in contrast to the reports and predictions of the era’s governments, a stagnation or deterioration in health indicators, especially for those most sensitive to the crisis. The present study’s argument is linked together across distinct chapters. First, we undertake the necessary clarification of the categories central to the understanding of a complex issue; clarifying the concept of health itself and its determinants, emphasizing the necessity of taking on an integral understanding as a fundamental prerequisite to unravelling what documents and reports from this era either leave unsaid or distort. Based on that analysis, we will explain the harmful effects of global economic acceleration, the monopolization and pillaging of strategic healthcare goods; not only those which directly place obstacles on the access to health services, but also those like the destructuration of small economies, linked to the impoverishment and worsening of living modes. Thinking epidemiologically, we intend to show signs of the deterioration of broad collectivities’ ways of life as a result of the mechanisms of acceleration and pillage. We will then collect disparate evidence of the deterioration of human health and ecosystems to, finally, establish the most urgent conclusions about this unfortunate period of our social and medical history.
Resumo:
In order to evaluate the flying capacity and nest site selection of Angiopolybia pallens (Lepeletier, 1836), we made 17 incursions (136 hours of sample efforts) in Atlantic Rain Forest environments in Bahia state. Our data show this wasp prefers to nest on wide leaves of bushes and short trees (nests between 0.30 and 3m from the ground) placed in half-shady environments (clearings and shadowed cultivations). The logistic regression model using Quasi-Newton method provided a good description of the flying capacity observed in A. pallens (x 2 = 91.52; p≪0.001). According to the logistic regression model, the A. pallens flight autonomy is low, flying for short distances and with an effective radius of action of about 24m measured from their nests, which means a foraging area of nearly 1,800 m 2.
Resumo:
This study's objective is to evaluate the photosynthates partitioning and source-sink relationships, in vegetative and reproductive twigs of sweet passion fruit, using the carbon-13, stable isotope of carbon. The leaves of vegetative and reproductive twigs were placed in a sealed chamber and 13CO 2 were injected for 30 minutes. After six hours, the different organs of the twig were collected and immersed in liquid nitrogen (-196°C). All plant samples were oven-dried, powdered and combusted in an elemental analyzer. The results showed that the methodology was efficient to evaluate the partitioning of photosynthates. In the vegetative twigs, the new open leaves, the closed leaves and growth meristem were the principal sink for the source leaves enriched with 13CO 2. In the reproductive twigs with only flowers buds, the source-sink relationship was changed and the flowers buds turn to be the principal sink and the growing leaves were secondary sinks. The presence of young fruits changed again the source-sink relationship of the twig, and they were the principal sinks. The apical leaves, with 60% of final leaf area were also a sink for photosynthates.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The main scope of my PhD is the reconstruction of the large-scale bivalve phylogeny on the basis of four mitochondrial genes, with samples taken from all major groups of the class. To my knowledge, it is the first attempt of such a breadth in Bivalvia. I decided to focus on both ribosomal and protein coding DNA sequences (two ribosomal encoding genes -12s and 16s -, and two protein coding ones - cytochrome c oxidase I and cytochrome b), since either bibliography and my preliminary results confirmed the importance of combined gene signals in improving evolutionary pathways of the group. Moreover, I wanted to propose a methodological pipeline that proved to be useful to obtain robust results in bivalves phylogeny. Actually, best-performing taxon sampling and alignment strategies were tested, and several data partitioning and molecular evolution models were analyzed, thus demonstrating the importance of molding and implementing non-trivial evolutionary models. In the line of a more rigorous approach to data analysis, I also proposed a new method to assess taxon sampling, by developing Clarke and Warwick statistics: taxon sampling is a major concern in phylogenetic studies, and incomplete, biased, or improper taxon assemblies can lead to misleading results in reconstructing evolutionary trees. Theoretical methods are already available to optimize taxon choice in phylogenetic analyses, but most involve some knowledge about genetic relationships of the group of interest, or even a well-established phylogeny itself; these data are not always available in general phylogenetic applications. The method I proposed measures the "phylogenetic representativeness" of a given sample or set of samples and it is based entirely on the pre-existing available taxonomy of the ingroup, which is commonly known to investigators. Moreover, it also accounts for instability and discordance in taxonomies. A Python-based script suite, called PhyRe, has been developed to implement all analyses.
Resumo:
Gregarine apicomplexans are a diverse group of single-celled parasites that have feeding stages (trophozoites) and gamonts that generally inhabit the extracellular spaces of invertebrate hosts living in marine, freshwater, and terrestrial environments. Inferences about the evolutionary morphology of gregarine apicomplexans are being incrementally refined by molecular phylogenetic data, which suggest that several traits associated with the feeding cells of gregarines arose by convergent evolution. The study reported here supports these inferences by showing how molecular data reveals traits that are phylogenetically misleading within the context of comparative morphology alone. We examined the ultrastructure and molecular phylogenetic positions of two gregarine species isolated from the spaghetti worm Thelepus japonicus: Selenidium terebellae Ray 1930 and S. melongena n. sp. The ultrastructural traits of S. terebellae were very similar to other species of Selenidium sensu stricto, such as having vermiform trophozoites with an apical complex, few epicytic folds, and a dense array of microtubules underlying the trilayered pellicle. By contrast, S. melongena n. sp. lacked a comparably discrete assembly of subpellicular microtubules, instead employing a system of fibrils beneath the cell surface that supported a relatively dense array of helically arranged epicytic folds. Molecular phylogenetic analyses of small subunit rDNA sequences derived from single-cell PCR unexpectedly demonstrated that these two gregarines are close sister species. The ultrastructural differences between these two species were consistent with the fact that S. terebellae infects the inner lining of the host intestines, and S. melongena n. sp. primarily inhabits the coelom, infecting the outside wall of the host intestine. Altogether, these data demonstrate a compelling case of niche partitioning and associated morphological divergence in marine gregarine apicomplexans. (C) 2014 Elsevier GmbH. All rights reserved.
Resumo:
Background mortality is an essential component of any forest growth and yield model. Forecasts of mortality contribute largely to the variability and accuracy of model predictions at the tree, stand and forest level. In the present study, I implement and evaluate state-of-the-art techniques to increase the accuracy of individual tree mortality models, similar to those used in many of the current variants of the Forest Vegetation Simulator, using data from North Idaho and Montana. The first technique addresses methods to correct for bias induced by measurement error typically present in competition variables. The second implements survival regression and evaluates its performance against the traditional logistic regression approach. I selected the regression calibration (RC) algorithm as a good candidate for addressing the measurement error problem. Two logistic regression models for each species were fitted, one ignoring the measurement error, which is the “naïve” approach, and the other applying RC. The models fitted with RC outperformed the naïve models in terms of discrimination when the competition variable was found to be statistically significant. The effect of RC was more obvious where measurement error variance was large and for more shade-intolerant species. The process of model fitting and variable selection revealed that past emphasis on DBH as a predictor variable for mortality, while producing models with strong metrics of fit, may make models less generalizable. The evaluation of the error variance estimator developed by Stage and Wykoff (1998), and core to the implementation of RC, in different spatial patterns and diameter distributions, revealed that the Stage and Wykoff estimate notably overestimated the true variance in all simulated stands, but those that are clustered. Results show a systematic bias even when all the assumptions made by the authors are guaranteed. I argue that this is the result of the Poisson-based estimate ignoring the overlapping area of potential plots around a tree. Effects, especially in the application phase, of the variance estimate justify suggested future efforts of improving the accuracy of the variance estimate. The second technique implemented and evaluated is a survival regression model that accounts for the time dependent nature of variables, such as diameter and competition variables, and the interval-censored nature of data collected from remeasured plots. The performance of the model is compared with the traditional logistic regression model as a tool to predict individual tree mortality. Validation of both approaches shows that the survival regression approach discriminates better between dead and alive trees for all species. In conclusion, I showed that the proposed techniques do increase the accuracy of individual tree mortality models, and are a promising first step towards the next generation of background mortality models. I have also identified the next steps to undertake in order to advance mortality models further.
Resumo:
Pollen stratigraphy of a core 270 cm long from Lake Dalgoto at 2310 m in the Northern Pirin Mountains, southern Bulgaria, was treated by optimal partitioning and compared to a broken-stick model to reveal statistically significant pollen zones. The vegetational reconstructions presented here are based on pollen percentages and pollen influx, on comparisons of modern and fossil pollen spectra, and on macrofossil dates from other sites in the mountains. During the Younger Dryas (11000–10200 14C yr BP), an open xerophytic herb vegetation with Artemisia and Chenopodiaceae was widely developed around the lake. Deciduous trees growing at lower elevations contributed to the pollen rain deposited at the higher-elevation sampling sites. Specifically, from 10200 to 8500 yr BP, Quercus, Ulmus, Tilia and Betula expanded rapidly at low and intermediate elevations, and between 8500 and 6500 yr BP they extended to higher elevations close to the upper forest limit, which was formed by Betula pendula at about 1900 m. Coniferous species were limited in the region at this time. After 6500 yr BP, the expansion of conifers (Pinus peuce, P. sylvestris, P. mugo, Abies alba) at high elevations forced the deciduous trees downward. Between 6500 and 3000 yr BP, the forest limit at 2200 m was formed by P. peuce, and A. alba had its maximum vertical range up to 1900 m. Later the abundance and vertical range of P. peuce and A. alba were reduced. After 3000 yr BP, Picea expanded.
Resumo:
Background: Squamous cell carcinoma of the oropharynx (SCCOP) is characterized by local tumor aggressiveness, high recurrence rates, a high incidence of second primary tumors, and medical comorbidities. Significant trends in demographic and clinical characteristics as well as survival among SCCOP patients have been observed over time, likely owing to the changing etiology of the disease. Human papillomavirus type 16 (HPV16) infection is associated with a significant proportion of these cancers. Biomarkers that may aid in identifying patients that are at higher risk of recurrence and death are important so that these patients may be followed more closely to improve their quality of life. ^ Study population and methods: The retrospective review (Specific Aim 2) included 3891 newly diagnosed, previously untreated patients presenting to our institution between 1955 and 2004. A total of 2299 patients treated at our institution were included in survival and recursive partitioning analysis. The prospective cohort study (Specific Aim 3) included 266 patients presenting to our institution between January 2006 and September 2009. ^ Results: The results from the retrospective review showed that over time, patients presented at younger ages and were more likely to have base of tongue/tonsil tumors and to be never/former smokers and moreover survival improved significantly over time. In survival and recursive partitioning analyses, the TNM staging system was efficient in prognosticating patients prior to 1995. However, in the recent decade, the TNM staging system was completely inadequate. The factors having the greatest positive effect on overall survival since 1995 were those common to HPV-associated SCCOP. The results from the prospective cohort study indicate that patients with high nodal stage and those with late stage disease have increased levels of pretreatment serum HPV DNA. ^ Conclusions: We saw a distinct improvement in survival among SCCOP patients over the past 50 years at our institution. The main factors contributing to this were changes in clinical characteristics, in particular surrogates for HPV status. The current TNM staging system for SCCOP is inadequate and incorporation of HPV status (and perhaps smoking status) is encouraged. Furthermore, although pretreatment circulating levels of HPV DNA was associated with higher N category and overall disease stage, it has limited utility as a marker for recurrence among SCCOP patients.^
Resumo:
Some verification and validation techniques have been evaluated both theoretically and empirically. Most empirical studies have been conducted without subjects, passing over any effect testers have when they apply the techniques. We have run an experiment with students to evaluate the effectiveness of three verification and validation techniques (equivalence partitioning, branch testing and code reading by stepwise abstraction). We have studied how well able the techniques are to reveal defects in three programs. We have replicated the experiment eight times at different sites. Our results show that equivalence partitioning and branch testing are equally effective and better than code reading by stepwise abstraction. The effectiveness of code reading by stepwise abstraction varies significantly from program to program. Finally, we have identified project contextual variables that should be considered when applying any verification and validation technique or to choose one particular technique.
Resumo:
"List of books referred to..." v. 1, p. cxc-ccxxvi.