956 resultados para random forest regression
Resumo:
Effects of conspecific neighbours on survival and growth of trees have been found to be related to species abundance. Both positive and negative relationships may explain observed abundance patterns. Surprisingly, it is rarely tested whether such relationships could be biased or even spurious due to transforming neighbourhood variables or influences of spatial aggregation, distance decay of neighbour effects and standardization of effect sizes. To investigate potential biases, communities of 20 identical species were simulated with log-series abundances but without species-specific interactions. No relationship of conspecific neighbour effects on survival or growth with species abundance was expected. Survival and growth of individuals was simulated in random and aggregated spatial patterns using no, linear, or squared distance decay of neighbour effects. Regression coefficients of statistical neighbourhood models were unbiased and unrelated to species abundance. However, variation in the number of conspecific neighbours was positively or negatively related to species abundance depending on transformations of neighbourhood variables, spatial pattern and distance decay. Consequently, effect sizes and standardized regression coefficients, often used in model fitting across large numbers of species, were also positively or negatively related to species abundance depending on transformation of neighbourhood variables, spatial pattern and distance decay. Tests using randomized tree positions and identities provide the best benchmarks by which to critically evaluate relationships of effect sizes or standardized regression coefficients with tree species abundance. This will better guard against potential misinterpretations.
Finite mixture regression model with random effects: application to neonatal hospital length of stay
Resumo:
A two-component mixture regression model that allows simultaneously for heterogeneity and dependency among observations is proposed. By specifying random effects explicitly in the linear predictor of the mixture probability and the mixture components, parameter estimation is achieved by maximising the corresponding best linear unbiased prediction type log-likelihood. Approximate residual maximum likelihood estimates are obtained via an EM algorithm in the manner of generalised linear mixed model (GLMM). The method can be extended to a g-component mixture regression model with the component density from the exponential family, leading to the development of the class of finite mixture GLMM. For illustration, the method is applied to analyse neonatal length of stay (LOS). It is shown that identification of pertinent factors that influence hospital LOS can provide important information for health care planning and resource allocation. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
Context Understanding connectivity patterns in relation to habitat fragmentation is essential to landscape management. However, connectivity is often judged from expert opinion or species occurrence patterns, with very few studies considering the actual movements of individuals. Path selection functions provide a promising tool to infer functional connectivity from animal movement data, but its practical application remains scanty. Objectives We aimed to describe functional connectivity patterns in a forest carnivore using path-level analysis, and to explore how connectivity is affected by land cover patterns and road networks. Methods We radiotracked 22 common genets in a mixed forest-agricultural landscape of southern Portugal. We developed path selection functions discriminating between observed and random paths in relation to landscape variables. These functions were used together with land cover information to map conductance surfaces. Results Genets moved preferentially within forest patches and close to riparian habitats. Functional connectivity declined with increasing road density, but increased with the proximity of culverts, viaducts and bridges. Functional connectivity was favoured by large forest patches, and by the presence of riparian areas providing corridors within open agricultural land. Roads reduced connectivity by dissecting forest patches, but had less effect on riparian corridors due to the presence of crossing structures. Conclusions Genet movements were jointly affected by the spatial distribution of suitable habitats, and the presence of a road network dissecting such habitats and creating obstacles in areas otherwise permeable to animal movement. Overall, the study showed the value of path-level analysis to assess functional connectivity patterns in human-modified landscapes.
Resumo:
Every year, autochthonous cases of Plasmodium vivax malaria occur in low-endemicity areas of Vale do Ribeira in the south-eastern part of the Atlantic Forest, state of São Paulo, where Anopheles cruzii and Anopheles bellator are considered the primary vectors. However, other species in the subgenus Nyssorhynchus of Anopheles (e.g., Anopheles marajoara) are abundant and may participate in the dynamics of malarial transmission in that region. The objectives of the present study were to assess the spatial distribution of An. cruzii, An. bellator and An. marajoara and to associate the presence of these species with malaria cases in the municipalities of the Vale do Ribeira. Potential habitat suitability modelling was applied to determine both the spatial distribution of An. cruzii, An. bellator and An. marajoara and to establish the density of each species. Poisson regression was utilized to associate malaria cases with estimated vector densities. As a result, An. cruzii was correlated with the forested slopes of the Serra do Mar, An. bellator with the coastal plain and An. marajoara with the deforested areas. Moreover, both An. marajoara and An. cruzii were positively associated with malaria cases. Considering that An. marajoara was demonstrated to be a primary vector of human Plasmodium in the rural areas of the state of Amapá, more attention should be given to the species in the deforested areas of the Atlantic Forest, where it might be a secondary vector.
Resumo:
We consider the problem of interaction neighborhood estimation from the partial observation of a finite number of realizations of a random field. We introduce a model selection rule to choose estimators of conditional probabilities among natural candidates. Our main result is an oracle inequality satisfied by the resulting estimator. We use then this selection rule in a two-step procedure to evaluate the interacting neighborhoods. The selection rule selects a small prior set of possible interacting points and a cutting step remove from this prior set the irrelevant points. We also prove that the Ising models satisfy the assumptions of the main theorems, without restrictions on the temperature, on the structure of the interacting graph or on the range of the interactions. It provides therefore a large class of applications for our results. We give a computationally efficient procedure in these models. We finally show the practical efficiency of our approach in a simulation study.
Resumo:
To test whether plant species influence greenhouse gas production in diverse ecosystems, we measured wet season soil CO(2) and N(2)O fluxes close to similar to 300 large (>35 cm in diameter at breast height (DBH)) trees of 15 species at three clay-rich forest sites in central Amazonia. We found that soil CO(2) fluxes were 38% higher near large trees than at control sites >10 m away from any tree (P < 0.0001). After adjusting for large tree presence, a multiple linear regression of soil temperature, bulk density, and liana DBH explained 19% of remaining CO(2) flux variability. Soil N(2)O fluxes adjacent to Caryocar villosum, Lecythis lurida, Schefflera morototoni, and Manilkara huberi were 84%-196% greater than Erisma uncinatum and Vochysia maxima, both Vochysiaceae. Tree species identity was the most important explanatory factor for N(2)O fluxes, accounting for more than twice the N(2)O flux variability as all other factors combined. Two observations suggest a mechanism for this finding: (1) sugar addition increased N(2)O fluxes near C. villosum twice as much (P < 0.05) as near Vochysiaceae and (2) species mean N(2)O fluxes were strongly negatively correlated with tree growth rate (P = 0.002). These observations imply that through enhanced belowground carbon allocation liana and tree species can stimulate soil CO(2) and N(2)O fluxes (by enhancing denitrification when carbon limits microbial metabolism). Alternatively, low N(2)O fluxes potentially result from strong competition of tree species with microbes for nutrients. Species-specific patterns in CO(2) and N(2)O fluxes demonstrate that plant species can influence soil biogeochemical processes in a diverse tropical forest.
Resumo:
Using data from a logging experiment in the eastern Brazilian Amazon region, we develop a matrix growth and yield model that captures the dynamic effects of harvest system choice on forest structure and composition. Multinomial logistic regression is used to estimate the growth transition parameters for a 10-year time step, while a Poisson regression model is used to estimate recruitment parameters. The model is designed to be easily integrated with an economic model of decisionmaking to perform tropical forest policy analysis. The model is used to compare the long-run structure and composition of a stand arising from the choice of implementing either conventional logging techniques or more carefully planned and executed reduced-impact logging (RIL) techniques, contrasted against a baseline projection of an unlogged forest. Results from log and leave scenarios show that a stand logged according to Brazilian management requirements will require well over 120 years to recover its initial commercial volume, regardless of logging technique employed. Implementing RIL, however, accelerates this recovery. Scenarios imposing a 40-year cutting cycle raise the possibility of sustainable harvest volumes, although at significantly lower levels than is implied by current regulations. Meeting current Brazilian forest policy goals may require an increase in the planned total area of permanent production forest or the widespread adoption of silvicultural practices that increase stand recovery and volume accumulation rates after RIL harvests. Published by Elsevier B.V.
Resumo:
A significant problem in the collection of responses to potentially sensitive questions, such as relating to illegal, immoral or embarrassing activities, is non-sampling error due to refusal to respond or false responses. Eichhorn & Hayre (1983) suggested the use of scrambled responses to reduce this form of bias. This paper considers a linear regression model in which the dependent variable is unobserved but for which the sum or product with a scrambling random variable of known distribution, is known. The performance of two likelihood-based estimators is investigated, namely of a Bayesian estimator achieved through a Markov chain Monte Carlo (MCMC) sampling scheme, and a classical maximum-likelihood estimator. These two estimators and an estimator suggested by Singh, Joarder & King (1996) are compared. Monte Carlo results show that the Bayesian estimator outperforms the classical estimators in almost all cases, and the relative performance of the Bayesian estimator improves as the responses become more scrambled.
Resumo:
The majority of past and current individual-tree growth modelling methodologies have failed to characterise and incorporate structured stochastic components. Rather, they have relied on deterministic predictions or have added an unstructured random component to predictions. In particular, spatial stochastic structure has been neglected, despite being present in most applications of individual-tree growth models. Spatial stochastic structure (also called spatial dependence or spatial autocorrelation) eventuates when spatial influences such as competition and micro-site effects are not fully captured in models. Temporal stochastic structure (also called temporal dependence or temporal autocorrelation) eventuates when a sequence of measurements is taken on an individual-tree over time, and variables explaining temporal variation in these measurements are not included in the model. Nested stochastic structure eventuates when measurements are combined across sampling units and differences among the sampling units are not fully captured in the model. This review examines spatial, temporal, and nested stochastic structure and instances where each has been characterised in the forest biometry and statistical literature. Methodologies for incorporating stochastic structure in growth model estimation and prediction are described. Benefits from incorporation of stochastic structure include valid statistical inference, improved estimation efficiency, and more realistic and theoretically sound predictions. It is proposed in this review that individual-tree modelling methodologies need to characterise and include structured stochasticity. Possibilities for future research are discussed. (C) 2001 Elsevier Science B.V. All rights reserved.
Resumo:
This paper addresses the investment decisions considering the presence of financial constraints of 373 large Brazilian firms from 1997 to 2004, using panel data. A Bayesian econometric model was used considering ridge regression for multicollinearity problems among the variables in the model. Prior distributions are assumed for the parameters, classifying the model into random or fixed effects. We used a Bayesian approach to estimate the parameters, considering normal and Student t distributions for the error and assumed that the initial values for the lagged dependent variable are not fixed, but generated by a random process. The recursive predictive density criterion was used for model comparisons. Twenty models were tested and the results indicated that multicollinearity does influence the value of the estimated parameters. Controlling for capital intensity, financial constraints are found to be more important for capital-intensive firms, probably due to their lower profitability indexes, higher fixed costs and higher degree of property diversification.
Resumo:
A two-component survival mixture model is proposed to analyse a set of ischaemic stroke-specific mortality data. The survival experience of stroke patients after index stroke may be described by a subpopulation of patients in the acute condition and another subpopulation of patients in the chronic phase. To adjust for the inherent correlation of observations due to random hospital effects, a mixture model of two survival functions with random effects is formulated. Assuming a Weibull hazard in both components, an EM algorithm is developed for the estimation of fixed effect parameters and variance components. A simulation study is conducted to assess the performance of the two-component survival mixture model estimators. Simulation results confirm the applicability of the proposed model in a small sample setting. Copyright (C) 2004 John Wiley Sons, Ltd.
Resumo:
1. Chrysophtharta bimaculata is a native chrysomelid species that can cause chronic defoliation of plantation and regrowth Eucalyptus forests in Tasmania, Australia. Knowledge of the dispersion pattern of C. bimaculata was needed in order to assess the efficiency of an integrated pest management (IPM) programme currently used for its control. 2. Using data from yellow flight traps, local populations of C. bimaculata adults were monitored over a season at spatial scales relevant to commercial forestry: within a 50-ha operational management unit (a forestry 'coupe') and between coupes. In addition, oviposition was monitored over a season at a subset of the between-coupe sites. 3. Dispersion indices (Taylor's Power Law and Iwao's Mean Crowding regression method) demonstrated that C. bimaculata adults were spatially aggregated within and between coupes, although the number of egg-batches laid at the between-coupe scale was uniform. Spatial autocorrelation analysis showed that trap-catches at the within-coupe level were similar (positively autocorrelated) to a radius distance of approximately 110 m, and then dissimilar (negatively autocorrelated) at approximately 250 m. At the between-coupe scale, no repeatable spatial autocorrelation patterns were observed. 4. For any individual site, rapid changes in beetle density were observed to be associated with loosely aggregated flights of beetles into and out of that site. Peak adult catches (> the weekly mean plus standard deviation trap-catch) for a site occurred for a period of 2.0 +/- 0.22 weeks at a time (n = 37), with normally only one or two peaks per site per season. Peak oviposition events for a site occurred on average 1.4 +/- 0.11 times per season and lasted 1.5 +/- 0.12 weeks. 5. Analysis of an extensive data set (n = 417) demonstrated that adult abundance at a site was positively correlated with egg density, but negatively correlated with tree damage (caused by conspecifics) and the presence of conspecific larvae. There was no relationship between adult abundance and a visual estimate of the amount of young foliage on trees. 6. Adults of C. bimaculata are show n to occur in relatively small, mobile aggregations. This means that pest surveys must be both regular (less than 2 weeks apart) and intensive (with sampling points no more than 150 m apart) if beetle populations are to be monitored with confidence. Further refinement of the current IPM strategy must recognize the problems posed by this temporal and spatial patchiness, particularly with regard to the use of biological insecticides, such as Bacillus thuringiensis, for which only a very short operational window exists.
Resumo:
A mixture model incorporating long-term survivors has been adopted in the field of biostatistics where some individuals may never experience the failure event under study. The surviving fractions may be considered as cured. In most applications, the survival times are assumed to be independent. However, when the survival data are obtained from a multi-centre clinical trial, it is conceived that the environ mental conditions and facilities shared within clinic affects the proportion cured as well as the failure risk for the uncured individuals. It necessitates a long-term survivor mixture model with random effects. In this paper, the long-term survivor mixture model is extended for the analysis of multivariate failure time data using the generalized linear mixed model (GLMM) approach. The proposed model is applied to analyse a numerical data set from a multi-centre clinical trial of carcinoma as an illustration. Some simulation experiments are performed to assess the applicability of the model based on the average biases of the estimates formed. Copyright (C) 2001 John Wiley & Sons, Ltd.
Resumo:
In this paper, we consider testing for additivity in a class of nonparametric stochastic regression models. Two test statistics are constructed and their asymptotic distributions are established. We also conduct a small sample study for one of the test statistics through a simulated example. (C) 2002 Elsevier Science (USA).
Resumo:
The composition of an open-forest lizard assemblage in eastern Australia was examined before and after a low-intensity controlled fire and concurrently compared with that in an adjoining unburnt area. The effect of fire on the available structural environment and the habitat used by two focal species, Carlia vivax and Lygisaurus foliorum, was also examined. Lizard species richness was unaffected by the controlled burn as was the abundance of most species. C. vivax was the only species to display a significant reduction in abundance after fire. While the low-intensity fire resulted in significant changes to the available structural environment, there were no compensatory shifts in the habitat preferences of either C. vivax or L. foliorum. The reduction in abundance of C. vivax was congruent with this species' avoidance of burnt areas. C. vivax displayed a non-random preference for ground cover and litter cover, which were reduced in burnt areas. Changes in the availability of preferred structural habitat features are likely to contribute to changes in the abundance of some lizard species. Therefore, even low-intensity disturbances can have an impact on lizard assemblages if critical habitat features are lost or become limiting.