212 resultados para random forest regression
em Scielo Saúde Pública - SP
Metodologia baseada em técnicas de mineração de dados para suporte à certificação de raças de ovinos
Resumo:
RESUMO O objetivo deste trabalho foi desenvolver uma metodologia baseada em técnicas de mineração de dados para selecionar os principais marcadores SNP (Single Nucleotide Polymorphism) para as raças de ovinos: Crioula, Morada Nova e Santa Inês. Os dados utilizados foram obtidos do Consórcio Internacional de Ovinos e são compostos por 72 animais das raças citadas, e cada animal possui 49.034 marcadores SNP. Considerando que o número de atributos (marcadores) é muito maior que o de observações (animais), foram aplicadas as técnicas de predição LASSO (Least Absolute Shrinkage and Selection Operator), Random Forest e Boosting para a geração de modelos preditivos que incorporam métodos de seleção de atributos. Os resultados revelaram que os modelos preditivos selecionaram os principais marcadores SNP para identificação das raças estudadas. O modelo LASSO selecionou um total de 29 marcadores relevantes. A partir dos modelos Random Forest e Boosting, foram obtidos 27 e 20 marcadores importantes, respectivamente. Por meio da intersecção dos modelos gerados, identificou-se um subconjunto de 18 marcadores com maior potencial de identificação das raças.
Resumo:
INTRODUCTION: Malaria is a serious problem in the Brazilian Amazon region, and the detection of possible risk factors could be of great interest for public health authorities. The objective of this article was to investigate the association between environmental variables and the yearly registers of malaria in the Amazon region using Bayesian spatiotemporal methods. METHODS: We used Poisson spatiotemporal regression models to analyze the Brazilian Amazon forest malaria count for the period from 1999 to 2008. In this study, we included some covariates that could be important in the yearly prediction of malaria, such as deforestation rate. We obtained the inferences using a Bayesian approach and Markov Chain Monte Carlo (MCMC) methods to simulate samples for the joint posterior distribution of interest. The discrimination of different models was also discussed. RESULTS: The model proposed here suggests that deforestation rate, the number of inhabitants per km², and the human development index (HDI) are important in the prediction of malaria cases. CONCLUSIONS: It is possible to conclude that human development, population growth, deforestation, and their associated ecological alterations are conducive to increasing malaria risk. We conclude that the use of Poisson regression models that capture the spatial and temporal effects under the Bayesian paradigm is a good strategy for modeling malaria counts.
Resumo:
The objective of this work was to compare random regression models for the estimation of genetic parameters for Guzerat milk production, using orthogonal Legendre polynomials. Records (20,524) of test-day milk yield (TDMY) from 2,816 first-lactation Guzerat cows were used. TDMY grouped into 10-monthly classes were analyzed for additive genetic effect and for environmental and residual permanent effects (random effects), whereas the contemporary group, calving age (linear and quadratic effects) and mean lactation curve were analized as fixed effects. Trajectories for the additive genetic and permanent environmental effects were modeled by means of a covariance function employing orthogonal Legendre polynomials ranging from the second to the fifth order. Residual variances were considered in one, four, six, or ten variance classes. The best model had six residual variance classes. The heritability estimates for the TDMY records varied from 0.19 to 0.32. The random regression model that used a second-order Legendre polynomial for the additive genetic effect, and a fifth-order polynomial for the permanent environmental effect is adequate for comparison by the main employed criteria. The model with a second-order Legendre polynomial for the additive genetic effect, and that with a fourth-order for the permanent environmental effect could also be employed in these analyses.
Resumo:
Few studies have been conducted to verify how the structure of the forest affects the occurence and abundance of neotropical birds. Our research was undertaken between January 2002 and July 2004 at the Reserva Ducke, near Manaus (02º55',03º01'S; 59º53',59º59'W) in central Amazonia, to verify how the forest structure affects the occurrence and abundance of two bird species: the Plain-brown Woodcreeper Dendrocincla fuliginosa and the White-chinned Woodcreeper Dendrocincla merula. Bird species occurrence was recorded using lines of 20 mist-nets (one sample unit), along 51 1-km transects distributed along 9 pararel 8 km trails covering an area of 6400 ha. Along these transects, we placed 50 x 50m plots where we recorded forest structure components (tree abundance, canopy openness, leaf litter, standing dead trees, logs, proximity to streams, and altitude). We then related these variables to bird occurence and abundance using multiple logistic and multiple linear regression models, respectively. We found that D. fuliginosa frequently used plateau areas; being more abundant in areas with more trees. On the other hand, D. merula occurred more frequently and was more abundant in areas with low tree abundance. Our results suggest that although both species overlap in the reserve (both were recorded in at least 68% of the sampled sites), they differ in the way they use the forest microhabitats. Therefore, local variation in the forest structure may contribute to the coexistence of congeneric species and may help to maintain local alpha diversity.
Resumo:
This study analyzed the influence of forest structural components on the occurence, size and density of groups of Bare-face Tamarin (Saguinus bicolor) - the most threatened species in the Amazon - and produced the first map of distribution of groups in large-scale spatial within the area of continuous forest. Population censuses were conducted between November 2002 and July 2003, covering 6400 hectares in the Ducke Reserve, Manaus-AM, Brazil. Groups of S. bicolor were recorded 41 times accordingly distributed in the environments: plateau (20); slopes (12); and lowlands (09). The mean group size was 4.8 indiv./group, and ranged from 2 to 11 individuals. In the sites where the groups were recorded, and in an equivalent number of sites where no tamarins were found located at least 500 m from those where they had been recorded, we placed 50 m x 50 m plots to record the following forest structural components: abundance of trees; abundance of lianas; abundance of fruiting trees and lianas; abundance of snags; abundance of logs; percentage of canopy opening; leaf litter depth; and altitude. Bare-face Tamarin more often uses areas with lower abundance of forest logs, smaller canopy opening and with higher abundance of snags, areas in the forest with smaller canopy opening present higher density of S. bicolor groups. Apparently this species does not use the forest in a random way, and may select areas for its daily activities depending on the micro-environmental heterogeneity produced by the forest structural components.
Resumo:
Species distribution modeling has relevant implications for the studies of biodiversity, decision making about conservation and knowledge about ecological requirements of the species. The aim of this study was to evaluate if the use of forest inventories can improve the estimation of occurrence probability, identify the limits of the potential distribution and habitat preference of a group of timber tree species. The environmental predictor variables were: elevation, slope, aspect, normalized difference vegetation index (NDVI) and height above the nearest drainage (HAND). To estimate the distribution of species we used the maximum entropy method (Maxent). In comparison with a random distribution, using topographic variables and vegetation index as features, the Maxent method predicted with an average accuracy of 86% the geographical distribution of studied species. The altitude and NDVI were the most important variables. There were limitations to the interpolation of the models for non-sampled locations and that are outside of the elevation gradient associated with the occurrence data in approximately 7% of the basin area. Ceiba pentandra (samaúma), Castilla ulei (caucho) and Hura crepitans (assacu) is more likely to occur in nearby water course areas. Clarisia racemosa (guariúba), Amburana acreana (cerejeira), Aspidosperma macrocarpon (pereiro), Apuleia leiocarpa (cumaru cetim), Aspidosperma parvifolium (amarelão) and Astronium lecointei (aroeira) can also occur in upland forest and well drained soils. This modeling approach has potential for application on other tropical species still less studied, especially those that are under pressure from logging.
Resumo:
Every year, autochthonous cases of Plasmodium vivax malaria occur in low-endemicity areas of Vale do Ribeira in the south-eastern part of the Atlantic Forest, state of São Paulo, where Anopheles cruzii and Anopheles bellator are considered the primary vectors. However, other species in the subgenus Nyssorhynchus of Anopheles (e.g., Anopheles marajoara) are abundant and may participate in the dynamics of malarial transmission in that region. The objectives of the present study were to assess the spatial distribution of An. cruzii, An. bellator and An. marajoara and to associate the presence of these species with malaria cases in the municipalities of the Vale do Ribeira. Potential habitat suitability modelling was applied to determine both the spatial distribution of An. cruzii, An. bellator and An. marajoara and to establish the density of each species. Poisson regression was utilized to associate malaria cases with estimated vector densities. As a result, An. cruzii was correlated with the forested slopes of the Serra do Mar, An. bellator with the coastal plain and An. marajoara with the deforested areas. Moreover, both An. marajoara and An. cruzii were positively associated with malaria cases. Considering that An. marajoara was demonstrated to be a primary vector of human Plasmodium in the rural areas of the state of Amapá, more attention should be given to the species in the deforested areas of the Atlantic Forest, where it might be a secondary vector.
Resumo:
Polistine wasps are important in Neotropical ecosystems due to their ubiquity and diversity. Inventories have not adequately considered spatial attributes of collected specimens. Spatial data on biodiversity are important for study and mitigation of anthropogenic impacts over natural ecosystems and for protecting species. We described and analyzed local-scale spatial patterns of collecting records of wasp species, as well as spatial variation of diversity descriptors in a 2500-hectare area of an Amazon forest in Brazil. Rare species comprised the largest fraction of the fauna. Close range spatial effects were detected for most of the more common species, with clustering of presence-data at short distances. Larger spatial lag effects could also be identified in some species, constituting probably cases of exogenous autocorrelation and candidates for explanations based on environmental factors. In a few cases, significant or near significant correlations were found between five species (of Agelaia, Angiopolybia, and Mischocyttarus) and three studied environmental variables: distance to nearest stream, terrain altitude, and the type of forest canopy. However, association between these factors and biodiversity variables were generally low. When used as predictors of polistine richness in a linear multiple regression, only the coefficient for the forest canopy variable resulted significant. Some level of prediction of wasp diversity variables can be attained based on environmental variables, especially vegetation structure. Large-scale landscape and regional studies should be scheduled to address this issue.
Resumo:
Litter fall consists of all organic material deposited on the forest floor, being of extremely important for the structure and maintenance of the ecosystem through nutrient cycling. This study aimed to evaluate the production and decomposition of litter fall in a secondary Atlantic forest fragment of secondary Atlantic Forest, at the Guarapiranga Ecological Park, in São Paulo, SP. The litter samples were taken monthly from May 2012 to May 2013. To assess the contribution of litter fall forty collectors were installed randomly within an area of 0.5 ha. The collected material was sent to the laboratory to be dried at 65 °C for 72 hours, being subsequently separated into fractions of leaves, twigs, reproductive parts and miscellaneous, and weighed to obtain the dry biomass. Litterbags were placed and tied close to the collectors to estimate the decomposition rate in order to evaluate the loss of dry biomass at 30, 60, 90, 120 and 150 days. After collection, the material was sent to the laboratory to be dried and weighed again. Total litter fall throughout the year reached 5.7 Mg.ha-1.yr-1 and the major amount of the material was collected from September till March. Leaves had the major contribution for total litter fall (72%), followed by twigs (14%), reproductive parts (11%) and miscellaneous (3%). Reproductive parts had a peak during the wet season. Positive correlation was observed between total litter and precipitation, temperature and radiation (r = 0.66, p<0.05; r = 0.76, p<0.05; r = 0.58, p<0.05, respectively). The multiple regression showed that precipitation and radiation contributed significantly to litter fall production. Decomposition rate was in the interval expected for secondary tropical forest and was correlated to rainfall. It was concluded that this fragment of secondary forest showed a seasonality effect driven mainly by precipitation and radiation, both important components of foliage renewal for the plant community and that decomposition was in an intermediate rate.
Resumo:
AB STRACT This study aimed at evaluating the natural durability of Eucalyptus dunnii, Eucalyptus robusta, Eucalyptus tereticornis and Hovenia dulcis woods submitted to a deterioration test in two environments, field and forest. The test samples were buried until half of their length (150 mm). Evaluations were carried out each 45 days, totalizing a 405-day period, with three-repetition withdrawal of each species for environment, totalizing nine samples from each environment, making up 24 test samples for evaluation. After percentage calculations of mass loss and resistance degree classification, the deterioration index was adopted for decomposition evaluation and fungal decay potential determination of test samples. The study has been carried out in completely randomized design (CRD), evaluated through analysis of variance (ANOVA) with subsequent comparison of means by Turkey' s test, in a 5%-level of probability of error, along with regression analysis. Eucalyptus tereticornis wood presented lesser mass loss in both environments. Hovenia dulcis presented lesser deterioration probability in both environments. Forest environment test samples presented greater mass loss percentages and lesser deterioration index.
Resumo:
The objectives of this study were to identify anthophilous butterflies on psychophilous flowers of four Asteraceae species in an Atlantic Forest fragment in Viçosa, Minas Gerais State, Southeastern Brazil, and to determine whether there are species in common with other lepidopteran inventories of the Southeastern and Midwestern regions of Brazil. It is the first inventory of anthophilous butterflies of a semideciduous forest fragment in Zona da Mata, State of Minas Gerais. A total of 108 species were recorded, representing the fourth largest lepidopteran survey in this State. The results demonstrated that Asteraceae species may be important tools for monitoring anthophilous butterflies. The similarity with other inventories ranged from 1 to 92.55%. Fifteen species were reported for the first time in the State of Minas Gerais, and among them, Melanis alena and Thisbe irenea were observed in this study only.
Resumo:
Os objetivos deste trabalho foram verificar a acurácia do método da Seleção Genômica Ampla (GWS) no melhoramento de milho nas condições de estresse nutricional e propor novos métodos de melhoramento baseados em GWS. Foram estimados os dois componentes da eficiência no uso de nitrogênio e de fósforo (eficiência de absorção e de utilização) em 41 combinações híbridas, em dois experimentos, sob baixa e alta disponibilidades de N e P. Para a genotipagem da população de estimação, foram utilizados 80 marcadores microssatélites. As estimativas dos parâmetros genéticos foram obtidas via REML/BLUP, e a predição dos valores genéticos genômicos, via regressão aleatória (Random Regression - RR) aplicada à seleção genômica ampla (RR-BLUP/GWS). Para os caracteres em que a GWS apresentou altos valores de acurácia, essa foi comparada com os métodos de Seleção Recorrente Intra e Interpopulacional. Com o uso da GWS houve aumento significativo na acurácia seletiva e nos ganhos genéticos por unidade de tempo.
Resumo:
Survival analysis is applied when the time until the occurrence of an event is of interest. Such data are routinely collected in plant diseases, although applications of the method are uncommon. The objective of this study was to use two studies on post-harvest diseases of peaches, considering two harvests together and the existence of random effect shared by fruits of a same tree, in order to describe the main techniques in survival analysis. The nonparametric Kaplan-Meier method, the log-rank test and the semi-parametric Cox's proportional hazards model were used to estimate the effect of cultivars and the number of days after full bloom on the survival to the brown rot symptom and the instantaneous risk of expressing it in two consecutive harvests. The joint analysis with baseline effect, varying between harvests, and the confirmation of the tree effect as a grouping factor with random effect were appropriate to interpret the phenomenon (disease) evaluated and can be important tools to replace or complement the conventional analysis, respecting the nature of the variable and the phenomenon.
Resumo:
ABSTRACT The objective of this work was to evaluate the dynamics of decomposition process of chopped secondary forest system, previously enriched with legumes Inga velutina Willd. and Stryphnodendron pulcherrimum (Willd.) Hochr. and the contribution of this process to the nutrient input to the cultivation of corn and bean under no-tillage. The experimental design was a randomized block, split plot with four replications. The plots were two species (I. velutina and S. pulcherrimum) and the subplots were seven times of evaluation (0, 7, 28, 63, 189, 252, 294 days after experiment installation). There was no difference (p ≥ 0.05) between the secondary forest systems enriched and no interaction with times for biomass waste, decomposition constant and half-life time. The waste of S. pulcherrimum trees had higher (p < 0.05) C/N ratio than that I. velutina. However, this one was higher (p < 0.05) in lignin content. Nevertheless, the dynamics of residue decomposition was similar. The corn yield was higher (p < 0.05) in cultivation under I.velutina waste. Meanwhile, the beans planted after corn, shows similar (p > 0.05) yield in both areas, regardless of the waste origin.
Resumo:
This paper analyses the commercial and socio-demographic antecedents of the importance of price in buyers' decisions. The study uses ordinal regression in order to analyze the data obtained from a random sample of consumers of frequently purchased products; these consumers were surveyed in different stores. The results demonstrate that shopping enjoyment and brand loyalty have an influence over the importance of price. However, responsibility for shopping (purchase frequency) does not show a significant relationship. Furthermore, some interesting socio-demographic characteristics were found in the context of the study that can be analyzed in future research.