67 resultados para multiple linear regression models
em Scielo Saúde Pública - SP
Resumo:
Statistical models allow the representation of data sets and the estimation and/or prediction of the behavior of a given variable through its interaction with the other variables involved in a phenomenon. Among other different statistical models, are the autoregressive state-space models (ARSS) and the linear regression models (LR), which allow the quantification of the relationships among soil-plant-atmosphere system variables. To compare the quality of the ARSS and LR models for the modeling of the relationships between soybean yield and soil physical properties, Akaike's Information Criterion, which provides a coefficient for the selection of the best model, was used in this study. The data sets were sampled in a Rhodic Acrudox soil, along a spatial transect with 84 points spaced 3 m apart. At each sampling point, soybean samples were collected for yield quantification. At the same site, soil penetration resistance was also measured and soil samples were collected to measure soil bulk density in the 0-0.10 m and 0.10-0.20 m layers. Results showed autocorrelation and a cross correlation structure of soybean yield and soil penetration resistance data. Soil bulk density data, however, were only autocorrelated in the 0-0.10 m layer and not cross correlated with soybean yield. The results showed the higher efficiency of the autoregressive space-state models in relation to the equivalent simple and multiple linear regression models using Akaike's Information Criterion. The resulting values were comparatively lower than the values obtained by the regression models, for all combinations of explanatory variables.
Resumo:
Is it possible to build predictive models (PMs) of soil particle-size distribution (psd) in a region with complex geology and a young and unstable land-surface? The main objective of this study was to answer this question. A set of 339 soil samples from a small slope catchment in Southern Brazil was used to build PMs of psd in the surface soil layer. Multiple linear regression models were constructed using terrain attributes (elevation, slope, catchment area, convergence index, and topographic wetness index). The PMs explained more than half of the data variance. This performance is similar to (or even better than) that of the conventional soil mapping approach. For some size fractions, the PM performance can reach 70 %. Largest uncertainties were observed in geologically more complex areas. Therefore, significant improvements in the predictions can only be achieved if accurate geological data is made available. Meanwhile, PMs built on terrain attributes are efficient in predicting the particle-size distribution (psd) of soils in regions of complex geology.
Resumo:
OBJECTIVE: To assess the association between health-related behaviors and quality of life among the elderly. METHODS: A population-based cross-sectional study was carried out including 1,958 elderly living in four areas in the state of São Paulo, southeastern Brazil, 2001/2002. Quality of life was assessed using the Medical Outcomes Study SF-36-Item Short Form Health Survey instrument. This instrument's eight subscales and two components were the dependent variables. Independent variables were physical activity, weekly frequency of alcohol consumption and smoking. Multiple linear regression models were used to control for the effect of gender, age, schooling, work, area of residence and number of chronic conditions. RESULTS: Physical activity was positively associated with the eight SF-36 subscales. The stronger associations were found for role-physical (β=11.9), physical functioning (β=11.3) and physical component. Elderly individuals who consumed alcohol at least once a week showed a better quality of life than those did not consume alcohol. Compared to non-smokers, smokers had a poorer quality of life for the mental component (β=-2.4). CONCLUSIONS: The study results showed that physical activity, moderate alcohol consumption and no smoking are positively associated with a better quality of life in the elderly.
Resumo:
OBJECTIVE To analyze the spatial distribution of risk for tuberculosis and its socioeconomic determinants in the city of Rio de Janeiro, Brazil.METHODS An ecological study on the association between the mean incidence rate of tuberculosis from 2004 to 2006 and socioeconomic indicators of the Censo Demográfico (Demographic Census) of 2000. The unit of analysis was the home district registered in the Sistema de Informação de Agravos de Notificação (Notifiable Diseases Information System) of Rio de Janeiro, Southeastern Brazil. The rates were standardized by sex and age group, and smoothed by the empirical Bayes method. Spatial autocorrelation was evaluated by Moran’s I. Multiple linear regression models were studied and the appropriateness of incorporating the spatial component in modeling was evaluated.RESULTS We observed a higher risk of the disease in some neighborhoods of the port and north regions, as well as a high incidence in the slums of Rocinha and Vidigal, in the south region, and Cidade de Deus, in the west. The final model identified a positive association for the variables: percentage of permanent private households in which the head of the house earns three to five minimum wages; percentage of individual residents in the neighborhood; and percentage of people living in homes with more than two people per bedroom.CONCLUSIONS The spatial analysis identified areas of risk of tuberculosis incidence in the neighborhoods of the city of Rio de Janeiro and also found spatial dependence for the incidence of tuberculosis and some socioeconomic variables. However, the inclusion of the space component in the final model was not required during the modeling process.
Resumo:
The prevalence of antibodies against Equine Influenza Virus (EIV) was determined in 529 equines living on ranches in the municipality of Poconé, Pantanal area of Brazil, by means of the hemagglutination inhibition test, using subtype H3N8 as antigen. The distribution and possible association among positive animal and ranches were evaluated by the chi-square test, spatial autoregressive and multiple linear regression models. The prevalence of antibodies against EIV was estimated at 45.2% (95% CI 30.2 - 61.1%) with titers ranging from 20 to 1,280 HAU. Seropositive equines were found on 92.0% of the surveyed ranches. Equine from non-flooded ranches (66.5%) and negativity in equine infectious anemia virus (EIAV) (61.7%) were associated with antibodies against EIV. No spatial correlation was found among the ranches, but the ones located in non-flooded areas were associated with antibodies against EIV. A negative correlation was found between the prevalence of antibodies against EIV and the presence of EIAV positive animals on the ranches. The high prevalence of antibodies against EIV detected in this study suggests that the virus is circulating among the animals, and this statistical analysis indicates that the movement and aggregation of animals are factors associated to the transmission of the virus in the region.
Resumo:
Few studies have been conducted to verify how the structure of the forest affects the occurence and abundance of neotropical birds. Our research was undertaken between January 2002 and July 2004 at the Reserva Ducke, near Manaus (02º55',03º01'S; 59º53',59º59'W) in central Amazonia, to verify how the forest structure affects the occurrence and abundance of two bird species: the Plain-brown Woodcreeper Dendrocincla fuliginosa and the White-chinned Woodcreeper Dendrocincla merula. Bird species occurrence was recorded using lines of 20 mist-nets (one sample unit), along 51 1-km transects distributed along 9 pararel 8 km trails covering an area of 6400 ha. Along these transects, we placed 50 x 50m plots where we recorded forest structure components (tree abundance, canopy openness, leaf litter, standing dead trees, logs, proximity to streams, and altitude). We then related these variables to bird occurence and abundance using multiple logistic and multiple linear regression models, respectively. We found that D. fuliginosa frequently used plateau areas; being more abundant in areas with more trees. On the other hand, D. merula occurred more frequently and was more abundant in areas with low tree abundance. Our results suggest that although both species overlap in the reserve (both were recorded in at least 68% of the sampled sites), they differ in the way they use the forest microhabitats. Therefore, local variation in the forest structure may contribute to the coexistence of congeneric species and may help to maintain local alpha diversity.
Resumo:
PURPOSE: To verify the predictors of intravasation rate during hysteroscopy.METHODS: Prospective observational study (Canadian Task Force classification II-1). All cases (n=200 women; 22 to 86 years old) were treated in an operating room setting. Considering respective bag overfill to calculate water balance, we tested two multiple linear regression models: one for total intravasation (mL) and the other for absorption rate (mL.min-1). The predictors tested (independent variables) were energy (mono/bipolar), tube patency (with/without tubal ligation), hysterometry (cm), age≤50 years, body surface area (m2), surgical complexity (with/without myomectomy) and duration (min).RESULTS: Mean intravasation was significantly higher when myomectomy was performed (442±616 versus 223±332 mL; p<0.01). In the proposed multiple linear regression models for total intravasation (adjusted R2=0.44; p<0.01), the only significant predictors were myomectomy and duration (p<0.01).In the proposed model for intravasation rate (R2=0.39; p<0.01), only myomectomy and hysterometry were significant predictors (p=0.02 and p<0.01, respectively).CONCLUSIONS: Not only myomectomy but also hysterometry were significant predictors of intravasation rate during operative hysteroscopy.
Resumo:
We studied the ability of patients not experienced in the use of peak expiratory flow meters to assess the severity of their asthma exacerbations and compared it to the assessment of experienced clinicians. We also evaluated which data of physical examination and medical history are used by physicians to subjectively evaluate the severity of asthma attacks. Fifty-seven adult patients (15 men and 42 women, with a mean (± SD) age of 37.3 ± 14.5 years and 24.0 ± 17.9 years of asthma symptoms) with asthma exacerbations were evaluated in a University Hospital Emergency Department. Patients and physicians independently evaluated the severity of the asthma attack using a linear scale. Patient score, physician score and forced expiratory volume at the first second (FEV1) were correlated with history and physical examination variables, and were also considered as dependent variables in multiple linear regression models. FEV1 correlated significantly with the physician score (rho = 0.42, P = 0.001), but not with patient score (rho = 0.03; P = 0.77). Use of neck accessory muscles, expiratory time and wheezing intensity were the explanatory variables in the FEV1 regression model and were also present in the physician score model. We conclude that physicians evaluate asthma exacerbation severity better than patients and that physician's scoring of asthma severity correlated significantly with objective measures of airway obstruction (FEV1). Some variables (the use of neck accessory muscles, expiratory time and wheezing intensity) persisted as explanatory variables in physician score and FEV1 regression models, and should be emphasized in medical schools and emergency settings.
Resumo:
The aim of this work is to establish a relationship between schistosomiasis prevalence and social-environmental variables, in the state of Minas Gerais, Brazil, through multiple linear regression. The final regression model was established, after a variables selection phase, with a set of spatial variables which contains the summer minimum temperature, human development index, and vegetation type variables. Based on this model, a schistosomiasis risk map was built for Minas Gerais.
Resumo:
Soil properties have an enormous impact on economic and environmental aspects of agricultural production. Quantitative relationships between soil properties and the factors that influence their variability are the basis of digital soil mapping. The predictive models of soil properties evaluated in this work are statistical (multiple linear regression-MLR) and geostatistical (ordinary kriging and co-kriging). The study was conducted in the municipality of Bom Jardim, RJ, using a soil database with 208 sampling points. Predictive models were evaluated for sand, silt and clay fractions, pH in water and organic carbon at six depths according to the specifications of the consortium of digital soil mapping at the global level (GlobalSoilMap). Continuous covariates and categorical predictors were used and their contributions to the model assessed. Only the environmental covariates elevation, aspect, stream power index (SPI), soil wetness index (SWI), normalized difference vegetation index (NDVI), and b3/b2 band ratio were significantly correlated with soil properties. The predictive models had a mean coefficient of determination of 0.21. Best results were obtained with the geostatistical predictive models, where the highest coefficient of determination 0.43 was associated with sand properties between 60 to 100 cm deep. The use of a sparse data set of soil properties for digital mapping can explain only part of the spatial variation of these properties. The results may be related to the sampling density and the quantity and quality of the environmental covariates and predictive models used.
Resumo:
The objective of this work was to compare random regression models for the estimation of genetic parameters for Guzerat milk production, using orthogonal Legendre polynomials. Records (20,524) of test-day milk yield (TDMY) from 2,816 first-lactation Guzerat cows were used. TDMY grouped into 10-monthly classes were analyzed for additive genetic effect and for environmental and residual permanent effects (random effects), whereas the contemporary group, calving age (linear and quadratic effects) and mean lactation curve were analized as fixed effects. Trajectories for the additive genetic and permanent environmental effects were modeled by means of a covariance function employing orthogonal Legendre polynomials ranging from the second to the fifth order. Residual variances were considered in one, four, six, or ten variance classes. The best model had six residual variance classes. The heritability estimates for the TDMY records varied from 0.19 to 0.32. The random regression model that used a second-order Legendre polynomial for the additive genetic effect, and a fifth-order polynomial for the permanent environmental effect is adequate for comparison by the main employed criteria. The model with a second-order Legendre polynomial for the additive genetic effect, and that with a fourth-order for the permanent environmental effect could also be employed in these analyses.
Resumo:
INTRODUCTION: Malaria is a serious problem in the Brazilian Amazon region, and the detection of possible risk factors could be of great interest for public health authorities. The objective of this article was to investigate the association between environmental variables and the yearly registers of malaria in the Amazon region using Bayesian spatiotemporal methods. METHODS: We used Poisson spatiotemporal regression models to analyze the Brazilian Amazon forest malaria count for the period from 1999 to 2008. In this study, we included some covariates that could be important in the yearly prediction of malaria, such as deforestation rate. We obtained the inferences using a Bayesian approach and Markov Chain Monte Carlo (MCMC) methods to simulate samples for the joint posterior distribution of interest. The discrimination of different models was also discussed. RESULTS: The model proposed here suggests that deforestation rate, the number of inhabitants per km², and the human development index (HDI) are important in the prediction of malaria cases. CONCLUSIONS: It is possible to conclude that human development, population growth, deforestation, and their associated ecological alterations are conducive to increasing malaria risk. We conclude that the use of Poisson regression models that capture the spatial and temporal effects under the Bayesian paradigm is a good strategy for modeling malaria counts.
Resumo:
Geographical information systems (GIS) are tools that have been recently tested for improving our understanding of the spatial distribution of disease. The objective of this paper was to further develop the GIS technology to model and control schistosomiasis using environmental, social, biological and remote-sensing variables. A final regression model (R² = 0.39) was established, after a variable selection phase, with a set of spatial variables including the presence or absence of Biomphalaria glabrata, winter enhanced vegetation index, summer minimum temperature and percentage of houses with water coming from a spring or well. A regional model was also developed by splitting the state of Minas Gerais (MG) into four regions and establishing a linear regression model for each of the four regions: 1 (R² = 0.97), 2 (R² = 0.60), 3 (R² = 0.63) and 4 (R² = 0.76). Based on these models, a schistosomiasis risk map was built for MG. In this paper, geostatistics was also used to make inferences about the presence of Biomphalaria spp. The result was a map of species and risk areas. The obtained risk map permits the association of uncertainties, which can be used to qualify the inferences and it can be thought of as an auxiliary tool for public health strategies.
Resumo:
The present paper aims to bring under discussion some theoretical and practical aspects about the proposition, validation and analysis of QSAR models based on multiple linear regression. A comprehensive approach for the derivation of extrathermodynamic equations is reviewed. Some examples of QSAR models published in the literature are analyzed and criticized.
Resumo:
Genetic algorithm was used for variable selection in simultaneous determination of mixtures of glucose, maltose and fructose by mid infrared spectroscopy. Different models, using partial least squares (PLS) and multiple linear regression (MLR) with and without data pre-processing, were used. Based on the results obtained, it was verified that a simpler model (multiple linear regression with variable selection by genetic algorithm) produces results comparable to more complex methods (partial least squares). The relative errors obtained for the best model was around 3% for the sugar determination, which is acceptable for this kind of determination.