910 resultados para weighted linear regression
Resumo:
RATIONALE: The interaction between lungs and chest wall influences lung volume, that determines lung history during respiration cycle. In this study, the influence of chest wall mechanics on respiratory system is assessed by the evaluation of inspiration pressure-volume curve (PV curve) under three different situations: closed-chest, open-chest and isolated lung. The PV curve parameters in each situation allow us to further understand the role played by different chest wall elements in the respiratory function. Methods: Twenty-four male Wistar rats (236 ± 29 g) were used. The animals were weighted and then anesthetized with xylazine 2% (0,5mL/kg) and ketamine 10% (0,9mL/kg), exsanguinated and later tracheostomized with a metallic cannula (14 gauge). The cannula was connected to an automatic small animal insufflator. This setup was connected to a pressure transducer (32 samples/s). The 24 animals were randomly separated in three groups: (i) closed chest, (ii) open chest and (iii) isolated lung. The rats were insufflated with 20mL quasi-statically (constant speed of 0,1mL/s). Insufflated volume and measured pressure data were kept and PV curves were obtained for all animals. The PV curves were fitted (non-linear least squares) against the sigmoid equation (1) to obtain the sigmoid equation parameters (a,b,c,d). Elastance measurements were obtained from linear regression of pressure/volume measurements in a 0,8s interval before and after the calculated point. Results: The parameters a, b and c showed no significant change, but the parameter d showed a significant variation among the three groups. The initial elastance also varied between open and closed chest, indicating the need of a higher pressure for the lung expansion, as can be seen in Table 1. Table 1: Mean and Standard Deviation of parameters obtained for each protocol. Protocol: Closed Chest – a (mL) -0.35±0.33; b (mL) 13.93±0.89; c (cm H2O) 21.28±2.37; d (cm H2O) 6.17±0.84; r²** (%) 99.4±0.14; Initial Elastance* (cm H2)/mL) 12.72±6.66; Weight (g) 232.33±5.72. Open Chest - a (mL) 0.01±0.28; b (mL) 14.79±0.54; c (cm H2O) 19.47±1.41; d (cm H2O) 3.50±0.28; r²** (%) 98.8±0.34; Initial Elastance* (cm H2)/mL) 28.68±2.36; Weight (g) 217.33±7.97. Isolated Lung - a (mL) -0.09±0.46; b (mL) 14.22±0.75; c (cm H2O) 21.76±1.43; d (cm H2O) 4.24±0.50; r²** (%) 98.9±0.19; Initial Elastance* (cm H2)/mL) 7.13±8.85; Weight (g) 224.33±16.66. * Elastance measures in the 0-0,1 mL range. ** Goodness of sigmoid fit versus measured data Conclusion: A supporting effect of the chest wall was observed at the initial moments of inspiration, observed as a higher initial elastance in open chest situations than in closed chest situations (p=0,00001). The similar initial elastance for the isolated lung and closed chest may be explained by the specific method used for the isolated lung experiment. As the isolated lung is supported by the trachea vertically, the weight of the tissue may have a similar effect of the residual negative pressure in the thorax, responsible for maintaining the residual volume.
Resumo:
Indoor radon is regularly measured in Switzerland. However, a nationwide model to predict residential radon levels has not been developed. The aim of this study was to develop a prediction model to assess indoor radon concentrations in Switzerland. The model was based on 44,631 measurements from the nationwide Swiss radon database collected between 1994 and 2004. Of these, 80% randomly selected measurements were used for model development and the remaining 20% for an independent model validation. A multivariable log-linear regression model was fitted and relevant predictors selected according to evidence from the literature, the adjusted R², the Akaike's information criterion (AIC), and the Bayesian information criterion (BIC). The prediction model was evaluated by calculating Spearman rank correlation between measured and predicted values. Additionally, the predicted values were categorised into three categories (50th, 50th-90th and 90th percentile) and compared with measured categories using a weighted Kappa statistic. The most relevant predictors for indoor radon levels were tectonic units and year of construction of the building, followed by soil texture, degree of urbanisation, floor of the building where the measurement was taken and housing type (P-values <0.001 for all). Mean predicted radon values (geometric mean) were 66 Bq/m³ (interquartile range 40-111 Bq/m³) in the lowest exposure category, 126 Bq/m³ (69-215 Bq/m³) in the medium category, and 219 Bq/m³ (108-427 Bq/m³) in the highest category. Spearman correlation between predictions and measurements was 0.45 (95%-CI: 0.44; 0.46) for the development dataset and 0.44 (95%-CI: 0.42; 0.46) for the validation dataset. Kappa coefficients were 0.31 for the development and 0.30 for the validation dataset, respectively. The model explained 20% overall variability (adjusted R²). In conclusion, this residential radon prediction model, based on a large number of measurements, was demonstrated to be robust through validation with an independent dataset. The model is appropriate for predicting radon level exposure of the Swiss population in epidemiological research. Nevertheless, some exposure misclassification and regression to the mean is unavoidable and should be taken into account in future applications of the model.
Resumo:
BACKGROUND Lead exposure is associated with low birth-weight. The objective of this study is to determine whether lead exposure is associated with lower body weight in children, adolescents and adults. METHODS We analyzed data from NHANES 1999-2006 for participants aged ≥3 using multiple logistic and multivariate linear regression. Using age- and sex-standardized BMI Z-scores, overweight and obese children (ages 3-19) were classified by BMI ≥85 th and ≥95 th percentiles, respectively. The adult population (age ≥20) was classified as overweight and obese with BMI measures of 25-29.9 and ≥30, respectively. Blood lead level (BLL) was categorized by weighted quartiles. RESULTS Multivariate linear regressions revealed a lower BMI Z-score in children and adolescents when the highest lead quartile was compared to the lowest lead quartile (β (SE)=-0.33 (0.07), p<0.001), and a decreased BMI in adults (β (SE)=-2.58 (0.25), p<0.001). Multiple logistic analyses in children and adolescents found a negative association between BLL and the percentage of obese and overweight with BLL in the highest quartile compared to the lowest quartile (OR=0.42, 95% CI: 0.30-0.59; and OR=0.67, 95% CI: 0.52-0.88, respectively). Adults in the highest lead quartile were less likely to be obese (OR=0.42, 95% CI: 0.35-0.50) compared to those in the lowest lead quartile. Further analyses with blood lead as restricted cubic splines, confirmed the dose-relationship between blood lead and body weight outcomes. CONCLUSIONS BLLs are associated with lower body mass index and obesity in children, adolescents and adults.
Resumo:
robreg provides a number of robust estimators for linear regression models. Among them are the high breakdown-point and high efficiency MM-estimator, the Huber and bisquare M-estimator, and the S-estimator, each supporting classic or robust standard errors. Furthermore, basic versions of the LMS/LQS (least median of squares) and LTS (least trimmed squares) estimators are provided. Note that the moremata package, also available from SSC, is required.
Resumo:
Experimentally renal tissue hypoxia appears to play an important role in the pathogenesis of chronic kidney disease (CKD) and arterial hypertension (AHT). In this study we measured renal tissue oxygenation and its determinants in humans using blood oxygenation level-dependent magnetic resonance imaging (BOLD-MRI) under standardized hydration conditions. Four coronal slices were selected, and a multi gradient echo sequence was used to acquire T2* weighted images. The mean cortical and medullary R2* values ( = 1/T2*) were calculated before and after administration of IV furosemide, a low R2* indicating a high tissue oxygenation. We studied 195 subjects (95 CKD, 58 treated AHT, and 42 healthy controls). Mean cortical R2 and medullary R2* were not significantly different between the groups at baseline. In stimulated conditions (furosemide injection), the decrease in R2* was significantly blunted in patients with CKD and AHT. In multivariate linear regression analyses, neither cortical nor medullary R2* were associated with eGFR or blood pressure, but cortical R2* correlated positively with male gender, blood glucose and uric acid levels. In conclusion, our data show that kidney oxygenation is tightly regulated in CKD and hypertensive patients at rest. However, the metabolic response to acute changes in sodium transport is altered in CKD and in AHT, despite preserved renal function in the latter group. This suggests the presence of early renal metabolic alterations in hypertension. The correlations between cortical R2* values, male gender, glycemia and uric acid levels suggest that these factors interfere with the regulation of renal tissue oxygenation.
Resumo:
Parameter estimates from commonly used multivariable parametric survival regression models do not directly quantify differences in years of life expectancy. Gaussian linear regression models give results in terms of absolute mean differences, but are not appropriate in modeling life expectancy, because in many situations time to death has a negative skewed distribution. A regression approach using a skew-normal distribution would be an alternative to parametric survival models in the modeling of life expectancy, because parameter estimates can be interpreted in terms of survival time differences while allowing for skewness of the distribution. In this paper we show how to use the skew-normal regression so that censored and left-truncated observations are accounted for. With this we model differences in life expectancy using data from the Swiss National Cohort Study and from official life expectancy estimates and compare the results with those derived from commonly used survival regression models. We conclude that a censored skew-normal survival regression approach for left-truncated observations can be used to model differences in life expectancy across covariates of interest.
Resumo:
Climatic relationships were established in two 210Pb dated pollen sequences from small mires closely surrounded by forest just below actual forest limits (but about 300 m below potential climatic forest limits) in the northern Swiss Alps (suboceanic in climate; mainly with Picea) and the central Swiss Alps (subcontinental; mainly Pinus cembra and Larix) at annual or near-annual resolution from ad 1901 to 1996. Effects of vegetational succession were removed by splitting the time series into early and late periods and by linear detrending. Both pollen concentrations detrended by the depth-age model and modified percentages (in which counts of dominant pollen types are down-weighted) are correlated by simple linear regression with smoothed climatic parameters with one-and two-year timelags, including average monthly and April/September daylight air temperatures and with seasonal and annual precipitation sums. Results from detrended pollen concentrations suggest that peat accumulation is favoured in the northern-Alpine mire either by early snowmelt or by summer precipitation, but in the central-Alpine mire by increased precipitation and cooler summers, suggesting a position of the northern-Alpine mire near the upper altitudinal limit of peat formation, but of the central-Alpine mire near the lower limit. Results from modified pollen percentages indicate that pollen pro duction by plants growing near their upper altitudinal limit is limited by insufficient warmth in summer, and pollen production by plants growing near their lower altitudinal limit is limited by too-high temperatures. Only weakly significant pollen/climate relationships were found for Pinus cembra and Larix, probably because they experience little climatic stress growing 300 m below the potential climatic forest limit.
Resumo:
Interaction effect is an important scientific interest for many areas of research. Common approach for investigating the interaction effect of two continuous covariates on a response variable is through a cross-product term in multiple linear regression. In epidemiological studies, the two-way analysis of variance (ANOVA) type of method has also been utilized to examine the interaction effect by replacing the continuous covariates with their discretized levels. However, the implications of model assumptions of either approach have not been examined and the statistical validation has only focused on the general method, not specifically for the interaction effect.^ In this dissertation, we investigated the validity of both approaches based on the mathematical assumptions for non-skewed data. We showed that linear regression may not be an appropriate model when the interaction effect exists because it implies a highly skewed distribution for the response variable. We also showed that the normality and constant variance assumptions required by ANOVA are not satisfied in the model where the continuous covariates are replaced with their discretized levels. Therefore, naïve application of ANOVA method may lead to an incorrect conclusion. ^ Given the problems identified above, we proposed a novel method modifying from the traditional ANOVA approach to rigorously evaluate the interaction effect. The analytical expression of the interaction effect was derived based on the conditional distribution of the response variable given the discretized continuous covariates. A testing procedure that combines the p-values from each level of the discretized covariates was developed to test the overall significance of the interaction effect. According to the simulation study, the proposed method is more powerful then the least squares regression and the ANOVA method in detecting the interaction effect when data comes from a trivariate normal distribution. The proposed method was applied to a dataset from the National Institute of Neurological Disorders and Stroke (NINDS) tissue plasminogen activator (t-PA) stroke trial, and baseline age-by-weight interaction effect was found significant in predicting the change from baseline in NIHSS at Month-3 among patients received t-PA therapy.^
Resumo:
Strategies are compared for the development of a linear regression model with stochastic (multivariate normal) regressor variables and the subsequent assessment of its predictive ability. Bias and mean squared error of four estimators of predictive performance are evaluated in simulated samples of 32 population correlation matrices. Models including all of the available predictors are compared with those obtained using selected subsets. The subset selection procedures investigated include two stopping rules, C$\sb{\rm p}$ and S$\sb{\rm p}$, each combined with an 'all possible subsets' or 'forward selection' of variables. The estimators of performance utilized include parametric (MSEP$\sb{\rm m}$) and non-parametric (PRESS) assessments in the entire sample, and two data splitting estimates restricted to a random or balanced (Snee's DUPLEX) 'validation' half sample. The simulations were performed as a designed experiment, with population correlation matrices representing a broad range of data structures.^ The techniques examined for subset selection do not generally result in improved predictions relative to the full model. Approaches using 'forward selection' result in slightly smaller prediction errors and less biased estimators of predictive accuracy than 'all possible subsets' approaches but no differences are detected between the performances of C$\sb{\rm p}$ and S$\sb{\rm p}$. In every case, prediction errors of models obtained by subset selection in either of the half splits exceed those obtained using all predictors and the entire sample.^ Only the random split estimator is conditionally (on $\\beta$) unbiased, however MSEP$\sb{\rm m}$ is unbiased on average and PRESS is nearly so in unselected (fixed form) models. When subset selection techniques are used, MSEP$\sb{\rm m}$ and PRESS always underestimate prediction errors, by as much as 27 percent (on average) in small samples. Despite their bias, the mean squared errors (MSE) of these estimators are at least 30 percent less than that of the unbiased random split estimator. The DUPLEX split estimator suffers from large MSE as well as bias, and seems of little value within the context of stochastic regressor variables.^ To maximize predictive accuracy while retaining a reliable estimate of that accuracy, it is recommended that the entire sample be used for model development, and a leave-one-out statistic (e.g. PRESS) be used for assessment. ^
Resumo:
We propose a linear regression method for estimating Weibull parameters from life tests. The method uses stochastic models of the unreliability at each failure instant. As a result, a heteroscedastic regression problem arises that is solved by weighted least squares minimization. The main feature of our method is an innovative s-normalization of the failure data models, to obtain analytic expressions of centers and weights for the regression. The method has been Monte Carlo contrasted with Benard?s approximation, and Maximum Likelihood Estimation; and it has the highest global scores for its robustness, and performance.
Resumo:
Specific cutting energy (SE) has been widely used to assess the rock cuttability for mechanical excavation purposes. Some prediction models were developed for SE through correlating rock properties with SE values. However, some of the textural and compositional rock parameters i.e. texture coefficient and feldspar, mafic, and felsic mineral contents were not considered. The present study is to investigate the effects of previously ignored rock parameters along with engineering rock properties on SE. Mineralogical and petrographic analyses, rock mechanics, and linear rock cutting tests were performed on sandstone samples taken from sites around Ankara, Turkey. Relationships between SE and rock properties were evaluated using bivariate correlation and linear regression analyses. The tests and subsequent analyses revealed that the texture coefficient and feldspar content of sandstones affected rock cuttability, evidenced by significant correlations between these parameters and SE at a 90% confidence level. Felsic and mafic mineral contents of sandstones did not exhibit any statistically significant correlation against SE. Cementation coefficient, effective porosity, and pore volume had good correlations against SE. Poisson's ratio, Brazilian tensile strength, Shore scleroscope hardness, Schmidt hammer hardness, dry density, and point load strength index showed very strong linear correlations against SE at confidence levels of 95% and above, all of which were also found suitable to be used in predicting SE individually, depending on the results of regression analysis, ANOVA, Student's t-tests, and R2 values. Poisson's ratio exhibited the highest correlation with SE and seemed to be the most reliable SE prediction tool in sandstones.
Resumo:
Specific cutting energy (SE) has been widely used to assess the rock cuttability for mechanical excavation purposes. Some prediction models were developed for SE through correlating rock properties with SE values. However, some of the textural and compositional rock parameters i.e. texture coefficient and feldspar, mafic, and felsic mineral contents were not considered. The present study is to investigate the effects of previously ignored rock parameters along with engineering rock properties on SE. Mineralogical and petrographic analyses, rock mechanics, and linear rock cutting tests were performed on sandstone samples taken from sites around Ankara, Turkey. Relationships between SE and rock properties were evaluated using bivariate correlation and linear regression analyses. The tests and subsequent analyses revealed that the texture coefficient and feldspar content of sandstones affected rock cuttability, evidenced by significant correlations between these parameters and SE at a 90% confidence level. Felsic and mafic mineral contents of sandstones did not exhibit any statistically significant correlation against SE. Cementation coefficient, effective porosity, and pore volume had good correlations against SE. Poisson's ratio, Brazilian tensile strength, Shore scleroscope hardness, Schmidt hammer hardness, dry density, and point load strength index showed very strong linear correlations against SE at confidence levels of 95% and above, all of which were also found suitable to be used in predicting SE individually, depending on the results of regression analysis, ANOVA, Student's t-tests, and R-2 values. Poisson's ratio exhibited the highest correlation with SE and seemed to be the most reliable SE prediction tool in sandstones.
Resumo:
Background: The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. Results: We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. Conclusion: The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.
Resumo:
Correlation and regression are two of the statistical procedures most widely used by optometrists. However, these tests are often misused or interpreted incorrectly, leading to erroneous conclusions from clinical experiments. This review examines the major statistical tests concerned with correlation and regression that are most likely to arise in clinical investigations in optometry. First, the use, interpretation and limitations of Pearson's product moment correlation coefficient are described. Second, the least squares method of fitting a linear regression to data and for testing how well a regression line fits the data are described. Third, the problems of using linear regression methods in observational studies, if there are errors associated in measuring the independent variable and for predicting a new value of Y for a given X, are discussed. Finally, methods for testing whether a non-linear relationship provides a better fit to the data and for comparing two or more regression lines are considered.
Resumo:
Fitting a linear regression to data provides much more information about the relationship between two variables than a simple correlation test. A goodness of fit test of the line should always be carried out. Hence, ‘r squared’ estimates the strength of the relationship between Y and X, ANOVA whether a statistically significant line is present, and the ‘t’ test whether the slope of the line is significantly different from zero. In addition, it is important to check whether the data fit the assumptions for regression analysis and, if not, whether a transformation of the Y and/or X variables is necessary.