82 resultados para Heterogeneous regression
Resumo:
This article investigates the impact of exposure to a serious, unusual, and unforeseen malaria epidemic in northeast Brazil in 1938–40 on subsequent human capital attainment and income. Arguing the event was exogenous, the article exploits cohort and regional heterogeneity in exposure to identify effects. Results are consistent with differential mortality rates according to gender and socioeconomic status, such that heterogeneous selection and scarring effects are observed. Analyzing by gender alone, positive (selection) effects are found for men, and mixed (positive and negative) effects for women. Allowing for heterogeneity by race, selection effects persist for men. In contrast, positive (selection) effects are observed for nonwhite women, and negative (scarring) effects for white women. Results contribute to evidence suggesting that exposure to negative environmental shocks affects human capital attainment, while also suggesting it heterogeneously affects cohort composition.
Resumo:
Forecasting wind power is an important part of a successful integration of wind power into the power grid. Forecasts with lead times longer than 6 h are generally made by using statistical methods to post-process forecasts from numerical weather prediction systems. Two major problems that complicate this approach are the non-linear relationship between wind speed and power production and the limited range of power production between zero and nominal power of the turbine. In practice, these problems are often tackled by using non-linear non-parametric regression models. However, such an approach ignores valuable and readily available information: the power curve of the turbine's manufacturer. Much of the non-linearity can be directly accounted for by transforming the observed power production into wind speed via the inverse power curve so that simpler linear regression models can be used. Furthermore, the fact that the transformed power production has a limited range can be taken care of by employing censored regression models. In this study, we evaluate quantile forecasts from a range of methods: (i) using parametric and non-parametric models, (ii) with and without the proposed inverse power curve transformation and (iii) with and without censoring. The results show that with our inverse (power-to-wind) transformation, simpler linear regression models with censoring perform equally or better than non-linear models with or without the frequently used wind-to-power transformation.
Resumo:
We use sunspot group observations from the Royal Greenwich Observatory (RGO) to investigate the effects of intercalibrating data from observers with different visual acuities. The tests are made by counting the number of groups RB above a variable cut-off threshold of observed total whole-spot area (uncorrected for foreshortening) to simulate what a lower acuity observer would have seen. The synthesised annual means of RB are then re-scaled to the full observed RGO group number RA using a variety of regression techniques. It is found that a very high correlation between RA and RB (rAB > 0.98) does not prevent large errors in the intercalibration (for example sunspot maximum values can be over 30 % too large even for such levels of rAB). In generating the backbone sunspot number (RBB), Svalgaard and Schatten (2015, this issue) force regression fits to pass through the scatter plot origin which generates unreliable fits (the residuals do not form a normal distribution) and causes sunspot cycle amplitudes to be exaggerated in the intercalibrated data. It is demonstrated that the use of Quantile-Quantile (“Q Q”) plots to test for a normal distribution is a useful indicator of erroneous and misleading regression fits. Ordinary least squares linear fits, not forced to pass through the origin, are sometimes reliable (although the optimum method used is shown to be different when matching peak and average sunspot group numbers). However, other fits are only reliable if non-linear regression is used. From these results it is entirely possible that the inflation of solar cycle amplitudes in the backbone group sunspot number as one goes back in time, relative to related solar-terrestrial parameters, is entirely caused by the use of inappropriate and non-robust regression techniques to calibrate the sunspot data.
Resumo:
The contraction of a species’ distribution range, which results from the extirpation of local populations, generally precedes its extinction. Therefore, understanding drivers of range contraction is important for conservation and management. Although there are many processes that can potentially lead to local extirpation and range contraction, three main null models have been proposed: demographic, contagion, and refuge. The first two models postulate that the probability of local extirpation for a given area depends on its relative position within the range; but these models generate distinct spatial predictions because they assume either a ubiquitous (demographic) or a clinal (contagion) distribution of threats. The third model (refuge) postulates that extirpations are determined by the intensity of human impacts, leading to heterogeneous spatial predictions potentially compatible with those made by the other two null models. A few previous studies have explored the generality of some of these null models, but we present here the first comprehensive evaluation of all three models. Using descriptive indices and regression analyses we contrast the predictions made by each of the null models using empirical spatial data describing range contraction in 386 terrestrial vertebrates (mammals, birds, amphibians, and reptiles) distributed across the World. Observed contraction patterns do not consistently conform to the predictions of any of the three models, suggesting that these may not be adequate null models to evaluate range contraction dynamics among terrestrial vertebrates. Instead, our results support alternative null models that account for both relative position and intensity of human impacts. These new models provide a better multifactorial baseline to describe range contraction patterns in vertebrates. This general baseline can be used to explore how additional factors influence contraction, and ultimately extinction for particular areas or species as well as to predict future changes in light of current and new threats.
Resumo:
Understanding how wildlife responds to road and traffic is essential for effective conservation. Yet, not many studies have evaluated how roads influence wildlife in protected areas, particularly within the large iconic African National Parks where tourism is mainly based on sightings from motorized vehicles with the consequent development and intense use of roads. To reduce this knowledge gap, we studied the behavioral response and local spatial distribution of impala Aepyceros melampus along the heterogeneous (with variation in road surface type and traffic intensity) road-network of Kruger National Park (KNP, South Africa). We surveyed different types of roads (paved and unpaved) recording the occurrence of flight responses among sighted impala and describing their local spatial distribution (in relation to the roads). We observed relatively few flight responses (19.5% of 118 observations), suggesting impalas could be partly habituated to vehicles in KNP. In addition, impala local distribution is apparently unaffected by unpaved roads, yet animals seem to avoid the close proximity of paved roads. Overall, our results suggest a negative, albeit small, effect of traffic intensity, and of presence of pavement on roads on the behavior of impala at KNP. Future studies would be necessary to understand how roads influence other species, but our results show that even within a protected area that has been well-visited for a long time, wildlife can still be affected by roads and traffic. This result has ecological (e.g., changes in spatial distribution of fauna) and management implications (e.g., challenges of facilitating wildlife sightings while minimizing disturbance) for protected areas where touristic activities are largely based on driving.
Resumo:
Turbulent surface fluxes of momentum and sensible and latent heat as well as surface temperature, air temperature, air humidity, and wind speed were measured by the German Falcon research aircraft over the marginal ice zone (MIZ) of the northern Baltic Sea and the Fram Strait. Applying the bulk formulas and the stability functions to the measurements, the roughness lengths for momentum z0, sensible heat zT, and latent heat zq were calculated. As mean values over a wide range of sea ice conditions, we obtain z0 = 5 � 10�4 m, zT = 1 � 10�8 m, and zq = 1 � 10�7 m. These correspond to the following mean values (± standard deviations) of neutral transfer coefficients reduced to 10 m height, CDN10 = (1.9 ± 0.8) � 10�3, CHN10 = (0.9 ± 0.3) � 10�3, and CEN10 = (1.0 ± 0.2) � 10�3. An average ratio of z0/zT � 104 was observed over the range of 10�6 m < z0 < 10�2 m and differs from previously published results over compact sea ice (10�1 < z0/zT < 103). Other observational results over heterogeneous sea ice do not exist. However, our z0/zT ratio approximately agrees with observations over heterogeneous land surfaces. Flux parameterizations based on commonly used roughness lengths ratios (z0 = zT = zq) overestimate the surface heat fluxes compared to our measurements by more than 100%.
Resumo:
Tensor clustering is an important tool that exploits intrinsically rich structures in real-world multiarray or Tensor datasets. Often in dealing with those datasets, standard practice is to use subspace clustering that is based on vectorizing multiarray data. However, vectorization of tensorial data does not exploit complete structure information. In this paper, we propose a subspace clustering algorithm without adopting any vectorization process. Our approach is based on a novel heterogeneous Tucker decomposition model taking into account cluster membership information. We propose a new clustering algorithm that alternates between different modes of the proposed heterogeneous tensor model. All but the last mode have closed-form updates. Updating the last mode reduces to optimizing over the multinomial manifold for which we investigate second order Riemannian geometry and propose a trust-region algorithm. Numerical experiments show that our proposed algorithm compete effectively with state-of-the-art clustering algorithms that are based on tensor factorization.