9 resultados para Quantile regressions

em AMS Tesi di Dottorato - Alm@DL - Università di Bologna


Relevância:

60.00% 60.00%

Publicador:

Resumo:

The aim of my dissertation is to study the gender wage gap with a specific focus on developing and transition countries. In the first chapter I present the main existing theories proposed to analyse the gender wage gap and I review the empirical literature on the gender wage gap in developing and transition countries and its main findings. Then, I discuss the overall empirical issues related to the estimation of the gender wage gap and the issues specific to developing and transition countries. The second chapter is an empirical analysis of the gender wage gap in a developing countries, the Union of Comoros, using data from the multidimensional household budget survey “Enquete integrale auprès des ménages” (EIM) run in 2004. The interest of my work is to provide a benchmark analysis for further studies on the situation of women in the Comorian labour market and to contribute to the literature on gender wage gap in Africa by making available more information on the dynamics and mechanism of the gender wage gap, given the limited interest on the topic in this area of the world. The third chapter is an applied analysis of the gender wage gap in a transition country, Poland, using data from the Labour Force Survey (LSF) collected for the years 1994 and 2004. I provide a detailed examination of how gender earning differentials have changed over the period starting from 1994 to a more advanced transition phase in 2004, when market elements have become much more important in the functioning of the Polish economy than in the earlier phase. The main contribution of my dissertation is the application of the econometrical methodology that I describe in the beginning of the second chapter. First, I run a preliminary OLS and quantile regression analysis to estimate and describe the raw and conditional wage gaps along the distribution. Second, I estimate quantile regressions separately for males and females, in order to allow for different rewards to characteristics. Third, I proceed to decompose the raw wage gap estimated at the mean through the Oaxaca-Blinder (1973) procedure. In the second chapter I run a two-steps Heckman procedure by estimating a model of participation in the labour market which shows a significant selection bias for females. Forth, I apply the Machado-Mata (2005) techniques to extend the decomposition analysis at all points of the distribution. In Poland I can also implement the Juhn, Murphy and Pierce (1991) decomposition over the period 1994-2004, to account for effects to the pay gap due to changes in overall wage dispersion beyond Oaxaca’s standard decomposition.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this work, we explore and demonstrate the potential for modeling and classification using quantile-based distributions, which are random variables defined by their quantile function. In the first part we formalize a least squares estimation framework for the class of linear quantile functions, leading to unbiased and asymptotically normal estimators. Among the distributions with a linear quantile function, we focus on the flattened generalized logistic distribution (fgld), which offers a wide range of distributional shapes. A novel naïve-Bayes classifier is proposed that utilizes the fgld estimated via least squares, and through simulations and applications, we demonstrate its competitiveness against state-of-the-art alternatives. In the second part we consider the Bayesian estimation of quantile-based distributions. We introduce a factor model with independent latent variables, which are distributed according to the fgld. Similar to the independent factor analysis model, this approach accommodates flexible factor distributions while using fewer parameters. The model is presented within a Bayesian framework, an MCMC algorithm for its estimation is developed, and its effectiveness is illustrated with data coming from the European Social Survey. The third part focuses on depth functions, which extend the concept of quantiles to multivariate data by imposing a center-outward ordering in the multivariate space. We investigate the recently introduced integrated rank-weighted (IRW) depth function, which is based on the distribution of random spherical projections of the multivariate data. This depth function proves to be computationally efficient and to increase its flexibility we propose different methods to explicitly model the projected univariate distributions. Its usefulness is shown in classification tasks: the maximum depth classifier based on the IRW depth is proven to be asymptotically optimal under certain conditions, and classifiers based on the IRW depth are shown to perform well in simulated and real data experiments.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This doctoral thesis aims at contributing to the literature on transition economies focusing on the Russian Federations and in particular on regional income convergence and fertility patterns. The first two chapter deal with the issue of income convergence across regions. Chapter 1 provides an historical-institutional analysis of the period between the late years of the Soviet Union and the last decade of economic growth and a presentation of the sample with a description of gross regional product composition, agrarian or industrial vocation, labor. Chapter 2 contributes to the literature on exploratory spatial data analysis with a application to a panel of 77 regions in the period 1994-2008. It provides an analysis of spatial patterns and it extends the theoretical framework of growth regressions controlling for spatial correlation and heterogeneity. Chapter 3 analyses the national demographic patterns since 1960 and provides a review of the policies on maternity leave and family benefits. Data sources are the Statistical Yearbooks of USSR, the Statistical Yearbooks of the Russian Soviet Federative Socialist Republic and the Demographic Yearbooks of Russia. Chapter 4 analyses the demographic patterns in light of the theoretical framework of the Becker model, the Second Demographic Transition and an economic-crisis argument. With national data from 1960, the theoretically issue of the pro or countercyclical relation between income and fertility is graphically analyzed and discussed, together with female employment and education. With regional data after 1994 different panel data models are tested. Individual level data from the Russian Longitudinal Monitoring Survey are employed using the logit model. Chapter 5 employs data from the Generations and Gender Survey by UNECE to focus on postponement and second births intentions. Postponement is studied through cohort analysis of mean maternal age at first birth, while the methodology used for second birth intentions is the ordered logit model.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The aim of this thesis is to apply multilevel regression model in context of household surveys. Hierarchical structure in this type of data is characterized by many small groups. In last years comparative and multilevel analysis in the field of perceived health have grown in size. The purpose of this thesis is to develop a multilevel analysis with three level of hierarchy for Physical Component Summary outcome to: evaluate magnitude of within and between variance at each level (individual, household and municipality); explore which covariates affect on perceived physical health at each level; compare model-based and design-based approach in order to establish informativeness of sampling design; estimate a quantile regression for hierarchical data. The target population are the Italian residents aged 18 years and older. Our study shows a high degree of homogeneity within level 1 units belonging from the same group, with an intraclass correlation of 27% in a level-2 null model. Almost all variance is explained by level 1 covariates. In fact, in our model the explanatory variables having more impact on the outcome are disability, unable to work, age and chronic diseases (18 pathologies). An additional analysis are performed by using novel procedure of analysis :"Linear Quantile Mixed Model", named "Multilevel Linear Quantile Regression", estimate. This give us the possibility to describe more generally the conditional distribution of the response through the estimation of its quantiles, while accounting for the dependence among the observations. This has represented a great advantage of our models with respect to classic multilevel regression. The median regression with random effects reveals to be more efficient than the mean regression in representation of the outcome central tendency. A more detailed analysis of the conditional distribution of the response on other quantiles highlighted a differential effect of some covariate along the distribution.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

During my Doctoral study I researched about the remote detection of canopy N concentration in forest stands, its potentials and problems, under many overlapping perspectives. The study consisted of three parts. In S. Rossore 2000 dataset analysis, I tested regressions between N concentration and NIR reflectances derived from different sources (field samples, airborne and satellite sensors). The analysis was further expanded using a larger dataset acquired in year 2009 as part of a new campaign funded by the ESA. In both cases, a good correlation was observed between Landsat NIR, using both TM (2009) and ETM+ (2000) imagery, and N concentration measured by a CHN elemental analyzer. Concerning airborne sensors I did not obtain the same good results, mainly because of the large FOV of the two instruments, and to the anisotropy of vegetation reflectance. We also tested the relation between ground based ASD measures and nitrogen concentration, obtaining really good results. Thus, I decided to expand my study to the regional level, focusing only on field and satellite measures. I analyzed a large dataset for the whole of Catalonia, Spain; MODIS imagery was used, in consideration of its spectral characteristics and despite its rather poor spatial resolution. Also in this case a regression between nitrogen concentration and reflectances was found, but not so good as in previous experiences. Moreover, vegetation type was found to play an important role in the observed relationship. We concluded that MODIS is not the most suitable satellite sensor in realities like Italy and Catalonia, which present a patchy and inhomogeneous vegetation cover; so it could be utilized for the parameterization of eco-physiological and biogeochemical models, but not for really local nitrogen estimate. Thus multispectral sensors similar to Landsat Thematic Mapper, with better spatial resolution, could be the most appropriate sensors to estimate N concentration.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Systemic risk is the protagonist of the recent financial crisis. This thesis proposes a definition and a propagation mechanism for systemic risk. Risk management has a direct linkage with capital management, when addressing the question that the risk handled by a financial institution is compatible with the amount of equity available. This thesis proposes a risk management of liquid market variables, which compose the assets of a bank, based on the statistical tool of PCA. The principal component analysis will define the PCR, or Principal Components of Risk. Such definition of Risk will be adopted to test if the risk represented by PCR is explanatory of the movements of equity and/or debt for the banks included in the in the index Itraxx financial senior: the results of these regressions will be compared with a formal Capital Adequacy test in order to assess the financial soundness of the main financial European institutions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Spatial prediction of hourly rainfall via radar calibration is addressed. The change of support problem (COSP), arising when the spatial supports of different data sources do not coincide, is faced in a non-Gaussian setting; in fact, hourly rainfall in Emilia-Romagna region, in Italy, is characterized by abundance of zero values and right-skeweness of the distribution of positive amounts. Rain gauge direct measurements on sparsely distributed locations and hourly cumulated radar grids are provided by the ARPA-SIMC Emilia-Romagna. We propose a three-stage Bayesian hierarchical model for radar calibration, exploiting rain gauges as reference measure. Rain probability and amounts are modeled via linear relationships with radar in the log scale; spatial correlated Gaussian effects capture the residual information. We employ a probit link for rainfall probability and Gamma distribution for rainfall positive amounts; the two steps are joined via a two-part semicontinuous model. Three model specifications differently addressing COSP are presented; in particular, a stochastic weighting of all radar pixels, driven by a latent Gaussian process defined on the grid, is employed. Estimation is performed via MCMC procedures implemented in C, linked to R software. Communication and evaluation of probabilistic, point and interval predictions is investigated. A non-randomized PIT histogram is proposed for correctly assessing calibration and coverage of two-part semicontinuous models. Predictions obtained with the different model specifications are evaluated via graphical tools (Reliability Plot, Sharpness Histogram, PIT Histogram, Brier Score Plot and Quantile Decomposition Plot), proper scoring rules (Brier Score, Continuous Rank Probability Score) and consistent scoring functions (Root Mean Square Error and Mean Absolute Error addressing the predictive mean and median, respectively). Calibration is reached and the inclusion of neighbouring information slightly improves predictions. All specifications outperform a benchmark model with incorrelated effects, confirming the relevance of spatial correlation for modeling rainfall probability and accumulation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This dissertation consists of three papers. The first paper "Ethnicity, Migration and Conflict: Evidence from Contemporary South Africa” exploits some of the institutional changes intervened in South Africa during the end of apartheid to investigate the relationship between ethnic diversity and conflict. I find within-ethnic polarization to be significantly related to the intensity of armed confrontations among black-dominated groups. My investigation thus gives strong and robust empirical support to the theoretical arguments which identify ethnic diversity as one of the determinants of civil conflict. The second chapter, "Pre-Colonial Centralization, Colonial Activities and Development in Latin America", investigates the hypothesis that pre-colonial ethnic institutions shaped contemporary regional development in Latin America. I document a strong and positive relationship between pre-colonial centralization and regional development. Results are in line with the view that highly centralized pre-colonial societies acted as a persistent force of agglomeration of economic activities and a strong predictor of colonial state capacity. The results provide a first evidence of the existence of a link between pre-colonial centralization, colonial institutional arrangements and contemporary economic development. The third paper "Bite and Divide: Malaria and Ethnic Diversity” investigates the role of malaria as a fundamental determinant of modern ethnic diversity. This paper explores the hypothesis, that a large exposure to malaria has fostered differential interactions that reduced contacts between groups and increased interactions within them Results document that malaria increases the number of ethnic groups at all levels of spatial disaggregation and time periods (exploiting historical and current ethnic diversity). Regressions' results show that endogamous marriages are more frequent in areas with higher geographic suitability to malaria. The results are in line with the view that malaria increases intra-ethnic interactions while decreasing inter-ethnic ones.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In the first chapter, I develop a panel no-cointegration test which extends Pesaran, Shin and Smith (2001)'s bounds test to the panel framework by considering the individual regressions in a Seemingly Unrelated Regression (SUR) system. This allows to take into account unobserved common factors that contemporaneously affect all the units of the panel and provides, at the same time, unit-specific test statistics. Moreover, the approach is particularly suited when the number of individuals of the panel is small relatively to the number of time series observations. I develop the algorithm to implement the test and I use Monte Carlo simulation to analyze the properties of the test. The small sample properties of the test are remarkable, compared to its single equation counterpart. I illustrate the use of the test through a test of Purchasing Power Parity in a panel of EU15 countries. In the second chapter of my PhD thesis, I verify the Expectation Hypothesis of the Term Structure in the repurchasing agreements (repo) market with a new testing approach. I consider an "inexact" formulation of the EHTS, which models a time-varying component in the risk premia and I treat the interest rates as a non-stationary cointegrated system. The effect of the heteroskedasticity is controlled by means of testing procedures (bootstrap and heteroskedasticity correction) which are robust to variance and covariance shifts over time. I fi#nd that the long-run implications of EHTS are verified. A rolling window analysis clarifies that the EHTS is only rejected in periods of turbulence of #financial markets. The third chapter introduces the Stata command "bootrank" which implements the bootstrap likelihood ratio rank test algorithm developed by Cavaliere et al. (2012). The command is illustrated through an empirical application on the term structure of interest rates in the US.