871 resultados para panel data analysis
Resumo:
Objective. The goal of this study is to characterize the current workforce of CIHs, the lengths of professional practice careers of the past and current CIHs.^ Methods. This is a secondary data analysis of data compiled from all of the nearly 50 annual roster listings of the American Board of Industrial Hygiene (ABIH) for Certified Industrial Hygienists active in each year since 1960. Survival analysis was performed as a technique to measure the primary outcome of interest. The technique which was involved in this study was the Kaplan-Meier method for estimating the survival function.^ Study subjects: The population to be studied is all Certified Industrial Hygienists (CIHs). A CIH is defined by the ABIH as an individual who has achieved the minimum requirements for education, working experience and through examination, has demonstrated a minimum level of knowledge and competency in the prevention of occupational illnesses. ^ Results. A Cox-proportional hazards model analysis was performed by different start-time cohorts of CIHs. In this model we chose cohort 1 as the reference cohort. The estimated relative risk of the event (defined as retirement, or absent from 5 consecutive years of listing) occurred for CIHs for cohorts 2,3,4,5 relative to cohort 1 is 0.385, 0.214, 0.234, 0.299 relatively. The result show that cohort 2 (CIHs issued from 1970-1980) has the lowest hazard ratio which indicates the lowest retirement rate.^ Conclusion. The manpower of CIHs (still actively practicing up to the end of 2009) increased tremendously starting in 1980 and grew into a plateau in recent decades. This indicates that the supply and demand of the profession may have reached equilibrium. More demographic information and variables are needed to actually predict the future number of CIHs needed. ^
Resumo:
When choosing among models to describe categorical data, the necessity to consider interactions makes selection more difficult. With just four variables, considering all interactions, there are 166 different hierarchical models and many more non-hierarchical models. Two procedures have been developed for categorical data which will produce the "best" subset or subsets of each model size where size refers to the number of effects in the model. Both procedures are patterned after the Leaps and Bounds approach used by Furnival and Wilson for continuous data and do not generally require fitting all models. For hierarchical models, likelihood ratio statistics (G('2)) are computed using iterative proportional fitting and "best" is determined by comparing, among models with the same number of effects, the Pr((chi)(,k)('2) (GREATERTHEQ) G(,ij)('2)) where k is the degrees of freedom for ith model of size j. To fit non-hierarchical as well as hierarchical models, a weighted least squares procedure has been developed.^ The procedures are applied to published occupational data relating to the occurrence of byssinosis. These results are compared to previously published analyses of the same data. Also, the procedures are applied to published data on symptoms in psychiatric patients and again compared to previously published analyses.^ These procedures will make categorical data analysis more accessible to researchers who are not statisticians. The procedures should also encourage more complex exploratory analyses of epidemiologic data and contribute to the development of new hypotheses for study. ^
Resumo:
The purpose of this study is to descriptively analyze the current program at Ben Taub Pediatric Weight Management Program in Houston, Texas, a program designed to help overweight children ages three to eighteen to lose weight. In Texas, approximately one in every three children is overweight or obese. Obesity is seen at an even greater level within Ben Taub due to the hospital's high rate of service for underserved minority populations (Dehghan et al, 2005; Tyler and Horner, 2008; Hunt, 2009). The weight management program consists of nutritional, behavioral, physical activity, and medical counseling. Analysis will focus on changes in weight, BMI, cholesterol levels, and blood pressure from 2007–2010 for all participants who attended at least two weight management sessions. Recommendations will be given in response to the results of the data analysis.^
Resumo:
Objective: In this secondary data analysis, three statistical methodologies were implemented to handle cases with missing data in a motivational interviewing and feedback study. The aim was to evaluate the impact that these methodologies have on the data analysis. ^ Methods: We first evaluated whether the assumption of missing completely at random held for this study. We then proceeded to conduct a secondary data analysis using a mixed linear model to handle missing data with three methodologies (a) complete case analysis, (b) multiple imputation with explicit model containing outcome variables, time, and the interaction of time and treatment, and (c) multiple imputation with explicit model containing outcome variables, time, the interaction of time and treatment, and additional covariates (e.g., age, gender, smoke, years in school, marital status, housing, race/ethnicity, and if participants play on athletic team). Several comparisons were conducted including the following ones: 1) the motivation interviewing with feedback group (MIF) vs. the assessment only group (AO), the motivation interviewing group (MIO) vs. AO, and the intervention of the feedback only group (FBO) vs. AO, 2) MIF vs. FBO, and 3) MIF vs. MIO.^ Results: We first evaluated the patterns of missingness in this study, which indicated that about 13% of participants showed monotone missing patterns, and about 3.5% showed non-monotone missing patterns. Then we evaluated the assumption of missing completely at random by Little's missing completely at random (MCAR) test, in which the Chi-Square test statistic was 167.8 with 125 degrees of freedom, and its associated p-value was p=0.006, which indicated that the data could not be assumed to be missing completely at random. After that, we compared if the three different strategies reached the same results. For the comparison between MIF and AO as well as the comparison between MIF and FBO, only the multiple imputation with additional covariates by uncongenial and congenial models reached different results. For the comparison between MIF and MIO, all the methodologies for handling missing values obtained different results. ^ Discussions: The study indicated that, first, missingness was crucial in this study. Second, to understand the assumptions of the model was important since we could not identify if the data were missing at random or missing not at random. Therefore, future researches should focus on exploring more sensitivity analyses under missing not at random assumption.^
Resumo:
These three manuscripts are presented as a PhD dissertation for the study of using GeoVis application to evaluate telehealth programs. The primary reason of this research was to understand how the GeoVis applications can be designed and developed using combined approaches of HC approach and cognitive fit theory and in terms utilized to evaluate telehealth program in Brazil. First manuscript The first manuscript in this dissertation presented a background about the use of GeoVisualization to facilitate visual exploration of public health data. The manuscript covered the existing challenges that were associated with an adoption of existing GeoVis applications. The manuscript combines the principles of Human Centered approach and Cognitive Fit Theory and a framework using a combination of these approaches is developed that lays the foundation of this research. The framework is then utilized to propose the design, development and evaluation of “the SanaViz” to evaluate telehealth data in Brazil, as a proof of concept. Second manuscript The second manuscript is a methods paper that describes the approaches that can be employed to design and develop “the SanaViz” based on the proposed framework. By defining the various elements of the HC approach and CFT, a mixed methods approach is utilized for the card sorting and sketching techniques. A representative sample of 20 study participants currently involved in the telehealth program at the NUTES telehealth center at UFPE, Recife, Brazil was enrolled. The findings of this manuscript helped us understand the needs of the diverse group of telehealth users, the tasks that they perform and helped us determine the essential features that might be necessary to be included in the proposed GeoVis application “the SanaViz”. Third manuscript The third manuscript involved mix- methods approach to compare the effectiveness and usefulness of the HC GeoVis application “the SanaViz” against a conventional GeoVis application “Instant Atlas”. The same group of 20 study participants who had earlier participated during Aim 2 was enrolled and a combination of quantitative and qualitative assessments was done. Effectiveness was gauged by the time that the participants took to complete the tasks using both the GeoVis applications, the ease with which they completed the tasks and the number of attempts that were taken to complete each task. Usefulness was assessed by System Usability Scale (SUS), a validated questionnaire tested in prior studies. In-depth interviews were conducted to gather opinions about both the GeoVis applications. This manuscript helped us in the demonstration of the usefulness and effectiveness of HC GeoVis applications to facilitate visual exploration of telehealth data, as a proof of concept. Together, these three manuscripts represent challenges of combining principles of Human Centered approach, Cognitive Fit Theory to design and develop GeoVis applications as a method to evaluate Telehealth data. To our knowledge, this is the first study to explore the usefulness and effectiveness of GeoVis to facilitate visual exploration of telehealth data. The results of the research enabled us to develop a framework for the design and development of GeoVis applications related to the areas of public health and especially telehealth. The results of our study showed that the varied users were involved with the telehealth program and the tasks that they performed. Further it enabled us to identify the components that might be essential to be included in these GeoVis applications. The results of our research answered the following questions; (a) Telehealth users vary in their level of understanding about GeoVis (b) Interaction features such as zooming, sorting, and linking and multiple views and representation features such as bar chart and choropleth maps were considered the most essential features of the GeoVis applications. (c) Comparing and sorting were two important tasks that the telehealth users would perform for exploratory data analysis. (d) A HC GeoVis prototype application is more effective and useful for exploration of telehealth data than a conventional GeoVis application. Future studies should be done to incorporate the proposed HC GeoVis framework to enable comprehensive assessment of the users and the tasks they perform to identify the features that might be necessary to be a part of the GeoVis applications. The results of this study demonstrate a novel approach to comprehensively and systematically enhance the evaluation of telehealth programs using the proposed GeoVis Framework.
Resumo:
Objectives: This study included two overarching objectives. Through a systematic review of the literature published between 1990 and 2012, the first objective aimed to assess whether insuring the uninsured would result in higher costs compared to insuring the currently insured. Studies that quantified the actual costs associated with insuring the uninsured in the U.S. were included. Based upon 2009 data from the Medical Expenditure Panel Survey (MEPS), the second objective aimed to assess and compare the self-reported health of populations with four different insurance statuses. The second part of this study involved a secondary data analysis of both currently insured and currently uninsured individuals who participated in the MEPS in 2009. The null hypothesis was that there were no differences across the four categories of health insurance status for self-reported health status and healthcare service use. The alternative hypothesis was that were differences across the four categories of health insurance status for self-reported health status and healthcare service use. Methods: For the systematic review, three databases were searched using search terms to identify studies that actually quantified the cost of insuring the uninsured. Thirteen studies were selected, discussed, and summarized in tables. For the secondary data analysis of MEPS data, this study compared four categories of health insurance status: (1) currently uninsured persons who will become eligible for Medicaid under the Patient Protection and Affordable Care Act (PPACA) healthcare reforms in 2014; (2) currently uninsured persons who will be required to buy private insurance through the PPACA health insurance exchanges in 2014; (3) persons currently insured under Medicaid or SCHIP; and (4) persons currently insured with private insurance. The four categories were compared on the basis of demographic information, health status information, and health conditions with relatively high prevalence. Chi-square tests were run to determine if there were differences between the four groups in regard to health insurance status and health status. With some exceptions, the two currently insured groups had worse self-reported health status compared to the two currently uninsured groups. Results: The thirteen studies that met the inclusion criteria for the systematic review included: (1) three cost studies from 1993, 1995, and 1997; (2) four cost studies from 2001, 2003, and 2004; (3) one study of disabilities and one study of immigrants; (4) two state specific studies of uninsured status; and (5) two current studies of healthcare reform. Of the thirteen studies reviewed, four directly addressed the study question about whether insuring the uninsured was more or less expensive than insuring the currently insured. All four of the studies provided support for the study finding that the cost of insuring the uninsured would generally not be higher than insuring those already insured. One study indicated that the cost of insuring the uninsured would be less expensive than insuring the population currently covered by Medicaid, but more expensive to insure than the populations of those covered by employer-sponsored insurance and non-group private insurance. While the nine other studies included in the systematic review discussed the costs associated with insuring the uninsured population, they did not directly compare the costs of insuring the uninsured population with the costs associated with insuring the currently insured population. For the MEPS secondary data analysis, the results of the chi-square tests indicated that there were differences in the distribution of disease status by health insurance status. As anticipated, with some exceptions, the uninsured reported lower rates of disease and healthcare service use. However, for the variable attention deficit disorder, the uninsured reported higher disease rates than the two insured groups. Additionally, for the variables high blood pressure, high cholesterol, and joint pain, the currently insured under Medicaid or SCHIP group reported a lower rate of disease than the two currently insured groups. This result may be due to the lower mean age of the currently insured under Medicaid or SCHIP group. Conclusion: Based on this study, with some exceptions, the costs for insuring the uninsured should not exceed healthcare-related costs for insuring the currently uninsured. The results of the systematic review indicated that the U.S. is already paying some of the costs associated with insuring the uninsured. PPACA will expand health insurance coverage to millions of Americans who are currently uninsured, as the individual mandate and insurance market reforms will require. Because many of the currently uninsured are relatively healthy young persons, the costs associated with expanding insurance coverage to the uninsured are anticipated to be relatively modest. However, for the purposes of construing these results, it is important to note that once individuals obtain insurance, it is anticipated that they will use more healthcare services, which will increase costs. (Abstract shortened by UMI.)^
Resumo:
Para evaluar la posible disminución de la adición de cloruro de sodio, mediante el agregado de oleorresinas de especias, se elaboró una masa en la que se empleó 85 % de carne vacuna y 15 % de grasa. Se fraccionó en tres lotes, adicionándoles: 1000 mg/kg de oleorresina de Origanum x Majoricum, 750 mg/kg de la de Capsicum annum o de la de Acantholippia seriphioides, respectivamente. Luego se dividieron en cinco porciones iguales y se agregó sal, hasta alcanzar contenidos de 0.00 %; 0.25 %; 0.50 %; 0.75 % y 1.00 %. Se homogeneizaron y se elaboraron medallones de 100 g, que se cocinaron en horno hasta alcanzar una temperatura interna de 72 °C. Se realizó una evaluación sensorial con 15 jueces semientrenados, solicitándoles que asignaran puntajes, mediante escalas estructuradas de siete puntos e indicaran cuál/es rechazarían. Los puntajes asignados por el panel se sometieron a un análisis exploratorio de datos, a las pruebas de Page y de comparaciones múltiples. Los medallones adicionados con los tres tipos de oleorresinas para las dosis de sal ensayadas presentaron diferencias (= 0.05): las dosis de 0 y 1% fueron las menos aceptadas y la de 0.25 % fue la más aceptada. El 50 % de los jueces rechazó el medallón con 0 % de sal, para los tres tipos de oleorresinas. En las condiciones ensayadas, la incorporación de oleorresinas en dosis de 1000 mg/kg para orégano o de 750 mg/kg para pimiento y tomillo mendocino, permite formular medallones de carne vacuna, con bajo contenido de sal y alta aceptación.
Resumo:
Nuevas cultivares de tomate, de colores distintos al tradicional rojo, se adaptan a la elaboración de productos alternativos, como las confituras. Se estudió la aceptabilidad por parte del consumidor de mermeladas elaboradas con las variedades Victoria FCA, Don Armando FCA y Santa Rosa FCA. Sus frutos: amarillos, anaranjados y rojos, respectivamente, fueron caracterizados por color, peso, acidez: titulable y potencial, y sólidos solubles. Las mermeladas, aromatizadas con clavo de olor, se elaboraron en una planta experimental hasta concentración 67-69 % de sólidos solubles. Un panel de 39 consumidores -clasificados en menores y mayores de 30 años- evaluó aspecto, color, aroma, textura y sabor, aplicando escalas no estructuradas. Las evaluaciones de ambos grupos fueron distintas. Para todas las características sensoriales la prueba de Friedman indicó diferencias entre los tres productos (a = 0,001). En una escala para cinco categorías, más del 50 % de los jueces consideraron las tres mermeladas en las categorías más altas: me gusta y me gusta mucho. El análisis de los datos categóricos de preferencia otorgó el primer lugar a la variedad roja, seguida por la anaranjada y la amarilla. Podría existir un segmento de consumidores interesados en el desarrollo de confituras de tomate amarillo, pero en el caso específico de la mermelada, tuvo mayor aceptabilidad el producto de color igual o parecido al tradicional.
Resumo:
This paper attempts to identify a pathway out of poverty over generations in the rural Philippines, based on long-term panel data spanning for nearly a quarter of a century. Specifically, it sequentially examines the determinants of schooling, subsequent occupational choices, and current non-farm earnings for the same individuals. We found that an initial rise in parental income, brought about by the land reform and the Green Revolution, among other things, improves the schooling of children, which later allows them to obtain remunerative non-farm jobs. These results suggest that the increased agricultural income, improved human capital through schooling and the development of non-farm sectors are the keys to reducing poverty in the long run. It must be also pointed out that the recent development of the rural non-farm sector offers ample employment opportunities for the less educated, which also significantly contributed to the poverty reduction.
Resumo:
This is to analyzes the operational behavior and technical progress among Philippine domestic banks, using micro-level data on individual banks. First, we summarize their major business activities and gain insight on how the structure is changing. Then, we formally estimate the cost function of Philippine domestic banks using panel data covering a seven-year period (1990-96). The presence of economies of scale and economies of scope is investigated and technical progress in the banking industry is measured. In addition, the results of analysis for the Philippines are compared with those of similar studies on Thailand conducted by the author previously.
Resumo:
International production fragmentation has been a global trend for decades, becoming especially important in Asia where the manufacturing process is fragmented into stages and dispersed around the region. This paper examines the effects of input and output tariff reductions on labor demand elasticities at the firm level. For this purpose, we consider a simple heterogenous firm model in which firms are allowed to export their products and to use imported intermediate inputs. The model predicts that only productive firms can use imported intermediate inputs (outsourcing) and tend to have larger constant-output labor demand elasticities. Input tariff reductions would lower the factor shares of labor for these productive firms and raise conditional labor demand elasticities further. We test these empirical predictions, constructing Chinese firm-level panel data over the 2000--2006 period. Controlling for potential tariff endogeneity by instruments, our empirical studies generally support these predictions.
Resumo:
We estimate the economic impacts of irrigation using the panel data set from rural Thailand. We employed difference-in-differences estimation and showed that tertiary irrigation has unexpected impacts. Contrary to the local experts predicitions that it should have substantial productivity impacts as it allows better water controls for farmers, we found largely zero profitability impacts. Another unexpected finding is that, while profitability is not affected, we see an increase in cultivation probability with the construction of tertiary canals. This is observed in both wet and dry seasons. This finding suggests that Thai farmers are willing to expand operation scale once they get water.
Resumo:
An important competence of human data analysts is to interpret and explain the meaning of the results of data analysis to end-users. However, existing automatic solutions for intelligent data analysis provide limited help to interpret and communicate information to non-expert users. In this paper we present a general approach to generating explanatory descriptions about the meaning of quantitative sensor data. We propose a type of web application: a virtual newspaper with automatically generated news stories that describe the meaning of sensor data. This solution integrates a variety of techniques from intelligent data analysis into a web-based multimedia presentation system. We validated our approach in a real world problem and demonstrate its generality using data sets from several domains. Our experience shows that this solution can facilitate the use of sensor data by general users and, therefore, can increase the utility of sensor network infrastructures.
Resumo:
Actualmente, la escasez de agua constituye un importante problema en muchos lugares del mundo. El crecimiento de la población, la creciente necesidad de alimentos, el desarrollo socio-económico y el cambio climático ejercen una importante y cada vez mayor presión sobre los recursos hídricos, a la que muchos países van a tener que enfrentarse en los próximos anos. La región Mediterránea es una de las regiones del mundo de mayor escasez de recursos hídricos, y es además una de las zonas más vulnerables al cambio climático. La mayoría de estudios sobre cambio climático prevén mayores temperaturas y una disminución de las precipitaciones, y una creciente escasez de agua debida a la disminución de recursos disponibles y al aumento de las demandas de riego. En el contexto actual de desarrollo de políticas se demanda cada vez más una mayor consideración del cambio climático en el marco de las políticas sectoriales. Sin embargo, los estudios enfocados a un solo sector no reflejan las múltiples dimensiones del los efectos del cambio climático. Numerosos estudios científicos han demostrado que el cambio climático es un fenómeno de naturaleza multi-dimensional y cuyos efectos se transmiten a múltiples escalas. Por tanto, es necesaria la producción de estudios y herramientas de análisis capaces de reflejar todas estas dimensiones y que contribuyan a la elaboración de políticas robustas en un contexto de cambio climático. Esta investigación pretende aportar una visión global de la problemática de la escasez de agua y los impactos, la vulnerabilidad y la adaptación al cambio climático en el contexto de la región mediterránea. La investigación presenta un marco integrado de modelización que se va ampliando progresivamente en un proceso secuencial y multi-escalar en el que en cada etapa se incorpora una nueva dimensión. La investigación consta de cuatro etapas que se abordan a lo largo de cuatro capítulos. En primer lugar, se estudia la vulnerabilidad económica de las explotaciones de regadío del Medio Guadiana, en España. Para ello, se utiliza un modelo de programación matemática en combinación con un modelo econométrico. A continuación, en la segunda etapa, se utiliza un modelo hidro-económico que incluye un modelo de cultivo para analizar los procesos que tienen lugar a escala de cultivo, explotación y cuenca teniendo en cuenta distintas escalas geográficas y de toma de decisiones. Esta herramienta permite el análisis de escenarios de cambio climático y la evaluación de posibles medidas de adaptación. La tercera fase consiste en el análisis de las barreras que dificultan la aplicación de procesos de adaptación para lo cual se analizan las redes socio-institucionales en la cuenca. Finalmente, la cuarta etapa aporta una visión sobre la escasez de agua y el cambio climático a escala nacional y regional mediante el estudio de distintos escenarios de futuro plausibles y los posibles efectos de las políticas en la escasez de agua. Para este análisis se utiliza un modelo econométrico de datos de panel para la región mediterránea y un modelo hidro-económico que se aplica a los casos de estudio de España y Jordania. Los resultados del estudio ponen de relieve la importancia de considerar múltiples escalas y múltiples dimensiones en el estudio de la gestión de los recursos hídricos y la adaptación al cambio climático en los contextos mediterráneos de escasez de agua estudiados. Los resultados muestran que los impactos del cambio climático en la cuenca del Guadiana y en el conjunto de España pueden comprometer la sostenibilidad del regadío y de los ecosistemas. El análisis a escala de cuenca hidrográfica resalta la importancia de las interacciones entre los distintos usuarios del agua y en concreto entre distintas comunidades de regantes, así como la necesidad de fortalecer el papel de las instituciones y de fomentar la creación de una visión común en la cuenca para facilitar la aplicación de los procesos de adaptación. Asimismo, los resultados de este trabajo evidencian también la capacidad y el papel fundamental de las políticas para lograr un desarrollo sostenible y la adaptación al cambio climático es regiones de escasez de agua tales como la región mediterránea. Especialmente, este trabajo pone de manifiesto el potencial de la Directiva Marco del Agua de la Unión Europea para lograr una efectiva adaptación al cambio climático. Sin embargo, en Jordania, además de la adaptación al cambio climático, es preciso diseñar estrategias de desarrollo sostenible más ambiciosas que contribuyan a reducir el riesgo futuro de escasez de agua. ABSTRACT Water scarcity is becoming a major concern in many parts of the world. Population growth, increasing needs for food production, socio-economic development and climate change represent pressures on water resources that many countries around the world will have to deal in the coming years. The Mediterranean region is one of the most water scarce regions of the world and is considered a climate change hotspot. Most projections of climate change envisage an increase in temperatures and a decrease in precipitation and a resulting reduction in water resources availability as a consequence of both reduced water availability and increased irrigation demands. Current policy development processes require the integration of climate change concerns into sectoral policies. However, sector-oriented studies often fail to address all the dimensions of climate change implications. Climate change research in the last years has evidenced the need for more integrated studies and methodologies that are capable of addressing the multi-scale and multi-dimensional nature of climate change. This research attempts to provide a comprehensive view of water scarcity and climate change impacts, vulnerability and adaptation in Mediterranean contexts. It presents an integrated modelling framework that is progressively enlarged in a sequential multi-scale process in which a new dimension of climate change and water resources is addressed at every stage. It is comprised of four stages, each one explained in a different chapter. The first stage explores farm-level economic vulnerability in the Spanish Guadiana basin using a mathematical programming model in combination with an econometric model. Then, in a second stage, the use of a hydro-economic modelling framework that includes a crop growth model allows for the analysis of crop, farm and basin level processes taking into account different geographical and decision-making scales. This integrated tool is used for the analysis of climate change scenarios and for the assessment of potential adaptation options. The third stage includes the analysis of barriers to the effective implementation of adaptation processes based on socioinstitutional network analysis. Finally, a regional and country level perspective of water scarcity and climate change is provided focusing on different possible socio-economic development pathways and the effect of policies on future water scarcity. For this analysis, a panel-data econometric model and a hydro-economic model are applied for the analysis of the Mediterranean region and country level case studies in Spain and Jordan. The overall results of the study demonstrate the value of considering multiple scales and multiple dimensions in water management and climate change adaptation in the Mediterranean water scarce contexts analysed. Results show that climate change impacts in the Guadiana basin and in Spain may compromise the sustainability of irrigation systems and ecosystems. The analysis at the basin level highlights the prominent role of interactions between different water users and irrigation districts and the need to strengthen institutional capacity and common understanding in the basin to enhance the implementation of adaptation processes. The results of this research also illustrate the relevance of water policies in achieving sustainable development and climate change adaptation in water scarce areas such as the Mediterranean region. Specifically, the EU Water Framework Directive emerges as a powerful trigger for climate change adaptation. However, in Jordan, outreaching sustainable development strategies are required in addition to climate change adaptation to reduce future risk of water scarcity.
Resumo:
We can say without hesitation that in energy markets a throughout data analysis is crucial when designing sophisticated models that are able to capture most of the critical market drivers. In this study we will attempt to investigate into Spanish natural gas prices structure to improve understanding of the role they play in the determination of electricity prices and decide in the future about price modelling aspects. To further understand the potential for modelling, this study will focus on the nature and characteristics of the different gas price data available. The fact that the existing gas market in Spain does not incorporate enough liquidity of trade makes it even more critical to analyze in detail available gas price data information that in the end will provide relevant information to understand how electricity prices are affected by natural gas markets. In this sense representative Spanish gas prices are typically difficult to explore given the fact that there is not a transparent gas market yet and all the gas imported in the country is negotiated and purchased by private companies at confidential terms.