Biblioteca Digital

905 resultados para LONGITUDINAL DATA-ANALYSIS

Parametric analysis of discrepant sets of data

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper a new parametric method to deal with discrepant experimental results is developed. The method is based on the fit of a probability density function to the data. This paper also compares the characteristics of different methods used to deduce recommended values and uncertainties from a discrepant set of experimental data. The methods are applied to the (137)Cs and (90)Sr published half-lives and special emphasis is given to the deduced confidence intervals. The obtained results are analyzed considering two fundamental properties expected from an experimental result: the probability content of confidence intervals and the statistical consistency between different recommended values. The recommended values and uncertainties for the (137)Cs and (90)Sr half-lives are 10,984 (24) days and 10,523 (70) days, respectively. (C) 2009 Elsevier B.V. All rights reserved.

Leverage analysis for linear mixed models

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We consider a generalized leverage matrix useful for the identification of influential units and observations in linear mixed models and show how a decomposition of this matrix may be employed to identify high leverage points for both the marginal fitted values and the random effect component of the conditional fitted values. We illustrate the different uses of the two components of the decomposition with a simulated example as well as with a real data set.

Bayesian Analysis for Nonlinear Regression Model Under Skewed Errors, with Application in Growth Curves

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We have considered a Bayesian approach for the nonlinear regression model by replacing the normal distribution on the error term by some skewed distributions, which account for both skewness and heavy tails or skewness alone. The type of data considered in this paper concerns repeated measurements taken in time on a set of individuals. Such multiple observations on the same individual generally produce serially correlated outcomes. Thus, additionally, our model does allow for a correlation between observations made from the same individual. We have illustrated the procedure using a data set to study the growth curves of a clinic measurement of a group of pregnant women from an obstetrics clinic in Santiago, Chile. Parameter estimation and prediction were carried out using appropriate posterior simulation schemes based in Markov Chain Monte Carlo methods. Besides the deviance information criterion (DIC) and the conditional predictive ordinate (CPO), we suggest the use of proper scoring rules based on the posterior predictive distribution for comparing models. For our data set, all these criteria chose the skew-t model as the best model for the errors. These DIC and CPO criteria are also validated, for the model proposed here, through a simulation study. As a conclusion of this study, the DIC criterion is not trustful for this kind of complex model.

A competitividade internacional de firmas transnacionais na nova geografia econômica das nações em processo de integração político-econômica regional: uma análise longitudinal de empresas globais em macrorregiões supranacionais

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Devido ao fenômeno da globalização, alguns aspectos em Economia Internacional e Política têm sido amplamente debatidos entre os estudiosos de Negócios Internacionais, dada a extensão do seu impacto sobre a competitividade operacional e estratégica das empresas multinacionais. Em conjunto com a realidade mais regional da maioria dos acordos preferenciais de comércio e de investimentos – que promovem uma integração regional mais profunda dos mercados, ao contrário do teórico mercado "global" – as abordagens teóricas mais globalizantes em estratégia de negócios internacionais têm sido mais questionadas. Enquanto alguns estudiosos, como Pankaj Ghemawat, (2007), propõem abordagens para a chamada "semi-globalização"; outros, por exemplo, com Alan Rugman e Alain Verbeke (inter alias 2004, 2007), por outro lado, defendem abordagens regionalistas mais restritas em negócios internacionais e estratégia de empresas internacionais. Tais posições sobre o desempenho das empresas transnacionais, no entanto, não foram amplamente testadas, deixando, assim, outras questões relevantes sem soluções. Assim sendo, identificou-se um espaço na literatura quanto à questão de as regiões – em vez de países individualmente considerados – serem ou não relevantes ao desempenho global das empresas multinacionais e em que medida. Nesse sentido, foi utilizada uma metodologia quantitativa longitudinal a fim de avaliar a evidência histórica da repercussão de presença em regiões selecionadas e/ou países sobre o desempenho das empresas transnacionais. Foram coletados dados no Compustat Global (2009) com vistas a uma análise econométrica de dados em painel. Nossos resultados consistem, brevemente, em três aspectos. Em primeiro lugar, quando ambas as variáveis (país e região) são simultaneamente consideradas influentes sobre o desempenho, existe significância estatística. Em segundo lugar, ao contrastar ambas as variáveis (país e região) entre si, em relação ao maior nível de impacto no desempenho, ainda que tenhamos encontrado relevância estatística para os países individualmente considerados, suspeitou-se de algum problema nos dados brutos. Em terceiro lugar, ao assumir uma correlação positiva entre o desempenho da empresa multinacional e do número de regiões geográficas onde essas corporações possuem operações significativas, foi encontrada significância estatística. Nossa conclusão, portanto, consiste no fato de que, dado que a maioria dos países são signatários de pelo menos um acordo de integração regional, as regiões devem ser o foco principal dos negócios internacionais e estratégia corporativa internacional, tanto nos propósitos teóricos (tendo em vista as conclusões desta pesquisa e a literatura sobre o assunto), quanto nos aspectos práticos (em vez de da customização da gestão e da estratégia para cada país individual).

Poverty dynamics in Costa Rica with panel data from cross-sections

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Includes bibliography

Semi-supervised dimensionality reduction based on partial least squares for visual analysis of high dimensional data

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dimensionality reduction is employed for visual data analysis as a way to obtaining reduced spaces for high dimensional data or to mapping data directly into 2D or 3D spaces. Although techniques have evolved to improve data segregation on reduced or visual spaces, they have limited capabilities for adjusting the results according to user's knowledge. In this paper, we propose a novel approach to handling both dimensionality reduction and visualization of high dimensional data, taking into account user's input. It employs Partial Least Squares (PLS), a statistical tool to perform retrieval of latent spaces focusing on the discriminability of the data. The method employs a training set for building a highly precise model that can then be applied to a much larger data set very effectively. The reduced data set can be exhibited using various existing visualization techniques. The training data is important to code user's knowledge into the loop. However, this work also devises a strategy for calculating PLS reduced spaces when no training data is available. The approach produces increasingly precise visual mappings as the user feeds back his or her knowledge and is capable of working with small and unbalanced training sets.

Institutional Processes and Discursive Strategies: Rhetoric and Vocabulary Analysis of CSR and Sustainability

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The candidate tackled an important issue in contemporary management: the role of CSR and Sustainability. The research proposal focused on a longitudinal and inductive research, directed to specify the evolution of CSR and contribute to the new institutional theory, in particular institutional work framework, and to the relation between institutions and discourse analysis. The documental analysis covers all the evolution of CSR, focusing also on a number of important networks and associations. Some of the methodologies employed in the thesis have been employed as a consequence of data analysis, in a truly inductive research process. The thesis is composed by two section. The first section mainly describes the research process and the analyses results. The candidates employed several research methods: a longitudinal content analysis of documents, a vocabulary research with statistical metrics as cluster analysis and factor analysis, a rhetorical analysis of justifications. The second section puts in relation the analysis results with theoretical frameworks and contributions. The candidate confronted with several frameworks: Actor-Network-Theory, Institutional work and Boundary Work, Institutional Logic. Chapters are focused on different issues: a historical reconstruction of CSR; a reflection about symbolic adoption of recurrent labels; two case studies of Italian networks, in order to confront institutional and boundary works; a theoretical model of institutional change based on contradiction and institutional complexity; the application of the model to CSR and Sustainability, proposing Sustainability as a possible institutional logic.

The Effects of Airbags and Seatbelts on Occupant Injury in Longitudinal Barrier Crashes

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction: Longitudinal barriers, such as guardrails, are designed to prevent a vehicle that leaves the roadway from impacting a more dangerous object while minimizing the risk of injury to the vehicle occupants. Current full-scale test procedures for these devices do not consider the effect of occupant restraints such as seatbelts and airbags. The purpose of this study was to determine the extent to which restraints are used or deployed in longitudinal barrier collisions and their subsequent effect on occupant injury. Methods: Binary logistic regression models were generated to predict occupant injury risk using data from the National Automotive Sampling System / Crashworthiness Data System from 1997 through 2007. Results: In tow-away longitudinal barrier crashes, airbag deployment rates were 70% for airbag-equipped vehicles. Compared with unbelted occupants without an airbag available, seat belt restrained occupants with an airbag available had a dramatically decreased risk of receiving a serious (MAIS 3+) injury (odds-ratio (OR)=0.03; 95% CI: 0.004- 0.24). A similar decrease was observed among those restrained by seat belts, but without an airbag available (OR=0.03; 95% CI: 0.001- 0.79). No significant differences in risk of serious injuries were observed between unbelted occupants with an airbag available compared with unbelted occupants without an airbag available (OR=0.53; 95% CI=0.10-2.68). Impact on Industry: This study refutes the perception in the roadside safety community that airbags rarely deploy in frontal barrier crashes, and suggests that current longitudinal barrier occupant risk criteria may over-estimate injury potential for restrained occupants involved in a longitudinal barrier crash.

Feature-Specific Penalized Latent Class Analysis for Genomic Data

Relevância:

100.00% 100.00%

Publicador:

Systemic analysis of microarray data sets towards identifying genes regulating root development

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Nitrogen and water are essential for plant growth and development. In this study, we designed experiments to produce gene expression data of poplar roots under nitrogen starvation and water deprivation conditions. We found low concentration of nitrogen led first to increased root elongation followed by lateral root proliferation and eventually increased root biomass. To identify genes regulating root growth and development under nitrogen starvation and water deprivation, we designed a series of data analysis procedures, through which, we have successfully identified biologically important genes. Differentially Expressed Genes (DEGs) analysis identified the genes that are differentially expressed under nitrogen starvation or drought. Protein domain enrichment analysis identified enriched themes (in same domains) that are highly interactive during the treatment. Gene Ontology (GO) enrichment analysis allowed us to identify biological process changed during nitrogen starvation. Based on the above analyses, we examined the local Gene Regulatory Network (GRN) and identified a number of transcription factors. After testing, one of them is a high hierarchically ranked transcription factor that affects root growth under nitrogen starvation. It is very tedious and time-consuming to analyze gene expression data. To avoid doing analysis manually, we attempt to automate a computational pipeline that now can be used for identification of DEGs and protein domain analysis in a single run. It is implemented in scripts of Perl and R.

From big smartphone data to worldwide research: The Mobile Data Challenge

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents an overview of the Mobile Data Challenge (MDC), a large-scale research initiative aimed at generating innovations around smartphone-based research, as well as community-based evaluation of mobile data analysis methodologies. First, we review the Lausanne Data Collection Campaign (LDCC), an initiative to collect unique longitudinal smartphone dataset for the MDC. Then, we introduce the Open and Dedicated Tracks of the MDC, describe the specific datasets used in each of them, discuss the key design and implementation aspects introduced in order to generate privacy-preserving and scientifically relevant mobile data resources for wider use by the research community, and summarize the main research trends found among the 100+ challenge submissions. We finalize by discussing the main lessons learned from the participation of several hundred researchers worldwide in the MDC Tracks.

Streaming Support for Data Intensive Cloud-Based Sequence Analysis

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

A longitudinal comparison of cardiovascular disease risk factors among children and adolescents in the U.S. and Japan: Project HeartBeat!

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Blood cholesterol and blood pressure development in childhood and adolescence have important impact on the future adult level of cholesterol and blood pressure, and on increased risk of cardiovascular diseases. The U.S. has higher mortality rates of coronary heart diseases than Japan. A longitudinal comparison in children of risk factor development in the two countries provides more understanding about the causes of cardiovascular disease and its prevention. Such comparisons have not been reported in the past. ^ In Project HeartBeat!, 506 non-Hispanic white, 136 black and 369 Japanese children participated in the study in the U.S. and Japan from 1991 to 1995. A synthetic cohort of ages 8 to 18 years was composed by three cohorts with starting ages at 8, 11, and 14. A multilevel regression model was used for data analysis. ^ The study revealed that the Japanese children had significantly higher slopes of mean total cholesterol (TC) and high density lipoprotein (HDL) cholesterol levels than the U.S. children after adjusting for age and sex. The mean TC level of Japanese children was not significantly different from white and black children. The mean HDL level of Japanese children was significantly higher than white and black children after adjusting for age and sex. The ratio of HDL/TC in Japanese children was significantly higher than in U.S. whites, but not significantly different from the black children. The Japanese group had significantly lower mean diastolic blood pressure phase IV (DBP4) and phase V (DBP5) than the two U.S. groups. The Japanese group also showed significantly higher slopes in systolic blood pressure, DBP5 and DBP4 during the study period than both U.S. groups. The differences were independent from height and body mass index. ^ The study provided the first longitudinal comparison of blood cholesterol and blood pressure between the U.S. and Japanese children and adolescents. It revealed the dynamic process of these factors in the three ethnic groups. ^

Quality Data is Key to Improving Education

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Data Quality Campaign (DQC) has been focused since 2005 on advocating for states to build robust state longitudinal data systems (SLDS). While states have made great progress in their data infrastructure, and should continue to emphasize this work, t data systems alone will not improve outcomes. It is time for both DQC and states to focus on building capacity to use the information that these systems are producing at every level – from classrooms to state houses. To impact system performance and student achievement, the ingrained culture must be replaced with one that focuses on data use for continuous improvement. The effective use of data to inform decisions, provide transparency, improve the measurement of outcomes, and fuel continuous improvement will not come to fruition unless there is a system wide focus on building capacity around the collection, analysis, dissemination, and use of this data, including through research.

New methods for quantification and analysis of quantitative real-time polymerase chain reaction data

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Quantitative real-time polymerase chain reaction (qPCR) is a sensitive gene quantitation method that has been widely used in the biological and biomedical fields. The currently used methods for PCR data analysis, including the threshold cycle (CT) method, linear and non-linear model fitting methods, all require subtracting background fluorescence. However, the removal of background fluorescence is usually inaccurate, and therefore can distort results. Here, we propose a new method, the taking-difference linear regression method, to overcome this limitation. Briefly, for each two consecutive PCR cycles, we subtracted the fluorescence in the former cycle from that in the later cycle, transforming the n cycle raw data into n-1 cycle data. Then linear regression was applied to the natural logarithm of the transformed data. Finally, amplification efficiencies and the initial DNA molecular numbers were calculated for each PCR run. To evaluate this new method, we compared it in terms of accuracy and precision with the original linear regression method with three background corrections, being the mean of cycles 1-3, the mean of cycles 3-7, and the minimum. Three criteria, including threshold identification, max R2, and max slope, were employed to search for target data points. Considering that PCR data are time series data, we also applied linear mixed models. Collectively, when the threshold identification criterion was applied and when the linear mixed model was adopted, the taking-difference linear regression method was superior as it gave an accurate estimation of initial DNA amount and a reasonable estimation of PCR amplification efficiencies. When the criteria of max R2 and max slope were used, the original linear regression method gave an accurate estimation of initial DNA amount. Overall, the taking-difference linear regression method avoids the error in subtracting an unknown background and thus it is theoretically more accurate and reliable. This method is easy to perform and the taking-difference strategy can be extended to all current methods for qPCR data analysis.^

«
1
2
...
22
23
24
25
26
27
28
...
60
61
»