139 resultados para Data migration


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Geographic Data Warehouses (GDW) are one of the main technologies used in decision-making processes and spatial analysis, and the literature proposes several conceptual and logical data models for GDW. However, little effort has been focused on studying how spatial data redundancy affects SOLAP (Spatial On-Line Analytical Processing) query performance over GDW. In this paper, we investigate this issue. Firstly, we compare redundant and non-redundant GDW schemas and conclude that redundancy is related to high performance losses. We also analyze the issue of indexing, aiming at improving SOLAP query performance on a redundant GDW. Comparisons of the SB-index approach, the star-join aided by R-tree and the star-join aided by GiST indicate that the SB-index significantly improves the elapsed time in query processing from 25% up to 99% with regard to SOLAP queries defined over the spatial predicates of intersection, enclosure and containment and applied to roll-up and drill-down operations. We also investigate the impact of the increase in data volume on the performance. The increase did not impair the performance of the SB-index, which highly improved the elapsed time in query processing. Performance tests also show that the SB-index is far more compact than the star-join, requiring only a small fraction of at most 0.20% of the volume. Moreover, we propose a specific enhancement of the SB-index to deal with spatial data redundancy. This enhancement improved performance from 80 to 91% for redundant GDW schemas.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Due to the imprecise nature of biological experiments, biological data is often characterized by the presence of redundant and noisy data. This may be due to errors that occurred during data collection, such as contaminations in laboratorial samples. It is the case of gene expression data, where the equipments and tools currently used frequently produce noisy biological data. Machine Learning algorithms have been successfully used in gene expression data analysis. Although many Machine Learning algorithms can deal with noise, detecting and removing noisy instances from the training data set can help the induction of the target hypothesis. This paper evaluates the use of distance-based pre-processing techniques for noise detection in gene expression data classification problems. This evaluation analyzes the effectiveness of the techniques investigated in removing noisy data, measured by the accuracy obtained by different Machine Learning classifiers over the pre-processed data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OBJECTIVE: To estimate the spatial intensity of urban violence events using wavelet-based methods and emergency room data. METHODS: Information on victims attended at the emergency room of a public hospital in the city of São Paulo, Southeastern Brazil, from January 1, 2002 to January 11, 2003 were obtained from hospital records. The spatial distribution of 3,540 events was recorded and a uniform random procedure was used to allocate records with incomplete addresses. Point processes and wavelet analysis technique were used to estimate the spatial intensity, defined as the expected number of events by unit area. RESULTS: Of all georeferenced points, 59% were accidents and 40% were assaults. There is a non-homogeneous spatial distribution of the events with high concentration in two districts and three large avenues in the southern area of the city of São Paulo. CONCLUSIONS: Hospital records combined with methodological tools to estimate intensity of events are useful to study urban violence. The wavelet analysis is useful in the computation of the expected number of events and their respective confidence bands for any sub-region and, consequently, in the specification of risk estimates that could be used in decision-making processes for public policies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Trata-se de um estudo de caso de uma adolescente de quinze anos, vítima de incesto perpetrado pelo padrasto, que teve como consequência sua gravidez e o nascimento de uma criança. O principal objetivo é discutir a reorganização familiar da adolescente, seu silêncio e o de sua família em relação ao abuso sexual. O contexto de pesquisa foi o Centro de Referência em Assistência Social (CRAS) de uma cidade de periferia. O método utilizado foi a observação participante. A organização das informações possibilitou construir Zonas de Sentido que se constituem em indicadores de vulnerabilidade: sua relação com a violência sofrida; sua relação com a família; sua relação com a filha e sua relação com a escola. Nesse caso, o silêncio, o isolamento e a migração para outra cidade foram opções de proteção. O estudo da gravidez nesta circunstância requer uma compreensão particularizada em relação ao estudo de adolescentes grávidas em geral.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Desembocaduras são ambientes bastante dinâmicos e sujeitos à complexa interação entre fatores estabilizadores e desestabilizadores. Dependendo dessa interação, desembocaduras podem apresentar a tendência de migração ao longo de barreiras arenosas. Um dos mecanismos mais eficientes de transporte de sedimento paralelo à costa, e consequentemente migração de canais, são as correntes longitudinais geradas pelas ondas se aproximando obliquamente à costa. A motivação do presente trabalho é entender o comportamento morfodinâmico do sistema de desembocadura do rio Itapocú, localizado no centro-norte de Santa Catarina (SC), frente aos processos forçantes que atuam na sua migração ao longo da linha de costa. A morfologia dos pontais arenosos foi obtida a partir de levantamentos morfológicos com o uso de DGPS. Para analisar a refração de ondas foi utilizado o modelo numérico MIKE 21 SW, sendo considerados como condições de contorno os dados de ondas referentes ao ano de 2002 e os dados de ondas previstos referentes ao período de coleta. Os dados de saída do modelo foram utilizados para estimar a deriva litorânea potencial na região. Os resultados morfológicos obtidos demonstraram uma migração da desembocadura para o norte durante o período analisado, sendo mais intenso durante o inverno e o verão. Ondas incidentes do quadrante sul sofreram mais o fenômeno da refração e as ondas de leste apresentaram menor variação angular ao se aproximarem à costa. A deriva litorânea potencial anual para os dados de ondas de 2002 apresentou sentido norte-sul, com inversão de sentido durante o outono. Utilizando os dados de ondas previstas para o período dos levantamentos, a deriva litorânea potencial estimada apresentou sentido sul-norte, concordando com a migração observada. Na região próxima a desembocadura, nos pontais arenosos, a deriva potencial apresentou direção para o norte durante todas as estações. Os dados de descarga fluvial não apresentaram influência na migração do canal, porém apresentaram uma relação com a largura do mesmo sazonalmente.Os dados de morfologia juntamente com os dados de deriva litorânea referentes às ondas de 2004/2005 mostraram claramente a migração do canal para o norte sendo a deriva a principal contribuinte para a migração da desembocadura.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The mature larva and pupa of Fulgeochlizus bruchi (Candèze, 1896) are described and illustrated. Bioluminescent patterns are also given. Comments, new data on the first instar larva and natural history data are presented. The first instar larvae differ from the mature larvae mainly in their chaetotaxy, which is sparse and more symmetrically distributed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The objective of this study was to estimate the regressions calibration for the dietary data that were measured using the quantitative food frequency questionnaire (QFFQ) in the Natural History of HPV Infection in Men: the HIM Study in Brazil. A sample of 98 individuals from the HIM study answered one QFFQ and three 24-hour recalls (24HR) at interviews. The calibration was performed using linear regression analysis in which the 24HR was the dependent variable and the QFFQ was the independent variable. Age, body mass index, physical activity, income and schooling were used as adjustment variables in the models. The geometric means between the 24HR and the calibration-corrected QFFQ were statistically equal. The dispersion graphs between the instruments demonstrate increased correlation after making the correction, although there is greater dispersion of the points with worse explanatory power of the models. Identification of the regressions calibration for the dietary data of the HIM study will make it possible to estimate the effect of the diet on HPV infection, corrected for the measurement error of the QFFQ.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Information on fruits and vegetables consumption in Brazil in the three levels of dietary data was analyzed and compared. Data about national supply came from Food Balance Sheets compiled by the FAO; household availability information was obtained from the Brazilian National Household Budget Survey (HBS); and actual intake information came from a large individual dietary intake survey that was representative of the adult population of São Paulo city. All sources of information were collected between 2002 and 2003. A subset of the HBS, representative of São Paulo city, was used in our analysis in order to improve the quality of the comparison with actual intake data. The ratio of national supply to household availability of fruits and vegetables was 2.6 while the ratio of national supply to actual intake was 4.0. The discrepancy ratio in the comparison between household availability and actual intake was smaller, 1.6. While the use of supply and availability data has advantages, as lower cost, must be taken into account that these sources tend to overestimate actual intake of fruits and vegetables.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

study-specific results, their findings should be interpreted with caution

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Diagnostic methods have been an important tool in regression analysis to detect anomalies, such as departures from error assumptions and the presence of outliers and influential observations with the fitted models. Assuming censored data, we considered a classical analysis and Bayesian analysis assuming no informative priors for the parameters of the model with a cure fraction. A Bayesian approach was considered by using Markov Chain Monte Carlo Methods with Metropolis-Hasting algorithms steps to obtain the posterior summaries of interest. Some influence methods, such as the local influence, total local influence of an individual, local influence on predictions and generalized leverage were derived, analyzed and discussed in survival data with a cure fraction and covariates. The relevance of the approach was illustrated with a real data set, where it is shown that, by removing the most influential observations, the decision about which model best fits the data is changed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We consider a nontrivial one-species population dynamics model with finite and infinite carrying capacities. Time-dependent intrinsic and extrinsic growth rates are considered in these models. Through the model per capita growth rate we obtain a heuristic general procedure to generate scaling functions to collapse data into a simple linear behavior even if an extrinsic growth rate is included. With this data collapse, all the models studied become independent from the parameters and initial condition. Analytical solutions are found when time-dependent coefficients are considered. These solutions allow us to perceive nontrivial transitions between species extinction and survival and to calculate the transition's critical exponents. Considering an extrinsic growth rate as a cancer treatment, we show that the relevant quantity depends not only on the intensity of the treatment, but also on when the cancerous cell growth is maximum.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction: The successful integration of stem cells in adult brain has become a central issue in modern neuroscience. In this study we sought to test the hypothesis that survival and neurodifferentiation of mesenchymal stem cells (MSCs) may be dependent upon microenvironmental conditions according to the site of implant in the brain. Methods: MSCs were isolated from adult rats and labeled with enhanced-green fluorescent protein (eGFP) lentivirus. A cell suspension was implanted stereotactically into the brain of 50 young rats, into one neurogenic area (hippocampus), and into another nonneurogenic area (striatum). Animals were sacrificed 6 or 12 weeks after surgery, and brains were stained for mature neuronal markers. Cells coexpressing NeuN (neuronal specific nuclear protein) and GFP (green fluorescent protein) were counted stereologically at both targets. Results: The isolated cell population was able to generate neurons positive for microtubule-associated protein 2 (MAP2), neuronal-specific nuclear protein (NeuN), and neurofilament 200 (NF200) in vitro. Electrophysiology confirmed expression of voltage-gated ionic channels. Once implanted into the hippocampus, cells survived for up to 12 weeks, migrated away from the graft, and gave rise to mature neurons able to synthesize neurotransmitters. By contrast, massive cell degeneration was seen in the striatum, with no significant migration. Induction of neuronal differentiation with increased cyclic adenosine monophosphate in the culture medium before implantation favored differentiation in vivo. Conclusions: Our data demonstrated that survival and differentiation of MSCs is strongly dependent upon a permissive microenvironment. Identification of the pro-neurogenic factors present in the hippocampus could subsequently allow for the integration of stem cells into nonpermissive areas of the central nervous system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The vascular endothelial growth factor (VEGF) is a major promoter of endothelial growth and migration. Some studies have shown a correlation between expression of this growth factor and prognosis in several cancers, including well-differentiated thyroid cancer. Aim: We studied VEGF expression, local invasiveness, and other prognostic factors in papillary thyroid carcinoma (PTC) to test the hypothesis that the expression of VEGF is correlated with the degree of invasion of PTC. Patients and Methods: Clinical and pathological data of 76 patients with PTC were retrospectively reviewed. Group 1 consisted of patients with gross locally invasive tumors, group 2 consisted of patients with only invasion of the thyroid capsule, and group 3 consisted of patients with noninvasive PTC. Results: VEGF expression was noted within the tumor in all groups of PTC patients but was absent in the surrounding normal tissue. Older patients had higher expression of VEGF than younger patients. The age of patients with strong reaction to VEGF was 46 +/- 14 (mean +/- standard deviation), and that in patients with a weaker reaction was 39 +/- 16 (p<0.05). Only 20% of patients with a follicular variant of PTC had a strong reaction to VEGF compared with 68% of patients with classical PTC (p<0.01). Conclusions: VEGF expression appears to be an early event in the development of PTC. Whether VEGF expression promotes the progression of PTC is not known, but the answer to this question may be important in view of its greater expression in older patients, a group whose prognosis in PTC is worse.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The inherent complexity of statistical methods and clinical phenomena compel researchers with diverse domains of expertise to work in interdisciplinary teams, where none of them have a complete knowledge in their counterpart's field. As a result, knowledge exchange may often be characterized by miscommunication leading to misinterpretation, ultimately resulting in errors in research and even clinical practice. Though communication has a central role in interdisciplinary collaboration and since miscommunication can have a negative impact on research processes, to the best of our knowledge, no study has yet explored how data analysis specialists and clinical researchers communicate over time. Methods/Principal Findings: We conducted qualitative analysis of encounters between clinical researchers and data analysis specialists (epidemiologist, clinical epidemiologist, and data mining specialist). These encounters were recorded and systematically analyzed using a grounded theory methodology for extraction of emerging themes, followed by data triangulation and analysis of negative cases for validation. A policy analysis was then performed using a system dynamics methodology looking for potential interventions to improve this process. Four major emerging themes were found. Definitions using lay language were frequently employed as a way to bridge the language gap between the specialties. Thought experiments presented a series of ""what if'' situations that helped clarify how the method or information from the other field would behave, if exposed to alternative situations, ultimately aiding in explaining their main objective. Metaphors and analogies were used to translate concepts across fields, from the unfamiliar to the familiar. Prolepsis was used to anticipate study outcomes, thus helping specialists understand the current context based on an understanding of their final goal. Conclusion/Significance: The communication between clinical researchers and data analysis specialists presents multiple challenges that can lead to errors.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction: Work disability is a major consequence of rheumatoid arthritis (RA), associated not only with traditional disease activity variables, but also more significantly with demographic, functional, occupational, and societal variables. Recent reports suggest that the use of biologic agents offers potential for reduced work disability rates, but the conclusions are based on surrogate disease activity measures derived from studies primarily from Western countries. Methods: The Quantitative Standard Monitoring of Patients with RA (QUEST-RA) multinational database of 8,039 patients in 86 sites in 32 countries, 16 with high gross domestic product (GDP) (>24K US dollars (USD) per capita) and 16 low-GDP countries (<11K USD), was analyzed for work and disability status at onset and over the course of RA and clinical status of patients who continued working or had stopped working in high-GDP versus low-GDP countries according to all RA Core Data Set measures. Associations of work disability status with RA Core Data Set variables and indices were analyzed using descriptive statistics and regression analyses. Results: At the time of first symptoms, 86% of men (range 57%-100% among countries) and 64% (19%-87%) of women <65 years were working. More than one third (37%) of these patients reported subsequent work disability because of RA. Among 1,756 patients whose symptoms had begun during the 2000s, the probabilities of continuing to work were 80% (95% confidence interval (CI) 78%-82%) at 2 years and 68% (95% CI 65%-71%) at 5 years, with similar patterns in high-GDP and low-GDP countries. Patients who continued working versus stopped working had significantly better clinical status for all clinical status measures and patient self-report scores, with similar patterns in high-GDP and low-GDP countries. However, patients who had stopped working in high-GDP countries had better clinical status than patients who continued working in low-GDP countries. The most significant identifier of work disability in all subgroups was Health Assessment Questionnaire (HAQ) functional disability score. Conclusions: Work disability rates remain high among people with RA during this millennium. In low-GDP countries, people remain working with high levels of disability and disease activity. Cultural and economic differences between societies affect work disability as an outcome measure for RA.