9 resultados para Towards Seamless Integration of Geoscience Models and Data

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Background The study and analysis of gene expression measurements is the primary focus of functional genomics. Once expression data is available, biologists are faced with the task of extracting (new) knowledge associated to the underlying biological phenomenon. Most often, in order to perform this task, biologists execute a number of analysis activities on the available gene expression dataset rather than a single analysis activity. The integration of heteregeneous tools and data sources to create an integrated analysis environment represents a challenging and error-prone task. Semantic integration enables the assignment of unambiguous meanings to data shared among different applications in an integrated environment, allowing the exchange of data in a semantically consistent and meaningful way. This work aims at developing an ontology-based methodology for the semantic integration of gene expression analysis tools and data sources. The proposed methodology relies on software connectors to support not only the access to heterogeneous data sources but also the definition of transformation rules on exchanged data. Results We have studied the different challenges involved in the integration of computer systems and the role software connectors play in this task. We have also studied a number of gene expression technologies, analysis tools and related ontologies in order to devise basic integration scenarios and propose a reference ontology for the gene expression domain. Then, we have defined a number of activities and associated guidelines to prescribe how the development of connectors should be carried out. Finally, we have applied the proposed methodology in the construction of three different integration scenarios involving the use of different tools for the analysis of different types of gene expression data. Conclusions The proposed methodology facilitates the development of connectors capable of semantically integrating different gene expression analysis tools and data sources. The methodology can be used in the development of connectors supporting both simple and nontrivial processing requirements, thus assuring accurate data exchange and information interpretation from exchanged data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: schizophrenia's endophenotipic profile is not only generally complex, but often varies from case to case. The perspective of trying to define specific anatomic correlates of the syndrome has led to disappointing results. In that context, neurophysiologic hypotheses (e. g. glutamatergic hypothesis) and connectivity hypotheses became prominent. Nevertheless, despite their commitment to the principle of denying 'localist' views and approaching the syndrome's endophenotype from a whole brain perspective, efforts to integrate both have not flourished at this moment in time. Objectives: This paper aims to introduce a new etiological model that integrates the glutamatergic and the WM (WM) hypotheses of schizophrenia's etiology. This model proposes to serve as a framework in order to relate to patterns of brain abnormalities from the onset of the syndrome to stages of advanced chronification. Highlights: Neurotransmitter abnormalities forego noticeable WM abnormalities. The former, chiefly represented by NMDAR hypo-function and associated molecular cascades, is related to the first signs of cell loss. This process is both directly and indirectly integrated to the underpinning of WM structural abnormalities; not only is the excess of glutamate toxic to the WM, but its disruption is associated to the expression of known genetic risk factors (e. g., NRG-1). A second level of the model develops the idea that abnormal neurotransmission within specific neural populations ('motifs') impair particular cognitive abilities, while subsequent WM structural abnormalities impair the integration of brain functions and multimodality. As a result of this two-stage dynamic, the affected individual progresses from experiencing specific cognitive and psychological deficits, to a condition of cognitive and existential fragmentation, linked to hardly reversible decreases in psychosocial functioning.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this work was to evaluate extreme water table depths in a watershed, using methods for geographical spatial data analysis. Groundwater spatio-temporal dynamics was evaluated in an outcrop of the Guarani Aquifer System. Water table depths were estimated from monitoring of water levels in 23 piezometers and time series modeling available from April 2004 to April 2011. For generation of spatial scenarios, geostatistical techniques were used, which incorporated into the prediction ancillary information related to the geomorphological patterns of the watershed, using a digital elevation model. This procedure improved estimates, due to the high correlation between water levels and elevation, and aggregated physical sense to predictions. The scenarios showed differences regarding the extreme levels - too deep or too shallow ones - and can subsidize water planning, efficient water use, and sustainable water management in the watershed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background The use of the knowledge produced by sciences to promote human health is the main goal of translational medicine. To make it feasible we need computational methods to handle the large amount of information that arises from bench to bedside and to deal with its heterogeneity. A computational challenge that must be faced is to promote the integration of clinical, socio-demographic and biological data. In this effort, ontologies play an essential role as a powerful artifact for knowledge representation. Chado is a modular ontology-oriented database model that gained popularity due to its robustness and flexibility as a generic platform to store biological data; however it lacks supporting representation of clinical and socio-demographic information. Results We have implemented an extension of Chado – the Clinical Module - to allow the representation of this kind of information. Our approach consists of a framework for data integration through the use of a common reference ontology. The design of this framework has four levels: data level, to store the data; semantic level, to integrate and standardize the data by the use of ontologies; application level, to manage clinical databases, ontologies and data integration process; and web interface level, to allow interaction between the user and the system. The clinical module was built based on the Entity-Attribute-Value (EAV) model. We also proposed a methodology to migrate data from legacy clinical databases to the integrative framework. A Chado instance was initialized using a relational database management system. The Clinical Module was implemented and the framework was loaded using data from a factual clinical research database. Clinical and demographic data as well as biomaterial data were obtained from patients with tumors of head and neck. We implemented the IPTrans tool that is a complete environment for data migration, which comprises: the construction of a model to describe the legacy clinical data, based on an ontology; the Extraction, Transformation and Load (ETL) process to extract the data from the source clinical database and load it in the Clinical Module of Chado; the development of a web tool and a Bridge Layer to adapt the web tool to Chado, as well as other applications. Conclusions Open-source computational solutions currently available for translational science does not have a model to represent biomolecular information and also are not integrated with the existing bioinformatics tools. On the other hand, existing genomic data models do not represent clinical patient data. A framework was developed to support translational research by integrating biomolecular information coming from different “omics” technologies with patient’s clinical and socio-demographic data. This framework should present some features: flexibility, compression and robustness. The experiments accomplished from a use case demonstrated that the proposed system meets requirements of flexibility and robustness, leading to the desired integration. The Clinical Module can be accessed in http://dcm.ffclrp.usp.br/caib/pg=iptrans webcite.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we continue the development of the differential calculus started in Aragona et al. (Monatsh. Math. 144: 13-29, 2005). Guided by the so-called sharp topology and the interpretation of Colombeau generalized functions as point functions on generalized point sets, we introduce the notion of membranes and extend the definition of integrals, given in Aragona et al. (Monatsh. Math. 144: 13-29, 2005), to integrals defined on membranes. We use this to prove a generalized version of the Cauchy formula and to obtain the Goursat Theorem for generalized holomorphic functions. A number of results from classical differential and integral calculus, like the inverse and implicit function theorems and Green's theorem, are transferred to the generalized setting. Further, we indicate that solution formulas for transport and wave equations with generalized initial data can be obtained as well.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: In the analysis of effects by cell treatment such as drug dosing, identifying changes on gene network structures between normal and treated cells is a key task. A possible way for identifying the changes is to compare structures of networks estimated from data on normal and treated cells separately. However, this approach usually fails to estimate accurate gene networks due to the limited length of time series data and measurement noise. Thus, approaches that identify changes on regulations by using time series data on both conditions in an efficient manner are demanded. Methods: We propose a new statistical approach that is based on the state space representation of the vector autoregressive model and estimates gene networks on two different conditions in order to identify changes on regulations between the conditions. In the mathematical model of our approach, hidden binary variables are newly introduced to indicate the presence of regulations on each condition. The use of the hidden binary variables enables an efficient data usage; data on both conditions are used for commonly existing regulations, while for condition specific regulations corresponding data are only applied. Also, the similarity of networks on two conditions is automatically considered from the design of the potential function for the hidden binary variables. For the estimation of the hidden binary variables, we derive a new variational annealing method that searches the configuration of the binary variables maximizing the marginal likelihood. Results: For the performance evaluation, we use time series data from two topologically similar synthetic networks, and confirm that our proposed approach estimates commonly existing regulations as well as changes on regulations with higher coverage and precision than other existing approaches in almost all the experimental settings. For a real data application, our proposed approach is applied to time series data from normal Human lung cells and Human lung cells treated by stimulating EGF-receptors and dosing an anticancer drug termed Gefitinib. In the treated lung cells, a cancer cell condition is simulated by the stimulation of EGF-receptors, but the effect would be counteracted due to the selective inhibition of EGF-receptors by Gefitinib. However, gene expression profiles are actually different between the conditions, and the genes related to the identified changes are considered as possible off-targets of Gefitinib. Conclusions: From the synthetically generated time series data, our proposed approach can identify changes on regulations more accurately than existing methods. By applying the proposed approach to the time series data on normal and treated Human lung cells, candidates of off-target genes of Gefitinib are found. According to the published clinical information, one of the genes can be related to a factor of interstitial pneumonia, which is known as a side effect of Gefitinib.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The scope of this study was to estimate calibrated values for dietary data obtained by the Food Frequency Questionnaire for Adolescents (FFQA) and illustrate the effect of this approach on food consumption data. The adolescents were assessed on two occasions, with an average interval of twelve months. In 2004, 393 adolescents participated, and 289 were then reassessed in 2005. Dietary data obtained by the FFQA were calibrated using the regression coefficients estimated from the average of two 24-hour recalls (24HR) of the subsample. The calibrated values were similar to the the 24HR reference measurement in the subsample. In 2004 and 2005 a significant difference was observed between the average consumption levels of the FFQA before and after calibration for all nutrients. With the use of calibrated data the proportion of schoolchildren who had fiber intake below the recommended level increased. Therefore, it is seen that calibrated data can be used to obtain adjusted associations due to reclassification of subjects within the predetermined categories.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents the offorts to calculate the geoid model for Brazil. It is limited by 6 degrees N and 35 degrees S in latitude and 30 degrees W and 75 degrees W in longitude. The terrestrial gravity data for the continent have been updated by means of the most recent surveys in Brazil and in the neighbour countries. The complete Bouguer and Helmert gravity anomalies have been derived through the Canadian package SHGEO. The short wavelength component was estimated via FFT. The geopotential model EGM2008 was used as a reference field restricted to degree and order 150. The model was validated over 844 GPS observations on Bench Marks of the spirit leveling network. The height anomalies plus a topography dependent correction term derived from EGM2008 (degree 2190 and order 2159), GO_CONS_GCF_2_DIR_R2 (degree and order 240), GOCO02S (degree and order 250), EIGEN 51C (degree and order 359) and EIGEN 6C (degree and order 1420), geoidal height derived from MAPGEO2004 (old official geoid model in Brazil) have also been compared to the GPS points on Bench Marks.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Produced water in oil fields is one of the main sources of wastewater generated in the industry. It contains several organic compounds, such as benzene, toluene, ethyl benzene and xylene (BTEX), whose disposal is regulated by law. The aim of this study is to investigate a treatment of produced water integrating two processes, i.e., induced air flotation (IAF) and photo-Fenton. The experiments were conducted in a column flotation and annular lamp reactor for flotation and photodegradation steps, respectively. The first order kinetic constant of IAF for the wastewater studied was determined to be 0.1765 min(-1) for the surfactant EO 7. Degradation efficiencies of organic loading were assessed using factorial planning. Statistical data analysis shows that H2O2 concentration is a determining factor in process efficiency. Degradations above 90% were reached in all cases after 90 min of reaction, attaining 100% mineralization in the optimized concentrations of Fenton reagents. Process integration was adequate with 100% organic load removal in 20 min. The results of the integration of the IAF with the photo-Fenton allowed to meet the effluent limits established by Brazilian legislation for disposal. (C) 2011 Elsevier B.V. All rights reserved.