Biblioteca Digital

934 resultados para Data anonymization and sanitization

Integrated data model and DSL modifications

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Companies are increasingly more and more dependent on distributed web-based software systems to support their businesses. This increases the need to maintain and extend software systems with up-to-date new features. Thus, the development process to introduce new features usually needs to be swift and agile, and the supporting software evolution process needs to be safe, fast, and efficient. However, this is usually a difficult and challenging task for a developer due to the lack of support offered by programming environments, frameworks, and database management systems. Changes needed at the code level, database model, and the actual data contained in the database must be planned and developed together and executed in a synchronized way. Even under a careful development discipline, the impact of changing an application data model is hard to predict. The lifetime of an application comprises changes and updates designed and tested using data, which is usually far from the real, production, data. So, coding DDL and DML SQL scripts to update database schema and data, is the usual (and hard) approach taken by developers. Such manual approach is error prone and disconnected from the real data in production, because developers may not know the exact impact of their changes. This work aims to improve the maintenance process in the context of Agile Platform by Outsystems. Our goal is to design and implement new data-model evolution features that ensure a safe support for change and a sound migration process. Our solution includes impact analysis mechanisms targeting the data model and the data itself. This provides, to developers, a safe, simple, and guided evolution process.

Combining data mining and evolutionary computation for multi-criteria optimization of earthworks

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Earthworks tasks aim at levelling the ground surface at a target construction area and precede any kind of structural construction (e.g., road and railway construction). It is comprised of sequential tasks, such as excavation, transportation, spreading and compaction, and it is strongly based on heavy mechanical equipment and repetitive processes. Under this context, it is essential to optimize the usage of all available resources under two key criteria: the costs and duration of earthwork projects. In this paper, we present an integrated system that uses two artificial intelligence based techniques: data mining and evolutionary multi-objective optimization. The former is used to build data-driven models capable of providing realistic estimates of resource productivity, while the latter is used to optimize resource allocation considering the two main earthwork objectives (duration and cost). Experiments held using real-world data, from a construction site, have shown that the proposed system is competitive when compared with current manual earthwork design.

Reconstructing transcriptional regulatory networks using data integration and text mining

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Transcriptional Regulatory Networks (TRNs) are powerful tool for representing several interactions that occur within a cell. Recent studies have provided information to help researchers in the tasks of building and understanding these networks. One of the major sources of information to build TRNs is biomedical literature. However, due to the rapidly increasing number of scientific papers, it is quite difficult to analyse the large amount of papers that have been published about this subject. This fact has heightened the importance of Biomedical Text Mining approaches in this task. Also, owing to the lack of adequate standards, as the number of databases increases, several inconsistencies concerning gene and protein names and identifiers are common. In this work, we developed an integrated approach for the reconstruction of TRNs that retrieve the relevant information from important biological databases and insert it into a unique repository, named KREN. Also, we applied text mining techniques over this integrated repository to build TRNs. However, was necessary to create a dictionary of names and synonyms associated with these entities and also develop an approach that retrieves all the abstracts from the related scientific papers stored on PubMed, in order to create a corpora of data about genes. Furthermore, these tasks were integrated into @Note, a software system that allows to use some methods from the Biomedical Text Mining field, including an algorithms for Named Entity Recognition (NER), extraction of all relevant terms from publication abstracts, extraction relationships between biological entities (genes, proteins and transcription factors). And finally, extended this tool to allow the reconstruction Transcriptional Regulatory Networks through using scientific literature.

Fishery statistics of the United States / prepared by Data Management and Statistics Division.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

1975

A tale of two logits, compositional data analysis and zero observations

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The application of compositional data analysis through log ratio trans-formations corresponds to a multinomial logit model for the shares themselves.This model is characterized by the property of Independence of Irrelevant Alter-natives (IIA). IIA states that the odds ratio in this case the ratio of shares is invariant to the addition or deletion of outcomes to the problem. It is exactlythis invariance of the ratio that underlies the commonly used zero replacementprocedure in compositional data analysis. In this paper we investigate using thenested logit model that does not embody IIA and an associated zero replacementprocedure and compare its performance with that of the more usual approach ofusing the multinomial logit model. Our comparisons exploit a data set that com-bines voting data by electoral division with corresponding census data for eachdivision for the 2001 Federal election in Australia

Phylogeny and circumscription of Sapindaceae revisited: molecular sequence data, morphology and biogeography support recognition of a new family, Xanthoceraceae

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background and aims Recent studies have adopted a broad definition of Sapindaceae that includes taxa traditionally placed in Aceraceae and Hippocastanaceae, achieving monophyly but yielding a family difficult to characterize and for which no obvious morphological synapomorphy exists. This expanded circumscription was necessitated by the finding that the monotypic, temperate Asian genus Xanthoceras, historically placed in Sapindaceae tribe Harpullieae, is basal within the group. Here we seek to clarify the relationships of Xanthoceras based on phylogenetic analyses using a dataset encompassing nearly 3/4 of sapindaceous genera, comparing the results with information from morphology and biogeography, in particular with respect to the other taxa placed in Harpullieae. We then re-examine the appropriateness of maintaining the current broad, morphologically heterogeneous definition of Sapindaceae and explore the advantages of an alternative family circumscription. Methods Using 243 samples representing 104 of the 142 currently recognized genera of Sapindaceae s. lat. (including all in Harpullieae), sequence data were analyzed for nuclear (ITS) and plastid (matK, rpoB, trnD-trnT, trnK-matK, trnL-trnF and trnS-trnG) markers, adopting the methodology of a recent family-wide study, performing single-gene and total evidence analyses based on maximum likelihood (ML) and maximum parsimony (MP) criteria, and applying heuristic searches developed for large datasets, viz, a new strategy implemented in RAxML (for ML) and the parsimony ratchet (for MP). Bootstrap analyses were performed for each method to test for congruence between markers. Key results Our findings support earlier suggestions that Harpullieae are polyphyletic: Xanthoceras is confirmed as sister to all other sampled taxa of Sapindaceae s. lat.; the remaining members belong to three other clades within Sapindaceae s. lat., two of which correspond respectively to the groups traditionally treated as Aceraceae and Hippocastanaceae, together forming a clade sister to the largely tropical Sapindaceae s. str., which is monophyletic and morphologically coherent provided Xanthoceras is excluded. Conclusion To overcome the difficulties of a broadly circumscribed Sapindaceae, we resurrect the historically recognized temperate families Aceraceae and Hippocastanaceae, and describe a new family, Xanthoceraceae, thus adopting a monophyletic and easily characterized circumscription of Sapindaceae nearly identical to that used for over a century.

Liver safety assessment: required data elements and best practices for data collection and standardization in clinical trials.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A workshop was convened to discuss best practices for the assessment of drug-induced liver injury (DILI) in clinical trials. In a breakout session, workshop attendees discussed necessary data elements and standards for the accurate measurement of DILI risk associated with new therapeutic agents in clinical trials. There was agreement that in order to achieve this goal the systematic acquisition of protocol-specified clinical measures and lab specimens from all study subjects is crucial. In addition, standard DILI terms that address the diverse clinical and pathologic signatures of DILI were considered essential. There was a strong consensus that clinical and lab analyses necessary for the evaluation of cases of acute liver injury should be consistent with the US Food and Drug Administration (FDA) guidance on pre-marketing risk assessment of DILI in clinical trials issued in 2009. A recommendation that liver injury case review and management be guided by clinicians with hepatologic expertise was made. Of note, there was agreement that emerging DILI signals should prompt the systematic collection of candidate pharmacogenomic, proteomic and/or metabonomic biomarkers from all study subjects. The use of emerging standardized clinical terminology, CRFs and graphic tools for data review to enable harmonization across clinical trials was strongly encouraged. Many of the recommendations made in the breakout session are in alignment with those made in the other parallel sessions on methodology to assess clinical liver safety data, causality assessment for suspected DILI, and liver safety assessment in special populations (hepatitis B, C, and oncology trials). Nonetheless, a few outstanding issues remain for future consideration.

Truncated robust distance for clinical laboratory safety data monitoring and assessment.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Laboratory safety data are routinely collected in clinical studies for safety monitoring and assessment. We have developed a truncated robust multivariate outlier detection method for identifying subjects with clinically relevant abnormal laboratory measurements. The proposed method can be applied to historical clinical data to establish a multivariate decision boundary that can then be used for future clinical trial laboratory safety data monitoring and assessment. Simulations demonstrate that the proposed method has the ability to detect relevant outliers while automatically excluding irrelevant outliers. Two examples from actual clinical studies are used to illustrate the use of this method for identifying clinically relevant outliers.

@neurIST infrastructure for advanced disease management through integration of heterogeneous data, computing, and complex processing services

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The increasing volume of data describing humandisease processes and the growing complexity of understanding, managing, and sharing such data presents a huge challenge for clinicians and medical researchers. This paper presents the@neurIST system, which provides an infrastructure for biomedical research while aiding clinical care, by bringing together heterogeneous data and complex processing and computing services. Although @neurIST targets the investigation and treatment of cerebral aneurysms, the system’s architecture is generic enough that it could be adapted to the treatment of other diseases.Innovations in @neurIST include confining the patient data pertaining to aneurysms inside a single environment that offers cliniciansthe tools to analyze and interpret patient data and make use of knowledge-based guidance in planning their treatment. Medicalresearchers gain access to a critical mass of aneurysm related data due to the system’s ability to federate distributed informationsources. A semantically mediated grid infrastructure ensures that both clinicians and researchers are able to seamlessly access andwork on data that is distributed across multiple sites in a secure way in addition to providing computing resources on demand forperforming computationally intensive simulations for treatment planning and research.

What is "clinical data"? Why and how can they be collected during field surveys on medicinal plants?

Relevância:

100.00% 100.00%

Publicador:

Resumo:

ETHNOPHARMACOLOGICAL RELEVANCE: "Reverse pharmacology", also called "bedside-to-bench" or "field to pharmacy" approach, is a research process starting with documentation of clinical outcome as observed by patients with different therapeutic regimens. The treatment most significantly associated with cure is selected for future studies: first, clinical safety and efficacy; then in vivo and vitro studies. Some clinical data, i.e. details on patient status and progress, can be collected during ethnobotanical surveys; they will help clinical researchers and, once effectiveness and safety are established, will also help users of traditional medicine make safer and more effective choices. To gather clinical data successfully, ethnopharmacologists need to be backed by an appropriate team of specialists in medicine and epidemiology. Ethnopharmacologists can also gather important data on traditional medicine safety. MATERIALS AND METHODS: The first step is to create a consensus on the meaning of "clinical data", their interest and importance. An understanding of why "a cure is not a proof of effectiveness" is a starting point to avoid faulty interpretation of the clinical observations. RESULTS: Experience showed that, with the "bedside-to-bench" approach, a treatment derived from traditional recipe can be scientifically validated (in terms of safety and effectiveness) with a cost of less than a million euros, thus providing an end-product that is affordable, available and sustainable. CONCLUSIONS: With rigorous clinical study results, medicinal plant users gain the possibility to refine heath strategies. The field surveyor may gain a better relationship with the population, once she/he is seen as bringing information useful for the quality of care in the community.

Field Experiments of Current Concrete Pavement Surface Characteristics Practices: Iowa Data Collection and Analysis, December 2005

Relevância:

100.00% 100.00%

Publicador:

Resumo:

One of the most important issues in portland cement concrete pavement research today is surface characteristics. The issue is one of balancing surface texture construction with the need for durability, skid resistance, and noise reduction. The National Concrete Pavement Technology Center at Iowa State University, in conjunction with the Federal Highway Administration, American Concrete Pavement Association, International Grinding and Grooving Association, Iowa Highway Research Board, and other states, have entered into a three-part National Surface Characteristics Program to resolve the balancing problem. As a portion of Part 2, this report documents the construction of 18 separate pavement surfaces for use in the first level of testing for the national project. It identifies the testing to be done and the limitations observed in the construction process. The results of the actual tests will be included in the subsequent national study reports.

The Anarak, Jandaq and Posht-e-Badam metamorphic complexes in central Iran: New geological data, relationships and tectonic implications

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Anarak, Jandaq and Posht-e-Badam metamorphic complexes occupy the NW part of the Central-East Iranian Microcontinent and are juxtaposed with the Great Kavir block and Sanandaj-Sirjan zone. Our recent findings redefine the origin of these complexes, so far attributed to the Precambrian-Early Paleozoic orogenic episodes, and now directly related to the tectonic evolution of the Paleo-Tethys Ocean. This tectonic evolution was initiated by Late Ordovician-Early Devonian rifting events and terminated in the Triassic by the Eocimmerian collision event due to the docking of the Cimmerian blocks with the Asiatic Turan block. The ``Variscan accretionary complex'' is a new name we proposed for the most widely distributed metamorphic rocks connected to the Anarak and Jandaq complexes. This accretionary complex exposed from SW of Jandaq to the Anarak and Kabudan areas is a thick and fine grain siliciclastic sequence accompanied by marginal-sea ophiolitic remnants, including gabbro-basalts with a supra-subduction-geochemical signature. New Ar-40/Ar-39 ages are obtained as 333-320 Ma for the metamorphism of this sequence under greenschist to amphibolite facies. Moreover, the limy intercalations in the volcano-sedimentary part of this complex in Godar-e-Siah yielded Upper Devonian-Tournaisian conodonts. The northeastern part of this complex in the Jandaq area was intruded by 215 +/- 15 Ma arc to collisional granite and pegmatites dated by ID-TIMS and its metamorphic rocks are characterized by Some Ar-40/Ar-39 radiometric ages of 163-156 Ma. The ``Variscan'' accretionary complex was northwardly accreted to the Airekan granitic terrane dated at 549 +/- 15 Ma. Later, from the Late Carboniferous to Triassic, huge amounts of oceanic material were accreted to its southern side and penetrated by several seamounts such as the Anarak and Kabudan. This new period of accretion is supported by the 280-230 Ma Ar-40/Ar-39 ages for the Anarak mild high-pressure metamorphic rocks and a 262 Ma U-Pb age for the trondhjemite-rhyolite association of that area. The Triassic Bayazeh flysch filled the foreland basin during the final closure of the Paleo-Tethys Ocean and was partly deposited and/or thrusted onto the Cimmerian Yazd block. The Paleo-Tethys magmatic arc products have been well-preserved in the Late Devonian-Carboniferous Godar-e-Siah intra-arc deposits and the Triassic Nakhlak fore-arc succession. On the passive margin of the Cimmerian block, in the Yazd region, the nearly continuous Upper Paleozoic platform-type deposition was totally interrupted during the Middle to Late Triassic. Local erosion, down to Lower Paleozoic levels, may be related to flexural bulge erosion. The platform was finally unconformably covered by Liassic continental molassic deposits of the Shemshak. One of the extensional periods related to Neo-Tethyan back-arc rifting in Late Cretaceous time finally separated parts of the Eocimmerian collisional domain from the Eurasian Turan domain. The opening and closing of this new ocean, characterized by the Nain and Sabzevar ophiolitic melanges, finally transported the Anarak-Jandaq composite terrane to Central Iran, accompanied by large scale rotation of the Central-East Iranian Microcontinent (CEIM). Due to many similarities between the Posht-e-Badam metamorphic complex and the Anarak-Jandaq composite terrane, the former could be part of the latter, if it was transported further south during Tertiary time. (C) 2007 Elsevier B.V. All rights reserved.

How many diagnosis fields are needed to capture safety events in administrative data? Findings and recommendations from the WHO ICD-11 Topic Advisory Group on Quality and Safety

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: As part of the WHO ICD-11 development initiative, the Topic Advisory Group on Quality and Safety explores meta-features of morbidity data sets, such as the optimal number of secondary diagnosis fields. DESIGN: The Health Care Quality Indicators Project of the Organization for Economic Co-Operation and Development collected Patient Safety Indicator (PSI) information from administrative hospital data of 19-20 countries in 2009 and 2011. We investigated whether three countries that expanded their data systems to include more secondary diagnosis fields showed increased PSI rates compared with six countries that did not. Furthermore, administrative hospital data from six of these countries and two American states, California (2011) and Florida (2010), were analysed for distributions of coded patient safety events across diagnosis fields. RESULTS: Among the participating countries, increasing the number of diagnosis fields was not associated with any overall increase in PSI rates. However, high proportions of PSI-related diagnoses appeared beyond the sixth secondary diagnosis field. The distribution of three PSI-related ICD codes was similar in California and Florida: 89-90% of central venous catheter infections and 97-99% of retained foreign bodies and accidental punctures or lacerations were captured within 15 secondary diagnosis fields. CONCLUSIONS: Six to nine secondary diagnosis fields are inadequate for comparing complication rates using hospital administrative data; at least 15 (and perhaps more with ICD-11) are recommended to fully characterize clinical outcomes. Increasing the number of fields should improve the international and intra-national comparability of data for epidemiologic and health services research, utilization analyses and quality of care assessment.

Three-dimensional interpolation of soil data: fertility and pedomorphological features in southern Brazil

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The graphical representation of spatial soil properties in a digital environment is complex because it requires a conversion of data collected in a discrete form onto a continuous surface. The objective of this study was to apply three-dimension techniques of interpolation and visualization on soil texture and fertility properties and establish relationships with pedogenetic factors and processes in a slope area. The GRASS Geographic Information System was used to generate three-dimensional models and ParaView software to visualize soil volumes. Samples of the A, AB, BA, and B horizons were collected in a regular 122-point grid in an area of 13 ha, in Pinhais, PR, in southern Brazil. Geoprocessing and graphic computing techniques were effective in identifying and delimiting soil volumes of distinct ranges of fertility properties confined within the soil matrix. Both three-dimensional interpolation and the visualization tool facilitated interpretation in a continuous space (volumes) of the cause-effect relationships between soil texture and fertility properties and pedological factors and processes, such as higher clay contents following the drainage lines of the area. The flattest part with more weathered soils (Oxisols) had the highest pH values and lower Al3+ concentrations. These techniques of data interpolation and visualization have great potential for use in diverse areas of soil science, such as identification of soil volumes occurring side-by-side but that exhibit different physical, chemical, and mineralogical conditions for plant root growth, and monitoring of plumes of organic and inorganic pollutants in soils and sediments, among other applications. The methodological details for interpolation and a three-dimensional view of soil data are presented here.

Design, data management, and population baseline characteristics of the PERFORM magnetic resonance imaging project.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Quantitative information from magnetic resonance imaging (MRI) may substantiate clinical findings and provide additional insight into the mechanism of clinical interventions in therapeutic stroke trials. The PERFORM study is exploring the efficacy of terutroban versus aspirin for secondary prevention in patients with a history of ischemic stroke. We report on the design of an exploratory longitudinal MRI follow-up study that was performed in a subgroup of the PERFORM trial. An international multi-centre longitudinal follow-up MRI study was designed for different MR systems employing safety and efficacy readouts: new T2 lesions, new DWI lesions, whole brain volume change, hippocampal volume change, changes in tissue microstructure as depicted by mean diffusivity and fractional anisotropy, vessel patency on MR angiography, and the presence of and development of new microbleeds. A total of 1,056 patients (men and women ≥ 55 years) were included. The data analysis included 3D reformation, image registration of different contrasts, tissue segmentation, and automated lesion detection. This large international multi-centre study demonstrates how new MRI readouts can be used to provide key information on the evolution of cerebral tissue lesions and within the macrovasculature after atherothrombotic stroke in a large sample of patients.

«
1
2
3
4
5
6
7
8
...
62
63
»