Biblioteca Digital

969 resultados para on-disk data layout

On the simulation of the UPMSAT-2 microsatellite power

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Simulation of satellite subsystems behaviour is extramely important in the design at early stages. The subsystems are normally simulated in the both ways : isolated and as part of more complex simulation that takes into account imputs from other subsystems (concurrent design). In the present work, a simple concurrent simulation of the power subsystem of a microsatellite, UPMSat-2, is described. The aim of the work is to obtain the performance profile of the system (battery charging level, power consumption by the payloads, power supply from solar panels....). Different situations such as battery critical low or high level, effects of high current charging due to the low temperature of solar panels after eclipse,DoD margins..., were analysed, and different safety strategies studied using the developed tool (simulator) to fulfil the mission requirements. Also, failure cases were analysed in order to study the robustness of the system. The mentioned simulator has been programed taking into account the power consumption performances (average and maximum consumptions per orbit/day) of small part of the subsystem (SELEX GALILEO SPVS modular generators built with Azur Space solar cells, SAFT VES16 6P4S Li-ion battery, SSBV magnetometers, TECNOBIT and DATSI/UPM On Board Data Handling -OBDH-...). The developed tool is then intended to be a modular simulator, with the chance of use any other components implementing some standard data.

A Focused Crawler in order to Get Semantic Web Resources (CSR)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents a Focused Crawler in order to Get Semantic Web Resources (CSR). Structured data web are available in formats such as Extensible Markup Language (XML), Resource Description Framework (RDF) and Ontology Web Language (OWL) that can be used for processing. One of the main challenges for performing a manual search and download semantic web resources is that this task consumes a lot of time. Our research work propose a focused crawler which allow to download these resources automatically and store them on disk in order to have a collection that will be used for data processing. CRS consists of three layers: (a) The User Interface Layer, (b) The Focus Crawler Layer and (c) The Base Crawler Layer. CSR uses as a selection policie the Shark-Search method. CSR was conducted with two experiments. The first one starts on December 15 2012 at 7:11 am and ends on December 16 2012 at 4:01 were obtained 448,123,537 bytes of data. The CSR ends by itself after to analyze 80,4375 seeds with an unlimited depth. CSR got 16,576 semantic resources files where the 89 % was RDF, the 10 % was XML and the 1% was OWL. The second one was based on the Web Data Commons work of the Research Group Data and Web Science at the University of Mannheim and the Institute AIFB at the Karlsruhe Institute of Technology. This began at 4:46 am of June 2 2013 and 1:37 am June 9 2013. After 162.51 hours of execution the result was 285,279 semantic resources where predominated the XML resources with 99 % and OWL and RDF with 1 % each one.

Linked-data based domain-specific sentiment lexicons

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we present a dataset componsed of domain-specific sentiment lexicons in six languages for two domains. We used existing collections of reviews from Trip Advisor, Amazon, the Stanford Network Analysis Project and the OpinRank Review Dataset. We use an RDF model based on the lemon and Marl formats to represent the lexicons. We describe the methodology that we applied to generate the domain-specific lexicons and we provide access information to our datasets.

MedVir: an interactive representation system of multidimensional medical data applied to Traumatic Brain Injury's rehabilitation prediction

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Clinicians could model the brain injury of a patient through his brain activity. However, how this model is defined and how it changes when the patient is recovering are questions yet unanswered. In this paper, the use of MedVir framework is proposed with the aim of answering these questions. Based on complex data mining techniques, this provides not only the differentiation between TBI patients and control subjects (with a 72% of accuracy using 0.632 Bootstrap validation), but also the ability to detect whether a patient may recover or not, and all of that in a quick and easy way through a visualization technique which allows interaction.

Query Rewriting Optimisation Techniques for Ontology-Based Data Access

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Ontology-Based Data Access (OBDA) permite el acceso a diferentes tipos de fuentes de datos (tradicionalmente bases de datos) usando un modelo más abstracto proporcionado por una ontología. La reescritura de consultas (query rewriting) usa una ontología para reescribir una consulta en una consulta reescrita que puede ser evaluada en la fuente de datos. Las consultas reescritas recuperan las respuestas que están implicadas por la combinación de los datos explicitamente almacenados en la fuente de datos, la consulta original y la ontología. Al trabajar sólo sobre las queries, la reescritura de consultas permite OBDA sobre cualquier fuente de datos que puede ser consultada, independientemente de las posibilidades para modificarla. Sin embargo, producir y evaluar las consultas reescritas son procesos costosos que suelen volverse más complejos conforme la expresividad y tamaño de la ontología y las consultas aumentan. En esta tesis exploramos distintas optimizaciones que peuden ser realizadas tanto en el proceso de reescritura como en las consultas reescritas para mejorar la aplicabilidad de OBDA en contextos realistas. Nuestra contribución técnica principal es un sistema de reescritura de consultas que implementa las optimizaciones presentadas en esta tesis. Estas optimizaciones son las contribuciones principales de la tesis y se pueden agrupar en tres grupos diferentes: -optimizaciones que se pueden aplicar al considerar los predicados en la ontología que no están realmente mapeados con las fuentes de datos. -optimizaciones en ingeniería que se pueden aplicar al manejar el proceso de reescritura de consultas en una forma que permite reducir la carga computacional del proceso de generación de consultas reescritas. -optimizaciones que se pueden aplicar al considerar metainformación adicional acerca de las características de la ABox. En esta tesis proporcionamos demostraciones formales acerca de la corrección y completitud de las optimizaciones propuestas, y una evaluación empírica acerca del impacto de estas optimizaciones. Como contribución adicional, parte de este enfoque empírico, proponemos un banco de pruebas (benchmark) para la evaluación de los sistemas de reescritura de consultas. Adicionalmente, proporcionamos algunas directrices para la creación y expansión de esta clase de bancos de pruebas. ABSTRACT Ontology-Based Data Access (OBDA) allows accessing different kinds of data sources (traditionally databases) using a more abstract model provided by an ontology. Query rewriting uses such ontology to rewrite a query into a rewritten query that can be evaluated on the data source. The rewritten queries retrieve the answers that are entailed by the combination of the data explicitly stored in the data source, the original query and the ontology. However, producing and evaluating the rewritten queries are both costly processes that become generally more complex as the expressiveness and size of the ontology and queries increase. In this thesis we explore several optimisations that can be performed both in the rewriting process and in the rewritten queries to improve the applicability of OBDA in real contexts. Our main technical contribution is a query rewriting system that implements the optimisations presented in this thesis. These optimisations are the core contributions of the thesis and can be grouped into three different groups: -optimisations that can be applied when considering the predicates in the ontology that are actually mapped to the data sources. -engineering optimisations that can be applied by handling the process of query rewriting in a way that permits to reduce the computational load of the query generation process. -optimisations that can be applied when considering additional metainformation about the characteristics of the ABox. In this thesis we provide formal proofs for the correctness of the proposed optimisations, and an empirical evaluation about the impact of the optimisations. As an additional contribution, part of this empirical approach, we propose a benchmark for the evaluation of query rewriting systems. We also provide some guidelines for the creation and expansion of this kind of benchmarks.

Design and implementation of an adaptive dashboard and visualization interface for the dynamic optimization and control of Data Centers

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Over the last few years, the Data Center market has increased exponentially and this tendency continues today. As a direct consequence of this trend, the industry is pushing the development and implementation of different new technologies that would improve the energy consumption efficiency of data centers. An adaptive dashboard would allow the user to monitor the most important parameters of a data center in real time. For that reason, monitoring companies work with IoT big data filtering tools and cloud computing systems to handle the amounts of data obtained from the sensors placed in a data center.Analyzing the market trends in this field we can affirm that the study of predictive algorithms has become an essential area for competitive IT companies. Complex algorithms are used to forecast risk situations based on historical data and warn the user in case of danger. Considering that several different users will interact with this dashboard from IT experts or maintenance staff to accounting managers, it is vital to personalize it automatically. Following that line of though, the dashboard should only show relevant metrics to the user in different formats like overlapped maps or representative graphs among others. These maps will show all the information needed in a visual and easy-to-evaluate way. To sum up, this dashboard will allow the user to visualize and control a wide range of variables. Monitoring essential factors such as average temperature, gradients or hotspots as well as energy and power consumption and savings by rack or building would allow the client to understand how his equipment is behaving, helping him to optimize the energy consumption and efficiency of the racks. It also would help him to prevent possible damages in the equipment with predictive high-tech algorithms.

Dynamic regulation of the tryptophan operon: A modeling study and comparison with experimental data

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A mathematical model for regulation of the tryptophan operon is presented. This model takes into account repression, feedback enzyme inhibition, and transcriptional attenuation. Special attention is given to model parameter estimation based on experimental data. The model's system of delay differential equations is numerically solved, and the results are compared with experimental data on the temporal evolution of enzyme activity in cultures of Escherichia coli after a nutritional shift (minimal + tryptophan medium to minimal medium). Good agreement is obtained between the numeric simulations and the experimental results for wild-type E. coli, as well as for two different mutant strains.

Lifespan depends on month of birth

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Month of birth influences adult life expectancy at ages 50+. Why? In two countries of the Northern Hemisphere–Austria and Denmark–people born in autumn (October–December) live longer than those born in spring (April–June). Data for Australia show that, in the Southern Hemisphere, the pattern is shifted by half a year. The lifespan pattern of British immigrants to Australia is similar to that of Austrians and Danes and significantly different from that of Australians. These findings are based on population data with more than a million observations and little or no selectivity. The differences in lifespan are independent of the seasonal distribution of deaths and the social differences in the seasonal distribution of births. In the Northern Hemisphere, the excess mortality in the first year of life of infants born in spring does not support the explanation of selective infant survival. Instead, remaining life expectancy at age 50 appears to depend on factors that arise in utero or early in infancy and that increase susceptibility to diseases later in life. This result is consistent with the finding that, at the turn of the last century, infants born in autumn had higher birth weights than those born in other seasons. Furthermore, differences in adult lifespan by month of birth decrease over time and are significantly smaller in more recent cohorts, which benefited from substantial improvements in maternal and infant health.

PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information

Relevância:

100.00% 100.00%

Publicador:

Resumo:

As the number of protein folds is quite limited, a mode of analysis that will be increasingly common in the future, especially with the advent of structural genomics, is to survey and re-survey the finite parts list of folds from an expanding number of perspectives. We have developed a new resource, called PartsList, that lets one dynamically perform these comparative fold surveys. It is available on the web at http://bioinfo.mbb.yale.edu/partslist and http://www.partslist.org. The system is based on the existing fold classifications and functions as a form of companion annotation for them, providing ‘global views’ of many already completed fold surveys. The central idea in the system is that of comparison through ranking; PartsList will rank the approximately 420 folds based on more than 180 attributes. These include: (i) occurrence in a number of completely sequenced genomes (e.g. it will show the most common folds in the worm versus yeast); (ii) occurrence in the structure databank (e.g. most common folds in the PDB); (iii) both absolute and relative gene expression information (e.g. most changing folds in expression over the cell cycle); (iv) protein–protein interactions, based on experimental data in yeast and comprehensive PDB surveys (e.g. most interacting fold); (v) sensitivity to inserted transposons; (vi) the number of functions associated with the fold (e.g. most multi-functional folds); (vii) amino acid composition (e.g. most Cys-rich folds); (viii) protein motions (e.g. most mobile folds); and (ix) the level of similarity based on a comprehensive set of structural alignments (e.g. most structurally variable folds). The integration of whole-genome expression and protein–protein interaction data with structural information is a particularly novel feature of our system. We provide three ways of visualizing the rankings: a profiler emphasizing the progression of high and low ranks across many pre-selected attributes, a dynamic comparer for custom comparisons and a numerical rankings correlator. These allow one to directly compare very different attributes of a fold (e.g. expression level, genome occurrence and maximum motion) in the uniform numerical format of ranks. This uniform framework, in turn, highlights the way that the frequency of many of the attributes falls off with approximate power-law behavior (i.e. according to V–b, for attribute value V and constant exponent b), with a few folds having large values and most having small values.

The Mice at play in the CALIFA survey. A case study of a gas-rich major merger between first passage and coalescence

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present optical integral field spectroscopy (IFS) observations of the Mice, a major merger between two massive (≳10^11 M_⊙) gas-rich spirals NGC 4676A and B, observed between first passage and final coalescence. The spectra provide stellar and gas kinematics, ionised gas properties, and stellar population diagnostics, over the full optical extent of both galaxies with ~1.6 kpc spatial resolution. The Mice galaxies provide a perfect case study that highlights the importance of IFS data for improving our understanding of local galaxies. The impact of first passage on the kinematics of the stars and gas has been significant, with strong bars most likely induced in both galaxies. The barred spiral NGC 4676B exhibits a strong twist in both its stellar and ionised gas disk. The edge-on disk galaxy NGC 4676A appears to be bulge free, with a strong bar causing its “boxy” light profile. On the other hand, the impact of the merger on the stellar populations has been minimal thus far. By combining the IFS data with archival multiwavelength observations we show that star formation induced by the recent close passage has not contributed significantly to the total star formation rate or stellar mass of the galaxies. Both galaxies show bicones of high ionisation gas extending along their minor axes. In NGC 4676A the high gas velocity dispersion and Seyfert-like line ratios at large scaleheight indicate a powerful outflow. Fast shocks (vs ~ 350 km s^-1) extend to ~6.6 kpc above the disk plane. The measured ram pressure (P/k = 4.8 × 10^6 K cm^-3) and mass outflow rate (~8−20 M_⊙ yr^-1) are similar to superwinds from local ultra-luminous infrared galaxies, although NGC 4676A only has a moderate infrared luminosity of 3 × 10^10 L_⊙. Energy beyond what is provided by the mechanical energy of the starburst appears to be required to drive the outflow. Finally, we compare the observations to mock kinematic and stellar population maps extracted from a hydrodynamical merger simulation. The models show little enhancement in star formation during and following first passage, in agreement with the observations. We highlight areas where IFS data could help further constrain the models.

Pipe3D, a pipeline to analyze integral field spectroscopy DATA: II. Analysis sequence and CALIFA dataproducts

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present PIPE3D, an analysis pipeline based on the FIT3D fitting tool, developed to explore the properties of the stellar populations and ionized gas of integral field spectroscopy (IFS) data. PIPE3D was created to provide coherent, simple to distribute, and comparable dataproducts, independently of the origin of the data, focused on the data of the most recent IFU surveys (e.g., CALIFA, MaNGA, and SAMI), and the last generation IFS instruments (e.g., MUSE). In this article we describe the different steps involved in the analysis of the data, illustrating them by showing the dataproducts derived for NGC 2916, observed by CALIFA and P-MaNGA. As a practical example of the pipeline we present the complete set of dataproducts derived for the 200 datacubes that comprises the V500 setup of the CALIFA Data Release 2 (DR2), making them freely available through the network. Finally, we explore the hypothesis that the properties of the stellar populations and ionized gas of galaxies at the effective radius are representative of the overall average ones, finding that this is indeed the case.

Forensic analysis of buildings affected by mining subsidence based on Differential Interferometry (Part III)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This work presents a forensic analysis of buildings affected by mining subsidence, which is based on deformation data obtained by Differential Interferometry (DInSAR). The proposed test site is La Union village (Murcia, SE Spain) where subsidence was triggered in an industrial area due to the collapse of abandoned underground mining labours occurred in 1998. In the first part of this work the study area was introduced, describing the spatial and temporal evolution of ground subsidence, through the elaboration of a cracks map on the buildings located within the affected area. In the second part, the evolution of the most significant cracks found in the most damaged buildings was monitored using biaxial extensometric units and inclinometers. This article describes the work performed in the third part, where DInSAR processing of satellite radar data, available between 1998 and 2008, has permitted to determine the spatial and temporal evolution of the deformation of all the buildings of the study area in a period when no continuous in situ instrumental data is available. Additionally, the comparison of these results with the forensic data gathered in the 2005–2008 period, reveal that there is a coincidence between damaged buildings, buildings where extensometers register significant movements of cracks, and buildings deformation estimated from radar data. As a result, it has been demonstrated that the integration of DInSAR data into forensic analysis methodologies contributes to improve significantly the assessment of the damages of buildings affected by mining subsidence.

Towards a reverse engineering approach for guiding user in applying data mining

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Comunicación presentada en las XVI Jornadas de Ingeniería del Software y Bases de Datos, JISBD 2011, A Coruña, 5-7 septiembre 2011.

Enrichment of the Phenotypic and Genotypic Data Warehouse analysis using Question Answering systems to facilitate the decision making process in cereal breeding programs

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Currently there are an overwhelming number of scientific publications in Life Sciences, especially in Genetics and Biotechnology. This huge amount of information is structured in corporate Data Warehouses (DW) or in Biological Databases (e.g. UniProt, RCSB Protein Data Bank, CEREALAB or GenBank), whose main drawback is its cost of updating that makes it obsolete easily. However, these Databases are the main tool for enterprises when they want to update their internal information, for example when a plant breeder enterprise needs to enrich its genetic information (internal structured Database) with recently discovered genes related to specific phenotypic traits (external unstructured data) in order to choose the desired parentals for breeding programs. In this paper, we propose to complement the internal information with external data from the Web using Question Answering (QA) techniques. We go a step further by providing a complete framework for integrating unstructured and structured information by combining traditional Databases and DW architectures with QA systems. The great advantage of our framework is that decision makers can compare instantaneously internal data with external data from competitors, thereby allowing taking quick strategic decisions based on richer data.

A Sociocybernetics Data Analysis Using Causality in Tourism Networks

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The aim of this paper is to propose a mathematical model to determine invariant sets, set covering, orbits and, in particular, attractors in the set of tourism variables. Analysis was carried out based on a pre-designed algorithm and applying our interpretation of chaos theory developed in the context of General Systems Theory. This article sets out the causal relationships associated with tourist flows in order to enable the formulation of appropriate strategies. Our results can be applied to numerous cases. For example, in the analysis of tourist flows, these findings can be used to determine whether the behaviour of certain groups affects that of other groups and to analyse tourist behaviour in terms of the most relevant variables. Unlike statistical analyses that merely provide information on current data, our method uses orbit analysis to forecast, if attractors are found, the behaviour of tourist variables in the immediate future.

«
1
2
...
27
28
29
30
31
32
33
...
64
65
»