77 resultados para literature-data integration


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The past years have shown an enormous advancement in sequencing and array-based technologies, producing supplementary or alternative views of the genome stored in various formats and databases. Their sheer volume and different data scope pose a challenge to jointly visualize and integrate diverse data types. We present AmalgamScope a new interactive software tool focusing on assisting scientists with the annotation of the human genome and particularly the integration of the annotation files from multiple data types, using gene identifiers and genomic coordinates. Supported platforms include next-generation sequencing and microarray technologies. The available features of AmalgamScope range from the annotation of diverse data types across the human genome to integration of the data based on the annotational information and visualization of the merged files within chromosomal regions or the whole genome. Additionally, users can define custom transcriptome library files for any species and use the file exchanging distant server options of the tool.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The mineralogy of airborne dust affects the impact of dust particles on direct and indirect radiative forcing, on atmospheric chemistry and on biogeochemical cycling. It is determined partly by the mineralogy of the dust-source regions and partly by size-dependent fractionation during erosion and transport. Here we present a data set that characterizes the clay and silt-sized fractions of global soil units in terms of the abundance of 12 minerals that are important for dust–climate interactions: quartz, feldspars, illite, smectite, kaolinite, chlorite, vermiculite, mica, calcite, gypsum, hematite and goethite. The basic mineralogical information is derived from the literature, and is then expanded following explicit rules, in order to characterize as many soil units as possible. We present three alternative realizations of the mineralogical maps, taking the uncertainties in the mineralogical data into account. We examine the implications of the new database for calculations of the single scattering albedo of airborne dust and thus for dust radiative forcing.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper reviews the literature concerning the practice of using Online Analytical Processing (OLAP) systems to recall information stored by Online Transactional Processing (OLTP) systems. Such a review provides a basis for discussion on the need for the information that are recalled through OLAP systems to maintain the contexts of transactions with the data captured by the respective OLTP system. The paper observes an industry trend involving the use of OLTP systems to process information into data, which are then stored in databases without the business rules that were used to process information and data stored in OLTP databases without associated business rules. This includes the necessitation of a practice, whereby, sets of business rules are used to extract, cleanse, transform and load data from disparate OLTP systems into OLAP databases to support the requirements for complex reporting and analytics. These sets of business rules are usually not the same as business rules used to capture data in particular OLTP systems. The paper argues that, differences between the business rules used to interpret these same data sets, risk gaps in semantics between information captured by OLTP systems and information recalled through OLAP systems. Literature concerning the modeling of business transaction information as facts with context as part of the modelling of information systems were reviewed to identify design trends that are contributing to the design quality of OLTP and OLAP systems. The paper then argues that; the quality of OLTP and OLAP systems design has a critical dependency on the capture of facts with associated context, encoding facts with contexts into data with business rules, storage and sourcing of data with business rules, decoding data with business rules into the facts with the context and recall of facts with associated contexts. The paper proposes UBIRQ, a design model to aid the co-design of data with business rules storage for OLTP and OLAP purposes. The proposed design model provides the opportunity for the implementation and use of multi-purpose databases, and business rules stores for OLTP and OLAP systems. Such implementations would enable the use of OLTP systems to record and store data with executions of business rules, which will allow for the use of OLTP and OLAP systems to query data with business rules used to capture the data. Thereby ensuring information recalled via OLAP systems preserves the contexts of transactions as per the data captured by the respective OLTP system.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Purpose: To investigate the relationship between research data management (RDM) and data sharing in the formulation of RDM policies and development of practices in higher education institutions (HEIs). Design/methodology/approach: Two strands of work were undertaken sequentially: firstly, content analysis of 37 RDM policies from UK HEIs; secondly, two detailed case studies of institutions with different approaches to RDM based on semi-structured interviews with staff involved in the development of RDM policy and services. The data are interpreted using insights from Actor Network Theory. Findings: RDM policy formation and service development has created a complex set of networks within and beyond institutions involving different professional groups with widely varying priorities shaping activities. Data sharing is considered an important activity in the policies and services of HEIs studied, but its prominence can in most cases be attributed to the positions adopted by large research funders. Research limitations/implications: The case studies, as research based on qualitative data, cannot be assumed to be universally applicable but do illustrate a variety of issues and challenges experienced more generally, particularly in the UK. Practical implications: The research may help to inform development of policy and practice in RDM in HEIs and funder organisations. Originality/value: This paper makes an early contribution to the RDM literature on the specific topic of the relationship between RDM policy and services, and openness – a topic which to date has received limited attention.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper seeks to use the increasingly influential citation and impact data to explore the contours of the social and environmental accounting (SEA) literature. Our ambitions are fourfold. First, we offer a more nuanced understanding of the journals in which we tend to publish SEA research. Second, we tease out what might plausibly be thought to be one indication of the ‘most influential’ SEA papers. Third, we offer a substantive cautionary note about the dangers of the careless use of citations as singular measures of ‘quality’ or ‘importance’, etc. Finally, we place the growing SEA literature in a wider context which both flatters and challenges the community that SEAJ seeks to serve.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We propose a geoadditive negative binomial model (Geo-NB-GAM) for regional count data that allows us to address simultaneously some important methodological issues, such as spatial clustering, nonlinearities, and overdispersion. This model is applied to the study of location determinants of inward greenfield investments that occurred during 2003–2007 in 249 European regions. After presenting the data set and showing the presence of overdispersion and spatial clustering, we review the theoretical framework that motivates the choice of the location determinants included in the empirical model, and we highlight some reasons why the relationship between some of the covariates and the dependent variable might be nonlinear. The subsequent section first describes the solutions proposed by previous literature to tackle spatial clustering, nonlinearities, and overdispersion, and then presents the Geo-NB-GAM. The empirical analysis shows the good performance of Geo-NB-GAM. Notably, the inclusion of a geoadditive component (a smooth spatial trend surface) permits us to control for spatial unobserved heterogeneity that induces spatial clustering. Allowing for nonlinearities reveals, in keeping with theoretical predictions, that the positive effect of agglomeration economies fades as the density of economic activities reaches some threshold value. However, no matter how dense the economic activity becomes, our results suggest that congestion costs never overcome positive agglomeration externalities.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Recent research suggests Eurasian snow-covered area (SCA) influences the Arctic Oscillation (AO) via the polar vortex. This could be important for Northern Hemisphere winter season forecasting. A fairly strong negative correlation between October SCA and the AO, based on both monthly and daily observational data, has been noted in the literature. While reproducing these previous links when using the same data, we find no further evidence of the link when using an independent satellite data source, or when using a climate model.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Clustering methods are increasingly being applied to residential smart meter data, providing a number of important opportunities for distribution network operators (DNOs) to manage and plan the low voltage networks. Clustering has a number of potential advantages for DNOs including, identifying suitable candidates for demand response and improving energy profile modelling. However, due to the high stochasticity and irregularity of household level demand, detailed analytics are required to define appropriate attributes to cluster. In this paper we present in-depth analysis of customer smart meter data to better understand peak demand and major sources of variability in their behaviour. We find four key time periods in which the data should be analysed and use this to form relevant attributes for our clustering. We present a finite mixture model based clustering where we discover 10 distinct behaviour groups describing customers based on their demand and their variability. Finally, using an existing bootstrapping technique we show that the clustering is reliable. To the authors knowledge this is the first time in the power systems literature that the sample robustness of the clustering has been tested.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The size and complexity of data sets generated within ecosystem-level programmes merits their capture, curation, storage and analysis, synthesis and visualisation using Big Data approaches. This review looks at previous attempts to organise and analyse such data through the International Biological Programme and draws on the mistakes made and the lessons learned for effective Big Data approaches to current Research Councils United Kingdom (RCUK) ecosystem-level programmes, using Biodiversity and Ecosystem Service Sustainability (BESS) and Environmental Virtual Observatory Pilot (EVOp) as exemplars. The challenges raised by such data are identified, explored and suggestions are made for the two major issues of extending analyses across different spatio-temporal scales and for the effective integration of quantitative and qualitative data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We compare measurements of integrated water vapour (IWV) over a subarctic site (Kiruna, Northern Sweden) from five different sensors and retrieval methods: Radiosondes, Global Positioning System (GPS), ground-based Fourier-transform infrared (FTIR) spectrometer, ground-based microwave radiometer, and satellite-based microwave radiometer (AMSU-B). Additionally, we compare also to ERA-Interim model reanalysis data. GPS-based IWV data have the highest temporal coverage and resolution and are chosen as reference data set. All datasets agree reasonably well, but the ground-based microwave instrument only if the data are cloud-filtered. We also address two issues that are general for such intercomparison studies, the impact of different lower altitude limits for the IWV integration, and the impact of representativeness error. We develop methods for correcting for the former, and estimating the random error contribution of the latter. A literature survey reveals that reported systematic differences between different techniques are study-dependent and show no overall consistent pattern. Further improving the absolute accuracy of IWV measurements and providing climate-quality time series therefore remain challenging problems.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The weak-constraint inverse for nonlinear dynamical models is discussed and derived in terms of a probabilistic formulation. The well-known result that for Gaussian error statistics the minimum of the weak-constraint inverse is equal to the maximum-likelihood estimate is rederived. Then several methods based on ensemble statistics that can be used to find the smoother (as opposed to the filter) solution are introduced and compared to traditional methods. A strong point of the new methods is that they avoid the integration of adjoint equations, which is a complex task for real oceanographic or atmospheric applications. they also avoid iterative searches in a Hilbert space, and error estimates can be obtained without much additional computational effort. the feasibility of the new methods is illustrated in a two-layer quasigeostrophic model.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Demand for organic milk is partially driven by consumer perceptions that it is more nutritious. However, there is still considerable uncertainty over whether the use of organic production standards affects milk quality. Here we report results of meta-analyses based on 170 published studies comparing the nutrient content of organic and conventional bovine milk. There were no significant differences in total SFA and MUFA concentrations between organic and conventional milk. However, concentrations of total PUFA and n-3 PUFA were significantly higher in organic milk, by an estimated 7 (95 % CI −1, 15) % and 56 (95 % CI 38, 74) %, respectively. Concentrations of α-linolenic acid (ALA), very long-chain n-3 fatty acids (EPA+DPA+DHA) and conjugated linoleic acid were also significantly higher in organic milk, by an 69 (95 % CI 53, 84) %, 57 (95 % CI 27, 87) % and 41 (95 % CI 14, 68) %, respectively. As there were no significant differences in total n-6 PUFA and linoleic acid (LA) concentrations, the n-6:n-3 and LA:ALA ratios were lower in organic milk, by an estimated 71 (95 % CI −122, −20) % and 93 (95 % CI −116, −70) %. It is concluded that organic bovine milk has a more desirable fatty acid composition than conventional milk. Meta-analyses also showed that organic milk has significantly higher α-tocopherol and Fe, but lower I and Se concentrations. Redundancy analysis of data from a large cross-European milk quality survey indicates that the higher grazing/conserved forage intakes in organic systems were the main reason for milk composition differences.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Demand for organic meat is partially driven by consumer perceptions that organic foods are more nutritious than non-organic foods. However, there have been no systematic reviews comparing specifically the nutrient content of organic and conventionally produced meat. In this study, we report results of a meta-analysis based on sixty-seven published studies comparing the composition of organic and non-organic meat products. For many nutritionally relevant compounds (e.g. minerals, antioxidants and most individual fatty acids (FA)), the evidence base was too weak for meaningful meta-analyses. However, significant differences in FA profiles were detected when data from all livestock species were pooled. Concentrations of SFA and MUFA were similar or slightly lower, respectively, in organic compared with conventional meat. Larger differences were detected for total PUFA and n-3 PUFA, which were an estimated 23 (95 % CI 11, 35) % and 47 (95 % CI 10, 84) % higher in organic meat, respectively. However, for these and many other composition parameters, for which meta-analyses found significant differences, heterogeneity was high, and this could be explained by differences between animal species/meat types. Evidence from controlled experimental studies indicates that the high grazing/forage-based diets prescribed under organic farming standards may be the main reason for differences in FA profiles. Further studies are required to enable meta-analyses for a wider range of parameters (e.g. antioxidant, vitamin and mineral concentrations) and to improve both precision and consistency of results for FA profiles for all species. Potential impacts of composition differences on human health are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper discusses how global financial institutions are using big data analytics within their compliance operations. A lot of previous research has focused on the strategic implications of big data, but not much research has considered how such tools are entwined with regulatory breaches and investigations in financial services. Our work covers two in-depth qualitative case studies, each addressing a distinct type of analytics. The first case focuses on analytics which manage everyday compliance breaches and so are expected by managers. The second case focuses on analytics which facilitate investigation and litigation where serious unexpected breaches may have occurred. In doing so, the study focuses on the micro/data to understand how these tools are influencing operational risks and practices. The paper draws from two bodies of literature, the social studies of information systems and finance to guide our analysis and practitioner recommendations. The cases illustrate how technologies are implicated in multijurisdictional challenges and regulatory conflicts at each end of the operational risk spectrum. We find that compliance analytics are both shaping and reporting regulatory matters yet often firms may have difficulties in recruiting individuals with relevant but diverse skill sets. The cases also underscore the increasing need for financial organizations to adopt robust information governance policies and processes to ease future remediation efforts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Estrogen is a ligand for the estrogen receptor (ER), which on binding 17beta-estradiol, functions as a ligand-activated transcription factor and regulates the transcription of target genes. This is the slow genomic mode of action. However, rapid non-genomic actions of estrogen also exist at the cell membrane. Using a novel two-pulse paradigm in which the first pulse rapidly initiates non-genomic actions using a membrane-limited estrogen conjugate (E-BSA), while the second pulse promotes genomic transcription from a consensus estrogen response element (ERE), we have demonstrated that rapid actions of estrogen potentiate the slower transcriptional response from an ERE-reporter in neuroblastoma cells. Since rapid actions of estrogen activate kinases, we used selective inhibitors in the two-pulse paradigm to determine the intracellular signaling cascades important in such potentiation. Inhibition of protein kinase A (PKA), PKC, mitogen activated protein kinase (MAPK) or phosphatidylinositol 3-OH kinase (PI-3K) in the first pulse decreases potentiation of transcription. Also, our data with both dominant negative and constitutive mutants of Galpha subunits show that Galpha(q) initiates the rapid signaling cascade at the membrane in SK-N-BE(2)C neuroblastoma cells. We discuss two models of multiple kinase activation at the membrane Pulses of estrogen induce lordosis behavior in female rats. Infusion of E-BSA into the ventromedial hypothalamus followed by 17beta-estradiol in the second pulse could induce lordosis behavior, demonstrating the applicability of this paradigm in vivo. A model where non-genomic actions of estrogen couple to genomic actions unites both aspects of hormone action.