17 resultados para modeling of data sources

em Publishing Network for Geoscientific


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Visual cluster analysis provides valuable tools that help analysts to understand large data sets in terms of representative clusters and relationships thereof. Often, the found clusters are to be understood in context of belonging categorical, numerical or textual metadata which are given for the data elements. While often not part of the clustering process, such metadata play an important role and need to be considered during the interactive cluster exploration process. Traditionally, linked-views allow to relate (or loosely speaking: correlate) clusters with metadata or other properties of the underlying cluster data. Manually inspecting the distribution of metadata for each cluster in a linked-view approach is tedious, specially for large data sets, where a large search problem arises. Fully interactive search for potentially useful or interesting cluster to metadata relationships may constitute a cumbersome and long process. To remedy this problem, we propose a novel approach for guiding users in discovering interesting relationships between clusters and associated metadata. Its goal is to guide the analyst through the potentially huge search space. We focus in our work on metadata of categorical type, which can be summarized for a cluster in form of a histogram. We start from a given visual cluster representation, and compute certain measures of interestingness defined on the distribution of metadata categories for the clusters. These measures are used to automatically score and rank the clusters for potential interestingness regarding the distribution of categorical metadata. Identified interesting relationships are highlighted in the visual cluster representation for easy inspection by the user. We present a system implementing an encompassing, yet extensible, set of interestingness scores for categorical metadata, which can also be extended to numerical metadata. Appropriate visual representations are provided for showing the visual correlations, as well as the calculated ranking scores. Focusing on clusters of time series data, we test our approach on a large real-world data set of time-oriented scientific research data, demonstrating how specific interesting views are automatically identified, supporting the analyst discovering interesting and visually understandable relationships.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A critical problem in radiocarbon dating is the spatial and temporal variability of marine reservoir ages (MRAs). We assessed the MRA evolution during the last deglaciation by numerical modeling, applying a self-consistent iteration scheme in which an existing radiocarbon chronology (derived by Hughen et al., Quat. Sci. Rev., 25, pp. 3216-3227, 2006) was readjusted by transient, 3-D simulations of marine and atmospheric Delta14C. To estimate the uncertainties regarding the ocean ventilation during the last deglaciation, we considered various ocean overturning scenarios which are based on different climatic background states (PD: modern climate, GS: LGM climate conditions). Minimum and maximum MRAs are included in file 'MRAminmax_21-14kaBP.nc'. Three further files include MRAs according to equilibrium simulations of the preindustrial ocean (file 'C14age_preindustrial.nc'; this is an update of our results published in 2005) and of the glacial ocean (files 'C14age_spinupLGM_GS.nc' and 'C14age_spinupLGM_PD.nc').

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Today's digital libraries (DLs) archive vast amounts of information in the form of text, videos, images, data measurements, etc. User access to DL content can rely on similarity between metadata elements, or similarity between the data itself (content-based similarity). We consider the problem of exploratory search in large DLs of time-oriented data. We propose a novel approach for overview-first exploration of data collections based on user-selected metadata properties. In a 2D layout representing entities of the selected property are laid out based on their similarity with respect to the underlying data content. The display is enhanced by compact summarizations of underlying data elements, and forms the basis for exploratory navigation of users in the data space. The approach is proposed as an interface for visual exploration, leading the user to discover interesting relationships between data items relying on content-based similarity between data items and their respective metadata labels. We apply the method on real data sets from the earth observation community, showing its applicability and usefulness.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The exponential growth of studies on the biological response to ocean acidification over the last few decades has generated a large amount of data. To facilitate data comparison, a data compilation hosted at the data publisher PANGAEA was initiated in 2008 and is updated on a regular basis (doi:10.1594/PANGAEA.149999). By January 2015, a total of 581 data sets (over 4 000 000 data points) from 539 papers had been archived. Here we present the developments of this data compilation five years since its first description by Nisumaa et al. (2010). Most of study sites from which data archived are still in the Northern Hemisphere and the number of archived data from studies from the Southern Hemisphere and polar oceans are still relatively low. Data from 60 studies that investigated the response of a mix of organisms or natural communities were all added after 2010, indicating a welcomed shift from the study of individual organisms to communities and ecosystems. The initial imbalance of considerably more data archived on calcification and primary production than on other processes has improved. There is also a clear tendency towards more data archived from multifactorial studies after 2010. For easier and more effective access to ocean acidification data, the ocean acidification community is strongly encouraged to contribute to the data archiving effort, and help develop standard vocabularies describing the variables and define best practices for archiving ocean acidification data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This data set contains a time series of plant height measurements (vegetative and reproductive) from the main experiment plots of a large grassland biodiversity experiment (the Jena Experiment; see further details below). In addition, data on species specific plant heights for the main experiment are available from 2002. In the main experiment, 82 grassland plots of 20 x 20 m were established from a pool of 60 species belonging to four functional groups (grasses, legumes, tall and small herbs). In May 2002, varying numbers of plant species from this species pool were sown into the plots to create a gradient of plant species richness (1, 2, 4, 8, 16 and 60 species) and functional richness (1, 2, 3, 4 functional groups). Plots were maintained by bi-annual weeding and mowing. 1. Plant height was recorded, generally, twice a year just before biomass harvest (during peak standing biomass in late May and in late August). Methodologies of measuring height have varied somewhat over the years. In earlier year the streched plant height was measured, while in later years the standing height without streching the plant was measured. Vegetative height was measured either as the height of the highest leaf or as the length of the main axis of non-flowering plants. Regenerating height was measured either as the height of the highest flower on a plant or as the height of the main axis of flowering. Sampled plants were either randomly selected in the core area of plots or along transects in defined distances. For details refer to the description of individual years. Starting in 2006, also the plots of the management experiment, that altered mowing frequency and fertilized subplots (see further details in the general description of the Jena Experiment) were sampled. 2. Species specific plant height was recorded two times in 2002: in late July (vegetative height) and just before biomass harvest during peak standing biomass in late August (vegetative and regenerative height). For each plot and each sown species in the species pool, 3 plant individuals (if present) from the central area of the plots were randomly selected and used to measure vegetative height (non-flowering indviduals) and regenerative height (flowering individuals) as stretched height. Provided are the means over the three measuremnts per plant species per plot.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This data set contains three time series of measurements of soil carbon (particular and dissolved) from the main experiment plots of a large grassland biodiversity experiment (the Jena Experiment; see further details below). In the main experiment, 82 grassland plots of 20 x 20 m were established from a pool of 60 species belonging to four functional groups (grasses, legumes, tall and small herbs). In May 2002, varying numbers of plant species from this species pool were sown into the plots to create a gradient of plant species richness (1, 2, 4, 8, 16 and 60 species) and functional richness (1, 2, 3, 4 functional groups). Plots were maintained by bi-annual weeding and mowing. 1. Particulate soil carbon: Stratified soil sampling was performed every two years since before sowing in April 2002 and was repeated in April 2004, 2006 and 2008 to a depth of 30 cm segmented to a depth resolution of 5 cm giving six depth subsamples per core. Total carbon concentration was analyzed on ball-milled subsamples by an elemental analyzer at 1150°C. Inorganic carbon concentration was measured by elemental analysis at 1150°C after removal of organic carbon for 16 h at 450°C in a muffle furnace. Organic carbon concentration was calculated as the difference between both measurements of total and inorganic carbon. 2. Particulate soil carbon (high intensity sampling): In one block of the Jena Experiment soil samples were taken to a depth of 1 m (segmented to a depth resolution of 5 cm giving 20 depth subsamples per core) with three replicates per block ever 5 years starting before sowing in April 2002. Samples were processed as for the more frequent sampling. 3. Dissolved organic carbon: Suction plates installed on the field site in 10, 20, 30 and 60 cm depth were used to sample soil pore water. Cumulative soil solution was sampled biweekly and analyzed for dissolved organic carbon concentration by a high TOC elemental analyzer. Annual mean values of DOC are provided.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This data set contains two time series of measurements of dissolved phosphorus (organic, inorganic and total with a biweekly resolution) and dissolved inorganic phosphorus with a seasonal resolution. In addition, data on phosphorus from soil samples measured in 2007 and fractionated by different acid-extrations (Hedley fractions) are provided. All data measured at the main experiment plots of a large grassland biodiversity experiment (the Jena Experiment; see further details below). In the main experiment, 82 grassland plots of 20 x 20 m were established from a pool of 60 species belonging to four functional groups (grasses, legumes, tall and small herbs). In May 2002, varying numbers of plant species from this species pool were sown into the plots to create a gradient of plant species richness (1, 2, 4, 8, 16 and 60 species) and functional richness (1, 2, 3, 4 functional groups). Plots were maintained by bi-annual weeding and mowing. 1. Dissolved phosphorus in soil solution: Suction plates installed on the field site in 10, 20, 30 and 60 cm depth were used to sample soil pore water. Cumulatively extracted soil solution was collected every two weeks from October 2002 to May 2006. The biweekly samples from 2002, 2003 and 2004 were analyzed for dissolved organic phosphorus (DOP), dissolved inorganic phosphorus (PO4P) and dissolved total phosphorus (TDP) by Continuous Flow Analyzer (CFA SAN ++, SKALAR [Breda, The Netherlands]). 2. Seasonal values of dissolved inorganic phosphorus in soil solution were calculated as volume-weighted mean values of the biweekly measurements (spring = March to May, summer = June to August, fall = September to November, winter = December to February). 3. Phosphorus fractions in soil: Five independent soil samples per plot were taken in a depth of 0-15 cm using a soil corer with an inner diameter of 1 cm. The five samples per plot were combined to one composite sample per plot. A four-step sequential P fractionation (Hedley fractions) was applied and concentrations of P fractions in soil were measured photometrically (molybdenum blue-reactive P) with a Continuous Flow Analyzer (Bran&Luebbe, Germany).