48 resultados para Open Research Data
em Publishing Network for Geoscientific
                                
                                
Resumo:
The thesis represents the first part of a reference book to the Tertiary flora of Saxony. All taxa based on leaves of angiosperms and on Ginkgo are included in this compendium. After an overview about the geological state of knowledge on the Tertiary in Saxony, phytostratigraphic concepts are introduced and a historical survey on the Tertiary paleobotanical research in Saxony is given. All plant macrofossils published from Saxonian Tertiary until end of 2013 and their sites of discovery (primary data) were recorded. This data were supplemented by additional attributes and unified through project-based M.Sc. theses. Subsequently, taxa of fossil leaves were selected, their data evaluated and brought to a consistent state of research. Data sheets for 187 out of 235 examined taxa were established for a determination atlas. Macro- and micromorphological attributes are described in this atlas and information are given about the systematic, synonymy, palaeoecology and spatial and temporal distribution. The describing part is illustrated by images and instructive drawings. The documented data were surveyed and discussed related to their quality within the literature in the result part. A bibliography of the extensive palaeobotanical literature for plant fossils of Saxony completes the work. The taxon and locality related data are implemented into an open source geographical information system (GIS) in order to visualize and to manage them effectively. For the first time, the results of this thesis implemented in the GIS allow the generation of distribution maps for the taxa of leaves of Tertiary angiospermes and Ginkgo in Saxony. Furthermore it enables to query topographical, geological and paleobotanical information about the fossil sites. A determination key was developed for the fossil material that allows a rough determination of the findings in the field. The compendium will be available for free use in a printed as well as in a digital version.
                                
Resumo:
Increasing amounts of data is collected in most areas of research and application. The degree to which this data can be accessed, analyzed, and retrieved, is a decisive in obtaining progress in fields such as scientific research or industrial production. We present a novel methodology supporting content-based retrieval and exploratory search in repositories of multivariate research data. In particular, our methods are able to describe two-dimensional functional dependencies in research data, e.g. the relationship between ination and unemployment in economics. Our basic idea is to use feature vectors based on the goodness-of-fit of a set of regression models to describe the data mathematically. We denote this approach Regressional Features and use it for content-based search and, since our approach motivates an intuitive definition of interestingness, for exploring the most interesting data. We apply our method on considerable real-world research datasets, showing the usefulness of our approach for user-centered access to research data in a Digital Library system.
                                
Resumo:
Visual cluster analysis provides valuable tools that help analysts to understand large data sets in terms of representative clusters and relationships thereof. Often, the found clusters are to be understood in context of belonging categorical, numerical or textual metadata which are given for the data elements. While often not part of the clustering process, such metadata play an important role and need to be considered during the interactive cluster exploration process. Traditionally, linked-views allow to relate (or loosely speaking: correlate) clusters with metadata or other properties of the underlying cluster data. Manually inspecting the distribution of metadata for each cluster in a linked-view approach is tedious, specially for large data sets, where a large search problem arises. Fully interactive search for potentially useful or interesting cluster to metadata relationships may constitute a cumbersome and long process. To remedy this problem, we propose a novel approach for guiding users in discovering interesting relationships between clusters and associated metadata. Its goal is to guide the analyst through the potentially huge search space. We focus in our work on metadata of categorical type, which can be summarized for a cluster in form of a histogram. We start from a given visual cluster representation, and compute certain measures of interestingness defined on the distribution of metadata categories for the clusters. These measures are used to automatically score and rank the clusters for potential interestingness regarding the distribution of categorical metadata. Identified interesting relationships are highlighted in the visual cluster representation for easy inspection by the user. We present a system implementing an encompassing, yet extensible, set of interestingness scores for categorical metadata, which can also be extended to numerical metadata. Appropriate visual representations are provided for showing the visual correlations, as well as the calculated ranking scores. Focusing on clusters of time series data, we test our approach on a large real-world data set of time-oriented scientific research data, demonstrating how specific interesting views are automatically identified, supporting the analyst discovering interesting and visually understandable relationships.
                                
Resumo:
The data collection "Deep Drilling of Glaciers: Soviet-Russian projects in Arctic, 1975-1995" was collected by the following basic considerations: - compilation of deep (>100 m) drilling projects on Arctic glaciers, using data of (a) publications; (b) archives of IGRAN; (c) personal communication of project participants; - documentation of parameters, references. Accuracy of data and techniques applied to determine different parameters are not evaluated. The accuracy of some geochemical parameters (up to 1984 and heavy metalls) is uncertain. Most reconstructions of ice core age and of annual layer thickness are discussed; - digitizing of published diagrams (in case, when original numerical data were lost) and subsequent data conversion to equal range series and adjustment to the common units. Therefore, the equal-range series were calculated from original data or converted from digitized chart values as indicated in the metadata. For the methodological purpose, the equal-range series obtained from original and reconstructed data were compared repeatedly; the systematic difference was less then 5-7%. Special attention should be given to the fact, that the data for individual ice core parameters varies, because some parameters were originally measured or registered. Parameters were converted in equal-range series using 2 m steps; - two or more parameter values were determined, then the mean-weighted (i.e. accounting the sample length) value is assigned to the entire interval; - one parameter value was determined, measured or registered independently from the parameter values in depth intervals which over- and underlie it, then the value is assigned to the entire interval; - one parameter value was determined, measured or registered for two adjoining depth intervals, then the specific value is assigned to the depth interval, which represents >75% of sample length ; if each of adjoining depth intervals represents <75% of sample length, then the correspondent parameter value is assigned to both intervals of depth. This collection of ice core data (version 2000) was made available through the EU funded QUEEN project by S.M. Arkhipov, Moscow.
                                
                                
Resumo:
The analysis of research data plays a key role in data-driven areas of science. Varieties of mixed research data sets exist and scientists aim to derive or validate hypotheses to find undiscovered knowledge. Many analysis techniques identify relations of an entire dataset only. This may level the characteristic behavior of different subgroups in the data. Like automatic subspace clustering, we aim at identifying interesting subgroups and attribute sets. We present a visual-interactive system that supports scientists to explore interesting relations between aggregated bins of multivariate attributes in mixed data sets. The abstraction of data to bins enables the application of statistical dependency tests as the measure of interestingness. An overview matrix view shows all attributes, ranked with respect to the interestingness of bins. Complementary, a node-link view reveals multivariate bin relations by positioning dependent bins close to each other. The system supports information drill-down based on both expert knowledge and algorithmic support. Finally, visual-interactive subset clustering assigns multivariate bin relations to groups. A list-based cluster result representation enables the scientist to communicate multivariate findings at a glance. We demonstrate the applicability of the system with two case studies from the earth observation domain and the prostate cancer research domain. In both cases, the system enabled us to identify the most interesting multivariate bin relations, to validate already published results, and, moreover, to discover unexpected relations.
                                
                                
Resumo:
Today's digital libraries (DLs) archive vast amounts of information in the form of text, videos, images, data measurements, etc. User access to DL content can rely on similarity between metadata elements, or similarity between the data itself (content-based similarity). We consider the problem of exploratory search in large DLs of time-oriented data. We propose a novel approach for overview-first exploration of data collections based on user-selected metadata properties. In a 2D layout representing entities of the selected property are laid out based on their similarity with respect to the underlying data content. The display is enhanced by compact summarizations of underlying data elements, and forms the basis for exploratory navigation of users in the data space. The approach is proposed as an interface for visual exploration, leading the user to discover interesting relationships between data items relying on content-based similarity between data items and their respective metadata labels. We apply the method on real data sets from the earth observation community, showing its applicability and usefulness.
                                
                                
Resumo:
We present 30 new planktonic foraminiferal census data of surface sediment samples from the South China Sea, recovered between 630 and 2883 m water depth. These new data, together with the 131 earlier published data sets from the western Pacific, are used for calibrating the SIMMAX-28 transfer function to estimate past sea-surface temperatures. This regional SIMMAX method offers a slightly better understanding of the marginal sea conditions of the South China Sea than the linear transfer function FP-12E, which is based only on open-ocean data. However, both methods are biased toward the tropical temperature regime because of the very limited data from temperate to subpolar regions. The SIMMAX formula was applied to sediment core 17940 from the northeastern South China Sea, with sedimentation rates of 20-80 cm/ka. Results revealed nearly unchanged summer temperatures around 28°C for the last 30 ky, while winter temperatures varied between 19.5°C in the last glacial maximum and 26°C during the Holocene. During Termination 1A, the winter estimates show a Younger Dryas cooling by 3°C subsequent to a temperature optimum of 24°C during the Bölling=Alleröd. Estimates of winter temperature differences between 0 and 100 m water depth document the seasonal variations in the thickness of the mixed layer and provide a new proxy for estimating past changes in the strength of the winter monsoon.
                                
Resumo:
Data of amphibians, reptiles and birds surveyed from February 2016 to May 2016 in the UNESCO Sheka forest biosphere reserve are provided as an online open access data file.
                                
Resumo:
In 2008, the 50th anniversary of the IGY (International Geophysical Year), WDCMARE presents with this CD publication 3632 data sets in Open Access as part of the most important results from 73 cruises of the research vessel METEOR between 1964 and 1985. The archive is a coherent organized collection of published and unpublished data sets produced by scientists of all marine research disciplines who participated in Meteor expeditions, measured environmental parameters during cruises and investigated sample material post cruise in the labs of the participating institutions. In most cases, the data was gathered from the Meteor Forschungsergebnisse, published by the Deutsche Forschungsgemeinschaft (DFG). A second important data source are time series and radiosonde ascensions of more than 20 years of ships weather observations, which were provided by the Deutscher Wetterdienst, Hamburg. The final inclusion of all data into the PANGAEA information system ensures secure archiving, future updates, widespread distribution in electronic, machine-readable form with longterm access via the Internet. To produce this publication, all data sets with metadata were extracted from PANGAEA and organized in a directory structure on a CD together with a search capability.
                                
                                
 
                    