18 resultados para multivariate binary data

em Publishing Network for Geoscientific


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Increasing amounts of data is collected in most areas of research and application. The degree to which this data can be accessed, analyzed, and retrieved, is a decisive in obtaining progress in fields such as scientific research or industrial production. We present a novel methodology supporting content-based retrieval and exploratory search in repositories of multivariate research data. In particular, our methods are able to describe two-dimensional functional dependencies in research data, e.g. the relationship between ination and unemployment in economics. Our basic idea is to use feature vectors based on the goodness-of-fit of a set of regression models to describe the data mathematically. We denote this approach Regressional Features and use it for content-based search and, since our approach motivates an intuitive definition of interestingness, for exploring the most interesting data. We apply our method on considerable real-world research datasets, showing the usefulness of our approach for user-centered access to research data in a Digital Library system.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This collection contains measurements of abundance and diversity of different groups of aboveground invertebrates sampled on the plots of the different sub-experiments at the field site of a large grassland biodiversity experiment (the Jena Experiment; see further details below). In the main experiment, 82 grassland plots of 20 x 20 m were established from a pool of 60 species belonging to four functional groups (grasses, legumes, tall and small herbs). In May 2002, varying numbers of plant species from this species pool were sown into the plots to create a gradient of plant species richness (1, 2, 4, 8, 16 and 60 species) and functional richness (1, 2, 3, 4 functional groups). Plots were maintained by bi-annual weeding and mowing. The following series of datasets are contained in this collection: 1. Measurements of ant abundance (number of individuals attracted to baits) and ant occurrence (binary data) in the Main Experiment in 2006 and 2013. Ants where sampled using two types of baited traps receiving ~10g of Tuna or ~10g of honey/Sucrose. After 30min the occurrence (presence = 1 / absence = 0) and abundance (number) of ants at the two types of baits was recorded and pooled per plot.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This data set contains measurements of ant abundance (number of individuals observed at the baits) and ant occurrence (binary data) measured in the Main Experiment plots of a large grassland biodiversity experiment (the Jena Experiment; see further details below). Ants were sampled in 80 plots of the Main Experiment using baited traps in July 2006. In each plot two petri dishes were set on the ground, one received ~10g of Tuna the other ~10g of sugar (Sucrose). After 30min the occurrence (presence = 1 / absence = 0) and abundance (number) of ants at the two baits was recorded. Given is, per plot, the sum of ants attracted to the two different baits. In the Main Experiment, 82 grassland plots of 20 x 20 m were established from a pool of 60 species belonging to four functional groups (grasses, legumes, tall and small herbs). In May 2002, varying numbers of plant species from this species pool were sown in the plots to create a gradient of plant species richness (1, 2, 4, 8, 16 and 60 species) and functional richness (1, 2, 3, or 4 functional groups). Plots were maintained by bi-annual weeding and mowing.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This data set contains measurements of ant abundance (number of individuals attracted to baits) and ant occurrence (binary data) measured in the Main Experiment plots of a large grassland biodiversity experiment (the Jena Experiment; see further details below). In the Main Experiment, 82 grassland plots of 20 x 20 m were established from a pool of 60 species belonging to four functional groups (grasses, legumes, tall and small herbs). In May 2002, varying numbers of plant species from this species pool were sown in the plots to create a gradient of plant species richness (1, 2, 4, 8, 16 and 60 species) and functional richness (1, 2, 3, or 4 functional groups). Plots were maintained by bi-annual weeding and mowing. Ants where sampled in 80 plots of the Main Experiment using baited traps end of July/ beginning of August 2013. Sampling took place 36 days after the end of a major flooding of the field site that lasted for several weeks (see DOI flood descriptor). In each plot two petri dishes were set on the ground, one received ~10g of Tuna the other ~10g of Honey. After 30min the occurrence (presence = 1 / absence = 0) and abundance (number) of ants at the two baits was recorded. Given is, per plot, the sum of ants attracted to the two different baits.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The reduction in sea ice along the SE Greenland coast during the last century has severely impacted ice-rafting to this area. In order to reconstruct ice-rafting and oceanographic conditions in the area of Denmark Strait during the last ~150 years, we conducted a multiproxy study on three short (20 cm) sediment cores from outer Kangerdlugssuaq Trough (~300 m water depth). The proxy-based data obtained have been compared with historical and instrumental data to gain a better understanding of the ice sheet-ocean interactions in the area. A robust chronology has been developed based on 210Pb and 137Cs measurements on core PO175GKC#9 (~66.2°N, 32°W) and expanded to the two adjacent cores based on correlations between calcite weight percent records. Our proxy records include sea-ice and phytoplankton biomarkers, and a variety of mineralogical determinations based on the <2 mm sediment fraction, including identification with quantitative x-ray diffraction, ice-rafted debris counts on the 63-150 µm sand fraction, and source identifications based on the composition of Fe oxides in the 45-250 µm fraction. A multivariate statistical analysis indicated significant correlations between our proxy records and historical data, especially with the mean annual temperature data from Stykkishólmur (Iceland) and the storis index (historical observations of sea-ice export via the East Greenland Current). In particular, the biological proxies (calcite weight percent, IP25, and total organic carbon %) showed significant linkage with the storis index. Our records show two distinct intervals in the recent history of the SE Greenland coast. The first of these (ad 1850-1910) shows predominantly perennial sea-ice conditions in the area, while the second (ad 1910-1990) shows more seasonally open water conditions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The operator effect is a well-known methodological bias already quantified in some taphonomic studies. However, the replicability effect, i.e., the use of taphonomic attributes as a replicable scientific method, has not been taken into account to the present. Here, we quantified for the first time this replicability bias using different multivariate statistical techniques, testing if the operator effect is related to the replicability effect. We analyzed the results reported by 15 operators working on the same dataset. Each operator analyzed 30 biological remains (bivalve shells) from five different sites, considering the attributes fragmentation, edge rounding, corrasion, bioerosion and secondary color. The operator effect followed the same pattern reported in previous studies, characterized by a worse correspondence for those attributes having more than two levels of damage categories. However, the effect did not appear to have relation with the replicability effect, because nearly all operators found differences among sites. Despite the binary attribute bioerosion exhibited 83% of correspondence among operators it was the taphonomic attributes that showed the highest dispersion among operators (28%). Therefore, we conclude that binary attributes (despite showing a reduction of the operator effect) diminish replicability, resulting in different interpretations of concordant data. We found that a variance value of nearly 8% among operators, was enough to generate a different taphonomic interpretation, in a Q-mode cluster analysis. The results reported here showed that the statistical method employed influences the level of replicability and comparability of a study and that the availability of results may be a valid alternative to reduce bias.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A simple method for efficient inversion of arbitrary radiative transfer models for image analysis is presented. The method operates by representing the shape of the function that maps model parameters to spectral reflectance by an adaptive look-up tree (ALUT) that evenly distributes the discretization error of tabulated reflectances in spectral space. A post-processing step organizes the data into a binary space partitioning tree that facilitates an efficient inversion search algorithm. In an example shallow water remote sensing application, the method performs faster than an implementation of previously published methodology and has the same accuracy in bathymetric retrievals. The method has no user configuration parameters requiring expert knowledge and minimizes the number of forward model runs required, making it highly suitable for routine operational implementation of image analysis methods. For the research community, straightforward and robust inversion allows research to focus on improving the radiative transfer models themselves without the added complication of devising an inversion strategy.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sites 1085, 1086 and 1087 were drilled off South Africa during Ocean Drilling Program (ODP) Leg 175 to investigate the Benguela Current System. While previous studies have focused on reconstructing the Neogene palaeoceanographic and palaeoclimatic history of these sites, palynology has been largely ignored, except for the Late Pliocene and Quaternary. This study presents palynological data from the upper Middle Miocene to lower Upper Pliocene sediments in Holes 1085A, 1086A and 1087C that provide complementary information about the history of the area. Abundant and diverse marine palynomorphs (mainly dinoflagellate cysts), rare spores and pollen, and dispersed organic matter have been recovered. Multivariate statistical analysis of dispersed organic matter identified three palynofacies assemblages (A, B, C) in the most continuous hole (1085A), and they were defined primarily by amorphous organic matter (AOM), and to a lesser extent black debris, structured phytoclasts, degraded phytoclasts, and marine palynomorphs. Ecostratigraphic interpretation based on dinoflagellate cyst, spore-pollen and palynofacies data allowed us to identify several palaeoceanographic and palaeoclimatic signals. First, the late Middle Miocene was subtropical, and sediments contained the highest percentages of land-derived organic matter, even though they are rich in AOM (palynofacies assemblage A). Second, the Late Miocene was cool-temperate and characterized by periods of intensified upwelling, increase in productivity, abundant and diverse oceanic dinoflagellate cysts, and the highest percentages of AOM (palynofacies assemblage C). Third, the Early to early Late Pliocene was warm-temperate with some dry intervals (increase in grass pollen) and intensified upwelling. Fourth, the Neogene "carbonate crash" identified in other southern oceans was recognized in two palynofacies A samples in Hole 1085A that are nearly barren of dinoflagellate cysts: one Middle Miocene sample (590 mbsf, 13.62 Ma) and one Upper Miocene sample (355 mbsf, 6.5 Ma). Finally, the extremely low percentages of pollen suggest sparse vegetation on the adjacent landmass, and Namib desert conditions were already in existence during the late Middle Miocene.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Global Ocean Sampling (GOS) expedition is currently the largest and geographically most comprehensive metagenomic dataset, including samples from the Atlantic, Pacific, and Indian Oceans. This study makes use of the wide range of environmental conditions and habitats encompassed within the GOS sites in order to investigate the ecological structuring of bacterial and archaeal taxon ranks. Community structures based on taxonomically classified 16S ribosomal RNA (rRNA) gene fragments at phylum, class, order, family, and genus rank levels were examined using multivariate statistical analysis, and the results were inspected in the context of oceanographic environmental variables and structured habitat classifications. At all taxon rank levels, community structures of neritic, oceanic, estuarine biomes, as well as other exotic biomes (salt marsh, lake, mangrove), were readily distinguishable from each other. A strong structuring of the communities with chlorophyll a concentration and a weaker yet significant structuring with temperature and salinity were observed. Furthermore, there were significant correlations between community structures and habitat classification. These results were used for further investigation of one-to-one relationships between taxa and environment and provided indications for ecological preferences shaped by primary production for both cultured and uncultured bacterial and archaeal clades.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The relative paleointensity (RPI) method assumes that the intensity of post depositional remanent magnetization (PDRM) depends exclusively on the magnetic field strength and the concentration of the magnetic carriers. Sedimentary remanence is regarded as an equilibrium state between aligning geomagnetic and randomizing interparticle forces. Just how strong these mechanical and electrostatic forces are, depends on many petrophysical factors related to mineralogy, particle size and shape of the matrix constituents. We therefore test the hypothesis that variations in sediment lithology modulate RPI records. For 90 selected Late Quaternary sediment samples from the subtropical and subantarctic South Atlantic Ocean a combined paleomagnetic and sedimentological dataset was established. Misleading alterations of the magnetic mineral fraction were detected by a routine Fe/kappa test (Funk, J., von Dobeneck, T., Reitz, A., 2004. Integrated rock magnetic and geochemical quantification of redoxomorphic iron mineral diagenesis in Late Quaternary sediments from the Equatorial Atlantic. In: Wefer, G., Mulitza, S., Ratmeyer, V. (Eds.), The South Atlantic in the Late Quaternary: reconstruction of material budgets and current systems. Springer-Verlag, Berlin/Heidelberg/New York/Tokyo, pp. 239-262). Samples with any indication of suboxic magnetite dissolution were excluded from the dataset. The parameters under study include carbonate, opal and terrigenous content, grain size distribution and clay mineral composition. Their bi- and multivariate correlations with the RPI signal were statistically investigated using standard techniques and criteria. While several of the parameters did not yield significant results, clay grain size and chlorite correlate weakly and opal, illite and kaolinite correlate moderately to the NRM/ARM signal used here as a RPI measure. The most influential single sedimentological factor is the kaolinite/illite ratio with a Pearson's coefficient of 0.51 and 99.9% significance. A three-member regression model suggests that matrix effects can make up over 50% of the observed RPI dynamics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Eleven sediment samples taken downcore and representing the past 26 kyr of deposition at MANOP site C (0°57.2°N, 138°57.3°W) were analyzed for lipid biomarker composition. Biomarkers of both terrestrial and marine sources of organic carbon were identified. In general, concentration profiles for these biomarkers and for total organic carbon (TOC) displayed three common stratigraphic features in the time series: (1) a maximum within the surface sediment mixed layer (<=4 ka); (2) a broad minimum extending throughout the interglacial deposit; and (3) a deep, pronounced maximum within the glacial deposit. Using the biomarker records, a simple binary mixing model is described that assesses the proportion of terrestrial to marine TOC in these sediments. Best estimates from this model suggest that ~20% of the TOC is land-derived, introduced by long-range eolian transport, and the remainder is derived from marine productivity. The direct correlation between the records for terrestrial and marine TOC with depth in this core fits an interpretation that primary productivity at site C has been controlled by wind-driven upwelling at least over the last glacial/interglacial cycle. The biomarker records place the greatest wind strength and highest primary productivity within the time frame of 18 to 22 kyr B.P. Diagenetic effects limit our ability to ascertain directly from the biomarker records the absolute magnitude that different types of primary productivity have changed at this ocean location over the past 26 kyr.