853 resultados para Data stream mining
Resumo:
The recent development of in-situ monitoring devices, such as UV-spectrometers, makes the study of short-term stream chemistry variation relevant, especially the study of diurnal cycles, which are not yet fully understood. Our study is based on high-frequency data from an agricultural catchment (Studienlandschaft Schwingbachtal, Germany). We propose a novel approach, i.e. the combination of cluster analysis and Linear Discriminant Analysis, to mine from these data nitrate behavior patterns. As a result, we observe a seasonality of nitrate diurnal cycles, that differs from the most common cycle seasonality described in the literature, i.e. pre-dawn peaks in spring. Our cycles appear in summer and the maximum and minimum shift to a later time in late summer/autumn. This is observed both for water- and energy-limited years, thus potentially stressing the role of evapotranspiration. This concluding hypothesis on the role of evapotranspiration on nitrate stream concentration, which was obtained through data mining, broadens the perspective on the diurnal cycling of stream nitrate concentrations.
Resumo:
The planktonic haptophyte Phaeocystis has been suggested to play a fundamental role in the global biogeochemical cycling of carbon and sulphur, but little is known about its global biomass distribution. We have collected global microscopy data of the genus Phaeocystis and converted abundance data to carbon biomass using species-specific carbon conversion factors. Microscopic counts of single-celled and colonial Phaeocystis were obtained both through the mining of online databases and by accepting direct submissions (both published and unpublished) from Phaeocystis specialists. We recorded abundance data from a total of 1595 depth-resolved stations sampled between 1955-2009. The quality-controlled dataset includes 5057 counts of individual Phaeocystis cells resolved to species level and information regarding life-stages from 3526 samples. 83% of stations were located in the Northern Hemisphere while 17% were located in the Southern Hemisphere. Most data were located in the latitude range of 50-70° N. While the seasonal distribution of Northern Hemisphere data was well-balanced, Southern Hemisphere data was biased towards summer months. Mean species- and form-specific cell diameters were determined from previously published studies. Cell diameters were used to calculate the cellular biovolume of Phaeocystis cells, assuming spherical geometry. Cell biomass was calculated using a carbon conversion factor for Prymnesiophytes (Menden-Deuer and Lessard, 2000). For colonies, the number of cells per colony was derived from the colony volume. Cell numbers were then converted to carbon concentrations. An estimation of colonial mucus carbon was included a posteriori, assuming a mean colony size for each species. Carbon content per cell ranged from 9 pg (single-celled Phaeocystis antarctica) to 29 pg (colonial Phaeocystis globosa). Non-zero Phaeocystis cell biomasses (without mucus carbon) range from 2.9 - 10?5 µg l-1 to 5.4 - 103 µg l-1, with a mean of 45.7 µg l-1 and a median of 3.0 µg l-1. Highest biomasses occur in the Southern Ocean below 70° S (up to 783.9 µg l-1), and in the North Atlantic around 50° N (up to 5.4 - 103 µg l-1).