2 resultados para multiclass classification problems

em Plymouth Marine Science Electronic Archive (PlyMSEA)


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Agglomerative cluster analyses encompass many techniques, which have been widely used in various fields of science. In biology, and specifically ecology, datasets are generally highly variable and may contain outliers, which increase the difficulty to identify the number of clusters. Here we present a new criterion to determine statistically the optimal level of partition in a classification tree. The criterion robustness is tested against perturbated data (outliers) using an observation or variable with values randomly generated. The technique, called Random Simulation Test (RST), is tested on (1) the well-known Iris dataset [Fisher, R.A., 1936. The use of multiple measurements in taxonomic problems. Ann. Eugenic. 7, 179–188], (2) simulated data with predetermined numbers of clusters following Milligan and Cooper [Milligan, G.W., Cooper, M.C., 1985. An examination of procedures for determining the number of clusters in a data set. Psychometrika 50, 159–179] and finally (3) is applied on real copepod communities data previously analyzed in Beaugrand et al. [Beaugrand, G., Ibanez, F., Lindley, J.A., Reid, P.C., 2002. Diversity of calanoid copepods in the North Atlantic and adjacent seas: species associations and biogeography. Mar. Ecol. Prog. Ser. 232, 179–195]. The technique is compared to several standard techniques. RST performed generally better than existing algorithms on simulated data and proved to be especially efficient with highly variable datasets.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

There is a multitude of ecosystem service classifications available within the literature, each with its own advantages and drawbacks. Elements of them have been used to tailor a generic ecosystem service classification for the marine environment and then for a case study site within the North Sea: the Dogger Bank. Indicators for each of the ecosystem services, deemed relevant to the case study site, were identified. Each indicator was then assessed against a set of agreed criteria to ensure its relevance and applicability to environmental management. This paper identifies the need to distinguish between indicators of ecosystem services that are entirely ecological in nature (and largely reveal the potential of an ecosystem to provide ecosystem services), indicators for the ecological processes contributing to the delivery of these services, and indicators of benefits that reveal the realized human use or enjoyment of an ecosystem service. It highlights some of the difficulties faced in selecting meaningful indicators, such as problems of specificity, spatial disconnect and the considerable uncertainty about marine species, habitats and the processes, functions and services they contribute to.