3 resultados para Databases and Health Information systems

em Repositório Científico do Instituto Politécnico de Lisboa - Portugal


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Doutoramento em Gestão

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Synthetic dyes are xenobiotic compounds that are being increasingly used in several industries, with special emphasis in the paper, textile and leather industries. Over 100,000 commercial dyes exist today and more than 7 × 105 tons of dyestuff is produced annually, of which 1–1.5 × 105 tons is released into the wastewaters (Rai et al in Crit Rev Environ Sci Tecnhol 35:219–238, 2005). Among these, azo dyes, characterized by the presence of one or more azo groups (–N=N–), and anthraquinonic dyes represent the largest and most versatile groups.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Feature discretization (FD) techniques often yield adequate and compact representations of the data, suitable for machine learning and pattern recognition problems. These representations usually decrease the training time, yielding higher classification accuracy while allowing for humans to better understand and visualize the data, as compared to the use of the original features. This paper proposes two new FD techniques. The first one is based on the well-known Linde-Buzo-Gray quantization algorithm, coupled with a relevance criterion, being able perform unsupervised, supervised, or semi-supervised discretization. The second technique works in supervised mode, being based on the maximization of the mutual information between each discrete feature and the class label. Our experimental results on standard benchmark datasets show that these techniques scale up to high-dimensional data, attaining in many cases better accuracy than existing unsupervised and supervised FD approaches, while using fewer discretization intervals.