5 resultados para Truncated robust multivariate outlier detection

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Given a large image set, in which very few images have labels, how to guess labels for the remaining majority? How to spot images that need brand new labels different from the predefined ones? How to summarize these data to route the user’s attention to what really matters? Here we answer all these questions. Specifically, we propose QuMinS, a fast, scalable solution to two problems: (i) Low-labor labeling (LLL) – given an image set, very few images have labels, find the most appropriate labels for the rest; and (ii) Mining and attention routing – in the same setting, find clusters, the top-'N IND.O' outlier images, and the 'N IND.R' images that best represent the data. Experiments on satellite images spanning up to 2.25 GB show that, contrasting to the state-of-the-art labeling techniques, QuMinS scales linearly on the data size, being up to 40 times faster than top competitors (GCap), still achieving better or equal accuracy, it spots images that potentially require unpredicted labels, and it works even with tiny initial label sets, i.e., nearly five examples. We also report a case study of our method’s practical usage to show that QuMinS is a viable tool for automatic coffee crop detection from remote sensing images.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We report a morphology-based approach for the automatic identification of outlier neurons, as well as its application to the NeuroMorpho.org database, with more than 5,000 neurons. Each neuron in a given analysis is represented by a feature vector composed of 20 measurements, which are then projected into a two-dimensional space by applying principal component analysis. Bivariate kernel density estimation is then used to obtain the probability distribution for the group of cells, so that the cells with highest probabilities are understood as archetypes while those with the smallest probabilities are classified as outliers. The potential of the methodology is illustrated in several cases involving uniform cell types as well as cell types for specific animal species. The results provide insights regarding the distribution of cells, yielding single and multi-variate clusters, and they suggest that outlier cells tend to be more planar and tortuous. The proposed methodology can be used in several situations involving one or more categories of cells, as well as for detection of new categories and possible artifacts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Multivariate analyses of UV-Vis spectral data from cachaca wood extracts provide a simple and robust model to classify aged Brazilian cachacas according to the wood species used in the maturation barrels. The model is based on inspection of 93 extracts of oak and different Brazilian wood species by a non-aged cachaca used as an extraction solvent. Application of PCA (Principal Components Analysis) and HCA (Hierarchical Cluster Analysis) leads to identification of 6 clusters of cachaca wood extracts (amburana, amendoim, balsamo, castanheira, jatoba, and oak). LDA (Linear Discriminant Analysis) affords classification of 10 different wood species used in the cachaca extracts (amburana, amendoim, balsamo, cabreuva-parda, canela-sassafras, castanheira, jatoba, jequitiba-rosa, louro-canela, and oak) with an accuracy ranging from 80% (amendoim and castanheira) to 100% (balsamo and jequitiba-rosa). The methodology provides a low-cost alternative to methods based on liquid chromatography and mass spectrometry to classify cachacas aged in barrels that are composed of different wood species.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract Background Lower respiratory tract infection (LRTI) is a major cause of pediatric morbidity and mortality, especially among non-affluent communities. In this study we determine the impact of respiratory viruses and how viral co-detections/infections can affect clinical LRTI severity in children in a hospital setting. Methods Patients younger than 3 years of age admitted to a tertiary hospital in Brazil during the months of high prevalence of respiratory viruses had samples collected from nasopharyngeal aspiration. These samples were tested for 13 different respiratory viruses through real-time PCR (rt-PCR). Patients were followed during hospitalization, and clinical data and population characteristics were collected during that period and at discharge to evaluate severity markers, especially length of hospital stay and oxygen use. Univariate regression analyses identified potential risk factors and multivariate logistic regressions were used to determine the impact of specific viral detections as well as viral co-detections in relation to clinical outcomes. Results We analyzed 260 episodes of LRTI with a viral detection rate of 85% (n = 222). Co-detection was observed in 65% of all virus-positive episodes. The most prevalent virus was Respiratory Syncytial Virus (RSV) (54%), followed by Human Metapneumovirus (hMPV) (32%) and Human Rhinovirus (HRV) (21%). In the multivariate models, infants with co-detection of HRV + RSV stayed 4.5 extra days (p = 0.004), when compared to infants without the co-detection. The same trends were observed for the outcome of days of supplemental oxygen use. Conclusions Although RSV remains as the main cause of LRTI in infants our study indicates an increase in the length of hospital stay and oxygen use in infants with HRV detected by RT-PCR compared to those without HRV. Moreover, one can speculate that when HRV is detected simultaneously with RSV there is an additive effect that may be reflected in more severe clinical outcome. Also, our study identified a significant number of children infected by recently identified viruses, such as hMPV and Human Bocavirus (HBov), and this is a novel finding for poor communities from developing countries.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Over the last decade, molecular phylogenetics has called into question some fundamental aspects of coral systematics. Within the Scleractinia, most families composed exclusively by zooxanthellate species are polyphyletic on the basis of molecular data, and the second most speciose coral family, the Caryophylliidae (most members of which are azooxanthellate), is an unnatural grouping. As part of the process of resolving taxonomic affinities of caryophylliids', here a new Robust' scleractinian family (Deltocyathiidae fam. n.) is proposed on the basis of combined molecular (CO1 and 28S rDNA) and morphological data, accommodating the early-diverging clade of traditional caryophylliids (represented today by the genus Deltocyathus). Whereas this family captures the full morphological diversity of the genus Deltocyathus, one species, Deltocyathus magnificus, is an outlier in terms of molecular data, and groups with the Complex coral family Turbinoliidae. Ultrastructural data, however, place D.magnificus within Deltocyathiidae fam. nov. Unfortunately, limited ultrastructural data are as yet available for turbinoliids, but D.magnificus may represent the first documented case of morphological convergence at the microstructural level among scleractinian corals. Marcelo V.Kitahara, Centro de Biologia Marinha, Universidade de SAo Paulo, SAo SebastiAo, S.P. 11600-000, Brazil. E-mail:kitahara@usp.br