58 resultados para Semi-supervised classification

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aims. In this work, we describe the pipeline for the fast supervised classification of light curves observed by the CoRoT exoplanet CCDs. We present the classification results obtained for the first four measured fields, which represent a one-year in-orbit operation. Methods. The basis of the adopted supervised classification methodology has been described in detail in a previous paper, as is its application to the OGLE database. Here, we present the modifications of the algorithms and of the training set to optimize the performance when applied to the CoRoT data. Results. Classification results are presented for the observed fields IRa01, SRc01, LRc01, and LRa01 of the CoRoT mission. Statistics on the number of variables and the number of objects per class are given and typical light curves of high-probability candidates are shown. We also report on new stellar variability types discovered in the CoRoT data. The full classification results are publicly available.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The design of binary morphological operators that are translation-invariant and locally defined by a finite neighborhood window corresponds to the problem of designing Boolean functions. As in any supervised classification problem, morphological operators designed from a training sample also suffer from overfitting. Large neighborhood tends to lead to performance degradation of the designed operator. This work proposes a multilevel design approach to deal with the issue of designing large neighborhood-based operators. The main idea is inspired by stacked generalization (a multilevel classifier design approach) and consists of, at each training level, combining the outcomes of the previous level operators. The final operator is a multilevel operator that ultimately depends on a larger neighborhood than of the individual operators that have been combined. Experimental results show that two-level operators obtained by combining operators designed on subwindows of a large window consistently outperform the single-level operators designed on the full window. They also show that iterating two-level operators is an effective multilevel approach to obtain better results.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This work proposes and discusses an approach for inducing Bayesian classifiers aimed at balancing the tradeoff between the precise probability estimates produced by time consuming unrestricted Bayesian networks and the computational efficiency of Naive Bayes (NB) classifiers. The proposed approach is based on the fundamental principles of the Heuristic Search Bayesian network learning. The Markov Blanket concept, as well as a proposed ""approximate Markov Blanket"" are used to reduce the number of nodes that form the Bayesian network to be induced from data. Consequently, the usually high computational cost of the heuristic search learning algorithms can be lessened, while Bayesian network structures better than NB can be achieved. The resulting algorithms, called DMBC (Dynamic Markov Blanket Classifier) and A-DMBC (Approximate DMBC), are empirically assessed in twelve domains that illustrate scenarios of particular interest. The obtained results are compared with NB and Tree Augmented Network (TAN) classifiers, and confinn that both proposed algorithms can provide good classification accuracies and better probability estimates than NB and TAN, while being more computationally efficient than the widely used K2 Algorithm.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Na primeira semana de maio de 2008, durante quatro dias, um ciclone em superfície permaneceu semi-estacionário na costa da região sul do Brasil. Este sistema foi responsável por chuvas e ventos fortes no Rio Grande do Sul e Santa Catarina, os quais causaram muitos danos (queda de árvores, enchentes e desabamentos). O objetivo deste trabalho é avaliar o processo de formação e entender os mecanismos responsáveis pelo lento deslocamento do ciclone, já que a maioria dos ciclones nesta região possui deslocamento mais rápido. A equação de desenvolvimento de Sutcliffe mostrou que a advecção de vorticidade absoluta ciclônica na média troposfera e a advecção de ar quente na camada entre 1000-500 hPa foram mecanismos importantes para a ciclogênese. Neste período, o intenso aquecimento diabático também contribuiu para a ciclogênese, à medida que se contrapôs ao intenso resfriamento adiabático devido aos movimentos verticais ascendentes. A advecção de vorticidade absoluta ciclônica que favoreceu a ciclogênese esteve associada a um Vórtice Ciclônico em Altos Níveis (VCAN), que se formou numa região de anomalia de vorticidade potencial. O VCAN se manteve semi-estacionário e compôs o setor norte de um bloqueio do tipo dipolo. Tal bloqueio intensificou um anticiclone em superfície, situado a sul/leste do ciclone, o que contribuiu para o ciclone se manter semi-estacionário. O movimento atípico e lento do ciclone para sul, e em alguns períodos para sudoeste, esteve associado com advecções de vorticidade absoluta ciclônica na média troposfera e de ar quente no seu setor sul. Somente quando o bloqueio em níveis médios e a anomalia de vorticidade potencial em níveis médios/altos se enfraqueceram, o ciclone em superfície se afastou da costa sul do Brasil.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

É apresentado um estudo sobre sistemas convectivos linearmente organizados e observados por um radar meteorológico banda-C na região semi-árida do Nordeste do Brasil. São analisados três dias (27 a 29) de março de 1985, com ênfase na investigação do papel desempenhado por fatores locais e de grande escala no desenvolvimento dos sistemas. No cenário de grande escala, a área de cobertura do radar foi influenciada por um cavado de ar superior austral no dia 27 e por um vórtice ciclônico de altos níveis no dia 29. A convergência de umidade próxima à superfície favoreceu a atividade convectiva nos dias 27 e 29, enquanto que divergência de umidade próxima à superfície inibiu a atividade convectiva no dia 28. No cenário de mesoescala, foi observado que o aquecimento diurno é um fator importante para a formação de células convectivas, somando-se a ele o papel determinante da orografia na localização dos ecos. De maneira geral, as imagens de radar mostram os sistemas convectivos linearmente organizados em áreas elevadas e núcleos convectivos intensos envolvidos por uma área de precipitação estratiforme. Os resultados indicam que convergência do fluxo de umidade em grande escala e aquecimento radiativo, são fatores determinantes na evolução e desenvolvimento dos ecos na área de estudo.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Saving our science from ourselves: the plight of biological classification. Biological classification ( nomenclature, taxonomy, and systematics) is being sold short. The desire for new technologies, faster and cheaper taxonomic descriptions, identifications, and revisions is symptomatic of a lack of appreciation and understanding of classification. The problem of gadget-driven science, a lack of best practice and the inability to accept classification as a descriptive and empirical science are discussed. The worst cases scenario is a future in which classifications are purely artificial and uninformative.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Due to the imprecise nature of biological experiments, biological data is often characterized by the presence of redundant and noisy data. This may be due to errors that occurred during data collection, such as contaminations in laboratorial samples. It is the case of gene expression data, where the equipments and tools currently used frequently produce noisy biological data. Machine Learning algorithms have been successfully used in gene expression data analysis. Although many Machine Learning algorithms can deal with noise, detecting and removing noisy instances from the training data set can help the induction of the target hypothesis. This paper evaluates the use of distance-based pre-processing techniques for noise detection in gene expression data classification problems. This evaluation analyzes the effectiveness of the techniques investigated in removing noisy data, measured by the accuracy obtained by different Machine Learning classifiers over the pre-processed data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

PURPOSE: The main goal of this study was to develop and compare two different techniques for classification of specific types of corneal shapes when Zernike coefficients are used as inputs. A feed-forward artificial Neural Network (NN) and discriminant analysis (DA) techniques were used. METHODS: The inputs both for the NN and DA were the first 15 standard Zernike coefficients for 80 previously classified corneal elevation data files from an Eyesys System 2000 Videokeratograph (VK), installed at the Departamento de Oftalmologia of the Escola Paulista de Medicina, São Paulo. The NN had 5 output neurons which were associated with 5 typical corneal shapes: keratoconus, with-the-rule astigmatism, against-the-rule astigmatism, "regular" or "normal" shape and post-PRK. RESULTS: The NN and DA responses were statistically analyzed in terms of precision ([true positive+true negative]/total number of cases). Mean overall results for all cases for the NN and DA techniques were, respectively, 94% and 84.8%. CONCLUSION: Although we used a relatively small database, results obtained in the present study indicate that Zernike polynomials as descriptors of corneal shape may be a reliable parameter as input data for diagnostic automation of VK maps, using either NN or DA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The spatial and temporal retention of metals has been studied in water and sediments of the Gavião River, Anagé and Tremedal Reservoirs, located in the semi-arid region, Bahia - Brazil, in order to identify trends in the fluxes of metals from the sediments to the water column. The determination of metals was made by ICP OES and ET AAS. The application of statistical methods showed that this aquatic system presents suitable conditions to move Cd2+ and Pb2+ from the water column to the sediment.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present a molecular phylogenetic analysis of caenophidian (advanced) snakes using sequences from two mitochondrial genes (12S and 16S rRNA) and one nuclear (c-mos) gene (1681 total base pairs), and with 131 terminal taxa sampled from throughout all major caenophidian lineages but focussing on Neotropical xenodontines. Direct optimization parsimony analysis resulted in a well-resolved phylogenetic tree, which corroborates some clades identified in previous analyses and suggests new hypotheses for the composition and relationships of others. The major salient points of our analysis are: (1) placement of Acrochordus, Xenodermatids, and Pareatids as successive outgroups to all remaining caenophidians (including viperids, elapids, atractaspidids, and all other "colubrid" groups); (2) within the latter group, viperids and homalopsids are sucessive sister clades to all remaining snakes; (3) the following monophyletic clades within crown group caenophidians: Afro-Asian psammophiids (including Mimophis from Madagascar), Elapidae (including hydrophiines but excluding Homoroselaps), Pseudoxyrhophiinae, Colubrinae, Natricinae, Dipsadinae, and Xenodontinae. Homoroselaps is associated with atractaspidids. Our analysis suggests some taxonomic changes within xenodontines, including new taxonomy for Alsophis elegans, Liophis amarali, and further taxonomic changes within Xenodontini and the West Indian radiation of xenodontines. Based on our molecular analysis, we present a revised classification for caenophidians and provide morphological diagnoses for many of the included clades; we also highlight groups where much more work is needed. We name as new two higher taxonomic clades within Caenophidia, one new subfamily within Dipsadidae, and, within Xenodontinae five new tribes, six new genera and two resurrected genera. We synonymize Xenoxybelis and Pseudablabes with Philodryas; Erythrolamprus with Liophis; and Lystrophis and Waglerophis with Xenodon.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Os motivos para as diferenças epidemiológicas e para a adesão ao tratamento da tuberculose em relação a homens e mulheres são desconhecidos. Este trabalho tem como objetivo verificar diferenças na adesão ao tratamento da tuberculose em relação ao sexo; identificar aspectos facilitadores e dificultadores para a adesão ao tratamento da tuberculose em relação ao sexo; analisar as crenças consideradas importantes para a adesão ao tratamento da tuberculose. Foi utilizado o referencial teórico do Modelo de Crenças em Saúde de Rosenstock e a técnica da Análise de Conteúdos de Bardin. Foram realizadas 28 entrevistas semiestruturadas com homens e mulheres em tratamento supervisionado de tuberculose do Distrito de Saúde da Freguesia do Ó/Brasilândia. Os resultados mostraram que o perfil daqueles que falharam na terapêutica da tuberculose em relação ao sexo foi: mulher - solteira e separada, com atividade remunerada não comprovada, nível de escolaridade entre fundamental I completo e ensino médio completo; homem - casado, com atividade remunerada comprovada, nível de escolaridade entre ensino fundamental II completo e ensino médio completo. Os aspectos facilitadores encontrados para a boa adesão residem no bom atendimento dos profissionais de saúde e na percepção, por parte do paciente, da sua melhora de saúde. As crenças para a boa adesão ao tratamento no sexo masculino e feminino foram: bom atendimento do serviço de saúde e bom tratamento (em relação aos medicamentos).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes a new food classification which assigns foodstuffs according to the extent and purpose of the industrial processing applied to them. Three main groups are defined: unprocessed or minimally processed foods (group 1), processed culinary and food industry ingredients (group 2), and ultra-processed food products (group 3). The use of this classification is illustrated by applying it to data collected in the Brazilian Household Budget Survey which was conducted in 2002/2003 through a probabilistic sample of 48,470 Brazilian households. The average daily food availability was 1,792 kcal/person being 42.5% from group 1 (mostly rice and beans and meat and milk), 37.5% from group 2 (mostly vegetable oils, sugar, and flours), and 20% from group 3 (mostly breads, biscuits, sweets, soft drinks, and sausages). The share of group 3 foods increased with income, and represented almost one third of all calories in higher income households. The impact of the replacement of group 1 foods and group 2 ingredients by group 3 products on the overall quality of the diet, eating patterns and health is discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Bovine coronavirus (BCoV) is a member of the group 2 of the Coronavirus (Nidovirales: Coronaviridae) and the causative agent of enteritis in both calves and adult bovine, as well as respiratory disease in calves. The present study aimed to develop a semi-nested RT-PCR for the detection of BCoV based on representative up-to-date sequences of the nucleocapsid gene, a conserved region of coronavirus genome. Three primers were designed, the first round with a 463bp and the second (semi-nested) with a 306bp predicted fragment. The analytical sensitivity was determined by 10-fold serial dilutions of the BCoV Kakegawa strain (HA titre: 256) in DEPC treated ultra-pure water, in fetal bovine serum (FBS) and in a BCoV-free fecal suspension, when positive results were found up to the 10-2, 10-3 and 10-7 dilutions, respectively, which suggests that the total amount of RNA in the sample influence the precipitation of pellets by the method of extraction used. When fecal samples was used, a large quantity of total RNA serves as carrier of BCoV RNA, demonstrating a high analytical sensitivity and lack of possible substances inhibiting the PCR. The final semi-nested RT-PCR protocol was applied to 25 fecal samples from adult cows, previously tested by a nested RT-PCR RdRp used as a reference test, resulting in 20 and 17 positives for the first and second tests, respectively, and a substantial agreement was found by kappa statistics (0.694). The high sensitivity and specificity of the new proposed method and the fact that primers were designed based on current BCoV sequences give basis to a more accurate diagnosis of BCoV-caused diseases, as well as to further insights on protocols for the detection of other Coronavirus representatives of both Animal and Public Health importance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This work proposes a new approach using a committee machine of artificial neural networks to classify masses found in mammograms as benign or malignant. Three shape factors, three edge-sharpness measures, and 14 texture measures are used for the classification of 20 regions of interest (ROIs) related to malignant tumors and 37 ROIs related to benign masses. A group of multilayer perceptrons (MLPs) is employed as a committee machine of neural network classifiers. The classification results are reached by combining the responses of the individual classifiers. Experiments involving changes in the learning algorithm of the committee machine are conducted. The classification accuracy is evaluated using the area A. under the receiver operating characteristics (ROC) curve. The A, result for the committee machine is compared with the A, results obtained using MLPs and single-layer perceptrons (SLPs), as well as a linear discriminant analysis (LDA) classifier Tests are carried out using the student's t-distribution. The committee machine classifier outperforms the MLP SLP, and LDA classifiers in the following cases: with the shape measure of spiculation index, the A, values of the four methods are, in order 0.93, 0.84, 0.75, and 0.76; and with the edge-sharpness measure of acutance, the values are 0.79, 0.70, 0.69, and 0.74. Although the features with which improvement is obtained with the committee machines are not the same as those that provided the maximal value of A(z) (A(z) = 0.99 with some shape features, with or without the committee machine), they correspond to features that are not critically dependent on the accuracy of the boundaries of the masses, which is an important result. (c) 2008 SPIE and IS&T.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The in vitro activity of the crude hydroalcoholic extract of the aerial parts of Miconia langsdorffii Cogn. was evaluated against the promastigote forms of L. amazonensis, the causative agent of cutaneous leishmaniasis in humans. The bioassay-guided fractionation of this extract led to identification of the triterpenes ursolic acid and oleanolic acid as the major compounds in the fraction that displayed the highest activity. Several ursolic acid semi-synthetic derivatives were prepared, to find out whether more active compounds could be obtained. Among these ursolic acid-derived substances, the C-28 methyl ester derivative exhibited the best antileishmanial activity.