858 resultados para Data pre-processing
Resumo:
The amount of textual information digitally stored is growing every day. However, our capability of processing and analyzing that information is not growing at the same pace. To overcome this limitation, it is important to develop semiautomatic processes to extract relevant knowledge from textual information, such as the text mining process. One of the main and most expensive stages of the text mining process is the text pre-processing stage, where the unstructured text should be transformed to structured format such as an attribute-value table. The stemming process, i.e. linguistics normalization, is usually used to find the attributes of this table. However, the stemming process is strongly dependent on the language in which the original textual information is given. Furthermore, for most languages, the stemming algorithms proposed in the literature are computationally expensive. In this work, several improvements of the well know Porter stemming algorithm for the Portuguese language, which explore the characteristics of this language, are proposed. Experimental results show that the proposed algorithm executes in far less time without affecting the quality of the generated stems.
Resumo:
Intelligent Transportation System (ITS) is a system that builds a safe, effective and integrated transportation environment based on advanced technologies. Road signs detection and recognition is an important part of ITS, which offer ways to collect the real time traffic data for processing at a central facility.This project is to implement a road sign recognition model based on AI and image analysis technologies, which applies a machine learning method, Support Vector Machines, to recognize road signs. We focus on recognizing seven categories of road sign shapes and five categories of speed limit signs. Two kinds of features, binary image and Zernike moments, are used for representing the data to the SVM for training and test. We compared and analyzed the performances of SVM recognition model using different features and different kernels. Moreover, the performances using different recognition models, SVM and Fuzzy ARTMAP, are observed.
Resumo:
In the last years the number of industrial applications for Augmented Reality (AR) and Virtual Reality (VR) environments has significantly increased. Optical tracking systems are an important component of AR/VR environments. In this work, a low cost optical tracking system with adequate attributes for professional use is proposed. The system works in infrared spectral region to reduce optical noise. A highspeed camera, equipped with daylight blocking filter and infrared flash strobes, transfers uncompressed grayscale images to a regular PC, where image pre-processing software and the PTrack tracking algorithm recognize a set of retro-reflective markers and extract its 3D position and orientation. Included in this work is a comprehensive research on image pre-processing and tracking algorithms. A testbed was built to perform accuracy and precision tests. Results show that the system reaches accuracy and precision levels slightly worse than but still comparable to professional systems. Due to its modularity, the system can be expanded by using several one-camera tracking modules linked by a sensor fusion algorithm, in order to obtain a larger working range. A setup with two modules was built and tested, resulting in performance similar to the stand-alone configuration.
Resumo:
This article presents a detailed study of the application of different additive manufacturing technologies (sintering process, three-dimensional printing, extrusion and stereolithographic process), in the design process of a complex geometry model and its moving parts. The fabrication sequence was evaluated in terms of pre-processing conditions (model generation and model STL SLI), generation strategy and physical model post-processing operations. Dimensional verification of the obtained models was undertook by projecting structured light (optical scan), a relatively new technology of main importance for metrology and reverse engineering. Studies were done in certain manufacturing time and production costs, which allowed the definition of an more comprehensive evaluation matrix of additive technologies.
Resumo:
The occurrence of transients in electrocardiogram (ECG) signals indicates an electrical phenomenon outside the heart. Thus, the identification of transients has been the most-used methodology in medical analysis since the invention of the electrocardiograph (device responsible for benchmarking of electrocardiogram signals). There are few papers related to this subject, which compels the creation of an architecture to do the pre-processing of this signal in order to identify transients. This paper proposes a method based on the signal energy of the Hilbert transform of electrocardiogram, being an alternative to methods based on morphology of the signal. This information will determine the creation of frames of the MP-HA protocol responsible for transmitting the ECG signals through an IEEE 802.3 network to a computing device. That, in turn, may perform a process to automatically sort the signal, or to present it to a doctor so that he can do the sorting manually
Resumo:
Currently great emphasis is given for seed metering that assist rigorous demands in relation to longitudinal distribution of seeds, as well as to the index of fails in spacing laws, breaks and double seeds. The evaluation of these variable demands much time and work of attainment of data and processing. The objective of this work went propose to use of graphs of normal probability, facilitating the treatment of the data and decreasing the time of processing. The evaluation methodology consists in the counting of broken seeds, fail spacing and double seeds through the measure of the spacing among seeds, preliminary experiments through combinations of treatments had been carried through whose factors of variation were the level of the reservoir of seeds, the leveling of the seed metering, the speed of displacement and dosage of seeds. The evaluation was carried through in two parts, first through preliminary experiments for elaboration of the graphs of normal probability and later in experiments with bigger sampling for evaluation of the influence of the factors most important. It was done the evaluation of seed metering of rotating internal ring, and the amount of necessary data for the evaluation was very decreased through of the graphs of normal probability that facilitated to prioritize only the significant factors. The dosage of seeds was factor that more important because factor (D) have greater significance.
Resumo:
As características físico-químicas do maracujá amarelo em três estádios de cor de casca (1/3 amarelo, 2/3 amarelo e inteiro amarelo), em quatro épocas da safra/99 foram avaliadas, visando estabelecer o ponto de colheita de melhor qualidade da fruta para a industrialização. Foram determinados o tamanho dos frutos, a cor da casca e da polpa, o rendimento de extração da polpa, o teor de sólidos solúveis totais, o pH, a acidez total, o ratio e o teor de vitamina C. Foi empregado o delineamento experimental inteiramente casualizado, com 8 repetições, 5 frutos por parcela e 3 tratamentos. Os resultados mostraram que o rendimento de polpa não apresentou diferença significativa entre os tratamentos e entre as colheitas. Com relação à cor da polpa, houve diferença significativa entre os três estádios apenas na 1ª colheita. Os frutos de casca 1/3 amarela apresentaram teor de sólidos solúveis significativamente inferior aos demais apenas na 3ª e 4ª colheitas, mas o maior valor médio ocorreu na 4ª colheita. O teor de acidez total dos frutos de casca inteira amarela foi significativamente inferior àqueles das colheitas 2, 3 e 4. Os valores mais elevados de vitamina C foram obtidos na 1ª colheita e, os frutos de casca 1/3 amarela apresentaram teores significativamente inferiores nas colheitas 1 e 2. de maneira geral, os resultados indicaram que, embora tenham ocorrido algumas diferenças nas características físico-químicas dos frutos nos diferentes estádios de cor de casca, frutos colhidos em todos os estádios de cor de casca estudados apresentaram-se adequados à industrialização.
Resumo:
Apresenta-se um sistema computacional, denominado ICADPLUS, desenvolvido para elaboração de banco de dados, tabulação de dados, cálculo do índice CPO e análise estatística para estimação de intervalos de confiança e comparação de resultados de duas populações.Tem como objetivo apresentar método simplificado para atender necessidades de serviços de saúde na área de odontologia processando fichas utilizadas por cirurgiões dentistas em levantamentos epidemiológicos de cárie dentária. A característica principal do sistema é a dispensa de profissional especializado na área de odontologia e computação, exigindo o conhecimento mínimo de digitação por parte do usuário, pois apresenta menus simples e claros como também relatórios padronizados, sem possibilidade de erro. Possui opções para fichas de CPO segundo Klein e Palmer, CPO proposto pela OMS, CPOS segundo Klein, Palmer e Knutson, e ceo. A validação do sistema foi feita por comparação com outros métodos, permitindo recomendar sua adoção.
Resumo:
A target tracking algorithm able to identify the position and to pursuit moving targets in video digital sequences is proposed in this paper. The proposed approach aims to track moving targets inside the vision field of a digital camera. The position and trajectory of the target are identified by using a neural network presenting competitive learning technique. The winning neuron is trained to approximate to the target and, then, pursuit it. A digital camera provides a sequence of images and the algorithm process those frames in real time tracking the moving target. The algorithm is performed both with black and white and multi-colored images to simulate real world situations. Results show the effectiveness of the proposed algorithm, since the neurons tracked the moving targets even if there is no pre-processing image analysis. Single and multiple moving targets are followed in real time.
Resumo:
This paper presents a novel segmentation method for cuboidal cell nuclei in images of prostate tissue stained with hematoxylin and eosin. The proposed method allows segmenting normal, hyperplastic and cancerous prostate images in three steps: pre-processing, segmentation of cuboidal cell nuclei and post-processing. The pre-processing step consists of applying contrast stretching to the red (R) channel to highlight the contrast of cuboidal cell nuclei. The aim of the second step is to apply global thresholding based on minimum cross entropy to generate a binary image with candidate regions for cuboidal cell nuclei. In the post-processing step, false positives are removed using the connected component method. The proposed segmentation method was applied to an image bank with 105 samples and measures of sensitivity, specificity and accuracy were compared with those provided by other segmentation approaches available in the specialized literature. The results are promising and demonstrate that the proposed method allows the segmentation of cuboidal cell nuclei with a mean accuracy of 97%. © 2013 Elsevier Ltd. All rights reserved.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Pós-graduação em Engenharia Elétrica - FEIS
Resumo:
Pós-graduação em Televisão Digital: Informação e Conhecimento - FAAC
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
O presente trabalho tem por objetivo central demonstrar a variabilidade existente na floresta no que tange aos estoques de biomassa e carbono florestal acima do solo, a partir da identificação e caracterização, com base em técnicas de sensoriamento remoto, de unidades de paisagem em uma área situada no município de Belterra, região oeste do Estado do Pará, a partir da matriz teórico-conceitual da abordagem Ecologia da Paisagem. Para o alcance de tal proposição, a metodologia empregada partiu da revisão da literatura sobre o tema, aquisição de dados cartográficos e orbitais, uso de técnicas de sensoriamento remoto, coleta de dados em campo, tratamento e análise estatística. O trabalho está dividido em quatro capítulos, seguidos pelas considerações gerais da obra. Partindo da matriz teórico-metodológica da Ecologia da Paisagem, analisa-se a dinâmica socioambiental do município de Belterra, que atualmente experimenta a expansão das atividades agrícolas, com destaque para a agricultura mecanizada da soja. A partir da análise multitemporal de imagens Landsat do município pôde-se avaliar a distribuição da cobertura florestal existente no mesmo, bem como o padrão espacial de distribuição das principais unidades de paisagem identificadas. Considerando esse recorte, realizou-se a coleta de dados em campo via inventário florestal em quatro tipologias florestais (floresta de alto platô, floresta de baixo platô, vegetação secundária e tensão ecológica) para obtenção de parâmetros morfométricos da vegetação e posterior quantificação dos estoques de biomassa e carbono contidos em cada unidade, bem como observar o comportamento estrutural da floresta nas mesmas. A adoção da paisagem como escala espacial de análise mostrou-se bastante satisfatória na quantificação dos estoques de biomassa e carbono florestal ao permitir considerar a influência da dinâmica socioeconômica na redução desses estoques. Além disso, possibilitou constatar que o reconhecimento da heterogeneidade da cobertura florestal é um elemento fundamental para a obtenção de estimativas de carbono de acordo com as características estruturais da vegetação, que varia de acordo com a topografia do terreno, com as espécies existentes e com as características geográficas, o que envolve a tipologia climática, as características geomorfológicas, pedológicas e geológicas da área.