986 resultados para Road image databases
Resumo:
Scale not given.
Resumo:
The size of online image datasets is constantly increasing. Considering an image dataset with millions of images, image retrieval becomes a seemingly intractable problem for exhaustive similarity search algorithms. Hashing methods, which encodes high-dimensional descriptors into compact binary strings, have become very popular because of their high efficiency in search and storage capacity. In the first part, we propose a multimodal retrieval method based on latent feature models. The procedure consists of a nonparametric Bayesian framework for learning underlying semantically meaningful abstract features in a multimodal dataset, a probabilistic retrieval model that allows cross-modal queries and an extension model for relevance feedback. In the second part, we focus on supervised hashing with kernels. We describe a flexible hashing procedure that treats binary codes and pairwise semantic similarity as latent and observed variables, respectively, in a probabilistic model based on Gaussian processes for binary classification. We present a scalable inference algorithm with the sparse pseudo-input Gaussian process (SPGP) model and distributed computing. In the last part, we define an incremental hashing strategy for dynamic databases where new images are added to the databases frequently. The method is based on a two-stage classification framework using binary and multi-class SVMs. The proposed method also enforces balance in binary codes by an imbalance penalty to obtain higher quality binary codes. We learn hash functions by an efficient algorithm where the NP-hard problem of finding optimal binary codes is solved via cyclic coordinate descent and SVMs are trained in a parallelized incremental manner. For modifications like adding images from an unseen class, we propose an incremental procedure for effective and efficient updates to the previous hash functions. Experiments on three large-scale image datasets demonstrate that the incremental strategy is capable of efficiently updating hash functions to the same retrieval performance as hashing from scratch.
Resumo:
Image (Video) retrieval is an interesting problem of retrieving images (videos) similar to the query. Images (Videos) are represented in an input (feature) space and similar images (videos) are obtained by finding nearest neighbors in the input representation space. Numerous input representations both in real valued and binary space have been proposed for conducting faster retrieval. In this thesis, we present techniques that obtain improved input representations for retrieval in both supervised and unsupervised settings for images and videos. Supervised retrieval is a well known problem of retrieving same class images of the query. We address the practical aspects of achieving faster retrieval with binary codes as input representations for the supervised setting in the first part, where binary codes are used as addresses into hash tables. In practice, using binary codes as addresses does not guarantee fast retrieval, as similar images are not mapped to the same binary code (address). We address this problem by presenting an efficient supervised hashing (binary encoding) method that aims to explicitly map all the images of the same class ideally to a unique binary code. We refer to the binary codes of the images as `Semantic Binary Codes' and the unique code for all same class images as `Class Binary Code'. We also propose a new class based Hamming metric that dramatically reduces the retrieval times for larger databases, where only hamming distance is computed to the class binary codes. We also propose a Deep semantic binary code model, by replacing the output layer of a popular convolutional Neural Network (AlexNet) with the class binary codes and show that the hashing functions learned in this way outperforms the state of the art, and at the same time provide fast retrieval times. In the second part, we also address the problem of supervised retrieval by taking into account the relationship between classes. For a given query image, we want to retrieve images that preserve the relative order i.e. we want to retrieve all same class images first and then, the related classes images before different class images. We learn such relationship aware binary codes by minimizing the similarity between inner product of the binary codes and the similarity between the classes. We calculate the similarity between classes using output embedding vectors, which are vector representations of classes. Our method deviates from the other supervised binary encoding schemes as it is the first to use output embeddings for learning hashing functions. We also introduce new performance metrics that take into account the related class retrieval results and show significant gains over the state of the art. High Dimensional descriptors like Fisher Vectors or Vector of Locally Aggregated Descriptors have shown to improve the performance of many computer vision applications including retrieval. In the third part, we will discuss an unsupervised technique for compressing high dimensional vectors into high dimensional binary codes, to reduce storage complexity. In this approach, we deviate from adopting traditional hyperplane hashing functions and instead learn hyperspherical hashing functions. The proposed method overcomes the computational challenges of directly applying the spherical hashing algorithm that is intractable for compressing high dimensional vectors. A practical hierarchical model that utilizes divide and conquer techniques using the Random Select and Adjust (RSA) procedure to compress such high dimensional vectors is presented. We show that our proposed high dimensional binary codes outperform the binary codes obtained using traditional hyperplane methods for higher compression ratios. In the last part of the thesis, we propose a retrieval based solution to the Zero shot event classification problem - a setting where no training videos are available for the event. To do this, we learn a generic set of concept detectors and represent both videos and query events in the concept space. We then compute similarity between the query event and the video in the concept space and videos similar to the query event are classified as the videos belonging to the event. We show that we significantly boost the performance using concept features from other modalities.
Resumo:
In lead pencil: Aug. 91.
Resumo:
"3d day of October 1854."
Resumo:
The usage of Optical Character Recognition’s (OCR, systems is a widely spread technology into the world of Computer Vision and Machine Learning. It is a topic that interest many field, for example the automotive, where becomes a specialized task known as License Plate Recognition, useful for many application from the automation of toll road to intelligent payments. However, OCR systems need to be very accurate and generalizable in order to be able to extract the text of license plates under high variable conditions, from the type of camera used for acquisition to light changes. Such variables compromise the quality of digitalized real scenes causing the presence of noise and degradation of various type, which can be minimized with the application of modern approaches for image iper resolution and noise reduction. Oneclass of them is known as Generative Neural Networks, which are very strong ally for the solution of this popular problem.
Resumo:
Diabetic Retinopathy (DR) is a complication of diabetes that can lead to blindness if not readily discovered. Automated screening algorithms have the potential to improve identification of patients who need further medical attention. However, the identification of lesions must be accurate to be useful for clinical application. The bag-of-visual-words (BoVW) algorithm employs a maximum-margin classifier in a flexible framework that is able to detect the most common DR-related lesions such as microaneurysms, cotton-wool spots and hard exudates. BoVW allows to bypass the need for pre- and post-processing of the retinographic images, as well as the need of specific ad hoc techniques for identification of each type of lesion. An extensive evaluation of the BoVW model, using three large retinograph datasets (DR1, DR2 and Messidor) with different resolution and collected by different healthcare personnel, was performed. The results demonstrate that the BoVW classification approach can identify different lesions within an image without having to utilize different algorithms for each lesion reducing processing time and providing a more flexible diagnostic system. Our BoVW scheme is based on sparse low-level feature detection with a Speeded-Up Robust Features (SURF) local descriptor, and mid-level features based on semi-soft coding with max pooling. The best BoVW representation for retinal image classification was an area under the receiver operating characteristic curve (AUC-ROC) of 97.8% (exudates) and 93.5% (red lesions), applying a cross-dataset validation protocol. To assess the accuracy for detecting cases that require referral within one year, the sparse extraction technique associated with semi-soft coding and max pooling obtained an AUC of 94.2 ± 2.0%, outperforming current methods. Those results indicate that, for retinal image classification tasks in clinical practice, BoVW is equal and, in some instances, surpasses results obtained using dense detection (widely believed to be the best choice in many vision problems) for the low-level descriptors.
Resumo:
The consumption of dietary supplements is highest among athletes and it can represent potential a health risk for consumers. The aim of this study was to determine the prevalence of consumption of dietary supplements by road runners. We interviewed 817 volunteers from four road races in the Brazilian running calendar. The sample consisted of 671 male and 146 female runners with a mean age of 37.9 ± 12.4 years. Of the sample, 28.33% reported having used some type of dietary supplement. The main motivation for this consumption is to increase in stamina and improve performance. The probability of consuming dietary supplements increased 4.67 times when the runners were guided by coaches. The consumption of supplements was strongly correlated (r = 0.97) with weekly running distance, and also highly correlated (r = 0.86) with the number of years the sport had been practiced. The longer the runner had practiced the sport, the higher the training volume and the greater the intake of supplements. The five most frequently cited reasons for consumption were: energy enhancement (29.5%), performance improvement (17.1%), increased level of endurance (10.3%), nutrient replacement (11.1%), and avoidance of fatigue (10.3%). About 30% of the consumers declared more than one reason for taking dietary supplements. The most consumed supplements were: carbohydrates (52.17%), vitamins (28.70%), and proteins (13.48%). Supplement consumption by road runners in Brazil appeared to be guided by the energy boosting properties of the supplement, the influence of coaches, and the experience of the user. The amount of supplement intake seemed to be lower among road runners than for athletes of other sports. We recommend that coaches and nutritionists emphasise that a balanced diet can meet the needs of physically active people.
Resumo:
Universidade Estadual de Campinas . Faculdade de Educação Física
Resumo:
Universidade Estadual de Campinas . Faculdade de Educação Física
Resumo:
Universidade Estadual de Campinas . Faculdade de Educação Física
Resumo:
PURPOSE: Juvenile idiopathic arthritis (JIA) has unknown etiology, and the involvement of the temporomandibular joint (TMJ) is rare in the early phase of the disease. The present article describes the use of computed tomography (CT) and magnetic resonance (MRI) images for the diagnosis of affected TMJ in JIA. CASE DESCRIPTION: A 12-year-old, female, Caucasian patient, with systemic rheumathoid arthritis and involvement of multiple joints was referred to the Imaging Center for TMJ assessment. The patient reported TMJ pain and limited opening of the mouth. The helical CT examination of the TMJ region showed asymmetric mandibular condyles, erosion of the right condyle and osteophyte-like formation. The MRI examination showed erosion of the right mandibular condyle, osteophytes, displacement without reduction and disruption of the articular disc. CONCLUSION: The disorders of the TMJ as a consequence of JIA must be carefully assessed by modern imaging methods such as CT and MRI. CT is very useful for the evaluation of discrete bone changes, which are not identified by conventional radiographs in the early phase of JIA. MRI allows the evaluation of soft tissues, the identification of acute articular inflammation and the differentiation between pannus and synovial hypertrophy.
Resumo:
Background: The present work aims at the application of the decision theory to radiological image quality control ( QC) in diagnostic routine. The main problem addressed in the framework of decision theory is to accept or reject a film lot of a radiology service. The probability of each decision of a determined set of variables was obtained from the selected films. Methods: Based on a radiology service routine a decision probability function was determined for each considered group of combination characteristics. These characteristics were related to the film quality control. These parameters were also framed in a set of 8 possibilities, resulting in 256 possible decision rules. In order to determine a general utility application function to access the decision risk, we have used a simple unique parameter called r. The payoffs chosen were: diagnostic's result (correct/incorrect), cost (high/low), and patient satisfaction (yes/no) resulting in eight possible combinations. Results: Depending on the value of r, more or less risk will occur related to the decision-making. The utility function was evaluated in order to determine the probability of a decision. The decision was made with patients or administrators' opinions from a radiology service center. Conclusion: The model is a formal quantitative approach to make a decision related to the medical imaging quality, providing an instrument to discriminate what is really necessary to accept or reject a film or a film lot. The method presented herein can help to access the risk level of an incorrect radiological diagnosis decision.
Resumo:
Background: Chrysotile is considered less harmful to human health than other types of asbestos fibers. Its clearance from the lung is faster and, in comparison to amphibole forms of asbestos, chrysotile asbestos fail to accumulate in the lung tissue due to a mechanism involving fibers fragmentation in short pieces. Short exposure to chrysotile has not been associated with any histopathological alteration of lung tissue. Methods: The present work focuses on the association of small chrysotile fibers with interphasic and mitotic human lung cancer cells in culture, using for analyses confocal laser scanning microscopy and 3D reconstructions. The main goal was to perform the analysis of abnormalities in mitosis of fibers-containing cells as well as to quantify nuclear DNA content of treated cells during their recovery in fiber-free culture medium. Results: HK2 cells treated with chrysotile for 48 h and recovered in additional periods of 24, 48 and 72 h in normal medium showed increased frequency of multinucleated and apoptotic cells. DNA ploidy of the cells submitted to the same chrysotile treatment schedules showed enhanced aneuploidy values. The results were consistent with the high frequency of multipolar spindles observed and with the presence of fibers in the intercellular bridge during cytokinesis. Conclusion: The present data show that 48 h chrysotile exposure can cause centrosome amplification, apoptosis and aneuploid cell formation even when long periods of recovery were provided. Internalized fibers seem to interact with the chromatin during mitosis, and they could also interfere in cytokinesis, leading to cytokinesis failure which forms aneuploid or multinucleated cells with centrosome amplification.