848 resultados para Content Based Image Retrieval (CBIR)


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this thesis, an image enhancement application is developed for low-vision patients when they use iPhones to see images/watch videos. The thesis has two contributions. The first contribution is the new image enhancement algorithm which combines human vision features. The new image enhancement algorithm is modified from a wavelet transform based image enhancement algorithm developed by Dr. Jinshan Tang. Different from the original algorithm, the new image enhancement algorithm combines human visual feature into the algorithm and thus can make the new algorithm more effective. Experimental simulation results show that the proposed algorithm has better visual results than the algorithm without combining visual features. The second contribution of this thesis is the development of a mobile image enhancement application. In this application, users with low-vision can see clearer images on an iPhone which is installed with the application I have developed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sketches are a unique way to communicate: drawing a simple sketch does not require any training, sketches convey information that is hard to describe with words, they are powerful enough to represent almost any concept, and nowadays, it is possible to draw directly from mobile devices. Motivated from the unique characteristics of sketches and fascinated by the human ability to imagine 3D objects from drawings, this thesis focuses on automatically associating geometric information to sketches. The main research directions of the thesis can be summarized as obtaining geometric information from freehand scene sketches to improve 2D sketch-based tasks and investigating Vision-Language models to overcome 3D sketch-based tasks limitations. The first part of the thesis concerns geometric information prediction from scene sketches improving scene sketch to image generation and unlocking new creativity effects. The thesis proceeds showing a study conducted on the Vision-Language models embedding space considering sketches, line renderings and RGB renderings of 3D shape to overcome the use of supervised datasets for 3D sketch-based tasks, that are limited and hard to acquire. Following the obtained observations and results, Vision-Language models are applied to Sketch Based Shape Retrieval without the need of training on supervised datasets. We then analyze the use of Vision-Language models for sketch based 3D reconstruction in an unsupervised manner. In the final chapter we report the results obtained in an additional project carried during the PhD, which has lead to the development of a framework to learn an embedding space of neural networks that can be navigated to get ready-to-use models with desired characteristics.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, we describe the Vannotea system - an application designed to enable collaborating groups to discuss and annotate collections of high quality images, video, audio or 3D objects. The system has been designed specifically to capture and share scholarly discourse and annotations about multimedia research data by teams of trusted colleagues within a research or academic environment. As such, it provides: authenticated access to a web browser search interface for discovering and retrieving media objects; a media replay window that can incorporate a variety of embedded plug-ins to render different scientific media formats; an annotation authoring, editing, searching and browsing tool; and session logging and replay capabilities. Annotations are personal remarks, interpretations, questions or references that can be attached to whole files, segments or regions. Vannotea enables annotations to be attached either synchronously (using jabber message passing and audio/video conferencing) or asynchronously and stand-alone. The annotations are stored on an Annotea server, extended for multimedia content. Their access, retrieval and re-use is controlled via Shibboleth identity management and XACML access policies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, we describe a model of the human visual system (HVS) based on the wavelet transform. This model is largely based on a previously proposed model, but has a number of modifications that make it more amenable to potential integration into a wavelet based image compression scheme. These modifications include the use of a separable wavelet transform instead of the cortex transform, the application of a wavelet contrast sensitivity function (CSP), and a simplified definition of subband contrast that allows us to predict noise visibility directly from wavelet coefficients. Initially, we outline the luminance, frequency, and masking sensitivities of the HVS and discuss how these can be incorporated into the wavelet transform. We then outline a number of limitations of the wavelet transform as a model of the HVS, namely the lack of translational invariance and poor orientation sensitivity. In order to investigate the efficacy of this wavelet based model, a wavelet visible difference predictor (WVDP) is described. The WVDP is then used to predict visible differences between an original and compressed (or noisy) image. Results are presented to emphasize the limitations of commonly used measures of image quality and to demonstrate the performance of the WVDP, The paper concludes with suggestions on bow the WVDP can be used to determine a visually optimal quantization strategy for wavelet coefficients and produce a quantitative measure of image quality.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dissertação apresentada à Escola Superior de Comunicação Social como parte dos requisitos para obtenção de grau de mestre em Audiovisual e Multimédia.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In visual sensor networks, local feature descriptors can be computed at the sensing nodes, which work collaboratively on the data obtained to make an efficient visual analysis. In fact, with a minimal amount of computational effort, the detection and extraction of local features, such as binary descriptors, can provide a reliable and compact image representation. In this paper, it is proposed to extract and code binary descriptors to meet the energy and bandwidth constraints at each sensing node. The major contribution is a binary descriptor coding technique that exploits the correlation using two different coding modes: Intra, which exploits the correlation between the elements that compose a descriptor; and Inter, which exploits the correlation between descriptors of the same image. The experimental results show bitrate savings up to 35% without any impact in the performance efficiency of the image retrieval task. © 2014 EURASIP.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

he expansion of Digital Television and the convergence between conventional broadcasting and television over IP contributed to the gradual increase of the number of available channels and on demand video content. Moreover, the dissemination of the use of mobile devices like laptops, smartphones and tablets on everyday activities resulted in a shift of the traditional television viewing paradigm from the couch to everywhere, anytime from any device. Although this new scenario enables a great improvement in viewing experiences, it also brings new challenges given the overload of information that the viewer faces. Recommendation systems stand out as a possible solution to help a watcher on the selection of the content that best fits his/her preferences. This paper describes a web based system that helps the user navigating on broadcasted and online television content by implementing recommendations based on collaborative and content based filtering. The algorithms developed estimate the similarity between items and users and predict the rating that a user would assign to a particular item (television program, movie, etc.). To enable interoperability between different systems, programs characteristics (title, genre, actors, etc.) are stored according to the TV-Anytime standard. The set of recommendations produced are presented through a Web Application that allows the user to interact with the system based on the obtained recommendations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Atualmente, os guias turísticos são constituídos por diversos módulos, nomeadamente, módulos de recomendação e de modelação do utilizador. Estes ajudam a adaptar melhor as recomendações dadas ao utilizador de acordo com as suas preferências. A necessidade de adaptar os guias turísticos às possíveis necessidades de saúde do utilizador, foi a motivação para a realização desta dissertação. Quando alguém visita um local desconhecido, considera normalmente as condições tanto de alojamento como de alimentação desse local. Contudo, se por algum motivo, necessita de cuidados de saúde, essa pessoa não se encontra preparada para isso. Assim, a recomendação de uma instituição de saúde direcionada para o turista é uma solução possível para o problema encontrado. Pretendeu-se desenvolver um módulo de recomendação híbrido no âmbito da prestação de informações relacionadas com as possíveis necessidades de saúde do turista, tendo em conta o seu perfil. Para a sua implementação seguiu-se a abordagem baseada em conteúdo e técnicas de classificação das instituições de saúde a recomendar ao utilizador. O protótipo desenvolvido foi testado com alguns utilizadores em termos de funcionalidades. Finalmente, pretende-se que o protótipo seja testado com mais utilizadores, possuidores de diversas características em termos de condições de mobilidade, historial clínico e necessidades. Estes testes irão permitir avaliar o protótipo ao nível da qualidade da recomendação prestada. Poder-se-á, assim, atingir o objetivo relativo à integração deste protótipo num sistema de recomendação de apoio ao turista utilizado pela Câmara Municipal do Porto.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Grande parte do tráfego online tem origem em páginas de resultados de motores de de pesquisa. Estes constituem hoje uma ferramenta fundamental de que os turistas se socorrem para pesquisar e filtrar a informação necessária ao planeamento das suas viagens, sendo, por isso, bastante tidos em conta pelas entidades ligadas ao turismo no momento da definição das suas estratégias de marketing. No presente documento é descrita a investigação feita em torno do modo de funcionamento do motor de pesquisa Google e das métricas que utiliza para avaliação de websites e páginas web. Desta investigação resultou a implementação de um website de conteúdos afetos ao mercado de turismo e viagens em Portugal, focado no mercado do turismo externo – All About Portugal. A implementação do website pretende provar, sustentando-se em orientações da área do SEO, que a propagação de conteúdos baseada unicamente nos motores de pesquisa é viável, confirmando, deste modo, a sua importância. Os dados de utilização desse mesmo website introduzem novos elementos que poderão servir de base a novos estudos.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Studies in Computational Intelligence, 616

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Students have different ways for learning and processing information. Some students prefer learning through seeing while others prefer learning through listening; some students prefer doing activities while other prefer reflecting.Some students reason logically, while others reason intuitively, etc. Identifying the learning style of each student, and providing learning content based on these styles represents a good method to enhance the learning quality. However, there are no efforts onhow to detect the students’ learning styles in mobile computer supported collaborative learning (MCSCL) environments. We present in this paper new ways for automatically detecting the learning styles of students in MCSCL environments based on the learning style model of Felder-Silverman. The identified learning styles of students could be then stored and used at anytime toassign each one of them to his/her appropriate learning group.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

O desenvolvimento profissional dos professores de matemática, por meio de progra¬mas nacionais e formações contínuas, deve proporcionar experiências que envolvam investigação, pensamento, planeamento, prática e reflexão. No caso da tecnologia, não nos devemos focar nas ferramentas em si, mas no modo como são usadas pelos docentes em contexto de sala de aula. Existem taxonomias de atividades de apren¬dizagem baseadas no conteúdo assentes na ideia do professor como construtor do currículo, que, para integrar com sucesso a tecnologia educativa nas aulas, desenvolve o conhecimento pedagógico e tecnológico do conteúdo (TPACK), e apresenta-se a de matemática. Desse modo, reflete-se, por meio de vários estudos nacionais e internacionais, que as tecnologias deverão ser usadas pelos professores de acordo com objetivos, conteúdos e pedagogias específicas para terem um efeito positivo na aprendizagem dos alunos sobre as atividades baseadas no conteúdo que melhor se enquadram com essas tecnologias.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aquest és un projecte que tracta sobre la indexació automàtica de continguts televisius. És una tasca que guanyarà importància amb els imminents canvis que hi haurà en la televisió que coneixem. L'entrada de la nova televisió digital farà que hi hagi una interacció molt més fluida entre l'espectador i la cadena, a més de grans quantitats de canals, cada un amb programes de tipus totalment diferents. Tot això farà que tenir mètodes de cerca basats en els continguts d'aquests programes sigui del tot imprescindible. Així doncs, el nostre projecte està basat plenament en poder extreure alguns d'aquests descriptors que faran possible la categorització dels diferents programes televisius.