969 resultados para visual search
Resumo:
Relative (comparative) attributes are promising for thematic ranking of visual entities, which also aids in recognition tasks. However, attribute rank learning often requires a substantial amount of relational supervision, which is highly tedious, and apparently impractical for real-world applications. In this paper, we introduce the Semantic Transform, which under minimal supervision, adaptively finds a semantic feature space along with a class ordering that is related in the best possible way. Such a semantic space is found for every attribute category. To relate the classes under weak supervision, the class ordering needs to be refined according to a cost function in an iterative procedure. This problem is ideally NP-hard, and we thus propose a constrained search tree formulation for the same. Driven by the adaptive semantic feature space representation, our model achieves the best results to date for all of the tasks of relative, absolute and zero-shot classification on two popular datasets. © 2013 IEEE.
Resumo:
Recovering a volumetric model of a person, car, or other object of interest from a single snapshot would be useful for many computer graphics applications. 3D model estimation in general is hard, and currently requires active sensors, multiple views, or integration over time. For a known object class, however, 3D shape can be successfully inferred from a single snapshot. We present a method for generating a ``virtual visual hull''-- an estimate of the 3D shape of an object from a known class, given a single silhouette observed from an unknown viewpoint. For a given class, a large database of multi-view silhouette examples from calibrated, though possibly varied, camera rigs are collected. To infer a novel single view input silhouette's virtual visual hull, we search for 3D shapes in the database which are most consistent with the observed contour. The input is matched to component single views of the multi-view training examples. A set of viewpoint-aligned virtual views are generated from the visual hulls corresponding to these examples. The 3D shape estimate for the input is then found by interpolating between the contours of these aligned views. When the underlying shape is ambiguous given a single view silhouette, we produce multiple visual hull hypotheses; if a sequence of input images is available, a dynamic programming approach is applied to find the maximum likelihood path through the feasible hypotheses over time. We show results of our algorithm on real and synthetic images of people.
Resumo:
Some WWW image engines allow the user to form a query in terms of text keywords. To build the image index, keywords are extracted heuristically from HTML documents containing each image, and/or from the image URL and file headers. Unfortunately, text-based image engines have merely retro-fitted standard SQL database query methods, and it is difficult to include images cues within such a framework. On the other hand, visual statistics (e.g., color histograms) are often insufficient for helping users find desired images in a vast WWW index. By truly unifying textual and visual statistics, one would expect to get better results than either used separately. In this paper, we propose an approach that allows the combination of visual statistics with textual statistics in the vector space representation commonly used in query by image content systems. Text statistics are captured in vector form using latent semantic indexing (LSI). The LSI index for an HTML document is then associated with each of the images contained therein. Visual statistics (e.g., color, orientedness) are also computed for each image. The LSI and visual statistic vectors are then combined into a single index vector that can be used for content-based search of the resulting image database. By using an integrated approach, we are able to take advantage of possible statistical couplings between the topic of the document (latent semantic content) and the contents of images (visual statistics). This allows improved performance in conducting content-based search. This approach has been implemented in a WWW image search engine prototype.
Resumo:
A common challenge that users of academic databases face is making sense of their query outputs for knowledge discovery. This is exacerbated by the size and growth of modern databases. PubMed, a central index of biomedical literature, contains over 25 million citations, and can output search results containing hundreds of thousands of citations. Under these conditions, efficient knowledge discovery requires a different data structure than a chronological list of articles. It requires a method of conveying what the important ideas are, where they are located, and how they are connected; a method of allowing users to see the underlying topical structure of their search. This paper presents VizMaps, a PubMed search interface that addresses some of these problems. Given search terms, our main backend pipeline extracts relevant words from the title and abstract, and clusters them into discovered topics using Bayesian topic models, in particular the Latent Dirichlet Allocation (LDA). It then outputs a visual, navigable map of the query results.
Resumo:
A rapidly increasing number of Web databases are now become accessible via
their HTML form-based query interfaces. Query result pages are dynamically generated
in response to user queries, which encode structured data and are displayed for human
use. Query result pages usually contain other types of information in addition to query
results, e.g., advertisements, navigation bar etc. The problem of extracting structured data
from query result pages is critical for web data integration applications, such as comparison
shopping, meta-search engines etc, and has been intensively studied. A number of approaches
have been proposed. As the structures of Web pages become more and more complex, the
existing approaches start to fail, and most of them do not remove irrelevant contents which
may a®ect the accuracy of data record extraction. We propose an automated approach for
Web data extraction. First, it makes use of visual features and query terms to identify data
sections and extracts data records in these sections. We also represent several content and
visual features of visual blocks in a data section, and use them to ¯lter out noisy blocks.
Second, it measures similarity between data items in di®erent data records based on their
visual and content features, and aligns them into di®erent groups so that the data in the
same group have the same semantics. The results of our experiments with a large set of
Web query result pages in di®erent domains show that our proposed approaches are highly
e®ective.
Resumo:
PURPOSE: Glaucoma patients are still at risk of becoming blind. It is of clinical significance to determine the risk of blindness and its causes to prevent its occurrence. This systematic review estimates the number of treated glaucoma patients with end-of-life visual impairment (VI) and blindness and the factors that are associated with this.
METHODS: A systematic literature search in relevant databases was conducted in August 2014 on end-of-life VI. A total of 2574 articles were identified, of which 5 on end-of-life VI. Several data items were extracted from the reports and presented in tables.
RESULTS: All studies had a retrospective design. A considerable number of glaucoma patients were found to be blind at the end of their life; with up to 24% unilateral and 10% bilateral blindness. The following factors were associated with blindness: (1) baseline severity of visual field loss: advanced stage of glaucoma or substantial visual field loss at the initial visit; (2) factors influencing progression: fluctuation of intraocular pressure (IOP) during treatment, presence of pseudoexfoliation, poor patient compliance, higher IOP; (3) longer time period: longer duration of disease and older age at death because of a longer life expectancy; and (4) coexistence of other ocular pathology.
CONCLUSIONS: Further prevention of blindness in glaucoma patients is needed. To reach this goal, it is important to address the risk factors for blindness identified in this review, especially those that can be modified, such as advanced disease at diagnosis, high and fluctuating IOP, and poor compliance.
Resumo:
Anticipating the increase in video information in future, archiving of news is an important activity in the visual media industry. When the volume of archives increases, it will be difficult for journalists to find the appropriate content using current search tools. This paper provides the details of the study we conducted about the news extraction systems used in different news channels in Kerala. Semantic web technologies can be used effectively since news archiving share many of the characteristics and problems of WWW. Since visual news archives of different media resources follow different metadata standards, interoperability between the resources is also an issue. World Wide Web Consortium has proposed a draft for an ontology framework for media resource which addresses the intercompatiblity issues. In this paper, the w3c proposed framework and its drawbacks is also discussed
Resumo:
The ISO norm line 9241 states some criteria for ergonomics of human system interaction. In markets with a huge variety of offers and little possibility of differentiation, providers can gain a decisive competitive advantage by user oriented interfaces. A precondition for this is that relevant information can be obtained for entrepreneurial decisions in this regard. To test how users of universal search result pages use those pages and pay attention to different elements, an eye tracking experiment with a mixed design has been developed. Twenty subjects were confronted with search engine result pages (SERPs) and were instructed to make a decision while conditions “national vs. international city” and “with vs. without miniaturized Google map” were used. Different parameters like fixation count, duration and time to first fixation were computed from the eye tracking raw data and supplemented by click rate data as well as data from questionnaires. Results of this pilot study revealed some remarkable facts like a vampire effect on miniaturized Google maps. Furthermore, Google maps did not shorten the process of decision making, Google ads were not fixated, visual attention on SERPs was influenced by position of the elements on the SERP and by the users’ familiarity with the search target. These results support the theory of Amount of Invested Mental Effort (AIME) and give providers empirical evidence to take users’ expectations into account. Furthermore, the results indicated that the task oriented goal mode of participants was a moderator for the attention spent on ads. Most important, SERPs with images attracted the viewers’ attention much longer than those without images. This unique selling proposition may lead to a distortion of competition on markets.
Resumo:
The crisis of the national project in the early 1990s, caused by a short-lived but disastrous government, led Brazilian art cinema, for the first time, to look at itself as periphery and re-approach the old colonial center, Portugal. Terra estrangeira/Foreign Land (Walter Salles & Daniela Thomas, Brazil/Portugal, 1995), a film about Brazilian exiles in Portugal, is the best illustration of this perspective shift which provides a new sense of Brazil’s scale and position within a global context. Shot mainly on location in São Paulo, Lisbon and Cape Verde, it promotes the encounter of Lusophone peoples who find a common ground in their marginal situation. Rather than as a former empire, Portugal is defined by its situation at the edge of Europe and by beliefs such as Sebastianism, whose origins go back to the time when the country was dominated by Spain. As a result, notions of “core” or “center” are devolved to the realm of myth. The film’s carefully crafted dialogue combines Brazilian, Portuguese and Creole linguistic peculiarities into a common dialect of exclusion, while language puns trigger visual rhymes which refer back to the Cinema Novo (the Brazilian New Wave) repertoire and restage the imaginary of the discovery turned into unfulfilled utopia. The main characters also acquire historical resonances, as they are depicted as descendants of Iberian conquistadors turned into smugglers of precious stones in the present. Their activities define a circuit of international exchange which resonates with that of globalized cinema, a realm in which Foreign Land, made up of citations and homage to other cinemas, tries to retrieve a sense of belonging.
Resumo:
The crisis of the national project in the early 1990s, caused by a short-lived but disastrous government, led Brazilian art cinema, for the first time, to look at itself as periphery and re-approach the old colonial centre, Portugal. Terra estrangeira/Foreign Land (Walter Salles & Daniela Thomas, Brazil/Portugal, 1995), a film about Brazilian exiles in Portugal, is the best illustration of this perspective shift aimed at providing a new sense of Brazil’s scale and position within a global context. Shot mainly on location in São Paulo, Lisbon and Cape Verde, it promotes the encounter of Lusophone peoples who find a common ground in their marginal situation. Even Portugal is defined by its location at the edge of Europe and by beliefs such as Sebastianism, whose origins go back to the time when the country was dominated by Spain. As a result, notions of ‘core’ or ‘centre’ are devolved to the realm of myth. The film’s carefully crafted dialogues combine Brazilian, Portuguese and Creole linguistic peculiarities into a common dialect of exclusion, while language puns trigger visual rhymes which refer back to the Cinema Novo (the Brazilian New Wave) repertoire and restage the imaginary of the discovery turned into unfulfilled utopia. The main characters also acquire historical resonances, as they are depicted as descendants of Iberian conquistadors turned into smugglers of precious stones in the present. Their activities define a circuit of international exchange which resonates with that of globalized cinema, a realm in which Foreign Land, made up of citations and homage to other cinemas, tries to retrieve a sense of belonging.
Resumo:
Spatial memory is important for locating objects in hierarchical data structures, such as desktop folders. There are, however, some contradictions in literature concerning the effectiveness of 3D user interfaces when compared to their 2D counterparts. This paper uses a task-based approach in order to investigate the effectiveness of adding a third dimension to specific user tasks, i.e. the impact of depth on navigation in a 3D file manager. Results highlight issues and benefits of using 3D interfaces for visual and verbal tasks, and introduces the possible existence of a correlation between aptitude scores achieved on the Guilford- Zimmerman Orientation Survey and Electroencephalography- measured brainwave activity as participants search for targets of variable perceptual salience in 2D and 3D environments.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The maintenance of a given body orientation is obtained by the complex relation between sensory information and muscle activity. Therefore, this study purpose was to review the role of visual, somatosensory, vestibular and auditory information in the maintenance and control of the posture. Method. a search by papers for the last 24 years was done in the PubMed and CAPES databases. The following keywords were used: postural control, sensory information, vestibular system, visual system, somatosensory system, auditory system and haptic system. Results. the influence of each sensory system and its integration were analyzed for the maintenance and control of the posture. Conclusion. the literature showed that there is information redundancy provided by sensory channels. Thus, the central nervous system chooses the main source for the posture control.
Resumo:
Pós-graduação em Artes - IA
Resumo:
Pós-graduação em Ciências da Motricidade - IBRC